                         SEQUENCE LISTING

<110>  KWS SAAT SE
 
<120>  Optimized plant CRISPR/Cpf1 systems

<130>  KWS0284PCT

<150>  US 62/616,136
<151>  2018-01-11

<160>  158   

<170>  PatentIn version 3.5

<210>  1
<211>  1413
<212>  DNA
<213>  Brachypodium distachyon

<400>  1
ggagaagaac tcgagaggga attgcagatc atgaggcaga tggctatttt tgtgtcacat       60

atgcgcaaaa agagaggcta tatttgtgtc cctaggttct tcgttgtatt gcagtttcca      120

tatcaatctg acttggtcgc atgagaaatt gatggttaaa taatttgaat ctctcatgta      180

gtatcaacta ttagatatta ttttcaccaa atatatttcc atcggagaag aagaggctac      240

agaggaagca gaagagaggg gtgggagaat ttttacactt ttgtacaccc acttaaacag      300

caaaatccgt atgaaaacag gcccaccaaa acaatgccac gataacaatc cgtagaaaca      360

aaagcttcat ttaacagcgg cgcaacaaag cacgcttatc catggtagtt gtagtccgta      420

tgcgatccaa agatcacgat tcacgcgtga cggacggacg acgcgtgcca caccacaact      480

aacggcatcc atggtagttg tagtccgtat gcgatccaaa gatcacgatt cacgcgtgac      540

ggacggacga cgcgcgccac accacaacta acagcgtgag ccagcgtcca aactccggat      600

ggcaacgggg acgaaacccg tcgggtagtc actgcccaaa cccgtccccg caaccttcat      660

cccaaacccg tccccgtttc cggtcgcggg tttcagtttt ctaccagacc cgtccccatc      720

gggtttttca tccccgtcgg gaaatccgaa cccgccagca tttcagcacc aagccaaagt      780

tgcagcagca acatgaataa aaaacaaccc gtttcaacac caagataaaa caaaacatta      840

taatttagac aacatttcac acgtataaca ataacatata gttctcacat ataacaacac      900

catttcacac ataaaacaac accatttggg ataaaaatat gggctatatc aggccatttt      960

tatgggccat attgagtttt cgtgggtttc acaggtaccg gatttgtaga atgctgaacc     1020

gggtttgaac cgtaaaatcc gcgggtattg aatttgaccc aatcccgtcg tcccctggtg     1080

gggtaaaaac accatcttga gtccaaacgg ccaccaacca aactccgacg gcaacaaaca     1140

aacggcgttg ctttgctcct cggtatctcc gtgaccgctc aatctcccgg ctgtttcccc     1200

ggaattgcgt ggactctctc atccacacgc aaaccgcctc tccctcctct ctcgtcctat     1260

ccgccccggt gccgtagcct cacgggactc ttcttcctcc cttgctataa aatccccgcc     1320

ccctcccgtc tcctctccac acatccaaac tctcaatcgc accgagaaaa atctcctagc     1380

gatcgaagcg aagcctctcc cgatcctctc aag                                  1413


<210>  2
<211>  981
<212>  DNA
<213>  Zea mays

<400>  2
tgcagcgtga cccggtcgtg cccctctcta gagataatga gcattgcatg tctaagttat       60

aaaaaattac cacatatttt ttttgtcaca cttgtttgaa gtgcagttta tctatcttta      120

tacatatatt taaactttac tctacgaata atataatcta tagtactaca ataatatcag      180

tgttttagag aatcatataa atgaacagtt agacatggtc taaaggacaa ttgagtattt      240

tgacaacagg actctacagt tttatctttt tagtgtgcat gtgttctcct ttttttttgc      300

aaatagcttc acctatataa tacttcatcc attttattag tacatccatt tagggtttag      360

ggttaatggt ttttatagac taattttttt agtacatcta ttttattcta ttttagcctc      420

taaattaaga aaactaaaac tctattttag tttttttatt taataattta gatataaaat      480

agaataaaat aaagtgacta aaaattaaac aaataccctt taagaaatta aaaaaactaa      540

ggaaacattt ttcttgtttc gagtagataa tgccagcctg ttaaacgccg tcgatcgacg      600

agtctaacgg acaccaacca gcgaaccagc agcgtcgcgt cgggccaagc gaagcagacg      660

gcacggcatc tctgtcgctg cctctggacc cctctcgaga gttccgctcc accgttggac      720

ttgctccgct gtcggcatcc agaaattgcg tggcggagcg gcagacgtga gccggcacgg      780

caggcggcct cctcctcctc tcacggcacc ggcagctacg ggggattcct ttcccaccgc      840

tccttcgctt tcccttcctc gcccgccgta ataaatagac accccctcca caccctcttt      900

ccccaacctc gtgttgttcg gagcgcacac acacacaacc agatctcccc caaatccacc      960

cgtcggcacc tccgcttcaa g                                                981


<210>  3
<211>  1247
<212>  DNA
<213>  Oryza sativa

<400>  3
tagctagcat actcgaggtc attcatatgc ttgagaagag agtcgggata gtccaaaata       60

aaacaaaggt aagattacct ggtcaaaagt gaaaacatca gttaaaaggt ggtataagta      120

aaatatcggt aataaaaggt ggcccaaagt gaaatttact cttttctact attataaaaa      180

ttgaggatgt tttgtcggta ctttgatacg tcatttttgt atgaattggt ttttaagttt      240

attcgcgatt tggaaatgca tatctgtatt tgagtcggtt tttaagttcg ttgcttttgt      300

aaatacagag ggatttgtat aagaaatatc tttaaaaaac ccatatgcta atttgacata      360

atttttgaga aaaatatata ttcaggcgaa ttccacaatg aacaataata agattaaaat      420

agcttgcccc cgttgcagcg atgggtattt tttctagtaa aataaaagat aaacttagac      480

tcaaaacatt tacaaaaaca acccctaaag tcctaaagcc caaagtgcta tgcacgatcc      540

atagcaagcc cagcccaacc caacccaacc caacccaccc cagtgcagcc aactggcaaa      600

tagtctccac ccccggcact atcaccgtga gttgtccgca ccaccgcacg actcgcagcc      660

aaaaaaaaaa aaagaaagaa aaaaaagaaa aagaaaaaca gcagctgggt ccgggtcgtg      720

ggggccggaa aagcgaggag gatcgcgagc agcgacgagg cccggccctc cctccgcttc      780

caaagaaacg ccccccatcg ccactatata catacccccc cctctcctcc catcccccca      840

accctaccac caccaccacc accacctcct cccccctcgc tgccggacga cgagctcctc      900

ccccctcccc ctccgccgcc gccggtaacc accccgcccc tctcctcttt ctttctccgt      960

tttttttttc gtcacggtct cgatctttgg ccttggtagt ttgggtgggc gagagcggct     1020

tcgtcgccca gatcggtgcg cgggaggggc gggatctcgc ggctggcgac tccgggcgtg     1080

agtcggcccg gatcctcgcg gggaatgggg ctctcggatg tagatcttct ttctttcttc     1140

tttttgtggt agaatttgaa tccctcagca ttgttcatcg gtagtttttc ttttcatgat     1200

ttgtgacaaa tgcagcctcg tgcggagctt ttttgtaggc ctagaag                   1247


<210>  4
<211>  691
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  p35S promoter sequence

<400>  4
gttcagaaga ccagagggct attgagactt ttcaacaaag ggtaatatcg ggaaacctcc       60

tcggattcca ttgcccagct atctgtcact tcatcgaaag gacagtagaa aaggaagatg      120

gcttctacaa atgccatcat tgcgataaag gaaaggctat cgttcaagat gcctctaccg      180

acagtggtcc caaagatgga cccccaccca cgaggaacat cgtggaaaaa gaagacgttc      240

caaccacgtc ttcaaagcaa gtggattgat gtgatacatg gtggagcacg acactctcgt      300

ctactccaag aatatcaaag atacagtctc agaagaccag agggctattg agacttttca      360

acaaagggta atatcgggaa acctcctcgg attccattgc ccagctatct gtcacttcat      420

cgaaaggaca gtagaaaagg aagatggctt ctacaaatgc catcattgcg ataaaggaaa      480

ggctatcgtt caagatgcct ctaccgacag tggtcccaaa gatggacccc cacccacgag      540

gaacatcgtg gaaaaagaag acgttccaac cacgtcttca aagcaagtgg attgatgtga      600

tatctccact gacgtaaggg atgacgcaca atcccactat ccttcgcaag acccttcctc      660

tatataagga agttcatttc atttggagag g                                     691


<210>  5
<211>  519
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ubi1 intron promoter sequence

<400>  5
gtacgccgct cgtcctcccc cccccccctc tctaccttct ctagatcggc gttccggtcc       60

atggttaggg cccggtagtt ctacttctgt tcatgtttgt gttagatccg tgtttgtgtt      120

agatccgtgc tgctagcgtt cgtacacgga tgcgacctgt acgtcagaca cgttctgatt      180

gctaacttgc cagtgtttct ctttggggaa tcctgggatg gctctagccg ttccgcagac      240

gggatcgatc taggataggt atacatgttg atgtgggttt tactgatgca tatacatgat      300

ggcatatgca gcatctattc atatgctcta accttgagta cctatctatt ataataaaca      360

agtatgtttt ataattattt tgatcttgat atacttggat gatggcatat gcagcagcta      420

tatgtggatt tttttagccc tgccttcata cgctatttat ttgcttggta ctgtttcttt      480

tgtcgatgct caccctgttg tttggtgtta cttctgcag                             519


<210>  6
<211>  1500
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pZmUbi1+Ubi1 intron promoter sequence

<400>  6
tgcagcgtga cccggtcgtg cccctctcta gagataatga gcattgcatg tctaagttat       60

aaaaaattac cacatatttt ttttgtcaca cttgtttgaa gtgcagttta tctatcttta      120

tacatatatt taaactttac tctacgaata atataatcta tagtactaca ataatatcag      180

tgttttagag aatcatataa atgaacagtt agacatggtc taaaggacaa ttgagtattt      240

tgacaacagg actctacagt tttatctttt tagtgtgcat gtgttctcct ttttttttgc      300

aaatagcttc acctatataa tacttcatcc attttattag tacatccatt tagggtttag      360

ggttaatggt ttttatagac taattttttt agtacatcta ttttattcta ttttagcctc      420

taaattaaga aaactaaaac tctattttag tttttttatt taataattta gatataaaat      480

agaataaaat aaagtgacta aaaattaaac aaataccctt taagaaatta aaaaaactaa      540

ggaaacattt ttcttgtttc gagtagataa tgccagcctg ttaaacgccg tcgatcgacg      600

agtctaacgg acaccaacca gcgaaccagc agcgtcgcgt cgggccaagc gaagcagacg      660

gcacggcatc tctgtcgctg cctctggacc cctctcgaga gttccgctcc accgttggac      720

ttgctccgct gtcggcatcc agaaattgcg tggcggagcg gcagacgtga gccggcacgg      780

caggcggcct cctcctcctc tcacggcacc ggcagctacg ggggattcct ttcccaccgc      840

tccttcgctt tcccttcctc gcccgccgta ataaatagac accccctcca caccctcttt      900

ccccaacctc gtgttgttcg gagcgcacac acacacaacc agatctcccc caaatccacc      960

cgtcggcacc tccgcttcaa ggtacgccgc tcgtcctccc ccccccccct ctctaccttc     1020

tctagatcgg cgttccggtc catggttagg gcccggtagt tctacttctg ttcatgtttg     1080

tgttagatcc gtgtttgtgt tagatccgtg ctgctagcgt tcgtacacgg atgcgacctg     1140

tacgtcagac acgttctgat tgctaacttg ccagtgtttc tctttgggga atcctgggat     1200

ggctctagcc gttccgcaga cgggatcgat ctaggatagg tatacatgtt gatgtgggtt     1260

ttactgatgc atatacatga tggcatatgc agcatctatt catatgctct aaccttgagt     1320

acctatctat tataataaac aagtatgttt tataattatt ttgatcttga tatacttgga     1380

tgatggcata tgcagcagct atatgtggat ttttttagcc ctgccttcat acgctattta     1440

tttgcttggt actgtttctt ttgtcgatgc tcaccctgtt gtttggtgtt acttctgcag     1500


<210>  7
<211>  2503
<212>  DNA
<213>  Brachypodium distachyon

<400>  7
ggagaagaac tcgagaggga attgcagatc atgaggcaga tggctatttt tgtgtcacat       60

atgcgcaaaa agagaggcta tatttgtgtc cctaggttct tcgttgtatt gcagtttcca      120

tatcaatctg acttggtcgc atgagaaatt gatggttaaa taatttgaat ctctcatgta      180

gtatcaacta ttagatatta ttttcaccaa atatatttcc atcggagaag aagaggctac      240

agaggaagca gaagagaggg gtgggagaat ttttacactt ttgtacaccc acttaaacag      300

caaaatccgt atgaaaacag gcccaccaaa acaatgccac gataacaatc cgtagaaaca      360

aaagcttcat ttaacagcgg cgcaacaaag cacgcttatc catggtagtt gtagtccgta      420

tgcgatccaa agatcacgat tcacgcgtga cggacggacg acgcgtgcca caccacaact      480

aacggcatcc atggtagttg tagtccgtat gcgatccaaa gatcacgatt cacgcgtgac      540

ggacggacga cgcgcgccac accacaacta acagcgtgag ccagcgtcca aactccggat      600

ggcaacgggg acgaaacccg tcgggtagtc actgcccaaa cccgtccccg caaccttcat      660

cccaaacccg tccccgtttc cggtcgcggg tttcagtttt ctaccagacc cgtccccatc      720

gggtttttca tccccgtcgg gaaatccgaa cccgccagca tttcagcacc aagccaaagt      780

tgcagcagca acatgaataa aaaacaaccc gtttcaacac caagataaaa caaaacatta      840

taatttagac aacatttcac acgtataaca ataacatata gttctcacat ataacaacac      900

catttcacac ataaaacaac accatttggg ataaaaatat gggctatatc aggccatttt      960

tatgggccat attgagtttt cgtgggtttc acaggtaccg gatttgtaga atgctgaacc     1020

gggtttgaac cgtaaaatcc gcgggtattg aatttgaccc aatcccgtcg tcccctggtg     1080

gggtaaaaac accatcttga gtccaaacgg ccaccaacca aactccgacg gcaacaaaca     1140

aacggcgttg ctttgctcct cggtatctcc gtgaccgctc aatctcccgg ctgtttcccc     1200

ggaattgcgt ggactctctc atccacacgc aaaccgcctc tccctcctct ctcgtcctat     1260

ccgccccggt gccgtagcct cacgggactc ttcttcctcc cttgctataa aatccccgcc     1320

ccctcccgtc tcctctccac acatccaaac tctcaatcgc accgagaaaa atctcctagc     1380

gatcgaagcg aagcctctcc cgatcctctc aaggtacgcc cgtttcccgt cgatcctcct     1440

ccttccgttc gtgttctgta gccgatcgat tcgattccct tacacccgtt cgtgttctct     1500

cgtggatcga tcgattgttt gttgctagaa ggaactcgta gatctggcgt ttatgaactg     1560

tgattcgggt tagtccagat cgattcaggt cggtcgtcgt tgagcctctc ggctatgtct     1620

ggattatcgt gtagatctgc tggttcagtt gattatgttc ttctaggagt aatttcgttg     1680

ggtcagcgcg atttctgctt aatctatgct gcttattgcg cctgtaccta tctactaagc     1740

tatgtgcacc tgtaattttg ctagattatt cgttcatcct cgtagttggt ttgtcacagt     1800

aatccgtatg ggttctgacg atgttattgt tggtcatacc taggcttctc cagattttat     1860

tttgttaaaa ttggatagat ctgctactga tagttgatga tggaatttgg tgctgaatct     1920

atgctattta ttgcgcctat acctgatcta tcgggctatg tacggctgta gtttactgga     1980

ttattcgttc atcctcggta gttggttcat cgtttgggtt ctgacgataa tattgttgat     2040

tatgcgtagg cttctgcaga ttgttgttaa aattggatac atcggttact gatggttgat     2100

gatagatttg tgctgaacct atctgtttat tgctcctata cctgatctat agggctatgt     2160

atgcctgtaa tttaccagat tattcgttca tcctcgtagt tggttcatct ctataattcg     2220

tatgggttct tatgatgtta tcgttgatta tgcctagtct tatacagatt attgtgtcaa     2280

gattgaatat acctgctact gatcggtgat aatttggtta gtagtttgca atctgctagg     2340

aacacgttac cactgtaatc tgtaaacatg gtttgccaga gtagtttgtt ctactactct     2400

tgatatggtt gctgatttta gtcgcctcct tttggatcat gtattgatgt ccttgcagat     2460

ttccgtgtac ttaccccggc ttttgtgtac ttcgtgttaa cag                       2503


<210>  8
<211>  535
<212>  DNA
<213>  Zea mays

<400>  8
gtccgccttg tttctcctct gtctcttgat ctgactaatc ttggtttatg attcgttgag       60

taattttggg gaaagcttcg tccacagttt tttttcgatg aacagtgccg cagtggcgct      120

gatcttgtat gctatcctgc aatcgtggtg aacttatttc ttttatatcc tttactccca      180

tgaaaaggct agtaatcttt ctcgatgtaa catcgtccag cactgctatt accgtgtggt      240

ccatccgaca gtctggctga acacatcata cgatctatgg agcaaaaatc tatcttccct      300

gttctttaat gaaggacgtc attttcatta gtatgatcta ggaatgttgc aacttgcaag      360

gaggcgtttc tttctttgaa tttaactaac tcgttgagtg gccctgtttc tcggacgtaa      420

ggcctttgct gctccacaca tgtccattcg aattttaccg tgtttagcaa gggcgaaaag      480

tttgcatctt gatgatttag cttgactatg cgattgcttt cctggacccg tgcag           535


<210>  9
<211>  1090
<212>  DNA
<213>  Brachypodium distachyon

<400>  9
gtacgcccgt ttcccgtcga tcctcctcct tccgttcgtg ttctgtagcc gatcgattcg       60

attcccttac acccgttcgt gttctctcgt ggatcgatcg attgtttgtt gctagaagga      120

actcgtagat ctggcgttta tgaactgtga ttcgggttag tccagatcga ttcaggtcgg      180

tcgtcgttga gcctctcggc tatgtctgga ttatcgtgta gatctgctgg ttcagttgat      240

tatgttcttc taggagtaat ttcgttgggt cagcgcgatt tctgcttaat ctatgctgct      300

tattgcgcct gtacctatct actaagctat gtgcacctgt aattttgcta gattattcgt      360

tcatcctcgt agttggtttg tcacagtaat ccgtatgggt tctgacgatg ttattgttgg      420

tcatacctag gcttctccag attttatttt gttaaaattg gatagatctg ctactgatag      480

ttgatgatgg aatttggtgc tgaatctatg ctatttattg cgcctatacc tgatctatcg      540

ggctatgtac ggctgtagtt tactggatta ttcgttcatc ctcggtagtt ggttcatcgt      600

ttgggttctg acgataatat tgttgattat gcgtaggctt ctgcagattg ttgttaaaat      660

tggatacatc ggttactgat ggttgatgat agatttgtgc tgaacctatc tgtttattgc      720

tcctatacct gatctatagg gctatgtatg cctgtaattt accagattat tcgttcatcc      780

tcgtagttgg ttcatctcta taattcgtat gggttcttat gatgttatcg ttgattatgc      840

ctagtcttat acagattatt gtgtcaagat tgaatatacc tgctactgat cggtgataat      900

ttggttagta gtttgcaatc tgctaggaac acgttaccac tgtaatctgt aaacatggtt      960

tgccagagta gtttgttcta ctactcttga tatggttgct gattttagtc gcctcctttt     1020

ggatcatgta ttgatgtcct tgcagatttc cgtgtactta ccccggcttt tgtgtacttc     1080

gtgttaacag                                                            1090


<210>  10
<211>  517
<212>  DNA
<213>  Zea mays

<400>  10
gtacgccgct cgtcctcccc cccccctctc taccttctct agatcggcgt tccggtccat       60

ggttagggcc cggtagttct acttctgttc atgtttgtgt tagatccgtg tttgtgttag      120

atccgtgctg ctagcgttcg tacacggatg cgacctgtac gtcagacacg ttctgattgc      180

taacttgcca gtgtttctct ttggggaatc ctgggatggc tctagccgtt ccgcagacgg      240

gatcgatcta ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg      300

catatgcagc atctattcat atgctctaac cttgagtacc tatctattat aataaacaag      360

tatgttttat aattattttg atcttgatat acttggatga tggcatatgc agcagctata      420

tgtggatttt tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg      480

tcgatgctca ccctgttgtt tggtgttact tctgcag                               517


<210>  11
<211>  258
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NosT terminator sequence

<400>  11
cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc       60

gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg      120

catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata      180

cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc      240

tatgttacta gatcgatc                                                    258


<210>  12
<211>  204
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  35S terminator sequence

<400>  12
gtcgatcgac aagctcgagt ttctccataa taatgtgtga gtagttccca gataagggaa       60

ttagggttcc tatagggttt cgctcatgtg ttgagcatat aagaaaccct tagtatgtat      120

ttgtatttgt aaaatacttc tatcaataaa atttctaatt cctaaaacca aaatccagta      180

ctaaaatcca gatcccccga atta                                             204


<210>  13
<211>  3771
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized polynulceotide

<400>  13
atggcatcta gcatggcacc aaagaaaaaa aggaaagttt ccaaacttga aaaatttaca       60

aactgctact ccctttccaa gacgcttagg tttaaagcga tccccgttgg caagacccaa      120

gagaatatcg ataacaaaag acttctggtc gaagatgaaa aaagggccga agactacaag      180

ggggtcaaga agttgctcga tcgctattat ctttccttta tcaacgatgt gcttcattca      240

atcaaactga agaacttgaa taactacatt agccttttca gaaagaaaac gaggactgaa      300

aaggagaaca aggaacttga gaatcttgaa ataaaccttc gcaaagaaat tgcaaaagcc      360

ttcaagggga acgaaggata taaatctctt ttcaaaaaag acattataga aacaattttg      420

cctgagtttc ttgacgacaa ggatgaaatt gcgctcgtca atagctttaa cggatttaca      480

actgccttca cagggttctt cgacaatagg gagaatatgt ttagcgagga ggcaaaaagc      540

acatccatcg cattcagatg catcaatgaa aatcttaccc ggtacatatc gaatatggac      600

atatttgaaa aagtggatgc aatattcgat aagcacgaag tccaggagat aaaggaaaag      660

atactgaata gcgactatga tgtcgaagat tttttcgaag gtgagttctt caactttgtc      720

ctgactcaag aaggcattga tgtctataat gcaataattg gaggttttgt gactgagtct      780

ggcgagaaga taaagggctt gaacgagtat atcaatctct acaaccagaa gactaagcaa      840

aagttgccta aatttaaacc gctttacaag caagttttga gcgaccggga aagcctttcc      900

ttttacggtg aaggatacac gagcgatgaa gaagtcctcg aagtcttccg caacacactc      960

aacaagaact cagaaatctt ttcctcaatt aaaaaattgg agaagctttt caagaacttc     1020

gatgaatact cttcggcggg gatttttgtg aagaacggcc cggcaatttc cacaatatct     1080

aaagacattt tcggagaatg gaacgtgata agagacaagt ggaatgcgga gtatgatgac     1140

atacacctga agaagaaggc agttgtgact gaaaaatacg aagatgacag gagaaaaagc     1200

tttaaaaaga tcgggtcctt ttcactggaa cagctgcagg agtatgccga cgccgatctt     1260

tcggttgtcg aaaagctcaa agaaataatt atccagaagg tcgatgaaat ctacaaggtg     1320

tacggctcaa gcgagaagct ctttgatgct gacttcgtgt tggagaagtc tcttaaaaaa     1380

aacgacgcag tcgtcgcgat aatgaaagat ttgctggatt cagtgaaatc cttcgagaat     1440

tatatcaaag ccttcttcgg cgaggggaag gagacaaaca gggatgagtc cttctatgga     1500

gacttcgttc tggcttacga catccttctt aaggtcgacc acatctatga cgcaattcgg     1560

aactatgtga cgcagaagcc gtattcgaaa gataagttca agctctattt ccaaaaccct     1620

caatttatgg gtgggtggga taaagacaaa gagaccgatt accgggcaac aattttgcgg     1680

tacgggtcta aatattacct cgctataatg gataagaaat acgctaaatg tctccagaaa     1740

attgacaaag atgacgtcaa cggcaattat gaaaaaatca attataaact ccttcctggc     1800

ccaaataaaa tgctcccgaa ggtgtttttt tccaaaaagt ggatggccta ttataatcca     1860

tcagaggata ttcagaaaat ctataaaaat gggaccttta agaagggtga catgtttaac     1920

ctgaacgatt gccacaagct tatagatttt ttcaaagact ctattagccg ctatcccaaa     1980

tggtctaatg cttatgattt caacttctct gaaactgaaa agtacaaaga tattgcagga     2040

ttctaccgcg aagttgaaga acaaggttat aaggtttcct ttgagtctgc gtccaagaaa     2100

gaggtcgata agttggtcga agaagggaaa ttgtatatgt ttcaaattta caataaagac     2160

ttttccgaca agtcccatgg tacacctaat ctgcatacca tgtacttcaa actgctgttc     2220

gatgagaata atcacggtca gattcgcctg agcggagggg cggaactctt catgaggaga     2280

gcatcgttga aaaaagagga gctcgtcgtg catccggcta acagccccat tgctaacaag     2340

aatccggata atccaaagaa gactactacc ctctcctatg acgtctataa ggataagaga     2400

ttctctgagg accagtacga gttgcacatc cctattgcga taaataaatg ccctaagaac     2460

atctttaaaa tcaatactga ggtcagagtc ctgcttaagc acgacgacaa cccgtatgtg     2520

atcgggattg ataggggtga aaggaacttg ctttatattg tggttgtcga tggaaaaggt     2580

aatatagtgg aacaatactc tctgaatgaa attatcaaca acttcaatgg cattaggatc     2640

aagaccgact atcattctct gttggacaag aaagagaaag agcgcttcga ggcacggcaa     2700

aactggacgt ctattgagaa catcaaggag cttaaggctg gttacatttc tcaggttgtg     2760

cacaaaattt gcgaactggt cgagaaatat gatgccgtta tcgcacttga agatctcaac     2820

agcggattta agaattctcg ggtgaaagtc gaaaaacagg tgtatcaaaa attcgaaaag     2880

atgctgatcg acaagctcaa ttatatggtt gataaaaaga gcaacccatg cgccacgggg     2940

ggtgcgctta agggctatca gattacgaac aaatttgaat ccttcaagtc aatgtcgacg     3000

caaaatgggt ttatattcta tataccggcg tggcttacat ctaaaataga tcctagcact     3060

gggttcgtga acctgctgaa aaccaagtac acttcaatcg cagattctaa aaaatttata     3120

agcagcttcg acagaatcat gtatgtgccc gaggaagacc tcttcgagtt tgcccttgat     3180

tacaaaaatt tctcaagaac ggatgcagac tacataaaga agtggaagct gtactcttat     3240

gggaaccgga ttcggatatt cagaaatccg aaaaaaaaca atgtctttga ttgggaggaa     3300

gtttgtctta cctctgctta caaagagctg ttcaataaat atggcattaa ttaccagcaa     3360

ggtgatatcc gggcgctcct ttgcgaacag tctgacaaag ctttctattc ttcatttatg     3420

gcgctcatgt cattgatgct gcagatgagg aatagcatta cggggaggac tgatgttgac     3480

tttctgatct cgcccgtgaa aaattctgat ggaatcttct acgattccag gaattatgag     3540

gcccaggaaa atgctatcct tcccaagaac gcagacgcaa atggcgcgta caatatagct     3600

cgcaaggttt tgtgggctat aggccaattc aagaaagccg aagacgaaaa gctggacaaa     3660

gttaagattg ctatatctaa caaagagtgg cttgagtatg cgcaaacatc tgttaaacac     3720

aaacgccccg cggctacaaa gaaggctggc caggcaaaga agaagaagtg a              3771


<210>  14
<211>  3684
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized polynucleotide

<400>  14
atgtccaaac ttgaaaaatt tacaaactgc tactcccttt ccaagacgct taggtttaaa       60

gcgatccccg ttggcaagac ccaagagaat atcgataaca aaagacttct ggtcgaagat      120

gaaaaaaggg ccgaagacta caagggggtc aagaagttgc tcgatcgcta ttatctttcc      180

tttatcaacg atgtgcttca ttcaatcaaa ctgaagaact tgaataacta cattagcctt      240

ttcagaaaga aaacgaggac tgaaaaggag aacaaggaac ttgagaatct tgaaataaac      300

cttcgcaaag aaattgcaaa agccttcaag gggaacgaag gatataaatc tcttttcaaa      360

aaagacatta tagaaacaat tttgcctgag tttcttgacg acaaggatga aattgcgctc      420

gtcaatagct ttaacggatt tacaactgcc ttcacagggt tcttcgacaa tagggagaat      480

atgtttagcg aggaggcaaa aagcacatcc atcgcattca gatgcatcaa tgaaaatctt      540

acccggtaca tatcgaatat ggacatattt gaaaaagtgg atgcaatatt cgataagcac      600

gaagtccagg agataaagga aaagatactg aatagcgact atgatgtcga agattttttc      660

gaaggtgagt tcttcaactt tgtcctgact caagaaggca ttgatgtcta taatgcaata      720

attggaggtt ttgtgactga gtctggcgag aagataaagg gcttgaacga gtatatcaat      780

ctctacaacc agaagactaa gcaaaagttg cctaaattta aaccgcttta caagcaagtt      840

ttgagcgacc gggaaagcct ttccttttac ggtgaaggat acacgagcga tgaagaagtc      900

ctcgaagtct tccgcaacac actcaacaag aactcagaaa tcttttcctc aattaaaaaa      960

ttggagaagc ttttcaagaa cttcgatgaa tactcttcgg cggggatttt tgtgaagaac     1020

ggcccggcaa tttccacaat atctaaagac attttcggag aatggaacgt gataagagac     1080

aagtggaatg cggagtatga tgacatacac ctgaagaaga aggcagttgt gactgaaaaa     1140

tacgaagatg acaggagaaa aagctttaaa aagatcgggt ccttttcact ggaacagctg     1200

caggagtatg ccgacgccga tctttcggtt gtcgaaaagc tcaaagaaat aattatccag     1260

aaggtcgatg aaatctacaa ggtgtacggc tcaagcgaga agctctttga tgctgacttc     1320

gtgttggaga agtctcttaa aaaaaacgac gcagtcgtcg cgataatgaa agatttgctg     1380

gattcagtga aatccttcga gaattatatc aaagccttct tcggcgaggg gaaggagaca     1440

aacagggatg agtccttcta tggagacttc gttctggctt acgacatcct tcttaaggtc     1500

gaccacatct atgacgcaat tcggaactat gtgacgcaga agccgtattc gaaagataag     1560

ttcaagctct atttccaaaa ccctcaattt atgggtgggt gggataaaga caaagagacc     1620

gattaccggg caacaatttt gcggtacggg tctaaatatt acctcgctat aatggataag     1680

aaatacgcta aatgtctcca gaaaattgac aaagatgacg tcaacggcaa ttatgaaaaa     1740

atcaattata aactccttcc tggcccaaat aaaatgctcc cgaaggtgtt tttttccaaa     1800

aagtggatgg cctattataa tccatcagag gatattcaga aaatctataa aaatgggacc     1860

tttaagaagg gtgacatgtt taacctgaac gattgccaca agcttataga ttttttcaaa     1920

gactctatta gccgctatcc caaatggtct aatgcttatg atttcaactt ctctgaaact     1980

gaaaagtaca aagatattgc aggattctac cgcgaagttg aagaacaagg ttataaggtt     2040

tcctttgagt ctgcgtccaa gaaagaggtc gataagttgg tcgaagaagg gaaattgtat     2100

atgtttcaaa tttacaataa agacttttcc gacaagtccc atggtacacc taatctgcat     2160

accatgtact tcaaactgct gttcgatgag aataatcacg gtcagattcg cctgagcgga     2220

ggggcggaac tcttcatgag gagagcatcg ttgaaaaaag aggagctcgt cgtgcatccg     2280

gctaacagcc ccattgctaa caagaatccg gataatccaa agaagactac taccctctcc     2340

tatgacgtct ataaggataa gagattctct gaggaccagt acgagttgca catccctatt     2400

gcgataaata aatgccctaa gaacatcttt aaaatcaata ctgaggtcag agtcctgctt     2460

aagcacgacg acaacccgta tgtgatcggg attgataggg gtgaaaggaa cttgctttat     2520

attgtggttg tcgatggaaa aggtaatata gtggaacaat actctctgaa tgaaattatc     2580

aacaacttca atggcattag gatcaagacc gactatcatt ctctgttgga caagaaagag     2640

aaagagcgct tcgaggcacg gcaaaactgg acgtctattg agaacatcaa ggagcttaag     2700

gctggttaca tttctcaggt tgtgcacaaa atttgcgaac tggtcgagaa atatgatgcc     2760

gttatcgcac ttgaagatct caacagcgga tttaagaatt ctcgggtgaa agtcgaaaaa     2820

caggtgtatc aaaaattcga aaagatgctg atcgacaagc tcaattatat ggttgataaa     2880

aagagcaacc catgcgccac ggggggtgcg cttaagggct atcagattac gaacaaattt     2940

gaatccttca agtcaatgtc gacgcaaaat gggtttatat tctatatacc ggcgtggctt     3000

acatctaaaa tagatcctag cactgggttc gtgaacctgc tgaaaaccaa gtacacttca     3060

atcgcagatt ctaaaaaatt tataagcagc ttcgacagaa tcatgtatgt gcccgaggaa     3120

gacctcttcg agtttgccct tgattacaaa aatttctcaa gaacggatgc agactacata     3180

aagaagtgga agctgtactc ttatgggaac cggattcgga tattcagaaa tccgaaaaaa     3240

aacaatgtct ttgattggga ggaagtttgt cttacctctg cttacaaaga gctgttcaat     3300

aaatatggca ttaattacca gcaaggtgat atccgggcgc tcctttgcga acagtctgac     3360

aaagctttct attcttcatt tatggcgctc atgtcattga tgctgcagat gaggaatagc     3420

attacgggga ggactgatgt tgactttctg atctcgcccg tgaaaaattc tgatggaatc     3480

ttctacgatt ccaggaatta tgaggcccag gaaaatgcta tccttcccaa gaacgcagac     3540

gcaaatggcg cgtacaatat agctcgcaag gttttgtggg ctataggcca attcaagaaa     3600

gccgaagacg aaaagctgga caaagttaag attgctatat ctaacaaaga gtggcttgag     3660

tatgcgcaaa catctgttaa acac                                            3684


<210>  15
<211>  1255
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  codon optimized polypeptide

<400>  15

Met Ala Ser Ser Met Ala Pro Lys Lys Lys Arg Lys Val Ser Lys Leu 
1               5                   10                  15      


Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys 
            20                  25                  30          


Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu 
        35                  40                  45              


Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys 
    50                  55                  60                  


Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser 
65                  70                  75                  80  


Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys 
                85                  90                  95      


Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn 
            100                 105                 110         


Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys 
        115                 120                 125             


Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu 
    130                 135                 140                 


Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr 
145                 150                 155                 160 


Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu 
                165                 170                 175     


Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu 
            180                 185                 190         


Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile 
        195                 200                 205             


Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser 
    210                 215                 220                 


Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val 
225                 230                 235                 240 


Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe 
                245                 250                 255     


Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn 
            260                 265                 270         


Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu 
        275                 280                 285             


Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu 
    290                 295                 300                 


Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu 
305                 310                 315                 320 


Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu 
                325                 330                 335     


Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn 
            340                 345                 350         


Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn 
        355                 360                 365             


Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys 
    370                 375                 380                 


Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser 
385                 390                 395                 400 


Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala 
                405                 410                 415     


Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln 
            420                 425                 430         


Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe 
        435                 440                 445             


Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val 
    450                 455                 460                 


Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn 
465                 470                 475                 480 


Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu 
                485                 490                 495     


Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val 
            500                 505                 510         


Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr 
        515                 520                 525             


Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly 
    530                 535                 540                 


Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg 
545                 550                 555                 560 


Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys 
                565                 570                 575     


Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys 
            580                 585                 590         


Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val 
        595                 600                 605             


Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile 
    610                 615                 620                 


Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn 
625                 630                 635                 640 


Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser 
                645                 650                 655     


Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr 
            660                 665                 670         


Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln 
        675                 680                 685             


Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys 
    690                 695                 700                 


Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp 
705                 710                 715                 720 


Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His Thr Met Tyr Phe 
                725                 730                 735     


Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly 
            740                 745                 750         


Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu 
        755                 760                 765             


Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn 
    770                 775                 780                 


Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg 
785                 790                 795                 800 


Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys 
                805                 810                 815     


Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu 
            820                 825                 830         


Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp Arg Gly Glu Arg 
        835                 840                 845             


Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu 
    850                 855                 860                 


Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile 
865                 870                 875                 880 


Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe 
                885                 890                 895     


Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys 
            900                 905                 910         


Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys Glu Leu Val Glu 
        915                 920                 925             


Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys 
    930                 935                 940                 


Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys 
945                 950                 955                 960 


Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro 
                965                 970                 975     


Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe 
            980                 985                 990         


Glu Ser Phe Lys Ser Met Ser Thr  Gln Asn Gly Phe Ile  Phe Tyr Ile 
        995                 1000                 1005             


Pro Ala  Trp Leu Thr Ser Lys  Ile Asp Pro Ser Thr  Gly Phe Val 
    1010                 1015                 1020             


Asn Leu  Leu Lys Thr Lys Tyr  Thr Ser Ile Ala Asp  Ser Lys Lys 
    1025                 1030                 1035             


Phe Ile  Ser Ser Phe Asp Arg  Ile Met Tyr Val Pro  Glu Glu Asp 
    1040                 1045                 1050             


Leu Phe  Glu Phe Ala Leu Asp  Tyr Lys Asn Phe Ser  Arg Thr Asp 
    1055                 1060                 1065             


Ala Asp  Tyr Ile Lys Lys Trp  Lys Leu Tyr Ser Tyr  Gly Asn Arg 
    1070                 1075                 1080             


Ile Arg  Ile Phe Arg Asn Pro  Lys Lys Asn Asn Val  Phe Asp Trp 
    1085                 1090                 1095             


Glu Glu  Val Cys Leu Thr Ser  Ala Tyr Lys Glu Leu  Phe Asn Lys 
    1100                 1105                 1110             


Tyr Gly  Ile Asn Tyr Gln Gln  Gly Asp Ile Arg Ala  Leu Leu Cys 
    1115                 1120                 1125             


Glu Gln  Ser Asp Lys Ala Phe  Tyr Ser Ser Phe Met  Ala Leu Met 
    1130                 1135                 1140             


Ser Leu  Met Leu Gln Met Arg  Asn Ser Ile Thr Gly  Arg Thr Asp 
    1145                 1150                 1155             


Val Asp  Phe Leu Ile Ser Pro  Val Lys Asn Ser Asp  Gly Ile Phe 
    1160                 1165                 1170             


Tyr Asp  Ser Arg Asn Tyr Glu  Ala Gln Glu Asn Ala  Ile Leu Pro 
    1175                 1180                 1185             


Lys Asn  Ala Asp Ala Asn Gly  Ala Tyr Asn Ile Ala  Arg Lys Val 
    1190                 1195                 1200             


Leu Trp  Ala Ile Gly Gln Phe  Lys Lys Ala Glu Asp  Glu Lys Leu 
    1205                 1210                 1215             


Asp Lys  Val Lys Ile Ala Ile  Ser Asn Lys Glu Trp  Leu Glu Tyr 
    1220                 1225                 1230             


Ala Gln  Thr Ser Val Lys His  Lys Arg Pro Ala Ala  Thr Lys Lys 
    1235                 1240                 1245             


Ala Gly  Gln Ala Lys Lys Lys  
    1250                 1255 


<210>  16
<211>  1228
<212>  PRT
<213>  Lachnospiraceae bacterium

<400>  16

Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr 
1               5                   10                  15      


Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp 
            20                  25                  30          


Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys 
        35                  40                  45              


Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp 
    50                  55                  60                  


Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu 
65                  70                  75                  80  


Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn 
                85                  90                  95      


Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn 
            100                 105                 110         


Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu 
        115                 120                 125             


Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe 
    130                 135                 140                 


Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn 
145                 150                 155                 160 


Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile 
                165                 170                 175     


Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys 
            180                 185                 190         


Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys 
        195                 200                 205             


Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe 
    210                 215                 220                 


Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile 
225                 230                 235                 240 


Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn 
                245                 250                 255     


Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys 
            260                 265                 270         


Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser 
        275                 280                 285             


Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe 
    290                 295                 300                 


Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys 
305                 310                 315                 320 


Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile 
                325                 330                 335     


Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe 
            340                 345                 350         


Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp 
        355                 360                 365             


Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp 
    370                 375                 380                 


Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu 
385                 390                 395                 400 


Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu 
                405                 410                 415     


Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser 
            420                 425                 430         


Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys 
        435                 440                 445             


Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys 
    450                 455                 460                 


Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr 
465                 470                 475                 480 


Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile 
                485                 490                 495     


Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr 
            500                 505                 510         


Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro 
        515                 520                 525             


Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala 
    530                 535                 540                 


Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys 
545                 550                 555                 560 


Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly 
                565                 570                 575     


Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met 
            580                 585                 590         


Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro 
        595                 600                 605             


Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly 
    610                 615                 620                 


Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys 
625                 630                 635                 640 


Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn 
                645                 650                 655     


Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys 
        675                 680                 685             


Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile 
                725                 730                 735     


Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys 
            740                 745                 750         


Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys 
        755                 760                 765             


Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr 
    770                 775                 780                 


Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile 
785                 790                 795                 800 


Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val 
                805                 810                 815     


Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp 
            820                 825                 830         


Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly 
        835                 840                 845             


Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn 
    850                 855                 860                 


Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu 
865                 870                 875                 880 


Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile 
                885                 890                 895     


Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys 
            900                 905                 910         


Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn 
        915                 920                 925             


Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln 
    930                 935                 940                 


Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys 
945                 950                 955                 960 


Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile 
                965                 970                 975     


Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe 
            980                 985                 990         


Ile Phe Tyr Ile Pro Ala Trp Leu  Thr Ser Lys Ile Asp  Pro Ser Thr 
        995                 1000                 1005             


Gly Phe  Val Asn Leu Leu Lys  Thr Lys Tyr Thr Ser  Ile Ala Asp 
    1010                 1015                 1020             


Ser Lys  Lys Phe Ile Ser Ser  Phe Asp Arg Ile Met  Tyr Val Pro 
    1025                 1030                 1035             


Glu Glu  Asp Leu Phe Glu Phe  Ala Leu Asp Tyr Lys  Asn Phe Ser 
    1040                 1045                 1050             


Arg Thr  Asp Ala Asp Tyr Ile  Lys Lys Trp Lys Leu  Tyr Ser Tyr 
    1055                 1060                 1065             


Gly Asn  Arg Ile Arg Ile Phe  Arg Asn Pro Lys Lys  Asn Asn Val 
    1070                 1075                 1080             


Phe Asp  Trp Glu Glu Val Cys  Leu Thr Ser Ala Tyr  Lys Glu Leu 
    1085                 1090                 1095             


Phe Asn  Lys Tyr Gly Ile Asn  Tyr Gln Gln Gly Asp  Ile Arg Ala 
    1100                 1105                 1110             


Leu Leu  Cys Glu Gln Ser Asp  Lys Ala Phe Tyr Ser  Ser Phe Met 
    1115                 1120                 1125             


Ala Leu  Met Ser Leu Met Leu  Gln Met Arg Asn Ser  Ile Thr Gly 
    1130                 1135                 1140             


Arg Thr  Asp Val Asp Phe Leu  Ile Ser Pro Val Lys  Asn Ser Asp 
    1145                 1150                 1155             


Gly Ile  Phe Tyr Asp Ser Arg  Asn Tyr Glu Ala Gln  Glu Asn Ala 
    1160                 1165                 1170             


Ile Leu  Pro Lys Asn Ala Asp  Ala Asn Gly Ala Tyr  Asn Ile Ala 
    1175                 1180                 1185             


Arg Lys  Val Leu Trp Ala Ile  Gly Gln Phe Lys Lys  Ala Glu Asp 
    1190                 1195                 1200             


Glu Lys  Leu Asp Lys Val Lys  Ile Ala Ile Ser Asn  Lys Glu Trp 
    1205                 1210                 1215             


Leu Glu  Tyr Ala Gln Thr Ser  Val Lys His 
    1220                 1225             


<210>  17
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  HH ribozyme encoding sequence

<400>  17
ctgatgagtc cgtgaggacg aaacgagtaa gctcgtc                                37


<210>  18
<211>  37
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  HH ribozyme sequence

<400>  18
cugaugaguc cgugaggacg aaacgaguaa gcucguc                                37


<210>  19
<211>  53
<212>  DNA
<213>  Oryza sativa

<400>  19
ccgccaacac tgccaatgcc ggtcccaagc ccggataaaa gtggaggggg cgg              53


<210>  20
<211>  53
<212>  RNA
<213>  Oryza sativa

<400>  20
ccgccaacac ugccaaugcc ggucccaagc ccggauaaaa guggaggggg cgg              53


<210>  21
<211>  63
<212>  DNA
<213>  Helianthus annuus

<400>  21
gcggggggcg tcagtcctac tctgcacctc ctcgtggtgt cgcctgggaa ccctctttcg       60

caa                                                                     63


<210>  22
<211>  63
<212>  RNA
<213>  Helianthus annuus

<400>  22
gcggggggcg ucaguccuac ucugcaccuc cucguggugu cgccugggaa cccucuuucg       60

caa                                                                     63


<210>  23
<211>  86
<212>  DNA
<213>  Helianthus annuus

<400>  23
gcggggggcg tcagtcctac tctgcacctc ctcgtggtgt cgcctgggaa ccctctttcg       60

caagaaagag gagccaagca gagagg                                            86


<210>  24
<211>  86
<212>  RNA
<213>  Helianthus annuus

<400>  24
gcggggggcg ucaguccuac ucugcaccuc cucguggugu cgccugggaa cccucuuucg       60

caagaaagag gagccaagca gagagg                                            86


<210>  25
<211>  80
<212>  DNA
<213>  Cynara scolymus

<400>  25
ggcgtcagtc ctactctgca cctcctcgtg gtgtcgcctg ggaaccctct ttcacaagaa       60

agaggagcca agcagagagg                                                   80


<210>  26
<211>  80
<212>  RNA
<213>  Cynara scolymus

<400>  26
ggcgucaguc cuacucugca ccuccucgug gugucgccug ggaacccucu uucacaagaa       60

agaggagcca agcagagagg                                                   80


<210>  27
<211>  68
<212>  DNA
<213>  Hepatitis Delta Virus

<400>  27
ggccggcatg gtcccagcct cctcgctggc gccggctggg caacatgctt cggcatggcg       60

aatgggac                                                                68


<210>  28
<211>  68
<212>  RNA
<213>  Hepatitis Delta Virus

<400>  28
ggccggcaug gucccagccu ccucgcuggc gccggcuggg caacaugcuu cggcauggcg       60

aaugggac                                                                68


<210>  29
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  scaffold RNA encoding sequnece

<400>  29
taatttctac taagtgtaga t                                                 21


<210>  30
<211>  21
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  scaffold RNA sequence

<400>  30
uaauuucuac uaaguguaga u                                                 21


<210>  31
<211>  717
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  tdTomato polynucleotide

<400>  31
atggtgagca agggcgagga ggtcatcaaa gagttcatgc gcttcaaggt gcgcatggag       60

ggctccatga acggccacga gttcgagatc gagggcgagg gcgagggccg cccctacgag      120

ggcacccaga ccgccaagct gaaggtgacc aagggcggcc ccctgccctt cgcctgggac      180

atcctgtccc cccagttcat gtacggctcc aaggcgtacg tgaagcaccc cgccgacatc      240

cccgattaca agaagctgtc cttccccgag ggcttcaagt gggagcgcgt gatgaacttc      300

gaggacggcg gtctggtgac cgtgacccag gactcctccc tgcaggacgg cacgctgatc      360

tacaaggtga agatgcgcgg caccaacttc ccccccgacg gccccgtaat gcagaagaag      420

accatgggct gggaggcctc caccgagcgc ctgtaccccc gcgacggcgt gctgaagggc      480

gagatccacc aggccctgaa gctgaaggac ggcggccact acctggtgga gttcaagacc      540

atctacatgg ccaagaagcc cgtgcaactg cccggctact actacgtgga caccaagctg      600

gacatcacct cccacaacga ggactacacc atcgtggaac agtacgagcg ctccgagggc      660

cgccaccacc tgttcctgta cggcatggac gagctgtaca agtctagagg tacctga         717


<210>  32
<211>  720
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mNeoGreen polynucleotide

<400>  32
atggcgtcct ccgtgagtaa aggagaagaa gataacatgg cttcgcttcc agccacacat       60

gagcttcaca tcttcggttc catcaacggc gttgacttcg atatggtcgg acaaggcact      120

gggaacccta atgacggata cgaagagctg aacctcaaga gcaccaaagg tgatcttcag      180

ttttctccat ggattctggt gccacacatt ggctacggat tccatcaata ccttccatac      240

cctgacggaa tgagtccatt ccaagcagcc atggttgatg gctccggata ccaagtccac      300

aggacaatgc agtttgagga cggtgcttcg ctcaccgtca actaccgtta cacttacgaa      360

gggagccaca tcaaaggaga agcccaagtg aaggggacag gctttcctgc tgatggacct      420

gtcatgacca actccttaac tgccgctgat tggtgccggt ccaagaaaac ctaccctaac      480

gacaagacca tcattagtac cttcaaatgg tcttacacca caggcaatgg caagagatat      540

cgctctacag ccaggactac ctacacattc gctaaaccaa tggccgctaa ctaccttaag      600

aaccaaccca tgtacgtgtt ccgtaagact gagttgaaac attccaagac cgaacttaac      660

ttcaaggagt ggcagaaggc atttaccgac gtaatgggca tggatgaact atacaaataa      720


<210>  33
<211>  5693
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP424 tdT embedded gRNA delivery vector

<400>  33
ctgacgcgcc ctgtagcggc acgtcacctg catcaggaat tcaggagaag aactcgagag       60

ggaattgcag atcatgaggc agatggctat ttttgtgtca catatgcgca aaaagagagg      120

ctatatttgt gtccctaggt tcttcgttgt attgcagttt ccatatcaat ctgacttggt      180

cgcatgagaa attgatggtt aaataatttg aatctctcat gtagtatcaa ctattagata      240

ttattttcac caaatatatt tccatcggag aagaagaggc tacagaggaa gcagaagaga      300

ggggtgggag aatttttaca cttttgtaca cccacttaaa cagcaaaatc cgtatgaaaa      360

caggcccacc aaaacaatgc cacgataaca atccgtagaa acaaaagctt catttaacag      420

cggcgcaaca aagcacgctt atccatggta gttgtagtcc gtatgcgatc caaagatcac      480

gattcacgcg tgacggacgg acgacgcgtg ccacaccaca actaacggca tccatggtag      540

ttgtagtccg tatgcgatcc aaagatcacg attcacgcgt gacggacgga cgacgcgcgc      600

cacaccacaa ctaacagcgt gagccagcgt ccaaactccg gatggcaacg gggacgaaac      660

ccgtcgggta gtcactgccc aaacccgtcc ccgcaacctt catcccaaac ccgtccccgt      720

ttccggtcgc gggtttcagt tttctaccag acccgtcccc atcgggtttt tcatccccgt      780

cgggaaatcc gaacccgcca gcatttcagc accaagccaa agttgcagca gcaacatgaa      840

taaaaaacaa cccgtttcaa caccaagata aaacaaaaca ttataattta gacaacattt      900

cacacgtata acaataacat atagttctca catataacaa caccatttca cacataaaac      960

aacaccattt gggataaaaa tatgggctat atcaggccat ttttatgggc catattgagt     1020

tttcgtgggt ttcacaggta ccggatttgt agaatgctga accgggtttg aaccgtaaaa     1080

tccgcgggta ttgaatttga cccaatcccg tcgtcccctg gtggggtaaa aacaccatct     1140

tgagtccaaa cggccaccaa ccaaactccg acggcaacaa acaaacggcg ttgctttgct     1200

cctcggtatc tccgtgaccg ctcaatctcc cggctgtttc cccggaattg cgtggactct     1260

ctcatccaca cgcaaaccgc ctctccctcc tctctcgtcc tatccgcccc ggtgccgtag     1320

cctcacggga ctcttcttcc tcccttgcta taaaatcccc gccccctcct gtctcctctc     1380

cacacatcca aactctcaat cgcaccgaga aaaatctcct agcgatcgaa gcgaagcctc     1440

tcccgatcct ctcaaggtac gcccgtttcc cgtcgatcct cctccttccg ttcgtgttct     1500

gtagccgatc gattcgattc ccttacaccc gttcgtgttc tctcgtggat cgatcgattg     1560

tttgttgcta gaaggaactc gtagatctgg cgtttatgaa ctgtgattcg ggttagtcca     1620

gatcgattca ggtcggtcgt cgttgagcct ctcggctatg tctggattat cgtgtagatc     1680

tgctggttca gttgattatg ttcttctagg agtaatttcg ttgggtcagc gcgatttctg     1740

cttaatctat gctgcttatt gcgcctgtac ctatctacta agctatgtgc acctgtaatt     1800

ttgctagatt attcgttcat cctcgtagtt ggtttgtcac agtaatccgt atgggttctg     1860

acgatgttat tgttggtcat acctaggctt ctccagattt tattttgtta aaattggata     1920

gatctgctac tgatagttga tgatggaatt tggtgctgaa tctatgctat ttattgcgcc     1980

tatacctgat ctatcgggct atgtacggct gtagtttact ggattattcg ttcatcctcg     2040

gtagttggtt catcgtttgg gttctgacga taatattgtt gattatgcgt aggcttctgc     2100

agattgttgt taaaattgga tacatcggtt actgatggtt gatgatagat ttgtgctgaa     2160

cctatctgtt tattgctcct atacctgatc tatagggcta tgtatgcctg taatttacca     2220

gattattcgt tcatcctcgt agttggttca tctctataat tcgtatgggt tcttatgatg     2280

ttatcgttga ttatgcctag tcttatacag attattgtgt caagattgaa tatacctgct     2340

actgatcggt gataatttgg ttagtagttt gcaatctgct aggaacacgt taccactgta     2400

atctgtaaac atggtttgcc agagtagttt gttctactac tcttgatatg gttgctgatt     2460

ttagtcgcct ccttttggat catgtattga tgtccttgca gatttccgtg tacttacccc     2520

ggcttttgtg tacttcgtgt taacagctct agaggatcct ctcaacacaa catatacaaa     2580

acaaacgaat ctcaagcaat caagcattct acttctattg cagcaattta aatcatttct     2640

tttaaagcaa aagcaatttt ctgaaaattt tcaccattta cgaacgatag ggcgcgatcc     2700

cgccaccatg gtgagcaagg gcgaggaggt catcaaagag ttcatgcgct tcaaggtgcg     2760

catggagggc tccatgaacg gccacgagtt cgagatcgag ggcgagggcg agggccgccc     2820

ctacgagggc acccagaccg ccaagctgaa ggtgaccaag ggcggccccc tgcccttcgc     2880

ctgggacatc ctgtcccccc agttcatgta cggctccaag gcgtacgtga agcaccccgc     2940

cgacatcccc gattacaaga agctgtcctt ccccgagggc ttcaagtggg agcgcgtgat     3000

gaacttcgag gacggcggtc tggtgaccgt gacccaggac tcctccctgc aggacggcac     3060

gctgatctac aaggtgaaga tgcgcggcac caacttcccc cccgacggcc ccgtaatgca     3120

gaagaagacc atgggctggg aggcctccac cgagcgcctg tacccccgcg acggcgtgct     3180

gaagggcgag atccaccagg ccctgaagct gaaggacggc ggccactacc tggtggagtt     3240

caagaccatc tacatggcca agaagcccgt gcaactgccc ggctactact acgtggacac     3300

caagctggac atcacctccc acaacgagga ctacaccatc gtggaacagt acgagcgctc     3360

cgagggccgc caccacctgt tcctgtacgg catggacgag ctgtacaagt ctagaggtac     3420

ctgataattt ctactaagtg tagatgagac ggagctcagt ctgaccgcgg cgtctcttaa     3480

tttctactaa gtgtagatcg aatttccccg atcgttcaaa catttggcaa taaagtttct     3540

taagattgaa tcctgttgcc ggtcttgcga tgattatcat ataatttctg ttgaattacg     3600

ttaagcatgt aataattaac atgtaatgca tgacgttatt tatgagatgg gtttttatga     3660

ttagagtccc gcaattatac atttaatacg cgatagaaaa caaaatatag cgcgcaaact     3720

aggataaatt atcgcgcgcg gtgtcatcta tgttactaga tcgctcgacg cggccgccat     3780

ggcctctagt ggatcaggtg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc     3840

ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg     3900

ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg     3960

cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg     4020

actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac     4080

cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca     4140

tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt     4200

gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc     4260

caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag     4320

agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac     4380

tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt     4440

tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa     4500

gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg     4560

gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa     4620

aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat     4680

atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc     4740

gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat     4800

acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc     4860

ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc     4920

tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag     4980

ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg     5040

ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg     5100

atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag     5160

taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt     5220

catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga     5280

atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc     5340

acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc     5400

aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc     5460

ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc     5520

cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca     5580

atattattga agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat     5640

ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cac            5693


<210>  34
<211>  4959
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP425 vector control guide RNA delivery without tdT

<400>  34
ctgacgcgcc ctgtagcggc acgtcacctg catcaggaat tcaggagaag aactcgagag       60

ggaattgcag atcatgaggc agatggctat ttttgtgtca catatgcgca aaaagagagg      120

ctatatttgt gtccctaggt tcttcgttgt attgcagttt ccatatcaat ctgacttggt      180

cgcatgagaa attgatggtt aaataatttg aatctctcat gtagtatcaa ctattagata      240

ttattttcac caaatatatt tccatcggag aagaagaggc tacagaggaa gcagaagaga      300

ggggtgggag aatttttaca cttttgtaca cccacttaaa cagcaaaatc cgtatgaaaa      360

caggcccacc aaaacaatgc cacgataaca atccgtagaa acaaaagctt catttaacag      420

cggcgcaaca aagcacgctt atccatggta gttgtagtcc gtatgcgatc caaagatcac      480

gattcacgcg tgacggacgg acgacgcgtg ccacaccaca actaacggca tccatggtag      540

ttgtagtccg tatgcgatcc aaagatcacg attcacgcgt gacggacgga cgacgcgcgc      600

cacaccacaa ctaacagcgt gagccagcgt ccaaactccg gatggcaacg gggacgaaac      660

ccgtcgggta gtcactgccc aaacccgtcc ccgcaacctt catcccaaac ccgtccccgt      720

ttccggtcgc gggtttcagt tttctaccag acccgtcccc atcgggtttt tcatccccgt      780

cgggaaatcc gaacccgcca gcatttcagc accaagccaa agttgcagca gcaacatgaa      840

taaaaaacaa cccgtttcaa caccaagata aaacaaaaca ttataattta gacaacattt      900

cacacgtata acaataacat atagttctca catataacaa caccatttca cacataaaac      960

aacaccattt gggataaaaa tatgggctat atcaggccat ttttatgggc catattgagt     1020

tttcgtgggt ttcacaggta ccggatttgt agaatgctga accgggtttg aaccgtaaaa     1080

tccgcgggta ttgaatttga cccaatcccg tcgtcccctg gtggggtaaa aacaccatct     1140

tgagtccaaa cggccaccaa ccaaactccg acggcaacaa acaaacggcg ttgctttgct     1200

cctcggtatc tccgtgaccg ctcaatctcc cggctgtttc cccggaattg cgtggactct     1260

ctcatccaca cgcaaaccgc ctctccctcc tctctcgtcc tatccgcccc ggtgccgtag     1320

cctcacggga ctcttcttcc tcccttgcta taaaatcccc gccccctcct gtctcctctc     1380

cacacatcca aactctcaat cgcaccgaga aaaatctcct agcgatcgaa gcgaagcctc     1440

tcccgatcct ctcaaggtac gcccgtttcc cgtcgatcct cctccttccg ttcgtgttct     1500

gtagccgatc gattcgattc ccttacaccc gttcgtgttc tctcgtggat cgatcgattg     1560

tttgttgcta gaaggaactc gtagatctgg cgtttatgaa ctgtgattcg ggttagtcca     1620

gatcgattca ggtcggtcgt cgttgagcct ctcggctatg tctggattat cgtgtagatc     1680

tgctggttca gttgattatg ttcttctagg agtaatttcg ttgggtcagc gcgatttctg     1740

cttaatctat gctgcttatt gcgcctgtac ctatctacta agctatgtgc acctgtaatt     1800

ttgctagatt attcgttcat cctcgtagtt ggtttgtcac agtaatccgt atgggttctg     1860

acgatgttat tgttggtcat acctaggctt ctccagattt tattttgtta aaattggata     1920

gatctgctac tgatagttga tgatggaatt tggtgctgaa tctatgctat ttattgcgcc     1980

tatacctgat ctatcgggct atgtacggct gtagtttact ggattattcg ttcatcctcg     2040

gtagttggtt catcgtttgg gttctgacga taatattgtt gattatgcgt aggcttctgc     2100

agattgttgt taaaattgga tacatcggtt actgatggtt gatgatagat ttgtgctgaa     2160

cctatctgtt tattgctcct atacctgatc tatagggcta tgtatgcctg taatttacca     2220

gattattcgt tcatcctcgt agttggttca tctctataat tcgtatgggt tcttatgatg     2280

ttatcgttga ttatgcctag tcttatacag attattgtgt caagattgaa tatacctgct     2340

actgatcggt gataatttgg ttagtagttt gcaatctgct aggaacacgt taccactgta     2400

atctgtaaac atggtttgcc agagtagttt gttctactac tcttgatatg gttgctgatt     2460

ttagtcgcct ccttttggat catgtattga tgtccttgca gatttccgtg tacttacccc     2520

ggcttttgtg tacttcgtgt taacagctct agaggatcct ctcaacacaa catatacaaa     2580

acaaacgaat ctcaagcaat caagcattct acttctattg cagcaattta aatcatttct     2640

tttaaagcaa aagcaatttt ctgaaaattt tcaccattta cgaacgatag taatttctac     2700

taagtgtaga tgagacggag ctcagtctga ccgcggcgtc tcttaatttc tactaagtgt     2760

agatcgaatt tccccgatcg ttcaaacatt tggcaataaa gtttcttaag attgaatcct     2820

gttgccggtc ttgcgatgat tatcatataa tttctgttga attacgttaa gcatgtaata     2880

attaacatgt aatgcatgac gttatttatg agatgggttt ttatgattag agtcccgcaa     2940

ttatacattt aatacgcgat agaaaacaaa atatagcgcg caaactagga taaattatcg     3000

cgcgcggtgt catctatgtt actagatcgc tcgacgcggc cgccatggcc tctagtggat     3060

caggtgtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat     3120

ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca     3180

ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc     3240

atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc     3300

aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg     3360

gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta     3420

ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg     3480

ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac     3540

acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag     3600

gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat     3660

ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat     3720

ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc     3780

gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt     3840

ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct     3900

agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt     3960

ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc     4020

gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac     4080

catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat     4140

cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg     4200

cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata     4260

gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta     4320

tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt     4380

gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag     4440

tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa     4500

gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc     4560

gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt     4620

taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc     4680

tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta     4740

ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa     4800

taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca     4860

tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac     4920

aaataggggt tccgcgcaca tttccccgaa aagtgccac                            4959


<210>  35
<211>  10434
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP372 LbCpf1 RR construct

<400>  35
actgctgcag tgcagcgtga cccggtcgtg cccctctcta gagataatga gcattgcatg       60

tctaagttat aaaaaattac cacatatttt ttttgtcaca cttgtttgaa gtgcagttta      120

tctatcttta tacatatatt taaactttac tctacgaata atataatcta tagtactaca      180

ataatatcag tgttttagag aatcatataa atgaacagtt agacatggtc taaaggacaa      240

ttgagtattt tgacaacagg actctacagt tttatctttt tagtgtgcat gtgttctcct      300

ttttttttgc aaatagcttc acctatataa tacttcatcc attttattag tacatccatt      360

tagggtttag ggttaatggt ttttatagac taattttttt agtacatcta ttttattcta      420

ttttagcctc taaattaaga aaactaaaac tctattttag tttttttatt taataattta      480

gatataaaat agaataaaat aaagtgacta aaaattaaac aaataccctt taagaaatta      540

aaaaaactaa ggaaacattt ttcttgtttc gagtagataa tgccagcctg ttaaacgccg      600

tcgatcgacg agtctaacgg acaccaacca gcgaaccagc agcgtcgcgt cgggccaagc      660

gaagcagacg gcacggcatc tctgtcgctg cctctggacc cctctcgaga gttccgctcc      720

accgttggac ttgctccgct gtcggcatcc agaaattgcg tggcggagcg gcagacgtga      780

gccggcacgg caggcggcct cctcctcctc tcacggcacc ggcagctacg ggggattcct      840

ttcccaccgc tccttcgctt tcccttcctc gcccgccgta ataaatagac accccctcca      900

caccctcttt ccccaacctc gtgttgttcg gagcgcacac acacacaacc agatctcccc      960

caaatccacc cgtcggcacc tccgcttcaa ggtacgccgc tcgtcctccc cccccccccc     1020

tctctacctt ctctagatcg gcgttccggt ccatggttag ggcccggtag ttctacttct     1080

gttcatgttt gtgttagatc cgtgtttgtg ttagatccgt gctgctagcg ttcgtacacg     1140

gatgcgacct gtacgtcaga cacgttctga ttgctaactt gccagtgttt ctctttgggg     1200

aatcctggga tggctctagc cgttccgcag acgggatcga tctaggatag gtatacatgt     1260

tgatgtgggt tttactgatg catatacatg atggcatatg cagcatctat tcatatgctc     1320

taaccttgag tacctatcta ttataataaa caagtatgtt ttataattat tttgatcttg     1380

atatacttgg atgatggcat atgcagcagc tatatgtgga tttttttagc cctgccttca     1440

tacgctattt atttgcttgg tactgtttct tttgtcgatg ctcaccctgt tgtttggtgt     1500

tacttctgca ggtcgaagct tgaagcaaac atggcatcta gcatggcacc aaagaaaaaa     1560

aggaaagttt ccaaacttga aaaatttaca aactgctact ccctttccaa gacgcttagg     1620

tttaaagcga tccccgttgg caagacccaa gagaatatcg ataacaaaag acttctggtc     1680

gaagatgaaa aaagggccga agactacaag ggggtcaaga agttgctcga tcgctattat     1740

ctttccttta tcaacgatgt gcttcattca atcaaactga agaacttgaa taactacatt     1800

agccttttca gaaagaaaac gaggactgaa aaggagaaca aggaacttga gaatcttgaa     1860

ataaaccttc gcaaagaaat tgcaaaagcc ttcaagggga acgaaggata taaatctctt     1920

ttcaaaaaag acattataga aacaattttg cctgagtttc ttgacgacaa ggatgaaatt     1980

gcgctcgtca atagctttaa cggatttaca actgccttca cagggttctt cgacaatagg     2040

gagaatatgt ttagcgagga ggcaaaaagc acatccatcg cattcagatg catcaatgaa     2100

aatcttaccc ggtacatatc gaatatggac atatttgaaa aagtggatgc aatattcgat     2160

aagcacgaag tccaggagat aaaggaaaag atactgaata gcgactatga tgtcgaagat     2220

tttttcgaag gtgagttctt caactttgtc ctgactcaag aaggcattga tgtctataat     2280

gcaataattg gaggttttgt gactgagtct ggcgagaaga taaagggctt gaacgagtat     2340

atcaatctct acaaccagaa gactaagcaa aagttgccta aatttaaacc gctttacaag     2400

caagttttga gcgaccggga aagcctttcc ttttacggtg aaggatacac gagcgatgaa     2460

gaagtcctcg aagtcttccg caacacactc aacaagaact cagaaatctt ttcctcaatt     2520

aaaaaattgg agaagctttt caagaacttc gatgaatact cttcggcggg gatttttgtg     2580

aagaacggcc cggcaatttc cacaatatct aaagacattt tcggagaatg gaacgtgata     2640

agagacaagt ggaatgcgga gtatgatgac atacacctga agaagaaggc agttgtgact     2700

gaaaaatacg aagatgacag gagaaaaagc tttaaaaaga tcgggtcctt ttcactggaa     2760

cagctgcagg agtatgccga cgccgatctt tcggttgtcg aaaagctcaa agaaataatt     2820

atccagaagg tcgatgaaat ctacaaggtg tacggctcaa gcgagaagct ctttgatgct     2880

gacttcgtgt tggagaagtc tcttaaaaaa aacgacgcag tcgtcgcgat aatgaaagat     2940

ttgctggatt cagtgaaatc cttcgagaat tatatcaaag ccttcttcgg cgaggggaag     3000

gagacaaaca gggatgagtc cttctatgga gacttcgttc tggcttacga catccttctt     3060

aaggtcgacc acatctatga cgcaattcgg aactatgtga cgcagaagcc gtattcgaaa     3120

gataagttca agctctattt ccaaaaccct caatttatgc gtgggtggga taaagacaaa     3180

gagaccgatt accgggcaac aattttgcgg tacgggtcta aatattacct cgctataatg     3240

gataagaaat acgctaaatg tctccagaaa attgacaaag atgacgtcaa cggcaattat     3300

gaaaaaatca attataaact ccttcctggc ccaaataaaa tgctcccgag ggtgtttttt     3360

tccaaaaagt ggatggccta ttataatcca tcagaggata ttcagaaaat ctataaaaat     3420

gggaccttta agaagggtga catgtttaac ctgaacgatt gccacaagct tatagatttt     3480

ttcaaagact ctattagccg ctatcccaaa tggtctaatg cttatgattt caacttctct     3540

gaaactgaaa agtacaaaga tattgcagga ttctaccgcg aagttgaaga acaaggttat     3600

aaggtttcct ttgagtctgc gtccaagaaa gaggtcgata agttggtcga agaagggaaa     3660

ttgtatatgt ttcaaattta caataaagac ttttccgaca agtcccatgg tacacctaat     3720

ctgcatacca tgtacttcaa actgctgttc gatgagaata atcacggtca gattcgcctg     3780

agcggagggg cggaactctt catgaggaga gcatcgttga aaaaagagga gctcgtcgtg     3840

catccggcta acagccccat tgctaacaag aatccggata atccaaagaa gactactacc     3900

ctctcctatg acgtctataa ggataagaga ttctctgagg accagtacga gttgcacatc     3960

cctattgcga taaataaatg ccctaagaac atctttaaaa tcaatactga ggtcagagtc     4020

ctgcttaagc acgacgacaa cccgtatgtg atcgggattg ataggggtga aaggaacttg     4080

ctttatattg tggttgtcga tggaaaaggt aatatagtgg aacaatactc tctgaatgaa     4140

attatcaaca acttcaatgg cattaggatc aagaccgact atcattctct gttggacaag     4200

aaagagaaag agcgcttcga ggcacggcaa aactggacgt ctattgagaa catcaaggag     4260

cttaaggctg gttacatttc tcaggttgtg cacaaaattt gcgaactggt cgagaaatat     4320

gatgccgtta tcgcacttga agatctcaac agcggattta agaattctcg ggtgaaagtc     4380

gaaaaacagg tgtatcaaaa attcgaaaag atgctgatcg acaagctcaa ttatatggtt     4440

gataaaaaga gcaacccatg cgccacgggg ggtgcgctta agggctatca gattacgaac     4500

aaatttgaat ccttcaagtc aatgtcgacg caaaatgggt ttatattcta tataccggcg     4560

tggcttacat ctaaaataga tcctagcact gggttcgtga acctgctgaa aaccaagtac     4620

acttcaatcg cagattctaa aaaatttata agcagcttcg acagaatcat gtatgtgccc     4680

gaggaagacc tcttcgagtt tgcccttgat tacaaaaatt tctcaagaac ggatgcagac     4740

tacataaaga agtggaagct gtactcttat gggaaccgga ttcggatatt cagaaatccg     4800

aaaaaaaaca atgtctttga ttgggaggaa gtttgtctta cctctgctta caaagagctg     4860

ttcaataaat atggcattaa ttaccagcaa ggtgatatcc gggcgctcct ttgcgaacag     4920

tctgacaaag ctttctattc ttcatttatg gcgctcatgt cattgatgct gcagatgagg     4980

aatagcatta cggggaggac tgatgttgac tttctgatct cgcccgtgaa aaattctgat     5040

ggaatcttct acgattccag gaattatgag gcccaggaaa atgctatcct tcccaagaac     5100

gcagacgcaa atggcgcgta caatatagct cgcaaggttt tgtgggctat aggccaattc     5160

aagaaagccg aagacgaaaa gctggacaaa gttaagattg ctatatctaa caaagagtgg     5220

cttgagtatg cgcaaacatc tgttaaacac aaacgccccg cggctacaaa gaaggctggc     5280

caggcaaaga agaagaagtg agtcgaccga tcgttcaaac atttggcaat aaagtttctt     5340

aagattgaat cctgttgccg gtcttgcgat gattatcata taatttctgt tgaattacgt     5400

taagcatgta ataattaaca tgtaatgcat gacgttattt atgagatggg tttttatgat     5460

tagagtcccg caattataca tttaatacgc gatagaaaac aaaatatagc gcgcaaacta     5520

ggataaatta tcgcgcgcgg tgtcatctat gttactagat cgatcccggg atatcgcggc     5580

cgcgtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc     5640

acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg     5700

aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat     5760

cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag     5820

gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga     5880

tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg     5940

tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt     6000

cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac     6060

gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc     6120

ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt     6180

ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc     6240

ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc     6300

agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg     6360

aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag     6420

atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg     6480

tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt     6540

tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca     6600

tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca     6660

gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc     6720

tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt     6780

ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg     6840

gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc     6900

aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg     6960

ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga     7020

tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga     7080

ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta     7140

aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg     7200

ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact     7260

ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata     7320

agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt     7380

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa     7440

ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgcgccctg tagcggcacg     7500

tctaattcgg gggatctgga ttttagtact ggattttggt tttaggaatt agaaatttta     7560

ttgatagaag tattttacaa atacaaatac atactaaggg tttcttatat gctcaacaca     7620

tgagcgaaac cctataggaa ccctaattcc cttatctggg aactactcac acattattat     7680

ggagaaactc gagcttgtcg atcgacatga tcagggagct ctagattatt tgtatagttc     7740

atccatgccc attacgtcgg taaatgcctt ctgccactcc ttgaagttaa gttcggtctt     7800

ggaatgtttc aactcagtct tacggaacac gtacatgggt tggttcttaa ggtagttagc     7860

ggccattggt ttagcgaatg tgtaggtagt cctggctgta gagcgatatc tcttgccatt     7920

gcctgtggtg taagaccatt tgaaggtact aatgatggtc ttgtcgttag ggtaggtttt     7980

cttggaccgg caccaatcag cggcagttaa ggagttggtc atgacaggtc catcagcagg     8040

aaagcctgtc cccttcactt gggcttctcc tttgatgtgg ctcccttcgt aagtgtaacg     8100

gtagttgacg gtgagcgaag caccgtcctc aaactgcatt gtcctgtgga cttggtatcc     8160

ggagccatca accatggctg cttggaatgg actcattccg tcagggtatg gaaggtattg     8220

atggaatccg tagccaatgt gtggcaccag aatccatgga gaaaactgaa gatcaccttt     8280

ggtgctcttg aggttcagct cttcgtatcc gtcattaggg ttcccagtgc cttgtccgac     8340

catatcgaag tcaacgccgt tgatggaacc gaagatgtga agctcatgtg tggctggaag     8400

cgaagccatg ttatcttctt ctcctttact cacggaggac gccatggtgg cgggatcgcg     8460

ccctatcgtt cgtaaatggt gaaaattttc agaaaattgc ttttgcttta aaagaaatga     8520

tttaaattgc tgcaatagaa gtagaatgct tgattgcttg agattcgttt gttttgtata     8580

tgttgtgttg agaggatcct caagcttcga cctgcagaag taacaccaaa caacagggtg     8640

agcatcgaca aaagaaacag taccaagcaa ataaatagcg tatgaaggca gggctaaaaa     8700

aatccacata tagctgctgc atatgccatc atccaagtat atcaagatca aaataattat     8760

aaaacatact tgtttattat aatagatagg tactcaaggt tagagcatat gaatagatgc     8820

tgcatatgcc atcatgtata tgcatcagta aaacccacat caacatgtat acctatccta     8880

gatcgatatt tccatccatc ttaaactcgt aactatgaag atgtatgaca cacacataca     8940

gttccaaaat taataaatac accaggtagt ttgaaacagt attctactcc gatctagaac     9000

gaatgaacga ccgcccaacc acaccacatc atcacaacca agcgaacaaa agcatctctg     9060

tatatgcatc agtaaaaccc gcatcaacat gtatacctat cctagatcga tatttccatc     9120

catcatcttc aattcgtaac tatgaatatg tatggcacac acatacagat ccaaaattaa     9180

taaatccacc aggtagtttg aaacagaatt ctactccgat ctagaacgac cgcccaacca     9240

gaccacatca tcacaaccaa gacaaaaaaa agcatgaaaa gatgacccga caaacaagtg     9300

cacggcatat attgaaataa aggaaaaggg caaaccaaac cctatgcaac gaaacaaaaa     9360

aaatcatgaa atcgatcccg tctgcggaac ggctagagcc atcccaggat tccccaaaga     9420

gaaacactgg caagttagca atcagaacgt gtctgacgta caggtcgcat ccgtgtacga     9480

acgctagcag cacggatcta acacaaacac ggatctaaca caaacatgaa cagaagtaga     9540

actaccgggc cctaaccatg gaccggaacg ccgatctaga gaaggtagag aggggggggg     9600

aggacgagcg gcgtaccttg aagcggaggt gccgacgggt ggatttgggg gagatccact     9660

agttctagag cggccgccac cgcggtggaa ttctcgaggt cctctccaaa tgaaatgaac     9720

ttccttatat agaggaaggg tcttgcgaag gatagtggga ttgtgcgtca tcccttacgt     9780

cagtggagat atcacatcaa tccacttgct ttgaagacgt ggttggaacg tcttcttttt     9840

ccacgatgct cctcgtgggt gggggtccat ctttgggacc actgtcggca gaggcatctt     9900

gaacgatagc ctttccttta tcgcaatgat ggcatttgta ggtgccacct tccttttcta     9960

ctgtcctttt gatcaagtga ccgatagctg ggcaatggaa tccgaggagg tttcccgata    10020

ttaccctttg ttgaaaagtc tcaatagccc tttggtcttc tgagactgta tctttgatat    10080

tcttggagta gacgagagtg tcgtgctcca ccatgttatc acatcaattc acttgctttg    10140

aagacgtggt tggaacgtct tctttttcca cgatgctcct cgtgggtggg ggtccatctt    10200

tgggaccact gtcggcagag gcatcttgaa cgatagcctt tcctttatcg caatgatggc    10260

atttgtaggt gccaccttcc ttttctactg tccttttgat caagtgacag atagctgggc    10320

aatggaatcc gaggaggttt cccgatatta ccctttgttg aaaagtctca atagcccttt    10380

ggtcttctga gacttgcagg caagcaagca tgaatgcctg ggcgcgccga tatc          10434


<210>  36
<211>  10434
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP373 LbCpf1 RVR construct

<400>  36
actgctgcag tgcagcgtga cccggtcgtg cccctctcta gagataatga gcattgcatg       60

tctaagttat aaaaaattac cacatatttt ttttgtcaca cttgtttgaa gtgcagttta      120

tctatcttta tacatatatt taaactttac tctacgaata atataatcta tagtactaca      180

ataatatcag tgttttagag aatcatataa atgaacagtt agacatggtc taaaggacaa      240

ttgagtattt tgacaacagg actctacagt tttatctttt tagtgtgcat gtgttctcct      300

ttttttttgc aaatagcttc acctatataa tacttcatcc attttattag tacatccatt      360

tagggtttag ggttaatggt ttttatagac taattttttt agtacatcta ttttattcta      420

ttttagcctc taaattaaga aaactaaaac tctattttag tttttttatt taataattta      480

gatataaaat agaataaaat aaagtgacta aaaattaaac aaataccctt taagaaatta      540

aaaaaactaa ggaaacattt ttcttgtttc gagtagataa tgccagcctg ttaaacgccg      600

tcgatcgacg agtctaacgg acaccaacca gcgaaccagc agcgtcgcgt cgggccaagc      660

gaagcagacg gcacggcatc tctgtcgctg cctctggacc cctctcgaga gttccgctcc      720

accgttggac ttgctccgct gtcggcatcc agaaattgcg tggcggagcg gcagacgtga      780

gccggcacgg caggcggcct cctcctcctc tcacggcacc ggcagctacg ggggattcct      840

ttcccaccgc tccttcgctt tcccttcctc gcccgccgta ataaatagac accccctcca      900

caccctcttt ccccaacctc gtgttgttcg gagcgcacac acacacaacc agatctcccc      960

caaatccacc cgtcggcacc tccgcttcaa ggtacgccgc tcgtcctccc cccccccccc     1020

tctctacctt ctctagatcg gcgttccggt ccatggttag ggcccggtag ttctacttct     1080

gttcatgttt gtgttagatc cgtgtttgtg ttagatccgt gctgctagcg ttcgtacacg     1140

gatgcgacct gtacgtcaga cacgttctga ttgctaactt gccagtgttt ctctttgggg     1200

aatcctggga tggctctagc cgttccgcag acgggatcga tctaggatag gtatacatgt     1260

tgatgtgggt tttactgatg catatacatg atggcatatg cagcatctat tcatatgctc     1320

taaccttgag tacctatcta ttataataaa caagtatgtt ttataattat tttgatcttg     1380

atatacttgg atgatggcat atgcagcagc tatatgtgga tttttttagc cctgccttca     1440

tacgctattt atttgcttgg tactgtttct tttgtcgatg ctcaccctgt tgtttggtgt     1500

tacttctgca ggtcgaagct tgaagcaaac atggcatcta gcatggcacc aaagaaaaaa     1560

aggaaagttt ccaaacttga aaaatttaca aactgctact ccctttccaa gacgcttagg     1620

tttaaagcga tccccgttgg caagacccaa gagaatatcg ataacaaaag acttctggtc     1680

gaagatgaaa aaagggccga agactacaag ggggtcaaga agttgctcga tcgctattat     1740

ctttccttta tcaacgatgt gcttcattca atcaaactga agaacttgaa taactacatt     1800

agccttttca gaaagaaaac gaggactgaa aaggagaaca aggaacttga gaatcttgaa     1860

ataaaccttc gcaaagaaat tgcaaaagcc ttcaagggga acgaaggata taaatctctt     1920

ttcaaaaaag acattataga aacaattttg cctgagtttc ttgacgacaa ggatgaaatt     1980

gcgctcgtca atagctttaa cggatttaca actgccttca cagggttctt cgacaatagg     2040

gagaatatgt ttagcgagga ggcaaaaagc acatccatcg cattcagatg catcaatgaa     2100

aatcttaccc ggtacatatc gaatatggac atatttgaaa aagtggatgc aatattcgat     2160

aagcacgaag tccaggagat aaaggaaaag atactgaata gcgactatga tgtcgaagat     2220

tttttcgaag gtgagttctt caactttgtc ctgactcaag aaggcattga tgtctataat     2280

gcaataattg gaggttttgt gactgagtct ggcgagaaga taaagggctt gaacgagtat     2340

atcaatctct acaaccagaa gactaagcaa aagttgccta aatttaaacc gctttacaag     2400

caagttttga gcgaccggga aagcctttcc ttttacggtg aaggatacac gagcgatgaa     2460

gaagtcctcg aagtcttccg caacacactc aacaagaact cagaaatctt ttcctcaatt     2520

aaaaaattgg agaagctttt caagaacttc gatgaatact cttcggcggg gatttttgtg     2580

aagaacggcc cggcaatttc cacaatatct aaagacattt tcggagaatg gaacgtgata     2640

agagacaagt ggaatgcgga gtatgatgac atacacctga agaagaaggc agttgtgact     2700

gaaaaatacg aagatgacag gagaaaaagc tttaaaaaga tcgggtcctt ttcactggaa     2760

cagctgcagg agtatgccga cgccgatctt tcggttgtcg aaaagctcaa agaaataatt     2820

atccagaagg tcgatgaaat ctacaaggtg tacggctcaa gcgagaagct ctttgatgct     2880

gacttcgtgt tggagaagtc tcttaaaaaa aacgacgcag tcgtcgcgat aatgaaagat     2940

ttgctggatt cagtgaaatc cttcgagaat tatatcaaag ccttcttcgg cgaggggaag     3000

gagacaaaca gggatgagtc cttctatgga gacttcgttc tggcttacga catccttctt     3060

aaggtcgacc acatctatga cgcaattcgg aactatgtga cgcagaagcc gtattcgaaa     3120

gataagttca agctctattt ccaaaaccct caatttatgc gtgggtggga taaagacgta     3180

gagaccgatc gccgggcaac aattttgcgg tacgggtcta aatattacct cgctataatg     3240

gataagaaat acgctaaatg tctccagaaa attgacaaag atgacgtcaa cggcaattat     3300

gaaaaaatca attataaact ccttcctggc ccaaataaaa tgctcccgaa ggtgtttttt     3360

tccaaaaagt ggatggccta ttataatcca tcagaggata ttcagaaaat ctataaaaat     3420

gggaccttta agaagggtga catgtttaac ctgaacgatt gccacaagct tatagatttt     3480

ttcaaagact ctattagccg ctatcccaaa tggtctaatg cttatgattt caacttctct     3540

gaaactgaaa agtacaaaga tattgcagga ttctaccgcg aagttgaaga acaaggttat     3600

aaggtttcct ttgagtctgc gtccaagaaa gaggtcgata agttggtcga agaagggaaa     3660

ttgtatatgt ttcaaattta caataaagac ttttccgaca agtcccatgg tacacctaat     3720

ctgcatacca tgtacttcaa actgctgttc gatgagaata atcacggtca gattcgcctg     3780

agcggagggg cggaactctt catgaggaga gcatcgttga aaaaagagga gctcgtcgtg     3840

catccggcta acagccccat tgctaacaag aatccggata atccaaagaa gactactacc     3900

ctctcctatg acgtctataa ggataagaga ttctctgagg accagtacga gttgcacatc     3960

cctattgcga taaataaatg ccctaagaac atctttaaaa tcaatactga ggtcagagtc     4020

ctgcttaagc acgacgacaa cccgtatgtg atcgggattg ataggggtga aaggaacttg     4080

ctttatattg tggttgtcga tggaaaaggt aatatagtgg aacaatactc tctgaatgaa     4140

attatcaaca acttcaatgg cattaggatc aagaccgact atcattctct gttggacaag     4200

aaagagaaag agcgcttcga ggcacggcaa aactggacgt ctattgagaa catcaaggag     4260

cttaaggctg gttacatttc tcaggttgtg cacaaaattt gcgaactggt cgagaaatat     4320

gatgccgtta tcgcacttga agatctcaac agcggattta agaattctcg ggtgaaagtc     4380

gaaaaacagg tgtatcaaaa attcgaaaag atgctgatcg acaagctcaa ttatatggtt     4440

gataaaaaga gcaacccatg cgccacgggg ggtgcgctta agggctatca gattacgaac     4500

aaatttgaat ccttcaagtc aatgtcgacg caaaatgggt ttatattcta tataccggcg     4560

tggcttacat ctaaaataga tcctagcact gggttcgtga acctgctgaa aaccaagtac     4620

acttcaatcg cagattctaa aaaatttata agcagcttcg acagaatcat gtatgtgccc     4680

gaggaagacc tcttcgagtt tgcccttgat tacaaaaatt tctcaagaac ggatgcagac     4740

tacataaaga agtggaagct gtactcttat gggaaccgga ttcggatatt cagaaatccg     4800

aaaaaaaaca atgtctttga ttgggaggaa gtttgtctta cctctgctta caaagagctg     4860

ttcaataaat atggcattaa ttaccagcaa ggtgatatcc gggcgctcct ttgcgaacag     4920

tctgacaaag ctttctattc ttcatttatg gcgctcatgt cattgatgct gcagatgagg     4980

aatagcatta cggggaggac tgatgttgac tttctgatct cgcccgtgaa aaattctgat     5040

ggaatcttct acgattccag gaattatgag gcccaggaaa atgctatcct tcccaagaac     5100

gcagacgcaa atggcgcgta caatatagct cgcaaggttt tgtgggctat aggccaattc     5160

aagaaagccg aagacgaaaa gctggacaaa gttaagattg ctatatctaa caaagagtgg     5220

cttgagtatg cgcaaacatc tgttaaacac aaacgccccg cggctacaaa gaaggctggc     5280

caggcaaaga agaagaagtg agtcgaccga tcgttcaaac atttggcaat aaagtttctt     5340

aagattgaat cctgttgccg gtcttgcgat gattatcata taatttctgt tgaattacgt     5400

taagcatgta ataattaaca tgtaatgcat gacgttattt atgagatggg tttttatgat     5460

tagagtcccg caattataca tttaatacgc gatagaaaac aaaatatagc gcgcaaacta     5520

ggataaatta tcgcgcgcgg tgtcatctat gttactagat cgatcccggg atatcgcggc     5580

cgcgtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc     5640

acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg     5700

aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat     5760

cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag     5820

gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga     5880

tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg     5940

tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt     6000

cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac     6060

gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc     6120

ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt     6180

ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc     6240

ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc     6300

agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg     6360

aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag     6420

atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg     6480

tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt     6540

tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca     6600

tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca     6660

gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc     6720

tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt     6780

ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg     6840

gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc     6900

aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg     6960

ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga     7020

tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga     7080

ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta     7140

aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg     7200

ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact     7260

ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata     7320

agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt     7380

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa     7440

ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgcgccctg tagcggcacg     7500

tctaattcgg gggatctgga ttttagtact ggattttggt tttaggaatt agaaatttta     7560

ttgatagaag tattttacaa atacaaatac atactaaggg tttcttatat gctcaacaca     7620

tgagcgaaac cctataggaa ccctaattcc cttatctggg aactactcac acattattat     7680

ggagaaactc gagcttgtcg atcgacatga tcagggagct ctagattatt tgtatagttc     7740

atccatgccc attacgtcgg taaatgcctt ctgccactcc ttgaagttaa gttcggtctt     7800

ggaatgtttc aactcagtct tacggaacac gtacatgggt tggttcttaa ggtagttagc     7860

ggccattggt ttagcgaatg tgtaggtagt cctggctgta gagcgatatc tcttgccatt     7920

gcctgtggtg taagaccatt tgaaggtact aatgatggtc ttgtcgttag ggtaggtttt     7980

cttggaccgg caccaatcag cggcagttaa ggagttggtc atgacaggtc catcagcagg     8040

aaagcctgtc cccttcactt gggcttctcc tttgatgtgg ctcccttcgt aagtgtaacg     8100

gtagttgacg gtgagcgaag caccgtcctc aaactgcatt gtcctgtgga cttggtatcc     8160

ggagccatca accatggctg cttggaatgg actcattccg tcagggtatg gaaggtattg     8220

atggaatccg tagccaatgt gtggcaccag aatccatgga gaaaactgaa gatcaccttt     8280

ggtgctcttg aggttcagct cttcgtatcc gtcattaggg ttcccagtgc cttgtccgac     8340

catatcgaag tcaacgccgt tgatggaacc gaagatgtga agctcatgtg tggctggaag     8400

cgaagccatg ttatcttctt ctcctttact cacggaggac gccatggtgg cgggatcgcg     8460

ccctatcgtt cgtaaatggt gaaaattttc agaaaattgc ttttgcttta aaagaaatga     8520

tttaaattgc tgcaatagaa gtagaatgct tgattgcttg agattcgttt gttttgtata     8580

tgttgtgttg agaggatcct caagcttcga cctgcagaag taacaccaaa caacagggtg     8640

agcatcgaca aaagaaacag taccaagcaa ataaatagcg tatgaaggca gggctaaaaa     8700

aatccacata tagctgctgc atatgccatc atccaagtat atcaagatca aaataattat     8760

aaaacatact tgtttattat aatagatagg tactcaaggt tagagcatat gaatagatgc     8820

tgcatatgcc atcatgtata tgcatcagta aaacccacat caacatgtat acctatccta     8880

gatcgatatt tccatccatc ttaaactcgt aactatgaag atgtatgaca cacacataca     8940

gttccaaaat taataaatac accaggtagt ttgaaacagt attctactcc gatctagaac     9000

gaatgaacga ccgcccaacc acaccacatc atcacaacca agcgaacaaa agcatctctg     9060

tatatgcatc agtaaaaccc gcatcaacat gtatacctat cctagatcga tatttccatc     9120

catcatcttc aattcgtaac tatgaatatg tatggcacac acatacagat ccaaaattaa     9180

taaatccacc aggtagtttg aaacagaatt ctactccgat ctagaacgac cgcccaacca     9240

gaccacatca tcacaaccaa gacaaaaaaa agcatgaaaa gatgacccga caaacaagtg     9300

cacggcatat attgaaataa aggaaaaggg caaaccaaac cctatgcaac gaaacaaaaa     9360

aaatcatgaa atcgatcccg tctgcggaac ggctagagcc atcccaggat tccccaaaga     9420

gaaacactgg caagttagca atcagaacgt gtctgacgta caggtcgcat ccgtgtacga     9480

acgctagcag cacggatcta acacaaacac ggatctaaca caaacatgaa cagaagtaga     9540

actaccgggc cctaaccatg gaccggaacg ccgatctaga gaaggtagag aggggggggg     9600

aggacgagcg gcgtaccttg aagcggaggt gccgacgggt ggatttgggg gagatccact     9660

agttctagag cggccgccac cgcggtggaa ttctcgaggt cctctccaaa tgaaatgaac     9720

ttccttatat agaggaaggg tcttgcgaag gatagtggga ttgtgcgtca tcccttacgt     9780

cagtggagat atcacatcaa tccacttgct ttgaagacgt ggttggaacg tcttcttttt     9840

ccacgatgct cctcgtgggt gggggtccat ctttgggacc actgtcggca gaggcatctt     9900

gaacgatagc ctttccttta tcgcaatgat ggcatttgta ggtgccacct tccttttcta     9960

ctgtcctttt gatcaagtga ccgatagctg ggcaatggaa tccgaggagg tttcccgata    10020

ttaccctttg ttgaaaagtc tcaatagccc tttggtcttc tgagactgta tctttgatat    10080

tcttggagta gacgagagtg tcgtgctcca ccatgttatc acatcaattc acttgctttg    10140

aagacgtggt tggaacgtct tctttttcca cgatgctcct cgtgggtggg ggtccatctt    10200

tgggaccact gtcggcagag gcatcttgaa cgatagcctt tcctttatcg caatgatggc    10260

atttgtaggt gccaccttcc ttttctactg tccttttgat caagtgacag atagctgggc    10320

aatggaatcc gaggaggttt cccgatatta ccctttgttg aaaagtctca atagcccttt    10380

ggtcttctga gacttgcagg caagcaagca tgaatgcctg ggcgcgccga tatc          10434


<210>  37
<211>  10432
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP315 Corn strategy LbCpf1 vector

<400>  37
actgctgcag tgcagcgtga cccggtcgtg cccctctcta gagataatga gcattgcatg       60

tctaagttat aaaaaattac cacatatttt ttttgtcaca cttgtttgaa gtgcagttta      120

tctatcttta tacatatatt taaactttac tctacgaata atataatcta tagtactaca      180

ataatatcag tgttttagag aatcatataa atgaacagtt agacatggtc taaaggacaa      240

ttgagtattt tgacaacagg actctacagt tttatctttt tagtgtgcat gtgttctcct      300

ttttttttgc aaatagcttc acctatataa tacttcatcc attttattag tacatccatt      360

tagggtttag ggttaatggt ttttatagac taattttttt agtacatcta ttttattcta      420

ttttagcctc taaattaaga aaactaaaac tctattttag tttttttatt taataattta      480

gatataaaat agaataaaat aaagtgacta aaaattaaac aaataccctt taagaaatta      540

aaaaaactaa ggaaacattt ttcttgtttc gagtagataa tgccagcctg ttaaacgccg      600

tcgatcgacg agtctaacgg acaccaacca gcgaaccagc agcgtcgcgt cgggccaagc      660

gaagcagacg gcacggcatc tctgtcgctg cctctggacc cctctcgaga gttccgctcc      720

accgttggac ttgctccgct gtcggcatcc agaaattgcg tggcggagcg gcagacgtga      780

gccggcacgg caggcggcct cctcctcctc tcacggcacc ggcagctacg ggggattcct      840

ttcccaccgc tccttcgctt tcccttcctc gcccgccgta ataaatagac accccctcca      900

caccctcttt ccccaacctc gtgttgttcg gagcgcacac acacacaacc agatctcccc      960

caaatccacc cgtcggcacc tccgcttcaa ggtacgccgc tcgtcctccc ccccccctct     1020

ctaccttctc tagatcggcg ttccggtcca tggttagggc ccggtagttc tacttctgtt     1080

catgtttgtg ttagatccgt gtttgtgtta gatccgtgct gctagcgttc gtacacggat     1140

gcgacctgta cgtcagacac gttctgattg ctaacttgcc agtgtttctc tttggggaat     1200

cctgggatgg ctctagccgt tccgcagacg ggatcgatct aggataggta tacatgttga     1260

tgtgggtttt actgatgcat atacatgatg gcatatgcag catctattca tatgctctaa     1320

ccttgagtac ctatctatta taataaacaa gtatgtttta taattatttt gatcttgata     1380

tacttggatg atggcatatg cagcagctat atgtggattt ttttagccct gccttcatac     1440

gctatttatt tgcttggtac tgtttctttt gtcgatgctc accctgttgt ttggtgttac     1500

ttctgcaggt cgaagcttga agcaaacatg gcatctagca tggcaccaaa gaaaaaaagg     1560

aaagtttcca aacttgaaaa atttacaaac tgctactccc tttccaagac gcttaggttt     1620

aaagcgatcc ccgttggcaa gacccaagag aatatcgata acaaaagact tctggtcgaa     1680

gatgaaaaaa gggccgaaga ctacaagggg gtcaagaagt tgctcgatcg ctattatctt     1740

tcctttatca acgatgtgct tcattcaatc aaactgaaga acttgaataa ctacattagc     1800

cttttcagaa agaaaacgag gactgaaaag gagaacaagg aacttgagaa tcttgaaata     1860

aaccttcgca aagaaattgc aaaagccttc aaggggaacg aaggatataa atctcttttc     1920

aaaaaagaca ttatagaaac aattttgcct gagtttcttg acgacaagga tgaaattgcg     1980

ctcgtcaata gctttaacgg atttacaact gccttcacag ggttcttcga caatagggag     2040

aatatgttta gcgaggaggc aaaaagcaca tccatcgcat tcagatgcat caatgaaaat     2100

cttacccggt acatatcgaa tatggacata tttgaaaaag tggatgcaat attcgataag     2160

cacgaagtcc aggagataaa ggaaaagata ctgaatagcg actatgatgt cgaagatttt     2220

ttcgaaggtg agttcttcaa ctttgtcctg actcaagaag gcattgatgt ctataatgca     2280

ataattggag gttttgtgac tgagtctggc gagaagataa agggcttgaa cgagtatatc     2340

aatctctaca accagaagac taagcaaaag ttgcctaaat ttaaaccgct ttacaagcaa     2400

gttttgagcg accgggaaag cctttccttt tacggtgaag gatacacgag cgatgaagaa     2460

gtcctcgaag tcttccgcaa cacactcaac aagaactcag aaatcttttc ctcaattaaa     2520

aaattggaga agcttttcaa gaacttcgat gaatactctt cggcggggat ttttgtgaag     2580

aacggcccgg caatttccac aatatctaaa gacattttcg gagaatggaa cgtgataaga     2640

gacaagtgga atgcggagta tgatgacata cacctgaaga agaaggcagt tgtgactgaa     2700

aaatacgaag atgacaggag aaaaagcttt aaaaagatcg ggtccttttc actggaacag     2760

ctgcaggagt atgccgacgc cgatctttcg gttgtcgaaa agctcaaaga aataattatc     2820

cagaaggtcg atgaaatcta caaggtgtac ggctcaagcg agaagctctt tgatgctgac     2880

ttcgtgttgg agaagtctct taaaaaaaac gacgcagtcg tcgcgataat gaaagatttg     2940

ctggattcag tgaaatcctt cgagaattat atcaaagcct tcttcggcga ggggaaggag     3000

acaaacaggg atgagtcctt ctatggagac ttcgttctgg cttacgacat ccttcttaag     3060

gtcgaccaca tctatgacgc aattcggaac tatgtgacgc agaagccgta ttcgaaagat     3120

aagttcaagc tctatttcca aaaccctcaa tttatgggtg ggtgggataa agacaaagag     3180

accgattacc gggcaacaat tttgcggtac gggtctaaat attacctcgc tataatggat     3240

aagaaatacg ctaaatgtct ccagaaaatt gacaaagatg acgtcaacgg caattatgaa     3300

aaaatcaatt ataaactcct tcctggccca aataaaatgc tcccgaaggt gtttttttcc     3360

aaaaagtgga tggcctatta taatccatca gaggatattc agaaaatcta taaaaatggg     3420

acctttaaga agggtgacat gtttaacctg aacgattgcc acaagcttat agattttttc     3480

aaagactcta ttagccgcta tcccaaatgg tctaatgctt atgatttcaa cttctctgaa     3540

actgaaaagt acaaagatat tgcaggattc taccgcgaag ttgaagaaca aggttataag     3600

gtttcctttg agtctgcgtc caagaaagag gtcgataagt tggtcgaaga agggaaattg     3660

tatatgtttc aaatttacaa taaagacttt tccgacaagt cccatggtac acctaatctg     3720

cataccatgt acttcaaact gctgttcgat gagaataatc acggtcagat tcgcctgagc     3780

ggaggggcgg aactcttcat gaggagagca tcgttgaaaa aagaggagct cgtcgtgcat     3840

ccggctaaca gccccattgc taacaagaat ccggataatc caaagaagac tactaccctc     3900

tcctatgacg tctataagga taagagattc tctgaggacc agtacgagtt gcacatccct     3960

attgcgataa ataaatgccc taagaacatc tttaaaatca atactgaggt cagagtcctg     4020

cttaagcacg acgacaaccc gtatgtgatc gggattgata ggggtgaaag gaacttgctt     4080

tatattgtgg ttgtcgatgg aaaaggtaat atagtggaac aatactctct gaatgaaatt     4140

atcaacaact tcaatggcat taggatcaag accgactatc attctctgtt ggacaagaaa     4200

gagaaagagc gcttcgaggc acggcaaaac tggacgtcta ttgagaacat caaggagctt     4260

aaggctggtt acatttctca ggttgtgcac aaaatttgcg aactggtcga gaaatatgat     4320

gccgttatcg cacttgaaga tctcaacagc ggatttaaga attctcgggt gaaagtcgaa     4380

aaacaggtgt atcaaaaatt cgaaaagatg ctgatcgaca agctcaatta tatggttgat     4440

aaaaagagca acccatgcgc cacggggggt gcgcttaagg gctatcagat tacgaacaaa     4500

tttgaatcct tcaagtcaat gtcgacgcaa aatgggttta tattctatat accggcgtgg     4560

cttacatcta aaatagatcc tagcactggg ttcgtgaacc tgctgaaaac caagtacact     4620

tcaatcgcag attctaaaaa atttataagc agcttcgaca gaatcatgta tgtgcccgag     4680

gaagacctct tcgagtttgc ccttgattac aaaaatttct caagaacgga tgcagactac     4740

ataaagaagt ggaagctgta ctcttatggg aaccggattc ggatattcag aaatccgaaa     4800

aaaaacaatg tctttgattg ggaggaagtt tgtcttacct ctgcttacaa agagctgttc     4860

aataaatatg gcattaatta ccagcaaggt gatatccggg cgctcctttg cgaacagtct     4920

gacaaagctt tctattcttc atttatggcg ctcatgtcat tgatgctgca gatgaggaat     4980

agcattacgg ggaggactga tgttgacttt ctgatctcgc ccgtgaaaaa ttctgatgga     5040

atcttctacg attccaggaa ttatgaggcc caggaaaatg ctatccttcc caagaacgca     5100

gacgcaaatg gcgcgtacaa tatagctcgc aaggttttgt gggctatagg ccaattcaag     5160

aaagccgaag acgaaaagct ggacaaagtt aagattgcta tatctaacaa agagtggctt     5220

gagtatgcgc aaacatctgt taaacacaaa cgccccgcgg ctacaaagaa ggctggccag     5280

gcaaagaaga agaagtgagt cgaccgatcg ttcaaacatt tggcaataaa gtttcttaag     5340

attgaatcct gttgccggtc ttgcgatgat tatcatataa tttctgttga attacgttaa     5400

gcatgtaata attaacatgt aatgcatgac gttatttatg agatgggttt ttatgattag     5460

agtcccgcaa ttatacattt aatacgcgat agaaaacaaa atatagcgcg caaactagga     5520

taaattatcg cgcgcggtgt catctatgtt actagatcga tcccgggata tcgcggccgg     5580

tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag     5640

aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc     5700

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca     5760

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt     5820

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc     5880

tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc     5940

tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc     6000

ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact     6060

tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg     6120

ctacagagtt cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta     6180

tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca     6240

aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa     6300

aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg     6360

aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc     6420

ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg     6480

acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat     6540

ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg     6600

gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa     6660

taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca     6720

tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc     6780

gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt     6840

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa     6900

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat     6960

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct     7020

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga     7080

gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag     7140

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga     7200

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca     7260

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg     7320

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc     7380

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag     7440

gggttccgcg cacatttccc cgaaaagtgc cacctgacgc gccctgtatc ggcacgtcta     7500

attcggggga tctggatttt agtactggat tttggtttta ggaattagaa attttattga     7560

tagaagtatt ttacaaatac aaatacatac taagggtttc ttatatgctc aacacatgag     7620

cgaaacccta taggaaccct aattccctta tctgggaact actcacacat tattatggag     7680

aaactcgagc ttgtcgatcg acatgatcag ggagccctag attatttgta tagttcatcc     7740

atgcccatta cgtcggtaaa tgccttctgc cactccttga agttaagttc ggtcttggaa     7800

tgtttcaact cagtcttacg gaacacgtac atgggttggt tcttaaggta gttagcggcc     7860

attggtttag cgaatgtgta ggtagtcctg gctgtagagc gatatctctt gccattgcct     7920

gtggtgtaag accatttgaa ggtactaatg atggtcttgt cgttagggta ggttttcttg     7980

gaccggcacc aatcagcggc agttaaggag ttggtcatga caggtccatc agcaggaaag     8040

cctgtcccct tcacttgggc ttctcctttg atgtggctcc cttcgtaagt gtaacggtag     8100

ttgacggtga gcgaagcacc gtcctcaaac tgcattgtcc tgtggacttg gtatccggag     8160

ccatcaacca tggctgcttg gaatggactc attccgtcag ggtatggaag gtattgatgg     8220

aatccgtagc caatgtgtgg caccagaatc catggagaaa actgaagatc acctttggtg     8280

ctcttgaggt tcagctcttc gtatccgtca ttagggttcc cagtgccttg tccgaccata     8340

tcgaagtcaa cgccgttgat ggaaccgaag atgtgaagct catgtgtggc tggaagcgaa     8400

gccatgttat cttcttctcc tttactcacg gaggacgcca tggtggcggg atcgcgccct     8460

atcgttcgta aatggtgaaa attttcagaa aattgctttt gctttaaaag aaatgattta     8520

aattgctgca atagaagtag aatgcttgat tgcttgagat tcgtttgttt tgtatatgtt     8580

gtgttgagag gatcctcaag cttcgacctg cagaagtaac accaaacaac agggtgagca     8640

tcgacaaaag aaacagtacc aagcaaataa atagcgtatg aaggcagggc taaaaaaatc     8700

cacatatagc tgctgcatat gccatcatcc aagtatatca agatcaaaat aattataaaa     8760

catacttgtt tattataata gataggtact caaggttaga gcatatgaat agatgctgca     8820

tatgccatca tgtatatgca tcagtaaaac ccacatcaac atgtatacct atcctagatc     8880

gatatttcca tccatcttaa actcgtaact atgaagatgt atgacacaca catacagttc     8940

caaaattaat aaatacacca ggtagtttga aacagtattc tactccgatc tagaacgaat     9000

gaacgaccgc ccaaccacac cacatcatca caaccaagcg aacaaaaagc atctctgtat     9060

atgcatcagt aaaacccgca tcaacatgta tacctatcct agatcgatat ttccatccat     9120

catcttcaat tcgtaactat gaatatgtat ggcacacaca tacagatcca aaattaataa     9180

atccaccagg tagtttgaaa cagaattcta ctccgatcta gaacgaccgc ccaaccagac     9240

cacatcatca caaccaagac aaaaaaaagc atgaaaagat gacccgacaa acaagtgcac     9300

ggcatatatt gaaataaagg aaaagggcaa accaaaccct atgcaacgaa acaaaaaaaa     9360

tcatgaaatc gatcccgtct gcggaacggc tagagccatc ccaggattcc ccaaagagaa     9420

acactggcaa gttagcaatc agaacgtgtc tgacgtacag gtcgcatccg tgtacgaacg     9480

ctagcagcac ggatctaaca caaacacgga tctaacacaa acatgaacag aagtagaact     9540

accgggccct aaccatggac cggaacgccg atctagagaa ggtagagagg gggggggggg     9600

aggacgagcg gcgtaccttg aagcggaggt gccgacgggt ggatttgggg gagatccact     9660

agttctagag cggccgccac cgcggtggaa ttctcgaggt cctctccaaa tgaaatgaac     9720

ttccttatat agaggaaggg tcttgcgaag gatagtggga ttgtgcgtca tcccttacgt     9780

cagtggagat atcacatcaa tccacttgct ttgaagacgt ggttggaacg tcttcttttt     9840

ccacgatgtt cctcgtgggt gggggtccat ctttgggacc actgtcggta gaggcatctt     9900

gaacgatagc ctttccttta tcgcaatgat ggcatttgta gaagccatct tccttttcta     9960

ctgtcctttc gatgaagtga cagatagctg ggcaatggaa tccgaggagg tttcccgata    10020

ttaccctttg ttgaaaagtc tcaatagccc tctggtcttc tgagactgta tctttgatat    10080

tcttggagta gacgagagtg tcgtgctcca ccatgtatca catcaatcca cttgctttga    10140

agacgtggtt ggaacgtctt ctttttccac gatgttcctc gtgggtgggg gtccatcttt    10200

gggaccactg tcggtagagg catcttgaac gatagccttt cctttatcgc aatgatggca    10260

tttgtagaag ccatcttcct tttctactgt cctttcgatg aagtgacaga tagctgggca    10320

atggaatccg aggaggtttc ccgatattac cctttgttga aaagtctcaa tagccctctg    10380

gtcttctgaa cctgcaggca agcaagcatg aatgcctggg cgcgccgata tc            10432


<210>  38
<211>  3684
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  LbCpf1 RR coding sequence codon optimized

<400>  38
atgtccaaac ttgaaaaatt tacaaactgc tactcccttt ccaagacgct taggtttaaa       60

gcgatccccg ttggcaagac ccaagagaat atcgataaca aaagacttct ggtcgaagat      120

gaaaaaaggg ccgaagacta caagggggtc aagaagttgc tcgatcgcta ttatctttcc      180

tttatcaacg atgtgcttca ttcaatcaaa ctgaagaact tgaataacta cattagcctt      240

ttcagaaaga aaacgaggac tgaaaaggag aacaaggaac ttgagaatct tgaaataaac      300

cttcgcaaag aaattgcaaa agccttcaag gggaacgaag gatataaatc tcttttcaaa      360

aaagacatta tagaaacaat tttgcctgag tttcttgacg acaaggatga aattgcgctc      420

gtcaatagct ttaacggatt tacaactgcc ttcacagggt tcttcgacaa tagggagaat      480

atgtttagcg aggaggcaaa aagcacatcc atcgcattca gatgcatcaa tgaaaatctt      540

acccggtaca tatcgaatat ggacatattt gaaaaagtgg atgcaatatt cgataagcac      600

gaagtccagg agataaagga aaagatactg aatagcgact atgatgtcga agattttttc      660

gaaggtgagt tcttcaactt tgtcctgact caagaaggca ttgatgtcta taatgcaata      720

attggaggtt ttgtgactga gtctggcgag aagataaagg gcttgaacga gtatatcaat      780

ctctacaacc agaagactaa gcaaaagttg cctaaattta aaccgcttta caagcaagtt      840

ttgagcgacc gggaaagcct ttccttttac ggtgaaggat acacgagcga tgaagaagtc      900

ctcgaagtct tccgcaacac actcaacaag aactcagaaa tcttttcctc aattaaaaaa      960

ttggagaagc ttttcaagaa cttcgatgaa tactcttcgg cggggatttt tgtgaagaac     1020

ggcccggcaa tttccacaat atctaaagac attttcggag aatggaacgt gataagagac     1080

aagtggaatg cggagtatga tgacatacac ctgaagaaga aggcagttgt gactgaaaaa     1140

tacgaagatg acaggagaaa aagctttaaa aagatcgggt ccttttcact ggaacagctg     1200

caggagtatg ccgacgccga tctttcggtt gtcgaaaagc tcaaagaaat aattatccag     1260

aaggtcgatg aaatctacaa ggtgtacggc tcaagcgaga agctctttga tgctgacttc     1320

gtgttggaga agtctcttaa aaaaaacgac gcagtcgtcg cgataatgaa agatttgctg     1380

gattcagtga aatccttcga gaattatatc aaagccttct tcggcgaggg gaaggagaca     1440

aacagggatg agtccttcta tggagacttc gttctggctt acgacatcct tcttaaggtc     1500

gaccacatct atgacgcaat tcggaactat gtgacgcaga agccgtattc gaaagataag     1560

ttcaagctct atttccaaaa ccctcaattt atgcgtgggt gggataaaga caaagagacc     1620

gattaccggg caacaatttt gcggtacggg tctaaatatt acctcgctat aatggataag     1680

aaatacgcta aatgtctcca gaaaattgac aaagatgacg tcaacggcaa ttatgaaaaa     1740

atcaattata aactccttcc tggcccaaat aaaatgctcc cgagggtgtt tttttccaaa     1800

aagtggatgg cctattataa tccatcagag gatattcaga aaatctataa aaatgggacc     1860

tttaagaagg gtgacatgtt taacctgaac gattgccaca agcttataga ttttttcaaa     1920

gactctatta gccgctatcc caaatggtct aatgcttatg atttcaactt ctctgaaact     1980

gaaaagtaca aagatattgc aggattctac cgcgaagttg aagaacaagg ttataaggtt     2040

tcctttgagt ctgcgtccaa gaaagaggtc gataagttgg tcgaagaagg gaaattgtat     2100

atgtttcaaa tttacaataa agacttttcc gacaagtccc atggtacacc taatctgcat     2160

accatgtact tcaaactgct gttcgatgag aataatcacg gtcagattcg cctgagcgga     2220

ggggcggaac tcttcatgag gagagcatcg ttgaaaaaag aggagctcgt cgtgcatccg     2280

gctaacagcc ccattgctaa caagaatccg gataatccaa agaagactac taccctctcc     2340

tatgacgtct ataaggataa gagattctct gaggaccagt acgagttgca catccctatt     2400

gcgataaata aatgccctaa gaacatcttt aaaatcaata ctgaggtcag agtcctgctt     2460

aagcacgacg acaacccgta tgtgatcggg attgataggg gtgaaaggaa cttgctttat     2520

attgtggttg tcgatggaaa aggtaatata gtggaacaat actctctgaa tgaaattatc     2580

aacaacttca atggcattag gatcaagacc gactatcatt ctctgttgga caagaaagag     2640

aaagagcgct tcgaggcacg gcaaaactgg acgtctattg agaacatcaa ggagcttaag     2700

gctggttaca tttctcaggt tgtgcacaaa atttgcgaac tggtcgagaa atatgatgcc     2760

gttatcgcac ttgaagatct caacagcgga tttaagaatt ctcgggtgaa agtcgaaaaa     2820

caggtgtatc aaaaattcga aaagatgctg atcgacaagc tcaattatat ggttgataaa     2880

aagagcaacc catgcgccac ggggggtgcg cttaagggct atcagattac gaacaaattt     2940

gaatccttca agtcaatgtc gacgcaaaat gggtttatat tctatatacc ggcgtggctt     3000

acatctaaaa tagatcctag cactgggttc gtgaacctgc tgaaaaccaa gtacacttca     3060

atcgcagatt ctaaaaaatt tataagcagc ttcgacagaa tcatgtatgt gcccgaggaa     3120

gacctcttcg agtttgccct tgattacaaa aatttctcaa gaacggatgc agactacata     3180

aagaagtgga agctgtactc ttatgggaac cggattcgga tattcagaaa tccgaaaaaa     3240

aacaatgtct ttgattggga ggaagtttgt cttacctctg cttacaaaga gctgttcaat     3300

aaatatggca ttaattacca gcaaggtgat atccgggcgc tcctttgcga acagtctgac     3360

aaagctttct attcttcatt tatggcgctc atgtcattga tgctgcagat gaggaatagc     3420

attacgggga ggactgatgt tgactttctg atctcgcccg tgaaaaattc tgatggaatc     3480

ttctacgatt ccaggaatta tgaggcccag gaaaatgcta tccttcccaa gaacgcagac     3540

gcaaatggcg cgtacaatat agctcgcaag gttttgtggg ctataggcca attcaagaaa     3600

gccgaagacg aaaagctgga caaagttaag attgctatat ctaacaaaga gtggcttgag     3660

tatgcgcaaa catctgttaa acac                                            3684


<210>  39
<211>  3684
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  LbCpf1 RVR coding sequence codon optimized

<400>  39
atgtccaaac ttgaaaaatt tacaaactgc tactcccttt ccaagacgct taggtttaaa       60

gcgatccccg ttggcaagac ccaagagaat atcgataaca aaagacttct ggtcgaagat      120

gaaaaaaggg ccgaagacta caagggggtc aagaagttgc tcgatcgcta ttatctttcc      180

tttatcaacg atgtgcttca ttcaatcaaa ctgaagaact tgaataacta cattagcctt      240

ttcagaaaga aaacgaggac tgaaaaggag aacaaggaac ttgagaatct tgaaataaac      300

cttcgcaaag aaattgcaaa agccttcaag gggaacgaag gatataaatc tcttttcaaa      360

aaagacatta tagaaacaat tttgcctgag tttcttgacg acaaggatga aattgcgctc      420

gtcaatagct ttaacggatt tacaactgcc ttcacagggt tcttcgacaa tagggagaat      480

atgtttagcg aggaggcaaa aagcacatcc atcgcattca gatgcatcaa tgaaaatctt      540

acccggtaca tatcgaatat ggacatattt gaaaaagtgg atgcaatatt cgataagcac      600

gaagtccagg agataaagga aaagatactg aatagcgact atgatgtcga agattttttc      660

gaaggtgagt tcttcaactt tgtcctgact caagaaggca ttgatgtcta taatgcaata      720

attggaggtt ttgtgactga gtctggcgag aagataaagg gcttgaacga gtatatcaat      780

ctctacaacc agaagactaa gcaaaagttg cctaaattta aaccgcttta caagcaagtt      840

ttgagcgacc gggaaagcct ttccttttac ggtgaaggat acacgagcga tgaagaagtc      900

ctcgaagtct tccgcaacac actcaacaag aactcagaaa tcttttcctc aattaaaaaa      960

ttggagaagc ttttcaagaa cttcgatgaa tactcttcgg cggggatttt tgtgaagaac     1020

ggcccggcaa tttccacaat atctaaagac attttcggag aatggaacgt gataagagac     1080

aagtggaatg cggagtatga tgacatacac ctgaagaaga aggcagttgt gactgaaaaa     1140

tacgaagatg acaggagaaa aagctttaaa aagatcgggt ccttttcact ggaacagctg     1200

caggagtatg ccgacgccga tctttcggtt gtcgaaaagc tcaaagaaat aattatccag     1260

aaggtcgatg aaatctacaa ggtgtacggc tcaagcgaga agctctttga tgctgacttc     1320

gtgttggaga agtctcttaa aaaaaacgac gcagtcgtcg cgataatgaa agatttgctg     1380

gattcagtga aatccttcga gaattatatc aaagccttct tcggcgaggg gaaggagaca     1440

aacagggatg agtccttcta tggagacttc gttctggctt acgacatcct tcttaaggtc     1500

gaccacatct atgacgcaat tcggaactat gtgacgcaga agccgtattc gaaagataag     1560

ttcaagctct atttccaaaa ccctcaattt atgcgtgggt gggataaaga cgtagagacc     1620

gatcgccggg caacaatttt gcggtacggg tctaaatatt acctcgctat aatggataag     1680

aaatacgcta aatgtctcca gaaaattgac aaagatgacg tcaacggcaa ttatgaaaaa     1740

atcaattata aactccttcc tggcccaaat aaaatgctcc cgaaggtgtt tttttccaaa     1800

aagtggatgg cctattataa tccatcagag gatattcaga aaatctataa aaatgggacc     1860

tttaagaagg gtgacatgtt taacctgaac gattgccaca agcttataga ttttttcaaa     1920

gactctatta gccgctatcc caaatggtct aatgcttatg atttcaactt ctctgaaact     1980

gaaaagtaca aagatattgc aggattctac cgcgaagttg aagaacaagg ttataaggtt     2040

tcctttgagt ctgcgtccaa gaaagaggtc gataagttgg tcgaagaagg gaaattgtat     2100

atgtttcaaa tttacaataa agacttttcc gacaagtccc atggtacacc taatctgcat     2160

accatgtact tcaaactgct gttcgatgag aataatcacg gtcagattcg cctgagcgga     2220

ggggcggaac tcttcatgag gagagcatcg ttgaaaaaag aggagctcgt cgtgcatccg     2280

gctaacagcc ccattgctaa caagaatccg gataatccaa agaagactac taccctctcc     2340

tatgacgtct ataaggataa gagattctct gaggaccagt acgagttgca catccctatt     2400

gcgataaata aatgccctaa gaacatcttt aaaatcaata ctgaggtcag agtcctgctt     2460

aagcacgacg acaacccgta tgtgatcggg attgataggg gtgaaaggaa cttgctttat     2520

attgtggttg tcgatggaaa aggtaatata gtggaacaat actctctgaa tgaaattatc     2580

aacaacttca atggcattag gatcaagacc gactatcatt ctctgttgga caagaaagag     2640

aaagagcgct tcgaggcacg gcaaaactgg acgtctattg agaacatcaa ggagcttaag     2700

gctggttaca tttctcaggt tgtgcacaaa atttgcgaac tggtcgagaa atatgatgcc     2760

gttatcgcac ttgaagatct caacagcgga tttaagaatt ctcgggtgaa agtcgaaaaa     2820

caggtgtatc aaaaattcga aaagatgctg atcgacaagc tcaattatat ggttgataaa     2880

aagagcaacc catgcgccac ggggggtgcg cttaagggct atcagattac gaacaaattt     2940

gaatccttca agtcaatgtc gacgcaaaat gggtttatat tctatatacc ggcgtggctt     3000

acatctaaaa tagatcctag cactgggttc gtgaacctgc tgaaaaccaa gtacacttca     3060

atcgcagatt ctaaaaaatt tataagcagc ttcgacagaa tcatgtatgt gcccgaggaa     3120

gacctcttcg agtttgccct tgattacaaa aatttctcaa gaacggatgc agactacata     3180

aagaagtgga agctgtactc ttatgggaac cggattcgga tattcagaaa tccgaaaaaa     3240

aacaatgtct ttgattggga ggaagtttgt cttacctctg cttacaaaga gctgttcaat     3300

aaatatggca ttaattacca gcaaggtgat atccgggcgc tcctttgcga acagtctgac     3360

aaagctttct attcttcatt tatggcgctc atgtcattga tgctgcagat gaggaatagc     3420

attacgggga ggactgatgt tgactttctg atctcgcccg tgaaaaattc tgatggaatc     3480

ttctacgatt ccaggaatta tgaggcccag gaaaatgcta tccttcccaa gaacgcagac     3540

gcaaatggcg cgtacaatat agctcgcaag gttttgtggg ctataggcca attcaagaaa     3600

gccgaagacg aaaagctgga caaagttaag attgctatat ctaacaaaga gtggcttgag     3660

tatgcgcaaa catctgttaa acac                                            3684


<210>  40
<211>  1228
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  LbCpf1 RR polypeptide

<400>  40

Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr 
1               5                   10                  15      


Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp 
            20                  25                  30          


Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys 
        35                  40                  45              


Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp 
    50                  55                  60                  


Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu 
65                  70                  75                  80  


Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn 
                85                  90                  95      


Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn 
            100                 105                 110         


Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu 
        115                 120                 125             


Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe 
    130                 135                 140                 


Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn 
145                 150                 155                 160 


Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile 
                165                 170                 175     


Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys 
            180                 185                 190         


Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys 
        195                 200                 205             


Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe 
    210                 215                 220                 


Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile 
225                 230                 235                 240 


Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn 
                245                 250                 255     


Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys 
            260                 265                 270         


Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser 
        275                 280                 285             


Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe 
    290                 295                 300                 


Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys 
305                 310                 315                 320 


Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile 
                325                 330                 335     


Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe 
            340                 345                 350         


Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp 
        355                 360                 365             


Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp 
    370                 375                 380                 


Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu 
385                 390                 395                 400 


Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu 
                405                 410                 415     


Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser 
            420                 425                 430         


Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys 
        435                 440                 445             


Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys 
    450                 455                 460                 


Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr 
465                 470                 475                 480 


Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile 
                485                 490                 495     


Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr 
            500                 505                 510         


Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro 
        515                 520                 525             


Gln Phe Met Arg Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala 
    530                 535                 540                 


Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys 
545                 550                 555                 560 


Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly 
                565                 570                 575     


Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met 
            580                 585                 590         


Leu Pro Arg Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro 
        595                 600                 605             


Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly 
    610                 615                 620                 


Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys 
625                 630                 635                 640 


Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn 
                645                 650                 655     


Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys 
        675                 680                 685             


Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile 
                725                 730                 735     


Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys 
            740                 745                 750         


Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys 
        755                 760                 765             


Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr 
    770                 775                 780                 


Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile 
785                 790                 795                 800 


Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val 
                805                 810                 815     


Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp 
            820                 825                 830         


Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly 
        835                 840                 845             


Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn 
    850                 855                 860                 


Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu 
865                 870                 875                 880 


Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile 
                885                 890                 895     


Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys 
            900                 905                 910         


Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn 
        915                 920                 925             


Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln 
    930                 935                 940                 


Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys 
945                 950                 955                 960 


Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile 
                965                 970                 975     


Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe 
            980                 985                 990         


Ile Phe Tyr Ile Pro Ala Trp Leu  Thr Ser Lys Ile Asp  Pro Ser Thr 
        995                 1000                 1005             


Gly Phe  Val Asn Leu Leu Lys  Thr Lys Tyr Thr Ser  Ile Ala Asp 
    1010                 1015                 1020             


Ser Lys  Lys Phe Ile Ser Ser  Phe Asp Arg Ile Met  Tyr Val Pro 
    1025                 1030                 1035             


Glu Glu  Asp Leu Phe Glu Phe  Ala Leu Asp Tyr Lys  Asn Phe Ser 
    1040                 1045                 1050             


Arg Thr  Asp Ala Asp Tyr Ile  Lys Lys Trp Lys Leu  Tyr Ser Tyr 
    1055                 1060                 1065             


Gly Asn  Arg Ile Arg Ile Phe  Arg Asn Pro Lys Lys  Asn Asn Val 
    1070                 1075                 1080             


Phe Asp  Trp Glu Glu Val Cys  Leu Thr Ser Ala Tyr  Lys Glu Leu 
    1085                 1090                 1095             


Phe Asn  Lys Tyr Gly Ile Asn  Tyr Gln Gln Gly Asp  Ile Arg Ala 
    1100                 1105                 1110             


Leu Leu  Cys Glu Gln Ser Asp  Lys Ala Phe Tyr Ser  Ser Phe Met 
    1115                 1120                 1125             


Ala Leu  Met Ser Leu Met Leu  Gln Met Arg Asn Ser  Ile Thr Gly 
    1130                 1135                 1140             


Arg Thr  Asp Val Asp Phe Leu  Ile Ser Pro Val Lys  Asn Ser Asp 
    1145                 1150                 1155             


Gly Ile  Phe Tyr Asp Ser Arg  Asn Tyr Glu Ala Gln  Glu Asn Ala 
    1160                 1165                 1170             


Ile Leu  Pro Lys Asn Ala Asp  Ala Asn Gly Ala Tyr  Asn Ile Ala 
    1175                 1180                 1185             


Arg Lys  Val Leu Trp Ala Ile  Gly Gln Phe Lys Lys  Ala Glu Asp 
    1190                 1195                 1200             


Glu Lys  Leu Asp Lys Val Lys  Ile Ala Ile Ser Asn  Lys Glu Trp 
    1205                 1210                 1215             


Leu Glu  Tyr Ala Gln Thr Ser  Val Lys His 
    1220                 1225             


<210>  41
<211>  1228
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  LbCpf1 RVR polypeptide

<400>  41

Met Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr 
1               5                   10                  15      


Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp 
            20                  25                  30          


Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys 
        35                  40                  45              


Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp 
    50                  55                  60                  


Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu 
65                  70                  75                  80  


Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn 
                85                  90                  95      


Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn 
            100                 105                 110         


Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu 
        115                 120                 125             


Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe 
    130                 135                 140                 


Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn 
145                 150                 155                 160 


Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile 
                165                 170                 175     


Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys 
            180                 185                 190         


Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys 
        195                 200                 205             


Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe 
    210                 215                 220                 


Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile 
225                 230                 235                 240 


Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn 
                245                 250                 255     


Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys 
            260                 265                 270         


Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser 
        275                 280                 285             


Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe 
    290                 295                 300                 


Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys 
305                 310                 315                 320 


Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile 
                325                 330                 335     


Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe 
            340                 345                 350         


Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp 
        355                 360                 365             


Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp 
    370                 375                 380                 


Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu 
385                 390                 395                 400 


Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu 
                405                 410                 415     


Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser 
            420                 425                 430         


Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys 
        435                 440                 445             


Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys 
    450                 455                 460                 


Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr 
465                 470                 475                 480 


Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile 
                485                 490                 495     


Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr 
            500                 505                 510         


Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro 
        515                 520                 525             


Gln Phe Met Arg Gly Trp Asp Lys Asp Val Glu Thr Asp Arg Arg Ala 
    530                 535                 540                 


Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys 
545                 550                 555                 560 


Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly 
                565                 570                 575     


Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met 
            580                 585                 590         


Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro 
        595                 600                 605             


Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly 
    610                 615                 620                 


Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys 
625                 630                 635                 640 


Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn 
                645                 650                 655     


Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu 
            660                 665                 670         


Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys 
        675                 680                 685             


Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile 
    690                 695                 700                 


Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His 
705                 710                 715                 720 


Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile 
                725                 730                 735     


Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys 
            740                 745                 750         


Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys 
        755                 760                 765             


Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr 
    770                 775                 780                 


Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile 
785                 790                 795                 800 


Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val 
                805                 810                 815     


Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp 
            820                 825                 830         


Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly 
        835                 840                 845             


Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn 
    850                 855                 860                 


Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu 
865                 870                 875                 880 


Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile 
                885                 890                 895     


Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys 
            900                 905                 910         


Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn 
        915                 920                 925             


Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln 
    930                 935                 940                 


Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys 
945                 950                 955                 960 


Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile 
                965                 970                 975     


Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe 
            980                 985                 990         


Ile Phe Tyr Ile Pro Ala Trp Leu  Thr Ser Lys Ile Asp  Pro Ser Thr 
        995                 1000                 1005             


Gly Phe  Val Asn Leu Leu Lys  Thr Lys Tyr Thr Ser  Ile Ala Asp 
    1010                 1015                 1020             


Ser Lys  Lys Phe Ile Ser Ser  Phe Asp Arg Ile Met  Tyr Val Pro 
    1025                 1030                 1035             


Glu Glu  Asp Leu Phe Glu Phe  Ala Leu Asp Tyr Lys  Asn Phe Ser 
    1040                 1045                 1050             


Arg Thr  Asp Ala Asp Tyr Ile  Lys Lys Trp Lys Leu  Tyr Ser Tyr 
    1055                 1060                 1065             


Gly Asn  Arg Ile Arg Ile Phe  Arg Asn Pro Lys Lys  Asn Asn Val 
    1070                 1075                 1080             


Phe Asp  Trp Glu Glu Val Cys  Leu Thr Ser Ala Tyr  Lys Glu Leu 
    1085                 1090                 1095             


Phe Asn  Lys Tyr Gly Ile Asn  Tyr Gln Gln Gly Asp  Ile Arg Ala 
    1100                 1105                 1110             


Leu Leu  Cys Glu Gln Ser Asp  Lys Ala Phe Tyr Ser  Ser Phe Met 
    1115                 1120                 1125             


Ala Leu  Met Ser Leu Met Leu  Gln Met Arg Asn Ser  Ile Thr Gly 
    1130                 1135                 1140             


Arg Thr  Asp Val Asp Phe Leu  Ile Ser Pro Val Lys  Asn Ser Asp 
    1145                 1150                 1155             


Gly Ile  Phe Tyr Asp Ser Arg  Asn Tyr Glu Ala Gln  Glu Asn Ala 
    1160                 1165                 1170             


Ile Leu  Pro Lys Asn Ala Asp  Ala Asn Gly Ala Tyr  Asn Ile Ala 
    1175                 1180                 1185             


Arg Lys  Val Leu Trp Ala Ile  Gly Gln Phe Lys Lys  Ala Glu Asp 
    1190                 1195                 1200             


Glu Lys  Leu Asp Lys Val Lys  Ile Ala Ile Ser Asn  Lys Glu Trp 
    1205                 1210                 1215             


Leu Glu  Tyr Ala Gln Thr Ser  Val Lys His 
    1220                 1225             


<210>  42
<211>  27
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SV40 NLS

<400>  42
atggcaccaa agaaaaaaag gaaagtt                                           27


<210>  43
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleoplasmin NLS

<400>  43
aaacgccccg cggctacaaa gaaggctggc caggcaaaga agaagaag                    48


<210>  44
<211>  3849
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP296 Ribozyme strategy vector

<400>  44
ctgacgcgcc ctgtagcggc ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga       60

taatgagcat tgcatgtcta agttataaaa aattaccaca tatttttttt gtcacacttg      120

tttgaagtgc agtttatcta tctttataca tatatttaaa ctttactcta cgaataatat      180

aatctatagt actacaataa tatcagtgtt ttagagaatc atataaatga acagttagac      240

atggtctaaa ggacaattga gtattttgac aacaggactc tacagtttta tctttttagt      300

gtgcatgtgt tctccttttt ttttgcaaat agcttcacct atataatact tcatccattt      360

tattagtaca tccatttagg gtttagggtt aatggttttt atagactaat ttttttagta      420

catctatttt attctatttt agcctctaaa ttaagaaaac taaaactcta ttttagtttt      480

tttatttaat aatttagata taaaatagaa taaaataaag tgactaaaaa ttaaacaaat      540

accctttaag aaattaaaaa aactaaggaa acatttttct tgtttcgagt agataatgcc      600

agcctgttaa acgccgtcga tcgacgagtc taacggacac caaccagcga accagcagcg      660

tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc tggacccctc      720

tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa attgcgtggc      780

ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac ggcaccggca      840

gctacggggg attcctttcc caccgctcct tcgctttccc ttcctcgccc gccgtaataa      900

atagacaccc cctccacacc ctctttcccc aacctcgtgt tgttcggagc gcacacacac      960

acaaccagat ctcccccaaa tccacccgtc ggcacctccg cttcaaggta cgccgctcgt     1020

cctccccccc cccccctctc taccttctct agatcggcgt tccggtccat ggttagggcc     1080

cggtagttct acttctgttc atgtttgtgt tagatccgtg tttgtgttag atccgtgctg     1140

ctagcgttcg tacacggatg cgacctgtac gtcagacacg ttctgattgc taacttgcca     1200

gtgtttctct ttggggaatc ctgggatggc tctagccgtt ccgcagacgg gatcgatcta     1260

ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc     1320

atctattcat atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat     1380

aattattttg atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt     1440

tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca     1500

ccctgttgtt tggtgttact tctgcaggga tccaaattac tgatgagtcc gtgaggacga     1560

aacgagtaag ctcgtctaat ttctactaag tgtagatgag acggagctca gtctgaccgc     1620

ggcgtctctg gccggcatgg tcccagcctc ctcgctggcg ccggctgggc aacatgcttc     1680

ggcatggcga atgggaccga tcgttcaaac atttggcaat aaagtttctt aagattgaat     1740

cctgttgccg gtcttgcgat gattatcata taatttctgt tgaattacgt taagcatgta     1800

ataattaaca tgtaatgcat gacgttattt atgagatggg tttttatgat tagagtcccg     1860

caattataca tttaatacgc gatagaaaac aaaatatagc gcgcaaacta ggataaatta     1920

tcgcgcgcgg tgtcatctat gttactagat cgatcgtcgt tcggctgcgg cgagcggtat     1980

cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga     2040

acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt     2100

ttttccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt     2160

ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc     2220

gctctcctgt tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa     2280

gcgtggcgct ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct     2340

ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta     2400

actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg     2460

gtaacaggat tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc     2520

ctaactacgg ctacactaga agaacagtat ttggtatctg cgctctgctg aagccagtta     2580

ccttcggaaa aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg     2640

gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt     2700

tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg     2760

tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta     2820

aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg     2880

aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg     2940

tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc     3000

gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg     3060

agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg     3120

aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag     3180

gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat     3240

caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc     3300

cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc     3360

ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa     3420

ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac     3480

gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt     3540

cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc     3600

gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa     3660

caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca     3720

tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat     3780

acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa     3840

aagtgccac                                                             3849


<210>  45
<211>  3834
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP374 vector with rice HDV-like ribozyme sequence

<400>  45
ctgacgcgcc ctgtagcggc ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga       60

taatgagcat tgcatgtcta agttataaaa aattaccaca tatttttttt gtcacacttg      120

tttgaagtgc agtttatcta tctttataca tatatttaaa ctttactcta cgaataatat      180

aatctatagt actacaataa tatcagtgtt ttagagaatc atataaatga acagttagac      240

atggtctaaa ggacaattga gtattttgac aacaggactc tacagtttta tctttttagt      300

gtgcatgtgt tctccttttt ttttgcaaat agcttcacct atataatact tcatccattt      360

tattagtaca tccatttagg gtttagggtt aatggttttt atagactaat ttttttagta      420

catctatttt attctatttt agcctctaaa ttaagaaaac taaaactcta ttttagtttt      480

tttatttaat aatttagata taaaatagaa taaaataaag tgactaaaaa ttaaacaaat      540

accctttaag aaattaaaaa aactaaggaa acatttttct tgtttcgagt agataatgcc      600

agcctgttaa acgccgtcga tcgacgagtc taacggacac caaccagcga accagcagcg      660

tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc tggacccctc      720

tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa attgcgtggc      780

ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac ggcaccggca      840

gctacggggg attcctttcc caccgctcct tcgctttccc ttcctcgccc gccgtaataa      900

atagacaccc cctccacacc ctctttcccc aacctcgtgt tgttcggagc gcacacacac      960

acaaccagat ctcccccaaa tccacccgtc ggcacctccg cttcaaggta cgccgctcgt     1020

cctccccccc cccccctctc taccttctct agatcggcgt tccggtccat ggttagggcc     1080

cggtagttct acttctgttc atgtttgtgt tagatccgtg tttgtgttag atccgtgctg     1140

ctagcgttcg tacacggatg cgacctgtac gtcagacacg ttctgattgc taacttgcca     1200

gtgtttctct ttggggaatc ctgggatggc tctagccgtt ccgcagacgg gatcgatcta     1260

ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc     1320

atctattcat atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat     1380

aattattttg atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt     1440

tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca     1500

ccctgttgtt tggtgttact tctgcaggga tccaaattac tgatgagtcc gtgaggacga     1560

aacgagtaag ctcgtctaat ttctactaag tgtagatgag acggagctca gtctgaccgc     1620

ggcgtctctc cgccaacact gccaatgccg gtcccaagcc cggataaaag tggagggggc     1680

ggcgatcgtt caaacatttg gcaataaagt ttcttaagat tgaatcctgt tgccggtctt     1740

gcgatgatta tcatataatt tctgttgaat tacgttaagc atgtaataat taacatgtaa     1800

tgcatgacgt tatttatgag atgggttttt atgattagag tcccgcaatt atacatttaa     1860

tacgcgatag aaaacaaaat atagcgcgca aactaggata aattatcgcg cgcggtgtca     1920

tctatgttac tagatcgatc gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg     1980

cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag     2040

gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc     2100

gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag     2160

gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga     2220

ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc     2280

atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg     2340

tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt     2400

ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca     2460

gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca     2520

ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag     2580

ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca     2640

agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg     2700

ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg agattatcaa     2760

aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca atctaaagta     2820

tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca cctatctcag     2880

cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag ataactacga     2940

tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac ccacgctcac     3000

cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc agaagtggtc     3060

ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct agagtaagta     3120

gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc gtggtgtcac     3180

gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat     3240

gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa     3300

gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg     3360

tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag     3420

aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat aataccgcgc     3480

cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct     3540

caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat     3600

cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg     3660

ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc     3720

aatattattg aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta     3780

tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccac           3834


<210>  46
<211>  3844
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP375 vector with sunflower HDV-like sequence

<400>  46
ctgacgcgcc ctgtagcggc ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga       60

taatgagcat tgcatgtcta agttataaaa aattaccaca tatttttttt gtcacacttg      120

tttgaagtgc agtttatcta tctttataca tatatttaaa ctttactcta cgaataatat      180

aatctatagt actacaataa tatcagtgtt ttagagaatc atataaatga acagttagac      240

atggtctaaa ggacaattga gtattttgac aacaggactc tacagtttta tctttttagt      300

gtgcatgtgt tctccttttt ttttgcaaat agcttcacct atataatact tcatccattt      360

tattagtaca tccatttagg gtttagggtt aatggttttt atagactaat ttttttagta      420

catctatttt attctatttt agcctctaaa ttaagaaaac taaaactcta ttttagtttt      480

tttatttaat aatttagata taaaatagaa taaaataaag tgactaaaaa ttaaacaaat      540

accctttaag aaattaaaaa aactaaggaa acatttttct tgtttcgagt agataatgcc      600

agcctgttaa acgccgtcga tcgacgagtc taacggacac caaccagcga accagcagcg      660

tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc tggacccctc      720

tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa attgcgtggc      780

ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac ggcaccggca      840

gctacggggg attcctttcc caccgctcct tcgctttccc ttcctcgccc gccgtaataa      900

atagacaccc cctccacacc ctctttcccc aacctcgtgt tgttcggagc gcacacacac      960

acaaccagat ctcccccaaa tccacccgtc ggcacctccg cttcaaggta cgccgctcgt     1020

cctccccccc cccccctctc taccttctct agatcggcgt tccggtccat ggttagggcc     1080

cggtagttct acttctgttc atgtttgtgt tagatccgtg tttgtgttag atccgtgctg     1140

ctagcgttcg tacacggatg cgacctgtac gtcagacacg ttctgattgc taacttgcca     1200

gtgtttctct ttggggaatc ctgggatggc tctagccgtt ccgcagacgg gatcgatcta     1260

ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc     1320

atctattcat atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat     1380

aattattttg atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt     1440

tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca     1500

ccctgttgtt tggtgttact tctgcaggga tccaaattac tgatgagtcc gtgaggacga     1560

aacgagtaag ctcgtctaat ttctactaag tgtagatgag acggagctca gtctgaccgc     1620

ggcgtctctg cggggggcgt cagtcctact ctgcacctcc tcgtggtgtc gcctgggaac     1680

cctctttcgc aacgatcgtt caaacatttg gcaataaagt ttcttaagat tgaatcctgt     1740

tgccggtctt gcgatgatta tcatataatt tctgttgaat tacgttaagc atgtaataat     1800

taacatgtaa tgcatgacgt tatttatgag atgggttttt atgattagag tcccgcaatt     1860

atacatttaa tacgcgatag aaaacaaaat atagcgcgca aactaggata aattatcgcg     1920

cgcggtgtca tctatgttac tagatcgatc gtcgttcggc tgcggcgagc ggtatcagct     1980

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg     2040

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc     2100

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga     2160

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct     2220

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg     2280

gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag     2340

ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat     2400

cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac     2460

aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac     2520

tacggctaca ctagaagaac agtatttggt atctgcgctc tgctgaagcc agttaccttc     2580

ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt     2640

tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc     2700

ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg     2760

agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca     2820

atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca     2880

cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag     2940

ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac     3000

ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc     3060

agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct     3120

agagtaagta gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc     3180

gtggtgtcac gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg     3240

cgagttacat gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc     3300

gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat     3360

tctcttactg tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag     3420

tcattctgag aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat     3480

aataccgcgc cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg     3540

cgaaaactct caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca     3600

cccaactgat cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga     3660

aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc     3720

ttcctttttc aatattattg aagcatttat cagggttatt gtctcatgag cggatacata     3780

tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg     3840

ccac                                                                  3844


<210>  47
<211>  3867
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP375 vector with sunflower HDV-like sequence_long

<400>  47
ctgacgcgcc ctgtagcggc ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga       60

taatgagcat tgcatgtcta agttataaaa aattaccaca tatttttttt gtcacacttg      120

tttgaagtgc agtttatcta tctttataca tatatttaaa ctttactcta cgaataatat      180

aatctatagt actacaataa tatcagtgtt ttagagaatc atataaatga acagttagac      240

atggtctaaa ggacaattga gtattttgac aacaggactc tacagtttta tctttttagt      300

gtgcatgtgt tctccttttt ttttgcaaat agcttcacct atataatact tcatccattt      360

tattagtaca tccatttagg gtttagggtt aatggttttt atagactaat ttttttagta      420

catctatttt attctatttt agcctctaaa ttaagaaaac taaaactcta ttttagtttt      480

tttatttaat aatttagata taaaatagaa taaaataaag tgactaaaaa ttaaacaaat      540

accctttaag aaattaaaaa aactaaggaa acatttttct tgtttcgagt agataatgcc      600

agcctgttaa acgccgtcga tcgacgagtc taacggacac caaccagcga accagcagcg      660

tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc tggacccctc      720

tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa attgcgtggc      780

ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac ggcaccggca      840

gctacggggg attcctttcc caccgctcct tcgctttccc ttcctcgccc gccgtaataa      900

atagacaccc cctccacacc ctctttcccc aacctcgtgt tgttcggagc gcacacacac      960

acaaccagat ctcccccaaa tccacccgtc ggcacctccg cttcaaggta cgccgctcgt     1020

cctccccccc cccccctctc taccttctct agatcggcgt tccggtccat ggttagggcc     1080

cggtagttct acttctgttc atgtttgtgt tagatccgtg tttgtgttag atccgtgctg     1140

ctagcgttcg tacacggatg cgacctgtac gtcagacacg ttctgattgc taacttgcca     1200

gtgtttctct ttggggaatc ctgggatggc tctagccgtt ccgcagacgg gatcgatcta     1260

ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc     1320

atctattcat atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat     1380

aattattttg atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt     1440

tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca     1500

ccctgttgtt tggtgttact tctgcaggga tccaaattac tgatgagtcc gtgaggacga     1560

aacgagtaag ctcgtctaat ttctactaag tgtagatgag acggagctca gtctgaccgc     1620

ggcgtctctg cggggggcgt cagtcctact ctgcacctcc tcgtggtgtc gcctgggaac     1680

cctctttcgc aagaaagagg agccaagcag agaggcgatc gttcaaacat ttggcaataa     1740

agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata atttctgttg     1800

aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat gagatgggtt     1860

tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa aatatagcgc     1920

gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg atcgtcgttc     1980

ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag     2040

gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa     2100

aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc     2160

gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc     2220

ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg     2280

cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt     2340

cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc     2400

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc     2460

cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag     2520

agttcttgaa gtggtggcct aactacggct acactagaag aacagtattt ggtatctgcg     2580

ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa     2640

ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag     2700

gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact     2760

cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa     2820

attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt     2880

accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag     2940

ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca     3000

gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc     3060

agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt     3120

ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg     3180

ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca     3240

gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg     3300

ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca     3360

tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg     3420

tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct     3480

cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca     3540

tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca     3600

gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg     3660

tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac     3720

ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt     3780

attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc     3840

cgcgcacatt tccccgaaaa gtgccac                                         3867


<210>  48
<211>  3861
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP376 vector with artichoke HDV-like sequence

<400>  48
ctgacgcgcc ctgtagcggc ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga       60

taatgagcat tgcatgtcta agttataaaa aattaccaca tatttttttt gtcacacttg      120

tttgaagtgc agtttatcta tctttataca tatatttaaa ctttactcta cgaataatat      180

aatctatagt actacaataa tatcagtgtt ttagagaatc atataaatga acagttagac      240

atggtctaaa ggacaattga gtattttgac aacaggactc tacagtttta tctttttagt      300

gtgcatgtgt tctccttttt ttttgcaaat agcttcacct atataatact tcatccattt      360

tattagtaca tccatttagg gtttagggtt aatggttttt atagactaat ttttttagta      420

catctatttt attctatttt agcctctaaa ttaagaaaac taaaactcta ttttagtttt      480

tttatttaat aatttagata taaaatagaa taaaataaag tgactaaaaa ttaaacaaat      540

accctttaag aaattaaaaa aactaaggaa acatttttct tgtttcgagt agataatgcc      600

agcctgttaa acgccgtcga tcgacgagtc taacggacac caaccagcga accagcagcg      660

tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc tggacccctc      720

tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa attgcgtggc      780

ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac ggcaccggca      840

gctacggggg attcctttcc caccgctcct tcgctttccc ttcctcgccc gccgtaataa      900

atagacaccc cctccacacc ctctttcccc aacctcgtgt tgttcggagc gcacacacac      960

acaaccagat ctcccccaaa tccacccgtc ggcacctccg cttcaaggta cgccgctcgt     1020

cctccccccc cccccctctc taccttctct agatcggcgt tccggtccat ggttagggcc     1080

cggtagttct acttctgttc atgtttgtgt tagatccgtg tttgtgttag atccgtgctg     1140

ctagcgttcg tacacggatg cgacctgtac gtcagacacg ttctgattgc taacttgcca     1200

gtgtttctct ttggggaatc ctgggatggc tctagccgtt ccgcagacgg gatcgatcta     1260

ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc     1320

atctattcat atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat     1380

aattattttg atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt     1440

tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca     1500

ccctgttgtt tggtgttact tctgcaggga tccaaattac tgatgagtcc gtgaggacga     1560

aacgagtaag ctcgtctaat ttctactaag tgtagatgag acggagctca gtctgaccgc     1620

ggcgtctctg gcgtcagtcc tactctgcac ctcctcgtgg tgtcgcctgg gaaccctctt     1680

tcacaagaaa gaggagccaa gcagagaggc gatcgttcaa acatttggca ataaagtttc     1740

ttaagattga atcctgttgc cggtcttgcg atgattatca tataatttct gttgaattac     1800

gttaagcatg taataattaa catgtaatgc atgacgttat ttatgagatg ggtttttatg     1860

attagagtcc cgcaattata catttaatac gcgatagaaa acaaaatata gcgcgcaaac     1920

taggataaat tatcgcgcgc ggtgtcatct atgttactag atcgatcgtc gttcggctgc     1980

ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata     2040

acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg     2100

cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct     2160

caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa     2220

gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc     2280

tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt     2340

aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg     2400

ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg     2460

cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct     2520

tgaagtggtg gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc     2580

tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg     2640

ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc     2700

aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt     2760

aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa     2820

aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat     2880

gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct     2940

gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg     3000

caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag     3060

ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta     3120

attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg     3180

ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg     3240

gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct     3300

ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta     3360

tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg     3420

gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc     3480

cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg     3540

gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga     3600

tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg     3660

ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat     3720

gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc     3780

tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca     3840

catttccccg aaaagtgcca c                                               3861


<210>  49
<211>  24
<212>  DNA
<213>  Zea mays

<400>  49
ctcgtcacga ttcccctctc ctgg                                              24


<210>  50
<211>  24
<212>  DNA
<213>  Zea mays

<400>  50
cccacctgaa aagttcgacc agga                                              24


<210>  51
<211>  24
<212>  DNA
<213>  Zea mays

<400>  51
tgtgtggtca cacttgccag ccag                                              24


<210>  52
<211>  24
<212>  DNA
<213>  Zea mays

<400>  52
gtggtcggat ttctggcatc gctg                                              24


<210>  53
<211>  23
<212>  DNA
<213>  Zea mays

<400>  53
gtctatgtcg atgaccagca gat                                               23


<210>  54
<211>  24
<212>  DNA
<213>  Zea mays

<400>  54
cctctcctgg tcgaactttt cagg                                              24


<210>  55
<211>  24
<212>  DNA
<213>  Zea mays

<400>  55
accaggagag gggaatcgtg acga                                              24


<210>  56
<211>  24
<212>  DNA
<213>  Zea mays

<400>  56
ttatagcacg acaaaagtaa aaat                                              24


<210>  57
<211>  24
<212>  DNA
<213>  Zea mays

<400>  57
attgtcgtca tcatcggcta acat                                              24


<210>  58
<211>  24
<212>  DNA
<213>  Zea mays

<400>  58
tactttgact tttcccttaa tgac                                              24


<210>  59
<211>  24
<212>  DNA
<213>  Zea mays

<400>  59
gggccggtca taaagcagct ctca                                              24


<210>  60
<211>  24
<212>  DNA
<213>  Zea mays

<400>  60
acggatagcg ctcctcgttg gcgc                                              24


<210>  61
<211>  24
<212>  DNA
<213>  Zea mays

<400>  61
acaatgttag ccgatgatga cgac                                              24


<210>  62
<211>  24
<212>  DNA
<213>  Zea mays

<400>  62
ggtaaccgtc ctccgtacgt cgtc                                              24


<210>  63
<211>  24
<212>  DNA
<213>  Zea mays

<400>  63
cctctctacg acgacgtacg gagg                                              24


<210>  64
<211>  24
<212>  DNA
<213>  Zea mays

<400>  64
gttacgggca gtgcagttga gcaa                                              24


<210>  65
<211>  24
<212>  DNA
<213>  Zea mays

<400>  65
ctgactgtcc agtggccacc taga                                              24


<210>  66

<400>  66
000

<210>  67
<211>  1226
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  p35S+Adh1 intron promoter sequence

<400>  67
gttcagaaga ccagagggct attgagactt ttcaacaaag ggtaatatcg ggaaacctcc       60

tcggattcca ttgcccagct atctgtcact tcatcgaaag gacagtagaa aaggaagatg      120

gcttctacaa atgccatcat tgcgataaag gaaaggctat cgttcaagat gcctctaccg      180

acagtggtcc caaagatgga cccccaccca cgaggaacat cgtggaaaaa gaagacgttc      240

caaccacgtc ttcaaagcaa gtggattgat gtgatacatg gtggagcacg acactctcgt      300

ctactccaag aatatcaaag atacagtctc agaagaccag agggctattg agacttttca      360

acaaagggta atatcgggaa acctcctcgg attccattgc ccagctatct gtcacttcat      420

cgaaaggaca gtagaaaagg aagatggctt ctacaaatgc catcattgcg ataaaggaaa      480

ggctatcgtt caagatgcct ctaccgacag tggtcccaaa gatggacccc cacccacgag      540

gaacatcgtg gaaaaagaag acgttccaac cacgtcttca aagcaagtgg attgatgtga      600

tatctccact gacgtaaggg atgacgcaca atcccactat ccttcgcaag acccttcctc      660

tatataagga agttcatttc atttggagag ggtccgcctt gtttctcctc tgtctcttga      720

tctgactaat cttggtttat gattcgttga gtaattttgg ggaaagcttc gtccacagtt      780

ttttttcgat gaacagtgcc gcagtggcgc tgatcttgta tgctatcctg caatcgtggt      840

gaacttattt cttttatatc ctttactccc atgaaaaggc tagtaatctt tctcgatgta      900

acatcgtcca gcactgctat taccgtgtgg tccatccgac agtctggctg aacacatcat      960

acgatctatg gagcaaaaat ctatcttccc tgttctttaa tgaaggacgt cattttcatt     1020

agtatgatct aggaatgttg caacttgcaa ggaggcgttt ctttctttga atttaactaa     1080

ctcgttgagt ggccctgttt ctcggacgta aggcctttgc tgctccacac atgtccattc     1140

gaattttacc gtgtttagca agggcgaaaa gtttgcatct tgatgattta gcttgactat     1200

gcgattgctt tcctggaccc gtgcag                                          1226


<210>  68
<211>  5770
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP471 mRNA array vector construct

<400>  68
ctgacgcgcc ctgtagcggc acgtcacctg catcaggaat tcaggagaag aactcgagag       60

ggaattgcag atcatgaggc agatggctat ttttgtgtca catatgcgca aaaagagagg      120

ctatatttgt gtccctaggt tcttcgttgt attgcagttt ccatatcaat ctgacttggt      180

cgcatgagaa attgatggtt aaataatttg aatctctcat gtagtatcaa ctattagata      240

ttattttcac caaatatatt tccatcggag aagaagaggc tacagaggaa gcagaagaga      300

ggggtgggag aatttttaca cttttgtaca cccacttaaa cagcaaaatc cgtatgaaaa      360

caggcccacc aaaacaatgc cacgataaca atccgtagaa acaaaagctt catttaacag      420

cggcgcaaca aagcacgctt atccatggta gttgtagtcc gtatgcgatc caaagatcac      480

gattcacgcg tgacggacgg acgacgcgtg ccacaccaca actaacggca tccatggtag      540

ttgtagtccg tatgcgatcc aaagatcacg attcacgcgt gacggacgga cgacgcgcgc      600

cacaccacaa ctaacagcgt gagccagcgt ccaaactccg gatggcaacg gggacgaaac      660

ccgtcgggta gtcactgccc aaacccgtcc ccgcaacctt catcccaaac ccgtccccgt      720

ttccggtcgc gggtttcagt tttctaccag acccgtcccc atcgggtttt tcatccccgt      780

cgggaaatcc gaacccgcca gcatttcagc accaagccaa agttgcagca gcaacatgaa      840

taaaaaacaa cccgtttcaa caccaagata aaacaaaaca ttataattta gacaacattt      900

cacacgtata acaataacat atagttctca catataacaa caccatttca cacataaaac      960

aacaccattt gggataaaaa tatgggctat atcaggccat ttttatgggc catattgagt     1020

tttcgtgggt ttcacaggta ccggatttgt agaatgctga accgggtttg aaccgtaaaa     1080

tccgcgggta ttgaatttga cccaatcccg tcgtcccctg gtggggtaaa aacaccatct     1140

tgagtccaaa cggccaccaa ccaaactccg acggcaacaa acaaacggcg ttgctttgct     1200

cctcggtatc tccgtgaccg ctcaatctcc cggctgtttc cccggaattg cgtggactct     1260

ctcatccaca cgcaaaccgc ctctccctcc tctctcgtcc tatccgcccc ggtgccgtag     1320

cctcacggga ctcttcttcc tcccttgcta taaaatcccc gccccctcct gtctcctctc     1380

cacacatcca aactctcaat cgcaccgaga aaaatctcct agcgatcgaa gcgaagcctc     1440

tcccgatcct ctcaaggtac gcccgtttcc cgtcgatcct cctccttccg ttcgtgttct     1500

gtagccgatc gattcgattc ccttacaccc gttcgtgttc tctcgtggat cgatcgattg     1560

tttgttgcta gaaggaactc gtagatctgg cgtttatgaa ctgtgattcg ggttagtcca     1620

gatcgattca ggtcggtcgt cgttgagcct ctcggctatg tctggattat cgtgtagatc     1680

tgctggttca gttgattatg ttcttctagg agtaatttcg ttgggtcagc gcgatttctg     1740

cttaatctat gctgcttatt gcgcctgtac ctatctacta agctatgtgc acctgtaatt     1800

ttgctagatt attcgttcat cctcgtagtt ggtttgtcac agtaatccgt atgggttctg     1860

acgatgttat tgttggtcat acctaggctt ctccagattt tattttgtta aaattggata     1920

gatctgctac tgatagttga tgatggaatt tggtgctgaa tctatgctat ttattgcgcc     1980

tatacctgat ctatcgggct atgtacggct gtagtttact ggattattcg ttcatcctcg     2040

gtagttggtt catcgtttgg gttctgacga taatattgtt gattatgcgt aggcttctgc     2100

agattgttgt taaaattgga tacatcggtt actgatggtt gatgatagat ttgtgctgaa     2160

cctatctgtt tattgctcct atacctgatc tatagggcta tgtatgcctg taatttacca     2220

gattattcgt tcatcctcgt agttggttca tctctataat tcgtatgggt tcttatgatg     2280

ttatcgttga ttatgcctag tcttatacag attattgtgt caagattgaa tatacctgct     2340

actgatcggt gataatttgg ttagtagttt gcaatctgct aggaacacgt taccactgta     2400

atctgtaaac atggtttgcc agagtagttt gttctactac tcttgatatg gttgctgatt     2460

ttagtcgcct ccttttggat catgtattga tgtccttgca gatttccgtg tacttacccc     2520

ggcttttgtg tacttcgtgt taacagctct agaggatcct ctcaacacaa catatacaaa     2580

acaaacgaat ctcaagcaat caagcattct acttctattg cagcaattta aatcatttct     2640

tttaaagcaa aagcaatttt ctgaaaattt tcaccattta cgaacgatag ggcgcgatcc     2700

cgccaccatg gtgagcaagg gcgaggaggt catcaaagag ttcatgcgct tcaaggtgcg     2760

catggagggc tccatgaacg gccacgagtt cgagatcgag ggcgagggcg agggccgccc     2820

ctacgagggc acccagaccg ccaagctgaa ggtgaccaag ggcggccccc tgcccttcgc     2880

ctgggacatc ctgtcccccc agttcatgta cggctccaag gcgtacgtga agcaccccgc     2940

cgacatcccc gattacaaga agctgtcctt ccccgagggc ttcaagtggg agcgcgtgat     3000

gaacttcgag gacggcggtc tggtgaccgt gacccaggac tcctccctgc aggacggcac     3060

gctgatctac aaggtgaaga tgcgcggcac caacttcccc cccgacggcc ccgtaatgca     3120

gaagaagacc atgggctggg aggcctccac cgagcgcctg tacccccgcg acggcgtgct     3180

gaagggcgag atccaccagg ccctgaagct gaaggacggc ggccactacc tggtggagtt     3240

caagaccatc tacatggcca agaagcccgt gcaactgccc ggctactact acgtggacac     3300

caagctggac atcacctccc acaacgagga ctacaccatc gtggaacagt acgagcgctc     3360

cgagggccgc caccacctgt tcctgtacgg catggacgag ctgtacaagt ctagaggtac     3420

ctgataattt ctactaagtg tagatctcgt cacgattccc ctctcctgga atttctactc     3480

ttgtagattg tgtggtcaca cttgccagcc agaatttcta ctcttgtaga tgtctatgtc     3540

gatgaccagc agattaattt ctactaagtg tagatcgaat ttccccgatc gttcaaacat     3600

ttggcaataa agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata     3660

atttctgttg aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat     3720

gagatgggtt tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa     3780

aatatagcgc gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg     3840

ctcgacgcgg ccgccatggc ctctagtgga tcaggtgtcg ttcggctgcg gcgagcggta     3900

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag     3960

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg     4020

tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg     4080

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg     4140

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga     4200

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc     4260

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt     4320

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact     4380

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg     4440

cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct gaagccagtt     4500

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt     4560

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct     4620

ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg     4680

gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt     4740

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt     4800

gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc     4860

gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg     4920

cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc     4980

gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg     5040

gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca     5100

ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga     5160

tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct     5220

ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg     5280

cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca     5340

accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata     5400

cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct     5460

tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact     5520

cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa     5580

acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc     5640

atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga     5700

tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga     5760

aaagtgccac                                                            5770


<210>  69
<211>  5770
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP472 mRNA array vector construct

<400>  69
ctgacgcgcc ctgtagcggc acgtcacctg catcaggaat tcaggagaag aactcgagag       60

ggaattgcag atcatgaggc agatggctat ttttgtgtca catatgcgca aaaagagagg      120

ctatatttgt gtccctaggt tcttcgttgt attgcagttt ccatatcaat ctgacttggt      180

cgcatgagaa attgatggtt aaataatttg aatctctcat gtagtatcaa ctattagata      240

ttattttcac caaatatatt tccatcggag aagaagaggc tacagaggaa gcagaagaga      300

ggggtgggag aatttttaca cttttgtaca cccacttaaa cagcaaaatc cgtatgaaaa      360

caggcccacc aaaacaatgc cacgataaca atccgtagaa acaaaagctt catttaacag      420

cggcgcaaca aagcacgctt atccatggta gttgtagtcc gtatgcgatc caaagatcac      480

gattcacgcg tgacggacgg acgacgcgtg ccacaccaca actaacggca tccatggtag      540

ttgtagtccg tatgcgatcc aaagatcacg attcacgcgt gacggacgga cgacgcgcgc      600

cacaccacaa ctaacagcgt gagccagcgt ccaaactccg gatggcaacg gggacgaaac      660

ccgtcgggta gtcactgccc aaacccgtcc ccgcaacctt catcccaaac ccgtccccgt      720

ttccggtcgc gggtttcagt tttctaccag acccgtcccc atcgggtttt tcatccccgt      780

cgggaaatcc gaacccgcca gcatttcagc accaagccaa agttgcagca gcaacatgaa      840

taaaaaacaa cccgtttcaa caccaagata aaacaaaaca ttataattta gacaacattt      900

cacacgtata acaataacat atagttctca catataacaa caccatttca cacataaaac      960

aacaccattt gggataaaaa tatgggctat atcaggccat ttttatgggc catattgagt     1020

tttcgtgggt ttcacaggta ccggatttgt agaatgctga accgggtttg aaccgtaaaa     1080

tccgcgggta ttgaatttga cccaatcccg tcgtcccctg gtggggtaaa aacaccatct     1140

tgagtccaaa cggccaccaa ccaaactccg acggcaacaa acaaacggcg ttgctttgct     1200

cctcggtatc tccgtgaccg ctcaatctcc cggctgtttc cccggaattg cgtggactct     1260

ctcatccaca cgcaaaccgc ctctccctcc tctctcgtcc tatccgcccc ggtgccgtag     1320

cctcacggga ctcttcttcc tcccttgcta taaaatcccc gccccctcct gtctcctctc     1380

cacacatcca aactctcaat cgcaccgaga aaaatctcct agcgatcgaa gcgaagcctc     1440

tcccgatcct ctcaaggtac gcccgtttcc cgtcgatcct cctccttccg ttcgtgttct     1500

gtagccgatc gattcgattc ccttacaccc gttcgtgttc tctcgtggat cgatcgattg     1560

tttgttgcta gaaggaactc gtagatctgg cgtttatgaa ctgtgattcg ggttagtcca     1620

gatcgattca ggtcggtcgt cgttgagcct ctcggctatg tctggattat cgtgtagatc     1680

tgctggttca gttgattatg ttcttctagg agtaatttcg ttgggtcagc gcgatttctg     1740

cttaatctat gctgcttatt gcgcctgtac ctatctacta agctatgtgc acctgtaatt     1800

ttgctagatt attcgttcat cctcgtagtt ggtttgtcac agtaatccgt atgggttctg     1860

acgatgttat tgttggtcat acctaggctt ctccagattt tattttgtta aaattggata     1920

gatctgctac tgatagttga tgatggaatt tggtgctgaa tctatgctat ttattgcgcc     1980

tatacctgat ctatcgggct atgtacggct gtagtttact ggattattcg ttcatcctcg     2040

gtagttggtt catcgtttgg gttctgacga taatattgtt gattatgcgt aggcttctgc     2100

agattgttgt taaaattgga tacatcggtt actgatggtt gatgatagat ttgtgctgaa     2160

cctatctgtt tattgctcct atacctgatc tatagggcta tgtatgcctg taatttacca     2220

gattattcgt tcatcctcgt agttggttca tctctataat tcgtatgggt tcttatgatg     2280

ttatcgttga ttatgcctag tcttatacag attattgtgt caagattgaa tatacctgct     2340

actgatcggt gataatttgg ttagtagttt gcaatctgct aggaacacgt taccactgta     2400

atctgtaaac atggtttgcc agagtagttt gttctactac tcttgatatg gttgctgatt     2460

ttagtcgcct ccttttggat catgtattga tgtccttgca gatttccgtg tacttacccc     2520

ggcttttgtg tacttcgtgt taacagctct agaggatcct ctcaacacaa catatacaaa     2580

acaaacgaat ctcaagcaat caagcattct acttctattg cagcaattta aatcatttct     2640

tttaaagcaa aagcaatttt ctgaaaattt tcaccattta cgaacgatag ggcgcgatcc     2700

cgccaccatg gtgagcaagg gcgaggaggt catcaaagag ttcatgcgct tcaaggtgcg     2760

catggagggc tccatgaacg gccacgagtt cgagatcgag ggcgagggcg agggccgccc     2820

ctacgagggc acccagaccg ccaagctgaa ggtgaccaag ggcggccccc tgcccttcgc     2880

ctgggacatc ctgtcccccc agttcatgta cggctccaag gcgtacgtga agcaccccgc     2940

cgacatcccc gattacaaga agctgtcctt ccccgagggc ttcaagtggg agcgcgtgat     3000

gaacttcgag gacggcggtc tggtgaccgt gacccaggac tcctccctgc aggacggcac     3060

gctgatctac aaggtgaaga tgcgcggcac caacttcccc cccgacggcc ccgtaatgca     3120

gaagaagacc atgggctggg aggcctccac cgagcgcctg tacccccgcg acggcgtgct     3180

gaagggcgag atccaccagg ccctgaagct gaaggacggc ggccactacc tggtggagtt     3240

caagaccatc tacatggcca agaagcccgt gcaactgccc ggctactact acgtggacac     3300

caagctggac atcacctccc acaacgagga ctacaccatc gtggaacagt acgagcgctc     3360

cgagggccgc caccacctgt tcctgtacgg catggacgag ctgtacaagt ctagaggtac     3420

ctgataattt ctactaagtg tagatgtcta tgtcgatgac cagcagataa tttctactct     3480

tgtagatctc gtcacgattc ccctctcctg gaatttctac tcttgtagat tgtgtggtca     3540

cacttgccag ccagtaattt ctactaagtg tagatcgaat ttccccgatc gttcaaacat     3600

ttggcaataa agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata     3660

atttctgttg aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat     3720

gagatgggtt tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa     3780

aatatagcgc gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg     3840

ctcgacgcgg ccgccatggc ctctagtgga tcaggtgtcg ttcggctgcg gcgagcggta     3900

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag     3960

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg     4020

tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg     4080

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg     4140

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga     4200

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc     4260

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt     4320

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact     4380

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg     4440

cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct gaagccagtt     4500

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt     4560

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct     4620

ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg     4680

gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt     4740

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt     4800

gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc     4860

gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg     4920

cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc     4980

gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg     5040

gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca     5100

ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga     5160

tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct     5220

ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg     5280

cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca     5340

accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata     5400

cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct     5460

tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact     5520

cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa     5580

acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc     5640

atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga     5700

tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga     5760

aaagtgccac                                                            5770


<210>  70
<211>  3938
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP473 ribozyme array vector construct

<400>  70
ctgacgcgcc ctgtagcggc ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga       60

taatgagcat tgcatgtcta agttataaaa aattaccaca tatttttttt gtcacacttg      120

tttgaagtgc agtttatcta tctttataca tatatttaaa ctttactcta cgaataatat      180

aatctatagt actacaataa tatcagtgtt ttagagaatc atataaatga acagttagac      240

atggtctaaa ggacaattga gtattttgac aacaggactc tacagtttta tctttttagt      300

gtgcatgtgt tctccttttt ttttgcaaat agcttcacct atataatact tcatccattt      360

tattagtaca tccatttagg gtttagggtt aatggttttt atagactaat ttttttagta      420

catctatttt attctatttt agcctctaaa ttaagaaaac taaaactcta ttttagtttt      480

tttatttaat aatttagata taaaatagaa taaaataaag tgactaaaaa ttaaacaaat      540

accctttaag aaattaaaaa aactaaggaa acatttttct tgtttcgagt agataatgcc      600

agcctgttaa acgccgtcga tcgacgagtc taacggacac caaccagcga accagcagcg      660

tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc tggacccctc      720

tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa attgcgtggc      780

ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac ggcaccggca      840

gctacggggg attcctttcc caccgctcct tcgctttccc ttcctcgccc gccgtaataa      900

atagacaccc cctccacacc ctctttcccc aacctcgtgt tgttcggagc gcacacacac      960

acaaccagat ctcccccaaa tccacccgtc ggcacctccg cttcaaggta cgccgctcgt     1020

cctccccccc cccccctctc taccttctct agatcggcgt tccggtccat ggttagggcc     1080

cggtagttct acttctgttc atgtttgtgt tagatccgtg tttgtgttag atccgtgctg     1140

ctagcgttcg tacacggatg cgacctgtac gtcagacacg ttctgattgc taacttgcca     1200

gtgtttctct ttggggaatc ctgggatggc tctagccgtt ccgcagacgg gatcgatcta     1260

ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc     1320

atctattcat atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat     1380

aattattttg atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt     1440

tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca     1500

ccctgttgtt tggtgttact tctgcaggga tccaaattac tgatgagtcc gtgaggacga     1560

aacgagtaag ctcgtctaat ttctactaag tgtagatctc gtcacgattc ccctctcctg     1620

gaatttctac tcttgtagat tgtgtggtca cacttgccag ccagaatttc tactcttgta     1680

gatgtctatg tcgatgacca gcagatggcg tcagtcctac tctgcacctc ctcgtggtgt     1740

cgcctgggaa ccctctttca caagaaagag gagccaagca gagaggcgat cgttcaaaca     1800

tttggcaata aagtttctta agattgaatc ctgttgccgg tcttgcgatg attatcatat     1860

aatttctgtt gaattacgtt aagcatgtaa taattaacat gtaatgcatg acgttattta     1920

tgagatgggt ttttatgatt agagtcccgc aattatacat ttaatacgcg atagaaaaca     1980

aaatatagcg cgcaaactag gataaattat cgcgcgcggt gtcatctatg ttactagatc     2040

gatcgtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc     2100

cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag     2160

gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca     2220

tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca     2280

ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg     2340

atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag     2400

gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt     2460

tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca     2520

cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg     2580

cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt     2640

tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc     2700

cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg     2760

cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg     2820

gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta     2880

gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg     2940

gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg     3000

ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc     3060

atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc     3120

agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc     3180

ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag     3240

tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat     3300

ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg     3360

caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt     3420

gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag     3480

atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg     3540

accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt     3600

aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct     3660

gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac     3720

tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat     3780

aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat     3840

ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca     3900

aataggggtt ccgcgcacat ttccccgaaa agtgccac                             3938


<210>  71
<211>  3938
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pGEP474 ribozyme array vector construct

<400>  71
ctgacgcgcc ctgtagcggc ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga       60

taatgagcat tgcatgtcta agttataaaa aattaccaca tatttttttt gtcacacttg      120

tttgaagtgc agtttatcta tctttataca tatatttaaa ctttactcta cgaataatat      180

aatctatagt actacaataa tatcagtgtt ttagagaatc atataaatga acagttagac      240

atggtctaaa ggacaattga gtattttgac aacaggactc tacagtttta tctttttagt      300

gtgcatgtgt tctccttttt ttttgcaaat agcttcacct atataatact tcatccattt      360

tattagtaca tccatttagg gtttagggtt aatggttttt atagactaat ttttttagta      420

catctatttt attctatttt agcctctaaa ttaagaaaac taaaactcta ttttagtttt      480

tttatttaat aatttagata taaaatagaa taaaataaag tgactaaaaa ttaaacaaat      540

accctttaag aaattaaaaa aactaaggaa acatttttct tgtttcgagt agataatgcc      600

agcctgttaa acgccgtcga tcgacgagtc taacggacac caaccagcga accagcagcg      660

tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg tcgctgcctc tggacccctc      720

tcgagagttc cgctccaccg ttggacttgc tccgctgtcg gcatccagaa attgcgtggc      780

ggagcggcag acgtgagccg gcacggcagg cggcctcctc ctcctctcac ggcaccggca      840

gctacggggg attcctttcc caccgctcct tcgctttccc ttcctcgccc gccgtaataa      900

atagacaccc cctccacacc ctctttcccc aacctcgtgt tgttcggagc gcacacacac      960

acaaccagat ctcccccaaa tccacccgtc ggcacctccg cttcaaggta cgccgctcgt     1020

cctccccccc cccccctctc taccttctct agatcggcgt tccggtccat ggttagggcc     1080

cggtagttct acttctgttc atgtttgtgt tagatccgtg tttgtgttag atccgtgctg     1140

ctagcgttcg tacacggatg cgacctgtac gtcagacacg ttctgattgc taacttgcca     1200

gtgtttctct ttggggaatc ctgggatggc tctagccgtt ccgcagacgg gatcgatcta     1260

ggataggtat acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc     1320

atctattcat atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat     1380

aattattttg atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt     1440

tttagccctg ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca     1500

ccctgttgtt tggtgttact tctgcaggga tccaaattac tgatgagtcc gtgaggacga     1560

aacgagtaag ctcgtctaat ttctactaag tgtagatgtc tatgtcgatg accagcagat     1620

aatttctact cttgtagatc tcgtcacgat tcccctctcc tggaatttct actcttgtag     1680

attgtgtggt cacacttgcc agccagggcg tcagtcctac tctgcacctc ctcgtggtgt     1740

cgcctgggaa ccctctttca caagaaagag gagccaagca gagaggcgat cgttcaaaca     1800

tttggcaata aagtttctta agattgaatc ctgttgccgg tcttgcgatg attatcatat     1860

aatttctgtt gaattacgtt aagcatgtaa taattaacat gtaatgcatg acgttattta     1920

tgagatgggt ttttatgatt agagtcccgc aattatacat ttaatacgcg atagaaaaca     1980

aaatatagcg cgcaaactag gataaattat cgcgcgcggt gtcatctatg ttactagatc     2040

gatcgtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc     2100

cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag     2160

gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca     2220

tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca     2280

ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg     2340

atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag     2400

gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt     2460

tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca     2520

cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg     2580

cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa gaacagtatt     2640

tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc     2700

cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg     2760

cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg     2820

gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta     2880

gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg     2940

gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg     3000

ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc     3060

atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc     3120

agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc     3180

ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag     3240

tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat     3300

ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg     3360

caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt     3420

gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag     3480

atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg     3540

accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt     3600

aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct     3660

gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac     3720

tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat     3780

aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat     3840

ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca     3900

aataggggtt ccgcgcacat ttccccgaaa agtgccac                             3938


<210>  72
<211>  3759
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized LbCpf1

<400>  72
atggcgccga agaagaagcg caaggtgtcc aagctcgaga agttcacgaa ctgctactcc       60

ctctccaaga ccctccgctt caaggccatc cccgtgggca agacccagga gaacatcgac      120

aacaagcgcc tcctggtcga ggacgagaag agggcggagg actacaaggg cgtgaagaag      180

ctcctggacc gctactacct ctccttcatc aacgacgtcc tgcacagcat caagctcaag      240

aacctgaaca actacatctc cctgttccgc aagaagacga ggaccgagaa ggagaacaag      300

gagctcgaga acctggagat caacctccgc aaggagatcg ccaaggcgtt caagggcaac      360

gagggctaca agagcctgtt caagaaggac atcatcgaga cgatcctccc ggagttcctg      420

gacgacaagg acgagatcgc cctcgtgaac tccttcaacg gcttcaccac ggcgttcacc      480

ggcttcttcg acaaccgcga gaacatgttc agcgaggagg ccaagtccac gagcatcgcg      540

ttccgctgca tcaacgagaa cctgaccagg tacatctcca acatggacat cttcgagaag      600

gtcgacgcca tcttcgacaa gcacgaggtg caggagatca aggagaagat cctcaacagc      660

gactacgacg tcgaggactt cttcgagggc gagttcttca acttcgtcct gacgcaggag      720

ggcatcgacg tgtacaacgc catcatcggt ggcttcgtga ccgagtccgg cgagaagatc      780

aagggcctca acgagtacat caacctgtac aaccagaaga ccaagcagaa gctcccgaag      840

ttcaagcccc tctacaagca ggtcctgtcc gaccgcgagt ccctgagctt ctacggcgag      900

ggctacacga gcgacgagga ggtcctcgag gtgttcagga acaccctgaa caagaacagc      960

gagatcttct ccagcatcaa gaagctcgag aagctgttca agaacttcga cgagtactcc     1020

agcgccggca tcttcgtcaa gaacggcccg gcgatctcca cgatcagcaa ggatatcttc     1080

ggcgagtgga acgtgatcag ggacaagtgg aacgccgagt acgacgacat ccacctcaag     1140

aagaaggcgg tggtcaccga gaagtacgag gacgaccgca ggaagtcctt caagaagatc     1200

ggctccttca gcctcgagca gctgcaggag tacgccgacg cggacctctc cgtggtcgag     1260

aagctgaagg agatcatcat ccagaaggtc gacgagatct acaaggtgta cggctccagc     1320

gagaagctgt tcgacgccga cttcgtcctc gagaagtccc tgaagaagaa cgacgccgtg     1380

gtcgcgatca tgaaggacct cctggactcc gtgaagagct tcgagaacta catcaaggcg     1440

ttcttcggcg agggcaagga gacgaaccgc gacgagtcct tctacggcga cttcgtcctc     1500

gcctacgaca tcctcctgaa ggtggaccac atctacgacg cgatcaggaa ctacgtgacc     1560

cagaagccgt acagcaagga caagttcaag ctgtacttcc agaaccccca gttcatgggc     1620

ggctgggaca aggacaagga gacggactac cgcgccacca tcctccgcta cggcagcaag     1680

tactacctgg ccatcatgga caagaagtac gcgaagtgcc tccagaagat cgacaaggac     1740

gacgtcaacg gcaactacga gaagatcaac tacaagctcc tgccgggccc caacaagatg     1800

ctgccgaagg tgttcttctc caagaagtgg atggcctact acaaccccag cgaggacatc     1860

cagaagatct acaagaacgg cacgttcaag aagggcgaca tgttcaacct caacgactgc     1920

cacaagctga tcgacttctt caaggactcc atcagccgct acccgaagtg gtccaacgcc     1980

tacgacttca acttcagcga gacagagaag tacaaggaca tcgcgggctt ctacagggag     2040

gtcgaggagc agggctacaa ggtgtccttc gagtccgcca gcaagaagga ggtcgacaag     2100

ctcgtggagg agggcaagct gtacatgttc cagatctaca acaaggactt ctccgacaag     2160

agccacggca cgcccaacct ccacaccatg tacttcaagc tcctgttcga cgagaacaac     2220

cacggccaga tccgcctctc cggcggcgcc gagctgttca tgaggagggc gagcctcaag     2280

aaggaggagc tggtggtcca ccccgctaac agcccaatcg cgaacaagaa cccggacaac     2340

cccaagaaga ccacgaccct ctcctacgac gtgtacaagg acaagcgctt cagcgaggac     2400

cagtacgagc tgcacatccc gatcgccatc aacaagtgcc ccaagaacat cttcaagatc     2460

aacaccgagg tcagggtgct cctgaagcac gacgacaacc cctacgtgat cggcatcgac     2520

cgcggcgaga ggaacctcct gtacatcgtg gtcgtggacg gcaagggcaa catcgtggag     2580

cagtactccc tgaacgagat catcaacaac ttcaacggca tccgcatcaa gacggactac     2640

cacagcctcc tggacaagaa ggagaaggag cgcttcgagg ccaggcagaa ctggacctcc     2700

atcgagaaca tcaaggagct caaggcgggc tacatcagcc aggtcgtgca caagatctgc     2760

gagctggtcg agaagtacga cgccgtgatc gcgctcgagg acctgaactc cggcttcaag     2820

aacagcaggg tcaaggtgga gaagcaggtc taccagaagt tcgagaagat gctcatcgac     2880

aagctgaact acatggtgga caagaagtcc aacccgtgcg ctacgggcgg cgcgctcaag     2940

ggctaccaga tcaccaacaa gttcgagagc ttcaagtcca tgagcaccca gaacggcttc     3000

atcttctaca tcccggcctg gctgacgtcc aagatcgacc ccagcaccgg cttcgtcaac     3060

ctcctgaaga cgaagtacac ctccatcgcg gacagcaaga agttcatctc cagcttcgac     3120

cgcatcatgt atgtgccgga ggaggacctc ttcgagttcg ccctggacta caagaacttc     3180

tccaggacgg acgcggatta catcaagaag tggaagctct acagctacgg caaccgcatc     3240

aggatcttcc gcaaccccaa gaagaacaac gtcttcgact gggaggaggt gtgcctcacc     3300

tccgcctaca aggagctgtt caacaagtac ggcatcaact accagcaggg cgacatcagg     3360

gcgctcctgt gcgagcagag cgacaaggcc ttctactcca gcttcatggc gctcatgtcc     3420

ctcatgctgc agatgcgcaa cagcatcacg ggcaggaccg acgtcgactt cctgatctcc     3480

ccggtgaaga acagcgacgg catcttctac gacagccgca actacgaggc ccaggagaac     3540

gcgatcctgc caaagaacgc ggacgccaac ggcgcctaca acatcgcgag gaaggtgctg     3600

tgggccatcg gccagttcaa gaaggcggag gacgagaagc tcgacaaggt caagatcgcc     3660

atctccaaca aggagtggct ggagtacgcg cagacctcgg tgaagcacaa gaggcccgct     3720

gccaccaaga aggcgggcca ggccaagaag aagaagtga                            3759


<210>  73
<211>  3759
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimimized LbCpf1

<400>  73
atggccccca agaagaagcg caaggtgagc aagctggaga agtttacaaa ctgctactcc       60

ctgtctaaga ctctgcgctt caaggccatc cctgtgggca agacccagga gaacatcgac      120

aataagcggc tgctggtgga ggacgagaag agagccgagg attataaggg cgtgaagaag      180

ctgctggatc gctactatct gtcttttatc aacgacgtgc tgcacagcat caagctgaag      240

aatctgaaca attacatcag cctgttccgg aagaaaacca gaaccgagaa ggagaataag      300

gagctggaga acctggagat caatctgcgg aaggagatcg ccaaggcctt caagggcaac      360

gagggctaca agtccctgtt taagaaggat atcatcgaga caatcctgcc agagttcctg      420

gacgataagg acgagatcgc cctggtgaac agcttcaatg gctttaccac agccttcacc      480

ggcttctttg ataacagaga gaatatgttt tccgaggagg ccaagagcac atccatcgcc      540

ttcaggtgta tcaacgagaa tctgacccgc tacatctcta atatggacat cttcgagaag      600

gtggacgcca tctttgataa gcacgaggtg caggagatca aggagaagat cctgaacagc      660

gactatgatg tggaggattt ctttgagggc gagttcttta actttgtgct gacacaggag      720

ggcatcgacg tgtataacgc catcatcggc ggcttcgtga ccgagagcgg cgagaagatc      780

aagggcctga acgagtacat caacctgtat aatcagaaaa ccaagcagaa gctgcctaag      840

tttaagccac tgtataagca ggtgctgagc gatcgggagt ctctgagctt ctacggcgag      900

ggctatacat ccgatgagga ggtgctggag gtgtttagaa acaccctgaa caagaacagc      960

gagatcttca gctccatcaa gaagctggag aagctgttca agaattttga cgagtactct     1020

agcgccggca tctttgtgaa gaacggcccc gccatcagca caatctccaa ggatatcttc     1080

ggcgagtgga acgtgatccg ggacaagtgg aatgccgagt atgacgatat ccacctgaag     1140

aagaaggccg tggtgaccga gaagtacgag gacgatcgga gaaagtcctt caagaagatc     1200

ggctcctttt ctctggagca gctgcaggag tacgccgacg ccgatctgtc tgtggtggag     1260

aagctgaagg agatcatcat ccagaaggtg gatgagatct acaaggtgta tggctcctct     1320

gagaagctgt tcgacgccga ttttgtgctg gagaagagcc tgaagaagaa cgacgccgtg     1380

gtggccatca tgaaggacct gctggattct gtgaagagct tcgagaatta catcaaggcc     1440

ttctttggcg agggcaagga gacaaacagg gacgagtcct tctatggcga ttttgtgctg     1500

gcctacgaca tcctgctgaa ggtggaccac atctacgatg ccatccgcaa ttatgtgacc     1560

cagaagccct actctaagga taagttcaag ctgtattttc agaaccctca gttcatgggc     1620

ggctgggaca aggataagga gacagactat cgggccacca tcctgagata cggctccaag     1680

tactatctgg ccatcatgga taagaagtac gccaagtgcc tgcagaagat cgacaaggac     1740

gatgtgaacg gcaattacga gaagatcaac tataagctgc tgcccggccc taataagatg     1800

ctgccaaagg tgttcttttc taagaagtgg atggcctact ataaccccag cgaggacatc     1860

cagaagatct acaagaatgg cacattcaag aagggcgata tgtttaacct gaatgactgt     1920

cacaagctga tcgacttctt taaggatagc atctcccggt atccaaagtg gtccaatgcc     1980

tacgatttca acttttctga gacagagaag tataaggaca tcgccggctt ttacagagag     2040

gtggaggagc agggctataa ggtgagcttc gagtctgcca gcaagaagga ggtggataag     2100

ctggtggagg agggcaagct gtatatgttc cagatctata acaaggactt ttccgataag     2160

tctcacggca cacccaatct gcacaccatg tacttcaagc tgctgtttga cgagaacaat     2220

cacggacaga tcaggctgag cggaggagca gagctgttca tgaggcgcgc ctccctgaag     2280

aaggaggagc tggtggtgca cccagccaac tcccctatcg ccaacaagaa tccagataat     2340

cccaagaaaa ccacaaccct gtcctacgac gtgtataagg ataagaggtt ttctgaggac     2400

cagtacgagc tgcacatccc aatcgccatc aataagtgcc ccaagaacat cttcaagatc     2460

aatacagagg tgcgcgtgct gctgaagcac gacgataacc cctatgtgat cggcatcgat     2520

aggggcgagc gcaatctgct gtatatcgtg gtggtggacg gcaagggcaa catcgtggag     2580

cagtattccc tgaacgagat catcaacaac ttcaacggca tcaggatcaa gacagattac     2640

cactctctgc tggacaagaa ggagaaggag aggttcgagg cccgccagaa ctggacctcc     2700

atcgagaata tcaaggagct gaaggccggc tatatctctc aggtggtgca caagatctgc     2760

gagctggtgg agaagtacga tgccgtgatc gccctggagg acctgaactc tggctttaag     2820

aatagccgcg tgaaggtgga gaagcaggtg tatcagaagt tcgagaagat gctgatcgat     2880

aagctgaact acatggtgga caagaagtct aatccttgtg caacaggcgg cgccctgaag     2940

ggctatcaga tcaccaataa gttcgagagc tttaagtcca tgtctaccca gaacggcttc     3000

atcttttaca tccctgcctg gctgacatcc aagatcgatc catctaccgg ctttgtgaac     3060

ctgctgaaaa ccaagtatac cagcatcgcc gattccaaga agttcatcag ctcctttgac     3120

aggatcatgt atgtgcccga ggaggatctg ttcgagtttg ccctggacta taagaacttc     3180

tctcgcacag acgccgatta catcaagaag tggaagctgt actcctacgg caaccggatc     3240

agaatcttcc ggaatcctaa gaagaacaac gtgttcgact gggaggaggt gtgcctgacc     3300

agcgcctata aggagctgtt caacaagtac ggcatcaatt atcagcaggg cgatatcaga     3360

gccctgctgt gcgagcagtc cgacaaggcc ttctactcta gctttatggc cctgatgagc     3420

ctgatgctgc agatgcggaa cagcatcaca ggccgcaccg acgtggattt tctgatcagc     3480

cctgtgaaga actccgacgg catcttctac gatagccgga actatgaggc ccaggagaat     3540

gccatcctgc caaagaacgc cgacgccaat ggcgcctata acatcgccag aaaggtgctg     3600

tgggccatcg gccagttcaa gaaggccgag gacgagaagc tggataaggt gaagatcgcc     3660

atctctaaca aggagtggct ggagtacgcc cagaccagcg tgaagcacaa aaggccggcg     3720

gccacgaaaa aggccggcca ggcaaaaaag aaaaagtga                            3759


<210>  74
<211>  3783
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized LbCpf1

<400>  74
atggctccta agaagaagcg gaaggttggt attcacgggg tgcctgcggc ttcaaagctc       60

gagaaattca ccaactgtta ttcgttgagc aaaacactgc ggtttaaagc gattccagtc      120

ggcaagactc aagagaatat agacaataag cggctgttgg tggaagatga aaagcgcgcg      180

gaagactaca aaggggtgaa gaagttgttg gacagatact acctctcttt tatcaatgat      240

gtcttgcact caatcaaatt gaagaatctg aacaactaca tctccctctt cagaaagaaa      300

acaaggacag aaaaggagaa taaggaactt gaaaatttgg agatcaatct gaggaaagag      360

atcgcgaaag cctttaaagg caacgaagga tacaaaagtc tgttcaagaa ggatataatt      420

gagacaattt tgccagagtt cctcgatgac aaggacgaga ttgcgctggt caattcgttc      480

aacggattca caacagcatt cacaggcttc tttgataatc gggaaaatat gttctctgag      540

gaggcaaagt ccacttctat tgcgttcagg tgtatcaatg agaatctcac taggtacatt      600

tccaacatgg atatctttga gaaggttgac gcaatttttg acaagcacga agttcaggag      660

attaaggaga agatcctcaa ttccgattat gacgttgagg acttcttcga aggtgagttt      720

tttaatttcg tgctcactca agagggtatc gacgtgtata atgcgatcat cggtgggttc      780

gtgactgagt ccggtgaaaa gattaaggga ttgaacgagt atatcaacct ttacaaccaa      840

aagacgaaac agaagctgcc aaagttcaag cctctttaca aacaggttct ttcagaccgc      900

gagtcactct cgttctatgg ggagggctac acttcggatg aggaagtcct ggaggtgttc      960

aggaatactc tcaataagaa ttcggagatt ttctcttcta taaaaaaact ggaaaagttg     1020

tttaagaatt ttgacgaata ctctagcgcc ggcatatttg tgaaaaacgg cccggccata     1080

tcaacgataa gtaaagatat cttcggcgaa tggaacgtga tcagagacaa atggaacgcg     1140

gagtatgacg atattcacct gaagaagaag gctgtcgtaa cggagaagta cgaggatgat     1200

cgcaggaaaa gcttcaaaaa gatcggaagt ttcagcctgg aacagttgca ggagtatgct     1260

gacgccgatc ttagcgtcgt cgagaagttg aaggagataa tcatccaaaa ggtcgacgag     1320

atatataaag tctatggatc aagtgaaaaa ctgttcgacg ccgacttcgt tttggagaag     1380

tccctgaaga agaacgacgc tgttgttgcc attatgaagg atctgctcga cagcgtgaag     1440

agtttcgaga actatattaa ggcttttttc ggggagggga aggagactaa cagagatgag     1500

tccttctacg gagacttcgt cctcgcgtac gatatactcc ttaaggtaga ccacatctac     1560

gacgcaatca gaaattacgt gacacaaaag ccgtacagca aggacaagtt caaactctac     1620

ttccagaacc cccagttcat gggcggctgg gacaaggaca aggaaacgga ttacagggct     1680

acgatcttga ggtatggttc aaaatactac ttggcgatta tggacaagaa gtacgccaag     1740

tgtctccaga agattgacaa agacgatgtc aatggcaatt atgagaagat caactacaag     1800

ctgcttccgg gtccgaacaa gatgctccca aaggttttct tcagcaagaa atggatggcc     1860

tactataacc caagcgagga catccagaag atttataaga acggtacgtt caagaagggc     1920

gacatgttca atcttaacga ctgtcacaag ctgatcgact tcttcaaaga ctcaattagc     1980

cggtacccaa agtggtctaa cgcctatgac ttcaactttt cggaaaccga gaagtacaag     2040

gatatagccg gattttatag agaggtggaa gagcagggct acaaggtgtc attcgagtcc     2100

gccagcaaga aggaagtgga caagctcgtg gaagagggta agctctacat gttccagatt     2160

tataataaag actttagcga taagagccac gggacaccta atctccacac aatgtatttc     2220

aagctgctct tcgacgagaa taaccacggc caaatcaggt tgtcaggagg ggctgaactc     2280

ttcatgcggc gcgctagcct taagaaggag gagcttgtag tccaccctgc gaatagtcca     2340

attgcgaata agaacccgga caatcctaaa aagactacaa cattgagcta cgacgtgtac     2400

aaggataaga ggttttccga ggatcagtac gagctccaca tcccgattgc gatcaacaag     2460

tgcccaaaga atattttcaa gataaacaca gaggtgcgtg tactcctgaa gcatgacgac     2520

aatccttacg tcattgggat tgatcggggc gagaggaacc tcctctatat tgtggtggtg     2580

gacgggaagg ggaacatagt cgaacagtac tcccttaacg aaataattaa caatttcaac     2640

ggcatccgta tcaagaccga ctaccattcg ttgctggaca agaaggagaa ggagagattt     2700

gaggcgcggc aaaattggac aagtatcgag aacatcaagg aactcaaagc aggttatatc     2760

tctcaagttg tgcataagat atgcgagctg gttgagaagt atgacgcagt gatcgctctt     2820

gaggacctca actcgggctt taagaattct agagttaaag tggagaagca ggtctatcaa     2880

aagttcgaga agatgcttat agataagctc aactacatgg tcgataagaa atcgaaccca     2940

tgtgccaccg gcggcgcact caaaggttac caaataacaa acaaattcga gtccttcaaa     3000

tcgatgagta ctcagaatgg gttcatattt tatataccgg cgtggcttac gtctaagatc     3060

gacccgtcaa ctggttttgt caacctgttg aagacgaaat acacgtccat tgccgattcg     3120

aaaaagttca tatctagttt tgatcgtatt atgtacgtcc cagaggaaga tcttttcgag     3180

tttgctctcg actacaaaaa cttttcgcgg accgatgcgg attacattaa aaaatggaaa     3240

ctctattcgt acggcaacag aatcaggatt tttcgcaacc ctaagaagaa taacgtcttt     3300

gattgggagg aagtttgctt gactagcgcg tacaaggagc tctttaataa gtatggcatt     3360

aactaccaac agggtgatat cagagcactg ctttgcgaac aatctgacaa ggctttctac     3420

tcatccttca tggctttgat gagcctgatg ctccagatga gaaattcaat tacaggcaga     3480

accgacgtgg atttcttgat ctccccggtt aaaaattctg atggcatctt ttacgatagc     3540

aggaactatg aagcgcaaga gaatgcgatt ctgccaaaaa atgcagacgc caacggtgcc     3600

tataacatcg ccaggaaagt cctgtgggcg atcggccagt tcaaaaaggc cgaagacgaa     3660

aaattggaca aggtcaaaat cgctatcagc aacaaagagt ggctggagta tgctcagaca     3720

tccgtaaagc ataagcgtcc tgctgccacc aaaaaggccg gacaggctaa gaaaaagaag     3780

tga                                                                   3783


<210>  75
<211>  3759
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized LbCpf1

<400>  75
atggccccga agaagaagag gaaggtcagc aagctcgaga agttcaccaa ctgctacagc       60

ctgagcaaga ccctgaggtt caaggctatc ccggtgggca agacccaaga gaacatcgac      120

aacaagaggc tgctggtcga ggacgagaag cgcgctgagg attacaaggg cgtgaagaag      180

ctgctggaca ggtactacct gagcttcatc aacgacgtgc tgcacagcat caagctgaag      240

aacctgaaca actacatcag cctgttccgc aagaaaacca ggaccgagaa agagaacaaa      300

gagcttgaga acctcgagat caacctgagg aaagagatcg ccaaggcctt caagggcaac      360

gagggctaca agagcctgtt caagaaggac atcatcgaga ctatcctgcc agagttcctg      420

gacgacaagg acgagatcgc cctggtgaac agcttcaacg gcttcacgac cgccttcacc      480

ggtttcttcg acaaccgcga gaatatgttc agcgaggaag ccaagagcac ctctatcgcc      540

ttccgctgca tcaacgagaa cctgacgcgc tacatctcca acatggatat cttcgagaag      600

gtggacgcca tcttcgataa gcacgaggtg caagagatca aagaaaagat cctgaacagc      660

gactacgacg tcgaggactt cttcgagggc gagttcttca acttcgtgct cacccaagag      720

ggcatcgatg tgtacaacgc catcatcggc ggcttcgtga ctgagagcgg cgagaagatc      780

aagggcctga acgagtacat caacctctac aatcaaaaga ccaagcagaa gctgccgaag      840

ttcaagccgc tgtacaagca ggttctgagc gaccgcgaga gcctgtcttt ctacggcgag      900

ggttacacca gcgacgaaga ggtgttggag gttttccgca acaccctgaa caagaacagc      960

gagatcttca gctccatcaa gaagctggaa aagctgttta agaacttcga cgagtacagc     1020

agcgccggca tcttcgtgaa gaacggccca gctatcagca ccatcagcaa ggacatcttc     1080

ggcgagtgga acgtgatcag ggacaagtgg aacgccgagt acgacgacat ccacctgaag     1140

aaaaaggccg tggtgaccga gaagtacgag gacgacaggc gcaagagctt caagaagatc     1200

ggctccttca gcctcgagca gctgcaagag tacgctgacg ctgacctgag cgtggtcgag     1260

aagctcaaag agatcatcat ccagaaggtc gacgagatct acaaggtgta cggcagcagc     1320

gagaagcttt tcgacgccga cttcgtcctt gagaagtccc tcaagaaaaa cgacgccgtg     1380

gtggccatca tgaaggacct gctggactcc gtgaagtcct tcgagaacta cattaaggct     1440

ttcttcggtg agggcaaaga gactaacagg gacgagagct tctacgggga tttcgtgctg     1500

gcctacgaca tcctgctcaa ggtggaccac atctacgacg ccatccgcaa ctacgtgacc     1560

cagaagccgt actccaagga caagtttaag ctgtacttcc agaatccgca gttcatgggc     1620

ggctgggaca aagacaaaga aaccgactac agggccacca tcctgaggta cggctccaag     1680

tactacctcg ccatcatgga caagaaatac gccaagtgcc tgcagaagat cgataaggac     1740

gacgtgaacg gcaactacga gaagattaac tacaagctgc tgccagggcc gaacaagatg     1800

ctcccgaagg tgttctttag caagaaatgg atggcctact acaacccgag cgaggatatc     1860

cagaaaatct acaagaacgg caccttcaag aaaggcgaca tgttcaacct gaacgactgc     1920

cacaagctga tcgatttctt caaggacagc atctctcgct acccgaagtg gtccaacgcc     1980

tacgatttca acttcagcga gactgaaaag tacaaggata tcgccggctt ctaccgcgag     2040

gtcgaggaac agggttacaa ggtgagcttc gagagcgcca gcaagaaaga ggtggacaag     2100

ctggtcgaag agggcaagct gtacatgttc cagatctata acaaggactt ctccgacaag     2160

agccacggca ccccaaacct gcacaccatg tacttcaagt tgctgttcga cgagaacaac     2220

cacggccaga tcaggctttc tggcggcgct gagcttttca tgagaagggc cagcctgaaa     2280

aaagaggaac tggtcgttca cccggcgaac agcccaatcg ccaacaagaa cccggacaac     2340

ccgaaaaaga ccaccacgct gagctacgac gtgtacaagg acaaaaggtt ctccgaggac     2400

cagtacgagc tgcacatccc gatcgccatc aacaagtgcc cgaagaacat cttcaagatc     2460

aacaccgagg tgagggtgct gctgaagcac gacgacaacc catacgtgat cggcatcgat     2520

aggggcgagc gcaacctgct ctacatcgtg gtggttgacg gcaagggcaa tatcgtcgag     2580

cagtacagcc ttaacgagat cattaacaac ttcaatggca tcaggatcaa gaccgactac     2640

cacagcctgc tcgacaagaa agaaaaagag cgcttcgagg ccaggcagaa ctggaccagc     2700

atcgagaata tcaaagagct gaaggccggc tacattagcc aggtggtgca caagatctgc     2760

gagctggtgg aaaagtacga cgcggtgatc gctctcgagg acctgaactc cgggttcaag     2820

aactcccgcg tgaaggttga gaagcaggtc taccaaaagt tcgagaagat gctgatcgac     2880

aagctcaact acatggtgga caaaaagagc aacccctgcg ccacaggcgg cgctcttaag     2940

ggctaccaga tcacgaacaa gttcgagtcc ttcaagagca tgagcaccca gaatggcttc     3000

atcttctaca tcccggcctg gctgaccagc aagatcgatc catctaccgg cttcgtcaac     3060

ctcctcaaga ccaagtacac cagcattgcc gacagcaaga agttcatctc cagcttcgac     3120

aggatcatgt acgtgccgga agaggacctg ttcgagttcg cgctcgatta caagaacttc     3180

agcaggaccg acgcggacta tattaagaag tggaagctct acagctacgg caacaggatc     3240

cgcatcttca gaaacccgaa gaaaaacaac gtgttcgact gggaagaagt gtgcctgacc     3300

agcgcctaca aagaactgtt caacaagtac ggcatcaact accagcaggg cgacatcagg     3360

gctctgctgt gcgagcagtc tgacaaggcg ttctacagct ccttcatggc cctgatgagc     3420

ctgatgctgc agatgaggaa cagcatcacc ggcaggacgg acgtcgactt cctgatcagc     3480

ccagtgaaga attccgacgg cattttctac gactctagga actacgaggc tcaagagaac     3540

gccatcctgc cgaagaacgc cgatgctaac ggcgcgtaca acattgcccg caaggtgctg     3600

tgggctatcg gccagtttaa gaaggccgag gacgaaaaac tggacaaggt gaagatcgcc     3660

attagcaaca aagagtggct cgagtacgcc cagaccagcg tgaagcacaa aaggccagcc     3720

gccactaaga aggctggcca ggccaaaaag aagaagtga                            3759


<210>  76
<211>  3759
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized LbCpf1

<400>  76
atggcaccca agaagaagcg caaagtgtca aagctcgaga agttcacaaa ctgttactcc       60

ttatctaaaa ccctgcgctt taaagcaatt cctgtcggta aaacccaaga gaacatcgac      120

aacaagagac tgctcgttga agatgaaaag agagccgagg attacaaggg cgtgaagaag      180

ctcctcgatc gctattacct gtccttcatt aacgatgtgc tccatagcat caagctcaag      240

aaccttaaca actatatctc attattccgc aagaaaacta gaacggagaa ggaaaacaaa      300

gagctagaga accttgaaat caacctcaga aaggaaatag ccaaggcgtt taaggggaat      360

gaaggctaca agagtttgtt taagaaagac atcattgaga caatattacc tgaattcctt      420

gatgacaagg acgagatcgc tctagtgaac agctttaatg gtttcaccac tgcgttcacg      480

ggcttcttcg ataacaggga gaatatgttt tctgaggaag ccaagtcaac ttccatcgcg      540

tttcgctgca tcaacgagaa tctgacccgt tacataagta acatggacat attcgagaaa      600

gttgacgcta tcttcgacaa gcatgaggtg caagaaatca aggaaaagat ccttaactct      660

gactacgacg tcgaggactt cttcgaggga gaattcttca atttcgttct gacccaggag      720

ggcatcgacg tgtacaatgc tattatcggt ggattcgtga cagagtccgg agaaaagatt      780

aagggcctta acgagtatat caacctttat aaccaaaaga cgaagcaaaa actccccaag      840

tttaaacctc tttacaaaca agttctatcg gatagagaaa gcctttcgtt ttacggggaa      900

ggatacacct ctgatgaaga ggttctcgag gtgttccgta acaccctgaa caagaactcc      960

gagatattct cgtcgattaa gaaactcgaa aagttgttca aaaacttcga cgaatactca     1020

tctgccggaa tttttgtgaa gaacggcccg gctatttcga ccatttccaa ggatatcttc     1080

ggagagtgga acgttatacg agataagtgg aacgcagagt atgatgatat ccaccttaag     1140

aagaaggcgg tcgtgacgga aaaatacgag gacgatcgta ggaagtcttt caagaagatt     1200

ggtagcttca gcctcgagca actgcaggaa tacgcggatg ctgatctgag cgtggtcgag     1260

aagcttaagg agataatcat ccaaaaggtt gacgagatat acaaggttta tggttcgtca     1320

gagaagttgt tcgacgccga cttcgtcctt gagaagtccc tgaagaagaa tgacgcggtc     1380

gtggcaatca tgaaagacct cctcgactcc gtcaaatcct ttgagaatta tattaaggcg     1440

ttcttcggcg aagggaagga gacaaatagg gatgagagtt tttacggcga ttttgtgcta     1500

gcttacgaca ttctgctgaa agttgaccac atatacgacg ctatccgaaa ctatgtcacc     1560

caaaagcctt actcaaaaga caagttcaag ctgtactttc aaaacccgca gttcatggga     1620

ggatgggata aggacaagga aactgactac agagccacga ttctccgcta cgggtccaaa     1680

tactacctcg ccattatgga taagaagtac gcgaagtgcc tgcagaagat cgacaaagac     1740

gacgtgaacg gaaactacga gaagatcaac tataagctgc tgccagggcc caacaagatg     1800

ctgccgaagg ttttctttag caagaagtgg atggcctact acaacccgtc cgaggacata     1860

cagaaaatct acaaaaatgg aactttcaaa aagggcgaca tgttcaactt gaacgattgc     1920

cacaagttaa ttgacttctt caaggacagc atttcgcgat atccgaagtg gtctaatgcc     1980

tatgacttca atttttctga aaccgagaag tataaggata tcgcgggttt ttatagggaa     2040

gttgaggagc aaggttacaa agtatcattt gaatctgcct ccaaaaagga ggtcgacaaa     2100

ctggttgaag agggcaaact atacatgttt cagatctaca acaaagattt ctcggataag     2160

tcgcacggga cgccgaactt acacaccatg tacttcaaac tgctgtttga tgagaacaat     2220

cacggccaga tccgtctaag cggcggtgcc gagcttttca tgcgccgcgc gagcttgaaa     2280

aaggaggagt tggtggtcca ccctgctaat tcaccgattg ctaacaagaa ccccgataac     2340

cccaaaaaga ccaccactct tagctacgat gtctataagg ataagcgctt cagcgaagat     2400

cagtatgagt tgcatattcc gattgccatc aacaaatgcc ccaaaaatat tttcaagatc     2460

aacactgagg tccgcgtgct gctcaaacat gacgataacc cgtacgtaat cggcatcgat     2520

agaggcgaac ggaacttact atacatcgtg gtagtagacg ggaagggaaa tatcgtcgaa     2580

cagtacagtc tgaatgaaat tattaacaat ttcaacggga tccggatcaa gacagactac     2640

cactccttgc tcgacaagaa ggaaaaggag cggttcgagg cccgacaaaa ctggacttcg     2700

attgagaaca ttaaggagct caaagcgggg tacatctccc aagtggtgca taaaatctgt     2760

gaactggttg agaaatatga tgcagtgata gctctcgagg acctcaattc tggcttcaag     2820

aactccaggg ttaaggtaga aaaacaggtc taccaaaaat ttgaaaagat gcttattgat     2880

aagttaaatt acatggtcga caaaaagtcg aatccgtgtg ccacaggagg cgccctcaag     2940

ggataccaga ttacgaataa gtttgaatcc ttcaagtcta tgagcacaca gaacggcttc     3000

attttctata tccccgcgtg gctcacttct aaaatcgacc catccaccgg cttcgtgaat     3060

ttgcttaaga caaagtacac tagcatcgcg gactcgaaga aattcatttc gtcgtttgac     3120

cggattatgt atgtgccaga agaagatcta ttcgagtttg ctctcgacta taaaaacttc     3180

tcccgcaccg acgccgacta cataaagaag tggaagttgt atagctacgg gaaccgcatc     3240

aggatattcc ggaatccgaa gaagaacaat gtgtttgatt gggaggaggt ctgcctcacg     3300

tcagcttaca aggagctgtt caacaaatac ggtataaact atcagcaggg cgacatccgg     3360

gcccttctgt gtgaacagag cgacaaagca ttttactctt cttttatggc tctgatgtcc     3420

ttgatgctgc aaatgcgcaa ttcaatcacg gggagaaccg atgtagactt tctgattagt     3480

ccggtcaaga atagcgacgg catattctac gattcaagga attacgaagc ccaggagaac     3540

gcgatcctgc caaaaaatgc agatgcgaat ggtgcataca atattgcaag gaaagtgtta     3600

tgggccatcg gccagttcaa gaaagctgag gacgagaagc ttgacaaggt caagatcgca     3660

atctcaaaca aggaatggct tgaatatgcg cagactagtg tgaaacataa gcgcccagct     3720

gccaccaaga aggccggcca ggccaagaaa aagaagtga                            3759


<210>  77
<211>  24
<212>  DNA
<213>  Triticum aestivum

<400>  77
cagcatggca tggagggtga cgat                                              24


<210>  78
<211>  24
<212>  DNA
<213>  Triticum aestivum

<400>  78
agcatggcat ggagggtgac gatg                                              24


<210>  79
<211>  24
<212>  DNA
<213>  Triticum aestivum

<400>  79
cgcaggagga ggaggagctc atcg                                              24


<210>  80
<211>  24
<212>  DNA
<213>  Triticum aestivum

<400>  80
cgcaccgctt cagccctgca gcac                                              24


<210>  81
<211>  24
<212>  DNA
<213>  Triticum aestivum

<400>  81
gcaccgcttc agccctgcag cacg                                              24


<210>  82
<211>  21
<212>  DNA
<213>  Beta vulgaris

<400>  82
gctgctaaac aatcaacatt t                                                 21


<210>  83
<211>  21
<212>  DNA
<213>  Beta vulgaris

<400>  83
taaacaatca acatttaggt a                                                 21


<210>  84
<211>  20
<212>  DNA
<213>  Beta vulgaris

<400>  84
atttaggtat ggttgtccaa                                                   20


<210>  85
<211>  20
<212>  DNA
<213>  Beta vulgaris

<400>  85
tttagcagca ttatcttaac                                                   20


<210>  86
<211>  20
<212>  DNA
<213>  Beta vulgaris

<400>  86
tatagaacct atcttcccat                                                   20


<210>  87
<211>  86
<212>  RNA
<213>  Helianthus annuus

<400>  87
gcggggggcg ucaguccuac ucugcaccuc cucguggugu cgccugggaa cccucuuucg       60

caagaaagag gagccaagca gagagg                                            86


<210>  88
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  crALS1 protospacer

<400>  88
caccctaatt gtagccaact cttg                                              24


<210>  89
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  crALS2 protospacer

<400>  89
gcagcattat cttaactggg agat                                              24


<210>  90
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  crALS3 protospacer

<400>  90
ggtatggttg tccaatggga agat                                              24


<210>  91
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  crALS4 protospacer

<400>  91
caaggtatgt atgtgcccgg ttag                                              24


<210>  92
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  crALS5 protospacer

<400>  92
gaagggtttc caaggtatgt atgt                                              24


<210>  93
<211>  8828
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  expression plasmid

<400>  93
gattgaaaga aatatagttt aaatatttat tgataaaata acaagtcagg tattatagtc       60

caagcaaaaa cataaattta ttgatgcaag tttaaattca gaaatatttc aataactgat      120

tatatcagct ggtacattgc cgtagatgaa agactgagtg cgatattatg tgtaatacat      180

aaattgatga tatagctagc ttagctcatc gggtcagaag aactcgtcaa gaaggcgata      240

gaaggcgatg cgctgcgaat cgggagcggc gataccgtaa agcacgagga agcggtcagc      300

ccattcgccg ccaagctctt cagcaatatc acgggtagcc aacgctatgt cctgatagcg      360

gtccgccaca cccagccggc cacagtcgat gaatccagaa aagcggccat tttccaccat      420

gatattcggc aagcaggcat cgccatgagt cacgacgaga tcctcgccgt cgggcatgcg      480

cgccttgagc ctggcgaaca gttcggctgg cgcgagcccc tgatgctctt cgtccagatc      540

atcctgatcg acaagaccgg cttccatccg agtacgtgct cgctcgatgc gatgtttcgc      600

ttggtggtcg aatgggcagg tagccggatc aagcgtatgc agccgccgca ttgcatcagc      660

catgatggat actttctcgg caggagcaag gtgagatgac aggagatcct gccccggcac      720

ttcgcccaat agcagccagt cccttcccgc ttcagtgaca acgtcgagca cagctgcgca      780

aggaacgccc gtcgtggcca gccacgatag ccgcgctgcc tcgtcctgaa gttcattcag      840

ggcaccggac aggtcggtct tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa      900

cacggcggca tcagagcagc cgattgtctg ttgtgcccag tcatagccga atagcctctc      960

cacccaagcg gccggagaac ctgcgtgcaa tccatcttgt tcaatccaag ctcccatttt     1020

ctgtgatttt ggatgtgttt ttgtatggat tgagagtgaa tatgagactc taattggata     1080

ccgaggggaa tttatggaac gtcagtggag catttttgac aagaaatatt tgctagctga     1140

tagtgacctt aggcgacttt tgaacgcgca ataatggttt ctgacgtatg tgcttagctc     1200

attaaactcc agaaacccgc ggctgagtgg ctccttcaac gttgcggttc tgtcagttcc     1260

aaacgtaaaa cggcttgtcc cgcgtcatcg gcgggggtca ttccggactg tggggccata     1320

tcccagaact ggttgagtcg gtccaacacc tgggtgccaa tcatgtcgat ggtggggtat     1380

ggccaatttt ttttcaattc aaaaatgtag atgtccgcag cgttattata aaatgaaagt     1440

acattttgat aaaacgacaa attacgatcc gtcgtattta taggcgaaag caataaacaa     1500

attattctaa ttcggaaatc tttatttcga cgtgtctaca ttcacgtcca aatgggggct     1560

tagatgagaa acttcacgat cggctctaga ggccatggcg gccgctcgag cgatctagta     1620

acatagatga caccgcgcgc gataatttat cctagtttgc gcgctatatt ttgttttcta     1680

tcgcgtatta aatgtataat tgcgggactc taatcataaa aacccatctc ataaataacg     1740

tcatgcatta catgttaatt attacatgct taacgtaatt caacagaaat tatatgataa     1800

tcatcgcaag accggcaaca ggattcaatc ttaagaaact ttattgccaa atgtttgaac     1860

gatcggggaa attcgatcta cacttagtag aaattacaag agttggctac aattagggtg     1920

atctacactt agtagaaatt atcaggtacc tctagacttg tacagctcgt ccatgccgta     1980

caggaacagg tggtggcggc cctcggagcg ctcgtactgt tccacgatgg tgtagtcctc     2040

gttgtgggag gtgatgtcca gcttggtgtc cacgtagtag tagccgggca gttgcacggg     2100

cttcttggcc atgtagatgg tcttgaactc caccaggtag tggccgccgt ccttcagctt     2160

cagggcctgg tggatctcgc ccttcagcac gccgtcgcgg gggtacaggc gctcggtgga     2220

ggcctcccag cccatggtct tcttctgcat tacggggccg tcggggggga agttggtgcc     2280

gcgcatcttc accttgtaga tcagcgtgcc gtcctgcagg gaggagtcct gggtcacggt     2340

caccagaccg ccgtcctcga agttcatcac gcgctcccac ttgaagccct cggggaagga     2400

cagcttcttg taatcgggga tgtcggcggg gtgcttcacg tacgccttgg agccgtacat     2460

gaactggggg gacaggatgt cccaggcgaa gggcaggggg ccgcccttgg tcaccttcag     2520

cttggcggtc tgggtgccct cgtaggggcg gccctcgccc tcgccctcga tctcgaactc     2580

gtggccgttc atggagccct ccatgcgcac cttgaagcgc atgaactctt tgatgacctc     2640

ctcgcccttg ctcaccatgg atcccctctc caaatgaaat gaacttcctt atatagagga     2700

agggtcttgc gaaggatagt gggattgtgc gtcatccctt acgtcagtgg agatatcaca     2760

tcaatccact tgctttgaag acgtggttgg aacgtcttct ttttccacga tgttcctcgt     2820

gggtgggggt ccatctttgg gaccactgtc ggtagaggca tcttgaacga tagcctttcc     2880

tttatcgcaa tgatggcatt tgtagaagcc atcttccttt tctactgtcc tttcgatgaa     2940

gtgacagata gctgggcaat ggaatccgag gaggtttccc gatattaccc tttgttgaaa     3000

agtctcaata gccctctggt cttctgagac tgtatctttg atattcttgg agtagacgag     3060

agtgtcgtgc tccaccatgt atcacatcaa tccacttgct ttgaagacgt ggttggaacg     3120

tcttcttttt ccacgatgtt cctcgtgggt gggggtccat ctttgggacc actgtcggta     3180

gaggcatctt gaacgatagc ctttccttta tcgcaatgat ggcatttgta gaagccatct     3240

tccttttcta ctgtcctttc gatgaagtga cagatagctg ggcaatggaa tccgaggagg     3300

tttcccgata ttaccctttg ttgaaaagtc tcaatagccc tctggtcttc tgaacactag     3360

taaggcctta agggccagat cccccgggct gcaggaattc gatctggcac gacaggtttc     3420

ccgactggaa agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg     3480

caccccaggc tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat     3540

aacaatttca cacaggaaac agctatgaca tgattacgaa ttcaaaaatt acggatatga     3600

atataggcat atccgtatcc gaattatccg tttgacagct agcaacgatt gtacaattgc     3660

ttctttaaaa aaggaagaaa gaaagaaaga aaagaatcaa catcagcgtt aacaaacggc     3720

cccgttacgg cccaaacggt catatagagt aacggcgtta agcgttgaaa gactcctatc     3780

gaaatacgta accgcaaacg tgtcatagtc agatcccctc ttccttcacc gcctcaaaca     3840

caaaaataat cttctacagc ctatatatac aaccccccct tctatctctc ctttctcaca     3900

attcatcatc tttctttctc tacccccaat tttaagaaat cctctcttct cctcttcatt     3960

ttcaaggtaa atctctctct ctctctctct ctctgttatt ccttgtttta attaggtatg     4020

tattattgct agtttgttaa tctgcttatc ttatgtatgc cttatgtgaa tatctttatc     4080

ttgttcatct catccgttta gaagctataa atttgttgat ttgactgtgt atctacacgt     4140

ggttatgttt atatctaatc agatatgaat ttcttcatat tgttgcgttt gtgtgtacca     4200

atccgaaatc gttgattttt ttcatttaat cgtgtagcta attgtacgta tacatatgga     4260

tctacgtatc aattgttcat ctgtttgtgt ttgtatgtat acagatctga aaacatcact     4320

tctctcatct gattgtgttg ttacatacat agatatagat ctgttatatc atttttttta     4380

ttaattgtgt atatatatat gtgcatagat ctggattaca tgattgtgat tatttacatg     4440

attttgttat ttacgtatgt atatatgtag atctggactt tttggagttg ttgacttgat     4500

tgtatttgtg tgtgtatatg tgtgttctga tcttgatatg ttatgtatgt gcagcgaatt     4560

cggcgcgcca aacaatggct tcctccatgg ctcctaagaa gaagaggaag gttagcaagc     4620

tcgagaagtt taccaactgc tacagcctct ctaagaccct caggttcaag gctatccctg     4680

tgggaaagac ccaagagaat atcgacaaca agaggctcct cgtcgaggat gagaagagag     4740

ctgaagatta caagggcgtg aagaagctcc tcgacaggta ctacctcagc ttcatcaacg     4800

atgtgctcca cagcatcaag ctcaagaacc tcaacaacta catcagcctc ttccgtaaga     4860

aaaccaggac cgagaaagag aacaaagagc ttgagaacct cgagatcaac ctccgtaaag     4920

agatcgccaa ggctttcaag ggaaacgagg gatacaagag cctcttcaag aaggatatta     4980

tcgagacaat cctgcctgag ttcctggacg ataaggatga gatcgctctc gtgaacagct     5040

tcaacggatt cactactgcc ttcaccggat tcttcgacaa cagggaaaac atgttcagcg     5100

aagaggccaa gagcacctct atcgctttca gatgcatcaa cgagaacctc acgcgttaca     5160

tcagcaacat ggacatcttc gagaaggtgg acgccatctt cgataagcac gaggtgcaag     5220

aaatcaaaga gaagatcctc aacagcgact acgacgtcga ggactttttt gaaggggagt     5280

tcttcaactt cgttctcacc caagagggca tcgacgtgta caacgctatt atcggaggat     5340

tcgtgaccga gtctggggag aagattaagg gactcaacga gtacatcaac ctgtacaacc     5400

agaaaacgaa gcagaagctc ccgaagttca agccgctcta caagcaggtt ctctctgatc     5460

gtgagagcct ctcattttac ggtgagggtt acacctctga cgaggaagtg cttgaggttt     5520

tccgtaacac cctcaacaag aacagcgaga tcttctcgtc catcaagaag ttggagaaac     5580

ttttcaagaa cttcgacgag tacagcagcg ctgggatctt cgttaagaac ggacctgcta     5640

tcagcaccat cagcaaggat attttcggcg agtggaacgt gatcagggac aagtggaatg     5700

ctgagtacga tgacatccac ctcaagaaga aggctgtcgt cactgagaag tacgaggatg     5760

acaggcgtaa gtcgttcaag aagatcggct ctttcagcct cgagcagctt caagaatacg     5820

ctgatgctga tctcagcgtg gtcgagaagc tcaaagagat catcatccag aaggtcgacg     5880

agatctacaa ggtgtacggg tcctctgaga agttgttcga tgctgatttc gtcctcgaga     5940

agagtctgaa gaagaacgac gctgtcgtcg cgatcatgaa ggatttgctc gacagcgtga     6000

agtccttcga gaactatatc aaggccttct tcggagaggg caaagagact aatagggacg     6060

agtctttcta cggggatttc gtgctcgctt acgatatcct cctcaaggtg gaccatatct     6120

acgacgccat cagaaactac gtgacccaga agccttacag caaggacaag ttcaagttgt     6180

actttcagaa cccgcagttc atgggcggat gggacaaaga caaagagaca gattacaggg     6240

ccaccatcct caggtacggg tctaagtact acctggccat catggacaag aaatacgcca     6300

agtgcctcca aaagatcgac aaggatgacg tgaacgggaa ctatgagaag atcaactaca     6360

agctccttcc gggaccgaac aagatgcttc ctaaggtgtt cttcagcaag aaatggatgg     6420

cctactacaa cccgtctgag gacatccaga aaatctacaa gaacgggacc ttcaagaaag     6480

gcgacatgtt caacctcaac gactgccaca agctcatcga tttcttcaag gacagcatct     6540

cgcgttaccc gaagtggtct aacgcttacg actttaactt cagcgagaca gaaaagtaca     6600

aggatatcgc cgggttctac cgtgaggttg aggaacaggg ttacaaggtt agcttcgaga     6660

gcgcctccaa gaaagaggtt gacaagttgg tcgaagaggg caagctctac atgttccaga     6720

tctataacaa ggacttctcc gacaagagcc acggaactcc taacctccat acgatgtact     6780

tcaagctgct tttcgacgag aacaaccacg ggcagatcag actttctggt ggtgctgaac     6840

tcttcatgcg tagggcctca ctcaagaaag aagagttggt tgttcacccg gccaactctc     6900

caatcgctaa caagaatcct gacaacccga aaaagaccac cacgctgtct tacgacgtct     6960

acaaggacaa aaggttcagc gaggaccagt acgagcttca tatcccgatc gctatcaaca     7020

agtgcccgaa gaacatcttc aagatcaata ccgaggtgag ggtgctgctc aagcacgatg     7080

ataaccctta cgtgatcgga atcgatcgtg gtgagagaaa cctcctctac atcgttgtgg     7140

tggacggaaa gggaaacatc gtcgagcagt acagcctgaa cgagattatc aacaatttca     7200

acggcatcag gatcaagacc gactaccact cactcctcga taagaaagaa aaagagcgtt     7260

tcgaggccag gcagaactgg acttctatcg aaaacatcaa agagttgaag gccggctaca     7320

tctctcaggt ggtgcataag atctgcgagc tggtggaaaa gtacgatgct gtgatcgctc     7380

ttgaggacct caactctggg ttcaagaaca gtagagtgaa ggttgagaag caggtctacc     7440

aaaagttcga gaagatgctc atcgacaagc tcaactacat ggtggacaaa aagagcaacc     7500

cttgcgctac cggtggtgct cttaagggat accagatcac gaacaagttc gagtccttca     7560

agagcatgag cacccagaac ggcttcatct tctatatccc tgcttggctc accagcaaga     7620

tcgatccttc tactggtttc gtgaacctgc tcaagaccaa gtacacctcg atcgccgaca     7680

gcaagaagtt catctcgtct ttcgacagga tcatgtacgt gccggaagag gatcttttcg     7740

agttcgctct cgactataag aacttcagca ggaccgacgc cgactacatt aagaagtgga     7800

agctctactc ctacgggaac cgtatcagga tcttccgaaa tccgaagaaa aacaacgtgt     7860

tcgactggga agaagtgtgc ctcacctctg cctacaaaga actgttcaac aagtacggca     7920

tcaactacca gcagggtgat atcagggctc ttttgtgcga gcagagcgac aaggcattct     7980

acagctcatt catggccctc atgtctctca tgctccagat gaggaactct atcaccggaa     8040

ggaccgatgt ggacttcctt atctctccgg tcaagaactc tgacgggatc ttctacgaca     8100

gccgtaacta tgaggctcaa gagaacgcta tcctgccgaa gaatgctgat gcaaacgggg     8160

cttacaacat tgcgagaaag gttctctggg ctatcgggca gtttaagaaa gcggaagatg     8220

agaagctcga caaggtgaag atcgccatct ccaacaaaga gtggcttgag tacgctcaga     8280

cctccgttaa gcacaagagg cctgctgcta ctaagaaagc tggccaggcc aaaaagaaga     8340

agtgaggcgc gccgagctcc aggcctccca gctttcgtcc gtatcatcgg tttcgacaac     8400

gttcgtcaag ttcaatgcat cagtttcatt gcccacacac cagaatccta ctaagtttga     8460

gtattatggc attggaaaag ctgttttctt ctatcatttg ttctgcttgt aatttactgt     8520

gttctttcag tttttgtttt cggacatcaa aatgcaaatg gatggataag agttaataaa     8580

tgatatggtc cttttgttca ttctcaaatt attattatct gttgttttta ctttaatggg     8640

ttgaatttaa gtaagaaagg aactaacagt gtgatattaa ggtgcaatgt tagacatata     8700

aaacagtctt tcacctctct ttggttatgt cttgaattgg tttgtttctt cacttatctg     8760

tgtaatcaag tttactatga gtctatgatc aagtaattat gcaatcaagt taagtacagt     8820

ataggctt                                                              8828


<210>  94
<211>  8201
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  expression plasmid

<400>  94
ttgaaagaaa tatagtttaa atatttattg ataaaataac aagtcaggta ttatagtcca       60

agcaaaaaca taaatttatt gatgcaagtt taaattcaga aatatttcaa taactgatta      120

tatcagctgg tacattgccg tagatgaaag actgagtgcg atattatgtg taatacataa      180

attgatgata tagctagctt agctcatcgg gtcagaagaa ctcgtcaaga aggcgataga      240

aggcgatgcg ctgcgaatcg ggagcggcga taccgtaaag cacgaggaag cggtcagccc      300

attcgccgcc aagctcttca gcaatatcac gggtagccaa cgctatgtcc tgatagcggt      360

ccgccacacc cagccggcca cagtcgatga atccagaaaa gcggccattt tccaccatga      420

tattcggcaa gcaggcatcg ccatgagtca cgacgagatc ctcgccgtcg ggcatgcgcg      480

ccttgagcct ggcgaacagt tcggctggcg cgagcccctg atgctcttcg tccagatcat      540

cctgatcgac aagaccggct tccatccgag tacgtgctcg ctcgatgcga tgtttcgctt      600

ggtggtcgaa tgggcaggta gccggatcaa gcgtatgcag ccgccgcatt gcatcagcca      660

tgatggatac tttctcggca ggagcaaggt gagatgacag gagatcctgc cccggcactt      720

cgcccaatag cagccagtcc cttcccgctt cagtgacaac gtcgagcaca gctgcgcaag      780

gaacgcccgt cgtggccagc cacgatagcc gcgctgcctc gtcctgaagt tcattcaggg      840

caccggacag gtcggtcttg acaaaaagaa ccgggcgccc ctgcgctgac agccggaaca      900

cggcggcatc agagcagccg attgtctgtt gtgcccagtc atagccgaat agcctctcca      960

cccaagcggc cggagaacct gcgtgcaatc catcttgttc aatccaagct cccattttct     1020

gtgattttgg atgtgttttt gtatggattg agagtgaata tgagactcta attggatacc     1080

gaggggaatt tatggaacgt cagtggagca tttttgacaa gaaatatttg ctagctgata     1140

gtgaccttag gcgacttttg aacgcgcaat aatggtttct gacgtatgtg cttagctcat     1200

taaactccag aaacccgcgg ctgagtggct ccttcaacgt tgcggttctg tcagttccaa     1260

acgtaaaacg gcttgtcccg cgtcatcggc gggggtcatt ccggactgtg gggccatatc     1320

ccagaactgg ttgagtcggt ccaacacctg ggtgccaatc atgtcgatgg tggggtatgg     1380

ccaatttttt ttcaattcaa aaatgtagat gtccgcagcg ttattataaa atgaaagtac     1440

attttgataa aacgacaaat tacgatccgt cgtatttata ggcgaaagca ataaacaaat     1500

tattctaatt cggaaatctt tatttcgacg tgtctacatt cacgtccaaa tgggggctta     1560

gatgagaaac ttcacgatcg gctctagagg ccatggcggc cgctcgagcg atctagtaac     1620

atagatgaca ccgcgcgcga taatttatcc tagtttgcgc gctatatttt gttttctatc     1680

gcgtattaaa tgtataattg cgggactcta atcataaaaa cccatctcat aaataacgtc     1740

atgcattaca tgttaattat tacatgctta acgtaattca acagaaatta tatgataatc     1800

atcgcaagac cggcaacagg attcaatctt aagaaacttt attgccaaat gtttgaacga     1860

tcgcctctct gcttggctcc tctttcttgt gaaagagggt tcccaggcga caccacgagg     1920

aggtgcagag taggactgac gcccaagagt tggctacaat tagggtgatc tacacttagt     1980

agaaattaga cgagcttact cgtttcgtcc tcacggactc atcagtaatt tggatcccct     2040

ctccaaatga aatgaacttc cttatataga ggaagggtct tgcgaaggat agtgggattg     2100

tgcgtcatcc cttacgtcag tggagatatc acatcaatcc acttgctttg aagacgtggt     2160

tggaacgtct tctttttcca cgatgttcct cgtgggtggg ggtccatctt tgggaccact     2220

gtcggtagag gcatcttgaa cgatagcctt tcctttatcg caatgatggc atttgtagaa     2280

gccatcttcc ttttctactg tcctttcgat gaagtgacag atagctgggc aatggaatcc     2340

gaggaggttt cccgatatta ccctttgttg aaaagtctca atagccctct ggtcttctga     2400

gactgtatct ttgatattct tggagtagac gagagtgtcg tgctccacca tgtatcacat     2460

caatccactt gctttgaaga cgtggttgga acgtcttctt tttccacgat gttcctcgtg     2520

ggtgggggtc catctttggg accactgtcg gtagaggcat cttgaacgat agcctttcct     2580

ttatcgcaat gatggcattt gtagaagcca tcttcctttt ctactgtcct ttcgatgaag     2640

tgacagatag ctgggcaatg gaatccgagg aggtttcccg atattaccct ttgttgaaaa     2700

gtctcaatag ccctctggtc ttctgaacac tagtaaggcc ttaagggcca gatcccccgg     2760

gctgcaggaa ttcgatctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca     2820

acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc     2880

cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg     2940

acatgattac gaattcaaaa attacggata tgaatatagg catatccgta tccgaattat     3000

ccgtttgaca gctagcaacg attgtacaat tgcttcttta aaaaaggaag aaagaaagaa     3060

agaaaagaat caacatcagc gttaacaaac ggccccgtta cggcccaaac ggtcatatag     3120

agtaacggcg ttaagcgttg aaagactcct atcgaaatac gtaaccgcaa acgtgtcata     3180

gtcagatccc ctcttccttc accgcctcaa acacaaaaat aatcttctac agcctatata     3240

tacaaccccc ccttctatct ctcctttctc acaattcatc atctttcttt ctctaccccc     3300

aattttaaga aatcctctct tctcctcttc attttcaagg taaatctctc tctctctctc     3360

tctctctgtt attccttgtt ttaattaggt atgtattatt gctagtttgt taatctgctt     3420

atcttatgta tgccttatgt gaatatcttt atcttgttca tctcatccgt ttagaagcta     3480

taaatttgtt gatttgactg tgtatctaca cgtggttatg tttatatcta atcagatatg     3540

aatttcttca tattgttgcg tttgtgtgta ccaatccgaa atcgttgatt tttttcattt     3600

aatcgtgtag ctaattgtac gtatacatat ggatctacgt atcaattgtt catctgtttg     3660

tgtttgtatg tatacagatc tgaaaacatc acttctctca tctgattgtg ttgttacata     3720

catagatata gatctgttat atcatttttt ttattaattg tgtatatata tatgtgcata     3780

gatctggatt acatgattgt gattatttac atgattttgt tatttacgta tgtatatatg     3840

tagatctgga ctttttggag ttgttgactt gattgtattt gtgtgtgtat atgtgtgttc     3900

tgatcttgat atgttatgta tgtgcagcga attcggcgcg ccaaacaatg gcttcctcca     3960

tggctcctaa gaagaagagg aaggttagca agctcgagaa gtttaccaac tgctacagcc     4020

tctctaagac cctcaggttc aaggctatcc ctgtgggaaa gacccaagag aatatcgaca     4080

acaagaggct cctcgtcgag gatgagaaga gagctgaaga ttacaagggc gtgaagaagc     4140

tcctcgacag gtactacctc agcttcatca acgatgtgct ccacagcatc aagctcaaga     4200

acctcaacaa ctacatcagc ctcttccgta agaaaaccag gaccgagaaa gagaacaaag     4260

agcttgagaa cctcgagatc aacctccgta aagagatcgc caaggctttc aagggaaacg     4320

agggatacaa gagcctcttc aagaaggata ttatcgagac aatcctgcct gagttcctgg     4380

acgataagga tgagatcgct ctcgtgaaca gcttcaacgg attcactact gccttcaccg     4440

gattcttcga caacagggaa aacatgttca gcgaagaggc caagagcacc tctatcgctt     4500

tcagatgcat caacgagaac ctcacgcgtt acatcagcaa catggacatc ttcgagaagg     4560

tggacgccat cttcgataag cacgaggtgc aagaaatcaa agagaagatc ctcaacagcg     4620

actacgacgt cgaggacttt tttgaagggg agttcttcaa cttcgttctc acccaagagg     4680

gcatcgacgt gtacaacgct attatcggag gattcgtgac cgagtctggg gagaagatta     4740

agggactcaa cgagtacatc aacctgtaca accagaaaac gaagcagaag ctcccgaagt     4800

tcaagccgct ctacaagcag gttctctctg atcgtgagag cctctcattt tacggtgagg     4860

gttacacctc tgacgaggaa gtgcttgagg ttttccgtaa caccctcaac aagaacagcg     4920

agatcttctc gtccatcaag aagttggaga aacttttcaa gaacttcgac gagtacagca     4980

gcgctgggat cttcgttaag aacggacctg ctatcagcac catcagcaag gatattttcg     5040

gcgagtggaa cgtgatcagg gacaagtgga atgctgagta cgatgacatc cacctcaaga     5100

agaaggctgt cgtcactgag aagtacgagg atgacaggcg taagtcgttc aagaagatcg     5160

gctctttcag cctcgagcag cttcaagaat acgctgatgc tgatctcagc gtggtcgaga     5220

agctcaaaga gatcatcatc cagaaggtcg acgagatcta caaggtgtac gggtcctctg     5280

agaagttgtt cgatgctgat ttcgtcctcg agaagagtct gaagaagaac gacgctgtcg     5340

tcgcgatcat gaaggatttg ctcgacagcg tgaagtcctt cgagaactat atcaaggcct     5400

tcttcggaga gggcaaagag actaataggg acgagtcttt ctacggggat ttcgtgctcg     5460

cttacgatat cctcctcaag gtggaccata tctacgacgc catcagaaac tacgtgaccc     5520

agaagcctta cagcaaggac aagttcaagt tgtactttca gaacccgcag ttcatgggcg     5580

gatgggacaa agacaaagag acagattaca gggccaccat cctcaggtac gggtctaagt     5640

actacctggc catcatggac aagaaatacg ccaagtgcct ccaaaagatc gacaaggatg     5700

acgtgaacgg gaactatgag aagatcaact acaagctcct tccgggaccg aacaagatgc     5760

ttcctaaggt gttcttcagc aagaaatgga tggcctacta caacccgtct gaggacatcc     5820

agaaaatcta caagaacggg accttcaaga aaggcgacat gttcaacctc aacgactgcc     5880

acaagctcat cgatttcttc aaggacagca tctcgcgtta cccgaagtgg tctaacgctt     5940

acgactttaa cttcagcgag acagaaaagt acaaggatat cgccgggttc taccgtgagg     6000

ttgaggaaca gggttacaag gttagcttcg agagcgcctc caagaaagag gttgacaagt     6060

tggtcgaaga gggcaagctc tacatgttcc agatctataa caaggacttc tccgacaaga     6120

gccacggaac tcctaacctc catacgatgt acttcaagct gcttttcgac gagaacaacc     6180

acgggcagat cagactttct ggtggtgctg aactcttcat gcgtagggcc tcactcaaga     6240

aagaagagtt ggttgttcac ccggccaact ctccaatcgc taacaagaat cctgacaacc     6300

cgaaaaagac caccacgctg tcttacgacg tctacaagga caaaaggttc agcgaggacc     6360

agtacgagct tcatatcccg atcgctatca acaagtgccc gaagaacatc ttcaagatca     6420

ataccgaggt gagggtgctg ctcaagcacg atgataaccc ttacgtgatc ggaatcgatc     6480

gtggtgagag aaacctcctc tacatcgttg tggtggacgg aaagggaaac atcgtcgagc     6540

agtacagcct gaacgagatt atcaacaatt tcaacggcat caggatcaag accgactacc     6600

actcactcct cgataagaaa gaaaaagagc gtttcgaggc caggcagaac tggacttcta     6660

tcgaaaacat caaagagttg aaggccggct acatctctca ggtggtgcat aagatctgcg     6720

agctggtgga aaagtacgat gctgtgatcg ctcttgagga cctcaactct gggttcaaga     6780

acagtagagt gaaggttgag aagcaggtct accaaaagtt cgagaagatg ctcatcgaca     6840

agctcaacta catggtggac aaaaagagca acccttgcgc taccggtggt gctcttaagg     6900

gataccagat cacgaacaag ttcgagtcct tcaagagcat gagcacccag aacggcttca     6960

tcttctatat ccctgcttgg ctcaccagca agatcgatcc ttctactggt ttcgtgaacc     7020

tgctcaagac caagtacacc tcgatcgccg acagcaagaa gttcatctcg tctttcgaca     7080

ggatcatgta cgtgccggaa gaggatcttt tcgagttcgc tctcgactat aagaacttca     7140

gcaggaccga cgccgactac attaagaagt ggaagctcta ctcctacggg aaccgtatca     7200

ggatcttccg aaatccgaag aaaaacaacg tgttcgactg ggaagaagtg tgcctcacct     7260

ctgcctacaa agaactgttc aacaagtacg gcatcaacta ccagcagggt gatatcaggg     7320

ctcttttgtg cgagcagagc gacaaggcat tctacagctc attcatggcc ctcatgtctc     7380

tcatgctcca gatgaggaac tctatcaccg gaaggaccga tgtggacttc cttatctctc     7440

cggtcaagaa ctctgacggg atcttctacg acagccgtaa ctatgaggct caagagaacg     7500

ctatcctgcc gaagaatgct gatgcaaacg gggcttacaa cattgcgaga aaggttctct     7560

gggctatcgg gcagtttaag aaagcggaag atgagaagct cgacaaggtg aagatcgcca     7620

tctccaacaa agagtggctt gagtacgctc agacctccgt taagcacaag aggcctgctg     7680

ctactaagaa agctggccag gccaaaaaga agaagtgagg cgcgccgagc tccaggcctc     7740

ccagctttcg tccgtatcat cggtttcgac aacgttcgtc aagttcaatg catcagtttc     7800

attgcccaca caccagaatc ctactaagtt tgagtattat ggcattggaa aagctgtttt     7860

cttctatcat ttgttctgct tgtaatttac tgtgttcttt cagtttttgt tttcggacat     7920

caaaatgcaa atggatggat aagagttaat aaatgatatg gtccttttgt tcattctcaa     7980

attattatta tctgttgttt ttactttaat gggttgaatt taagtaagaa aggaactaac     8040

agtgtgatat taaggtgcaa tgttagacat ataaaacagt ctttcacctc tctttggtta     8100

tgtcttgaat tggtttgttt cttcacttat ctgtgtaatc aagtttacta tgagtctatg     8160

atcaagtaat tatgcaatca agttaagtac agtataggct t                         8201


<210>  95
<211>  87
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  expression casssette

<400>  95
cggggaaatt cgatctacac ttagtagaaa ttacaagagt tggctacaat tagggtgatc       60

tacacttagt agaaattatc aggtacc                                           87


<210>  96
<211>  141
<212>  DNA
<213>  Beta vulgaris

<400>  96
ggacaaccat acctaaatgt tgattgttta gcagcattat cttaactggg agattttcca       60

ccctaattgt agccaactct tgaacattca taataaaact gccatcccca tcaatatcga      120

caaccactgc atctggtcga g                                                141


<210>  97
<211>  135
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel1

<400>  97
ggacaaccat acctaaatgt tgattgttta gcagcattat cttaactttt tccaccctaa       60

ttgtagccaa ctcttgaaca ttcataataa aactgccatc cccatcaata tcgacaacca      120

ctgcatctgg tcgag                                                       135


<210>  98
<211>  134
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel2

<400>  98
ggacaaccat acctaaatgt tgattgttta gcagcattat cttagatttt ccaccctaat       60

tgtagccaac tcttgaacat tcataataaa actgccatcc ccatcaatat cgacaaccac      120

tgcatctggt cgag                                                        134


<210>  99
<211>  126
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel3

<400>  99
ggacaaccat acctaaatgt tgattgttta gcagcattat ttccacccta attgtagcca       60

actcttgaac attcataata aaactgccat ccccatcaat atcgacaacc actgcatctg      120

gtcgag                                                                 126


<210>  100
<211>  103
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel4

<400>  100
ggacaaccat acctaaatgt tgattgttta gcagcattat cttgaacatt cataataaaa       60

ctgccatccc catcaatatc gacaaccact gcatctggtc gag                        103


<210>  101
<211>  83
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel5

<400>  101
ggacaaccat acctaaatgt tgattgttta gcagcattat ctgccatccc catcaatatc       60

gacaaccact gcatctggtc gag                                               83


<210>  102
<211>  76
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel6

<400>  102
ggacaaccat acctaaatgt tgattgttta gcagcattat ccccatcaat atcgacaacc       60

actgcatctg gtcgag                                                       76


<210>  103
<211>  64
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel7

<400>  103
ggacaaccat acctaaatgt tgattgttta gcagcattat cgacaaccac tgcatctggt       60

cgag                                                                    64


<210>  104
<211>  59
<212>  DNA
<213>  Zea mays

<400>  104
tttaatttac tgtcacgatt cccctctcct ggtcgaactt ttcaggtggg gaaagctgc        59


<210>  105
<211>  53
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP5 target locus

<400>  105
tttgatttac tgtcacgatt ccctggtcga acttttcagg tggggaaagc tgc              53


<210>  106
<211>  58
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP5 target locus

<400>  106
tttgatttac tgtcacgatt cccctctcct gtcgaacttt tcaggtgggg aaagctgc         58


<210>  107

<400>  107
000

<210>  108

<400>  108
000

<210>  109
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP5 target locus

<400>  109
tttgatttac tgtctaactt ttcaggtggg gaaagctgc                              39


<210>  110
<211>  58
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP5 target locus

<400>  110
tttgatttac tgtcatgatt cccctccctg gtcgaacttt tcaggtgggg aaagctgc         58


<210>  111
<211>  60
<212>  DNA
<213>  Zea mays

<400>  111
ctgtggcact accaagctcc tgctcaccat tccaagaatc cttgagcttg ctgaagagct       60


<210>  112
<211>  54
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP75 target locus

<400>  112
ctgtggcact accaagctcc tgctcaccag aatccttgag cttgctgaag agct             54


<210>  113
<211>  56
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP75 target locus

<400>  113
ctgtggcact accaagctcc tgctcacctt ggaatccttg agcttgctga agagct           56


<210>  114
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP75 target locus

<400>  114
ctgtggcact accaagctcc tgctcagaat ccttgagctt gctgaagagc t                51


<210>  115
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP75 target locus

<400>  115
ctgtggcact accaagctcc tgcttgctga agagct                                 36


<210>  116
<211>  53
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP75 target locus

<400>  116
ctgtggcact accaagctcc tgctcaatga atccttgagc ttgctgaaga gct              53


<210>  117
<211>  57
<212>  DNA
<213>  Zea mays

<400>  117
gcaggagctg cctggaggcg ggctcctcgt gtaccagagc ttctgtgctg aagacgc          57


<210>  118
<211>  49
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP77 target locus

<400>  118
gcaggagctg cctggaggcg ggctcctcga gcttctgtgc tgaagacgc                   49


<210>  119
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP77 target locus

<400>  119
gcaggagctg cctggaggcg ggctctgtgc tgaagacgc                              39


<210>  120
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP77 target locus

<400>  120
gcaggagctg cctggaggcg ggctcctcgt gagcttctgt gctgaagacg c                51


<210>  121
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP77 target locus

<400>  121
gcaggagctg cctggaggcg ggctcctgag cttctgtgct gaagacgc                    48


<210>  122
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP77 target locus

<400>  122
gcaggagctg cctggaggcg ggctcctcgg gagcttctgt gctgaagacg c                51


<210>  123
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP77 target locus

<400>  123
gcaggagctg cctggaggcg ggctcctcag cttctgtgct gaagacgc                    48


<210>  124
<211>  50
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel at crGEP77 target locus

<400>  124
gcaggagctg cctggaggcg ggctcctcgt agcttctgtg ctgaagacgc                  50


<210>  125
<211>  60
<212>  DNA
<213>  Triticum aestivum

<400>  125
gctcttcccg caccgcttca gccctgcagc acgcacccat ccatgacgca acacacatca       60


<210>  126
<211>  57
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in A genome

<400>  126
gctcttcccg caccgcttca gccctgcacg cacccatcca tgacgcaaca cacatca          57


<210>  127
<211>  53
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in A genome

<400>  127
gctcttcccg caccgcttca gccccgcacc catccatgac gcaacacaca tca              53


<210>  128
<211>  55
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in A genome

<400>  128
gctcttcccg caccgcttca gccctacgca cccatccatg acgcaacaca catca            55


<210>  129
<211>  56
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in A genome

<400>  129
gctcttcccg caccgcttca gcctgcacgc acccatccat gacgcaacac acatca           56


<210>  130
<211>  54
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in A genome

<400>  130
gctcttcccg caccgcttca gccctggcac ccatccatga cgcaacacac atca             54


<210>  131
<211>  46
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in A genome

<400>  131
gctcttcccg caccgcttca gcccatccat gacgcaacac acatca                      46


<210>  132
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in A genome

<400>  132
gctcttcccg caccgcttca gcccccatcc atgacgcaac acacatca                    48


<210>  133
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in A genome

<400>  133
gctcttcccg caccgcttca gcacccatcc atgacgcaac acacatca                    48


<210>  134
<211>  60
<212>  DNA
<213>  Triticum aestivum

<400>  134
gctcttcccg caccgcttca gccctgcagc acgcacccat ccatgacgca acacacatca       60


<210>  135
<211>  50
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in B genome

<400>  135
gctcttcccg caccgcttca gcccacccat ccatgacgca acacacatca                  50


<210>  136
<211>  54
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in B genome

<400>  136
gctcttcccg caccgcttca gcccacgcac ccatccatga cgcaacacac atca             54


<210>  137
<211>  53
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in B genome

<400>  137
gctcttcccg caccgcttca gccacgcacc catccatgac gcaacacaca tca              53


<210>  138
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in B genome

<400>  138
gctcttcccg caccgcttca gccgcaccca tccatgacgc aacacacatc a                51


<210>  139
<211>  50
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in B genome

<400>  139
gctcttcccg caccgcttca gcgcacccat ccatgacgca acacacatca                  50


<210>  140
<211>  60
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in B genome

<400>  140
catccatgac gcaccacaca tcagcccagc acgcacccat ccatgacgca acacacatca       60


<210>  141
<211>  60
<212>  DNA
<213>  Triticum aestivum

<400>  141
gctcttcccg caccgcttca gccctgcagc acgcacccat ccatgacgca acacacatca       60


<210>  142
<211>  53
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in D genome

<400>  142
gctcttcccg caccgcttca gccccgcacc catccatgac gcaacacaca tca              53


<210>  143
<211>  47
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in D genome

<400>  143
gctcttcccg caccgcttca gccccatcca tgacgcaaca cacatca                     47


<210>  144
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  InDel in D genome

<400>  144
gctcttcccg caccgcttca gccctaccca tccatgacgc aacacacatc a                51


<210>  145
<211>  96
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  expression cassette

<400>  145
aattcgatct acacttagta gaaattaatc tcccagtaga aattaatctc ccagttaaga       60

taatgctgca tctacactta gtagaaatta tcaggt                                 96


<210>  146
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  protospacer

<400>  146
caccctaatt gtagccaact cttg                                              24


<210>  147
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  protospacer

<400>  147
gcagcattat cttaactggg agat                                              24


<210>  148
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  protospacer

<400>  148
ggtatggttg tccaatggga agat                                              24


<210>  149
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  protospacer

<400>  149
caaggtatgt atgtgcccgg ttag                                              24


<210>  150
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  protospacer

<400>  150
gaagggtttc caaggtatgt atgt                                              24


<210>  151
<211>  187
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  expression cassettte

<400>  151
cgatcgcctc tctgcttggc tcctctttct tgtgaaagag ggttcccagg cgacaccacg       60

aggaggtgca gagtaggact gacgcccaag agttggctac aattagggtg atctacactt      120

agtagaaatt agacgagctt actcgtttcg tcctcacgga ctcatcagta atttggatcc      180

cctctcc                                                                187


<210>  152
<211>  1252
<212>  PRT
<213>  Lachnospiraceae bacterium

<400>  152

Met Ala Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr 
1               5                   10                  15      


Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val 
            20                  25                  30          


Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp 
        35                  40                  45              


Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg 
    50                  55                  60                  


Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys 
65                  70                  75                  80  


Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu 
                85                  90                  95      


Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu 
            100                 105                 110         


Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys 
        115                 120                 125             


Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp 
    130                 135                 140                 


Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr 
145                 150                 155                 160 


Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser 
                165                 170                 175     


Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile 
            180                 185                 190         


Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His 
        195                 200                 205             


Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val 
    210                 215                 220                 


Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu 
225                 230                 235                 240 


Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser 
                245                 250                 255     


Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln 
            260                 265                 270         


Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val 
        275                 280                 285             


Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser 
    290                 295                 300                 


Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser 
305                 310                 315                 320 


Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe 
                325                 330                 335     


Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile 
            340                 345                 350         


Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp 
        355                 360                 365             


Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val 
    370                 375                 380                 


Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile 
385                 390                 395                 400 


Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu 
                405                 410                 415     


Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu 
            420                 425                 430         


Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe 
        435                 440                 445             


Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met 
    450                 455                 460                 


Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala 
465                 470                 475                 480 


Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly 
                485                 490                 495     


Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr 
            500                 505                 510         


Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys 
        515                 520                 525             


Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys 
    530                 535                 540                 


Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys 
545                 550                 555                 560 


Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys 
                565                 570                 575     


Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys 
            580                 585                 590         


Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys 
        595                 600                 605             


Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr 
    610                 615                 620                 


Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys 
625                 630                 635                 640 


His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys 
                645                 650                 655     


Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys 
            660                 665                 670         


Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val 
        675                 680                 685             


Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu 
    690                 695                 700                 


Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys 
705                 710                 715                 720 


Ser His Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe 
                725                 730                 735     


Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu 
            740                 745                 750         


Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro 
        755                 760                 765             


Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr 
    770                 775                 780                 


Thr Thr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp 
785                 790                 795                 800 


Gln Tyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn 
                805                 810                 815     


Ile Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp 
            820                 825                 830         


Asn Pro Tyr Val Ile Gly Ile Asp Arg Gly Glu Arg Asn Leu Leu Tyr 
        835                 840                 845             


Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu 
    850                 855                 860                 


Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr 
865                 870                 875                 880 


His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln 
                885                 890                 895     


Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile 
            900                 905                 910         


Ser Gln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala 
        915                 920                 925             


Val Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val 
    930                 935                 940                 


Lys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp 
945                 950                 955                 960 


Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly 
                965                 970                 975     


Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys 
            980                 985                 990         


Ser Met Ser Thr Gln Asn Gly Phe  Ile Phe Tyr Ile Pro  Ala Trp Leu 
        995                 1000                 1005             


Thr Ser  Lys Ile Asp Pro Ser  Thr Gly Phe Val Asn  Leu Leu Lys 
    1010                 1015                 1020             


Thr Lys  Tyr Thr Ser Ile Ala  Asp Ser Lys Lys Phe  Ile Ser Ser 
    1025                 1030                 1035             


Phe Asp  Arg Ile Met Tyr Val  Pro Glu Glu Asp Leu  Phe Glu Phe 
    1040                 1045                 1050             


Ala Leu  Asp Tyr Lys Asn Phe  Ser Arg Thr Asp Ala  Asp Tyr Ile 
    1055                 1060                 1065             


Lys Lys  Trp Lys Leu Tyr Ser  Tyr Gly Asn Arg Ile  Arg Ile Phe 
    1070                 1075                 1080             


Arg Asn  Pro Lys Lys Asn Asn  Val Phe Asp Trp Glu  Glu Val Cys 
    1085                 1090                 1095             


Leu Thr  Ser Ala Tyr Lys Glu  Leu Phe Asn Lys Tyr  Gly Ile Asn 
    1100                 1105                 1110             


Tyr Gln  Gln Gly Asp Ile Arg  Ala Leu Leu Cys Glu  Gln Ser Asp 
    1115                 1120                 1125             


Lys Ala  Phe Tyr Ser Ser Phe  Met Ala Leu Met Ser  Leu Met Leu 
    1130                 1135                 1140             


Gln Met  Arg Asn Ser Ile Thr  Gly Arg Thr Asp Val  Asp Phe Leu 
    1145                 1150                 1155             


Ile Ser  Pro Val Lys Asn Ser  Asp Gly Ile Phe Tyr  Asp Ser Arg 
    1160                 1165                 1170             


Asn Tyr  Glu Ala Gln Glu Asn  Ala Ile Leu Pro Lys  Asn Ala Asp 
    1175                 1180                 1185             


Ala Asn  Gly Ala Tyr Asn Ile  Ala Arg Lys Val Leu  Trp Ala Ile 
    1190                 1195                 1200             


Gly Gln  Phe Lys Lys Ala Glu  Asp Glu Lys Leu Asp  Lys Val Lys 
    1205                 1210                 1215             


Ile Ala  Ile Ser Asn Lys Glu  Trp Leu Glu Tyr Ala  Gln Thr Ser 
    1220                 1225                 1230             


Val Lys  His Lys Arg Pro Ala  Ala Thr Lys Lys Ala  Gly Gln Ala 
    1235                 1240                 1245             


Lys Lys  Lys Lys 
    1250         


<210>  153
<211>  1252
<212>  PRT
<213>  Lachnospiraceae bacterium

<400>  153

Met Ala Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr 
1               5                   10                  15      


Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val 
            20                  25                  30          


Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp 
        35                  40                  45              


Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg 
    50                  55                  60                  


Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys 
65                  70                  75                  80  


Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu 
                85                  90                  95      


Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu 
            100                 105                 110         


Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys 
        115                 120                 125             


Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp 
    130                 135                 140                 


Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr 
145                 150                 155                 160 


Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser 
                165                 170                 175     


Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile 
            180                 185                 190         


Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His 
        195                 200                 205             


Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val 
    210                 215                 220                 


Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu 
225                 230                 235                 240 


Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser 
                245                 250                 255     


Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln 
            260                 265                 270         


Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val 
        275                 280                 285             


Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser 
    290                 295                 300                 


Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser 
305                 310                 315                 320 


Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe 
                325                 330                 335     


Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile 
            340                 345                 350         


Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp 
        355                 360                 365             


Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val 
    370                 375                 380                 


Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile 
385                 390                 395                 400 


Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu 
                405                 410                 415     


Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu 
            420                 425                 430         


Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe 
        435                 440                 445             


Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met 
    450                 455                 460                 


Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala 
465                 470                 475                 480 


Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly 
                485                 490                 495     


Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr 
            500                 505                 510         


Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys 
        515                 520                 525             


Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys 
    530                 535                 540                 


Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys 
545                 550                 555                 560 


Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys 
                565                 570                 575     


Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys 
            580                 585                 590         


Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys 
        595                 600                 605             


Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr 
    610                 615                 620                 


Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys 
625                 630                 635                 640 


His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys 
                645                 650                 655     


Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys 
            660                 665                 670         


Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val 
        675                 680                 685             


Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu 
    690                 695                 700                 


Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys 
705                 710                 715                 720 


Ser His Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe 
                725                 730                 735     


Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu 
            740                 745                 750         


Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro 
        755                 760                 765             


Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr 
    770                 775                 780                 


Thr Thr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp 
785                 790                 795                 800 


Gln Tyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn 
                805                 810                 815     


Ile Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp 
            820                 825                 830         


Asn Pro Tyr Val Ile Gly Ile Asp Arg Gly Glu Arg Asn Leu Leu Tyr 
        835                 840                 845             


Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu 
    850                 855                 860                 


Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr 
865                 870                 875                 880 


His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln 
                885                 890                 895     


Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile 
            900                 905                 910         


Ser Gln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala 
        915                 920                 925             


Val Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val 
    930                 935                 940                 


Lys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp 
945                 950                 955                 960 


Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly 
                965                 970                 975     


Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys 
            980                 985                 990         


Ser Met Ser Thr Gln Asn Gly Phe  Ile Phe Tyr Ile Pro  Ala Trp Leu 
        995                 1000                 1005             


Thr Ser  Lys Ile Asp Pro Ser  Thr Gly Phe Val Asn  Leu Leu Lys 
    1010                 1015                 1020             


Thr Lys  Tyr Thr Ser Ile Ala  Asp Ser Lys Lys Phe  Ile Ser Ser 
    1025                 1030                 1035             


Phe Asp  Arg Ile Met Tyr Val  Pro Glu Glu Asp Leu  Phe Glu Phe 
    1040                 1045                 1050             


Ala Leu  Asp Tyr Lys Asn Phe  Ser Arg Thr Asp Ala  Asp Tyr Ile 
    1055                 1060                 1065             


Lys Lys  Trp Lys Leu Tyr Ser  Tyr Gly Asn Arg Ile  Arg Ile Phe 
    1070                 1075                 1080             


Arg Asn  Pro Lys Lys Asn Asn  Val Phe Asp Trp Glu  Glu Val Cys 
    1085                 1090                 1095             


Leu Thr  Ser Ala Tyr Lys Glu  Leu Phe Asn Lys Tyr  Gly Ile Asn 
    1100                 1105                 1110             


Tyr Gln  Gln Gly Asp Ile Arg  Ala Leu Leu Cys Glu  Gln Ser Asp 
    1115                 1120                 1125             


Lys Ala  Phe Tyr Ser Ser Phe  Met Ala Leu Met Ser  Leu Met Leu 
    1130                 1135                 1140             


Gln Met  Arg Asn Ser Ile Thr  Gly Arg Thr Asp Val  Asp Phe Leu 
    1145                 1150                 1155             


Ile Ser  Pro Val Lys Asn Ser  Asp Gly Ile Phe Tyr  Asp Ser Arg 
    1160                 1165                 1170             


Asn Tyr  Glu Ala Gln Glu Asn  Ala Ile Leu Pro Lys  Asn Ala Asp 
    1175                 1180                 1185             


Ala Asn  Gly Ala Tyr Asn Ile  Ala Arg Lys Val Leu  Trp Ala Ile 
    1190                 1195                 1200             


Gly Gln  Phe Lys Lys Ala Glu  Asp Glu Lys Leu Asp  Lys Val Lys 
    1205                 1210                 1215             


Ile Ala  Ile Ser Asn Lys Glu  Trp Leu Glu Tyr Ala  Gln Thr Ser 
    1220                 1225                 1230             


Val Lys  His Lys Arg Pro Ala  Ala Thr Lys Lys Ala  Gly Gln Ala 
    1235                 1240                 1245             


Lys Lys  Lys Lys 
    1250         


<210>  154
<211>  1260
<212>  PRT
<213>  Lachnospiraceae bacterium

<400>  154

Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala 
1               5                   10                  15      


Ala Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr 
            20                  25                  30          


Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp 
        35                  40                  45              


Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys 
    50                  55                  60                  


Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp 
65                  70                  75                  80  


Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu 
                85                  90                  95      


Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn 
            100                 105                 110         


Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn 
        115                 120                 125             


Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu 
    130                 135                 140                 


Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe 
145                 150                 155                 160 


Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn 
                165                 170                 175     


Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile 
            180                 185                 190         


Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys 
        195                 200                 205             


Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys 
    210                 215                 220                 


Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe 
225                 230                 235                 240 


Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile 
                245                 250                 255     


Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn 
            260                 265                 270         


Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys 
        275                 280                 285             


Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser 
    290                 295                 300                 


Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe 
305                 310                 315                 320 


Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys 
                325                 330                 335     


Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile 
            340                 345                 350         


Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe 
        355                 360                 365             


Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp 
    370                 375                 380                 


Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp 
385                 390                 395                 400 


Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu 
                405                 410                 415     


Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu 
            420                 425                 430         


Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser 
        435                 440                 445             


Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys 
    450                 455                 460                 


Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys 
465                 470                 475                 480 


Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr 
                485                 490                 495     


Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile 
            500                 505                 510         


Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr 
        515                 520                 525             


Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro 
    530                 535                 540                 


Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala 
545                 550                 555                 560 


Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys 
                565                 570                 575     


Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly 
            580                 585                 590         


Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met 
        595                 600                 605             


Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro 
    610                 615                 620                 


Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly 
625                 630                 635                 640 


Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys 
                645                 650                 655     


Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn 
            660                 665                 670         


Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu 
        675                 680                 685             


Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys 
    690                 695                 700                 


Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile 
705                 710                 715                 720 


Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His 
                725                 730                 735     


Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile 
            740                 745                 750         


Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys 
        755                 760                 765             


Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys 
    770                 775                 780                 


Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr 
785                 790                 795                 800 


Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile 
                805                 810                 815     


Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val 
            820                 825                 830         


Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp 
        835                 840                 845             


Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly 
    850                 855                 860                 


Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn 
865                 870                 875                 880 


Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu 
                885                 890                 895     


Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile 
            900                 905                 910         


Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys 
        915                 920                 925             


Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn 
    930                 935                 940                 


Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln 
945                 950                 955                 960 


Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys 
                965                 970                 975     


Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile 
            980                 985                 990         


Thr Asn Lys Phe Glu Ser Phe Lys  Ser Met Ser Thr Gln  Asn Gly Phe 
        995                 1000                 1005             


Ile Phe  Tyr Ile Pro Ala Trp  Leu Thr Ser Lys Ile  Asp Pro Ser 
    1010                 1015                 1020             


Thr Gly  Phe Val Asn Leu Leu  Lys Thr Lys Tyr Thr  Ser Ile Ala 
    1025                 1030                 1035             


Asp Ser  Lys Lys Phe Ile Ser  Ser Phe Asp Arg Ile  Met Tyr Val 
    1040                 1045                 1050             


Pro Glu  Glu Asp Leu Phe Glu  Phe Ala Leu Asp Tyr  Lys Asn Phe 
    1055                 1060                 1065             


Ser Arg  Thr Asp Ala Asp Tyr  Ile Lys Lys Trp Lys  Leu Tyr Ser 
    1070                 1075                 1080             


Tyr Gly  Asn Arg Ile Arg Ile  Phe Arg Asn Pro Lys  Lys Asn Asn 
    1085                 1090                 1095             


Val Phe  Asp Trp Glu Glu Val  Cys Leu Thr Ser Ala  Tyr Lys Glu 
    1100                 1105                 1110             


Leu Phe  Asn Lys Tyr Gly Ile  Asn Tyr Gln Gln Gly  Asp Ile Arg 
    1115                 1120                 1125             


Ala Leu  Leu Cys Glu Gln Ser  Asp Lys Ala Phe Tyr  Ser Ser Phe 
    1130                 1135                 1140             


Met Ala  Leu Met Ser Leu Met  Leu Gln Met Arg Asn  Ser Ile Thr 
    1145                 1150                 1155             


Gly Arg  Thr Asp Val Asp Phe  Leu Ile Ser Pro Val  Lys Asn Ser 
    1160                 1165                 1170             


Asp Gly  Ile Phe Tyr Asp Ser  Arg Asn Tyr Glu Ala  Gln Glu Asn 
    1175                 1180                 1185             


Ala Ile  Leu Pro Lys Asn Ala  Asp Ala Asn Gly Ala  Tyr Asn Ile 
    1190                 1195                 1200             


Ala Arg  Lys Val Leu Trp Ala  Ile Gly Gln Phe Lys  Lys Ala Glu 
    1205                 1210                 1215             


Asp Glu  Lys Leu Asp Lys Val  Lys Ile Ala Ile Ser  Asn Lys Glu 
    1220                 1225                 1230             


Trp Leu  Glu Tyr Ala Gln Thr  Ser Val Lys His Lys  Arg Pro Ala 
    1235                 1240                 1245             


Ala Thr  Lys Lys Ala Gly Gln  Ala Lys Lys Lys Lys  
    1250                 1255                 1260 


<210>  155
<211>  1252
<212>  PRT
<213>  Lachnospiraceae bacterium

<400>  155

Met Ala Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr 
1               5                   10                  15      


Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val 
            20                  25                  30          


Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp 
        35                  40                  45              


Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg 
    50                  55                  60                  


Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys 
65                  70                  75                  80  


Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu 
                85                  90                  95      


Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu 
            100                 105                 110         


Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys 
        115                 120                 125             


Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp 
    130                 135                 140                 


Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr 
145                 150                 155                 160 


Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser 
                165                 170                 175     


Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile 
            180                 185                 190         


Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His 
        195                 200                 205             


Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val 
    210                 215                 220                 


Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu 
225                 230                 235                 240 


Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser 
                245                 250                 255     


Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln 
            260                 265                 270         


Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val 
        275                 280                 285             


Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser 
    290                 295                 300                 


Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser 
305                 310                 315                 320 


Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe 
                325                 330                 335     


Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile 
            340                 345                 350         


Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp 
        355                 360                 365             


Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val 
    370                 375                 380                 


Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile 
385                 390                 395                 400 


Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu 
                405                 410                 415     


Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu 
            420                 425                 430         


Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe 
        435                 440                 445             


Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met 
    450                 455                 460                 


Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala 
465                 470                 475                 480 


Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly 
                485                 490                 495     


Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr 
            500                 505                 510         


Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys 
        515                 520                 525             


Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys 
    530                 535                 540                 


Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys 
545                 550                 555                 560 


Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys 
                565                 570                 575     


Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys 
            580                 585                 590         


Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys 
        595                 600                 605             


Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr 
    610                 615                 620                 


Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys 
625                 630                 635                 640 


His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys 
                645                 650                 655     


Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys 
            660                 665                 670         


Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val 
        675                 680                 685             


Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu 
    690                 695                 700                 


Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys 
705                 710                 715                 720 


Ser His Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe 
                725                 730                 735     


Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu 
            740                 745                 750         


Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro 
        755                 760                 765             


Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr 
    770                 775                 780                 


Thr Thr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp 
785                 790                 795                 800 


Gln Tyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn 
                805                 810                 815     


Ile Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp 
            820                 825                 830         


Asn Pro Tyr Val Ile Gly Ile Asp Arg Gly Glu Arg Asn Leu Leu Tyr 
        835                 840                 845             


Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu 
    850                 855                 860                 


Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr 
865                 870                 875                 880 


His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln 
                885                 890                 895     


Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile 
            900                 905                 910         


Ser Gln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala 
        915                 920                 925             


Val Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val 
    930                 935                 940                 


Lys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp 
945                 950                 955                 960 


Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly 
                965                 970                 975     


Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys 
            980                 985                 990         


Ser Met Ser Thr Gln Asn Gly Phe  Ile Phe Tyr Ile Pro  Ala Trp Leu 
        995                 1000                 1005             


Thr Ser  Lys Ile Asp Pro Ser  Thr Gly Phe Val Asn  Leu Leu Lys 
    1010                 1015                 1020             


Thr Lys  Tyr Thr Ser Ile Ala  Asp Ser Lys Lys Phe  Ile Ser Ser 
    1025                 1030                 1035             


Phe Asp  Arg Ile Met Tyr Val  Pro Glu Glu Asp Leu  Phe Glu Phe 
    1040                 1045                 1050             


Ala Leu  Asp Tyr Lys Asn Phe  Ser Arg Thr Asp Ala  Asp Tyr Ile 
    1055                 1060                 1065             


Lys Lys  Trp Lys Leu Tyr Ser  Tyr Gly Asn Arg Ile  Arg Ile Phe 
    1070                 1075                 1080             


Arg Asn  Pro Lys Lys Asn Asn  Val Phe Asp Trp Glu  Glu Val Cys 
    1085                 1090                 1095             


Leu Thr  Ser Ala Tyr Lys Glu  Leu Phe Asn Lys Tyr  Gly Ile Asn 
    1100                 1105                 1110             


Tyr Gln  Gln Gly Asp Ile Arg  Ala Leu Leu Cys Glu  Gln Ser Asp 
    1115                 1120                 1125             


Lys Ala  Phe Tyr Ser Ser Phe  Met Ala Leu Met Ser  Leu Met Leu 
    1130                 1135                 1140             


Gln Met  Arg Asn Ser Ile Thr  Gly Arg Thr Asp Val  Asp Phe Leu 
    1145                 1150                 1155             


Ile Ser  Pro Val Lys Asn Ser  Asp Gly Ile Phe Tyr  Asp Ser Arg 
    1160                 1165                 1170             


Asn Tyr  Glu Ala Gln Glu Asn  Ala Ile Leu Pro Lys  Asn Ala Asp 
    1175                 1180                 1185             


Ala Asn  Gly Ala Tyr Asn Ile  Ala Arg Lys Val Leu  Trp Ala Ile 
    1190                 1195                 1200             


Gly Gln  Phe Lys Lys Ala Glu  Asp Glu Lys Leu Asp  Lys Val Lys 
    1205                 1210                 1215             


Ile Ala  Ile Ser Asn Lys Glu  Trp Leu Glu Tyr Ala  Gln Thr Ser 
    1220                 1225                 1230             


Val Lys  His Lys Arg Pro Ala  Ala Thr Lys Lys Ala  Gly Gln Ala 
    1235                 1240                 1245             


Lys Lys  Lys Lys 
    1250         


<210>  156
<211>  1252
<212>  PRT
<213>  Lachnospiraceae bacterium

<400>  156

Met Ala Pro Lys Lys Lys Arg Lys Val Ser Lys Leu Glu Lys Phe Thr 
1               5                   10                  15      


Asn Cys Tyr Ser Leu Ser Lys Thr Leu Arg Phe Lys Ala Ile Pro Val 
            20                  25                  30          


Gly Lys Thr Gln Glu Asn Ile Asp Asn Lys Arg Leu Leu Val Glu Asp 
        35                  40                  45              


Glu Lys Arg Ala Glu Asp Tyr Lys Gly Val Lys Lys Leu Leu Asp Arg 
    50                  55                  60                  


Tyr Tyr Leu Ser Phe Ile Asn Asp Val Leu His Ser Ile Lys Leu Lys 
65                  70                  75                  80  


Asn Leu Asn Asn Tyr Ile Ser Leu Phe Arg Lys Lys Thr Arg Thr Glu 
                85                  90                  95      


Lys Glu Asn Lys Glu Leu Glu Asn Leu Glu Ile Asn Leu Arg Lys Glu 
            100                 105                 110         


Ile Ala Lys Ala Phe Lys Gly Asn Glu Gly Tyr Lys Ser Leu Phe Lys 
        115                 120                 125             


Lys Asp Ile Ile Glu Thr Ile Leu Pro Glu Phe Leu Asp Asp Lys Asp 
    130                 135                 140                 


Glu Ile Ala Leu Val Asn Ser Phe Asn Gly Phe Thr Thr Ala Phe Thr 
145                 150                 155                 160 


Gly Phe Phe Asp Asn Arg Glu Asn Met Phe Ser Glu Glu Ala Lys Ser 
                165                 170                 175     


Thr Ser Ile Ala Phe Arg Cys Ile Asn Glu Asn Leu Thr Arg Tyr Ile 
            180                 185                 190         


Ser Asn Met Asp Ile Phe Glu Lys Val Asp Ala Ile Phe Asp Lys His 
        195                 200                 205             


Glu Val Gln Glu Ile Lys Glu Lys Ile Leu Asn Ser Asp Tyr Asp Val 
    210                 215                 220                 


Glu Asp Phe Phe Glu Gly Glu Phe Phe Asn Phe Val Leu Thr Gln Glu 
225                 230                 235                 240 


Gly Ile Asp Val Tyr Asn Ala Ile Ile Gly Gly Phe Val Thr Glu Ser 
                245                 250                 255     


Gly Glu Lys Ile Lys Gly Leu Asn Glu Tyr Ile Asn Leu Tyr Asn Gln 
            260                 265                 270         


Lys Thr Lys Gln Lys Leu Pro Lys Phe Lys Pro Leu Tyr Lys Gln Val 
        275                 280                 285             


Leu Ser Asp Arg Glu Ser Leu Ser Phe Tyr Gly Glu Gly Tyr Thr Ser 
    290                 295                 300                 


Asp Glu Glu Val Leu Glu Val Phe Arg Asn Thr Leu Asn Lys Asn Ser 
305                 310                 315                 320 


Glu Ile Phe Ser Ser Ile Lys Lys Leu Glu Lys Leu Phe Lys Asn Phe 
                325                 330                 335     


Asp Glu Tyr Ser Ser Ala Gly Ile Phe Val Lys Asn Gly Pro Ala Ile 
            340                 345                 350         


Ser Thr Ile Ser Lys Asp Ile Phe Gly Glu Trp Asn Val Ile Arg Asp 
        355                 360                 365             


Lys Trp Asn Ala Glu Tyr Asp Asp Ile His Leu Lys Lys Lys Ala Val 
    370                 375                 380                 


Val Thr Glu Lys Tyr Glu Asp Asp Arg Arg Lys Ser Phe Lys Lys Ile 
385                 390                 395                 400 


Gly Ser Phe Ser Leu Glu Gln Leu Gln Glu Tyr Ala Asp Ala Asp Leu 
                405                 410                 415     


Ser Val Val Glu Lys Leu Lys Glu Ile Ile Ile Gln Lys Val Asp Glu 
            420                 425                 430         


Ile Tyr Lys Val Tyr Gly Ser Ser Glu Lys Leu Phe Asp Ala Asp Phe 
        435                 440                 445             


Val Leu Glu Lys Ser Leu Lys Lys Asn Asp Ala Val Val Ala Ile Met 
    450                 455                 460                 


Lys Asp Leu Leu Asp Ser Val Lys Ser Phe Glu Asn Tyr Ile Lys Ala 
465                 470                 475                 480 


Phe Phe Gly Glu Gly Lys Glu Thr Asn Arg Asp Glu Ser Phe Tyr Gly 
                485                 490                 495     


Asp Phe Val Leu Ala Tyr Asp Ile Leu Leu Lys Val Asp His Ile Tyr 
            500                 505                 510         


Asp Ala Ile Arg Asn Tyr Val Thr Gln Lys Pro Tyr Ser Lys Asp Lys 
        515                 520                 525             


Phe Lys Leu Tyr Phe Gln Asn Pro Gln Phe Met Gly Gly Trp Asp Lys 
    530                 535                 540                 


Asp Lys Glu Thr Asp Tyr Arg Ala Thr Ile Leu Arg Tyr Gly Ser Lys 
545                 550                 555                 560 


Tyr Tyr Leu Ala Ile Met Asp Lys Lys Tyr Ala Lys Cys Leu Gln Lys 
                565                 570                 575     


Ile Asp Lys Asp Asp Val Asn Gly Asn Tyr Glu Lys Ile Asn Tyr Lys 
            580                 585                 590         


Leu Leu Pro Gly Pro Asn Lys Met Leu Pro Lys Val Phe Phe Ser Lys 
        595                 600                 605             


Lys Trp Met Ala Tyr Tyr Asn Pro Ser Glu Asp Ile Gln Lys Ile Tyr 
    610                 615                 620                 


Lys Asn Gly Thr Phe Lys Lys Gly Asp Met Phe Asn Leu Asn Asp Cys 
625                 630                 635                 640 


His Lys Leu Ile Asp Phe Phe Lys Asp Ser Ile Ser Arg Tyr Pro Lys 
                645                 650                 655     


Trp Ser Asn Ala Tyr Asp Phe Asn Phe Ser Glu Thr Glu Lys Tyr Lys 
            660                 665                 670         


Asp Ile Ala Gly Phe Tyr Arg Glu Val Glu Glu Gln Gly Tyr Lys Val 
        675                 680                 685             


Ser Phe Glu Ser Ala Ser Lys Lys Glu Val Asp Lys Leu Val Glu Glu 
    690                 695                 700                 


Gly Lys Leu Tyr Met Phe Gln Ile Tyr Asn Lys Asp Phe Ser Asp Lys 
705                 710                 715                 720 


Ser His Gly Thr Pro Asn Leu His Thr Met Tyr Phe Lys Leu Leu Phe 
                725                 730                 735     


Asp Glu Asn Asn His Gly Gln Ile Arg Leu Ser Gly Gly Ala Glu Leu 
            740                 745                 750         


Phe Met Arg Arg Ala Ser Leu Lys Lys Glu Glu Leu Val Val His Pro 
        755                 760                 765             


Ala Asn Ser Pro Ile Ala Asn Lys Asn Pro Asp Asn Pro Lys Lys Thr 
    770                 775                 780                 


Thr Thr Leu Ser Tyr Asp Val Tyr Lys Asp Lys Arg Phe Ser Glu Asp 
785                 790                 795                 800 


Gln Tyr Glu Leu His Ile Pro Ile Ala Ile Asn Lys Cys Pro Lys Asn 
                805                 810                 815     


Ile Phe Lys Ile Asn Thr Glu Val Arg Val Leu Leu Lys His Asp Asp 
            820                 825                 830         


Asn Pro Tyr Val Ile Gly Ile Asp Arg Gly Glu Arg Asn Leu Leu Tyr 
        835                 840                 845             


Ile Val Val Val Asp Gly Lys Gly Asn Ile Val Glu Gln Tyr Ser Leu 
    850                 855                 860                 


Asn Glu Ile Ile Asn Asn Phe Asn Gly Ile Arg Ile Lys Thr Asp Tyr 
865                 870                 875                 880 


His Ser Leu Leu Asp Lys Lys Glu Lys Glu Arg Phe Glu Ala Arg Gln 
                885                 890                 895     


Asn Trp Thr Ser Ile Glu Asn Ile Lys Glu Leu Lys Ala Gly Tyr Ile 
            900                 905                 910         


Ser Gln Val Val His Lys Ile Cys Glu Leu Val Glu Lys Tyr Asp Ala 
        915                 920                 925             


Val Ile Ala Leu Glu Asp Leu Asn Ser Gly Phe Lys Asn Ser Arg Val 
    930                 935                 940                 


Lys Val Glu Lys Gln Val Tyr Gln Lys Phe Glu Lys Met Leu Ile Asp 
945                 950                 955                 960 


Lys Leu Asn Tyr Met Val Asp Lys Lys Ser Asn Pro Cys Ala Thr Gly 
                965                 970                 975     


Gly Ala Leu Lys Gly Tyr Gln Ile Thr Asn Lys Phe Glu Ser Phe Lys 
            980                 985                 990         


Ser Met Ser Thr Gln Asn Gly Phe  Ile Phe Tyr Ile Pro  Ala Trp Leu 
        995                 1000                 1005             


Thr Ser  Lys Ile Asp Pro Ser  Thr Gly Phe Val Asn  Leu Leu Lys 
    1010                 1015                 1020             


Thr Lys  Tyr Thr Ser Ile Ala  Asp Ser Lys Lys Phe  Ile Ser Ser 
    1025                 1030                 1035             


Phe Asp  Arg Ile Met Tyr Val  Pro Glu Glu Asp Leu  Phe Glu Phe 
    1040                 1045                 1050             


Ala Leu  Asp Tyr Lys Asn Phe  Ser Arg Thr Asp Ala  Asp Tyr Ile 
    1055                 1060                 1065             


Lys Lys  Trp Lys Leu Tyr Ser  Tyr Gly Asn Arg Ile  Arg Ile Phe 
    1070                 1075                 1080             


Arg Asn  Pro Lys Lys Asn Asn  Val Phe Asp Trp Glu  Glu Val Cys 
    1085                 1090                 1095             


Leu Thr  Ser Ala Tyr Lys Glu  Leu Phe Asn Lys Tyr  Gly Ile Asn 
    1100                 1105                 1110             


Tyr Gln  Gln Gly Asp Ile Arg  Ala Leu Leu Cys Glu  Gln Ser Asp 
    1115                 1120                 1125             


Lys Ala  Phe Tyr Ser Ser Phe  Met Ala Leu Met Ser  Leu Met Leu 
    1130                 1135                 1140             


Gln Met  Arg Asn Ser Ile Thr  Gly Arg Thr Asp Val  Asp Phe Leu 
    1145                 1150                 1155             


Ile Ser  Pro Val Lys Asn Ser  Asp Gly Ile Phe Tyr  Asp Ser Arg 
    1160                 1165                 1170             


Asn Tyr  Glu Ala Gln Glu Asn  Ala Ile Leu Pro Lys  Asn Ala Asp 
    1175                 1180                 1185             


Ala Asn  Gly Ala Tyr Asn Ile  Ala Arg Lys Val Leu  Trp Ala Ile 
    1190                 1195                 1200             


Gly Gln  Phe Lys Lys Ala Glu  Asp Glu Lys Leu Asp  Lys Val Lys 
    1205                 1210                 1215             


Ile Ala  Ile Ser Asn Lys Glu  Trp Leu Glu Tyr Ala  Gln Thr Ser 
    1220                 1225                 1230             


Val Lys  His Lys Arg Pro Ala  Ala Thr Lys Lys Ala  Gly Gln Ala 
    1235                 1240                 1245             


Lys Lys  Lys Lys 
    1250         


<210>  157
<211>  3948
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  variant I of Cpf1 (SEQ ID NO: 72) including an intronic sequence


<220>
<221>  Intron
<222>  (862)..(1050)

<400>  157
atggcgccga agaagaagcg caaggtgtcc aagctcgaga agttcacgaa ctgctactcc       60

ctctccaaga ccctccgctt caaggccatc cccgtgggca agacccagga gaacatcgac      120

aacaagcgcc tcctggtcga ggacgagaag agggcggagg actacaaggg cgtgaagaag      180

ctcctggacc gctactacct ctccttcatc aacgacgtcc tgcacagcat caagctcaag      240

aacctgaaca actacatctc cctgttccgc aagaagacga ggaccgagaa ggagaacaag      300

gagctcgaga acctggagat caacctccgc aaggagatcg ccaaggcgtt caagggcaac      360

gagggctaca agagcctgtt caagaaggac atcatcgaga cgatcctccc ggagttcctg      420

gacgacaagg acgagatcgc cctcgtgaac tccttcaacg gcttcaccac ggcgttcacc      480

ggcttcttcg acaaccgcga gaacatgttc agcgaggagg ccaagtccac gagcatcgcg      540

ttccgctgca tcaacgagaa cctgaccagg tacatctcca acatggacat cttcgagaag      600

gtcgacgcca tcttcgacaa gcacgaggtg caggagatca aggagaagat cctcaacagc      660

gactacgacg tcgaggactt cttcgagggc gagttcttca acttcgtcct gacgcaggag      720

ggcatcgacg tgtacaacgc catcatcggt ggcttcgtga ccgagtccgg cgagaagatc      780

aagggcctca acgagtacat caacctgtac aaccagaaga ccaagcagaa gctcccgaag      840

ttcaagcccc tctacaagca ggtaagtttc tgcttctacc tttgatatat atataataat      900

tatcattaat tagtagtaat ataatatttc aaatattttt ttcaaaataa aagaatgtag      960

tatatagcaa ttgcttttct gtagtttata agtgtgtata ttttaattta taacttttct     1020

aatatatgac caaaacatgg tgatgtgcag gtcctgtccg accgcgagtc cctgagcttc     1080

tacggcgagg gctacacgag cgacgaggag gtcctcgagg tgttcaggaa caccctgaac     1140

aagaacagcg agatcttctc cagcatcaag aagctcgaga agctgttcaa gaacttcgac     1200

gagtactcca gcgccggcat cttcgtcaag aacggcccgg cgatctccac gatcagcaag     1260

gatatcttcg gcgagtggaa cgtgatcagg gacaagtgga acgccgagta cgacgacatc     1320

cacctcaaga agaaggcggt ggtcaccgag aagtacgagg acgaccgcag gaagtccttc     1380

aagaagatcg gctccttcag cctcgagcag ctgcaggagt acgccgacgc ggacctctcc     1440

gtggtcgaga agctgaagga gatcatcatc cagaaggtcg acgagatcta caaggtgtac     1500

ggctccagcg agaagctgtt cgacgccgac ttcgtcctcg agaagtccct gaagaagaac     1560

gacgccgtgg tcgcgatcat gaaggacctc ctggactccg tgaagagctt cgagaactac     1620

atcaaggcgt tcttcggcga gggcaaggag acgaaccgcg acgagtcctt ctacggcgac     1680

ttcgtcctcg cctacgacat cctcctgaag gtggaccaca tctacgacgc gatcaggaac     1740

tacgtgaccc agaagccgta cagcaaggac aagttcaagc tgtacttcca gaacccccag     1800

ttcatgggcg gctgggacaa ggacaaggag acggactacc gcgccaccat cctccgctac     1860

ggcagcaagt actacctggc catcatggac aagaagtacg cgaagtgcct ccagaagatc     1920

gacaaggacg acgtcaacgg caactacgag aagatcaact acaagctcct gccgggcccc     1980

aacaagatgc tgccgaaggt gttcttctcc aagaagtgga tggcctacta caaccccagc     2040

gaggacatcc agaagatcta caagaacggc acgttcaaga agggcgacat gttcaacctc     2100

aacgactgcc acaagctgat cgacttcttc aaggactcca tcagccgcta cccgaagtgg     2160

tccaacgcct acgacttcaa cttcagcgag acagagaagt acaaggacat cgcgggcttc     2220

tacagggagg tcgaggagca gggctacaag gtgtccttcg agtccgccag caagaaggag     2280

gtcgacaagc tcgtggagga gggcaagctg tacatgttcc agatctacaa caaggacttc     2340

tccgacaaga gccacggcac gcccaacctc cacaccatgt acttcaagct cctgttcgac     2400

gagaacaacc acggccagat ccgcctctcc ggcggcgccg agctgttcat gaggagggcg     2460

agcctcaaga aggaggagct ggtggtccac cccgctaaca gcccaatcgc gaacaagaac     2520

ccggacaacc ccaagaagac cacgaccctc tcctacgacg tgtacaagga caagcgcttc     2580

agcgaggacc agtacgagct gcacatcccg atcgccatca acaagtgccc caagaacatc     2640

ttcaagatca acaccgaggt cagggtgctc ctgaagcacg acgacaaccc ctacgtgatc     2700

ggcatcgacc gcggcgagag gaacctcctg tacatcgtgg tcgtggacgg caagggcaac     2760

atcgtggagc agtactccct gaacgagatc atcaacaact tcaacggcat ccgcatcaag     2820

acggactacc acagcctcct ggacaagaag gagaaggagc gcttcgaggc caggcagaac     2880

tggacctcca tcgagaacat caaggagctc aaggcgggct acatcagcca ggtcgtgcac     2940

aagatctgcg agctggtcga gaagtacgac gccgtgatcg cgctcgagga cctgaactcc     3000

ggcttcaaga acagcagggt caaggtggag aagcaggtct accagaagtt cgagaagatg     3060

ctcatcgaca agctgaacta catggtggac aagaagtcca acccgtgcgc tacgggcggc     3120

gcgctcaagg gctaccagat caccaacaag ttcgagagct tcaagtccat gagcacccag     3180

aacggcttca tcttctacat cccggcctgg ctgacgtcca agatcgaccc cagcaccggc     3240

ttcgtcaacc tcctgaagac gaagtacacc tccatcgcgg acagcaagaa gttcatctcc     3300

agcttcgacc gcatcatgta tgtgccggag gaggacctct tcgagttcgc cctggactac     3360

aagaacttct ccaggacgga cgcggattac atcaagaagt ggaagctcta cagctacggc     3420

aaccgcatca ggatcttccg caaccccaag aagaacaacg tcttcgactg ggaggaggtg     3480

tgcctcacct ccgcctacaa ggagctgttc aacaagtacg gcatcaacta ccagcagggc     3540

gacatcaggg cgctcctgtg cgagcagagc gacaaggcct tctactccag cttcatggcg     3600

ctcatgtccc tcatgctgca gatgcgcaac agcatcacgg gcaggaccga cgtcgacttc     3660

ctgatctccc cggtgaagaa cagcgacggc atcttctacg acagccgcaa ctacgaggcc     3720

caggagaacg cgatcctgcc aaagaacgcg gacgccaacg gcgcctacaa catcgcgagg     3780

aaggtgctgt gggccatcgg ccagttcaag aaggcggagg acgagaagct cgacaaggtc     3840

aagatcgcca tctccaacaa ggagtggctg gagtacgcgc agacctcggt gaagcacaag     3900

aggcccgctg ccaccaagaa ggcgggccag gccaagaaga agaagtga                  3948


<210>  158
<211>  3948
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  variant III of Cpf1 (SEQ ID NO: 75) including an intronic 
       sequence


<220>
<221>  Intron
<222>  (628)..(816)

<400>  158
atggccccga agaagaagag gaaggtcagc aagctcgaga agttcaccaa ctgctacagc       60

ctgagcaaga ccctgaggtt caaggctatc ccggtgggca agacccaaga gaacatcgac      120

aacaagaggc tgctggtcga ggacgagaag cgcgctgagg attacaaggg cgtgaagaag      180

ctgctggaca ggtactacct gagcttcatc aacgacgtgc tgcacagcat caagctgaag      240

aacctgaaca actacatcag cctgttccgc aagaaaacca ggaccgagaa agagaacaaa      300

gagcttgaga acctcgagat caacctgagg aaagagatcg ccaaggcctt caagggcaac      360

gagggctaca agagcctgtt caagaaggac atcatcgaga ctatcctgcc agagttcctg      420

gacgacaagg acgagatcgc cctggtgaac agcttcaacg gcttcacgac cgccttcacc      480

ggtttcttcg acaaccgcga gaatatgttc agcgaggaag ccaagagcac ctctatcgcc      540

ttccgctgca tcaacgagaa cctgacgcgc tacatctcca acatggatat cttcgagaag      600

gtggacgcca tcttcgataa gcacgaggta agtttctgct tctacctttg atatatatat      660

aataattatc attaattagt agtaatataa tatttcaaat atttttttca aaataaaaga      720

atgtagtata tagcaattgc ttttctgtag tttataagtg tgtatatttt aatttataac      780

ttttctaata tatgaccaaa acatggtgat gtgcaggtgc aagagatcaa agaaaagatc      840

ctgaacagcg actacgacgt cgaggacttc ttcgagggcg agttcttcaa cttcgtgctc      900

acccaagagg gcatcgatgt gtacaacgcc atcatcggcg gcttcgtgac tgagagcggc      960

gagaagatca agggcctgaa cgagtacatc aacctctaca atcaaaagac caagcagaag     1020

ctgccgaagt tcaagccgct gtacaagcag gttctgagcg accgcgagag cctgtctttc     1080

tacggcgagg gttacaccag cgacgaagag gtgttggagg ttttccgcaa caccctgaac     1140

aagaacagcg agatcttcag ctccatcaag aagctggaaa agctgtttaa gaacttcgac     1200

gagtacagca gcgccggcat cttcgtgaag aacggcccag ctatcagcac catcagcaag     1260

gacatcttcg gcgagtggaa cgtgatcagg gacaagtgga acgccgagta cgacgacatc     1320

cacctgaaga aaaaggccgt ggtgaccgag aagtacgagg acgacaggcg caagagcttc     1380

aagaagatcg gctccttcag cctcgagcag ctgcaagagt acgctgacgc tgacctgagc     1440

gtggtcgaga agctcaaaga gatcatcatc cagaaggtcg acgagatcta caaggtgtac     1500

ggcagcagcg agaagctttt cgacgccgac ttcgtccttg agaagtccct caagaaaaac     1560

gacgccgtgg tggccatcat gaaggacctg ctggactccg tgaagtcctt cgagaactac     1620

attaaggctt tcttcggtga gggcaaagag actaacaggg acgagagctt ctacggggat     1680

ttcgtgctgg cctacgacat cctgctcaag gtggaccaca tctacgacgc catccgcaac     1740

tacgtgaccc agaagccgta ctccaaggac aagtttaagc tgtacttcca gaatccgcag     1800

ttcatgggcg gctgggacaa agacaaagaa accgactaca gggccaccat cctgaggtac     1860

ggctccaagt actacctcgc catcatggac aagaaatacg ccaagtgcct gcagaagatc     1920

gataaggacg acgtgaacgg caactacgag aagattaact acaagctgct gccagggccg     1980

aacaagatgc tcccgaaggt gttctttagc aagaaatgga tggcctacta caacccgagc     2040

gaggatatcc agaaaatcta caagaacggc accttcaaga aaggcgacat gttcaacctg     2100

aacgactgcc acaagctgat cgatttcttc aaggacagca tctctcgcta cccgaagtgg     2160

tccaacgcct acgatttcaa cttcagcgag actgaaaagt acaaggatat cgccggcttc     2220

taccgcgagg tcgaggaaca gggttacaag gtgagcttcg agagcgccag caagaaagag     2280

gtggacaagc tggtcgaaga gggcaagctg tacatgttcc agatctataa caaggacttc     2340

tccgacaaga gccacggcac cccaaacctg cacaccatgt acttcaagtt gctgttcgac     2400

gagaacaacc acggccagat caggctttct ggcggcgctg agcttttcat gagaagggcc     2460

agcctgaaaa aagaggaact ggtcgttcac ccggcgaaca gcccaatcgc caacaagaac     2520

ccggacaacc cgaaaaagac caccacgctg agctacgacg tgtacaagga caaaaggttc     2580

tccgaggacc agtacgagct gcacatcccg atcgccatca acaagtgccc gaagaacatc     2640

ttcaagatca acaccgaggt gagggtgctg ctgaagcacg acgacaaccc atacgtgatc     2700

ggcatcgata ggggcgagcg caacctgctc tacatcgtgg tggttgacgg caagggcaat     2760

atcgtcgagc agtacagcct taacgagatc attaacaact tcaatggcat caggatcaag     2820

accgactacc acagcctgct cgacaagaaa gaaaaagagc gcttcgaggc caggcagaac     2880

tggaccagca tcgagaatat caaagagctg aaggccggct acattagcca ggtggtgcac     2940

aagatctgcg agctggtgga aaagtacgac gcggtgatcg ctctcgagga cctgaactcc     3000

gggttcaaga actcccgcgt gaaggttgag aagcaggtct accaaaagtt cgagaagatg     3060

ctgatcgaca agctcaacta catggtggac aaaaagagca acccctgcgc cacaggcggc     3120

gctcttaagg gctaccagat cacgaacaag ttcgagtcct tcaagagcat gagcacccag     3180

aatggcttca tcttctacat cccggcctgg ctgaccagca agatcgatcc atctaccggc     3240

ttcgtcaacc tcctcaagac caagtacacc agcattgccg acagcaagaa gttcatctcc     3300

agcttcgaca ggatcatgta cgtgccggaa gaggacctgt tcgagttcgc gctcgattac     3360

aagaacttca gcaggaccga cgcggactat attaagaagt ggaagctcta cagctacggc     3420

aacaggatcc gcatcttcag aaacccgaag aaaaacaacg tgttcgactg ggaagaagtg     3480

tgcctgacca gcgcctacaa agaactgttc aacaagtacg gcatcaacta ccagcagggc     3540

gacatcaggg ctctgctgtg cgagcagtct gacaaggcgt tctacagctc cttcatggcc     3600

ctgatgagcc tgatgctgca gatgaggaac agcatcaccg gcaggacgga cgtcgacttc     3660

ctgatcagcc cagtgaagaa ttccgacggc attttctacg actctaggaa ctacgaggct     3720

caagagaacg ccatcctgcc gaagaacgcc gatgctaacg gcgcgtacaa cattgcccgc     3780

aaggtgctgt gggctatcgg ccagtttaag aaggccgagg acgaaaaact ggacaaggtg     3840

aagatcgcca ttagcaacaa agagtggctc gagtacgccc agaccagcgt gaagcacaaa     3900

aggccagccg ccactaagaa ggctggccag gccaaaaaga agaagtga                  3948


