                         SEQUENCE LISTING

<110>  INSTITUT PASTEUR
 
<120>  TAL EFFECTOR MEANS USEFUL FOR PARTIAL OR FULL DELETION OF DNA 
       TANDEM REPEATS

<130>  B10179A - FP/BA

<150>  EP13306644
<151>  2013-11-29

<160>  56    

<170>  PatentIn version 3.5

<210>  1
<211>  8810
<212>  DNA
<213>  Artificial

<220>
<223>  plasmid pCLS9996

<400>  1
gcgcacattt ccccgaaaag tgccacctga cgtccgatca aaaatcatcg cttcgctgat       60

taattacccc agaaataagg ctaaaaaact aatcgcatta tcatcctatg gttgttaatt      120

tgattcgttc atttgaaggt ttgtggggcc aggttactgc caatttttcc tcttcataac      180

cataaaagct agtattgtag aatctttatt gttcggagca gtgcggcgcg aggcacatct      240

gcgtttcagg aacgcgaccg gtgaagacga ggacgcacgg aggagagtct tccttcggag      300

ggctgtcacc cgctcggcgg cttctaatcc gtacttcaat atagcaatga gcagttaagc      360

gtattactga aagttccaaa gagaaggttt ttttaggcta atcgacctcg agcagatccg      420

ccaggcgtgt atatagcgtg gatggccagg caactttagt gctgacacat acaggcatat      480

atatatgtgt gcgacgacac atgatcatat ggcatgcatg tgctctgtat gtatataaaa      540

ctcttgtttt cttcttttct ctaaatattc tttccttata cattaggtcc tttgtagcat      600

aaattactat acttctatag acacgcaaac acaaatacac agcggccttg ccaccatggg      660

cgatcctaaa aagaaacgta aggtcatcga taaggagacc gccgctgcca agttcgagag      720

acagcacatg gacagcatcg atatcgccga tctacgcacg ctcggctaca gccagcagca      780

acaggagaag atcaaaccga aggttcgttc gacagtggcg cagcaccacg aggcactggt      840

cggccacggg tttacacacg cgcacatcgt tgcgttaagc caacacccgg cagcgttagg      900

gaccgtcgct gtcaagtatc aggacatgat cgcagcgttg ccagaggcga cacacgaagc      960

gatcgttggc gtcggcaaac agtggtccgg cgcacgcgct ctggaggcct tgctcacggt     1020

ggcgggagag ttgagaggtc caccgttaca gttggacaca ggccaacttc tcaagattgc     1080

aaaacgtggc ggcgtgaccg cagtggaggc agtgcatgca tggcgcaatg cactgacggg     1140

tgccccgctc aacttgaccc cccagcaggt ggtggccatc gccagcaata atggtggcaa     1200

gcaggcgctg gagacggtcc agcggctgtt gccggtgctg tgccaggccc acggcttgac     1260

cccggagcag gtggtggcca tcgccagcca cgatggcggc aagcaggcgc tggagacggt     1320

ccagcggctg ttgccggtgc tgtgccaggc ccacggcttg accccccagc aggtggtggc     1380

catcgccagc aatggcggtg gcaagcaggc gctggagacg gtccagcggc tgttgccggt     1440

gctgtgccag gcccacggct tgacccccca gcaggtggtg gccatcgcca gcaataatgg     1500

tggcaagcag gcgctggaga cggtccagcg gctgttgccg gtgctgtgcc aggcccacgg     1560

cttgaccccg gagcaggtgg tggccatcgc cagccacgat ggcggcaagc aggcgctgga     1620

gacggtccag cggctgttgc cggtgctgtg ccaggcccac ggcttgaccc cccagcaggt     1680

ggtggccatc gccagcaatg gcggtggcaa gcaggcgctg gagacggtcc agcggctgtt     1740

gccggtgctg tgccaggccc acggcttgac cccccagcag gtggtggcca tcgccagcaa     1800

taatggtggc aagcaggcgc tggagacggt ccagcggctg ttgccggtgc tgtgccaggc     1860

ccacggcttg accccggagc aggtggtggc catcgccagc cacgatggcg gcaagcaggc     1920

gctggagacg gtccagcggc tgttgccggt gctgtgccag gcccacggct tgacccccca     1980

gcaggtggtg gccatcgcca gcaatggcgg tggcaagcag gcgctggaga cggtccagcg     2040

gctgttgccg gtgctgtgcc aggcccacgg cttgaccccc cagcaggtgg tggccatcgc     2100

cagcaataat ggtggcaagc aggcgctgga gacggtccag cggctgttgc cggtgctgtg     2160

ccaggcccac ggcttgaccc cggagcaggt ggtggccatc gccagccacg atggcggcaa     2220

gcaggcgctg gagacggtcc agcggctgtt gccggtgctg tgccaggccc acggcttgac     2280

cccccagcag gtggtggcca tcgccagcaa tggcggtggc aagcaggcgc tggagacggt     2340

ccagcggctg ttgccggtgc tgtgccaggc ccacggcttg accccccagc aggtggtggc     2400

catcgccagc aataatggtg gcaagcaggc gctggagacg gtccagcggc tgttgccggt     2460

gctgtgccag gcccacggct tgaccccgga gcaggtggtg gccatcgcca gccacgatgg     2520

cggcaagcag gcgctggaga cggtccagcg gctgttgccg gtgctgtgcc aggcccacgg     2580

cttgaccccc cagcaggtgg tggccatcgc cagcaatggc ggtggcaagc aggcgctgga     2640

gacggtccag cggctgttgc cggtgctgtg ccaggcccac ggcttgaccc ctcagcaggt     2700

ggtggccatc gccagcaatg gcggcggcag gccggcgctg gagagcattg ttgcccagtt     2760

atctcgccct gatccggcgt tggccgcgtt gaccaacgac cacctcgtcg ccttggcctg     2820

cctcggcggg cgtcctgcgc tggatgcagt gaaaaaggga ttgggggatc ctatcagccg     2880

ttcccagctg gtgaagtccg agctggagga gaagaaatcc gagttgaggc acaagctgaa     2940

gtacgtgccc cacgagtaca tcgagctgat cgagatcgcc cggaacagca cccaggaccg     3000

tatcctggag atgaaggtga tggagttctt catgaaggtg tacggctaca ggggcaagca     3060

cctgggcggc tccaggaagc ccgacggcgc catctacacc gtgggctccc ccatcgacta     3120

cggcgtgatc gtggacacca aggcctactc cggcggctac aacctgccca tcggccaggc     3180

cgacgaaatg cagaggtacg tggaggagaa ccagaccagg aacaagcaca tcaaccccaa     3240

cgagtggtgg aaggtgtacc cctccagcgt gaccgagttc aagttcctgt tcgtgtccgg     3300

ccacttcaag ggcaactaca aggcccagct gaccaggctg aaccacatca ccaactgcaa     3360

cggcgccgtg ctgtccgtgg aggagctcct gatcggcggc gagatgatca aggccggcac     3420

cctgaccctg gaggaggtga ggaggaagtt caacaacggc gagatcaact tcgcggccga     3480

ctgataactc gagcgatcct ctagacgagc tcctcgagcc tgcagcagct gaagctttgg     3540

acttcttcgc cagaggtttg gtcaagtctc caatcaaggt tgtcggcttg tctaccttgc     3600

cagaaattta cgaaaagatg gaaaagggtc aaatcgttgg tagatacgtt gttgacactt     3660

ctaaataagc gaatttctta tgatttatga tttttattat taaataagtt ataaaaaaaa     3720

taagtgtata caaattttaa agtgactctt aggttttaaa acgaaaattc ttattcttga     3780

gtaactcttt cctgtaggtc aggttgcttt ctcaggtata gcatgaggtc gctcttattg     3840

accacacctc taccggcatg caagcttggc gtaatcatgg tcatagctgt ttcctgtgtg     3900

aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa agtgtaaagc     3960

ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac tgcccgcttt     4020

ccagtcggga aacctgtcgt gccagcagat ctgtttagct tgcctcgtcc ccgccgggtc     4080

acccggccag cgacatggag gcccagaata ccctccttga cagtcttgac gtgcgcagct     4140

caggggcatg atgtgactgt cgcccgtaca tttagcccat acatccccat gtataatcat     4200

ttgcatccat acattttgat ggccgcacgg cgcgaagcaa aaattacggc tcctcgctgc     4260

agacctgcga gcagggaaac gctcccctca cagacgcgtt gaattgtccc cacgccgcgc     4320

ccctgtagag aaatataaaa ggttaggatt tgccactgag gttcttcttt catatacttc     4380

cttttaaaat cttgctagga tacagttctc acatcacatc cgaacataaa caaccatgca     4440

tgggtaagga aaagactcac gtttcgaggc cgcgattaaa ttccaacatg gatgctgatt     4500

tatatgggta taaatgggct cgcgataatg tcgggcaatc aggtgcgaca atctatcgat     4560

tgtatgggaa gcccgatgcg ccagagttgt ttctgaaaca tggcaaaggt agcgttgcca     4620

atgatgttac agatgagatg gtcagactaa actggctgac ggaatttatg cctcttccga     4680

ccatcaagca ttttatccgt actcctgatg atgcatggtt actcaccact gcgatccccg     4740

gcaaaacagc attccaggta ttagaagaat atcctgattc aggtgaaaat attgttgatg     4800

cgctggcagt gttcctgcgc cggttgcatt cgattcctgt ttgtaattgt ccttttaaca     4860

gcgatcgcgt atttcgcctc gctcaggcgc aatcacgaat gaataacggt ttggttgatg     4920

cgagtgattt tgatgacgag cgtaatggct ggcctgttga acaagtctgg aaagaaatgc     4980

ataagctttt gccattctca ccggattcag tcgtcactca tggtgatttc tcacttgata     5040

accttatttt tgacgagggg aaattaatag gttgtattga tgttggacga gtcggaatcg     5100

cagaccgata ccaggatctt gccatcctat ggaactgcct cggtgagttt tctccttcat     5160

tacagaaacg gctttttcaa aaatatggta ttgataatcc tgatatgaat aaattgcagt     5220

ttcatttgat gctcgatgag tttttctaat cagtactgac aataaaaaga ttcttgtttt     5280

caagaacttg tcatttgtat agttttttta tattgtagtt gttctatttt aatcaaatgt     5340

tagcgtgatt tatatttttt ttcgcctcga catcatctgc ccagatgcga agttaagtgc     5400

gcagaaagta atatcatgcg tcaatcgtat gtgaatgctg gtcgctatac tgctgtcgat     5460

tcgatactaa cgccgccatc cagtgtcgaa aacgagctcg aattcatcga tgatatcaga     5520

tccactagtg gcctatgcga ccgcggatct gccggtctcc ctatagtgag tcgtattaat     5580

ttcgataagc caggttaacc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt     5640

gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct     5700

gcggcgagcg gtatcagcat cgatgaattc cacggactat agactatact agtatactcc     5760

gtctactgta cgatacactt ccgctcaggt ccttgtcctt taacgaggcc ttaccactct     5820

tttgttactc tattgatcca gctcagcaaa ggcagtgtga tctaagattc tatcttcgcg     5880

atgtagtaaa actagctaga ccgagaaaga gactagaaat gcaaaaggca cttctacaat     5940

ggctgccatc attattatcc gatgtgacgc tgcagcttct caatgatatt cgaatacgct     6000

ttgaggagat acagcctaat atccgacaaa ctgttttaca gatttacgat cgtacttgtt     6060

acccatcatt gaattttgaa catccgaacc tgggagtttt ccctgaaaca gatagtatat     6120

ttgaacctgt ataataatat atagtctagc gctttacgga agacaatgta tgtatttcgg     6180

ttcctggaga aactattgca tctattgcat aggtaatctt gcacgtcgca tccccggttc     6240

attttctgcg tttccatctt gcacttcaat agcatatctt tgttaacgaa gcatctgtgc     6300

ttcattttgt agaacaaaaa tgcaacgcga gagcgctaat ttttcaaaca aagaatctga     6360

gctgcatttt tacagaacag aaatgcaacg cgaaagcgct attttaccaa cgaagaatct     6420

gtgcttcatt tttgtaaaac aaaaatgcaa cgcgagagcg ctaatttttc aaacaaagaa     6480

tctgagctgc atttttacag aacagaaatg caacgcgaga gcgctatttt accaacaaag     6540

aatctatact tcttttttgt tctacaaaaa tgcatcccga gagcgctatt tttctaacaa     6600

agcatcttag attacttttt ttctcctttg tgcgctctat aatgcagtct cttgataact     6660

ttttgcactg taggtccgtt aaggttagaa gaaggctact ttggtgtcta ttttctcttc     6720

cataaaaaaa gcctgactcc acttcccgcg tttactgatt actagcgaag ctgcgggtgc     6780

attttttcaa gataaaggca tccccgatta tattctatac cgatgtggat tgcgcatact     6840

ttgtgaacag aaagtgatag cgttgatgat tcttcattgg tcagaaaatt atgaacggtt     6900

tcttctattt tgtctctata tactacgtat aggaaatgtt tacattttcg tattgttttc     6960

gattcactct atgaatagtt cttactacaa tttttttgtc taaagagtaa tactagagat     7020

aaacataaaa aatgtagagg tcgagtttag atgcaagttc aaggagcgaa aggtggatgg     7080

gtaggttata tagggatata gcacagagat atatagcaaa gagatacttt tgagcaatgt     7140

ttgtggaagc ggtattcgca atattttagt agctcgttac agtccggtgc gtttttggtt     7200

ttttgaaagt gcgtcttcag agcgcttttg gttttcaaaa gcgctctgaa gttcctatac     7260

tttctagaga ataggaactt cggaatagga acttcaaagc gtttccgaaa acgagcgctt     7320

ccgaaaatgc aacgcgagct gcgcacatac agctcactgt tcacgtcgca cctatatctg     7380

cgtgttgcct gtatatatat atacatgaga agaacggcat agtgcgtgtt tatgcttaaa     7440

tgcgtactta tatgcgtcta tttatgtagg atgaaaggta gtctagtacc tcctgtgata     7500

ttatcccatt ccatgcgggg tatcgtatgc ttccttcagc actacccttt agctgttcta     7560

tatgctgcca ctcctcaatt ggattagtct catccttcaa tgctatcatt tcctttgata     7620

ttggatcata tgcatagtac cgagaaacta gtgcgaagta gtgatcaggt attgctgtta     7680

tctgatgagt atacgttgtc ctggccacgg cagaagcacg cttatcgctc caatttccca     7740

caacattagt caactccgtt aggcccttca ttgaaagaaa tgaggtcatc aaatgtcttc     7800

caatgtgaga ttttgggcca ttttttatag caaagattga ataaggcgca tttttcttca     7860

aagctttatt gtacgatctg actaagttat cttttaataa ttggtattcc tgtttattgc     7920

ttgaagaatt gccggtccta tttactcgtt ttaggactgg ttcagaattc atcgatgctc     7980

actcaaaggt cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg     8040

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc     8100

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga     8160

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct     8220

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg     8280

gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag     8340

ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat     8400

cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac     8460

aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac     8520

tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc     8580

ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt     8640

tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc     8700

ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg     8760

agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc                8810


<210>  2
<211>  11109
<212>  DNA
<213>  Artificial

<220>
<223>  plasmid pCLS16715

<400>  2
gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ccgatcaaaa atcatcgctt       60

cgctgattaa ttaccccaga aataaggcta aaaaactaat cgcattatca tcctatggtt      120

gttaatttga ttcgttcatt tgaaggtttg tggggccagg ttactgccaa tttttcctct      180

tcataaccat aaaagctagt attgtagaat ctttattgtt cggagcagtg cggcgcgagg      240

cacatctgcg tttcaggaac gcgaccggtg aagacgagga cgcacggagg agagtcttcc      300

ttcggagggc tgtcacccgc tcggcggctt ctaatccgta cttcaatata gcaatgagca      360

gttaagcgta ttactgaaag ttccaaagag aaggtttttt taggctaatc gacctcgagc      420

agatccgcca ggcgtgtata tagcgtggat ggccaggcaa ctttagtgct gacacataca      480

ggcatatata tatgtgtgcg acgacacatg atcatatggc atgcatgtgc tctgtatgta      540

tataaaactc ttgttttctt cttttctcta aatattcttt ccttatacat taggtccttt      600

gtagcataaa ttactatact tctatagaca cgcaaacaca aatacacagc ggccttgcca      660

ccatgggcga tcctaaaaag aaacgtaagg tcatcgatta cccatacgat gttccagatt      720

acgctatcga tatcgccgat ctacgcacgc tcggctacag ccagcagcaa caggagaaga      780

tcaaaccgaa ggttcgttcg acagtggcgc agcaccacga ggcactggtc ggccacgggt      840

ttacacacgc gcacatcgtt gcgttaagcc aacacccggc agcgttaggg accgtcgctg      900

tcaagtatca ggacatgatc gcagcgttgc cagaggcgac acacgaagcg atcgttggcg      960

tcggcaaaca gtggtccggc gcacgcgctc tggaggcctt gctcacggtg gcgggagagt     1020

tgagaggtcc accgttacag ttggacacag gccaacttct caagattgca aaacgtggcg     1080

gcgtgaccgc agtggaggca gtgcatgcat ggcgcaatgc actgacgggt gccccgctca     1140

acttgacccc ccagcaggtg gtggccatcg ccagcaataa tggtggcaag caggcgctgg     1200

agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc ccccagcagg     1260

tggtggccat cgccagcaat ggcggtggca agcaggcgct ggagacggtc cagcggctgt     1320

tgccggtgct gtgccaggcc cacggcttga ccccccagca ggtggtggcc atcgccagca     1380

ataatggtgg caagcaggcg ctggagacgg tccagcggct gttgccggtg ctgtgccagg     1440

cccacggctt gaccccggag caggtggtgg ccatcgccag caatattggt ggcaagcagg     1500

cgctggagac ggtgcaggcg ctgttgccgg tgctgtgcca ggcccacggc ttgacccccc     1560

agcaggtggt ggccatcgcc agcaatggcg gtggcaagca ggcgctggag acggtccagc     1620

ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ggagcaggtg gtggccatcg     1680

ccagccacga tggcggcaag caggcgctgg agacggtcca gcggctgttg ccggtgctgt     1740

gccaggccca cggcttgacc ccggagcagg tggtggccat cgccagccac gatggcggca     1800

agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc cacggcttga     1860

ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg ctggagacgg     1920

tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccggag caggtggtgg     1980

ccatcgccag ccacgatggc ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg     2040

tgctgtgcca ggcccacggc ttgaccccgg agcaggtggt ggccatcgcc agccacgatg     2100

gcggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc caggcccacg     2160

gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag caggcgctgg     2220

agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc ccggagcagg     2280

tggtggccat cgccagcaat attggtggca agcaggcgct ggagacggtg caggcgctgt     2340

tgccggtgct gtgccaggcc cacggcttga ccccccagca ggtggtggcc atcgccagca     2400

ataatggtgg caagcaggcg ctggagacgg tccagcggct gttgccggtg ctgtgccagg     2460

cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc ggcaagcagg     2520

cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc ttgaccccgg     2580

agcaggtggt ggccatcgcc agcaatattg gtggcaagca ggcgctggag acggtgcagg     2640

cgctgttgcc ggtgctgtgc caggcccacg gcttgacccc tcagcaggtg gtggccatcg     2700

ccagcaatgg cggcggcagg ccggcgctgg agagcattgt tgcccagtta tctcgccctg     2760

atccggcgtt ggccgcgttg accaacgacc acctcgtcgc cttggcctgc ctcggcgggc     2820

gtcctgcgct ggatgcagtg aaaaagggat tgggggatcc tatcagccgt tcccagctgg     2880

tgaagtccga gctggaggag aagaaatccg agttgaggca caagctgaag tacgtgcccc     2940

acgagtacat cgagctgatc gagatcgccc ggaacagcac ccaggaccgt atcctggaga     3000

tgaaggtgat ggagttcttc atgaaggtgt acggctacag gggcaagcac ctgggcggct     3060

ccaggaagcc cgacggcgcc atctacaccg tgggctcccc catcgactac ggcgtgatcg     3120

tggacaccaa ggcctactcc ggcggctaca acctgcccat cggccaggcc gacgaaatgc     3180

agaggtacgt ggaggagaac cagaccagga acaagcacat caaccccaac gagtggtgga     3240

aggtgtaccc ctccagcgtg accgagttca agttcctgtt cgtgtccggc cacttcaagg     3300

gcaactacaa ggcccagctg accaggctga accacatcac caactgcaac ggcgccgtgc     3360

tgtccgtgga ggagctcctg atcggcggcg agatgatcaa ggccggcacc ctgaccctgg     3420

aggaggtgag gaggaagttc aacaacggcg agatcaactt cgcggccgac tgataactcg     3480

agcgatcctc tagacgagct cctcgagcct gcagcagctg aagctttgga cttcttcgcc     3540

agaggtttgg tcaagtctcc aatcaaggtt gtcggcttgt ctaccttgcc agaaatttac     3600

gaaaagatgg aaaagggtca aatcgttggt agatacgttg ttgacacttc taaataagcg     3660

aatttcttat gatttatgat ttttattatt aaataagtta taaaaaaaat aagtgtatac     3720

aaattttaaa gtgactctta ggttttaaaa cgaaaattct tattcttgag taactctttc     3780

ctgtaggtca ggttgctttc tcaggtatag catgaggtcg ctcttattga ccacacctct     3840

accggcatgc aagcttggcg taatcatggt catagctgtt tcctgtgtga aattgttatc     3900

cgctcacaat tccacacaac atacgagccg gaagcataaa gtgtaaagcc tggggtgcct     3960

aatgagtgag ctaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa     4020

acctgtcgtg ccagcagatc tattacatta tgggtggtat gttggaataa aaatcaacta     4080

tcatctacta actagtattt acgttactag tatattatca tatacggtgt tagaagatga     4140

cgcaaatgat gagaaatagt catctaaatt agtggaagct gaaacgcaag gattgataat     4200

gtaataggat caatgaatat taacatataa aatgatgata ataatattta tagaattgtg     4260

tagaattgca gattcccttt tatggattcc taaatcctcg aggagaactt ctagtatatc     4320

tacataccta atattattgc cttattaaaa atggaatccc aacaattaca tcaaaatcca     4380

cattctcttc aaaatcaatt gtcctgtact tccttgttca tgtgtgttca aaaacgttat     4440

atttatagga taattatact ctatttctca acaagtaatt ggttgtttgg ccgagcggtc     4500

taaggcgcct gattcaagaa atatcttgac cgcagttaac tgtgggaata ctcaggtatc     4560

gtaagatgca agagttcgaa tctcttagca accattattt ttttcctcaa cataacgaga     4620

acacacaggg gcgctatcgc acagaatcaa attcgatgac tggaaatttt ttgttaattt     4680

cagaggtcgc ctgacgcata tacctttttc aactgaaaaa ttgggagaaa aaggaaaggt     4740

gagagccgcg gaaccggctt ttcatataga atagagaagc gttcatgact aaatgcttgc     4800

atcacaatac ttgaagttga caatattatt taaggaccta ttgttttttc caataggtgg     4860

ttagcaatcg tcttactttc taacttttct taccttttac atttcagcaa tatatatata     4920

tatatttcaa ggatatacca ttctaatgtc tgcccctaag aagatcgtcg ttttgccagg     4980

tgaccacgtt ggtcaagaaa tcacagccga agccattaag gttcttaaag ctatttctga     5040

tgttcgttcc aatgtcaagt tcgatttcga aaatcattta attggtggtg ctgctatcga     5100

tgctacaggt gtcccacttc cagatgaggc gctggaagcc tccaagaagg ttgatgccgt     5160

tttgttaggt gctgtgggtg gtcctaaatg gggtaccggt agtgttagac ctgaacaagg     5220

tttactaaaa atccgtaaag aacttcaatt gtacgccaac ttaagaccat gtaactttgc     5280

atccgactct cttttagact tatctccaat caagccacaa tttgctaaag gtactgactt     5340

cgttgttgtc agagaattag tgggaggtat ttactttggt aagagaaagg aagacgatgg     5400

tgatggtgtc gcttgggata gtgaacaata caccgttcca gaagtgcaaa gaatcacaag     5460

aatggccgct ttcatggccc tacaacatga gccaccattg cctatttggt ccttggataa     5520

agctaatgtt ttggcctctt caagattatg gagaaaaact gtggaggaaa ccatcaagaa     5580

cgaattccct acattgaagg ttcaacatca attgattgat tctgccgcca tgatcctagt     5640

taagaaccca acccacctaa atggtattat aatcaccagc aacatgtttg gtgatatcat     5700

ctccgatgaa gcctccgtta tcccaggttc cttgggtttg ttgccatctg cgtccttggc     5760

ctctttgcca gacaagaaca ccgcatttgg tttgtacgaa ccatgccacg gttctgctcc     5820

agatttgcca aagaataagg tcaaccctat cgccactatc ttgtctgctg caatgatgtt     5880

gaaattgtca ttgaacttgc ctgaagaagg taaggccatt gaagatgcag ttaaaaaggt     5940

tttggatgca ggtatcagaa ctggtgattt aggtggttcc aacagtacca cggaagtcgg     6000

tgatgctgtc gccgaagaag ttaagaaaat ccttgcttaa aaagattctc tttttttatg     6060

atatttgtac ataaacttta taaatgaaat tcataataga aacgacacga aattacaaaa     6120

tggaatatgt tcatagggta gacgaaacta tatacgcaat ctacatacat ttatcaagaa     6180

ggagaaaaag gaggatgtaa aggaatacag gtaagcaaat tgatactaat ggctcaacgt     6240

gataaggaaa aagaattgca ctttaacatt aatattgaca aggaggaggg caccacacaa     6300

aaagttaggt gtaacagaaa atcatgaaac tatgattcct aatttatata ttggaggatt     6360

ttctctaaaa aaaaaaaaat acaacaaata aaaaacactc aatgacctga ccatttgatg     6420

gagtttaagt caataccttc ttgaaccatt tcccataatg gtgaaagttc cctcaagaat     6480

tttactctgt cagaaacggc cttaacgacg tagtcgacct cctcttcagt actaaatcta     6540

ccaataccaa atctgatgga agaatgggct aatgcatcat ccttacccag cgcatgtaaa     6600

acataagaag gttctaggga agcagatgta caggctgaac ccgaggataa tgcgatatcc     6660

cttagtgcca tcaataaaga ttctccttcc acgtaggcga aagaaacgtt aacacaccct     6720

ggataacgat gatctggaga tccgttcaac gtggtatgtt cagcggataa tagacctttg     6780

actaatttat cggatagtct tttgatgtga gcttggtcgt tgtcaaattc tttcttcatc     6840

aatctcgcag cttcaccaaa tcccgctacc aatggggggg ccaaagtacc agatctgctg     6900

cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct     6960

tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagcatcg     7020

atgaattcca cggactatag actatactag tatactccgt ctactgtacg atacacttcc     7080

gctcaggtcc ttgtccttta acgaggcctt accactcttt tgttactcta ttgatccagc     7140

tcagcaaagg cagtgtgatc taagattcta tcttcgcgat gtagtaaaac tagctagacc     7200

gagaaagaga ctagaaatgc aaaaggcact tctacaatgg ctgccatcat tattatccga     7260

tgtgacgctg cagcttctca atgatattcg aatacgcttt gaggagatac agcctaatat     7320

ccgacaaact gttttacaga tttacgatcg tacttgttac ccatcattga attttgaaca     7380

tccgaacctg ggagttttcc ctgaaacaga tagtatattt gaacctgtat aataatatat     7440

agtctagcgc tttacggaag acaatgtatg tatttcggtt cctggagaaa ctattgcatc     7500

tattgcatag gtaatcttgc acgtcgcatc cccggttcat tttctgcgtt tccatcttgc     7560

acttcaatag catatctttg ttaacgaagc atctgtgctt cattttgtag aacaaaaatg     7620

caacgcgaga gcgctaattt ttcaaacaaa gaatctgagc tgcattttta cagaacagaa     7680

atgcaacgcg aaagcgctat tttaccaacg aagaatctgt gcttcatttt tgtaaaacaa     7740

aaatgcaacg cgagagcgct aatttttcaa acaaagaatc tgagctgcat ttttacagaa     7800

cagaaatgca acgcgagagc gctattttac caacaaagaa tctatacttc ttttttgttc     7860

tacaaaaatg catcccgaga gcgctatttt tctaacaaag catcttagat tacttttttt     7920

ctcctttgtg cgctctataa tgcagtctct tgataacttt ttgcactgta ggtccgttaa     7980

ggttagaaga aggctacttt ggtgtctatt ttctcttcca taaaaaaagc ctgactccac     8040

ttcccgcgtt tactgattac tagcgaagct gcgggtgcat tttttcaaga taaaggcatc     8100

cccgattata ttctataccg atgtggattg cgcatacttt gtgaacagaa agtgatagcg     8160

ttgatgattc ttcattggtc agaaaattat gaacggtttc ttctattttg tctctatata     8220

ctacgtatag gaaatgttta cattttcgta ttgttttcga ttcactctat gaatagttct     8280

tactacaatt tttttgtcta aagagtaata ctagagataa acataaaaaa tgtagaggtc     8340

gagtttagat gcaagttcaa ggagcgaaag gtggatgggt aggttatata gggatatagc     8400

acagagatat atagcaaaga gatacttttg agcaatgttt gtggaagcgg tattcgcaat     8460

attttagtag ctcgttacag tccggtgcgt ttttggtttt ttgaaagtgc gtcttcagag     8520

cgcttttggt tttcaaaagc gctctgaagt tcctatactt tctagagaat aggaacttcg     8580

gaataggaac ttcaaagcgt ttccgaaaac gagcgcttcc gaaaatgcaa cgcgagctgc     8640

gcacatacag ctcactgttc acgtcgcacc tatatctgcg tgttgcctgt atatatatat     8700

acatgagaag aacggcatag tgcgtgttta tgcttaaatg cgtacttata tgcgtctatt     8760

tatgtaggat gaaaggtagt ctagtacctc ctgtgatatt atcccattcc atgcggggta     8820

tcgtatgctt ccttcagcac taccctttag ctgttctata tgctgccact cctcaattgg     8880

attagtctca tccttcaatg ctatcatttc ctttgatatt ggatcatatg catagtaccg     8940

agaaactagt gcgaagtagt gatcaggtat tgctgttatc tgatgagtat acgttgtcct     9000

ggccacggca gaagcacgct tatcgctcca atttcccaca acattagtca actccgttag     9060

gcccttcatt gaaagaaatg aggtcatcaa atgtcttcca atgtgagatt ttgggccatt     9120

ttttatagca aagattgaat aaggcgcatt tttcttcaaa gctttattgt acgatctgac     9180

taagttatct tttaataatt ggtattcctg tttattgctt gaagaattgc cggtcctatt     9240

tactcgtttt aggactggtt cagaattcat cgatgctcac tcaaaggtcg gtaatacggt     9300

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg     9360

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg     9420

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat     9480

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta     9540

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct     9600

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc     9660

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa     9720

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg     9780

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag     9840

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt     9900

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     9960

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc    10020

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca    10080

cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa    10140

cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat    10200

ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct    10260

taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt    10320

tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat    10380

ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta    10440

atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg    10500

gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt    10560

tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg    10620

cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg    10680

taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc    10740

ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa    10800

ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac    10860

cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt    10920

ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg    10980

gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa    11040

gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata    11100

aacaaatag                                                            11109


<210>  3
<211>  597
<212>  DNA
<213>  Flavovacterium

<400>  3
cagctggtga agtccgagct ggaggagaag aaatccgagt tgaggcacaa gctgaagtac       60

gtgccccacg agtacatcga gctgatcgag atcgcccgga acagcaccca ggaccgtatc      120

ctggagatga aggtgatgga gttcttcatg aaggtgtacg gctacagggg caagcacctg      180

ggcggctcca ggaagcccga cggcgccatc tacaccgtgg gctcccccat cgactacggc      240

gtgatcgtgg acaccaaggc ctactccggc ggctacaacc tgcccatcgg ccaggccgac      300

gaaatgcaga ggtacgtgga ggagaaccag accaggaaca agcacatcaa ccccaacgag      360

tggtggaagg tgtacccctc cagcgtgacc gagttcaagt tcctgttcgt gtccggccac      420

ttcaagggca actacaaggc ccagctgacc aggctgaacc acatcaccaa ctgcaacggc      480

gccgtgctgt ccgtggagga gctcctgatc ggcggcgaga tgatcaaggc cggcaccctg      540

accctggagg aggtgaggag gaagttcaac aacggcgaga tcaacttcgc ggccgac         597


<210>  4
<211>  15
<212>  DNA
<213>  Artificial

<220>
<223>  DNA target site of the left-hand TALE of Figure 1B

<400>  4
gtgatccccc cagca                                                        15


<210>  5
<211>  15
<212>  DNA
<213>  Artificial

<220>
<223>  Sequence complementary to the DNA target site of SEQ ID NO: 4

<400>  5
tgctgggggg atcac                                                        15


<210>  6
<211>  5
<212>  DNA
<213>  Artificial

<220>
<223>  portion of the DNA target site of SEQ ID NO: 4 that is the 
       sequence of the 5' end of the DNA tandem repeat

<400>  6
cagca                                                                    5


<210>  7
<211>  10
<212>  DNA
<213>  Artificial

<220>
<223>  portion of the DNA target site of SEQ ID NO: 4 that is the gene 
       sequence that is immediately adjacent to the 5' end of the tandem
       repeat (outside of the tandem repeat sequence)

<400>  7
gtgatccccc                                                              10


<210>  8
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  spacer of Figure 1B

<400>  8
gcagcagcag cagcagcagc                                                   20


<210>  9
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  sequence of the spacer on the complementary strand

<400>  9
gctgctgctg ctgctgctgc                                                   20


<210>  10
<211>  15
<212>  DNA
<213>  Artificial

<220>
<223>  DNA target site of the right-hand TALE of Figure 1B

<400>  10
gctgctgctg ctgct                                                        15


<210>  11
<211>  15
<212>  DNA
<213>  Artificial

<220>
<223>  Sequence complementary to the DNA target site of SEQ ID NO: 10

<400>  11
agcagcagca gcagc                                                        15


<210>  12
<211>  65
<212>  DNA
<213>  Artificial

<220>
<223>  Split left TALE DNA-binding domain of Figure 1B

<400>  12
tcgctgcagg tcggcctcag cctggccgaa agaaagaaat ggtctgtgat ccccccagca       60

gcagc                                                                   65


<210>  13
<211>  65
<212>  DNA
<213>  Artificial

<220>
<223>  Sequence complementary to SEQ ID NO: 12

<400>  13
gctgctgctg gtccagccgg agtcggaccg gctttctttc tttaccagac actagggggc       60

agcga                                                                   65


<210>  14
<211>  27
<212>  DNA
<213>  Artificial

<220>
<223>  CTG repeat

<400>  14
ctgctgctgc tgctgctgct gctgctg                                           27


<210>  15
<211>  65
<212>  DNA
<213>  Artificial

<220>
<223>  DNA comprising a DNA direct tandem repeat consisting of 9 copies 
       of the unit CTG

<400>  15
tagccgggaa tgctgctgct gctgctgctg ctgctgctgg ggggatcaca gaccatttct       60

ttctt                                                                   65


<210>  16
<211>  68
<212>  DNA
<213>  Artificial

<220>
<223>  DNA comprising a DNA direct tandem repeat consisting of 9 copies 
       of the unit CTG

<400>  16
tagccgggaa tgctgctgct gctgctgctg ctgctgctgg ggggatcaca tacttttttt       60

ttctttcg                                                                68


<210>  17
<211>  19
<212>  DNA
<213>  Artificial

<220>
<223>  mutation detected in yeast

<400>  17
aaaaaaaaaa aaaaaaaaa                                                    19


<210>  18
<211>  24
<212>  DNA
<213>  Artificial

<220>
<223>  mutation detected in yeast

<400>  18
aaaaaaaaaa aaaaaaaaaa aaaa                                              24


<210>  19
<211>  19
<212>  DNA
<213>  Artificial

<220>
<223>  mutation detected in yeast

<400>  19
tttttttttt ttttttttt                                                    19


<210>  20
<211>  13
<212>  DNA
<213>  Artificial

<220>
<223>  mutation detected in yeast

<400>  20
tttttttttt ttt                                                          13


<210>  21
<211>  15
<212>  DNA
<213>  Artificial

<220>
<223>  mutation detected in yeast

<400>  21
aagaaaaaaa aaaaa                                                        15


<210>  22
<211>  5
<212>  PRT
<213>  Artificial

<220>
<223>  linker

<400>  22

Gln Gly Pro Ser Gly 
1               5   


<210>  23
<211>  34
<212>  DNA
<213>  Artificial

<220>
<223>  DNA comprising a DNA direct tandem repeat consisting of 8 copies 
       of the unit CAG

<400>  23
gtgatccccc cagcagcagc agcagcagca gcag                                   34


<210>  24
<211>  270
<212>  PRT
<213>  Artificial

<220>
<223>  TALE tandem repeat consisting of 8 copies of the unit of SEQ ID 
       NO: 25 (the RVDs being HD, NG, NI, NN, NS, N*, HG and H* 
       respectively)

<400>  24

Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly 
        35                  40                  45              


Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys 
    50                  55                  60                  


Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn 
65                  70                  75                  80  


Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val 
                85                  90                  95      


Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala 
            100                 105                 110         


Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu 
        115                 120                 125             


Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala 
    130                 135                 140                 


Ile Ala Ser Asn Ser Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg 
145                 150                 155                 160 


Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val 
                165                 170                 175     


Val Ala Ile Ala Ser Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln 
            180                 185                 190         


Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln 
        195                 200                 205             


Val Val Ala Ile Ala Ser His Gly Gly Gly Lys Gln Ala Leu Glu Thr 
    210                 215                 220                 


Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro 
225                 230                 235                 240 


Glu Gln Val Val Ala Ile Ala Ser His Gly Gly Lys Gln Ala Leu Glu 
                245                 250                 255     


Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly 
            260                 265                 270 


<210>  25
<211>  34
<212>  PRT
<213>  Artificial

<220>
<223>  TAL effector tandem repeat unit [XX is selected from the group 
       consisting of HD, NG, NI, NN, NS, N*, HG, H*, IG, HA, ND, NK, HI,
       HN, NA, SN and YG (the symbol * denotes that the second X is 
       missing)]


<220>
<221>  MISC_FEATURE
<222>  (12)..(13)
<223>  XX is the RVD of the TAL effector tandem repat unit; XX selected 
       from the group consisting of HD, NG, NI, NN, NS, N*, HG, H*, IG, 
       HA, ND, NK, HI, HN, NA, SN and YG (the symbol * denotes that the 
       second X is missing)

<400>  25

Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Xaa Xaa Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


His Gly 
        


<210>  26
<211>  34
<212>  PRT
<213>  Artificial

<220>
<223>  TAL effector tandem repeat unit [XX is selected from the group 
       consisting of HD, NG, NI, NN, NS, N*, HG, H*, IG, HA, ND, NK, HI,
       HN, NA, SN and YG (the symbol * denotes that the second X is 
       missing)]


<220>
<221>  MISC_FEATURE
<222>  (12)..(13)
<223>  XX is the RVD of the TAL effector tandem repeat unit; XX is 
       selected from the group consisting of HD, NG, NI, NN, NS, N*, HG,
       H*, IG, HA, ND, NK, HI, HN, NA, SN and YG (the symbol * denotes 
       that the second X is missing)]

<400>  26

Leu Thr Pro Asp Gln Val Val Ala Ile Ala Ser Xaa Xaa Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Asp 
            20                  25                  30          


His Gly 
        


<210>  27
<211>  39
<212>  DNA
<213>  Artificial

<220>
<223>  CAG repeat

<400>  27
cagcagcagc agcagcagca gcagcagcag cagcagcag                              39


<210>  28
<211>  87
<212>  DNA
<213>  Artificial

<220>
<223>  CAG repeat

<400>  28
cagcagcagc agcagcagca gcagcagcag cagcagcagc agcagcagca gcagcagcag       60

cagcagcagc agcagcagca gcagcag                                           87


<210>  29
<211>  45
<212>  DNA
<213>  Artificial

<220>
<223>  CAG repeat

<400>  29
cagcagcagc agcagcagca gcagcagcag cagcagcagc agcag                       45


<210>  30
<211>  9
<212>  DNA
<213>  Artificial

<220>
<223>  CAG repeat

<400>  30
cagcagcag                                                                9


<210>  31
<211>  366
<212>  DNA
<213>  Artificial

<220>
<223>  CTG repeat

<400>  31
ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg       60

ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg      120

ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg      180

ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg      240

ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg      300

ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg      360

ctgctg                                                                 366


<210>  32
<211>  216
<212>  DNA
<213>  Artificial

<220>
<223>  CTG repeat

<400>  32
ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg       60

ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg      120

ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg      180

ctgctgctgc tgctgctgct gctgctgctg ctgctg                                216


<210>  33
<211>  96
<212>  DNA
<213>  Artificial

<220>
<223>  CTG repeat

<400>  33
ctgctgctgc tgctgctgct gctgctgctg ctgctgctgc tgctgctgct gctgctgctg       60

ctgctgctgc tgctgctgct gctgctgctg ctgctg                                 96


<210>  34
<211>  6
<212>  DNA
<213>  Artificial

<220>
<223>  CTG repeat

<400>  34
ctgctg                                                                   6


<210>  35
<211>  6
<212>  DNA
<213>  Artificial

<220>
<223>  CTG repeat

<400>  35
cagcag                                                                   6


<210>  36
<211>  4
<212>  DNA
<213>  Artificial

<220>
<223>  CTG repeat

<400>  36
gcag                                                                     4


<210>  37
<211>  366
<212>  DNA
<213>  Artificial

<220>
<223>  GAL10 enhancer

<400>  37
gatcaaaaat catcgcttcg ctgattaatt accccagaaa taaggctaaa aaactaatcg       60

cattatcatc ctatggttgt taatttgatt cgttcatttg aaggtttgtg gggccaggtt      120

actgccaatt tttcctcttc ataaccataa aagctagtat tgtagaatct ttattgttcg      180

gagcagtgcg gcgcgaggca catctgcgtt tcaggaacgc gaccggtgaa gacgaggacg      240

cacggaggag agtcttcctt cggagggctg tcacccgctc ggcggcttct aatccgtact      300

tcaatatagc aatgagcagt taagcgtatt actgaaagtt ccaaagagaa ggttttttta      360

ggctaa                                                                 366


<210>  38
<211>  240
<212>  DNA
<213>  Artificial

<220>
<223>  CYC1 promoter

<400>  38
tcgacctcga gcagatccgc caggcgtgta tatagcgtgg atggccaggc aactttagtg       60

ctgacacata caggcatata tatatgtgtg cgacgacaca tgatcatatg gcatgcatgt      120

gctctgtatg tatataaaac tcttgttttc ttcttttctc taaatattct ttccttatac      180

attaggtcct ttgtagcata aattactata cttctataga cacgcaaaca caaatacaca      240


<210>  39
<211>  2829
<212>  DNA
<213>  Artificial

<220>
<223>  sequence coding for the TALEN arm that recognizes the DNA target 
       site of SEQ ID NO: 10

<400>  39
atgggcgatc ctaaaaagaa acgtaaggtc atcgataagg agaccgccgc tgccaagttc       60

gagagacagc acatggacag catcgatatc gccgatctac gcacgctcgg ctacagccag      120

cagcaacagg agaagatcaa accgaaggtt cgttcgacag tggcgcagca ccacgaggca      180

ctggtcggcc acgggtttac acacgcgcac atcgttgcgt taagccaaca cccggcagcg      240

ttagggaccg tcgctgtcaa gtatcaggac atgatcgcag cgttgccaga ggcgacacac      300

gaagcgatcg ttggcgtcgg caaacagtgg tccggcgcac gcgctctgga ggccttgctc      360

acggtggcgg gagagttgag aggtccaccg ttacagttgg acacaggcca acttctcaag      420

attgcaaaac gtggcggcgt gaccgcagtg gaggcagtgc atgcatggcg caatgcactg      480

acgggtgccc cgctcaactt gaccccccag caggtggtgg ccatcgccag caataatggt      540

ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc      600

ttgaccccgg agcaggtggt ggccatcgcc agccacgatg gcggcaagca ggcgctggag      660

acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg      720

gtggccatcg ccagcaatgg cggtggcaag caggcgctgg agacggtcca gcggctgttg      780

ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat      840

aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc      900

cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg      960

ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccccag     1020

caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg     1080

ctgttgccgg tgctgtgcca ggcccacggc ttgacccccc agcaggtggt ggccatcgcc     1140

agcaataatg gtggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc     1200

caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag     1260

caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc     1320

ccccagcagg tggtggccat cgccagcaat ggcggtggca agcaggcgct ggagacggtc     1380

cagcggctgt tgccggtgct gtgccaggcc cacggcttga ccccccagca ggtggtggcc     1440

atcgccagca ataatggtgg caagcaggcg ctggagacgg tccagcggct gttgccggtg     1500

ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc     1560

ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc     1620

ttgacccccc agcaggtggt ggccatcgcc agcaatggcg gtggcaagca ggcgctggag     1680

acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg     1740

gtggccatcg ccagcaataa tggtggcaag caggcgctgg agacggtcca gcggctgttg     1800

ccggtgctgt gccaggccca cggcttgacc ccggagcagg tggtggccat cgccagccac     1860

gatggcggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc     1920

cacggcttga ccccccagca ggtggtggcc atcgccagca atggcggtgg caagcaggcg     1980

ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gacccctcag     2040

caggtggtgg ccatcgccag caatggcggc ggcaggccgg cgctggagag cattgttgcc     2100

cagttatctc gccctgatcc ggcgttggcc gcgttgacca acgaccacct cgtcgccttg     2160

gcctgcctcg gcgggcgtcc tgcgctggat gcagtgaaaa agggattggg ggatcctatc     2220

agccgttccc agctggtgaa gtccgagctg gaggagaaga aatccgagtt gaggcacaag     2280

ctgaagtacg tgccccacga gtacatcgag ctgatcgaga tcgcccggaa cagcacccag     2340

gaccgtatcc tggagatgaa ggtgatggag ttcttcatga aggtgtacgg ctacaggggc     2400

aagcacctgg gcggctccag gaagcccgac ggcgccatct acaccgtggg ctcccccatc     2460

gactacggcg tgatcgtgga caccaaggcc tactccggcg gctacaacct gcccatcggc     2520

caggccgacg aaatgcagag gtacgtggag gagaaccaga ccaggaacaa gcacatcaac     2580

cccaacgagt ggtggaaggt gtacccctcc agcgtgaccg agttcaagtt cctgttcgtg     2640

tccggccact tcaagggcaa ctacaaggcc cagctgacca ggctgaacca catcaccaac     2700

tgcaacggcg ccgtgctgtc cgtggaggag ctcctgatcg gcggcgagat gatcaaggcc     2760

ggcaccctga ccctggagga ggtgaggagg aagttcaaca acggcgagat caacttcgcg     2820

gccgactga                                                             2829


<210>  40
<211>  320
<212>  DNA
<213>  Artificial

<220>
<223>  ADH1 terminator

<400>  40
tattgaccac acctctaccg gcatgcaagc ttggcgtaat catggtcata gctgtttcct       60

gtgtgaaatt gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt      120

aaagcctggg gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc      180

gctttccagt cgggaaacct gtcgtgccag cagatctgtt tagcttgcct cgtccccgcc      240

gggtcacccg gccagcgaca tggaggccca gaataccctc cttgacagtc ttgacgtgcg      300

cagctcaggg gcatgatgtg                                                  320


<210>  41
<211>  383
<212>  DNA
<213>  Artificial

<220>
<223>  TEF promoter

<400>  41
tgaggttctt ctttcatata cttcctttta aaatcttgct aggatacagt tctcacatca       60

catccgaaca taaacaacca tgcatgggta aggaaaagac tcacgtttcg aggccgcgat      120

taaattccaa catggatgct gatttatatg ggtataaatg ggctcgcgat aatgtcgggc      180

aatcaggtgc gacaatctat cgattgtatg ggaagcccga tgcgccagag ttgtttctga      240

aacatggcaa aggtagcgtt gccaatgatg ttacagatga gatggtcaga ctaaactggc      300

tgacggaatt tatgcctctt ccgaccatca agcattttat ccgtactcct gatgatgcat      360

ggttactcac cactgcgatc ccc                                              383


<210>  42
<211>  807
<212>  DNA
<213>  Artificial

<220>
<223>  sequence coding for the KANMX selection marker

<400>  42
ggcaaaacag cattccaggt attagaagaa tatcctgatt caggtgaaaa tattgttgat       60

gcgctggcag tgttcctgcg ccggttgcat tcgattcctg tttgtaattg tccttttaac      120

agcgatcgcg tatttcgcct cgctcaggcg caatcacgaa tgaataacgg tttggttgat      180

gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg gaaagaaatg      240

cataagcttt tgccattctc accggattca gtcgtcactc atggtgattt ctcacttgat      300

aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg agtcggaatc      360

gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt ttctccttca      420

ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa taaattgcag      480

tttcatttga tgctcgatga gtttttctaa tcagtactga caataaaaag attcttgttt      540

tcaagaactt gtcatttgta tagttttttt atattgtagt tgttctattt taatcaaatg      600

ttagcgtgat ttatattttt tttcgcctcg acatcatctg cccagatgcg aagttaagtg      660

cgcagaaagt aatatcatgc gtcaatcgta tgtgaatgct ggtcgctata ctgctgtcga      720

ttcgatacta acgccgccat ccagtgtcga aaacgagctc gaattcatcg atgatatcag      780

atccactagt ggcctatgcg accgcgg                                          807


<210>  43
<211>  213
<212>  DNA
<213>  Artificial

<220>
<223>  TEF terminator

<400>  43
atctgccggt ctccctatag tgagtcgtat taatttcgat aagccaggtt aacctgcatt       60

aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct      120

cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gcatcgatga      180

attccacgga ctatagacta tactagtata ctc                                   213


<210>  44
<211>  1345
<212>  DNA
<213>  Artificial

<220>
<223>  2-Micron replication origin

<400>  44
gctatttttc taacaaagca tcttagatta ctttttttct cctttgtgcg ctctataatg       60

cagtctcttg ataacttttt gcactgtagg tccgttaagg ttagaagaag gctactttgg      120

tgtctatttt ctcttccata aaaaaagcct gactccactt cccgcgttta ctgattacta      180

gcgaagctgc gggtgcattt tttcaagata aaggcatccc cgattatatt ctataccgat      240

gtggattgcg catactttgt gaacagaaag tgatagcgtt gatgattctt cattggtcag      300

aaaattatga acggtttctt ctattttgtc tctatatact acgtatagga aatgtttaca      360

ttttcgtatt gttttcgatt cactctatga atagttctta ctacaatttt tttgtctaaa      420

gagtaatact agagataaac ataaaaaatg tagaggtcga gtttagatgc aagttcaagg      480

agcgaaaggt ggatgggtag gttatatagg gatatagcac agagatatat agcaaagaga      540

tacttttgag caatgtttgt ggaagcggta ttcgcaatat tttagtagct cgttacagtc      600

cggtgcgttt ttggtttttt gaaagtgcgt cttcagagcg cttttggttt tcaaaagcgc      660

tctgaagttc ctatactttc tagagaatag gaacttcgga ataggaactt caaagcgttt      720

ccgaaaacga gcgcttccga aaatgcaacg cgagctgcgc acatacagct cactgttcac      780

gtcgcaccta tatctgcgtg ttgcctgtat atatatatac atgagaagaa cggcatagtg      840

cgtgtttatg cttaaatgcg tacttatatg cgtctattta tgtaggatga aaggtagtct      900

agtacctcct gtgatattat cccattccat gcggggtatc gtatgcttcc ttcagcacta      960

ccctttagct gttctatatg ctgccactcc tcaattggat tagtctcatc cttcaatgct     1020

atcatttcct ttgatattgg atcatatgca tagtaccgag aaactagtgc gaagtagtga     1080

tcaggtattg ctgttatctg atgagtatac gttgtcctgg ccacggcaga agcacgctta     1140

tcgctccaat ttcccacaac attagtcaac tccgttaggc ccttcattga aagaaatgag     1200

gtcatcaaat gtcttccaat gtgagatttt gggccatttt ttatagcaaa gattgaataa     1260

ggcgcatttt tcttcaaagc tttattgtac gatctgacta agttatcttt taataattgg     1320

tattcctgtt tattgcttga agaat                                           1345


<210>  45
<211>  1530
<212>  DNA
<213>  Artificial

<220>
<223>  sequence coding for the TAL effector tandem repeat of the TALEN 
       arm that binds to the DNA target site of SEQ ID NO: 10 (15 
       adjacent units of 34 amino acids)

<400>  45
ttgacccccc agcaggtggt ggccatcgcc agcaataatg gtggcaagca ggcgctggag       60

acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ggagcaggtg      120

gtggccatcg ccagccacga tggcggcaag caggcgctgg agacggtcca gcggctgttg      180

ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat      240

ggcggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc      300

cacggcttga ccccccagca ggtggtggcc atcgccagca ataatggtgg caagcaggcg      360

ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccggag      420

caggtggtgg ccatcgccag ccacgatggc ggcaagcagg cgctggagac ggtccagcgg      480

ctgttgccgg tgctgtgcca ggcccacggc ttgacccccc agcaggtggt ggccatcgcc      540

agcaatggcg gtggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc      600

caggcccacg gcttgacccc ccagcaggtg gtggccatcg ccagcaataa tggtggcaag      660

caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc      720

ccggagcagg tggtggccat cgccagccac gatggcggca agcaggcgct ggagacggtc      780

cagcggctgt tgccggtgct gtgccaggcc cacggcttga ccccccagca ggtggtggcc      840

atcgccagca atggcggtgg caagcaggcg ctggagacgg tccagcggct gttgccggtg      900

ctgtgccagg cccacggctt gaccccccag caggtggtgg ccatcgccag caataatggt      960

ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc     1020

ttgaccccgg agcaggtggt ggccatcgcc agccacgatg gcggcaagca ggcgctggag     1080

acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg     1140

gtggccatcg ccagcaatgg cggtggcaag caggcgctgg agacggtcca gcggctgttg     1200

ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat     1260

aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc     1320

cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg     1380

ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccccag     1440

caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg     1500

ctgttgccgg tgctgtgcca ggcccacggc                                      1530


<210>  46
<211>  34
<212>  PRT
<213>  Artificial

<220>
<223>  TAL effector tandem repeat unit [XX is selected from the group 
       consisting of HD, NG, NI, NN, NS, N*, HG, H*, IG, HA, ND, NK, HI,
       HN, NA, SN and YG (the symbol * denotes that the second X is 
       missing)]


<220>
<221>  MISC_FEATURE
<222>  (12)..(13)
<223>  XX is the RVD of the TAL effector tandem repeat unit; XX is 
       selected from the group consisting of HD, NG, NI, NN, NS, N*, HG,
       H*, IG, HA, ND, NK, HI, HN, NA, SN and YG (the symbol * denotes 
       that the second X is missing)

<400>  46

Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Xaa Xaa Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


His Gly 
        


<210>  47
<211>  60
<212>  DNA
<213>  Artificial

<220>
<223>  (non-specific) C-terminal truncated unit of 20 amino acids of the
       TALEN arm that binds to the DNA target site of SEQ ID NO: 10

<400>  47
ttgacccctc agcaggtggt ggccatcgcc agcaatggcg gcggcaggcc ggcgctggag       60


<210>  48
<211>  20
<212>  PRT
<213>  Artificial

<220>
<223>  (non-specific) C-terminal truncated unit of 20 amino acids of the
       TALEN arm that binds to the DNA target site of SEQ ID NO: 10

<400>  48

Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Arg 
1               5                   10                  15      


Pro Ala Leu Glu 
            20  


<210>  49
<211>  199
<212>  PRT
<213>  Artificial

<220>
<223>  FokI monomer

<400>  49

Gln Leu Val Lys Ser Glu Leu Glu Glu Lys Lys Ser Glu Leu Arg His 
1               5                   10                  15      


Lys Leu Lys Tyr Val Pro His Glu Tyr Ile Glu Leu Ile Glu Ile Ala 
            20                  25                  30          


Arg Asn Ser Thr Gln Asp Arg Ile Leu Glu Met Lys Val Met Glu Phe 
        35                  40                  45              


Phe Met Lys Val Tyr Gly Tyr Arg Gly Lys His Leu Gly Gly Ser Arg 
    50                  55                  60                  


Lys Pro Asp Gly Ala Ile Tyr Thr Val Gly Ser Pro Ile Asp Tyr Gly 
65                  70                  75                  80  


Val Ile Val Asp Thr Lys Ala Tyr Ser Gly Gly Tyr Asn Leu Pro Ile 
                85                  90                  95      


Gly Gln Ala Asp Glu Met Gln Arg Tyr Val Glu Glu Asn Gln Thr Arg 
            100                 105                 110         


Asn Lys His Ile Asn Pro Asn Glu Trp Trp Lys Val Tyr Pro Ser Ser 
        115                 120                 125             


Val Thr Glu Phe Lys Phe Leu Phe Val Ser Gly His Phe Lys Gly Asn 
    130                 135                 140                 


Tyr Lys Ala Gln Leu Thr Arg Leu Asn His Ile Thr Asn Cys Asn Gly 
145                 150                 155                 160 


Ala Val Leu Ser Val Glu Glu Leu Leu Ile Gly Gly Glu Met Ile Lys 
                165                 170                 175     


Ala Gly Thr Leu Thr Leu Glu Glu Val Arg Arg Lys Phe Asn Asn Gly 
            180                 185                 190         


Glu Ile Asn Phe Ala Ala Asp 
        195                 


<210>  50
<211>  2811
<212>  DNA
<213>  Artificial

<220>
<223>  sequence coding for the TALEN arm that binds to the DNA target 
       site of SEQ ID NO: 4

<400>  50
atgggcgatc ctaaaaagaa acgtaaggtc atcgattacc catacgatgt tccagattac       60

gctatcgata tcgccgatct acgcacgctc ggctacagcc agcagcaaca ggagaagatc      120

aaaccgaagg ttcgttcgac agtggcgcag caccacgagg cactggtcgg ccacgggttt      180

acacacgcgc acatcgttgc gttaagccaa cacccggcag cgttagggac cgtcgctgtc      240

aagtatcagg acatgatcgc agcgttgcca gaggcgacac acgaagcgat cgttggcgtc      300

ggcaaacagt ggtccggcgc acgcgctctg gaggccttgc tcacggtggc gggagagttg      360

agaggtccac cgttacagtt ggacacaggc caacttctca agattgcaaa acgtggcggc      420

gtgaccgcag tggaggcagt gcatgcatgg cgcaatgcac tgacgggtgc cccgctcaac      480

ttgacccccc agcaggtggt ggccatcgcc agcaataatg gtggcaagca ggcgctggag      540

acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg      600

gtggccatcg ccagcaatgg cggtggcaag caggcgctgg agacggtcca gcggctgttg      660

ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat      720

aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc      780

cacggcttga ccccggagca ggtggtggcc atcgccagca atattggtgg caagcaggcg      840

ctggagacgg tgcaggcgct gttgccggtg ctgtgccagg cccacggctt gaccccccag      900

caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg      960

ctgttgccgg tgctgtgcca ggcccacggc ttgaccccgg agcaggtggt ggccatcgcc     1020

agccacgatg gcggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc     1080

caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag     1140

caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc     1200

ccggagcagg tggtggccat cgccagccac gatggcggca agcaggcgct ggagacggtc     1260

cagcggctgt tgccggtgct gtgccaggcc cacggcttga ccccggagca ggtggtggcc     1320

atcgccagcc acgatggcgg caagcaggcg ctggagacgg tccagcggct gttgccggtg     1380

ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc     1440

ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc     1500

ttgaccccgg agcaggtggt ggccatcgcc agccacgatg gcggcaagca ggcgctggag     1560

acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ggagcaggtg     1620

gtggccatcg ccagcaatat tggtggcaag caggcgctgg agacggtgca ggcgctgttg     1680

ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat     1740

aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc     1800

cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg     1860

ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccggag     1920

caggtggtgg ccatcgccag caatattggt ggcaagcagg cgctggagac ggtgcaggcg     1980

ctgttgccgg tgctgtgcca ggcccacggc ttgacccctc agcaggtggt ggccatcgcc     2040

agcaatggcg gcggcaggcc ggcgctggag agcattgttg cccagttatc tcgccctgat     2100

ccggcgttgg ccgcgttgac caacgaccac ctcgtcgcct tggcctgcct cggcgggcgt     2160

cctgcgctgg atgcagtgaa aaagggattg ggggatccta tcagccgttc ccagctggtg     2220

aagtccgagc tggaggagaa gaaatccgag ttgaggcaca agctgaagta cgtgccccac     2280

gagtacatcg agctgatcga gatcgcccgg aacagcaccc aggaccgtat cctggagatg     2340

aaggtgatgg agttcttcat gaaggtgtac ggctacaggg gcaagcacct gggcggctcc     2400

aggaagcccg acggcgccat ctacaccgtg ggctccccca tcgactacgg cgtgatcgtg     2460

gacaccaagg cctactccgg cggctacaac ctgcccatcg gccaggccga cgaaatgcag     2520

aggtacgtgg aggagaacca gaccaggaac aagcacatca accccaacga gtggtggaag     2580

gtgtacccct ccagcgtgac cgagttcaag ttcctgttcg tgtccggcca cttcaagggc     2640

aactacaagg cccagctgac caggctgaac cacatcacca actgcaacgg cgccgtgctg     2700

tccgtggagg agctcctgat cggcggcgag atgatcaagg ccggcaccct gaccctggag     2760

gaggtgagga ggaagttcaa caacggcgag atcaacttcg cggccgactg a              2811


<210>  51
<211>  320
<212>  DNA
<213>  Artificial

<220>
<223>  ADH1 terminator

<400>  51
tttggacttc ttcgccagag gtttggtcaa gtctccaatc aaggttgtcg gcttgtctac       60

cttgccagaa atttacgaaa agatggaaaa gggtcaaatc gttggtagat acgttgttga      120

cacttctaaa taagcgaatt tcttatgatt tatgattttt attattaaat aagttataaa      180

aaaaataagt gtatacaaat tttaaagtga ctcttaggtt ttaaaacgaa aattcttatt      240

cttgagtaac tctttcctgt aggtcaggtt gctttctcag gtatagcatg aggtcgctct      300

tattgaccac acctctaccg                                                  320


<210>  52
<211>  1095
<212>  DNA
<213>  Artificial

<220>
<223>  sequence coding for the LEU2 selection marker

<400>  52
atgtctgccc ctaagaagat cgtcgttttg ccaggtgacc acgttggtca agaaatcaca       60

gccgaagcca ttaaggttct taaagctatt tctgatgttc gttccaatgt caagttcgat      120

ttcgaaaatc atttaattgg tggtgctgct atcgatgcta caggtgtccc acttccagat      180

gaggcgctgg aagcctccaa gaaggttgat gccgttttgt taggtgctgt gggtggtcct      240

aaatggggta ccggtagtgt tagacctgaa caaggtttac taaaaatccg taaagaactt      300

caattgtacg ccaacttaag accatgtaac tttgcatccg actctctttt agacttatct      360

ccaatcaagc cacaatttgc taaaggtact gacttcgttg ttgtcagaga attagtggga      420

ggtatttact ttggtaagag aaaggaagac gatggtgatg gtgtcgcttg ggatagtgaa      480

caatacaccg ttccagaagt gcaaagaatc acaagaatgg ccgctttcat ggccctacaa      540

catgagccac cattgcctat ttggtccttg gataaagcta atgttttggc ctcttcaaga      600

ttatggagaa aaactgtgga ggaaaccatc aagaacgaat tccctacatt gaaggttcaa      660

catcaattga ttgattctgc cgccatgatc ctagttaaga acccaaccca cctaaatggt      720

attataatca ccagcaacat gtttggtgat atcatctccg atgaagcctc cgttatccca      780

ggttccttgg gtttgttgcc atctgcgtcc ttggcctctt tgccagacaa gaacaccgca      840

tttggtttgt acgaaccatg ccacggttct gctccagatt tgccaaagaa taaggtcaac      900

cctatcgcca ctatcttgtc tgctgcaatg atgttgaaat tgtcattgaa cttgcctgaa      960

gaaggtaagg ccattgaaga tgcagttaaa aaggttttgg atgcaggtat cagaactggt     1020

gatttaggtg gttccaacag taccacggaa gtcggtgatg ctgtcgccga agaagttaag     1080

aaaatccttg cttaa                                                      1095


<210>  53
<211>  1345
<212>  DNA
<213>  Artificial

<220>
<223>  2-Micron replication origin

<400>  53
aacgaagcat ctgtgcttca ttttgtagaa caaaaatgca acgcgagagc gctaattttt       60

caaacaaaga atctgagctg catttttaca gaacagaaat gcaacgcgaa agcgctattt      120

taccaacgaa gaatctgtgc ttcatttttg taaaacaaaa atgcaacgcg agagcgctaa      180

tttttcaaac aaagaatctg agctgcattt ttacagaaca gaaatgcaac gcgagagcgc      240

tattttacca acaaagaatc tatacttctt ttttgttcta caaaaatgca tcccgagagc      300

gctatttttc taacaaagca tcttagatta ctttttttct cctttgtgcg ctctataatg      360

cagtctcttg ataacttttt gcactgtagg tccgttaagg ttagaagaag gctactttgg      420

tgtctatttt ctcttccata aaaaaagcct gactccactt cccgcgttta ctgattacta      480

gcgaagctgc gggtgcattt tttcaagata aaggcatccc cgattatatt ctataccgat      540

gtggattgcg catactttgt gaacagaaag tgatagcgtt gatgattctt cattggtcag      600

aaaattatga acggtttctt ctattttgtc tctatatact acgtatagga aatgtttaca      660

ttttcgtatt gttttcgatt cactctatga atagttctta ctacaatttt tttgtctaaa      720

gagtaatact agagataaac ataaaaaatg tagaggtcga gtttagatgc aagttcaagg      780

agcgaaaggt ggatgggtag gttatatagg gatatagcac agagatatat agcaaagaga      840

tacttttgag caatgtttgt ggaagcggta ttcgcaatat tttagtagct cgttacagtc      900

cggtgcgttt ttggtttttt gaaagtgcgt cttcagagcg cttttggttt tcaaaagcgc      960

tctgaagttc ctatactttc tagagaatag gaacttcgga ataggaactt caaagcgttt     1020

ccgaaaacga gcgcttccga aaatgcaacg cgagctgcgc acatacagct cactgttcac     1080

gtcgcaccta tatctgcgtg ttgcctgtat atatatatac atgagaagaa cggcatagtg     1140

cgtgtttatg cttaaatgcg tacttatatg cgtctattta tgtaggatga aaggtagtct     1200

agtacctcct gtgatattat cccattccat gcggggtatc gtatgcttcc ttcagcacta     1260

ccctttagct gttctatatg ctgccactcc tcaattggat tagtctcatc cttcaatgct     1320

atcatttcct ttgatattgg atcat                                           1345


<210>  54
<211>  1530
<212>  DNA
<213>  Artificial

<220>
<223>  sequence coding for the TAL effector tandem repeat of the TALEN 
       arm that binds to the DNA target site of SEQ ID NO: 4 (15 
       adjacent units of 34 amino acids)

<400>  54
ttgacccccc agcaggtggt ggccatcgcc agcaataatg gtggcaagca ggcgctggag       60

acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ccagcaggtg      120

gtggccatcg ccagcaatgg cggtggcaag caggcgctgg agacggtcca gcggctgttg      180

ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat      240

aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc      300

cacggcttga ccccggagca ggtggtggcc atcgccagca atattggtgg caagcaggcg      360

ctggagacgg tgcaggcgct gttgccggtg ctgtgccagg cccacggctt gaccccccag      420

caggtggtgg ccatcgccag caatggcggt ggcaagcagg cgctggagac ggtccagcgg      480

ctgttgccgg tgctgtgcca ggcccacggc ttgaccccgg agcaggtggt ggccatcgcc      540

agccacgatg gcggcaagca ggcgctggag acggtccagc ggctgttgcc ggtgctgtgc      600

caggcccacg gcttgacccc ggagcaggtg gtggccatcg ccagccacga tggcggcaag      660

caggcgctgg agacggtcca gcggctgttg ccggtgctgt gccaggccca cggcttgacc      720

ccggagcagg tggtggccat cgccagccac gatggcggca agcaggcgct ggagacggtc      780

cagcggctgt tgccggtgct gtgccaggcc cacggcttga ccccggagca ggtggtggcc      840

atcgccagcc acgatggcgg caagcaggcg ctggagacgg tccagcggct gttgccggtg      900

ctgtgccagg cccacggctt gaccccggag caggtggtgg ccatcgccag ccacgatggc      960

ggcaagcagg cgctggagac ggtccagcgg ctgttgccgg tgctgtgcca ggcccacggc     1020

ttgaccccgg agcaggtggt ggccatcgcc agccacgatg gcggcaagca ggcgctggag     1080

acggtccagc ggctgttgcc ggtgctgtgc caggcccacg gcttgacccc ggagcaggtg     1140

gtggccatcg ccagcaatat tggtggcaag caggcgctgg agacggtgca ggcgctgttg     1200

ccggtgctgt gccaggccca cggcttgacc ccccagcagg tggtggccat cgccagcaat     1260

aatggtggca agcaggcgct ggagacggtc cagcggctgt tgccggtgct gtgccaggcc     1320

cacggcttga ccccggagca ggtggtggcc atcgccagcc acgatggcgg caagcaggcg     1380

ctggagacgg tccagcggct gttgccggtg ctgtgccagg cccacggctt gaccccggag     1440

caggtggtgg ccatcgccag caatattggt ggcaagcagg cgctggagac ggtgcaggcg     1500

ctgttgccgg tgctgtgcca ggcccacggc                                      1530


<210>  55
<211>  34
<212>  PRT
<213>  Artificial

<220>
<223>  TAL effector tandem repeat unit [XX is selected from the group 
       consisting of HD, NG, NI, NN, NS, N*, HG, H*, IG, HA, ND, NK, HI,
       HN, NA, SN and YG (the symbol * denotes that the second X is 
       missing)]


<220>
<221>  MISC_FEATURE
<222>  (12)..(13)
<223>  XX is the RVD of the TAL effector tandem repeat unit; XX is 
       selected from the group consisting of HD, NG, NI, NN, NS, N*, HG,
       H*, IG, HA, ND, NK, HI, HN, NA, SN and YG (the symbol * denotes 
       that the second X is missing)

<400>  55

Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Xaa Xaa Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Ala Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


His Gly 
        


<210>  56
<211>  14
<212>  PRT
<213>  Artificial

<220>
<223>  (non-specific) C-terminal truncated unit of 14 amino acids of the
       TALEN arm that binds to the DNA target site of SEQ ID NO: 10

<400>  56

Leu Thr Pro Gln Gln Val Val Ala Ile Ala Ser Asn Gly Gly 
1               5                   10                  


