                         SEQUENCE LISTING

<110>  BASF Plant Science Company GmbH
 
<120>  Optimized Endonucleases and Uses Thereof

<130>  PF 71110

<150>  09177375.4
<151>  2009-11-27

<150>  61/264715
<151>  2009-11-27

<150>  10170199.3
<151>  2010-07-20

<150>  61/365836
<151>  2010-07-20

<160>  30    

<170>  PatentIn version 3.5

<210>  1
<211>  235
<212>  PRT
<213>  Saccharomyces cerevisiae

<400>  1

Met Lys Asn Ile Lys Lys Asn Gln Val Met Asn Leu Gly Pro Asn Ser 
1               5                   10                  15      


Lys Leu Leu Lys Glu Tyr Lys Ser Gln Leu Ile Glu Leu Asn Ile Glu 
            20                  25                  30          


Gln Phe Glu Ala Gly Ile Gly Leu Ile Leu Gly Asp Ala Tyr Ile Arg 
        35                  40                  45              


Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gln Phe Glu Trp Lys Asn 
    50                  55                  60                  


Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gln Trp Val Leu 
65                  70                  75                  80  


Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn Leu Val 
                85                  90                  95      


Ile Thr Trp Gly Ala Gln Thr Phe Lys His Gln Ala Phe Asn Lys Leu 
            100                 105                 110         


Ala Asn Leu Phe Ile Val Asn Asn Lys Lys Thr Ile Pro Asn Asn Leu 
        115                 120                 125             


Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe Met Asp 
    130                 135                 140                 


Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Thr Asn Lys Ser Ile 
145                 150                 155                 160 


Val Leu Asn Thr Gln Ser Phe Thr Phe Glu Glu Val Glu Tyr Leu Val 
                165                 170                 175     


Lys Gly Leu Arg Asn Lys Phe Gln Leu Asn Cys Tyr Val Lys Ile Asn 
            180                 185                 190         


Lys Asn Lys Pro Ile Ile Tyr Ile Asp Ser Met Ser Tyr Leu Ile Phe 
        195                 200                 205             


Tyr Asn Leu Ile Lys Pro Tyr Leu Ile Pro Gln Met Met Tyr Lys Leu 
    210                 215                 220                 


Pro Asn Thr Ile Ser Ser Glu Thr Phe Leu Lys 
225                 230                 235 


<210>  2
<211>  236
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  derived from I-SceI sequence

<400>  2

Met Gly Lys Asn Ile Lys Lys Asn Gln Val Met Asn Leu Gly Pro Asn 
1               5                   10                  15      


Ser Lys Leu Leu Lys Glu Tyr Lys Ser Gln Leu Ile Glu Leu Asn Ile 
            20                  25                  30          


Glu Gln Phe Glu Ala Gly Ile Gly Leu Ile Leu Gly Asp Ala Tyr Ile 
        35                  40                  45              


Arg Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gln Phe Glu Trp Lys 
    50                  55                  60                  


Asn Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gln Trp Val 
65                  70                  75                  80  


Leu Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn Leu 
                85                  90                  95      


Val Ile Thr Trp Gly Ala Gln Thr Phe Lys His Gln Ala Phe Asn Lys 
            100                 105                 110         


Leu Ala Asn Leu Phe Ile Val Asn Asn Lys Lys Thr Ile Pro Asn Asn 
        115                 120                 125             


Leu Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe Met 
    130                 135                 140                 


Asp Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Thr Asn Lys Ser 
145                 150                 155                 160 


Ile Val Leu Asn Thr Gln Ser Phe Thr Phe Glu Glu Val Glu Tyr Leu 
                165                 170                 175     


Val Lys Gly Leu Arg Asn Lys Phe Gln Leu Asn Cys Tyr Val Lys Ile 
            180                 185                 190         


Asn Lys Asn Lys Pro Ile Ile Tyr Ile Asp Ser Met Ser Tyr Leu Ile 
        195                 200                 205             


Phe Tyr Asn Leu Ile Lys Pro Tyr Leu Ile Pro Gln Met Met Tyr Lys 
    210                 215                 220                 


Leu Pro Asn Thr Ile Ser Ser Glu Thr Phe Leu Lys 
225                 230                 235     


<210>  3
<211>  227
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  derived from I-SceI sequence

<400>  3

Met Gly Lys Asn Ile Lys Lys Asn Gln Val Met Asn Leu Gly Pro Asn 
1               5                   10                  15      


Ser Lys Leu Leu Lys Glu Tyr Lys Ser Gln Leu Ile Glu Leu Asn Ile 
            20                  25                  30          


Glu Gln Phe Glu Ala Gly Ile Gly Leu Ile Leu Gly Asp Ala Tyr Ile 
        35                  40                  45              


Arg Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gln Phe Glu Trp Lys 
    50                  55                  60                  


Asn Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gln Trp Val 
65                  70                  75                  80  


Leu Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn Leu 
                85                  90                  95      


Val Ile Thr Trp Gly Ala Gln Thr Phe Lys His Gln Ala Phe Asn Lys 
            100                 105                 110         


Leu Ala Asn Leu Phe Ile Val Asn Asn Lys Lys Thr Ile Pro Asn Asn 
        115                 120                 125             


Leu Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe Met 
    130                 135                 140                 


Asp Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Thr Asn Lys Ser 
145                 150                 155                 160 


Ile Val Leu Asn Thr Gln Ser Phe Thr Phe Glu Glu Val Glu Tyr Leu 
                165                 170                 175     


Val Lys Gly Leu Arg Asn Lys Phe Gln Leu Asn Cys Tyr Val Lys Ile 
            180                 185                 190         


Asn Lys Asn Lys Pro Ile Ile Tyr Ile Asp Ser Met Ser Tyr Leu Ile 
        195                 200                 205             


Phe Tyr Asn Leu Ile Lys Pro Tyr Leu Ile Pro Gln Met Met Tyr Lys 
    210                 215                 220                 


Leu Pro Asn 
225         


<210>  4
<211>  7
<212>  PRT
<213>  SV40

<400>  4

Pro Lys Lys Lys Arg Lys Val 
1               5           


<210>  5
<211>  234
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  derived from I-SceI sequence ; S. cerevisiae

<400>  5

Met Gly Pro Lys Lys Lys Arg Lys Val Lys Asn Ile Lys Lys Asn Gln 
1               5                   10                  15      


Val Met Asn Leu Gly Pro Asn Ser Lys Leu Leu Lys Glu Tyr Lys Ser 
            20                  25                  30          


Gln Leu Ile Glu Leu Asn Ile Glu Gln Phe Glu Ala Gly Ile Gly Leu 
        35                  40                  45              


Ile Leu Gly Asp Ala Tyr Ile Arg Ser Arg Asp Glu Gly Lys Thr Tyr 
    50                  55                  60                  


Cys Met Gln Phe Glu Trp Lys Asn Lys Ala Tyr Met Asp His Val Cys 
65                  70                  75                  80  


Leu Leu Tyr Asp Gln Trp Val Leu Ser Pro Pro His Lys Lys Glu Arg 
                85                  90                  95      


Val Asn His Leu Gly Asn Leu Val Ile Thr Trp Gly Ala Gln Thr Phe 
            100                 105                 110         


Lys His Gln Ala Phe Asn Lys Leu Ala Asn Leu Phe Ile Val Asn Asn 
        115                 120                 125             


Lys Lys Thr Ile Pro Asn Asn Leu Val Glu Asn Tyr Leu Thr Pro Met 
    130                 135                 140                 


Ser Leu Ala Tyr Trp Phe Met Asp Asp Gly Gly Lys Trp Asp Tyr Asn 
145                 150                 155                 160 


Lys Asn Ser Thr Asn Lys Ser Ile Val Leu Asn Thr Gln Ser Phe Thr 
                165                 170                 175     


Phe Glu Glu Val Glu Tyr Leu Val Lys Gly Leu Arg Asn Lys Phe Gln 
            180                 185                 190         


Leu Asn Cys Tyr Val Lys Ile Asn Lys Asn Lys Pro Ile Ile Tyr Ile 
        195                 200                 205             


Asp Ser Met Ser Tyr Leu Ile Phe Tyr Asn Leu Ile Lys Pro Tyr Leu 
    210                 215                 220                 


Ile Pro Gln Met Met Tyr Lys Leu Pro Asn 
225                 230                 


<210>  6
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  chimeric recognition site

<400>  6
tagggataac agggtaat                                                     18


<210>  7
<211>  4065
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  plasmid


<220>
<221>  misc_feature
<222>  (3)..(12)
<223>  n is a, c, g, or t

<400>  7
ccnnnnnnnn nngaattcga agcttgggcc cgaacaaaaa ctcatctcag aagaggatct       60

gaatagcgcc gtcgaccatc atcatcatca tcattgagtt taaacggtct ccagcttggc      120

tgttttggcg gatgagagaa gattttcagc ctgatacaga ttaaatcaga acgcagaagc      180

ggtctgataa aacagaattt gcctggcggc agtagcgcgg tggtcccacc tgaccccatg      240

ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggtctcc ccatgcgaga      300

gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact gggcctttcg      360

ttttatctgt tgtttgtcgg tgaacgctct cctgagtagg acaaatccgc cgggagcgga      420

tttgaacgtt gcgaagcaac ggcccggagg gtggcgggca ggacgcccgc cataaactgc      480

caggcatcaa attaagcaga aggccatcct gacggatggc ctttttgcgt ttctacaaac      540

tcttttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct      600

gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg      660

cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg      720

tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc      780

tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca      840

cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac      900

tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa      960

agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg     1020

ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt     1080

ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg     1140

aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc     1200

gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga     1260

tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta     1320

ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc     1380

cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg     1440

atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt     1500

cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa     1560

ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt     1620

cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt     1680

ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt     1740

tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga     1800

taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag     1860

caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata     1920

agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg     1980

gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga     2040

gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca     2100

ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa     2160

acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt     2220

tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac     2280

ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt     2340

ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga     2400

ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg tattttctcc     2460

ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca atctgctctg     2520

atgccgcata gttaagccag tatacactcc gctatcgcta cgtgactggg tcatggctgc     2580

gccccgacac ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc     2640

cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc     2700

atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg aaggcgaagc ggcatgcata     2760

atgtgcctgt caaatggacg aagcagggat tctgcaaacc ctatgctact ccgtcaagcc     2820

gtcaattgtc tgattcgtta ccaattatga caacttgacg gctacatcat tcactttttc     2880

ttcacaaccg gcacggaact cgctcgggct ggccccggtg cattttttaa atacccgcga     2940

gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg gtggcgatag gcatccgggt     3000

ggtgctcaaa agcagcttcg cctggctgat acgttggtcc tcgcgccagc ttaagacgct     3060

aatccctaac tgctggcgga aaagatgtga cagacgcgac ggcgacaagc aaacatgctg     3120

tgcgacgctg gcgatatcaa aattgctgtc tgccaggtga tcgctgatgt actgacaagc     3180

ctcgcgtacc cgattatcca tcggtggatg gagcgactcg ttaatcgctt ccatgcgccg     3240

cagtaacaat tgctcaagca gatttatcgc cagcagctcc gaatagcgcc cttccccttg     3300

cccggcgtta atgatttgcc caaacaggtc gctgaaatgc ggctggtgcg cttcatccgg     3360

gcgaaagaac cccgtattgg caaatattga cggccagtta agccattcat gccagtaggc     3420

gcgcggacga aagtaaaccc actggtgata ccattcgcga gcctccggat gacgaccgta     3480

gtgatgaatc tctcctggcg ggaacagcaa aatatcaccc ggtcggcaaa caaattctcg     3540

tccctgattt ttcaccaccc cctgaccgcg aatggtgaga ttgagaatat aacctttcat     3600

tcccagcggt cggtcgataa aaaaatcgag ataaccgttg gcctcaatcg gcgttaaacc     3660

cgccaccaga tgggcattaa acgagtatcc cggcagcagg ggatcatttt gcgcttcagc     3720

catacttttc atactcccgc cattcagaga agaaaccaat tgtccatatt gcatcagaca     3780

ttgccgtcac tgcgtctttt actggctctt ctcgctaacc aaaccggtaa ccccgcttat     3840

taaaagcatt ctgtaacaaa gcgggaccaa agccatgaca aaaacgcgta acaaaagtgt     3900

ctataatcac ggcagaaaag tccacattga ttatttgcac ggcgtcacac tttgctatgc     3960

catagcattt ttatccataa gattagcgga tcctacctga cgctttttat cgcaactctc     4020

tactgtttct ccatacccgt tttttgggct aacaggagga attaa                     4065


<210>  8
<211>  711
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Insert of VC-SAH40-4

<400>  8
atgggtaaga acattaagaa gaaccaggtg atgaacctgg gccctaactc taagctgctt       60

aaggaataca agtctcagct gattgagctg aacattgagc agttcgaggc tggcataggc      120

ctgattctgg gcgatgctta cattaggtct agggatgagg gcaagaccta ctgcatgcag      180

ttcgagtgga agaacaaggc ttacatggat cacgtgtgcc tgctgtacga tcagtgggtg      240

ctgtctcctc ctcacaagaa ggagagggtg aaccacttgg gaaacctggt gattacctgg      300

ggcgctcaaa ccttcaagca ccaggctttc aacaagctgg ctaacctgtt cattgtgaac      360

aacaagaaga ccattcctaa caacctggtg gagaactacc tgacccctat gtctctggct      420

tactggttca tggatgatgg cggcaagtgg gattacaaca agaactctac caacaagtct      480

attgtgctga acacccagtc tttcaccttc gaggaggtgg aatacctggt gaagggcctg      540

aggaacaagt tccagctgaa ctgctacgtg aagattaaca agaacaagcc tattatttac      600

attgattcta tgtcttacct gattttctac aacctgatta agccttacct gattcctcag      660

atgatgtaca agctgcctaa caccatctct tctgagacct tcctgaagtg a               711


<210>  9
<211>  4905
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Construct II


<220>
<221>  misc_feature
<222>  (1218)..(1227)
<223>  n is a, c, g, or t

<400>  9
agcgctggca gtccttgcca ttgccgggat cggggcagta acgggatggg cgatcagccc       60

gagcgcgacg cccggaagca ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca      120

ggtgccgggc agtgagggcg gcggcctggg tggcggcctg cccttcactt cggccgtcgg      180

ggcattcacg gacttcatgg cggggccggc aatttttacc ttgggcattc ttggcatagt      240

ggtcgcgggt gccgtgctcg tgttcggggg tgcgataaac ccagcgaacc atttgaggtg      300

ataggtaaga ttataccgag gtatgaaaac gagaattgga cctttacaga attactctat      360

gaagcgccat atttaaaaag ctaccaagac gaagaggatg aagaggatga ggaggcagat      420

tgccttgaat atattgacaa tactgataag ataatatatc ttttatatag aagatatcgc      480

cgtatgtaag gatttcaggg ggcaaggcat aggcagcgcg cttatcaata tatctataga      540

atgggcaaag cataaaaact tgcatggact aatgcttgaa acccaggaca ataaccttat      600

agcttgtaaa ttctatcata attgggtaat gactccaact tattgatagt gttttatgtt      660

cagataatgc ccgatgactt tgtcatgcag ctccaccgat tttgagaacg acagcgactt      720

ccgtcccagc cgtgccaggt gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt      780

atatcgcttg ctgattacgt gcagctttcc cttcaggcgg gattcataca gcggccagcc      840

atccgtcatc catatcacca cgtcaaaggg tgacagcagg ctcataagac gccccagcgt      900

cgccatagtg cgttcaccga atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc      960

gtaaaacagc cagcgctggc gcgatttagc cccgacatag ccccactgtt cgtccatttc     1020

cgcgcagacg atgacgtcac tgcccggctg tatgcgcgag gttaccgact gcggcctgag     1080

ttttttaagt gacgtaaaat cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg     1140

catccaacgc cattcatggc catatcaatg attttctggt gcgtaccggg ttgagaagcg     1200

gtgtaagtga actgcagnnn nnnnnnnaag cttgactctc ttaagggagc gtcgagtacg     1260

cgcccgggga gcccaagggc acgccctggc acccgaagct ctagtatcaa atttggcaca     1320

aaaagcaaaa ttaaaatact gataattgcc aacacaatta acatctcaat caaggtaaat     1380

gctttttgct ttttttgcca aagctatctt ccgtgatcag agctccagct tttgttccct     1440

ttagtgaggg ttaattgcgc gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa     1500

ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg     1560

gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcca     1620

gtcgggaaac ctgtcgtgcc agctgataga cacagaagcc actggagcac ctcaaaaaca     1680

ccatcataca ctaaatcagt aagttggcag catcacccat aattgtggtt tcaaaatcgg     1740

ctccgtcgat actatgttat acgccaactt tgaaaacaac tttgaaaaag ctgttttctg     1800

gtatttaagg ttttagaatg caaggaacag tgaattggag ttcgtcttgt tataattagc     1860

ttcttggggt atctttaaat actgtagaaa agaggaagga aataataaat ggctaaaatg     1920

agaatatcac cggaattgaa aaaactgatc gaaaaatacc gctgcgtaaa agatacggaa     1980

ggaatgtctc ctgctaaggt atataagctg gtgggagaaa atgaaaacct atatttaaaa     2040

atgacggaca gccggtataa agggaccacc tatgatgtgg aacgggaaaa ggacatgatg     2100

ctatggctgg aaggaaagct gcctgttcca aaggtcctgc actttgaacg gcatgatggc     2160

tggagcaatc tgctcatgag tgaggccgat ggcgtccttt gctcggaaga gtatgaagat     2220

gaacaaagcc ctgaaaagat tatcgagctg tatgcggagt gcatcaggct ctttcactcc     2280

atcgacatat cggattgtcc ctatacgaat agcttagaca gccgcttagc cgaattggat     2340

tacttactga ataacgatct ggccgatgtg gattgcgaaa actgggaaga agacactcca     2400

tttaaagatc cgcgcgagct gtatgatttt ttaaagacgg aaaagcccga agaggaactt     2460

gtcttttccc acggcgacct gggagacagc aacatctttg tgaaagatgg caaagtaagt     2520

ggctttattg atcttgggag aagcggcagg gcggacaagt ggtatgacat tgccttctgc     2580

gtccggtcga tcagggagga tatcggggaa gaacagtatg tcgagctatt ttttgactta     2640

ctggggatca agcctgattg ggagaaaata aaatattata ttttactgga tgaattgttt     2700

tagtacctag atgtggcgca acgatgccgg cgacaagcag gagcgcaccg acttcttccg     2760

catcaagtgt tttggctctc aggccgaggc ccacggcaag tatttgggca aggggtcgct     2820

ggtattcgtg cagggcaaga ttcggaatac caagtacgag aaggacggcc agacggtcta     2880

cgggaccgac ttcattgccg ataaggtgga ttatctggac accaaggcac caggcgggtc     2940

aaatcaggaa taagggcaca ttgccccggc gtgagtcggg gcaatcccgc aaggagggtg     3000

aatgaatcgg acgtttgacc ggaaggcata caggcaagaa ctgatcgacg cggggttttc     3060

cgccgaggat gccgaaacca tcgcaagccg caccgtcatg cgtgcgcccc gcgaaacctt     3120

ccagtccgtc ggctcgatgg tccagcaagc tacggccaag atcgagcgcg acagcgtgca     3180

actggctccc cctgccctgc ccgcgccatc ggccgccgtg gagcgttcgc gtcgtctcga     3240

acaggaggcg gcaggtttgg cgaagtcgat gaccatcgac acgcgaggaa ctatgacgac     3300

caagaagcga aaaaccgccg gcgaggacct ggcaaaacag gtcagcgagg ccaagcaggc     3360

cgcgttgctg aaacacacga agcagcagat caaggaaatg cagctttcct tgttcgatat     3420

tgcgccgtgg ccggacacga tgcgagcgat gccaaacgac acggcccgct ctgccctgtt     3480

caccacgcgc aacaagaaaa tcccgcgcga ggcgctgcaa aacaaggtca ttttccacgt     3540

caacaaggac gtgaagatca cctacaccgg cgtcgagctg cgggccgacg atgacgaact     3600

ggtgtggcag caggtgttgg agtacgcgaa gcgcacccct atcggcgagc cgatcacctt     3660

cacgttctac gagctttgcc aggacctggg ctggtcgatc aatggccggt attacacgaa     3720

ggccgaggaa tgcctgtcgc gcctacaggc gacggcgatg ggcttcacgt ccgaccgcgt     3780

tgggcacctg gaatcggtgt cgctgctgca ccgcttccgc gtcctggacc gtggcaagaa     3840

aacgtcccgt tgccaggtcc tgatcgacga ggaaatcgtc gtgctgtttg ctggcgacca     3900

ctacacgaaa ttcatatggg agaagtaccg caagctgtcg ccgacggccc gacggatgtt     3960

cgactatttc agctcgcacc gggagccgta cccgctcaag ctggaaacct tccgcctcat     4020

gtgcggatcg gattccaccc gcgtgaagaa gtggcgcgag caggtcggcg aagcctgcga     4080

agagttgcga ggcagcggcc tggtggaaca cgcctgggtc aatgatgacc tggtgcattg     4140

caaacgctag ggccttgtgg ggtcagttcc ggctgggggt tcagcagcca gcgctttact     4200

ctagtgacgc tcaccgggct ggttgccctc gccgctgggc tggcggccgt ctatggccct     4260

gcaaacgcgc cagaaacgcc gtcgaagccg tgtgcgagac accgcggccg ccggcgttgt     4320

ggatacctcg cggaaaactt ggccctcact gacagatgag gggcggacgt tgacacttga     4380

ggggccgact cacccggcgc ggcgttgaca gatgaggggc aggctcgatt tcggccggcg     4440

acgtggagct ggccagcctc gcaaatcggc gaaaacgcct gattttacgc gagtttccca     4500

cagatgatgt ggacaagcct ggggataagt gccctgcggt attgacactt gaggggcgcg     4560

actactgaca gatgaggggc gcgatccttg acacttgagg ggcagagtgc tgacagatga     4620

ggggcgcacc tattgacatt tgaggggctg tccacaggca gaaaatccag catttgcaag     4680

ggtttccgcc cgtttttcgg ccaccgctaa cctgtctttt aacctgcttt taaaccaata     4740

tttataaacc ttgtttttaa ccagggctgc gccctgtgcg cgtgaccgcg cacgccgaag     4800

gggggtgccc ccccttctcg aaccctcccg gcccgctaac gcgggcctcc catcccccca     4860

ggggctgcgc ccctcggccg cgaacggcct caccccaaaa atggc                     4905


<210>  10
<211>  260
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Insert of VC-SAH6-1

<400>  10
ttgccatgtt ttacggcagt gagagcagag atagcgctga tgtccggcgg tgcttttgcc       60

gttacgcacc accccgtcag tagctgaaca ggagggacag ctggcgaaag ggggatgtgc      120

tgcaaggcga ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac      180

ggccagtgag cgcgcgtaat acgactcact atagggcgaa ttgggtactc gagtacgcta      240

gggataacag ggtaatatag                                                  260


<210>  11
<211>  4580
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  VC-SAH7-1

<400>  11
ctagtgacgc tcaccgggct ggttgccctc gccgctgggc tggcggccgt ctatggccct       60

gcaaacgcgc cagaaacgcc gtcgaagccg tgtgcgagac accgcggccg ccggcgttgt      120

ggatacctcg cggaaaactt ggccctcact gacagatgag gggcggacgt tgacacttga      180

ggggccgact cacccggcgc ggcgttgaca gatgaggggc aggctcgatt tcggccggcg      240

acgtggagct ggccagcctc gcaaatcggc gaaaacgcct gattttacgc gagtttccca      300

cagatgatgt ggacaagcct ggggataagt gccctgcggt attgacactt gaggggcgcg      360

actactgaca gatgaggggc gcgatccttg acacttgagg ggcagagtgc tgacagatga      420

ggggcgcacc tattgacatt tgaggggctg tccacaggca gaaaatccag catttgcaag      480

ggtttccgcc cgtttttcgg ccaccgctaa cctgtctttt aacctgcttt taaaccaata      540

tttataaacc ttgtttttaa ccagggctgc gccctgtgcg cgtgaccgcg cacgccgaag      600

gggggtgccc ccccttctcg aaccctcccg gcccgctaac gcgggcctcc catcccccca      660

ggggctgcgc ccctcggccg cgaacggcct caccccaaaa atggcagcgc tggcagtcct      720

tgccattgcc gggatcgggg cagtaacggg atgggcgatc agcccgagcg cgacgcccgg      780

aagcattgac gtgccgcagg tgctggcatc gacattcagc gaccaggtgc cgggcagtga      840

gggcggcggc ctgggtggcg gcctgccctt cacttcggcc gtcggggcat tcacggactt      900

catggcgggg ccggcaattt ttaccttggg cattcttggc atagtggtcg cgggtgccgt      960

gctcgtgttc gggggtgcga taaacccagc gaaccatttg aggtgatagg taagattata     1020

ccgaggtatg aaaacgagaa ttggaccttt acagaattac tctatgaagc gccatattta     1080

aaaagctacc aagacgaaga ggatgaagag gatgaggagg cagattgcct tgaatatatt     1140

gacaatactg ataagataat atatctttta tatagaagat atcgccgtat gtaaggattt     1200

cagggggcaa ggcataggca gcgcgcttat caatatatct atagaatggg caaagcataa     1260

aaacttgcat ggactaatgc ttgaaaccca ggacaataac cttatagctt gtaaattcta     1320

tcataattgg gtaatgactc caacttattg atagtgtttt atgttcagat aatgcccgat     1380

gactttgtca tgcagctcca ccgattttga gaacgacagc gacttccgtc ccagccgtgc     1440

caggtgctgc ctcagattca ggttatgccg ctcaattcgc tgcgtatatc gcttgctgat     1500

tacgtgcagc tttcccttca ggcgggattc atacagcggc cagccatccg tcatccatat     1560

caccacgtca aagggtgaca gcaggctcat aagacgcccc agcgtcgcca tagtgcgttc     1620

accgaatacg tgcgcaacaa ccgtcttccg gagactgtca tacgcgtaaa acagccagcg     1680

ctggcgcgat ttagccccga catagcccca ctgttcgtcc atttccgcgc agacgatgac     1740

gtcactgccc ggctgtatgc gcgaggttac cgactgcggc ctgagttttt taagtgacgt     1800

aaaatcgtgt tgaggccaac gcccataatg cgggctgttg cccggcatcc aacgccattc     1860

atggccatat caatgatttt ctggtgcgta ccgggttgag aagcggtgta agtgaactgc     1920

agttgccatg ttttacggca gtgagagcag agatagcgct gatgtccggc ggtgcttttg     1980

ccgttacgca ccaccccgtc agtagctgaa caggagggac agctgataga cacagaagcc     2040

actggagcac ctcaaaaaca ccatcataca ctaaatcagt aagttggcag catcacccat     2100

aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac     2160

tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag     2220

ttcgtcttgt tataattagc ttcttggggt atctttaaat actgtagaaa agaggaagga     2280

aataataaat ggctaaaatg agaatatcac cggaattgaa aaaactgatc gaaaaatacc     2340

gctgcgtaaa agatacggaa ggaatgtctc ctgctaaggt atataagctg gtgggagaaa     2400

atgaaaacct atatttaaaa atgacggaca gccggtataa agggaccacc tatgatgtgg     2460

aacgggaaaa ggacatgatg ctatggctgg aaggaaagct gcctgttcca aaggtcctgc     2520

actttgaacg gcatgatggc tggagcaatc tgctcatgag tgaggccgat ggcgtccttt     2580

gctcggaaga gtatgaagat gaacaaagcc ctgaaaagat tatcgagctg tatgcggagt     2640

gcatcaggct ctttcactcc atcgacatat cggattgtcc ctatacgaat agcttagaca     2700

gccgcttagc cgaattggat tacttactga ataacgatct ggccgatgtg gattgcgaaa     2760

actgggaaga agacactcca tttaaagatc cgcgcgagct gtatgatttt ttaaagacgg     2820

aaaagcccga agaggaactt gtcttttccc acggcgacct gggagacagc aacatctttg     2880

tgaaagatgg caaagtaagt ggctttattg atcttgggag aagcggcagg gcggacaagt     2940

ggtatgacat tgccttctgc gtccggtcga tcagggagga tatcggggaa gaacagtatg     3000

tcgagctatt ttttgactta ctggggatca agcctgattg ggagaaaata aaatattata     3060

ttttactgga tgaattgttt tagtacctag atgtggcgca acgatgccgg cgacaagcag     3120

gagcgcaccg acttcttccg catcaagtgt tttggctctc aggccgaggc ccacggcaag     3180

tatttgggca aggggtcgct ggtattcgtg cagggcaaga ttcggaatac caagtacgag     3240

aaggacggcc agacggtcta cgggaccgac ttcattgccg ataaggtgga ttatctggac     3300

accaaggcac caggcgggtc aaatcaggaa taagggcaca ttgccccggc gtgagtcggg     3360

gcaatcccgc aaggagggtg aatgaatcgg acgtttgacc ggaaggcata caggcaagaa     3420

ctgatcgacg cggggttttc cgccgaggat gccgaaacca tcgcaagccg caccgtcatg     3480

cgtgcgcccc gcgaaacctt ccagtccgtc ggctcgatgg tccagcaagc tacggccaag     3540

atcgagcgcg acagcgtgca actggctccc cctgccctgc ccgcgccatc ggccgccgtg     3600

gagcgttcgc gtcgtctcga acaggaggcg gcaggtttgg cgaagtcgat gaccatcgac     3660

acgcgaggaa ctatgacgac caagaagcga aaaaccgccg gcgaggacct ggcaaaacag     3720

gtcagcgagg ccaagcaggc cgcgttgctg aaacacacga agcagcagat caaggaaatg     3780

cagctttcct tgttcgatat tgcgccgtgg ccggacacga tgcgagcgat gccaaacgac     3840

acggcccgct ctgccctgtt caccacgcgc aacaagaaaa tcccgcgcga ggcgctgcaa     3900

aacaaggtca ttttccacgt caacaaggac gtgaagatca cctacaccgg cgtcgagctg     3960

cgggccgacg atgacgaact ggtgtggcag caggtgttgg agtacgcgaa gcgcacccct     4020

atcggcgagc cgatcacctt cacgttctac gagctttgcc aggacctggg ctggtcgatc     4080

aatggccggt attacacgaa ggccgaggaa tgcctgtcgc gcctacaggc gacggcgatg     4140

ggcttcacgt ccgaccgcgt tgggcacctg gaatcggtgt cgctgctgca ccgcttccgc     4200

gtcctggacc gtggcaagaa aacgtcccgt tgccaggtcc tgatcgacga ggaaatcgtc     4260

gtgctgtttg ctggcgacca ctacacgaaa ttcatatggg agaagtaccg caagctgtcg     4320

ccgacggccc gacggatgtt cgactatttc agctcgcacc gggagccgta cccgctcaag     4380

ctggaaacct tccgcctcat gtgcggatcg gattccaccc gcgtgaagaa gtggcgcgag     4440

caggtcggcg aagcctgcga agagttgcga ggcagcggcc tggtggaaca cgcctgggtc     4500

aatgatgacc tggtgcattg caaacgctag ggccttgtgg ggtcagttcc ggctgggggt     4560

tcagcagcca gcgctttact                                                 4580


<210>  12
<211>  5221
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Construct III


<220>
<221>  misc_feature
<222>  (1557)..(1566)
<223>  n is a, c, g, or t

<400>  12
agcgctggca gtccttgcca ttgccgggat cggggcagta acgggatggg cgatcagccc       60

gagcgcgacg cccggaagca ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca      120

ggtgccgggc agtgagggcg gcggcctggg tggcggcctg cccttcactt cggccgtcgg      180

ggcattcacg gacttcatgg cggggccggc aatttttacc ttgggcattc ttggcatagt      240

ggtcgcgggt gccgtgctcg tgttcggggg tgcgataaac ccagcgaacc atttgaggtg      300

ataggtaaga ttataccgag gtatgaaaac gagaattgga cctttacaga attactctat      360

gaagcgccat atttaaaaag ctaccaagac gaagaggatg aagaggatga ggaggcagat      420

tgccttgaat atattgacaa tactgataag ataatatatc ttttatatag aagatatcgc      480

cgtatgtaag gatttcaggg ggcaaggcat aggcagcgcg cttatcaata tatctataga      540

atgggcaaag cataaaaact tgcatggact aatgcttgaa acccaggaca ataaccttat      600

agcttgtaaa ttctatcata attgggtaat gactccaact tattgatagt gttttatgtt      660

cagataatgc ccgatgactt tgtcatgcag ctccaccgat tttgagaacg acagcgactt      720

ccgtcccagc cgtgccaggt gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt      780

atatcgcttg ctgattacgt gcagctttcc cttcaggcgg gattcataca gcggccagcc      840

atccgtcatc catatcacca cgtcaaaggg tgacagcagg ctcataagac gccccagcgt      900

cgccatagtg cgttcaccga atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc      960

gtggttacag tcttgcgcga catgcgtcac cacggtgata tcgtccaccc aggtgttcgg     1020

cgtggtgtag agcattacgc tgcgatggat tccggcatag ttaaagaaat catggaagta     1080

agactgcttt ttcttgccgt tttcgtcggt aatcaccatt cccggcggga tagtctgcca     1140

gttcagttcg ttgttcacac aaacggtgat acgtacactt ttcccggcaa taacatacgg     1200

cgtgacatcg gcttcaaatg gcgtatagcc gccctgatgc tccatcactt cctgattatt     1260

gacccacact ttgccgtaat gagtgaccgc atcgaaacgc agcacgatac gctggcctgc     1320

ccaacctttc ggtataaaga cttcgcgctg ataccagacg ttgcccgcat aattacgaat     1380

atctgcatcg gcgaactgat cgttaaaact gcctggcaca gcaattgccc ggctttcttg     1440

taacgcgctt tcccaccaac gctgatcaat tccacagttt tcgcggtcca gactgaatgc     1500

ccacaggccg tcgagttttt tgatttcacg ggttggggtt tctacaggac tctagannnn     1560

nnnnnngcgg ccgctggcac cacctgccag tcaacagacg cgtaaaacag ccagcgctgg     1620

cgcgatttag ccccgacata gccccactgt tcgtccattt ccgcgcagac gatgacgtca     1680

ctgcccggct gtatgcgcga ggttaccgac tgcggcctga gttttttaag tgacgtaaaa     1740

tcgtgttgag gccaacgccc ataatgcggg ctgttgcccg gcatccaacg ccattcatgg     1800

ccatatcaat gattttctgg tgcgtaccgg gttgagaagc ggtgtaagtg aactgcagtt     1860

gccatgtttt acggcagtga gagcagagat agcgctgatg tccggcggtg cttttgccgt     1920

tacgcaccac cccgtcagta gctgaacagg agggacagct gatagacaca gaagccactg     1980

gagcacctca aaaacaccat catacactaa atcagtaagt tggcagcatc acccataatt     2040

gtggtttcaa aatcggctcc gtcgatacta tgttatacgc caactttgaa aacaactttg     2100

aaaaagctgt tttctggtat ttaaggtttt agaatgcaag gaacagtgaa ttggagttcg     2160

tcttgttata attagcttct tggggtatct ttaaatactg tagaaaagag gaaggaaata     2220

ataaatggct aaaatgagaa tatcaccgga attgaaaaaa ctgatcgaaa aataccgctg     2280

cgtaaaagat acggaaggaa tgtctcctgc taaggtatat aagctggtgg gagaaaatga     2340

aaacctatat ttaaaaatga cggacagccg gtataaaggg accacctatg atgtggaacg     2400

ggaaaaggac atgatgctat ggctggaagg aaagctgcct gttccaaagg tcctgcactt     2460

tgaacggcat gatggctgga gcaatctgct catgagtgag gccgatggcg tcctttgctc     2520

ggaagagtat gaagatgaac aaagccctga aaagattatc gagctgtatg cggagtgcat     2580

caggctcttt cactccatcg acatatcgga ttgtccctat acgaatagct tagacagccg     2640

cttagccgaa ttggattact tactgaataa cgatctggcc gatgtggatt gcgaaaactg     2700

ggaagaagac actccattta aagatccgcg cgagctgtat gattttttaa agacggaaaa     2760

gcccgaagag gaacttgtct tttcccacgg cgacctggga gacagcaaca tctttgtgaa     2820

agatggcaaa gtaagtggct ttattgatct tgggagaagc ggcagggcgg acaagtggta     2880

tgacattgcc ttctgcgtcc ggtcgatcag ggaggatatc ggggaagaac agtatgtcga     2940

gctatttttt gacttactgg ggatcaagcc tgattgggag aaaataaaat attatatttt     3000

actggatgaa ttgttttagt acctagatgt ggcgcaacga tgccggcgac aagcaggagc     3060

gcaccgactt cttccgcatc aagtgttttg gctctcaggc cgaggcccac ggcaagtatt     3120

tgggcaaggg gtcgctggta ttcgtgcagg gcaagattcg gaataccaag tacgagaagg     3180

acggccagac ggtctacggg accgacttca ttgccgataa ggtggattat ctggacacca     3240

aggcaccagg cgggtcaaat caggaataag ggcacattgc cccggcgtga gtcggggcaa     3300

tcccgcaagg agggtgaatg aatcggacgt ttgaccggaa ggcatacagg caagaactga     3360

tcgacgcggg gttttccgcc gaggatgccg aaaccatcgc aagccgcacc gtcatgcgtg     3420

cgccccgcga aaccttccag tccgtcggct cgatggtcca gcaagctacg gccaagatcg     3480

agcgcgacag cgtgcaactg gctccccctg ccctgcccgc gccatcggcc gccgtggagc     3540

gttcgcgtcg tctcgaacag gaggcggcag gtttggcgaa gtcgatgacc atcgacacgc     3600

gaggaactat gacgaccaag aagcgaaaaa ccgccggcga ggacctggca aaacaggtca     3660

gcgaggccaa gcaggccgcg ttgctgaaac acacgaagca gcagatcaag gaaatgcagc     3720

tttccttgtt cgatattgcg ccgtggccgg acacgatgcg agcgatgcca aacgacacgg     3780

cccgctctgc cctgttcacc acgcgcaaca agaaaatccc gcgcgaggcg ctgcaaaaca     3840

aggtcatttt ccacgtcaac aaggacgtga agatcaccta caccggcgtc gagctgcggg     3900

ccgacgatga cgaactggtg tggcagcagg tgttggagta cgcgaagcgc acccctatcg     3960

gcgagccgat caccttcacg ttctacgagc tttgccagga cctgggctgg tcgatcaatg     4020

gccggtatta cacgaaggcc gaggaatgcc tgtcgcgcct acaggcgacg gcgatgggct     4080

tcacgtccga ccgcgttggg cacctggaat cggtgtcgct gctgcaccgc ttccgcgtcc     4140

tggaccgtgg caagaaaacg tcccgttgcc aggtcctgat cgacgaggaa atcgtcgtgc     4200

tgtttgctgg cgaccactac acgaaattca tatgggagaa gtaccgcaag ctgtcgccga     4260

cggcccgacg gatgttcgac tatttcagct cgcaccggga gccgtacccg ctcaagctgg     4320

aaaccttccg cctcatgtgc ggatcggatt ccacccgcgt gaagaagtgg cgcgagcagg     4380

tcggcgaagc ctgcgaagag ttgcgaggca gcggcctggt ggaacacgcc tgggtcaatg     4440

atgacctggt gcattgcaaa cgctagggcc ttgtggggtc agttccggct gggggttcag     4500

cagccagcgc tttactctag tgacgctcac cgggctggtt gccctcgccg ctgggctggc     4560

ggccgtctat ggccctgcaa acgcgccaga aacgccgtcg aagccgtgtg cgagacaccg     4620

cggccgccgg cgttgtggat acctcgcgga aaacttggcc ctcactgaca gatgaggggc     4680

ggacgttgac acttgagggg ccgactcacc cggcgcggcg ttgacagatg aggggcaggc     4740

tcgatttcgg ccggcgacgt ggagctggcc agcctcgcaa atcggcgaaa acgcctgatt     4800

ttacgcgagt ttcccacaga tgatgtggac aagcctgggg ataagtgccc tgcggtattg     4860

acacttgagg ggcgcgacta ctgacagatg aggggcgcga tccttgacac ttgaggggca     4920

gagtgctgac agatgagggg cgcacctatt gacatttgag gggctgtcca caggcagaaa     4980

atccagcatt tgcaagggtt tccgcccgtt tttcggccac cgctaacctg tcttttaacc     5040

tgcttttaaa ccaatattta taaaccttgt ttttaaccag ggctgcgccc tgtgcgcgtg     5100

accgcgcacg ccgaaggggg gtgccccccc ttctcgaacc ctcccggccc gctaacgcgg     5160

gcctcccatc cccccagggg ctgcgcccct cggccgcgaa cggcctcacc ccaaaaatgg     5220

c                                                                     5221


<210>  13
<211>  8885
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Construct IV


<220>
<221>  misc_feature
<222>  (3)..(12)
<223>  n is a, c, g, or t

<400>  13
ccnnnnnnnn nnttaattaa cgaagagcaa gagctcgaat ttccccgatc gttcaaacat       60

ttggcaataa agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata      120

atttctgttg aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat      180

gagatgggtt tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa      240

aatatagcgc gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg      300

ggaattggca tgcaagcttg gcactggccg tcgttttaca acgtcgtgac tgggaaaacc      360

ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata      420

gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatgct      480

agagcagctt gagcttggat cagattgtcg tttcccgcct tcagtttaaa ctatcagtgt      540

ttgacaggat atattggcgg gtaaacctaa gagaaaagag cgtttattag aataatcgga      600

tatttaaaag ggcgtgaaaa ggtttatccg ttcgtccatt tgtatgtgca tgccaaccac      660

agggttcccc tcgggatcaa agtactttga tccaacccct ccgctgctat agtgcagtcg      720

gcttctgacg ttcagtgcag ccgtcttctg aaaacgacat gtcgcacaag tcctaagtta      780

cgcgacaggc tgccgccctg cccttttcct ggcgttttct tgtcgcgtgt tttagtcgca      840

taaagtagaa tacttgcgac tagaaccgga gacattacgc catgaacaag agcgccgccg      900

ctggcctgct gggctatgcc cgcgtcagca ccgacgacca ggacttgacc aaccaacggg      960

ccgaactgca cgcggccggc tgcaccaagc tgttttccga gaagatcacc ggcaccaggc     1020

gcgaccgccc ggagctggcc aggatgcttg accacctacg ccctggcgac gttgtgacag     1080

tgaccaggct agaccgcctg gcccgcagca cccgcgacct actggacatt gccgagcgca     1140

tccaggaggc cggcgcgggc ctgcgtagcc tggcagagcc gtgggccgac accaccacgc     1200

cggccggccg catggtgttg accgtgttcg ccggcattgc cgagttcgag cgttccctaa     1260

tcatcgaccg cacccggagc gggcgcgagg ccgccaaggc ccgaggcgtg aagtttggcc     1320

cccgccctac cctcaccccg gcacagatcg cgcacgcccg cgagctgatc gaccaggaag     1380

gccgcaccgt gaaagaggcg gctgcactgc ttggcgtgca tcgctcgacc ctgtaccgcg     1440

cacttgagcg cagcgaggaa gtgacgccca ccgaggccag gcggcgcggt gccttccgtg     1500

aggacgcatt gaccgaggcc gacgccctgg cggccgccga gaatgaacgc caagaggaac     1560

aagcatgaaa ccgcaccagg acggccagga cgaaccgttt ttcattaccg aagagatcga     1620

ggcggagatg atcgcggccg ggtacgtgtt cgagccgccc gcgcacgtct caaccgtgcg     1680

gctgcatgaa atcctggccg gtttgtctga tgccaagctg gcggcctggc cggccagctt     1740

ggccgctgaa gaaaccgagc gccgccgtct aaaaaggtga tgtgtatttg agtaaaacag     1800

cttgcgtcat gcggtcgctg cgtatatgat gcgatgagta aataaacaaa tacgcaaggg     1860

gaacgcatga aggttatcgc tgtacttaac cagaaaggcg ggtcaggcaa gacgaccatc     1920

gcaacccatc tagcccgcgc cctgcaactc gccggggccg atgttctgtt agtcgattcc     1980

gatccccagg gcagtgcccg cgattgggcg gccgtgcggg aagatcaacc gctaaccgtt     2040

gtcggcatcg accgcccgac gattgaccgc gacgtgaagg ccatcggccg gcgcgacttc     2100

gtagtgatcg acggagcgcc ccaggcggcg gacttggctg tgtccgcgat caaggcagcc     2160

gacttcgtgc tgattccggt gcagccaagc ccttacgaca tatgggccac cgccgacctg     2220

gtggagctgg ttaagcagcg cattgaggtc acggatggaa ggctacaagc ggcctttgtc     2280

gtgtcgcggg cgatcaaagg cacgcgcatc ggcggtgagg ttgccgaggc gctggccggg     2340

tacgagctgc ccattcttga gtcccgtatc acgcagcgcg tgagctaccc aggcactgcc     2400

gccgccggca caaccgttct tgaatcagaa cccgagggcg acgctgcccg cgaggtccag     2460

gcgctggccg ctgaaattaa atcaaaactc atttgagtta atgaggtaaa gagaaaatga     2520

gcaaaagcac aaacacgcta agtgccggcc gtccgagcgc acgcagcagc aaggctgcaa     2580

cgttggccag cctggcagac acgccagcca tgaagcgggt caactttcag ttgccggcgg     2640

aggatcacac caagctgaag atgtacgcgg tacgccaagg caagaccatt accgagctgc     2700

tatctgaata catcgcgcag ctaccagagt aaatgagcaa atgaataaat gagtagatga     2760

attttagcgg ctaaaggagg cggcatggaa aatcaagaac aaccaggcac cgacgccgtg     2820

gaatgcccca tgtgtggagg aacgggcggt tggccaggcg taagcggctg ggttgcctgc     2880

cggccctgca atggcactgg aacccccaag cccgaggaat cggcgtgagc ggtcgcaaac     2940

catccggccc ggtacaaatc ggcgcggcgc tgggtgatga cctggtggag aagttgaagg     3000

ccgcgcaggc cgcccagcgg caacgcatcg aggcagaagc acgccccggt gaatcgtggc     3060

aagcggccgc tgatcgaatc cgcaaagaat cccggcaacc gccggcagcc ggtgcgccgt     3120

cgattaggaa gccgcccaag ggcgacgagc aaccagattt tttcgttccg atgctctatg     3180

acgtgggcac ccgcgatagt cgcagcatca tggacgtggc cgttttccgt ctgtcgaagc     3240

gtgaccgacg agctggcgag gtgatccgct acgagcttcc agacgggcac gtagaggttt     3300

ccgcagggcc ggccggcatg gccagtgtgt gggattacga cctggtactg atggcggttt     3360

cccatctaac cgaatccatg aaccgatacc gggaagggaa gggagacaag cccggccgcg     3420

tgttccgtcc acacgttgcg gacgtactca agttctgccg gcgagccgat ggcggaaagc     3480

agaaagacga cctggtagaa acctgcattc ggttaaacac cacgcacgtt gccatgcagc     3540

gtacgaagaa ggccaagaac ggccgcctgg tgacggtatc cgagggtgaa gccttgatta     3600

gccgctacaa gatcgtaaag agcgaaaccg ggcggccgga gtacatcgag atcgagctag     3660

ctgattggat gtaccgcgag atcacagaag gcaagaaccc ggacgtgctg acggttcacc     3720

ccgattactt tttgatcgat cccggcatcg gccgttttct ctaccgcctg gcacgccgcg     3780

ccgcaggcaa ggcagaagcc agatggttgt tcaagacgat ctacgaacgc agtggcagcg     3840

ccggagagtt caagaagttc tgtttcaccg tgcgcaagct gatcgggtca aatgacctgc     3900

cggagtacga tttgaaggag gaggcggggc aggctggccc gatcctagtc atgcgctacc     3960

gcaacctgat cgagggcgaa gcatccgccg gttcctaatg tacggagcag atgctagggc     4020

aaattgccct agcaggggaa aaaggtcgaa aaggtctctt tcctgtggat agcacgtaca     4080

ttgggaaccc aaagccgtac attgggaacc ggaacccgta cattgggaac ccaaagccgt     4140

acattgggaa ccggtcacac atgtaagtga ctgatataaa agagaaaaaa ggcgattttt     4200

ccgcctaaaa ctctttaaaa cttattaaaa ctcttaaaac ccgcctggcc tgtgcataac     4260

tgtctggcca gcgcacagcc gaagagctgc aaaaagcgcc tacccttcgg tcgctgcgct     4320

ccctacgccc cgccgcttcg cgtcggccta tcgcggccgc tggccgctca aaaatggctg     4380

gcctacggcc aggcaatcta ccagggcgcg gacaagccgc gccgtcgcca ctcgaccgcc     4440

ggcgcccaca tcaaggcacc ctgcctcgcg cgtttcggtg atgacggtga aaacctctga     4500

cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa     4560

gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gcgcagccat gacccagtca     4620

cgtagcgata gcggagtgta tactggctta actatgcggc atcagagcag attgtactga     4680

gagtgcacca tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa taccgcatca     4740

ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag     4800

cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag     4860

gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc     4920

tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc     4980

agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc     5040

tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt     5100

cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg     5160

ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat     5220

ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag     5280

ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt     5340

ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc     5400

cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta     5460

gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag     5520

atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga     5580

ttttggtcat gcattctagg tactaaaaca attcatccag taaaatataa tattttattt     5640

tctcccaatc aggcttgatc cccagtaagt caaaaaatag ctcgacatac tgttcttccc     5700

cgatatcctc cctgatcgac cggacgcaga aggcaatgtc ataccacttg tccgccctgc     5760

cgcttctccc aagatcaata aagccactta ctttgccatc tttcacaaag atgttgctgt     5820

ctcccaggtc gccgtgggaa aagacaagtt cctcttcggg cttttccgtc tttaaaaaat     5880

catacagctc gcgcggatct ttaaatggag tgtcttcttc ccagttttcg caatccacat     5940

cggccagatc gttattcagt aagtaatcca attcggctaa gcggctgtct aagctattcg     6000

tatagggaca atccgatatg tcgatggagt gaaagagcct gatgcactcc gcatacagct     6060

cgataatctt ttcagggctt tgttcatctt catactcttc cgagcaaagg acgccatcgg     6120

cctcactcat gagcagattg ctccagccat catgccgttc aaagtgcagg acctttggaa     6180

caggcagctt tccttccagc catagcatca tgtccttttc ccgttccaca tcataggtgg     6240

tccctttata ccggctgtcc gtcattttta aatataggtt ttcattttct cccaccagct     6300

tatatacctt agcaggagac attccttccg tatcttttac gcagcggtat ttttcgatca     6360

gttttttcaa ttccggtgat attctcattt tagccattta ttatttcctt cctcttttct     6420

acagtattta aagatacccc aagaagctaa ttataacaag acgaactcca attcactgtt     6480

ccttgcattc taaaacctta aataccagaa aacagctttt tcaaagttgt tttcaaagtt     6540

ggcgtataac atagtatcga cggagccgat tttgaaaccg cggtgatcac aggcagcaac     6600

gctctgtcat cgttacaatc aacatgctac cctccgcgag atcatccgtg tttcaaaccc     6660

ggcagcttag ttgccgttct tccgaatagc atcggtaaca tgagcaaagt ctgccgcctt     6720

acaacggctc tcccgctgac gccgtcccgg actgatgggc tgcctgtatc gagtggtgat     6780

tttgtgccga gctgccggtc ggggagctgt tggctggctg gtggcaggat atattgtggt     6840

gtaaacaaat tgacgcttag acaacttaat aacacattgc ggacgttttt aatgtactga     6900

attaacgccg aattaagctt ggacaatcag taaattgaac ggagaatatt attcataaaa     6960

atacgatagt aacgggtgat atattcatta gaatgaaccg aaaccggcgg taaggatctg     7020

agctacacat gctcaggttt tttacaacgt gcacaacaga attgaaagca aatatcatgc     7080

gatcataggc gtctcgcata tctcattaaa gcagggcatg ccggtcgagt caaatctcgg     7140

tgacgggcag gaccggacgg ggcggtaccg gcaggctgaa gtccagctgc cagaaaccca     7200

cgtcatgcca gttcccgtgc ttgaagccgg ccgcccgcag catgccgcgg ggggcatatc     7260

cgagcgcctc gtgcatgcgc acgctcgggt cgttgggcag cccgatgaca gcgaccacgc     7320

tcttgaagcc ctgtgcctcc agggacttca gcaggtgggt gtagagcgtg gagcccagtc     7380

ccgtccgctg gtggcggggg gagacgtaca cggtcgactc ggccgtccag tcgtaggcgt     7440

tgcgtgcctt ccaggggccc gcgtaggcga tgccggcgac ctcgccgtcc acctcggcga     7500

cgagccaggg atagcgctcc cgcagacgga cgaggtcgtc cgtccactcc tgcggttcct     7560

gcggctcggt acggaagttg accgtgcttg tctcgatgta gtggttgacg atggtgcaga     7620

ccgccggcat gtccgcctcg gtggcacggc ggatgtcggc cgggcgtcgt tctgggctca     7680

tggtagactc gacggatcca cgtgtggaag atatgaattt ttttgagaaa ctagataaga     7740

ttaatgaata tcggtgtttt ggttttttct tgtggccgtc tttgtttata ttgagatttt     7800

tcaaatcagt gcgcaagacg tgacgtaagt atccgagtca gtttttattt ttctactaat     7860

ttggtcgaag ctttgggcgg atcctctaga attcgaatcc aaaaattacg gatatgaata     7920

taggcatatc cgtatccgaa ttatccgttt gacagctagc aacgattgta caattgcttc     7980

tttaaaaaag gaagaaagaa agaaagaaaa gaatcaacat cagcgttaac aaacggcccc     8040

gttacggccc aaacggtcat atagagtaac ggcgttaagc gttgaaagac tcctatcgaa     8100

atacgtaacc gcaaacgtgt catagtcaga tcccctcttc cttcaccgcc tcaaacacaa     8160

aaataatctt ctacagccta tatatacaac ccccccttct atctctcctt tctcacaatt     8220

catcatcttt ctttctctac ccccaatttt aagaaatcct ctcttctcct cttcattttc     8280

aaggtaaatc tctctctctc tctctctctc tgttattcct tgttttaatt aggtatgtat     8340

tattgctagt ttgttaatct gcttatctta tgtatgcctt atgtgaatat ctttatcttg     8400

ttcatctcat ccgtttagaa gctataaatt tgttgatttg actgtgtatc tacacgtggt     8460

tatgtttata tctaatcaga tatgaatttc ttcatattgt tgcgtttgtg tgtaccaatc     8520

cgaaatcgtt gatttttttc atttaatcgt gtagctaatt gtacgtatac atatggatct     8580

acgtatcaat tgttcatctg tttgtgtttg tatgtataca gatctgaaaa catcacttct     8640

ctcatctgat tgtgttgtta catacataga tatagatctg ttatatcatt ttttttatta     8700

attgtgtata tatatatgtg catagatctg gattacatga ttgtgattat ttacatgatt     8760

ttgttattta cgtatgtata tatgtagatc tggacttttt ggagttgttg acttgattgt     8820

atttgtgtgt gtatatgtgt gttctgatct tgatatgtta tgtatgtgca gcccgggttg     8880

ctctt                                                                 8885


<210>  14
<211>  10934
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Construct V


<220>
<221>  misc_feature
<222>  (612)..(621)
<223>  n is a, c, g, or t

<400>  14
gtagaaaccc caacccgtga aatcaaaaaa ctcgacggcc tgtgggcatt cagtctggat       60

cgcgaaaact gtggaattga tcagcgttgg tgggaaagcg cgttacaaga aagccgggca      120

attgctgtgc caggcagttt taacgatcag ttcgccgatg cagatattcg taattatgcg      180

ggcaacgtct ggtatcagcg cgaagtcttt ataccgaaag gttgggcagg ccagcgtatc      240

gtgctgcgtt tcgatgcggt cactcattac ggcaaagtgt gggtcaataa tcaggaagtg      300

atggagcatc agggcggcta tacgccattt gaagccgatg tcacgccgta tgttattgcc      360

gggaaaagtg tacgtatcac cgtttgtgtg aacaacgaac tgaactggca gactatcccg      420

ccgggaatgg tgattaccga cgaaaacggc aagaaaaagc agtcttactt ccatgatttc      480

tttaactatg ccggaatcca tcgcagcgta atgctctaca ccacgccgaa cacctgggtg      540

gacgatatca ccgtggtgac gcatgtcgcg caagactgta accacgcgtc tgttgactgg      600

caggtggtgc cnnnnnnnnn nctagagtcc tgtagaaacc ccaacccgtg aaatcaaaaa      660

actcgacggc ctgtgggcat tcagtctgga ccgcgaaaac tgtggaattg atcagcgttg      720

gtgggaaagc gcgttacaag aaagccgggc aattgctgtg ccaggcagtt ttaacgatca      780

gttcgccgat gcagatattc gtaattatgc gggcaacgtc tggtatcagc gcgaagtctt      840

tataccgaaa ggttgggcag gccagcgtat cgtgctgcgt ttcgatgcgg tcactcatta      900

cggcaaagtg tgggtcaata atcaggaagt gatggagcat cagggcggct atacgccatt      960

tgaagccgat gtcacgccgt atgttattgc cgggaaaagt gtacgtatca ccgtttgtgt     1020

gaacaacgaa ctgaactggc agactatccc gccgggaatg gtgattaccg acgaaaacgg     1080

caagaaaaag cagtcttact tccatgattt ctttaactat gccggaatcc atcgcagcgt     1140

aatgctctac accacgccga acacctgggt ggacgatatc accgtggtga cgcatgtcgc     1200

gcaagactgt aaccacgcgt ctgttgactg gcaggtggtg gccaatggtg atgtcagcgt     1260

tgaactgcgt gatgcggatc aacaggtggt tgcaactgga caaggcacta gcgggacttt     1320

gcaagtggtg aatccgcacc tctggcaacc gggtgaaggt tatctctatg aactgtgcgt     1380

cacagccaaa agccagacag agtgtgatat ctacccgctt cgcgtcggca tccggtcagt     1440

ggcagtgaag ggcgaacagt tcctgattaa ccacaaaccg ttctacttta ctggctttgg     1500

tcgtcatgaa gatgcggact tgcgtggcaa aggattcgat aacgtgctga tggtgcacga     1560

ccacgcatta atggactgga ttggggccaa ctcctaccgt acctcgcatt acccttacgc     1620

tgaagagatg ctcgactggg cagatgaaca tggcatcgtg gtgattgatg aaactgctgc     1680

tgtcggcttt aacctctctt taggcattgg tttcgaagcg ggcaacaagc cgaaagaact     1740

gtacagcgaa gaggcagtca acggggaaac tcagcaagcg cacttacagg cgattaaaga     1800

gctgatagcg cgtgacaaaa accacccaag cgtggtgatg tggagtattg ccaacgaacc     1860

ggatacccgt ccgcaaggtg cacgggaata tttcgcgcca ctggcggaag caacgcgtaa     1920

actcgacccg acgcgtccga tcacctgcgt caatgtaatg ttctgcgacg ctcacaccga     1980

taccatcagc gatctctttg atgtgctgtg cctgaaccgt tattacggat ggtatgtcca     2040

aagcggcgat ttggaagcgg cagagaaggt actggaaaaa gaacttctgg cctggcagga     2100

gaaactgcat cagccgatta tcatcaccga atacggcgtg gatacgttag ccgggctgca     2160

ctcaatgtac accgacatgt ggagtgaaga gtatcagtgt gcatggctgg atatgtatca     2220

ccgcgtcttt gatcgcgtca gcgccgtcgt cggtgaacag gtatggaatt tcgccgattt     2280

tgcgacctcg caaggcatat tgcgcgttgg cggtaacaag aaagggatct tcactcgcga     2340

ccgcaaaccg aagtcggcgg cttttctgct gcaaaaacgc tggactggca tgaacttcgg     2400

tgaaaaaccg cagcagggag gcaaacaatg aatcaacaac tctcctggcg caccatcgtc     2460

ggctacagcc tcgggaattg ctaccgagct cgaatttccc cgatcgttca aacatttggc     2520

aataaagttt cttaagattg aatcctgttg ccggacttgc gatgattatc atataatttc     2580

tgttgaatta cgttaagcat gtaataatta acatgtaatg catgacgtta tttatgagat     2640

gggtttttat gattagagtc ccgcaattat acatttaata cgcgatagaa aacaaaatat     2700

agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc tatgttacta gatcggaata     2760

agcttggcgt aatcatggtc atagctgttt cctactagat ctgattgtcg tttcccgcct     2820

tcagtttaaa ctatcagtgt ttgacaggat atattggcgg gtaaacctaa gagaaaagag     2880

cgtttattag aataatcgga tatttaaaag ggcgtgaaaa ggtttatccg ttcgtccatt     2940

tgtatgtcca tggaacgcag tggcggtttt catggcttgt tatgactgtt tttttggggt     3000

acagtctatg cctcgggcat ccaagcagca agcgcgttac gccgtgggtc gatgtttgat     3060

gttatggagc agcaacgatg ttacgcagca gggcagtcgc cctaaaacaa agttaaacat     3120

catgggggaa gcggtgatcg ccgaagtatc gactcaacta tcagaggtag ttggcgtcat     3180

cgagcgccat ctcgaaccga cgttgctggc cgtacatttg tacggctccg cagtggatgg     3240

cggcctgaag ccacacagtg atattgattt gctggttacg gtgaccgtaa ggcttgatga     3300

aacaacgcgg cgagctttga tcaacgacct tttggaaact tcggcttccc ctggagagag     3360

cgagattctc cgcgctgtag aagtcaccat tgttgtgcac gacgacatca ttccgtggcg     3420

ttatccagct aagcgcgaac tgcaatttgg agaatggcag cgcaatgaca ttcttgcagg     3480

tatcttcgag ccagccacga tcgacattga tctggctatc ttgctgacaa aagcaagaga     3540

acatagcgtt gccttggtag gtccagcggc ggaggaactc tttgatccgg ttcctgaaca     3600

ggatctattt gaggcgctaa atgaaacctt aacgctatgg aactcgccgc ccgactgggc     3660

tggcgatgag cgaaatgtag tgcttacgtt gtcccgcatt tggtacagcg cagtaaccgg     3720

caaaatcgcg ccgaaggatg tcgctgccga ctgggcaatg gagcgcctgc cggcccagta     3780

tcagcccgtc atacttgaag ctagacaggc ttatcttgga caagaagaag atcgcttggc     3840

ctcgcgcgca gatcagttgg aagaatttgt ccactacgtg aaaggcgaga tcaccaaggt     3900

agtcggcaaa taatgtctag ctagaaattc gttcaagccg acgccgcttc gcggcgcggc     3960

ttaactcaag cgttagatgc actaagcaca taattgctca cagccaaact atcaggtcaa     4020

gtctgctttt attattttta agcgtgcata ataagcccta cacaaattgg gagatatatc     4080

atgcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga     4140

aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac     4200

aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt     4260

tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc     4320

gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat     4380

cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag     4440

acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc     4500

cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag     4560

cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac     4620

aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg     4680

gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct     4740

atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc     4800

tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga     4860

gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga     4920

agcggaagag cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg     4980

catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agtatacact     5040

ccgctatcgc tacgtgactg ggtcatggct gcgccccgac acccgccaac acccgctgac     5100

gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc     5160

gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag gcagggtgcc     5220

ttgatgtggg cgccggcggt cgagtggcga cggcgcggct tgtccgcgcc ctggtagatt     5280

gcctggccgt aggccagcca tttttgagcg gccagcggcc gcgataggcc gacgcgaagc     5340

ggcggggcgt agggagcgca gcgaccgaag ggtaggcgct ttttgcagct cttcggctgt     5400

gcgctggcca gacagttatg cacaggccag gcgggtttta agagttttaa taagttttaa     5460

agagttttag gcggaaaaat cgcctttttt ctcttttata tcagtcactt acatgtgtga     5520

ccggttccca atgtacggct ttgggttccc aatgtacggg ttccggttcc caatgtacgg     5580

ctttgggttc ccaatgtacg tgctatccac aggaaagaga ccttttcgac ctttttcccc     5640

tgctagggca atttgcccta gcatctgctc cgtacattag gaaccggcgg atgcttcgcc     5700

ctcgatcagg ttgcggtagc gcatgactag gatcgggcca gcctgccccg cctcctcctt     5760

caaatcgtac tccggcaggt catttgaccc gatcagcttg cgcacggtga aacagaactt     5820

cttgaactct ccggcgctgc cactgcgttc gtagatcgtc ttgaacaacc atctggcttc     5880

tgccttgcct gcggcgcggc gtgccaggcg gtagagaaaa cggccgatgc cgggatcgat     5940

caaaaagtaa tcggggtgaa ccgtcagcac gtccgggttc ttgccttctg tgatctcgcg     6000

gtacatccaa tcagctagct cgatctcgat gtactccggc cgcccggttt cgctctttac     6060

gatcttgtag cggctaatca aggcttcacc ctcggatacc gtcaccaggc ggccgttctt     6120

ggccttcttc gtacgctgca tggcaacgtg cgtggtgttt aaccgaatgc aggtttctac     6180

caggtcgtct ttctgctttc cgccatcggc tcgccggcag aacttgagta cgtccgcaac     6240

gtgtggacgg aacacgcggc cgggcttgtc tcccttccct tcccggtatc ggttcatgga     6300

ttcggttaga tgggaaaccg ccatcagtac caggtcgtaa tcccacacac tggccatgcc     6360

ggccggccct gcggaaacct ctacgtgccc gtctggaagc tcgtagcgga tcacctcgcc     6420

agctcgtcgg tcacgcttcg acagacggaa aacggccacg tccatgatgc tgcgactatc     6480

gcgggtgccc acgtcataga gcatcggaac gaaaaaatct ggttgctcgt cgcccttggg     6540

cggcttccta atcgacggcg caccggctgc cggcggttgc cgggattctt tgcggattcg     6600

atcagcggcc gcttgccacg attcaccggg gcgtgcttct gcctcgatgc gttgccgctg     6660

ggcggcctgc gcggccttca acttctccac caggtcatca cccagcgccg cgccgatttg     6720

taccgggccg gatggtttgc gaccgctcac gccgattcct cgggcttggg ggttccagtg     6780

ccattgcagg gccggcagac aacccagccg cttacgcctg gccaaccgcc cgttcctcca     6840

cacatggggc attccacggc gtcggtgcct ggttgttctt gattttccat gccgcctcct     6900

ttagccgcta aaattcatct actcatttat tcatttgctc atttactctg gtagctgcgc     6960

gatgtattca gatagcagct cggtaatggt cttgccttgg cgtaccgcgt acatcttcag     7020

cttggtgtga tcctccgccg gcaactgaaa gttgacccgc ttcatggctg gcgtgtctgc     7080

caggctggcc aacgttgcag ccttgctgct gcgtgcgctc ggacggccgg cacttagcgt     7140

gtttgtgctt ttgctcattt tctctttacc tcattaactc aaatgagttt tgatttaatt     7200

tcagcggcca gcgcctggac ctcgcgggca gcgtcgccct cgggttctga ttcaagaacg     7260

gttgtgccgg cggcggcagt gcctgggtag ctcacgcgct gcgtgatacg ggactcaaga     7320

atgggcagct cgtacccggc cagcgcctcg gcaacctcac cgccgatgcg cgtgcctttg     7380

atcgcccgcg acacgacaaa ggccgcttgt agccttccat ccgtgacctc aatgcgctgc     7440

ttaaccagct ccaccaggtc ggcggtggcc catatgtcgt aagggcttgg ctgcaccgga     7500

atcagcacga agtcggctgc cttgatcgcg gacacagcca agtccgccgc ctggggcgct     7560

ccgtcgatca ctacgaagtc gcgccggccg atggccttca cgtcgcggtc aatcgtcggg     7620

cggtcgatgc cgacaacggt tagcggttga tcttcccgca cggccgccca atcgcgggca     7680

ctgccctggg gatcggaatc gactaacaga acatcggccc cggcgagttg cagggcgcgg     7740

gctagatggg ttgcgatggt cgtcttgcct gacccgcctt tctggttaag tacagcgata     7800

accttcatgc gttccccttg cgtatttgtt tatttactca tcgcatcata tacgcagcga     7860

ccgcatgacg caagctgttt tactcaaata cacatcacct ttttagacgg cggcgctcgg     7920

tttcttcagc ggccaagctg gccggccagg ccgccagctt ggcatcagac aaaccggcca     7980

ggatttcatg cagccgcacg gttgagacgt gcgcgggcgg ctcgaacacg tacccggccg     8040

cgatcatctc cgcctcgatc tcttcggtaa tgaaaaacgg ttcgtcctgg ccgtcctggt     8100

gcggtttcat gcttgttcct cttggcgttc attctcggcg gccgccaggg cgtcggcctc     8160

ggtcaatgcg tcctcacgga aggcaccgcg ccgcctggcc tcggtgggcg tcacttcctc     8220

gctgcgctca agtgcgcggt acagggtcga gcgatgcacg ccaagcagtg cagccgcctc     8280

tttcacggtg cggccttcct ggtcgatcag ctcgcgggcg tgcgcgatct gtgccggggt     8340

gagggtaggg cgggggccaa acttcacgcc tcgggccttg gcggcctcgc gcccgctccg     8400

ggtgcggtcg atgattaggg aacgctcgaa ctcggcaatg ccggcgaaca cggtcaacac     8460

catgcggccg gccggcgtgg tggtaacgcg tggtgatttt gtgccgagct gccggtcggg     8520

gagctgttgg ctggctggtg gcaggatata ttgtggtgta aacaaattga cgcttagaca     8580

acttaataac acattgcgga cgtctttaat gtactgaatt aacatccgtt tgatacttgt     8640

ctaaaattgg ctgatttcga gtgcatctat gcataaaaac aatctaatga caattattac     8700

caagcaggat cctgtcaaac actgatagtt taaactgaag gcgggaaacg acaatctgat     8760

catgagcgga gaattaaggg agtcacgtta tgacccccgc cgatgacgcg ggacaagccg     8820

ttttacgttt ggaactgaca gaaccgcaac gttgaaggag ccactcagcc gcgggtttct     8880

ggagtttaat gagctaagca catacgtcag aaaccattat tgcgcgttca aaagtcgcct     8940

aaggtcacta tcagctagca aatatttctt gtcaaaaatg ctccactgac gttccataaa     9000

ttcccctcgg tatccaatta gagtctcata ttcactctca atccaaataa tctgcaccgg     9060

atctggatcg tttcgcatga ttgaacaaga tggattgcac gcaggttctc cggccgcttg     9120

ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct ctgatgccgc     9180

cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg acctgtccgg     9240

tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca cgacgggcgt     9300

tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc tgctattggg     9360

cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga aagtatccat     9420

catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc cattcgacca     9480

ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc ttgtcgatca     9540

ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg ccaggctcaa     9600

ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa     9660

tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc tgggtgtggc     9720

ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc ttggcggcga     9780

atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc agcgcatcgc     9840

cttctatcgc cttcttgacg agttcttctg agcgggaccc aagctctaga tcttgctgcg     9900

ttcggatatt ttcgtggagt tcccgccaca gacccggatg atccccgatc gttcaaacat     9960

ttggcaataa agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata    10020

atttctgttg aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat    10080

gagatgggtt tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa    10140

aatatagcgc gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg    10200

ggcctcctgt caagctctga gtcgttgtaa aacgacggcc agtgaattga gctcggtacc    10260

gagtcaaaga ttcaaataga ggacctaaca gaactcgccg taaagactgg cgaacagttc    10320

atacagagtc tcttacgact caatgacaag aagaaaatct tcgtcaacat ggtggagcac    10380

gacacgcttg tctactccaa aaatatcaaa gatacagtct cagaagacca aagggcaatt    10440

gagacttttc aacaaagggt aatatccgga aacctcctcg gattccattg cccagctatc    10500

tgtcacttta ttgtgaagat agtggaaaag gaaggtggct cctacaaatg ccatcattgc    10560

gataaaggaa aggccatcgt tgaagatgcc tctgccgaca gtggtcccaa agatggaccc    10620

ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc aaagcaagtg    10680

gattgatgtg atatctccac tgacgtaagg gatgacgcac aatcccacta tccttcgcaa    10740

gacccttcct ctatataagg aagttcattt catttggaga ggacagggta cgtacctaga    10800

atacaaagaa gaggaagaag aaacctctac agaagaaagt gatggatccc cgggatcatc    10860

tacttctgaa gactcagact cagactaagc aggtgacgaa cgtcaccaat cccaattcga    10920

tctacatccg tcct                                                      10934


<210>  15
<211>  235
<212>  PRT
<213>  Saccharomyces cerevisiae

<400>  15

Met Lys Asn Ile Lys Lys Asn Gln Val Met Asn Thr Gly Pro Asn Ser 
1               5                   10                  15      


Lys Leu Leu Lys Glu Tyr Lys Ser Gln Leu Ile Glu Leu Asn Ile Glu 
            20                  25                  30          


Gln Phe Glu Ala Gly Ile Gly Leu Ile Leu Gly Asp Ala Tyr Ile Arg 
        35                  40                  45              


Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gln Phe Glu Trp Lys Asn 
    50                  55                  60                  


Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gln Trp Val Leu 
65                  70                  75                  80  


Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn Leu Val 
                85                  90                  95      


Ile Thr Trp Gly Ala Gln Thr Phe Lys His Gln Ala Phe Asn Lys Leu 
            100                 105                 110         


Ala Asn Leu Phe Ile Val Asn Asn Lys Lys Thr Ile Pro Asn Asn Leu 
        115                 120                 125             


Val Glu Asn Tyr Leu Thr Pro Met Ser Thr Ala Tyr Trp Phe Met Asp 
    130                 135                 140                 


Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Thr Asn Lys Ser Ile 
145                 150                 155                 160 


Val Leu Asn Thr Gln Ser Phe Thr Phe Glu Glu Val Glu Tyr Leu Val 
                165                 170                 175     


Lys Gly Leu Arg Asn Lys Phe Gln Leu Asn Cys Tyr Val Lys Ile Asn 
            180                 185                 190         


Lys Asn Lys Pro Ile Ile Tyr Ile Asp Ser Met Ser Tyr Thr Ile Phe 
        195                 200                 205             


Tyr Asn Leu Ile Lys Pro Tyr Leu Ile Pro Gln Met Met Tyr Lys Thr 
    210                 215                 220                 


Pro Asn Thr Ile Ser Ser Glu Thr Phe Leu Lys 
225                 230                 235 


<210>  16
<211>  238
<212>  PRT
<213>  Zygosaccharomyces bisporus

<400>  16

Met Lys Phe Ile Lys Lys Glu Gln Ile Lys Asn Leu Gly Pro Asn Ser 
1               5                   10                  15      


Lys Leu Leu Lys Gln Tyr Lys Ser Gln Leu Thr Asn Leu Thr Ser Glu 
            20                  25                  30          


Gln Leu Glu Ile Gly Val Gly Leu Leu Leu Gly Asp Ala Tyr Ile Arg 
        35                  40                  45              


Ser Arg Asp Asn Gly Lys Thr Asn Cys Ile Gln Phe Glu Trp Lys Asn 
    50                  55                  60                  


Lys Ala Tyr Ile Asp His Ile Cys Leu Lys Phe Asp Glu Trp Val Leu 
65                  70                  75                  80  


Ser Pro Pro His Lys Lys Met Arg Ile Asn His Leu Gly Asn Glu Val 
                85                  90                  95      


Ile Thr Trp Gly Ala Gln Thr Phe Lys His Glu Ala Phe Asn Glu Leu 
            100                 105                 110         


Ser Lys Leu Phe Ile Ile Asn Asn Lys Lys His Ile Ile Asn Asn Leu 
        115                 120                 125             


Ile Glu Asp Tyr Val Thr Pro Lys Ser Leu Ala Tyr Trp Phe Met Asp 
    130                 135                 140                 


Asp Gly Gly Lys Trp Asp Tyr Asn Lys Gly Ser Met Asn Lys Ser Ile 
145                 150                 155                 160 


Val Leu Asn Thr Gln Cys Phe Thr Ile Asp Glu Val Asn Ser Leu Ile 
                165                 170                 175     


Asn Gly Leu Asn Thr Lys Phe Lys Leu Asn Cys Ser Met Lys Phe Asn 
            180                 185                 190         


Lys Asn Lys Pro Ile Ile Tyr Ile Pro His Asn Ser Tyr Asn Ile Tyr 
        195                 200                 205             


Tyr Glu Leu Ile Ser Pro Tyr Ile Ile Thr Glu Met Arg Tyr Lys Leu 
    210                 215                 220                 


Pro Ser Tyr Glu Gly Thr Ser Lys Asp Tyr Asn Lys Ile His 
225                 230                 235             


<210>  17
<211>  228
<212>  PRT
<213>  Lachancea thermotolerans

<400>  17

Met Thr Met Lys Tyr Ile Thr Lys Gln Gln Ile Lys Asn Leu Gly Pro 
1               5                   10                  15      


Asn Ser Lys Leu Leu Lys Gln Tyr Lys Ala Gln Leu Thr Arg Leu Thr 
            20                  25                  30          


Thr Val Gln Leu Glu Ala Gly Val Gly Leu Ile Leu Gly Asp Ala Tyr 
        35                  40                  45              


Ile Arg Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gln Phe Glu Trp 
    50                  55                  60                  


Lys Asn Glu Ala Tyr Ile Asn His Val Cys Lys Leu Tyr Asp Glu Trp 
65                  70                  75                  80  


Val Leu Ser Ser Pro His Lys Lys Val Arg Thr Asn His Leu Gly Asn 
                85                  90                  95      


Glu Val Val Thr Trp Gly Ala Gln Thr Phe Lys His Lys Ala Phe Asn 
            100                 105                 110         


Glu Leu Ala Glu Leu Phe Ile Ile Asn Asn Asn Lys His Ile Asn Pro 
        115                 120                 125             


Asp Leu Val Asn Gln Tyr Ile Thr Pro Arg Ser Leu Ala Tyr Trp Phe 
    130                 135                 140                 


Met Asp Asp Gly Gly Lys Trp Asp Tyr Asn Thr Asn Ser Asn Asn Lys 
145                 150                 155                 160 


Ser Ile Val Leu Asn Thr Gln Gly Phe Ser Ile Gln Glu Val Gln Tyr 
                165                 170                 175     


Leu Ile Asp Gly Leu Asn Ile Lys Phe Asn Leu Asn Cys Ile Met Lys 
            180                 185                 190         


Phe Asn Lys Asn Lys Pro Ile Ile Phe Ile Pro Ser Asp Asn Tyr Lys 
        195                 200                 205             


His Tyr Tyr Asp Leu Ile Ile Pro Tyr Ile Ile Pro Glu Met Lys Tyr 
    210                 215                 220                 


Lys Leu Pro Thr 
225             


<210>  18
<211>  230
<212>  PRT
<213>  Pichia canadensis

<400>  18

Met Lys Lys Gln Ile Ile Asn Lys Lys Asp Leu Leu Gly Leu Gly Pro 
1               5                   10                  15      


Asn Ser Lys Leu Ile Lys Asp Tyr Lys Lys Gln Trp Thr Thr Leu Ser 
            20                  25                  30          


Lys Ile Gln Glu Glu Thr Leu Ile Gly Asn Ile Leu Gly Asp Val Tyr 
        35                  40                  45              


Ile Lys Lys Leu Lys Arg Asn Lys His Phe Leu Leu Gln Phe Glu Trp 
    50                  55                  60                  


Lys Asn Lys Ala Tyr Ile Glu His Ile Val Arg Val Phe Asp Glu Tyr 
65                  70                  75                  80  


Val Ile Ser Pro Pro Thr Leu Tyr Glu Arg Lys Asn His Leu Gly Asn 
                85                  90                  95      


Lys Val Ile Thr Trp Arg Ala Gln Thr Phe Glu His Lys Ala Phe Asp 
            100                 105                 110         


Lys Leu Gly Tyr Tyr Phe Met Glu Asn His Lys Lys Ile Ile Lys Pro 
        115                 120                 125             


Asp Leu Val Leu Asn Tyr Ile Thr Glu Arg Ser Leu Ala Tyr Trp Phe 
    130                 135                 140                 


Met Asp Asp Gly Gly Lys Trp Asp Tyr Asn Lys Lys Thr Lys Asn Lys 
145                 150                 155                 160 


Ser Leu Val Leu His Thr Gln Gly Phe Lys Lys Glu Glu Val Glu Ile 
                165                 170                 175     


Leu Ile Asn Asp Leu Asn Ile Lys Phe Asn Leu Asn Cys Ser Ile Lys 
            180                 185                 190         


Phe Asn Lys Asn Lys Pro Ile Ile Tyr Ile Pro Asn Lys Asp Tyr Glu 
        195                 200                 205             


Leu Phe Tyr Asn Leu Val Asn Pro Tyr Ile Ile Pro Glu Met Lys Tyr 
    210                 215                 220                 


Lys Leu Leu Phe Asn Val 
225                 230 


<210>  19
<211>  34
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  TALL Repeat 34

<400>  19

Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


His Gly 
        


<210>  20
<211>  35
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Tall repeat 35

<400>  20

Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys 
1               5                   10                  15      


Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala 
            20                  25                  30          


Pro His Asp 
        35  


<210>  21
<211>  693
<212>  DNA
<213>  Artificial Sequence

<220>
<223>   N terminal shortened I-SceI

<400>  21
atgggacagg tgatgaacct gggccctaac tctaagctgc ttaaggaata caagtctcag       60

ctgattgagc tgaacattga gcagttcgag gctggcatag gcctgattct gggcgatgct      120

tacattaggt ctagggatga gggcaagacc tactgcatgc agttcgagtg gaagaacaag      180

gcttacatgg atcacgtgtg cctgctgtac gatcagtggg tgctgtctcc tcctcacaag      240

aaggagaggg tgaaccactt gggaaacctg gtgattacct ggggcgctca aaccttcaag      300

caccaggctt tcaacaagct ggctaacctg ttcattgtga acaacaagaa gaccattcct      360

aacaacctgg tggagaacta cctgacccct atgtctctgg cttactggtt catggatgat      420

ggcggcaagt gggattacaa caagaactct accaacaagt ctattgtgct gaacacccag      480

tctttcacct tcgaggaggt ggaatacctg gtgaagggcc tgaggaacaa gttccagctg      540

aactgctacg tgaagattaa caagaacaag cctattattt acattgattc tatgtcttac      600

ctgattttct acaacctgat taagccttac ctgattcctc agatgatgta caagctgcct      660

aacaccatct cttctgagac cttcctgaag tga                                   693


<210>  22
<211>  666
<212>  DNA
<213>  Artificial Sequence

<220>
<223>   N- and C- terminal shortened I-SceI

<400>  22
atgggacagg tgatgaacct gggccctaac tctaagctgc ttaaggaata caagtctcag       60

ctgattgagc tgaacattga gcagttcgag gctggcatag gcctgattct gggcgatgct      120

tacattaggt ctagggatga gggcaagacc tactgcatgc agttcgagtg gaagaacaag      180

gcttacatgg atcacgtgtg cctgctgtac gatcagtggg tgctgtctcc tcctcacaag      240

aaggagaggg tgaaccactt gggaaacctg gtgattacct ggggcgctca aaccttcaag      300

caccaggctt tcaacaagct ggctaacctg ttcattgtga acaacaagaa gaccattcct      360

aacaacctgg tggagaacta cctgacccct atgtctctgg cttactggtt catggatgat      420

ggcggcaagt gggattacaa caagaactct accaacaagt ctattgtgct gaacacccag      480

tctttcacct tcgaggaggt ggaatacctg gtgaagggcc tgaggaacaa gttccagctg      540

aactgctacg tgaagattaa caagaacaag cctattattt acattgattc tatgtcttac      600

ctgattttct acaacctgat taagccttac ctgattcctc agatgatgta caagctgcct      660

aactga                                                                 666


<210>  23
<211>  8411
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid sequence

<400>  23
ttcttttgtt tatggttgtc tgtcagcatt tgacttgcag tttcatgctc atagtcatat       60

acgttattct aggctttttt gaatatctta ttactttttt cgtaatacaa ttttataatt      120

ttatcaaagt tatacaacta taactaaaat tagggttttc tacaaaacaa aaaaatcttc      180

taattttttt tgttgtagcc agtttactcg taagttacaa aaaaatacaa atgaacccac      240

atgtattatg cgtttaacta ggattaccat gtactttcat gtactcaatt caccctatac      300

tctttttttt tttttttcta gttccaccca atctataaaa ttctgtccat ttgaccaaat      360

tcaattaatt tctgtaattg cgatttaaaa ttaatattac atgttcacta tttctcgatt      420

tgagggaacc cgagtttaaa tatgataaaa atgttgaccc atcactacaa atatgttata      480

gtttatactt aatagtggtg tttttgggga taattgatga attaagtaaa catgattctt      540

cttatgaagt tgattgagtg attattgtat gtaaacctat gtgattgatg ttattggttg      600

attgagtgat tattgtatta gtatgtaagc aaagatgatt gttcttatga ggtaatttgt      660

tactcattca tccttttgca tatgagaaat tgtgttagcg tacgcaaaac aatagagaac      720

ataaaagata tgtgtattta tttaaggtga cttttgttaa tgatattgta gtatctatac      780

atttatatat aacttgttga atttgagtat aagctatcag gatccggggg atcctctaga      840

gtcgaggtac ccaacttttc tatacaaagt tgatagcttg gcgtaatcga tagcttggcg      900

taatcatggt catagctgtt tcctactaga tctgattgtc gtttcccgcc ttcagtttaa      960

actatcagtg tttgacagga tatattggcg ggtaaaccta agagaaaaga gcgtttatta     1020

gaataatcgg atatttaaaa gggcgtgaaa aggtttatcc gttcgtccat ttgtatgtcc     1080

atggaacgca gtggcggttt tcatggcttg ttatgactgt ttttttgggg tacagtctat     1140

gcctcgggca tccaagcagc aagcgcgtta cgccgtgggt cgatgtttga tgttatggag     1200

cagcaacgat gttacgcagc agggcagtcg ccctaaaaca aagttaaaca tcatggggga     1260

agcggtgatc gccgaagtat cgactcaact atcagaggta gttggcgtca tcgagcgcca     1320

tctcgaaccg acgttgctgg ccgtacattt gtacggctcc gcagtggatg gcggcctgaa     1380

gccacacagt gatattgatt tgctggttac ggtgaccgta aggcttgatg aaacaacgcg     1440

gcgagctttg atcaacgacc ttttggaaac ttcggcttcc cctggagaga gcgagattct     1500

ccgcgctgta gaagtcacca ttgttgtgca cgacgacatc attccgtggc gttatccagc     1560

taagcgcgaa ctgcaatttg gagaatggca gcgcaatgac attcttgcag gtatcttcga     1620

gccagccacg atcgacattg atctggctat cttgctgaca aaagcaagag aacatagcgt     1680

tgccttggta ggtccagcgg cggaggaact ctttgatccg gttcctgaac aggatctatt     1740

tgaggcgcta aatgaaacct taacgctatg gaactcgccg cccgactggg ctggcgatga     1800

gcgaaatgta gtgcttacgt tgtcccgcat ttggtacagc gcagtaaccg gcaaaatcgc     1860

gccgaaggat gtcgctgccg actgggcaat ggagcgcctg ccggcccagt atcagcccgt     1920

catacttgaa gctagacagg cttatcttgg acaagaagaa gatcgcttgg cctcgcgcgc     1980

agatcagttg gaagaatttg tccactacgt gaaaggcgag atcaccaagg tagtcggcaa     2040

ataatgtcta gctagaaatt cgttcaagcc gacgccgctt cgcggcgcgg cttaactcaa     2100

gcgttagatg cactaagcac ataattgctc acagccaaac tatcaggtca agtctgcttt     2160

tattattttt aagcgtgcat aataagccct acacaaattg ggagatatat catgcatgac     2220

caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa     2280

aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc     2340

accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt     2400

aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg     2460

ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc     2520

agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt     2580

accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga     2640

gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct     2700

tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg     2760

cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca     2820

cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa     2880

cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt     2940

ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga     3000

taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga     3060

gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatggtg     3120

cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac tccgctatcg     3180

ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga cgcgccctga     3240

cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc     3300

atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga ggcagggtgc cttgatgtgg     3360

gcgccggcgg tcgagtggcg acggcgcggc ttgtccgcgc cctggtagat tgcctggccg     3420

taggccagcc atttttgagc ggccagcggc cgcgataggc cgacgcgaag cggcggggcg     3480

tagggagcgc agcgaccgaa gggtaggcgc tttttgcagc tcttcggctg tgcgctggcc     3540

agacagttat gcacaggcca ggcgggtttt aagagtttta ataagtttta aagagtttta     3600

ggcggaaaaa tcgccttttt tctcttttat atcagtcact tacatgtgtg accggttccc     3660

aatgtacggc tttgggttcc caatgtacgg gttccggttc ccaatgtacg gctttgggtt     3720

cccaatgtac gtgctatcca caggaaagag accttttcga cctttttccc ctgctagggc     3780

aatttgccct agcatctgct ccgtacatta ggaaccggcg gatgcttcgc cctcgatcag     3840

gttgcggtag cgcatgacta ggatcgggcc agcctgcccc gcctcctcct tcaaatcgta     3900

ctccggcagg tcatttgacc cgatcagctt gcgcacggtg aaacagaact tcttgaactc     3960

tccggcgctg ccactgcgtt cgtagatcgt cttgaacaac catctggctt ctgccttgcc     4020

tgcggcgcgg cgtgccaggc ggtagagaaa acggccgatg ccgggatcga tcaaaaagta     4080

atcggggtga accgtcagca cgtccgggtt cttgccttct gtgatctcgc ggtacatcca     4140

atcagctagc tcgatctcga tgtactccgg ccgcccggtt tcgctcttta cgatcttgta     4200

gcggctaatc aaggcttcac cctcggatac cgtcaccagg cggccgttct tggccttctt     4260

cgtacgctgc atggcaacgt gcgtggtgtt taaccgaatg caggtttcta ccaggtcgtc     4320

tttctgcttt ccgccatcgg ctcgccggca gaacttgagt acgtccgcaa cgtgtggacg     4380

gaacacgcgg ccgggcttgt ctcccttccc ttcccggtat cggttcatgg attcggttag     4440

atgggaaacc gccatcagta ccaggtcgta atcccacaca ctggccatgc cggccggccc     4500

tgcggaaacc tctacgtgcc cgtctggaag ctcgtagcgg atcacctcgc cagctcgtcg     4560

gtcacgcttc gacagacgga aaacggccac gtccatgatg ctgcgactat cgcgggtgcc     4620

cacgtcatag agcatcggaa cgaaaaaatc tggttgctcg tcgcccttgg gcggcttcct     4680

aatcgacggc gcaccggctg ccggcggttg ccgggattct ttgcggattc gatcagcggc     4740

cgcttgccac gattcaccgg ggcgtgcttc tgcctcgatg cgttgccgct gggcggcctg     4800

cgcggccttc aacttctcca ccaggtcatc acccagcgcc gcgccgattt gtaccgggcc     4860

ggatggtttg cgaccgctca cgccgattcc tcgggcttgg gggttccagt gccattgcag     4920

ggccggcaga caacccagcc gcttacgcct ggccaaccgc ccgttcctcc acacatgggg     4980

cattccacgg cgtcggtgcc tggttgttct tgattttcca tgccgcctcc tttagccgct     5040

aaaattcatc tactcattta ttcatttgct catttactct ggtagctgcg cgatgtattc     5100

agatagcagc tcggtaatgg tcttgccttg gcgtaccgcg tacatcttca gcttggtgtg     5160

atcctccgcc ggcaactgaa agttgacccg cttcatggct ggcgtgtctg ccaggctggc     5220

caacgttgca gccttgctgc tgcgtgcgct cggacggccg gcacttagcg tgtttgtgct     5280

tttgctcatt ttctctttac ctcattaact caaatgagtt ttgatttaat ttcagcggcc     5340

agcgcctgga cctcgcgggc agcgtcgccc tcgggttctg attcaagaac ggttgtgccg     5400

gcggcggcag tgcctgggta gctcacgcgc tgcgtgatac gggactcaag aatgggcagc     5460

tcgtacccgg ccagcgcctc ggcaacctca ccgccgatgc gcgtgccttt gatcgcccgc     5520

gacacgacaa aggccgcttg tagccttcca tccgtgacct caatgcgctg cttaaccagc     5580

tccaccaggt cggcggtggc ccatatgtcg taagggcttg gctgcaccgg aatcagcacg     5640

aagtcggctg ccttgatcgc ggacacagcc aagtccgccg cctggggcgc tccgtcgatc     5700

actacgaagt cgcgccggcc gatggccttc acgtcgcggt caatcgtcgg gcggtcgatg     5760

ccgacaacgg ttagcggttg atcttcccgc acggccgccc aatcgcgggc actgccctgg     5820

ggatcggaat cgactaacag aacatcggcc ccggcgagtt gcagggcgcg ggctagatgg     5880

gttgcgatgg tcgtcttgcc tgacccgcct ttctggttaa gtacagcgat aaccttcatg     5940

cgttcccctt gcgtatttgt ttatttactc atcgcatcat atacgcagcg accgcatgac     6000

gcaagctgtt ttactcaaat acacatcacc tttttagacg gcggcgctcg gtttcttcag     6060

cggccaagct ggccggccag gccgccagct tggcatcaga caaaccggcc aggatttcat     6120

gcagccgcac ggttgagacg tgcgcgggcg gctcgaacac gtacccggcc gcgatcatct     6180

ccgcctcgat ctcttcggta atgaaaaacg gttcgtcctg gccgtcctgg tgcggtttca     6240

tgcttgttcc tcttggcgtt cattctcggc ggccgccagg gcgtcggcct cggtcaatgc     6300

gtcctcacgg aaggcaccgc gccgcctggc ctcggtgggc gtcacttcct cgctgcgctc     6360

aagtgcgcgg tacagggtcg agcgatgcac gccaagcagt gcagccgcct ctttcacggt     6420

gcggccttcc tggtcgatca gctcgcgggc gtgcgcgatc tgtgccgggg tgagggtagg     6480

gcgggggcca aacttcacgc ctcgggcctt ggcggcctcg cgcccgctcc gggtgcggtc     6540

gatgattagg gaacgctcga actcggcaat gccggcgaac acggtcaaca ccatgcggcc     6600

ggccggcgtg gtggtaacgc gtggtgattt tgtgccgagc tgccggtcgg ggagctgttg     6660

gctggctggt ggcaggatat attgtggtgt aaacaaattg acgcttagac aacttaataa     6720

cacattgcgg acgtctttaa tgtactgaat taacatccgt ttgatacttg tctaaaattg     6780

gctgatttcg agtgcatcta tgcataaaaa caatctaatg acaattatta ccaagcagag     6840

cttgacagga ggcccgatct agtaacatag atgacaccgc gcgcgataat ttatcctagt     6900

ttgcgcgcta tattttgttt tctatcgcgt attaaatgta taattgcggg actctaatca     6960

taaaaaccca tctcataaat aacgtcatgc attacatgtt aattattaca tgcttaacgt     7020

aattcaacag aaattatatg ataatcatcg caagaccggc aacaggattc aatcttaaga     7080

aactttattg ccaaatgttt gaacgatcgg ggatcatccg ggtctgtggc gggaactcca     7140

cgaaaatatc cgaacgcagc aagatctaga gcttgggtcc cgctcagaag aactcgtcaa     7200

gaaggcgata gaaggcgatg cgctgcgaat cgggagcggc gataccgtaa agcacgagga     7260

agcggtcagc ccattcgccg ccaagctctt cagcaatatc acgggtagcc aacgctatgt     7320

cctgatagcg gtccgccaca cccagccggc cacagtcgat gaatccagaa aagcggccat     7380

tttccaccat gatattcggc aagcaggcat cgccatgggt cacgacgaga tcctcgccgt     7440

cgggcatgcg cgccttgagc ctggcgaaca gttcggctgg cgcgagcccc tgatgctctt     7500

cgtccagatc atcctgatcg acaagaccgg cttccatccg agtacgtgct cgctcgatgc     7560

gatgtttcgc ttggtggtcg aatgggcagg tagccggatc aagcgtatgc agccgccgca     7620

ttgcatcagc catgatggat actttctcgg caggagcaag gtgagatgac aggagatcct     7680

gccccggcac ttcgcccaat agcagccagt cccttcccgc ttcagtgaca acgtcgagca     7740

cagctgcgca aggaacgccc gtcgtggcca gccacgatag ccgcgctgcc tcgtcctgca     7800

gttcattcag ggcaccggac aggtcggtct tgacaaaaag aaccgggcgc ccctgcgctg     7860

acagccggaa cacggcggca tcagagcagc cgattgtctg ttgtgcccag tcatagccga     7920

atagcctctc cacccaagcg gccggagaac ctgcgtgcaa tccatcttgt tcaatcatgc     7980

gaaacgatcc agatccggtg cagattattt ggattgagag tgaatatgag actctaattg     8040

gataccgagg ggaatttatg gaacgtcagt ggagcatttt tgacaagaaa tatttgctag     8100

ctgatagtga ccttaggcga cttttgaacg cgcaataatg gtttctgacg tatgtgctta     8160

gctcattaaa ctccagaaac ccgcggctga gtggctcctt caacgttgcg gttctgtcag     8220

ttccaaacgt aaaacggctt gtcccgcgtc atcggcgggg gtcataacgt gactccctta     8280

attctccgct catgatcaga ttgtcgtttc ccgccttcag tttaaactat cagtgtttga     8340

caggatcctg agtcgttgta aaacgacggc cagtgaatta tccggccagt gaattatcaa     8400

ctatgtataa t                                                          8411


<210>  24
<211>  10765
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid sequence

<400>  24
cgcagtggcg gttttcatgg cttgttatga ctgttttttt ggggtacagt ctatgcctcg       60

ggcatccaag cagcaagcgc gttacgccgt gggtcgatgt ttgatgttat ggagcagcaa      120

cgatgttacg cagcagggca gtcgccctaa aacaaagtta aacatcatgg gggaagcggt      180

gatcgccgaa gtatcgactc aactatcaga ggtagttggc gtcatcgagc gccatctcga      240

accgacgttg ctggccgtac atttgtacgg ctccgcagtg gatggcggcc tgaagccaca      300

cagtgatatt gatttgctgg ttacggtgac cgtaaggctt gatgaaacaa cgcggcgagc      360

tttgatcaac gaccttttgg aaacttcggc ttcccctgga gagagcgaga ttctccgcgc      420

tgtagaagtc accattgttg tgcacgacga catcattccg tggcgttatc cagctaagcg      480

cgaactgcaa tttggagaat ggcagcgcaa tgacattctt gcaggtatct tcgagccagc      540

cacgatcgac attgatctgg ctatcttgct gacaaaagca agagaacata gcgttgcctt      600

ggtaggtcca gcggcggagg aactctttga tccggttcct gaacaggatc tatttgaggc      660

gctaaatgaa accttaacgc tatggaactc gccgcccgac tgggctggcg atgagcgaaa      720

tgtagtgctt acgttgtccc gcatttggta cagcgcagta accggcaaaa tcgcgccgaa      780

ggatgtcgct gccgactggg caatggagcg cctgccggcc cagtatcagc ccgtcatact      840

tgaagctaga caggcttatc ttggacaaga agaagatcgc ttggcctcgc gcgcagatca      900

gttggaagaa tttgtccact acgtgaaagg cgagatcacc aaggtagtcg gcaaataatg      960

tctagctaga aattcgttca agccgacgcc gcttcgcggc gcggcttaac tcaagcgtta     1020

gatgcactaa gcacataatt gctcacagcc aaactatcag gtcaagtctg cttttattat     1080

ttttaagcgt gcataataag ccctacacaa attgggagat atatcatgca tgaccaaaat     1140

cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc     1200

ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct     1260

accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg     1320

cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca     1380

cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc     1440

tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga     1500

taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac     1560

gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga     1620

agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag     1680

ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg     1740

acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag     1800

caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc     1860

tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc     1920

tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgcct     1980

gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct     2040

cagtacaatc tgctctgatg ccgcatagtt aagccagtat acactccgct atcgctacgt     2100

gactgggtca tggctgcgcc ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct     2160

tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt     2220

cagaggtttt caccgtcatc accgaaacgc gcgaggcagg gtgccttgat gtgggcgccg     2280

gcggtcgagt ggcgacggcg cggcttgtcc gcgccctggt agattgcctg gccgtaggcc     2340

agccattttt gagcggccag cggccgcgat aggccgacgc gaagcggcgg ggcgtaggga     2400

gcgcagcgac cgaagggtag gcgctttttg cagctcttcg gctgtgcgct ggccagacag     2460

ttatgcacag gccaggcggg ttttaagagt tttaataagt tttaaagagt tttaggcgga     2520

aaaatcgcct tttttctctt ttatatcagt cacttacatg tgtgaccggt tcccaatgta     2580

cggctttggg ttcccaatgt acgggttccg gttcccaatg tacggctttg ggttcccaat     2640

gtacgtgcta tccacaggaa agagaccttt tcgacctttt tcccctgcta gggcaatttg     2700

ccctagcatc tgctccgtac attaggaacc ggcggatgct tcgccctcga tcaggttgcg     2760

gtagcgcatg actaggatcg ggccagcctg ccccgcctcc tccttcaaat cgtactccgg     2820

caggtcattt gacccgatca gcttgcgcac ggtgaaacag aacttcttga actctccggc     2880

gctgccactg cgttcgtaga tcgtcttgaa caaccatctg gcttctgcct tgcctgcggc     2940

gcggcgtgcc aggcggtaga gaaaacggcc gatgccggga tcgatcaaaa agtaatcggg     3000

gtgaaccgtc agcacgtccg ggttcttgcc ttctgtgatc tcgcggtaca tccaatcagc     3060

tagctcgatc tcgatgtact ccggccgccc ggtttcgctc tttacgatct tgtagcggct     3120

aatcaaggct tcaccctcgg ataccgtcac caggcggccg ttcttggcct tcttcgtacg     3180

ctgcatggca acgtgcgtgg tgtttaaccg aatgcaggtt tctaccaggt cgtctttctg     3240

ctttccgcca tcggctcgcc ggcagaactt gagtacgtcc gcaacgtgtg gacggaacac     3300

gcggccgggc ttgtctccct tcccttcccg gtatcggttc atggattcgg ttagatggga     3360

aaccgccatc agtaccaggt cgtaatccca cacactggcc atgccggccg gccctgcgga     3420

aacctctacg tgcccgtctg gaagctcgta gcggatcacc tcgccagctc gtcggtcacg     3480

cttcgacaga cggaaaacgg ccacgtccat gatgctgcga ctatcgcggg tgcccacgtc     3540

atagagcatc ggaacgaaaa aatctggttg ctcgtcgccc ttgggcggct tcctaatcga     3600

cggcgcaccg gctgccggcg gttgccggga ttctttgcgg attcgatcag cggccgcttg     3660

ccacgattca ccggggcgtg cttctgcctc gatgcgttgc cgctgggcgg cctgcgcggc     3720

cttcaacttc tccaccaggt catcacccag cgccgcgccg atttgtaccg ggccggatgg     3780

tttgcgaccg ctcacgccga ttcctcgggc ttgggggttc cagtgccatt gcagggccgg     3840

cagacaaccc agccgcttac gcctggccaa ccgcccgttc ctccacacat ggggcattcc     3900

acggcgtcgg tgcctggttg ttcttgattt tccatgccgc ctcctttagc cgctaaaatt     3960

catctactca tttattcatt tgctcattta ctctggtagc tgcgcgatgt attcagatag     4020

cagctcggta atggtcttgc cttggcgtac cgcgtacatc ttcagcttgg tgtgatcctc     4080

cgccggcaac tgaaagttga cccgcttcat ggctggcgtg tctgccaggc tggccaacgt     4140

tgcagccttg ctgctgcgtg cgctcggacg gccggcactt agcgtgtttg tgcttttgct     4200

cattttctct ttacctcatt aactcaaatg agttttgatt taatttcagc ggccagcgcc     4260

tggacctcgc gggcagcgtc gccctcgggt tctgattcaa gaacggttgt gccggcggcg     4320

gcagtgcctg ggtagctcac gcgctgcgtg atacgggact caagaatggg cagctcgtac     4380

ccggccagcg cctcggcaac ctcaccgccg atgcgcgtgc ctttgatcgc ccgcgacacg     4440

acaaaggccg cttgtagcct tccatccgtg acctcaatgc gctgcttaac cagctccacc     4500

aggtcggcgg tggcccatat gtcgtaaggg cttggctgca ccggaatcag cacgaagtcg     4560

gctgccttga tcgcggacac agccaagtcc gccgcctggg gcgctccgtc gatcactacg     4620

aagtcgcgcc ggccgatggc cttcacgtcg cggtcaatcg tcgggcggtc gatgccgaca     4680

acggttagcg gttgatcttc ccgcacggcc gcccaatcgc gggcactgcc ctggggatcg     4740

gaatcgacta acagaacatc ggccccggcg agttgcaggg cgcgggctag atgggttgcg     4800

atggtcgtct tgcctgaccc gcctttctgg ttaagtacag cgataacctt catgcgttcc     4860

ccttgcgtat ttgtttattt actcatcgca tcatatacgc agcgaccgca tgacgcaagc     4920

tgttttactc aaatacacat caccttttta gacggcggcg ctcggtttct tcagcggcca     4980

agctggccgg ccaggccgcc agcttggcat cagacaaacc ggccaggatt tcatgcagcc     5040

gcacggttga gacgtgcgcg ggcggctcga acacgtaccc ggccgcgatc atctccgcct     5100

cgatctcttc ggtaatgaaa aacggttcgt cctggccgtc ctggtgcggt ttcatgcttg     5160

ttcctcttgg cgttcattct cggcggccgc cagggcgtcg gcctcggtca atgcgtcctc     5220

acggaaggca ccgcgccgcc tggcctcggt gggcgtcact tcctcgctgc gctcaagtgc     5280

gcggtacagg gtcgagcgat gcacgccaag cagtgcagcc gcctctttca cggtgcggcc     5340

ttcctggtcg atcagctcgc gggcgtgcgc gatctgtgcc ggggtgaggg tagggcgggg     5400

gccaaacttc acgcctcggg ccttggcggc ctcgcgcccg ctccgggtgc ggtcgatgat     5460

tagggaacgc tcgaactcgg caatgccggc gaacacggtc aacaccatgc ggccggccgg     5520

cgtggtggta acgcgtggtg attttgtgcc gagctgccgg tcggggagct gttggctggc     5580

tggtggcagg atatattgtg gtgtaaacaa attgacgctt agacaactta ataacacatt     5640

gcggacgtct ttaatgtact gaattaacat ccgtttgata cttgtctaaa attggctgat     5700

ttcgagtgca tctatgcata aaaacaatct aatgacaatt attaccaagc aggatcctgt     5760

caaacactga tagtttaaac tgaaggcggg aaacgacaat ctgatcatga gcggagaatt     5820

aagggagtca cgttatgacc cccgccgatg acgcgggaca agccgtttta cgtttggaac     5880

tgacagaacc gcaacgttga aggagccact cagccgcggg tttctggagt ttaatgagct     5940

aagcacatac gtcagaaacc attattgcgc gttcaaaagt cgcctaaggt cactatcagc     6000

tagcaaatat ttcttgtcaa aaatgctcca ctgacgttcc ataaattccc ctcggtatcc     6060

aattagagtc tcatattcac tctcaatcca aataatctcg acatgtctcc ggagaggaga     6120

ccagttgaga ttaggccagc tacagcagcc gatatggccg cggtttgtga catcgttaac     6180

cattacattg agacgtctac agtgaacttt aggacagagc cacaaacacc acaagagtgg     6240

attgatgacc tagagaggtt gcaagataga tacccttggt tggttgctga ggttgagggt     6300

gttgtggctg gtattgctta cgctgggccc tggaaggcta ggaacgctta cgattggaca     6360

gttgagagta ctgtttacgt gtcacatagg catcaaaggt tgggcctagg atctacattg     6420

tacacacatt tgcttaagtc tatggaggcg caaggtttta agtctgtggt tgctgttata     6480

ggccttccaa acgatccatc tgttaggttg catgaggctt tgggatacac agcgcggggt     6540

acattgcgcg cggctggata caagcatggt ggatggcatg atgttggttt ttggcaaagg     6600

gattttgagt tgccagctcc tccaaggcca gttaggccag ttacccagat ctgagtcgat     6660

cgaccgatct tgctgcgttc ggatattttc gtggagttcc cgccacagac ccggatgatc     6720

cccgatcgtt caaacatttg gcaataaagt ttcttaagat tgaatcctgt tgccggtctt     6780

gcgatgatta tcatataatt tctgttgaat tacgttaagc atgtaataat taacatgtaa     6840

tgcatgacgt tatttatgag atgggttttt atgattagag tcccgcaatt atacatttaa     6900

tacgcgatag aaaacaaaat atagcgcgca aactaggata aattatcgcg cgcggtgtca     6960

tctatgttac tagatcgggc ctcctgtcaa gctggctgag tcgttgtaaa acgacggcca     7020

gtgaattcga gctcggtacc gagtcaaaga ttcaaataga ggacctaaca gaactcgccg     7080

taaagactgg cgaacagttc atacagagtc tcttacgact caatgacaag aagaaaatct     7140

tcgtcaacat ggtggagcac gacacgcttg tctactccaa aaatatcaaa gatacagtct     7200

cagaagacca aagggcaatt gagacttttc aacaaagggt aatatccgga aacctcctcg     7260

gattccattg cccagctatc tgtcacttta ttgtgaagat agtggaaaag gaaggtggct     7320

cctacaaatg ccatcattgc gataaaggaa aggccatcgt tgaagatgcc tctgccgaca     7380

gtggtcccaa agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa     7440

ccacgtcttc aaagcaagtg gattgatgtg atatctccac tgacgtaagg gatgacgcac     7500

aatcccacta tccttcgcaa gacccttcct ctatataagg aagttcattt catttggaga     7560

ggacagggta cgtacctaga atacaaagaa gaggaagaag aaacctctac agaagaaagt     7620

gatggatccc cgggatcatc tacttctgaa gactcagact cagactaagc aggtgacgaa     7680

cgtcaccaat cccaattcga tctacatccg tcctgtagaa accccaaccc gtgaaatcaa     7740

aaaactcgac ggcctgtggg cattcagtct ggatcgcgaa aactgtggaa ttgatcagcg     7800

ttggtgggaa agcgcgttac aagaaagccg ggcaattgct gtgccaggca gttttaacga     7860

tcagttcgcc gatgcagata ttcgtaatta tgcgggcaac gtctggtatc agcgcgaagt     7920

ctttataccg aaaggttggg caggccagcg tatcgtgctg cgtttcgatg cggtcactca     7980

ttacggcaaa gtgtgggtca ataatcagga agtgatggag catcagggcg gctatacgcc     8040

atttgaagcc gatgtcacgc cgtatgttat tgccgggaaa agtgtacgta tcaccgtttg     8100

tgtgaacaac gaactgaact ggcagactat cccgccggga atggtgatta ccgacgaaaa     8160

cggcaagaaa aagcagtctt acttccatga tttctttaac tatgccggaa tccatcgcag     8220

cgtaatgctc tacaccacgc cgaacacctg ggtggacgat atcaccgtgg tgacgcatgt     8280

cgcgcaagac tgtaaccacg cgtctgttga ctggcaggtg gtgccagcgg ccgcctaggg     8340

ataacagggt aatagtctag tccgaaaacg ccgtgagaca tattggttac gatcctaagg     8400

tagcgaaatt cacccggtaa ctctgtgcca gctagagtcc tgtagaaacc ccaacccgtg     8460

aaatcaaaaa actcgacggc ctgtgggcat tcagtctgga ccgcgaaaac tgtggaattg     8520

atcagcgttg gtgggaaagc gcgttacaag aaagccgggc aattgctgtg ccaggcagtt     8580

ttaacgatca gttcgccgat gcagatattc gtaattatgc gggcaacgtc tggtatcagc     8640

gcgaagtctt tataccgaaa ggttgggcag gccagcgtat cgtgctgcgt ttcgatgcgg     8700

tcactcatta cggcaaagtg tgggtcaata atcaggaagt gatggagcat cagggcggct     8760

atacgccatt tgaagccgat gtcacgccgt atgttattgc cgggaaaagt gtacgtatca     8820

ccgtttgtgt gaacaacgaa ctgaactggc agactatccc gccgggaatg gtgattaccg     8880

acgaaaacgg caagaaaaag cagtcttact tccatgattt ctttaactat gccggaatcc     8940

atcgcagcgt aatgctctac accacgccga acacctgggt ggacgatatc accgtggtga     9000

cgcatgtcgc gcaagactgt aaccacgcgt ctgttgactg gcaggtggtg gccaatggtg     9060

atgtcagcgt tgaactgcgt gatgcggatc aacaggtggt tgcaactgga caaggcacta     9120

gcgggacttt gcaagtggtg aatccgcacc tctggcaacc gggtgaaggt tatctctatg     9180

aactgtgcgt cacagccaaa agccagacag agtgtgatat ctacccgctt cgcgtcggca     9240

tccggtcagt ggcagtgaag ggcgaacagt tcctgattaa ccacaaaccg ttctacttta     9300

ctggctttgg tcgtcatgaa gatgcggact tgcgtggcaa aggattcgat aacgtgctga     9360

tggtgcacga ccacgcatta atggactgga ttggggccaa ctcctaccgt acctcgcatt     9420

acccttacgc tgaagagatg ctcgactggg cagatgaaca tggcatcgtg gtgattgatg     9480

aaactgctgc tgtcggcttt aacctctctt taggcattgg tttcgaagcg ggcaacaagc     9540

cgaaagaact gtacagcgaa gaggcagtca acggggaaac tcagcaagcg cacttacagg     9600

cgattaaaga gctgatagcg cgtgacaaaa accacccaag cgtggtgatg tggagtattg     9660

ccaacgaacc ggatacccgt ccgcaaggtg cacgggaata tttcgcgcca ctggcggaag     9720

caacgcgtaa actcgacccg acgcgtccga tcacctgcgt caatgtaatg ttctgcgacg     9780

ctcacaccga taccatcagc gatctctttg atgtgctgtg cctgaaccgt tattacggat     9840

ggtatgtcca aagcggcgat ttggaagcgg cagagaaggt actggaaaaa gaacttctgg     9900

cctggcagga gaaactgcat cagccgatta tcatcaccga atacggcgtg gatacgttag     9960

ccgggctgca ctcaatgtac accgacatgt ggagtgaaga gtatcagtgt gcatggctgg    10020

atatgtatca ccgcgtcttt gatcgcgtca gcgccgtcgt cggtgaacag gtatggaatt    10080

tcgccgattt tgcgacctcg caaggcatat tgcgcgttgg cggtaacaag aaagggatct    10140

tcactcgcga ccgcaaaccg aagtcggcgg cttttctgct gcaaaaacgc tggactggca    10200

tgaacttcgg tgaaaaaccg cagcagggag gcaaacaatg aatcaacaac tctcctggcg    10260

caccatcgtc ggctacagcc tcgggaattg ctaccgagct cgaatttccc cgatcgttca    10320

aacatttggc aataaagttt cttaagattg aatcctgttg ccggacttgc gatgattatc    10380

atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg catgacgtta    10440

tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata cgcgatagaa    10500

aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc tatgttacta    10560

gatcggaata agcttggcgt aatcatggtc atagctgttt cctactagat ctgattgtcg    10620

tttcccgcct tcagtttaaa ctatcagtgt ttgacaggat atattggcgg gtaaacctaa    10680

gagaaaagag cgtttattag aataatcgga tatttaaaag ggcgtgaaaa ggtttatccg    10740

ttcgtccatt tgtatgtcca tggaa                                          10765


<210>  25
<211>  15
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  sequence motif of I-SceI

<400>  25

His Val Cys Leu Leu Tyr Asp Gln Trp Val Leu Ser Pro Pro His 
1               5                   10                  15  


<210>  26
<211>  11
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  sequence motif of I-SceI

<400>  26

Leu Ala Tyr Trp Phe Met Asp Asp Gly Gly Lys 
1               5                   10      


<210>  27
<211>  27
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  sequence motif of I-SceI

<400>  27

Lys Thr Ile Pro Asn Asn Leu Val Glu Asn Tyr Leu Thr Pro Met Ser 
1               5                   10                  15      


Leu Ala Tyr Trp Phe Met Asp Asp Gly Gly Lys 
            20                  25          


<210>  28
<211>  19
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  sequence motif of I-SceI

<400>  28

Lys Pro Ile Ile Tyr Ile Asp Ser Met Ser Tyr Leu Ile Phe Tyr Asn 
1               5                   10                  15      


Leu Ile Lys 
            


<210>  29
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  sequence motif of I-SceI

<400>  29

Lys Leu Pro Asn Thr Ile Ser Ser Glu Thr Phe Leu Lys 
1               5                   10              


<210>  30
<211>  238
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  I-SceI having a 5 amino acid deletion at C-terminus

<400>  30

Met Gly Pro Lys Lys Lys Arg Lys Val Lys Asn Ile Lys Lys Asn Gln 
1               5                   10                  15      


Val Met Asn Leu Gly Pro Asn Ser Lys Leu Leu Lys Glu Tyr Lys Ser 
            20                  25                  30          


Gln Leu Ile Glu Leu Asn Ile Glu Gln Phe Glu Ala Gly Ile Gly Leu 
        35                  40                  45              


Ile Leu Gly Asp Ala Tyr Ile Arg Ser Arg Asp Glu Gly Lys Thr Tyr 
    50                  55                  60                  


Cys Met Gln Phe Glu Trp Lys Asn Lys Ala Tyr Met Asp His Val Cys 
65                  70                  75                  80  


Leu Leu Tyr Asp Gln Trp Val Leu Ser Pro Pro His Lys Lys Glu Arg 
                85                  90                  95      


Val Asn His Leu Gly Asn Leu Val Ile Thr Trp Gly Ala Gln Thr Phe 
            100                 105                 110         


Lys His Gln Ala Phe Asn Lys Leu Ala Asn Leu Phe Ile Val Asn Asn 
        115                 120                 125             


Lys Lys Thr Ile Pro Asn Asn Leu Val Glu Asn Tyr Leu Thr Pro Met 
    130                 135                 140                 


Ser Leu Ala Tyr Trp Phe Met Asp Asp Gly Gly Lys Trp Asp Tyr Asn 
145                 150                 155                 160 


Lys Asn Ser Thr Asn Lys Ser Ile Val Leu Asn Thr Gln Ser Phe Thr 
                165                 170                 175     


Phe Glu Glu Val Glu Tyr Leu Val Lys Gly Leu Arg Asn Lys Phe Gln 
            180                 185                 190         


Leu Asn Cys Tyr Val Lys Ile Asn Lys Asn Lys Pro Ile Ile Tyr Ile 
        195                 200                 205             


Asp Ser Met Ser Tyr Leu Ile Phe Tyr Asn Leu Ile Lys Pro Tyr Leu 
    210                 215                 220                 


Ile Pro Gln Met Met Tyr Lys Leu Pro Asn Thr Ile Ser Ser 
225                 230                 235             


