                         SUBSTITUTE SEQUENCE LISTING

<110>  Agrivida, Inc.
       Shen, Binzhang
       Apgar, James
       Bougri, Oleg
       Raab, R. Michael
 
<120>  CELLULOSIC PROCESSING TRAIT DEVELOPMENT USING A THERMOREGULATED, 
       INTEIN-MODIFIED XYLANASE

<130>  AGR-PT013WO

<150>  61/377,759
<151>  2010-08-27

<160>  74    

<170>  PatentIn version 3.5

<210>  1
<211>  2277
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158-39

<400>  1
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta ctattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgccacg tccctgggcc aagtgacaat cgatggcggg      360

acctacgaca tctataggac gacacgcgtc aaccagcctt gcctggccga gggctcgctc      420

gtcttggacg cggctaccgg gcagagggtc cctatcgaaa aggtgcgtcc ggggatggaa      480

gttttctcct tgggacctga ttacagactg tatcgggtgc ccgttttgga ggtccttgag      540

agcggggttg gggaagttgt gcgcctcaga actcggtcag ggagaacgct ggtgttgaca      600

ccagatcacc cgcttttgac ccccgaaggt tggaaacctc tttgtgacct cccgcttgga      660

actccaattg cagtccccgc agaactgcct gtggcgggcc acttggcccc acctgaagaa      720

cgtgttacgc tcctggctct tctgttgggg gatgggaaca caaagctgtc gggtcggaga      780

ggtacacgtc ctaatgcctt cttctacagc aaagaccccg aattgctcgc ggcttatcgc      840

cggtgtgcag aagccttggg tgcaaaggtg aaagcatacg tccacccgac tacgggggtg      900

gttacactcg caaccctcgc tccacgtcct ggagctcaag atcctgtcaa acgcctcgtt      960

gtcgaggcgg gaatggttgc taaagccgaa gagaagaggg tcccggagga ggtgtttcgt     1020

taccggcgtg aggcgttggc ccttttcttg ggccgtttgt cctcgacaga cggctctgtt     1080

gaaaggaaga ggatctctta ttcaagtgcc agtttgggac tggcccagga tgtcgcacat     1140

ctcttgctgc gccttggaat tacatctcaa ctccgttcga gagggccacg ggctcacgag     1200

gttcttatat cgggccgcga ggatattttg cggtttgctg aacttatcgg accctacctc     1260

ttgggggcca agagggagag acttgcagcg ctggaagctg aggcccgcag gcgtttgcct     1320

ggacagggat ggcacttgcg gcttgttctt cctgccgtgg cgtacagagt gagcgaggct     1380

aaaaggcgct cgggattttc gtggagtgaa gccggtcagc gcgtcgcagt tgcgggatcg     1440

tgtttgtcat ctggactcaa cctcaaattg cccagacgct acctttctcg gcaccggttg     1500

tcgctgctcg gtgaggcttt tgccgaccct gggctggaag cgctcgcgga aggccaagtg     1560

ctctgggacc ctattgttgc tgtcgaaccg gccggtaagg cgagaacatt cgacttgcgc     1620

gttccaccct ttgcaaactt cgtgagcgag gacctggtgg tgcataactc cattgtgggg     1680

acagccacgt tcgatcagta ctggagcgtg cgcacctcta agcggacttc aggaacagtg     1740

accgtgaccg atcacttccg cgcctgggcg aaccggggcc tgaacctcgg cacaatagac     1800

caaattacat tgtgcgtgga gggttaccaa agctctggat cagccaacat cacccagaac     1860

accttctctc agggctcttc ttccggcagt tcgggtggct catccggctc cacaacgact     1920

actcgcatcg agtgtgagaa catgtccttg tccggaccct acgttagcag gatcaccaat     1980

ccctttaatg gtattgcgct gtacgccaac ggagacacag cccgcgctac cgttaacttc     2040

cccgcaagtc gcaactacaa tttccgcctg cggggttgcg gcaacaacaa taatcttgcc     2100

cgtgtggacc tgaggatcga cggacggacc gtcgggacct tttattacca gggcacatac     2160

ccctgggagg ccccaattga caatgtttat gtcagtgcgg ggagtcatac agtcgaaatc     2220

actgttactg cggataacgg cacatgggac gtgtatgccg actacctggt gatacag        2277


<210>  2
<211>  759
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Syntehtic construct, S158-39

<400>  2

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Ser Leu 
            100                 105                 110         


Gly Gln Val Thr Ile Asp Gly Gly Thr Tyr Asp Ile Tyr Arg Thr Thr 
        115                 120                 125             


Arg Val Asn Gln Pro Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala 
    130                 135                 140                 


Ala Thr Gly Gln Arg Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu 
145                 150                 155                 160 


Val Phe Ser Leu Gly Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu 
                165                 170                 175     


Glu Val Leu Glu Ser Gly Val Gly Glu Val Val Arg Leu Arg Thr Arg 
            180                 185                 190         


Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro 
        195                 200                 205             


Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala 
    210                 215                 220                 


Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu 
225                 230                 235                 240 


Arg Val Thr Leu Leu Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu 
                245                 250                 255     


Ser Gly Arg Arg Gly Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp 
            260                 265                 270         


Pro Glu Leu Leu Ala Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala 
        275                 280                 285             


Lys Val Lys Ala Tyr Val His Pro Thr Thr Gly Val Val Thr Leu Ala 
    290                 295                 300                 


Thr Leu Ala Pro Arg Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val 
305                 310                 315                 320 


Val Glu Ala Gly Met Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu 
                325                 330                 335     


Glu Val Phe Arg Tyr Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg 
            340                 345                 350         


Leu Ser Ser Thr Asp Gly Ser Val Glu Arg Lys Arg Ile Ser Tyr Ser 
        355                 360                 365             


Ser Ala Ser Leu Gly Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg 
    370                 375                 380                 


Leu Gly Ile Thr Ser Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu 
385                 390                 395                 400 


Val Leu Ile Ser Gly Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile 
                405                 410                 415     


Gly Pro Tyr Leu Leu Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu 
            420                 425                 430         


Ala Glu Ala Arg Arg Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu 
        435                 440                 445             


Val Leu Pro Ala Val Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser 
    450                 455                 460                 


Gly Phe Ser Trp Ser Glu Ala Gly Gln Arg Val Ala Val Ala Gly Ser 
465                 470                 475                 480 


Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser 
                485                 490                 495     


Arg His Arg Leu Ser Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu 
            500                 505                 510         


Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val 
        515                 520                 525             


Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe 
    530                 535                 540                 


Ala Asn Phe Val Ser Glu Asp Leu Val Val His Asn Ser Ile Val Gly 
545                 550                 555                 560 


Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg Thr 
                565                 570                 575     


Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn Arg 
            580                 585                 590         


Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu Gly 
        595                 600                 605             


Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser Gln 
    610                 615                 620                 


Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr Thr 
625                 630                 635                 640 


Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val Ser 
                645                 650                 655     


Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly Asp 
            660                 665                 670         


Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn Phe 
        675                 680                 685             


Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp Leu 
    690                 695                 700                 


Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr Tyr 
705                 710                 715                 720 


Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser His 
                725                 730                 735     


Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val Tyr 
            740                 745                 750         


Ala Asp Tyr Leu Val Ile Gln 
        755                 


<210>  3
<211>  2277
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158-21

<400>  3
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta ctattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgccacg tccctgggcc aagtgacaat cgatggcggg      360

acctacgaca tctataggac gacacgcgtc aaccagcctt gcctggccga gggctcgctc      420

gtcttggacg cggctaccgg gcagagggtc cctatcgaaa aggtgcgtcc ggggatggaa      480

gttttctcct tgggacctga ttacagactg tatcgggtgc ccgttttgga ggtccttgag      540

agcggggtta gggaagttgt gcgcctcaga actcggtcag ggagaacgct ggtgttgaca      600

ccagatcacc cgcttttgac ccccgaaggt tggaaacctc tttgtgacct cccgcttgga      660

actccaattg cagtccccgc agaactgcct gtggcgggcc acttggcccc acctgaagaa      720

cgtgttacgc tcctggctct tctgttgggg gatgggaaca caaagctgtc gggtcggaga      780

ggtacacgtc ctaatgcctt cttctacagc aaagaccccg aattgctcgc ggcttatcgc      840

cggtgtggag aagccttggg tgcaaaggtg aaagcatacg tccacccgac tacgggggtg      900

gttacactcg caaccctcgc tccacgtcct ggagctcaag atcctgtcaa acgcctcgtt      960

gtcgaggcgg gaatggttgc taaagccgaa gagaagaggg tcccggagga ggtgtttcgt     1020

taccggcgtg aggcgttggc ccttttcttg ggccgtttgt tctcgacaga cggctctgtt     1080

gaaaagaaga ggatctctta ttcaagtgcc agtttgggac tggcccagga tgtcgcacat     1140

ctcttgctgc gccttggaat tacatctcaa ctccgttcga gagggccacg ggctcacgag     1200

gttcttatat cgggccgcga ggatattttg cggtttgctg aacttatcgg accctacctc     1260

ttgggggcca agagggagag acttgcagcg ctggaagctg aggcccgcag gcgtttgcct     1320

ggacagggat ggcacttgcg gcttgttctt cctgccgtgg cgtacagagt gagcgaggct     1380

aaaaggcgct cgggattttc gtggagtgaa gccggtcggc gcgtcgcagt tgcgggatcg     1440

tgtttgtcat ctggactcaa cctcaaattg cccagacgct acctttctca gcaccggttg     1500

tcgctgctcg gtgaggcttt tgccgaccct gggctggaag cgctcgcgga aggccaagtg     1560

ctctgggacc ctattgttgc tgtcgaaccg gccggtaagg cgagaacatt cgacttgcgc     1620

gttccaccct ttgcaaactt cgtgagcgag gacctggtgg tgcataactc cattgtgggg     1680

acagccacgt tcgatcagta ctggagcgtg cgcacctcta agcggacttc aggaacagtg     1740

accgtgaccg atcacttccg cgcctgggcg aaccggggcc tgaacctcgg cacaatagac     1800

caaattacat tgtgcgtgga gggttaccaa agctctggat cagccaacat cacccagaac     1860

accttctctc agggctcttc ttccggcagt tcgggtggct catccggctc cacaacgact     1920

actcgcatcg agtgtgagaa catgtccttg tccggaccct acgttagcag gatcaccaat     1980

ccctttaatg gtattgcgct gtacgccaac ggagacacag cccgcgctac cgttaacttc     2040

cccgcaagtc gcaactacaa tttccgcctg cggggttgcg gcaacaacaa taatcttgcc     2100

cgtgtggacc tgaggatcga cggacggacc gtcgggacct tttattacca gggcacatac     2160

ccctgggagg ccccaattga caatgtttat gtcagtgcgg ggagtcatac agtcgaaatc     2220

actgttactg cggataacgg cacatgggac gtgtatgccg actacctggt gatacag        2277


<210>  4
<211>  759
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158-21

<400>  4

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Ser Leu 
            100                 105                 110         


Gly Gln Val Thr Ile Asp Gly Gly Thr Tyr Asp Ile Tyr Arg Thr Thr 
        115                 120                 125             


Arg Val Asn Gln Pro Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala 
    130                 135                 140                 


Ala Thr Gly Gln Arg Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu 
145                 150                 155                 160 


Val Phe Ser Leu Gly Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu 
                165                 170                 175     


Glu Val Leu Glu Ser Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg 
            180                 185                 190         


Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro 
        195                 200                 205             


Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala 
    210                 215                 220                 


Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu 
225                 230                 235                 240 


Arg Val Thr Leu Leu Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu 
                245                 250                 255     


Ser Gly Arg Arg Gly Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp 
            260                 265                 270         


Pro Glu Leu Leu Ala Ala Tyr Arg Arg Cys Gly Glu Ala Leu Gly Ala 
        275                 280                 285             


Lys Val Lys Ala Tyr Val His Pro Thr Thr Gly Val Val Thr Leu Ala 
    290                 295                 300                 


Thr Leu Ala Pro Arg Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val 
305                 310                 315                 320 


Val Glu Ala Gly Met Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu 
                325                 330                 335     


Glu Val Phe Arg Tyr Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg 
            340                 345                 350         


Leu Phe Ser Thr Asp Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser 
        355                 360                 365             


Ser Ala Ser Leu Gly Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg 
    370                 375                 380                 


Leu Gly Ile Thr Ser Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu 
385                 390                 395                 400 


Val Leu Ile Ser Gly Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile 
                405                 410                 415     


Gly Pro Tyr Leu Leu Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu 
            420                 425                 430         


Ala Glu Ala Arg Arg Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu 
        435                 440                 445             


Val Leu Pro Ala Val Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser 
    450                 455                 460                 


Gly Phe Ser Trp Ser Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser 
465                 470                 475                 480 


Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser 
                485                 490                 495     


Gln His Arg Leu Ser Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu 
            500                 505                 510         


Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val 
        515                 520                 525             


Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe 
    530                 535                 540                 


Ala Asn Phe Val Ser Glu Asp Leu Val Val His Asn Ser Ile Val Gly 
545                 550                 555                 560 


Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg Thr 
                565                 570                 575     


Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn Arg 
            580                 585                 590         


Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu Gly 
        595                 600                 605             


Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser Gln 
    610                 615                 620                 


Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr Thr 
625                 630                 635                 640 


Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val Ser 
                645                 650                 655     


Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly Asp 
            660                 665                 670         


Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn Phe 
        675                 680                 685             


Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp Leu 
    690                 695                 700                 


Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr Tyr 
705                 710                 715                 720 


Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser His 
                725                 730                 735     


Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val Tyr 
            740                 745                 750         


Ala Asp Tyr Leu Val Ile Gln 
        755                 


<210>  5
<211>  2280
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-180

<400>  5
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta ctattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgcctgc ctggccgagg gctcgctcgt cttggacgcg      360

gctaccgggc agagggtccc tatcgaaaag gtgcgtccgg ggatggaagt tttctccttg      420

ggacctgatt acagactgta tcgggtgccc gttttggagg tccttgagag cggggttagg      480

gaagttgtgc gcctcagaac tcggtcaggg agaacgctgg tgttgacacc agatcacccg      540

cttttgaccc ccgaaggttg gaaacctctt tgtgacctcc cgcttggaac tccaattgca      600

gtccccgcag aactgcctgt ggcgggccac ttggccccac ctgaagaacg tgttacgctc      660

ctggctcttc tgttggggga tgggaacaca aagctgtcgg gtcggagagg tacacgtcct      720

aatgccttct tctacagcaa agaccccgaa ttgctcgcgg cttatcgccg gtgtgcagaa      780

gccttgggtg caaaggtgaa agcatacgtc cacccgacta cgggggtggt tacactcgca      840

accctcgctc cacgtcctgg agctcaagat cctgtcaaac gcctcgttgt cgaggcggga      900

atggttgcta aagccgaaga gaagagggtc ccggaggagg tgtttcgtta ccggcgtgag      960

gcgttggccc ttttcttggg ccgtttgttc tcgacagacg gctctgttga aaagaagagg     1020

atctcttatt caagtgccag tttgggactg gcccaggatg tcgcacatct cttgctgcgc     1080

cttggaatta catctcaact ccgttcgaga gggccacggg ctcacgaggt tcttatatcg     1140

ggccgcgagg atattttgcg gtttgctgaa cttatcggac cctacctctt gggggccaag     1200

agggagagac ttgcagcgct ggaagctgag gcccgcaggc gtttgcctgg acagggatgg     1260

cacttgcggc ttgttcttcc tgccgtggcg tacagagtga gcgaggctaa aaggcgctcg     1320

ggattttcgt ggagtgaagc cggtcggcgc gtcgcagttg cgggatcgtg tttgtcatct     1380

ggactcaacc tcaaattgcc cagacgctac ctttctcggc accggttgtc gctgctcggt     1440

gaggcttttg ccgaccctgg gctggaagcg ctcgcggaag gccaagtgct ctgggaccct     1500

attgttgctg tcgaaccggc cggtaaggcg agaacattcg acttgcgcgt tccacccttt     1560

gcaaacttcg tgagcgagga cctggtggtg cataacacgt cccccatggg ccaagtgaca     1620

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttccattgtg     1680

gggacagcca cgttcgatca gtactggagc gtgcgcacct ctaagcggac ttcaggaaca     1740

gtgaccgtga ccgatcactt ccgcgcctgg gcgaaccggg gcctgaacct cggcacaata     1800

gaccaaatta cattgtgcgt ggagggttac caaagctctg gatcagccaa catcacccag     1860

aacaccttct ctcagggctc ttcttccggc agttcgggtg gctcatccgg ctccacaacg     1920

actactcgca tcgagtgtga gaacatgtcc ttgtccggac cctacgttag caggatcacc     1980

aatcccttta atggtattgc gctgtacgcc aacggagaca cagcccgcgc taccgttaac     2040

ttccccgcaa gtcgcaacta caatttccgc ctgcggggtt gcggcaacaa caataatctt     2100

gcccgtgtgg acctgaggat cgacggacgg accgtcggga ccttttatta ccagggcaca     2160

tacccctggg aggccccaat tgacaatgtt tatgtcagtg cggggagtca tacagtcgaa     2220

atcactgtta ctgcggataa cggcacatgg gacgtgtatg ccgactacct ggtgatacag     2280


<210>  6
<211>  760
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-180

<400>  6

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Cys Leu Ala 
            100                 105                 110         


Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg Val Pro Ile 
        115                 120                 125             


Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly Pro Asp Tyr 
    130                 135                 140                 


Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser Gly Val Arg 
145                 150                 155                 160 


Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu Val Leu Thr 
                165                 170                 175     


Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro Leu Cys Asp 
            180                 185                 190         


Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu Pro Val Ala 
        195                 200                 205             


Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu Ala Leu Leu 
    210                 215                 220                 


Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly Thr Arg Pro 
225                 230                 235                 240 


Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala Ala Tyr Arg 
                245                 250                 255     


Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr Val His Pro 
            260                 265                 270         


Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg Pro Gly Ala 
        275                 280                 285             


Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met Val Ala Lys 
    290                 295                 300                 


Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr Arg Arg Glu 
305                 310                 315                 320 


Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp Gly Ser Val 
                325                 330                 335     


Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly Leu Ala Gln 
            340                 345                 350         


Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser Gln Leu Arg 
        355                 360                 365             


Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly Arg Glu Asp 
    370                 375                 380                 


Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu Gly Ala Lys 
385                 390                 395                 400 


Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg Arg Leu Pro 
                405                 410                 415     


Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val Ala Tyr Arg 
            420                 425                 430         


Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser Glu Ala Gly 
        435                 440                 445             


Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly Leu Asn Leu 
    450                 455                 460                 


Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser Leu Leu Gly 
465                 470                 475                 480 


Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu Gly Gln Val 
                485                 490                 495     


Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys Ala Arg Thr 
            500                 505                 510         


Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser Glu Asp Leu 
        515                 520                 525             


Val Val His Asn Thr Ser Pro Met Gly Gln Val Thr Ile Asp Gly Gly 
    530                 535                 540                 


Thr Tyr Asp Ile Tyr Arg Thr Thr Arg Val Asn Gln Pro Ser Ile Val 
545                 550                 555                 560 


Gly Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg 
                565                 570                 575     


Thr Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn 
            580                 585                 590         


Arg Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu 
        595                 600                 605             


Gly Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser 
    610                 615                 620                 


Gln Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr 
625                 630                 635                 640 


Thr Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val 
                645                 650                 655     


Ser Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly 
            660                 665                 670         


Asp Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn 
        675                 680                 685             


Phe Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp 
    690                 695                 700                 


Leu Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr 
705                 710                 715                 720 


Tyr Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser 
                725                 730                 735     


His Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val 
            740                 745                 750         


Tyr Ala Asp Tyr Leu Val Ile Gln 
        755                 760 


<210>  7
<211>  2280
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-100-165

<400>  7
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta ctattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgcctgc ctggccgagg gctcgctcgt cttggacgcg      360

gctaccgggc agagggtccc tatcgaaaag gtgcgtccgg ggatggaagt tttctccttg      420

ggacctgatt acagactgta tcgggtgccc gttttggagg tccttgagag cggggttagg      480

gaagttgtgc gcctcagaac tcggtcaggg agaacgctgg tgttgacacc agatcacccg      540

cttttgaccc ccgaaggttg gaaacctctt tgtgacctcc cgcttggaac tccaattgca      600

gtccccgcag aactgcctgt ggcgggccac ttggccccac ctgaagaacg tgttacgctc      660

ctggctcttc tgttggggga tgggaacaca aagctgtcgg gtcggagagg tacacgtcct      720

aatgccttct tctacagcaa aaaccccgaa ttgctcgcgg cttatcgccg gtgtgcagaa      780

gccttgggtg caaaggtgaa agcatacgtc cacccgacta cgggggtggt tacactcgca      840

accctcgctc cacgtcctgg agctcaagat cctgtcaaac gcctcgttgt cgaggcggga      900

atggttgcta aagccgaaga gaagagggtc ccggaggagg tgtttcgtta ccggcgtgag      960

gcgttggccc ttttcttggg ccgtttgttc tcgacagacg gctctgttga aaagaagagg     1020

atctcttatt caagtgccag tttgggactg gcccaggatg tcgcacatct cttgctgcgc     1080

cttggaatta catctcaact ccgttcgaga gggccacggg ctcacgaggt tcttatatcg     1140

ggccgcgagg atattttgcg gtttgctgaa cttatcggac cctacctctt gggggccaag     1200

agggagagac ttgcagcgct ggaagctgag gcccgcaggc gtttgcctgg acagggatgg     1260

cacttgcggc ttgttcttcc tgccgtggcg tacagagtga gcgaggctaa aaggcgctcg     1320

ggattttcgt ggagtgaagc cggtcggcgc gtcgcagttg cgggatcgtg tttgtcatct     1380

ggactcaacc tcaaattgcc cagacgctac ctttctcggc accggttgtc gctgctcggt     1440

gaggcttttg ccgaccctgg gctggaagcg ctcgcggaag gccaagtgct ctgggaccct     1500

attgttgctg tcgaaccggc cggtaaggcg agaacattcg acttgcgcgt tccacccttt     1560

gcaaacttcg tgagcgagga cctggtggtg cataacaccg tccccctggg ccaagtgaca     1620

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttccattgtg     1680

gggacagcca cgttcgatca gtactggagc gtgcgcacct ctaagcggac ttcaggaaca     1740

gtgaccgtga ccgatcactt ccgcgcctgg gcgaaccggg gcctgaacct cggcacaata     1800

gaccaaatta cattgtgcgt ggagggttac caaagctctg gatcagccaa catcacccag     1860

aacaccttct ctcagggctc ttcttccggc agttcgggtg gctcatccgg ctccacaacg     1920

actactcgca tcgagtgtga gaacatgtcc ttgtccggac cctacgttag caggatcacc     1980

aatcccttta atggtattgc gctgtacgcc aacggagaca cagcccgcgc taccgttaac     2040

ttccccgcaa gtcgcaacta caatttccgc ctgcggggtt gcggcaacaa caataatctt     2100

gcccgtgtgg acctgaggat cgacggacgg accgtcggga ccttttatta ccagggcaca     2160

tacccctggg aggccccaat tgacaatgtt tatgtcagtg cggggagtca tacagtcgaa     2220

atcactgtta gtgcggataa cggcacatgg gacgtgtatg ccgactacct ggtgatacag     2280


<210>  8
<211>  760
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-100-165

<400>  8

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Cys Leu Ala 
            100                 105                 110         


Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg Val Pro Ile 
        115                 120                 125             


Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly Pro Asp Tyr 
    130                 135                 140                 


Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser Gly Val Arg 
145                 150                 155                 160 


Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu Val Leu Thr 
                165                 170                 175     


Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro Leu Cys Asp 
            180                 185                 190         


Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu Pro Val Ala 
        195                 200                 205             


Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu Ala Leu Leu 
    210                 215                 220                 


Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly Thr Arg Pro 
225                 230                 235                 240 


Asn Ala Phe Phe Tyr Ser Lys Asn Pro Glu Leu Leu Ala Ala Tyr Arg 
                245                 250                 255     


Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr Val His Pro 
            260                 265                 270         


Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg Pro Gly Ala 
        275                 280                 285             


Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met Val Ala Lys 
    290                 295                 300                 


Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr Arg Arg Glu 
305                 310                 315                 320 


Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp Gly Ser Val 
                325                 330                 335     


Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly Leu Ala Gln 
            340                 345                 350         


Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser Gln Leu Arg 
        355                 360                 365             


Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly Arg Glu Asp 
    370                 375                 380                 


Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu Gly Ala Lys 
385                 390                 395                 400 


Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg Arg Leu Pro 
                405                 410                 415     


Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val Ala Tyr Arg 
            420                 425                 430         


Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser Glu Ala Gly 
        435                 440                 445             


Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly Leu Asn Leu 
    450                 455                 460                 


Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser Leu Leu Gly 
465                 470                 475                 480 


Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu Gly Gln Val 
                485                 490                 495     


Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys Ala Arg Thr 
            500                 505                 510         


Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser Glu Asp Leu 
        515                 520                 525             


Val Val His Asn Thr Val Pro Leu Gly Gln Val Thr Ile Asp Gly Gly 
    530                 535                 540                 


Thr Tyr Asp Ile Tyr Arg Thr Thr Arg Val Asn Gln Pro Ser Ile Val 
545                 550                 555                 560 


Gly Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg 
                565                 570                 575     


Thr Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn 
            580                 585                 590         


Arg Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu 
        595                 600                 605             


Gly Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser 
    610                 615                 620                 


Gln Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr 
625                 630                 635                 640 


Thr Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val 
                645                 650                 655     


Ser Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly 
            660                 665                 670         


Asp Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn 
        675                 680                 685             


Phe Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp 
    690                 695                 700                 


Leu Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr 
705                 710                 715                 720 


Tyr Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser 
                725                 730                 735     


His Thr Val Glu Ile Thr Val Ser Ala Asp Asn Gly Thr Trp Asp Val 
            740                 745                 750         


Tyr Ala Asp Tyr Leu Val Ile Gln 
        755                 760 


<210>  9
<211>  2280
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-10068

<400>  9
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta ctattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgcctgc ctggccgagg gctcgctcgt cttggacgcg      360

gctaccgggc agagggtccc tatcgaaaag gtgcgtccgg ggatggaagt tttctccctg      420

ggacctgatt acagactgta tcgggtgccc gttttggagg tccttgagag cggggttagg      480

gaagttgtgc gcctcagaac tcggtcagag agaacgctgg tgttgacacc agatcacccg      540

cttttgaccc ccgaaggttg gaaacctctt tgtgacctcc cgcttggaac tccaattgca      600

gtccccgcag aactgcctgt ggcgggccac ttggccccac ctgaagaacg tgttacgctc      660

ctggctcttc tgttggggga tgggaacaca aagctgtcgg gtcggagagg tacacgtcct      720

aatgccttct tccacagcaa agaccccgaa ttgctcgcgg cttatcgccg gtgtgcagaa      780

gccttgggtg caaaggtgaa agcatacgtc cacccgacta cgggggtggt tacactcgca      840

accctcgccc cacgtcctgg agctcaagat cctgtcaaac gcctcgttgt cgaggcggga      900

atggttgcta aagccgaaga gaagagggtc ccggaggagg tgtttcgtta ccggcgtgag      960

gcgttggccc ttttcttggg ccgtttgttc tcgacagacg gctctgttga aaagaagagg     1020

atctcttatt caagtgccag tttggggctg gcccaggatg tcgcacatct cttgctgcgc     1080

cttggaatta catctcaact ccgttcgaga gggccacggg ctcacgaggt tcttatatcg     1140

ggccgcgagg atattttgcg gtttgctgaa cttatcggac cctacctctt gggggccaag     1200

agggagagac ttgcagcgct ggaagctgag gcccgcaggc gtttgcctgt acagggatgg     1260

cactcgcggc ttgttcttcc tgccgtggcg tacagagtga gcgaggctaa aaggcgctcg     1320

ggattttcgt ggagtgaagc cggtcggcgc gtcgcagttg cgggatcgtg tttgtcatct     1380

ggactcaacc tcaaattgcc cagacgctac ctttctcggc accggttgtc gctgctcggt     1440

gaggcttttg ccgaccctgg gctggaagcg ctcgcggaag gccaagtgct ctgggaccct     1500

attgttgctg tcgaaccggc cggtaaggcg agaacattcg acttgcgcgt tccacccttt     1560

gcaaacttcg tgagcgagga cctggtggtg cataacaccg tccccctggg ccaagtgaca     1620

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttccattgtg     1680

gggacagcca cgttcgatca gtactggagc gtgcgcacct ctaagcggac ttcaggaaca     1740

gtgaccgtga ccgatcactt ccgcgcctgg gcgaaccggg gcctgaacct cggcacaata     1800

gaccaaatta cattgtgcgt ggagggttac caaagctctg gatcagccaa catcacccag     1860

aacaccttct ctcagggctc ttcttccggc agttcgggtg gctcatccgg ctccacaacg     1920

actactcgca tcgagtgtga gaacatgtcc ttgtccggac cctacgttag caggatcacc     1980

aatcccttta atggtattgc gctgtacgcc aacggagaca cagcccgcgc taccgttaac     2040

ttccccgcaa gtcgcaacta caatttccgc ctgcggggtt gcggcaacaa caataatctt     2100

gcccgtgtgg acctgaggat cgacggacgg accgtcggga ccttttatta ccagggcaca     2160

tacccctggg aggccccaat tgacaatgtt tatgtcagtg cggggagtca tacagtcgaa     2220

atcactgtta ctgcggataa cggcacatgg gacgtgtatg ccgactacct ggtgatacag     2280


<210>  10
<211>  760
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-10068

<400>  10

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Cys Leu Ala 
            100                 105                 110         


Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg Val Pro Ile 
        115                 120                 125             


Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly Pro Asp Tyr 
    130                 135                 140                 


Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser Gly Val Arg 
145                 150                 155                 160 


Glu Val Val Arg Leu Arg Thr Arg Ser Glu Arg Thr Leu Val Leu Thr 
                165                 170                 175     


Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro Leu Cys Asp 
            180                 185                 190         


Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu Pro Val Ala 
        195                 200                 205             


Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu Ala Leu Leu 
    210                 215                 220                 


Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly Thr Arg Pro 
225                 230                 235                 240 


Asn Ala Phe Phe His Ser Lys Asp Pro Glu Leu Leu Ala Ala Tyr Arg 
                245                 250                 255     


Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr Val His Pro 
            260                 265                 270         


Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg Pro Gly Ala 
        275                 280                 285             


Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met Val Ala Lys 
    290                 295                 300                 


Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr Arg Arg Glu 
305                 310                 315                 320 


Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp Gly Ser Val 
                325                 330                 335     


Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly Leu Ala Gln 
            340                 345                 350         


Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser Gln Leu Arg 
        355                 360                 365             


Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly Arg Glu Asp 
    370                 375                 380                 


Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu Gly Ala Lys 
385                 390                 395                 400 


Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg Arg Leu Pro 
                405                 410                 415     


Val Gln Gly Trp His Ser Arg Leu Val Leu Pro Ala Val Ala Tyr Arg 
            420                 425                 430         


Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser Glu Ala Gly 
        435                 440                 445             


Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly Leu Asn Leu 
    450                 455                 460                 


Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser Leu Leu Gly 
465                 470                 475                 480 


Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu Gly Gln Val 
                485                 490                 495     


Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys Ala Arg Thr 
            500                 505                 510         


Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser Glu Asp Leu 
        515                 520                 525             


Val Val His Asn Thr Val Pro Leu Gly Gln Val Thr Ile Asp Gly Gly 
    530                 535                 540                 


Thr Tyr Asp Ile Tyr Arg Thr Thr Arg Val Asn Gln Pro Ser Ile Val 
545                 550                 555                 560 


Gly Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg 
                565                 570                 575     


Thr Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn 
            580                 585                 590         


Arg Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu 
        595                 600                 605             


Gly Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser 
    610                 615                 620                 


Gln Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr 
625                 630                 635                 640 


Thr Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val 
                645                 650                 655     


Ser Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly 
            660                 665                 670         


Asp Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn 
        675                 680                 685             


Phe Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp 
    690                 695                 700                 


Leu Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr 
705                 710                 715                 720 


Tyr Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser 
                725                 730                 735     


His Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val 
            740                 745                 750         


Tyr Ala Asp Tyr Leu Val Ile Gln 
        755                 760 


<210>  11
<211>  2280
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-10039

<400>  11
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta ctattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgcctgc ctggccgagg gctcgctcgt cttggacgcg      360

gctaccgggc agagggtccc tatcgaaaag gtgcgtccgg ggatggaagt tttctccttg      420

ggacctgatt acagactgta tcgggtgccc gttttggagg tccttgagag cggggttagg      480

gaagttgtgc gcctcagaac tcggtcaggg agaacgctgg tgttgacacc agatcacccg      540

cttttgaccc ccgaaggttg gaaacctctt tgtgacctcc cgcttggaac tccaattgca      600

gtccccgcag aactgcctgt ggcgggccac ttggccccac ctgaagaacg tgttacgctc      660

ctggctcttc tgttggggga tgggaacaca aagctgtcgg gtcggagagg tacacgtcct      720

aatgccttct tctacagcaa agaccccgaa ttgctcgcgg cttatcgccg gtgtgcagaa      780

gccttgggtg caaaggtgaa agcatacgtc cacccgacta cgggggtggt tacactcgca      840

accctcgctc cacgtcctgg agctcaagat cctgtcaaac gcctcgttgt cgaggcggga      900

atggttgcta aagccgaaga gaagagggtc ccggaggagg tgtttcgtta ccggcgtgag      960

gcgttggccc ttttcttggg ccgtttgttc tcgacagacg gctctgttga aaagaagagg     1020

atctcttatt caagtgccag tttgggactg gcccaggatg tcgcacatct cttgctgcgc     1080

cttggaatta catctcaact ccgttcgaga gggccacggg ctcacgaggt tcttatatcg     1140

ggccgcgagg atattttgcg gtttgctgaa cttatcggac cctacctctt gggggccaag     1200

agggagagac ttgcagcgct ggaagctgag gcccgcaggc gtttgcctgg acagggatgg     1260

cacttgcggc ttgttcttcc tgccgtggcg tacagagtga gcgaggctaa aaggcgctcg     1320

ggattttcgt ggagtgaagc cggtcggcgc gtcgcagttg cgggatcgtg tttgtcatct     1380

ggactcaacc tcaaattgcc cagacgctac ctttctcggc accggttgtc gctgctcggt     1440

gaggcttttg ccgaccctgg gctggaagcg ctcgcggaag gcctagtgct ctgggaccct     1500

attgttgctg tcgaaccggc cggtaaggcg agaacattcg acttgcgcgt tccacccttt     1560

gcaaacttcg tgagcgagga cctggtggtg cataacaccg tccccctggg ccaagtgaca     1620

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttccattgtg     1680

gggacagcca cgttcgatca gtactggagc gtgcgcacct ctaagcggac ttcaggaaca     1740

gtgaccgtga ccgatcactt ccgcgcctgg gcgaaccggg gcctgaacct cggcacaata     1800

gaccaaatta cattgtgcgt ggagggttac caaagctctg gatcagccaa catcacccag     1860

aacaccttct ctcagggctc ttcttccggc agttcgggtg gctcatccgg ctccacaacg     1920

actactcgca tcgagtgtga gaacatgtcc ttgtccggac cctacgttag caggatcacc     1980

aatcccttta atggtattgc gctgtacgcc aacggagaca cagcccgcgc taccgttaac     2040

ttccccgcaa gtcgcaacta caatttccgc ctgcggggtt gcggcaacaa caataatctt     2100

gcccgtgtgg acctgaggat cgacggacgg accgtcggga ccttttatta ccagggcaca     2160

tacccctggg aggccccaat tgacaatgtt tatgtcagtg cggggagtca tacagtcgaa     2220

atcactgtta ctgcggataa cggcacatgg gacgtgtatg ccgactacct ggtgatacag     2280


<210>  12
<211>  760
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-10039

<400>  12

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Cys Leu Ala 
            100                 105                 110         


Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg Val Pro Ile 
        115                 120                 125             


Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly Pro Asp Tyr 
    130                 135                 140                 


Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser Gly Val Arg 
145                 150                 155                 160 


Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu Val Leu Thr 
                165                 170                 175     


Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro Leu Cys Asp 
            180                 185                 190         


Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu Pro Val Ala 
        195                 200                 205             


Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu Ala Leu Leu 
    210                 215                 220                 


Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly Thr Arg Pro 
225                 230                 235                 240 


Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala Ala Tyr Arg 
                245                 250                 255     


Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr Val His Pro 
            260                 265                 270         


Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg Pro Gly Ala 
        275                 280                 285             


Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met Val Ala Lys 
    290                 295                 300                 


Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr Arg Arg Glu 
305                 310                 315                 320 


Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp Gly Ser Val 
                325                 330                 335     


Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly Leu Ala Gln 
            340                 345                 350         


Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser Gln Leu Arg 
        355                 360                 365             


Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly Arg Glu Asp 
    370                 375                 380                 


Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu Gly Ala Lys 
385                 390                 395                 400 


Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg Arg Leu Pro 
                405                 410                 415     


Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val Ala Tyr Arg 
            420                 425                 430         


Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser Glu Ala Gly 
        435                 440                 445             


Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly Leu Asn Leu 
    450                 455                 460                 


Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser Leu Leu Gly 
465                 470                 475                 480 


Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu Gly Leu Val 
                485                 490                 495     


Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys Ala Arg Thr 
            500                 505                 510         


Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser Glu Asp Leu 
        515                 520                 525             


Val Val His Asn Thr Val Pro Leu Gly Gln Val Thr Ile Asp Gly Gly 
    530                 535                 540                 


Thr Tyr Asp Ile Tyr Arg Thr Thr Arg Val Asn Gln Pro Ser Ile Val 
545                 550                 555                 560 


Gly Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg 
                565                 570                 575     


Thr Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn 
            580                 585                 590         


Arg Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu 
        595                 600                 605             


Gly Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser 
    610                 615                 620                 


Gln Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr 
625                 630                 635                 640 


Thr Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val 
                645                 650                 655     


Ser Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly 
            660                 665                 670         


Asp Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn 
        675                 680                 685             


Phe Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp 
    690                 695                 700                 


Leu Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr 
705                 710                 715                 720 


Tyr Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser 
                725                 730                 735     


His Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val 
            740                 745                 750         


Tyr Ala Asp Tyr Leu Val Ile Gln 
        755                 760 


<210>  13
<211>  2280
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-100

<400>  13
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta ctattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgcctgc ctggccgagg gctcgctcgt cttggacgcg      360

gctaccgggc agagggtccc tatcgaaaag gtgcgtccgg ggatggaagt tttctccttg      420

ggacctgatt acagactgta tcgggtgccc gttttggagg tccttgagag cggggttagg      480

gaagttgtgc gcctcagaac tcggtcaggg agaacgctgg tgttgacacc agatcacccg      540

cttttgaccc ccgaaggttg gaaacctctt tgtgacctcc cgcttggaac tccaattgca      600

gtccccgcag aactgcctgt ggcgggccac ttggccccac ctgaagaacg tgttacgctc      660

ctggctcttc tgttggggga tgggaacaca aagctgtcgg gtcggagagg tacacgtcct      720

aatgccttct tctacagcaa agaccccgaa ttgctcgcgg cttatcgccg gtgtgcagaa      780

gccttgggtg caaaggtgaa agcatacgtc cacccgacta cgggggtggt tacactcgca      840

accctcgctc cacgtcctgg agctcaagat cctgtcaaac gcctcgttgt cgaggcggga      900

atggttgcta aagccgaaga gaagagggtc ccggaggagg tgtttcgtta ccggcgtgag      960

gcgttggccc ttttcttggg ccgtttgttc tcgacagacg gctctgttga aaagaagagg     1020

atctcttatt caagtgccag tttgggactg gcccaggatg tcgcacatct cttgctgcgc     1080

cttggaatta catctcaact ccgttcgaga gggccacggg ctcacgaggt tcttatatcg     1140

ggccgcgagg atattttgcg gtttgctgaa cttatcggac cctacctctt gggggccaag     1200

agggagagac ttgcagcgct ggaagctgag gcccgcaggc gtttgcctgg acagggatgg     1260

cacttgcggc ttgttcttcc tgccgtggcg tacagagtga gcgaggctaa aaggcgctcg     1320

ggattttcgt ggagtgaagc cggtcggcgc gtcgcagttg cgggatcgtg tttgtcatct     1380

ggactcaacc tcaaattgcc cagacgctac ctttctcggc accggttgtc gctgctcggt     1440

gaggcttttg ccgaccctgg gctggaagcg ctcgcggaag gccaagtgct ctgggaccct     1500

attgttgctg tcgaaccggc cggtaaggcg agaacattcg acttgcgcgt tccacccttt     1560

gcaaacttcg tgagcgagga cctggtggtg cataacaccg tccccctggg ccaagtgaca     1620

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttccattgtg     1680

gggacagcca cgttcgatca gtactggagc gtgcgcacct ctaagcggac ttcaggaaca     1740

gtgaccgtga ccgatcactt ccgcgcctgg gcgaaccggg gcctgaacct cggcacaata     1800

gaccaaatta cattgtgcgt ggagggttac caaagctctg gatcagccaa catcacccag     1860

aacaccttct ctcagggctc ttcttccggc agttcgggtg gctcatccgg ctccacaacg     1920

actactcgca tcgagtgtga gaacatgtcc ttgtccggac cctacgttag caggatcacc     1980

aatcccttta atggtattgc gctgtacgcc aacggagaca cagcccgcgc taccgttaac     2040

ttccccgcaa gtcgcaacta caatttccgc ctgcggggtt gcggcaacaa caataatctt     2100

gcccgtgtgg acctgaggat cgacggacgg accgtcggga ccttttatta ccagggcaca     2160

tacccctggg aggccccaat tgacaatgtt tatgtcagtg cggggagtca tacagtcgaa     2220

atcactgtta ctgcggataa cggcacatgg gacgtgtatg ccgactacct ggtgatacag     2280


<210>  14
<211>  760
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-100

<400>  14

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Cys Leu Ala 
            100                 105                 110         


Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg Val Pro Ile 
        115                 120                 125             


Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly Pro Asp Tyr 
    130                 135                 140                 


Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser Gly Val Arg 
145                 150                 155                 160 


Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu Val Leu Thr 
                165                 170                 175     


Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro Leu Cys Asp 
            180                 185                 190         


Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu Pro Val Ala 
        195                 200                 205             


Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu Ala Leu Leu 
    210                 215                 220                 


Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly Thr Arg Pro 
225                 230                 235                 240 


Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala Ala Tyr Arg 
                245                 250                 255     


Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr Val His Pro 
            260                 265                 270         


Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg Pro Gly Ala 
        275                 280                 285             


Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met Val Ala Lys 
    290                 295                 300                 


Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr Arg Arg Glu 
305                 310                 315                 320 


Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp Gly Ser Val 
                325                 330                 335     


Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly Leu Ala Gln 
            340                 345                 350         


Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser Gln Leu Arg 
        355                 360                 365             


Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly Arg Glu Asp 
    370                 375                 380                 


Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu Gly Ala Lys 
385                 390                 395                 400 


Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg Arg Leu Pro 
                405                 410                 415     


Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val Ala Tyr Arg 
            420                 425                 430         


Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser Glu Ala Gly 
        435                 440                 445             


Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly Leu Asn Leu 
    450                 455                 460                 


Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser Leu Leu Gly 
465                 470                 475                 480 


Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu Gly Gln Val 
                485                 490                 495     


Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys Ala Arg Thr 
            500                 505                 510         


Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser Glu Asp Leu 
        515                 520                 525             


Val Val His Asn Thr Val Pro Leu Gly Gln Val Thr Ile Asp Gly Gly 
    530                 535                 540                 


Thr Tyr Asp Ile Tyr Arg Thr Thr Arg Val Asn Gln Pro Ser Ile Val 
545                 550                 555                 560 


Gly Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg 
                565                 570                 575     


Thr Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn 
            580                 585                 590         


Arg Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu 
        595                 600                 605             


Gly Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser 
    610                 615                 620                 


Gln Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr 
625                 630                 635                 640 


Thr Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val 
                645                 650                 655     


Ser Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly 
            660                 665                 670         


Asp Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn 
        675                 680                 685             


Phe Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp 
    690                 695                 700                 


Leu Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr 
705                 710                 715                 720 


Tyr Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser 
                725                 730                 735     


His Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val 
            740                 745                 750         


Tyr Ala Asp Tyr Leu Val Ile Gln 
        755                 760 


<210>  15
<211>  2280
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158-30-m79-110

<400>  15
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta caattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgccacg tccctgggcc aagtgacaat cgatggcggg      360

acctacgaca tctataggac gacacgcgtc aaccagcctt gcctggccga gggctcgctc      420

gtcttggacg cggctaccgg gcagagggtc cctatcgaaa aggtgcgtcc ggggatggaa      480

gttttctcct tgggacctga ttacagactg tatcaggtgc ccgttttgga ggtccttgag      540

agcggggttg gggaagttgt gcgcctcaga actcggtcag ggagaacgct ggtgttgaca      600

ccagatcacc cgcttttgac ccccgaaggt tggaaacctc tttgtgacct cccgcttgga      660

actccaattg cagtccccgc agaactgcct gtggcgggcc acttggcccc acctgaagaa      720

cgtgttacgc ccctggctct tctgttgggg gatgggaaca caaagctgtc gggtcggaga      780

ggtacacgtc ctaatgcctt cttctactgc aaagaccccg aattgctcgc ggcttatcgc      840

cggtgtgcag aagccttggg tgcaaaggtg aaagcatacg tccacccgac tacgggggtg      900

gttacactcg caaccctcgc tccacgtcct ggagctcaag atcctgtcaa acgcctcgtt      960

gtcgaggcgg gaatggttgc taaagccgaa gagaagaggg tcccggagga ggtgttccgt     1020

taccggcgtg aggcgttggc ccttttcttg ggccgtttgt tctcgacaga cggctctgtt     1080

gaaaagaaga ggatctctta ttcaagtgcc agtttgggac tggcccagga tgtcgcacat     1140

ctcttgctgc gccttggaat tacatctcaa ctccgttcga gagggccacg ggctcacgag     1200

gttcttatat cgggccgcga ggatattttg cggtttgctg aacttatcgg accctacctc     1260

ttgggggcca agagggagag acttgcagcg ctggaagctg aggcccgcag gcgtttgcct     1320

ggacagggat ggcacttgcg gcttgttctt cctgccgtgg cgtacagagt gagcgaggct     1380

aaaaggcgct cgggattttc gtggagtgaa gccggtcggc gcgtcgcagt tgcgggatcg     1440

tgtttgtcat ctggactcaa cctcaaattg cccagacgct acctttctcg gcaccggttg     1500

tcgatgctcg gtgaggcttt tgccgaccct gggctggaag cgctcgcgga aggccaagtg     1560

ctctgggacc ctattgttgc tgtcgaaccg gccggtaagg cgagaacatt cgacttgcgc     1620

gttccaccct ttgcaaactt cgcgagcgag gacctggtgg tgcataactc cattgtgggg     1680

acagccacgt tcgatcagta ctggagcgtg cgcacctcta agcggacttc aggaacagtg     1740

accgtgaccg atcacttccg cgcctgggcg aaccggggcc tgaacctcgg cacaatagac     1800

caaattacat tgtgcgtgga gggttaccaa agctctggat cagccaacat cacccagaac     1860

accttctctc agggctcttc ttccggcagt tcgggtggct catccggctc cacaacgact     1920

actcgcatcg agtgtgagaa catgtccttg tccggaccct acgttagcag gatcaccaat     1980

ccctttaatg gtattgcgct gtacgccaac ggagacacag cccgcgctac cgttaacttc     2040

cccgcaagtc gcaactacaa tttccgcctg cggggttgcg gcaacaacaa taatcttgcc     2100

cgtgtggacc tgaggatcga cggacggacc gtcgggacct tttattacca gggcacatac     2160

ccctgggagg ccccaattga caatgtttat gtcagtgcgg ggagtcatac agtcgaaatc     2220

actgttactg cggataacgg cacatgggac gtgtatgccg actacctggt gatacagtga     2280


<210>  16
<211>  759
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158-30-m79-110

<400>  16

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Asn Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Ser Leu 
            100                 105                 110         


Gly Gln Val Thr Ile Asp Gly Gly Thr Tyr Asp Ile Tyr Arg Thr Thr 
        115                 120                 125             


Arg Val Asn Gln Pro Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala 
    130                 135                 140                 


Ala Thr Gly Gln Arg Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu 
145                 150                 155                 160 


Val Phe Ser Leu Gly Pro Asp Tyr Arg Leu Tyr Gln Val Pro Val Leu 
                165                 170                 175     


Glu Val Leu Glu Ser Gly Val Gly Glu Val Val Arg Leu Arg Thr Arg 
            180                 185                 190         


Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro 
        195                 200                 205             


Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala 
    210                 215                 220                 


Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu 
225                 230                 235                 240 


Arg Val Thr Pro Leu Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu 
                245                 250                 255     


Ser Gly Arg Arg Gly Thr Arg Pro Asn Ala Phe Phe Tyr Cys Lys Asp 
            260                 265                 270         


Pro Glu Leu Leu Ala Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala 
        275                 280                 285             


Lys Val Lys Ala Tyr Val His Pro Thr Thr Gly Val Val Thr Leu Ala 
    290                 295                 300                 


Thr Leu Ala Pro Arg Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val 
305                 310                 315                 320 


Val Glu Ala Gly Met Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu 
                325                 330                 335     


Glu Val Phe Arg Tyr Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg 
            340                 345                 350         


Leu Phe Ser Thr Asp Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser 
        355                 360                 365             


Ser Ala Ser Leu Gly Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg 
    370                 375                 380                 


Leu Gly Ile Thr Ser Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu 
385                 390                 395                 400 


Val Leu Ile Ser Gly Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile 
                405                 410                 415     


Gly Pro Tyr Leu Leu Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu 
            420                 425                 430         


Ala Glu Ala Arg Arg Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu 
        435                 440                 445             


Val Leu Pro Ala Val Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser 
    450                 455                 460                 


Gly Phe Ser Trp Ser Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser 
465                 470                 475                 480 


Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser 
                485                 490                 495     


Arg His Arg Leu Ser Met Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu 
            500                 505                 510         


Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val 
        515                 520                 525             


Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe 
    530                 535                 540                 


Ala Asn Phe Ala Ser Glu Asp Leu Val Val His Asn Ser Ile Val Gly 
545                 550                 555                 560 


Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg Thr 
                565                 570                 575     


Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn Arg 
            580                 585                 590         


Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu Gly 
        595                 600                 605             


Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser Gln 
    610                 615                 620                 


Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr Thr 
625                 630                 635                 640 


Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val Ser 
                645                 650                 655     


Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly Asp 
            660                 665                 670         


Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn Phe 
        675                 680                 685             


Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp Leu 
    690                 695                 700                 


Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr Tyr 
705                 710                 715                 720 


Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser His 
                725                 730                 735     


Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val Tyr 
            740                 745                 750         


Ala Asp Tyr Leu Val Ile Gln 
        755                 


<210>  17
<211>  759
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Modified Protein

<400>  17

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Asn Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Ser Leu 
            100                 105                 110         


Gly Gln Val Thr Ile Asp Gly Gly Thr Tyr Asp Ile Tyr Arg Thr Thr 
        115                 120                 125             


Arg Val Asn Gln Pro Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala 
    130                 135                 140                 


Ala Thr Gly Gln Arg Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu 
145                 150                 155                 160 


Val Phe Ser Leu Gly Pro Asp Tyr Arg Leu Tyr Gln Val Pro Val Leu 
                165                 170                 175     


Glu Val Leu Glu Ser Gly Val Gly Glu Val Val Arg Leu Arg Thr Arg 
            180                 185                 190         


Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro 
        195                 200                 205             


Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala 
    210                 215                 220                 


Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu 
225                 230                 235                 240 


Arg Val Thr Pro Leu Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu 
                245                 250                 255     


Ser Gly Arg Arg Gly Thr Arg Pro Asn Ala Phe Phe Tyr Cys Lys Asp 
            260                 265                 270         


Pro Glu Leu Leu Ala Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala 
        275                 280                 285             


Lys Val Lys Ala Tyr Val His Pro Thr Thr Gly Val Val Thr Leu Ala 
    290                 295                 300                 


Thr Leu Ala Pro Arg Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val 
305                 310                 315                 320 


Val Glu Ala Gly Met Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu 
                325                 330                 335     


Glu Val Phe Arg Tyr Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg 
            340                 345                 350         


Leu Phe Ser Thr Asp Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser 
        355                 360                 365             


Ser Ala Ser Leu Gly Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg 
    370                 375                 380                 


Leu Gly Ile Thr Ser Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu 
385                 390                 395                 400 


Val Leu Ile Ser Gly Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile 
                405                 410                 415     


Gly Pro Tyr Leu Leu Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu 
            420                 425                 430         


Ala Glu Ala Arg Arg Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu 
        435                 440                 445             


Val Leu Pro Ala Val Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser 
    450                 455                 460                 


Gly Phe Ser Trp Ser Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser 
465                 470                 475                 480 


Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser 
                485                 490                 495     


Arg His Arg Leu Ser Met Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu 
            500                 505                 510         


Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val 
        515                 520                 525             


Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe 
    530                 535                 540                 


Ala Asn Phe Ala Ser Glu Asp Leu Val Val His Asn Ser Ile Val Gly 
545                 550                 555                 560 


Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg Thr 
                565                 570                 575     


Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn Arg 
            580                 585                 590         


Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu Gly 
        595                 600                 605             


Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser Gln 
    610                 615                 620                 


Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr Thr 
625                 630                 635                 640 


Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val Ser 
                645                 650                 655     


Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly Asp 
            660                 665                 670         


Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn Phe 
        675                 680                 685             


Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp Leu 
    690                 695                 700                 


Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr Tyr 
705                 710                 715                 720 


Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser His 
                725                 730                 735     


Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val Tyr 
            740                 745                 750         


Ala Asp Tyr Leu Val Ile Gln 
        755                 


<210>  18
<211>  2280
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Modifed Protein Coding Seq.

<400>  18
caaacaagca ttactctgac atccaacgca tccggtacgt ttgacggtta caattacgaa       60

ctctggaagg atactggcaa tacaacaatg acggtctaca ctcaaggtcg cttttcctgc      120

cagtggtcga acatcaataa cgcgttgttt aggaccggga agaaatacaa ccagaattgg      180

cagtctcttg gcacaatccg gatcacgtac tctgcgactt acaacccaaa cgggaactcc      240

tacttgtgta tctatggctg gtctaccaac ccattggtcg agttctacat cgttgagtcc      300

tgggggaact ggagaccgcc tggtgccacg tccctgggcc aagtgacaat cgatggcggg      360

acctacgaca tctataggac gacacgcgtc aaccagcctt gcctggccga gggctcgctc      420

gtcttggacg cggctaccgg gcagagggtc cctatcgaaa aggtgcgtcc ggggatggaa      480

gttttctcct tgggacctga ttacagactg tatcaggtgc ccgttttgga ggtccttgag      540

agcggggttg gggaagttgt gcgcctcaga actcggtcag ggagaacgct ggtgttgaca      600

ccagatcacc cgcttttgac ccccgaaggt tggaaacctc tttgtgacct cccgcttgga      660

actccaattg cagtccccgc agaactgcct gtggcgggcc acttggcccc acctgaagaa      720

cgtgttacgc ccctggctct tctgttgggg gatgggaaca caaagctgtc gggtcggaga      780

ggtacacgtc ctaatgcctt cttctactgc aaagaccccg aattgctcgc ggcttatcgc      840

cggtgtgcag aagccttggg tgcaaaggtg aaagcatacg tccacccgac tacgggggtg      900

gttacactcg caaccctcgc tccacgtcct ggagctcaag atcctgtcaa acgcctcgtt      960

gtcgaggcgg gaatggttgc taaagccgaa gagaagaggg tcccggagga ggtgttccgt     1020

taccggcgtg aggcgttggc ccttttcttg ggccgtttgt tctcgacaga cggctctgtt     1080

gaaaagaaga ggatctctta ttcaagtgcc agtttgggac tggcccagga tgtcgcacat     1140

ctcttgctgc gccttggaat tacatctcaa ctccgttcga gagggccacg ggctcacgag     1200

gttcttatat cgggccgcga ggatattttg cggtttgctg aacttatcgg accctacctc     1260

ttgggggcca agagggagag acttgcagcg ctggaagctg aggcccgcag gcgtttgcct     1320

ggacagggat ggcacttgcg gcttgttctt cctgccgtgg cgtacagagt gagcgaggct     1380

aaaaggcgct cgggattttc gtggagtgaa gccggtcggc gcgtcgcagt tgcgggatcg     1440

tgtttgtcat ctggactcaa cctcaaattg cccagacgct acctttctcg gcaccggttg     1500

tcgatgctcg gtgaggcttt tgccgaccct gggctggaag cgctcgcgga aggccaagtg     1560

ctctgggacc ctattgttgc tgtcgaaccg gccggtaagg cgagaacatt cgacttgcgc     1620

gttccaccct ttgcaaactt cgcgagcgag gacctggtgg tgcataactc cattgtgggg     1680

acagccacgt tcgatcagta ctggagcgtg cgcacctcta agcggacttc aggaacagtg     1740

accgtgaccg atcacttccg cgcctgggcg aaccggggcc tgaacctcgg cacaatagac     1800

caaattacat tgtgcgtgga gggttaccaa agctctggat cagccaacat cacccagaac     1860

accttctctc agggctcttc ttccggcagt tcgggtggct catccggctc cacaacgact     1920

actcgcatcg agtgtgagaa catgtccttg tccggaccct acgttagcag gatcaccaat     1980

ccctttaatg gtattgcgct gtacgccaac ggagacacag cccgcgctac cgttaacttc     2040

cccgcaagtc gcaactacaa tttccgcctg cggggttgcg gcaacaacaa taatcttgcc     2100

cgtgtggacc tgaggatcga cggacggacc gtcgggacct tttattacca gggcacatac     2160

ccctgggagg ccccaattga caatgtttat gtcagtgcgg ggagtcatac agtcgaaatc     2220

actgttactg cggataacgg cacatgggac gtgtatgccg actacctggt gatacagtga     2280


<210>  19
<211>  325
<212>  PRT
<213>  Dictyoglomus thermophilum

<400>  19

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Ser Leu 
            100                 105                 110         


Gly Gln Val Thr Ile Asp Gly Gly Thr Tyr Ser Ile Val Gly Thr Ala 
        115                 120                 125             


Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg Thr Ser Gly 
    130                 135                 140                 


Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn Arg Gly Leu 
145                 150                 155                 160 


Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu Gly Tyr Gln 
                165                 170                 175     


Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser Gln Gly Ser 
            180                 185                 190         


Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr Thr Thr Arg 
        195                 200                 205             


Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val Ser Arg Ile 
    210                 215                 220                 


Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly Asp Thr Ala 
225                 230                 235                 240 


Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn Phe Arg Leu 
                245                 250                 255     


Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp Leu Arg Ile 
            260                 265                 270         


Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr Tyr Pro Trp 
        275                 280                 285             


Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser His Thr Val 
    290                 295                 300                 


Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val Tyr Ala Asp 
305                 310                 315                 320 


Tyr Leu Val Ile Gln 
                325 


<210>  20
<211>  1014
<212>  DNA
<213>  Dictyoglomus thermophilum

<400>  20
atgcaaacaa gcattactct gacatccaac gcatccggta cgtttgacgg ttactattac       60

gaactctgga aggatactgg caatacaaca atgacggtct acactcaagg tcgcttttcc      120

tgccagtggt cgaacatcaa taacgcgttg tttaggaccg ggaagaaata caaccagaat      180

tggcagtctc ttggcacaat ccggatcacg tactctgcga cttacaaccc aaacgggaac      240

tcctacttgt gtatctatgg ctggtctacc aacccattgg tcgagttcta catcgttgag      300

tcctggggga actggagacc gcctggtgcc acgtccctgg gccaagtgac aatcgatggc      360

gggacctacg acatctatag gacgacacgc gtcaaccagc cttccattgt ggggacagcc      420

acgttcgatc agtactggag cgtgcgcacc tctaagcgga cttcaggaac agtgaccgtg      480

accgatcact tccgcgcctg ggcgaaccgg ggcctgaacc tcggcacaat agaccaaatt      540

acattgtgcg tggagggtta ccaaagctct ggatcagcca acatcaccca gaacaccttc      600

tctcagggct cttcttccgg cagttcgggt ggctcatccg gctccacaac gactactcgc      660

atcgagtgtg agaacatgtc cttgtccgga ccctacgtta gcaggatcac caatcccttt      720

aatggtattg cgctgtacgc caacggagac acagcccgcg ctaccgttaa cttccccgca      780

agtcgcaact acaatttccg cctgcggggt tgcggcaaca acaataatct tgcccgtgtg      840

gacctgagga tcgacggacg gaccgtcggg accttttatt accagggcac atacccctgg      900

gaggccccaa ttgacaatgt ttatgtcagt gcggggagtc atacagtcga aatcactgtt      960

actgcggata acggcacatg ggacgtgtat gccgactacc tggtgataca gtga           1014


<210>  21
<211>  759
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, T134-195

<400>  21

Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp Gly 
1               5                   10                  15      


Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr Val 
            20                  25                  30          


Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn Ala 
        35                  40                  45              


Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu Gly 
    50                  55                  60                  


Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn Ser 
65                  70                  75                  80  


Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe Tyr 
                85                  90                  95      


Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Cys Leu Ala 
            100                 105                 110         


Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg Val Pro Ile 
        115                 120                 125             


Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly Pro Asp Tyr 
    130                 135                 140                 


Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser Gly Val Arg 
145                 150                 155                 160 


Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu Val Leu Thr 
                165                 170                 175     


Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro Leu Cys Asp 
            180                 185                 190         


Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu Pro Val Ala 
        195                 200                 205             


Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu Ala Leu Leu 
    210                 215                 220                 


Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly Thr Arg Pro 
225                 230                 235                 240 


Asn Ala Ser Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala Ala Tyr Arg 
                245                 250                 255     


Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr Val His Pro 
            260                 265                 270         


Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg Pro Gly Ala 
        275                 280                 285             


Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met Val Ala Lys 
    290                 295                 300                 


Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr Arg Arg Glu 
305                 310                 315                 320 


Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp Gly Ser Val 
                325                 330                 335     


Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly Leu Ala Gln 
            340                 345                 350         


Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Arg Ser Gln Leu Arg 
        355                 360                 365             


Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly Arg Glu Asp 
    370                 375                 380                 


Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu Gly Ala Lys 
385                 390                 395                 400 


Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg Arg Leu Pro 
                405                 410                 415     


Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val Ala Tyr Arg 
            420                 425                 430         


Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser Glu Ala Gly 
        435                 440                 445             


Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly Leu Asn Leu 
    450                 455                 460                 


Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser Leu Leu Gly 
465                 470                 475                 480 


Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu Gly Gln Val 
                485                 490                 495     


Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys Ala Arg Thr 
            500                 505                 510         


Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser Glu Asp Leu 
        515                 520                 525             


Val Val His Asn Thr Ser Leu Gly Gln Val Thr Ile Asp Gly Gly Thr 
    530                 535                 540                 


Tyr Asp Ile Tyr Arg Thr Thr Arg Val Asn Gln Pro Ser Ile Val Gly 
545                 550                 555                 560 


Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys Arg Thr 
                565                 570                 575     


Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala Asn Arg 
            580                 585                 590         


Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val Glu Gly 
        595                 600                 605             


Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe Ser Gln 
    610                 615                 620                 


Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr Thr Thr 
625                 630                 635                 640 


Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr Val Ser 
                645                 650                 655     


Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn Gly Asp 
            660                 665                 670         


Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr Asn Phe 
        675                 680                 685             


Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val Asp Leu 
    690                 695                 700                 


Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly Thr Tyr 
705                 710                 715                 720 


Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly Ser His 
                725                 730                 735     


Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp Val Tyr 
            740                 745                 750         


Ala Asp Tyr Leu Val Ile Gln 
        755                 


<210>  22
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth-S158-39 Intein Sequence

<400>  22

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Gly Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Ser Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Arg Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Gln Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  23
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth-T134-195 Intein Sequence

<400>  23

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Ser Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Arg Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  24
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth-S158-21 Intein Sequence

<400>  24

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Gly Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Gln His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  25
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth T134-180 Intein Sequence

<400>  25

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  26
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth T134-100-65

<400>  26

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asn Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  27
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth T134-100-68

<400>  27

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg Ser Glu Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe His Ser Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Val Gln Gly Trp His Ser Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  28
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth T134-100-39

<400>  28

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Leu Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  29
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth T134-100

<400>  29

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  30
<211>  423
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth S158-30-m79-110

<400>  30

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Gln Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Gly Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Pro Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe Tyr Cys Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Met Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Ala Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  31
<211>  1014
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, XynB (P77853) maize codon-optimized sequence
       based on Dictyoglomus thermophilum sequence

<400>  31
atgcaaacaa gcattactct gacatccaac gcatccggta cgtttgacgg ttactattac       60

gaactctgga aggatactgg caatacaaca atgacggtct acactcaagg tcgcttttcc      120

tgccagtggt cgaacatcaa taacgcgttg tttaggaccg ggaagaaata caaccagaat      180

tggcagtctc ttggcacaat ccggatcacg tactctgcga cttacaaccc aaacgggaac      240

tcctacttgt gtatctatgg ctggtctacc aacccattgg tcgagttcta catcgttgag      300

tcctggggga actggagacc gcctggtgcc acgtccctgg gccaagtgac aatcgatggc      360

gggacctacg acatctatag gacgacacgc gtcaaccagc cttccattgt ggggacagcc      420

acgttcgatc agtactggag cgtgcgcacc tctaagcgga cttcaggaac agtgaccgtg      480

accgatcact tccgcgcctg ggcgaaccgg ggcctgaacc tcggcacaat agaccaaatt      540

acattgtgcg tggagggtta ccaaagctct ggatcagcca acatcaccca gaacaccttc      600

tctcagggct cttcttccgg cagttcgggt ggctcatccg gctccacaac gactactcgc      660

atcgagtgtg agaacatgtc cttgtccgga ccctacgtta gcaggatcac caatcccttt      720

aatggtattg cgctgtacgc caacggagac acagcccgcg ctaccgttaa cttccccgca      780

agtcgcaact acaatttccg cctgcggggt tgcggcaaca acaataatct tgcccgtgtg      840

gacctgagga tcgacggacg gaccgtcggg accttttatt accagggcac atacccctgg      900

gaggccccaa ttgacaatgt ttatgtcagt gcggggagtc atacagtcga aatcactgtt      960

actgcggata acggcacatg ggacgtgtat gccgactacc tggtgataca gtga           1014


<210>  32
<211>  1269
<212>  DNA
<213>  Thermus thermophilus

<400>  32
tgcctggccg agggctcgct cgtcttggac gcggctaccg ggcagagggt ccctatcgaa       60

aaggtgcgtc cggggatgga agttttctcc ttgggacctg attacagact gtatcgggtg      120

cccgttttgg aggtccttga gagcggggtt agggaagttg tgcgcctcag aactcggtca      180

gggagaacgc tggtgttgac accagatcac ccgcttttga cccccgaagg ttggaaacct      240

ctttgtgacc tcccgcttgg aactccaatt gcagtccccg cagaactgcc tgtggcgggc      300

cacttggccc cacctgaaga acgtgttacg ctcctggctc ttctgttggg ggatgggaac      360

acaaagctgt cgggtcggag aggtacacgt cctaatgcct tcttctacag caaagacccc      420

gaattgctcg cggcttatcg ccggtgtgca gaagccttgg gtgcaaaggt gaaagcatac      480

gtccacccga ctacgggggt ggttacactc gcaaccctcg ctccacgtcc tggagctcaa      540

gatcctgtca aacgcctcgt tgtcgaggcg ggaatggttg ctaaagccga agagaagagg      600

gtcccggagg aggtgtttcg ttaccggcgt gaggcgttgg cccttttctt gggccgtttg      660

ttctcgacag acggctctgt tgaaaagaag aggatctctt attcaagtgc cagtttggga      720

ctggcccagg atgtcgcaca tctcttgctg cgccttggaa ttacatctca actccgttcg      780

agagggccac gggctcacga ggttcttata tcgggccgcg aggatatttt gcggtttgct      840

gaacttatcg gaccctacct cttgggggcc aagagggaga gacttgcagc gctggaagct      900

gaggcccgca ggcgtttgcc tggacaggga tggcacttgc ggcttgttct tcctgccgtg      960

gcgtacagag tgagcgaggc taaaaggcgc tcgggatttt cgtggagtga agccggtcgg     1020

cgcgtcgcag ttgcgggatc gtgtttgtca tctggactca acctcaaatt gcccagacgc     1080

tacctttctc ggcaccggtt gtcgctgctc ggtgaggctt ttgccgaccc tgggctggaa     1140

gcgctcgcgg aaggccaagt gctctgggac cctattgttg ctgtcgaacc ggccggtaag     1200

gcgagaacat tcgacttgcg cgttccaccc tttgcaaact tcgtgagcga ggacctggtg     1260

gtgcataac                                                             1269


<210>  33
<211>  75
<212>  DNA
<213>  Hordeum vulgare

<400>  33
atggcgaaca aacatttgtc cctctccctc ttcctcgtcc tccttggcct gtcggccagc       60

ttggcctccg ggcaa                                                        75


<210>  34
<211>  423
<212>  PRT
<213>  Thermus thermophilus

<400>  34

Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg 
1               5                   10                  15      


Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly 
            20                  25                  30          


Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser 
        35                  40                  45              


Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu 
    50                  55                  60                  


Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro 
65                  70                  75                  80  


Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu 
                85                  90                  95      


Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu 
            100                 105                 110         


Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly 
        115                 120                 125             


Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp Pro Glu Leu Leu Ala 
    130                 135                 140                 


Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr 
145                 150                 155                 160 


Val His Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg 
                165                 170                 175     


Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met 
            180                 185                 190         


Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr 
        195                 200                 205             


Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp 
    210                 215                 220                 


Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly 
225                 230                 235                 240 


Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser 
                245                 250                 255     


Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly 
            260                 265                 270         


Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu 
        275                 280                 285             


Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg 
    290                 295                 300                 


Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val 
305                 310                 315                 320 


Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser Gly Phe Ser Trp Ser 
                325                 330                 335     


Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly 
            340                 345                 350         


Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser 
        355                 360                 365             


Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu 
    370                 375                 380                 


Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys 
385                 390                 395                 400 


Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser 
                405                 410                 415     


Glu Asp Leu Val Val His Asn 
            420             


<210>  35
<211>  2349
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158 19

<400>  35
atgttcctta agaaactgtc taagttgctg ctcgtcgtgc tccttgttgc cgtttacaca       60

caggtcaacg cgcaaacaag cattactctg acatccaacg catccggtac gtttgacggt      120

tactattacg aactctggaa ggatactggc aatacaacaa tgacggtcta cactcaaggt      180

cgcttttcct gccagtggtc gaacatcaat aacgcgttgt ttaggaccgg gaagaaatac      240

aaccagaatt ggcagtctct tggcacaatc cggatcacgt actctgcgac ttacaaccca      300

aacgggaact cctacttgtg tatctatggc tggtctacca acccattggt cgagttctac      360

atcgttgagt cctgggggaa ctggagaccg cctggtgcca cgtccctggg ccaagtgaca      420

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttgcctggcc      480

gagggctcgc tcgtcttgga cgcggctacc gggcagaggg tccctatcga aaaggtgcgt      540

ccggggatgg aagttttctc cttgggacct gattacagac tgtatcgggt gcccgttttg      600

gaggtccttg agagcggggt tggggaagtt gtgcgcctca gaactcggtc agggagaacg      660

ctggtgttga caccagatca cccgcttttg acccccgaag gttggaaacc tctttgtgac      720

ctcccgcttg gaactccaat tgcagtcccc gcagaactgc ctgtggcggg ccacttggcc      780

ccacctgaag aacgtgttac gctcctggct cttctgttgg gggatgggaa cacaaagctg      840

tcgggtcgga gaggtacacg tcctattgcc ttcttctaca gcaaagaccc cgaattgctc      900

gcggcttatc gccggtgtgc agaagccttg ggtgcaaagg tgaaagcata cgtccacccg      960

actacggggg tggttacact cgcaaccctc gctccacgtc ctggagctca agatcctgtc     1020

aaacgcctcg ttgtcgaggc gggaatggtt gctaaagccg aagagaagag ggtcccggag     1080

gaggtgtttc gttaccggcg tgaggcgttg gcccttttct tgggccgttt gttctcgaca     1140

gacggctctg ttgaaaagaa gaggatctct tattcaagtg ccagtttggg actggcccag     1200

gatgtcgcac atctcttgct gcgccttgga attacatctc aactccgttc gagagggcca     1260

cgggctcacg aggttcttat atcgggccgc gaggatattt tgcggtttgc tgaacttatc     1320

ggaccctacc tcttgggggc caagagggag agacttgcag cgctggaagc tgaggcccgc     1380

aggcgtttgc ctggacaggg atggcacttg cggcttgttc ttcctgccgt ggcgtacaga     1440

gtgagcgagg ctaaaaggcg ctcgggattt tcgtggagtg aagccggtcg gcgcgtcgca     1500

gttgcgggat cgtgtttgtc atctggactc aacctcaaat tgcccagacg ctacctttct     1560

cggcaccggt tgtcgctgct cggtgaggct tttgccgacc ctgggctgga agcgctcgcg     1620

gaaggccaag tgctctggga ccctattgtt gctgtcgaac cggccggtaa ggcgagaaca     1680

ttcgacttgc gcgttccacc ctttgcaaac ttcgtgagcg aggacctggt ggtgcataac     1740

tccattgtgg ggacagccac gttcgatcag tactggagcg tgcgcacctc taagcggact     1800

tcaggaacag tgaccgtgac cgatcacttc cgcgcctggg cgaaccgggg cctgaacctc     1860

ggcacaatag accaaattac attgtgcgtg gagggttacc aaagctctgg atcagccaac     1920

atcacccaga acaccttctc tcagggctct tcttccggca gttcgggtgg ctcatccggc     1980

tccacaacga ctactcgcat cgagtgtgag aacatgtcct tgtccggacc ctacgttagc     2040

aggatcacca atccctttaa tggtattgcg ctgtacgcca acggagacac agcccgcgct     2100

accgttaact tccccgcaag tcgcaactac aatttccgcc tgcggggttg cggcaacaac     2160

aataatcttg cccgtgtgga cctgaggatc gacggacgga ccgtcgggac cttttattac     2220

cagggcacat acccctggga ggccccaatt gacaatgttt atgtcagtgc ggggagtcat     2280

acagtcgaaa tcactgttac tgcggataac ggcacatggg acgtgtatgc cgactacctg     2340

gtgatacag                                                             2349


<210>  36
<211>  2349
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158-3103

<400>  36
atgttcctta agaaactgtc taagttgctg ctcgtcgtgc tccttgttgc cgtttacaca       60

caggtcaacg cgcaaacaag cattactctg acatccaacg catccggtac gtttgacggt      120

tactattacg aactctggaa ggatactggc aatacaacaa tgacggtcta cactcaaggt      180

cgcttttcct gccagtggtc gaacatcaat aacgcgttgt ttaggaccgg gaagaaatac      240

aaccagaatt ggcagtctct tggcacaatc cggatcacgt actctgcgac ttacaaccca      300

aacgggaact cctacttgtg tatctatggc tggtctacca acccattggt cgagttctac      360

atcgttgagt cctgggggaa ctggagaccg cctggtgcca cgtccctggg ccaagtgaca      420

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttgcctggcc      480

gagggctcgc tcgtcttgga cgcggctacc gggcagaggg tccctatcga aaaggtgcgt      540

ccggggatgg aagttttctc cttgggacct gattacagac tgtatcgggt gcccgttttg      600

gaggtccttg agagcggggt tggggaagtt gtgcgcctca gaactcggtc agggagaacg      660

ctggtgttga caccagatca cccgcttttg acccccgaag gttggaaacc tctttgtgac      720

ctcccgcttg gaactccaat tgcagtcccc gcagaactgc ctgtggcggg ccacttggcc      780

ccacctgaag aacgtgttac gctcctggct cttctgttgg gggatgggaa cacaaagctg      840

tcgggtcgga gaggtacacg tcctaatgcc ttcttctaca gcaaagaccc cgaattgctc      900

gcggcttatc gccggtgtgc agaagccttg ggtgcaaagg tgaaagcata cgtccacccg      960

actacggggg tggttacact cgcaaccctc gctccacgtc ctggagctca agatcctgtc     1020

aaacgcctcg ttgtcgaggc gggaatggtt gctaaagccg aagagaagag ggtcccggag     1080

gaggtgtttc gttaccggcg tgaggcgttg gcccttttct tgggccgttt gttctcgaca     1140

gacggctctg ttgaaaagaa gaggatctct tattcaagtg ccagtttggg actggcccag     1200

gatgtcgcac atctcttgct gcgccttgga attacatctc aactccgttc gagagggcca     1260

cgggctcacg aggttcttat atcgggccgc gaggatattt tgcggtttgc tgaacttatc     1320

ggaccctacc tcttgggggc caagagggag agacttgcag cgctggaagc tgaggcccgc     1380

aggcgtttgc ctggacaggg atggcacttg cggcttgttc ttcctgccgt ggcgtacaga     1440

gtgagcgagg ctaaaaggcg ctcgggattt tcgtggagtg aagccggtcg gcgcgtcgca     1500

gttgcgggat cgtgtttgtc atctggactc aacctcaaat tgcccagacg ctacctttct     1560

cggcaccggt tgtcgatgct cggtgaggct tttgccgacc ctgggctgga agcgctcgcg     1620

gaaggccaag tgctctggga ccctattgtt gctgtcgaac cggccggtaa ggcgagaaca     1680

ttcgacttgc gcgttccacc ctttgcaaac ttcgtgagcg aggacctggt ggtgcataac     1740

tccattgtgg ggacagccac gttcgatcag tactggagcg tgcgcacctc taagcggact     1800

tcaggaacag tgaccgtgac cgatcacttc cgcgcctggg cgaaccgggg cctgaacctc     1860

ggcacaatag accaaattac attgtgcgtg gagggttacc aaagctctgg atcagccaac     1920

atcacccaga acaccttctc tcagggctct tcttccggca gttcgggtgg ctcatccggc     1980

tccacaacga ctgctcgcat cgagtgtgag aacatgtcct tgtccggtcc ctacgttagc     2040

aggatcacca atccctttaa tggtattgcg ctgtacgcca acggagacac agcccgcgct     2100

accgttaact tccccgcaag tcgcaactac aatttccgcc tgcggggttg cggcaacaac     2160

aataatcttg cccgtgtgga cctgaggatc gacggacgga ccgtcgggac cttttattac     2220

cagggcacat acccctggga ggccccaatt gacaatgttt atgtcagtgc ggggagtcat     2280

acagtcgaaa tcactgttac tgcggataac ggcacatggg acgtgtatgc cgactacctg     2340

gtgatacag                                                             2349


<210>  37
<211>  2349
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158-3108

<400>  37
atgttcctta agaaactgtc taagttgctg ctcgtcgtgc tccttgttgc cgtttacaca       60

caggtcaacg cgcaaacaag cattactctg acatccaacg catccggtac gtttgacggt      120

tactattacg aactctggaa ggatactggc aatacaacaa tgacggtcta cactcaaggt      180

cgcttttcct gccagtggtc gaacatcaat aacgcgttgt ttaggaccgg gaagaaatac      240

aaccagaatt ggcagtctct tggcacaatc cggatcacgt actctgcgac ttacaaccca      300

aacgggaact cctacttgtg tatctatggc tggtctacca acccattggt cgagttctac      360

atcgttgagt cctgggggaa ctggagaccg cctggtgcca cgtccctggg ccaagtgaca      420

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttgcctggcc      480

gagggctcgc tcgtcttgga cgcggctacc gggcagaggg tccctatcga aaaggtgcgt      540

ccggggatgg aagttttctc cttgggacct gattacagac tgtatcgggt gcccgttttg      600

gaggtccttg agagcggggt tggggaagtt gtgcgcctca gaactcggtc agggagaacg      660

ctggtgttga caccagatca cccgcttttg acccccgaag gttggaaacc tctttgtgac      720

ctcccgcttg gaactccaat tgcagtcccc gcagaactgc ctgtggcggg ccacttggcc      780

ccacctgaag aacgtgttac gctcctggct cttctgttgg gggatgggaa cacaaagctg      840

tcgggtcgga gaggtacacg tcctaatgcc ttcttctaca gcaaagaccc cgaattgctc      900

gcggcttatc gccggtgtgc agaagccttg ggtgcaaagg tgaaagcata cgtccacccg      960

actacggggg tggttacact cgcaaccctc gctccacgtc ctggagctca agatcctgtc     1020

aaacgcctcg ttgtcgaggc gggaatggtt gctaaagccg aagagaagag ggtcccggag     1080

gaggtgtttc gttaccggcg tgaggcgttg gcccttttct tgggccgttt gttctcgaca     1140

gacggctctg ttgaaaagaa gaggatctct tattcaagtg ccagtttggg actggcccag     1200

gatgtcgcac atctcttgct gcgccttgga attacatctc aactccgttc gagagggcca     1260

cgggctcaca aggttcttat atcgggccgc gaggatattt tgcggtttgc tgaacttatc     1320

ggaccctacc tcttgggggc caagagggag agacttgcag cgctggaagc tgaggcccgc     1380

aggcgtttgc ctggacaggg atggcacttg cggcttgttc ttcctgccgt ggcgtacaga     1440

gtgagcgagg ctaaaaggcg ctcgggattt tcgtggagtg aagccggtcg gcgcgtcgca     1500

gttgcgggat cgtgtttgtc atctggactc aacctcaaat tgcccagacg ctacctttct     1560

cggcaccggt tgtcgctgct cggtgaggct tttgccgacc ctgggctgga agcgctcgcg     1620

gaaggccaag tgctctggga ccctattgtt gctgtcgaac cggccggtaa ggcgagaaca     1680

ttcgacttgc gcgttccacc ctttgcaaac ttcgtgagcg aggacctggt ggtgcataac     1740

tccattgtgg ggacagccac gttcgatcag tactggagcg tgcgcacctc taagcggact     1800

tcaggaacag tgaccgtgac cgatcacttc cgcgcctggg cgaaccgggg cctgaacctc     1860

ggcacaatag accaaattac attgtgcgtg gagggttacc aaagctctgg atcagccaac     1920

atcacccaga acaccttctc tcagggctct tcttccggca gttcgggtgg ctcatccggc     1980

tccacaacga ctactcgcat cgagtgtgag aacatgtcct tgtccggacc ctacgttagc     2040

aggatcacca atccctttaa tggtattgcg ctgtacgcca acggagacac agcccgcgct     2100

accgttaact tccccgcaag tcgcaactac aatttccgcc tgcggggttg cggcaacaac     2160

aataatcttg cccgtgtgga cctgaggatc gacggacgga ccgtcgggac cttttattac     2220

cagggcacat acccctggga ggccccaatt gacaatgttt atgtcagtgc ggggagtcat     2280

acagtcgaaa tcactgttac tgcggataac ggcacatggg acgtgtatgc cgactacctg     2340

gtgatacag                                                             2349


<210>  38
<211>  2349
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, S158-30

<400>  38
atgttcctta agaaactgtc taagttgctg ctcgtcgtgc tccttgttgc cgtttacaca       60

caggtcaacg cgcaaacaag cattactctg acatccaacg catccggtac gtttgacggt      120

tactattacg aactctggaa ggatactggc aatacaacaa tgacggtcta cactcaaggt      180

cgcttttcct gccagtggtc gaacatcaat aacgcgttgt ttaggaccgg gaagaaatac      240

aaccagaatt ggcagtctct tggcacaatc cggatcacgt actctgcgac ttacaaccca      300

aacgggaact cctacttgtg tatctatggc tggtctacca acccattggt cgagttctac      360

atcgttgagt cctgggggaa ctggagaccg cctggtgcca cgtccctggg ccaagtgaca      420

atcgatggcg ggacctacga catctatagg acgacacgcg tcaaccagcc ttgcctggcc      480

gagggctcgc tcgtcttgga cgcggctacc gggcagaggg tccctatcga aaaggtgcgt      540

ccggggatgg aagttttctc cttgggacct gattacagac tgtatcgggt gcccgttttg      600

gaggtccttg agagcggggt tagggaagtt gtgcgcctca gaactcggtc agggagaacg      660

ctggtgttga caccagatca cccgcttttg acccccgaag gttggaaacc tctttgtgac      720

ctcccgcttg gaactccaat tgcagtcccc gcagaactgc ctgtggcggg ccacttggcc      780

ccacctgaag aacgtgttac gctcctggct cttctgttgg gggatgggaa cacaaagctg      840

tcgggtcgga gaggtacacg tcctaatgcc ttcttctaca gcaaagaccc cgaattgctc      900

gcggcttatc gccggtgtgc agaagccttg ggtgcaaagg tgaaagcata cgtccacccg      960

actacggggg tggttacact cgcaaccctc gctccacgtc ctggagctca agatcctgtc     1020

aaacgcctcg ttgtcgaggc gggaatggtt gctaaagccg aagagaagag ggtcccggag     1080

gaggtgtttc gttaccggcg tgaggcgttg gcccttttct tgggccgttt gttctcgaca     1140

gacggctctg ttgaaaagaa gaggatctct tattcaagtg ccagtttggg actggcccag     1200

gatgtcgcac atctcttgct gcgccttgga attacatctc aactccgttc gagagggcca     1260

cgggctcacg aggttcttat atcgggccgc gaggatattt tgcggtttgc tgaacttatc     1320

ggaccctacc tcttgggggc caagagggag agacttgcag cgctggaagc tgaggcccgc     1380

aggcgtttgc ctggacaggg atggcacttg cggcttgttc ttcctgccgt ggcgtacaga     1440

gtgagcgagg ctaaaaggcg ctcgggattt tcgtggagtg aagccggtcg gcgcgtcgca     1500

gttgcgggat cgtgtttgtc atctggactc aacctcaaat tgcccagacg ctacctttct     1560

cggcaccggt tgtcgatgct cggtgaggct tttgccgacc ctgggctgga agcgctcgcg     1620

gaaggccaag tgctctggga ccctattgtt gctgtcgaac cggccggtaa ggcgagaaca     1680

ttcgacttgc gcgttccacc ctttgcaaac ttcgtgagcg aggacctggt ggtgcataac     1740

tccattgtgg ggacagccac gttcgatcag tactggagcg tgcgcacctc taagcggact     1800

tcaggaacag tgaccgtgac cgatcacttc cgcgcctggg cgaaccgggg cctgaacctc     1860

ggcacaatag accaaattac attgtgcgtg gagggttacc aaagctctgg atcagccaac     1920

atcacccaga acaccttctc tcagggctct tcttccggca gttcgggtgg ctcatccggc     1980

tccacaacga ctactcgcat cgagtgtgag aacatgtcct tgtccggacc ctacgttagc     2040

aggatcacca atccctttaa tggtattgcg ctgtacgcca acggagacac agcccgcgct     2100

accgttaact tccccgcaag tcgcaactac aatttccgcc tgcggggttg cggcaacaac     2160

aataatcttg cccgtgtgga cctgaggatc gacggacgga ccgtcgggac cttttattac     2220

cagggcacat acccctggga ggccccaatt gacaatgttt atgtcagtgc ggggagtcat     2280

acagtcgaaa tcactgttac tgcggataac ggcacatggg acgtgtatgc cgactacctg     2340

gtgatacag                                                             2349


<210>  39
<211>  24
<212>  PRT
<213>  Dictyoglomus thermophilum

<400>  39

Met Phe Leu Lys Lys Leu Ser Lys Leu Leu Leu Val Val Leu Leu Val 
1               5                   10                  15      


Ala Val Tyr Thr Gln Val Asn Ala 
            20                  


<210>  40
<211>  433
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth iXynB clone T134

<400>  40

Arg Pro Pro Gly Ala Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala 
1               5                   10                  15      


Ala Thr Gly Gln Arg Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu 
            20                  25                  30          


Val Phe Ser Leu Gly Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu 
        35                  40                  45              


Glu Val Leu Glu Ser Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg 
    50                  55                  60                  


Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro 
65                  70                  75                  80  


Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala 
                85                  90                  95      


Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu 
            100                 105                 110         


Arg Val Thr Leu Leu Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu 
        115                 120                 125             


Ser Gly Arg Arg Gly Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp 
    130                 135                 140                 


Pro Glu Leu Leu Ala Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala 
145                 150                 155                 160 


Lys Val Lys Ala Tyr Val His Pro Thr Thr Gly Val Val Thr Leu Ala 
                165                 170                 175     


Thr Leu Ala Pro Arg Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val 
            180                 185                 190         


Val Glu Ala Gly Met Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu 
        195                 200                 205             


Glu Val Phe Arg Tyr Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg 
    210                 215                 220                 


Leu Phe Ser Thr Asp Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser 
225                 230                 235                 240 


Ser Ala Ser Leu Gly Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg 
                245                 250                 255     


Leu Gly Ile Thr Ser Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu 
            260                 265                 270         


Val Leu Ile Ser Gly Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile 
        275                 280                 285             


Gly Pro Tyr Leu Leu Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu 
    290                 295                 300                 


Ala Glu Ala Arg Arg Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu 
305                 310                 315                 320 


Val Leu Pro Ala Val Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser 
                325                 330                 335     


Gly Phe Ser Trp Ser Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser 
            340                 345                 350         


Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser 
        355                 360                 365             


Arg His Arg Leu Ser Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu 
    370                 375                 380                 


Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val 
385                 390                 395                 400 


Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe 
                405                 410                 415     


Ala Asn Phe Val Ser Glu Asp Leu Val Val His Asn Thr Ser Leu Gly 
            420                 425                 430         


Gln 
    


<210>  41
<211>  433
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Tth iXynB clone S158

<400>  41

Arg Val Asn Gln Pro Cys Leu Ala Glu Gly Ser Leu Val Leu Asp Ala 
1               5                   10                  15      


Ala Thr Gly Gln Arg Val Pro Ile Glu Lys Val Arg Pro Gly Met Glu 
            20                  25                  30          


Val Phe Ser Leu Gly Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu 
        35                  40                  45              


Glu Val Leu Glu Ser Gly Val Arg Glu Val Val Arg Leu Arg Thr Arg 
    50                  55                  60                  


Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr Pro 
65                  70                  75                  80  


Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile Ala 
                85                  90                  95      


Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu Glu 
            100                 105                 110         


Arg Val Thr Leu Leu Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys Leu 
        115                 120                 125             


Ser Gly Arg Arg Gly Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys Asp 
    130                 135                 140                 


Pro Glu Leu Leu Ala Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly Ala 
145                 150                 155                 160 


Lys Val Lys Ala Tyr Val His Pro Thr Thr Gly Val Val Thr Leu Ala 
                165                 170                 175     


Thr Leu Ala Pro Arg Pro Gly Ala Gln Asp Pro Val Lys Arg Leu Val 
            180                 185                 190         


Val Glu Ala Gly Met Val Ala Lys Ala Glu Glu Lys Arg Val Pro Glu 
        195                 200                 205             


Glu Val Phe Arg Tyr Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly Arg 
    210                 215                 220                 


Leu Phe Ser Thr Asp Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr Ser 
225                 230                 235                 240 


Ser Ala Ser Leu Gly Leu Ala Gln Asp Val Ala His Leu Leu Leu Arg 
                245                 250                 255     


Leu Gly Ile Thr Ser Gln Leu Arg Ser Arg Gly Pro Arg Ala His Glu 
            260                 265                 270         


Val Leu Ile Ser Gly Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu Ile 
        275                 280                 285             


Gly Pro Tyr Leu Leu Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu Glu 
    290                 295                 300                 


Ala Glu Ala Arg Arg Arg Leu Pro Gly Gln Gly Trp His Leu Arg Leu 
305                 310                 315                 320 


Val Leu Pro Ala Val Ala Tyr Arg Val Ser Glu Ala Lys Arg Arg Ser 
                325                 330                 335     


Gly Phe Ser Trp Ser Glu Ala Gly Arg Arg Val Ala Val Ala Gly Ser 
            340                 345                 350         


Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu Ser 
        355                 360                 365             


Arg His Arg Leu Ser Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu 
    370                 375                 380                 


Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val 
385                 390                 395                 400 


Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe 
                405                 410                 415     


Ala Asn Phe Val Ser Glu Asp Leu Val Val His Asn Ser Ile Val Gly 
            420                 425                 430         


Thr 
    


<210>  42
<211>  57
<212>  PRT
<213>  Thermus thermophilus

<400>  42

Leu Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg Val 
1               5                   10                  15      


Pro Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly Pro 
            20                  25                  30          


Asp Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser Gly 
        35                  40                  45              


Val Arg Glu Val Val Arg Leu Arg Thr 
    50                  55          


<210>  43
<211>  4
<212>  PRT
<213>  Thermus thermophilus

<400>  43

Leu Ala Glu Gly 
1               


<210>  44
<211>  60
<212>  PRT
<213>  Mycobacterium tuberculosis

<400>  44

Leu Ala Glu Gly Thr Arg Ile Phe Asp Pro Val Thr Gly Thr Thr His 
1               5                   10                  15      


Arg Ile Glu Asp Val Val Asp Gly Arg Lys Pro Ile His Val Val Ala 
            20                  25                  30          


Ala Ala Lys Asp Gly Thr Leu His Ala Arg Pro Val Val Ser Trp Phe 
        35                  40                  45              


Asp Gln Gly Thr Arg Asp Val Ile Gly Leu Arg Ile 
    50                  55                  60  


<210>  45
<211>  60
<212>  PRT
<213>  Thermus thermophilus

<400>  45

Arg Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr 
1               5                   10                  15      


Pro Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile 
            20                  25                  30          


Ala Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu 
        35                  40                  45              


Glu Arg Val Thr Leu Leu Ala Leu Leu Leu Gly Asp 
    50                  55                  60  


<210>  46
<211>  4
<212>  PRT
<213>  Thermus thermophilus

<400>  46

Thr Pro Asp His 
1               


<210>  47
<211>  60
<212>  PRT
<213>  Mycobacterium tuberculosis

<400>  47

Ala Gly Gly Ala Ile Leu Trp Ala Thr Pro Asp His Lys Val Leu Thr 
1               5                   10                  15      


Glu Tyr Gly Trp Arg Ala Ala Gly Glu Leu Arg Lys Gly Asp Arg Val 
            20                  25                  30          


Ala Gln Pro Arg Arg Phe Asp Gly Phe Gly Asp Ser Ala Pro Ile Pro 
        35                  40                  45              


Ala Arg Val Gln Ala Leu Ala Asp Ala Leu Asp Asp 
    50                  55                  60  


<210>  48
<211>  53
<212>  PRT
<213>  Thermus thermophilus

<400>  48

Leu Arg Leu Val Leu Pro Ala Val Ala Tyr Arg Val Ser Glu Ala Lys 
1               5                   10                  15      


Arg Arg Ser Gly Phe Ser Trp Ser Glu Ala Gly Arg Arg Val Ala Val 
            20                  25                  30          


Ala Gly Ser Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg 
        35                  40                  45              


Tyr Leu Ser Arg His 
    50              


<210>  49
<211>  52
<212>  PRT
<213>  Mycobacterium tuberculosis

<400>  49

Leu Arg Ile Ala Gly Gly Ala Ile Leu Trp Ala Thr Pro Asp His Lys 
1               5                   10                  15      


Val Leu Thr Glu Tyr Gly Trp Arg Ala Ala Gly Glu Leu Arg Lys Gly 
            20                  25                  30          


Asp Arg Val Ala Gln Pro Arg Arg Phe Asp Gly Phe Gly Asp Ser Ala 
        35                  40                  45              


Pro Ile Pro Ala 
    50          


<210>  50
<211>  57
<212>  PRT
<213>  Thermus thermophilus

<400>  50

Arg Leu Ser Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala 
1               5                   10                  15      


Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro 
            20                  25                  30          


Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn 
        35                  40                  45              


Phe Val Ser Glu Asp Leu Val Val His 
    50                  55          


<210>  51
<211>  6
<212>  PRT
<213>  Thermus thermophilus

<400>  51

Ala Arg Thr Phe Asp Leu 
1               5       


<210>  52
<211>  57
<212>  PRT
<213>  Mycobacterium tuberculosis

<400>  52

Arg Val Gln Ala Leu Ala Asp Ala Leu Asp Asp Lys Phe Leu His Asp 
1               5                   10                  15      


Met Leu Ala Glu Glu Leu Arg Tyr Ser Val Ile Arg Glu Val Leu Pro 
            20                  25                  30          


Thr Arg Arg Ala Arg Thr Phe Asp Leu Glu Val Glu Glu Leu His Thr 
        35                  40                  45              


Leu Val Ala Glu Gly Val Val Val His 
    50                  55          


<210>  53
<211>  46
<212>  PRT
<213>  Thermus thermophilus

<400>  53

Arg Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr 
1               5                   10                  15      


Pro Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile 
            20                  25                  30          


Ala Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Asp 
        35                  40                  45      


<210>  54
<211>  46
<212>  PRT
<213>  Thermus thermophilus

<400>  54

Pro Gly Leu Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile 
1               5                   10                  15      


Val Ala Val Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val 
            20                  25                  30          


Pro Pro Phe Ala Asn Phe Val Ser Glu Asp Leu Val Val His 
        35                  40                  45      


<210>  55
<211>  46
<212>  PRT
<213>  Mycobacterium tuberculosis

<400>  55

Lys Phe Leu His Asp Met Leu Ala Glu Glu Leu Arg Tyr Ser Val Ile 
1               5                   10                  15      


Arg Glu Val Leu Pro Thr Arg Arg Ala Arg Thr Phe Asp Leu Glu Val 
            20                  25                  30          


Glu Glu Leu His Thr Leu Val Ala Glu Gly Val Val Val His 
        35                  40                  45      


<210>  56
<211>  9983
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG2004

<400>  56
ccgatggccg agctgtggat gggcgcacat ccgaaaagca gttcacgagt gcagaatgcc       60

gccggagata tcgtttcact gcgtgatgtg attgagagtg ataaatcgac tctgctcgga      120

gaggccgttg ccaaacgctt tggcgaactg cctttcctgt tcaaagtatt atgcgcagca      180

cagccactct ccattcaggt tcatccaaac aaacacaatt ctgaaatcgg ttttgccaaa      240

gaaaatgccg caggtatccc gatggatgcc gccgagcgta actataaaga tcctaaccac      300

aagccggagc tggtttttgc gctgacgcct ttccttgcga tgaacgcgtt tcgtgaattt      360

tccgagattg tctccctact ccagccggtc gcaggtgcac atccggcgat tgctcacttt      420

ttacaacagc ctgatgccga acgtttaagc gaactgttcg ccagcctgtt gaatatgcag      480

ggtgaagaaa aatcccgcgc gctggcgatt ttaaaatcgg ccctcgatag ccagcagggt      540

gaaccgtggc aaacgattcg tttaatttct gaattttacc cggaagacag cggtctgttc      600

tccccgctat tgctgaatgt ggtgaaattg aaccctggcg aagcgatgtt cctgttcgct      660

gaaacaccgc acgcttacct gcaaggcgtg gcgctggaag tgatggcaaa ctccgataac      720

gtgctgcgtg cgggtctgac gcctaaatac attgatattc cggaactggt tgccaatgtg      780

aaattcgaag ccaaaccggc taaccagttg ttgacccagc cggtgaaaca aggtgcagaa      840

ctggacttcc cgattccagt ggatgatttt gccttctcgc tgcatgacct tagtgataaa      900

gaaaccacca ttagccagca gagtgccgcc attttgttct gcgtcgaagg cgatgcaacg      960

ttgtggaaag gttctcagca gttacagctt aaaccgggtg aatcagcgtt tattgccgcc     1020

aacgaatcac cggtgactgt caaaggccac ggccgtttag cgcgtgttta caacaagctg     1080

taagagctta ctgaaaaaat taacatctct tgctaagctg ggagctctag atccccgaat     1140

ttccccgatc gttcaaacat ttggcaataa agtttcttaa gattgaatcc tgttgccggt     1200

cttgcgatga ttatcatata atttctgttg aattacgtta agcatgtaat aattaacatg     1260

taatgcatga cgttatttat gagatgggtt tttatgatta gagtcccgca attatacatt     1320

taatacgcga tagaaaacaa aatatagcgc gcaaactagg ataaattatc gcgcgcggtg     1380

tcatctatgt tactagatcg ggaattggcg agctcgaatt aattcagtac attaaaaacg     1440

tccgcaatgt gttattaagt tgtctaagcg tcaatttgtt tacaccacaa tatatcctgc     1500

caccagccag ccaacagctc cccgaccggc agctcggcac aaaatcacca ctcgatacag     1560

gcagcccatc agtccgggac ggcgtcagcg ggagagccgt tgtaaggcgg cagactttgc     1620

tcatgttacc gatgctattc ggaagaacgg caactaagct gccgggtttg aaacacggat     1680

gatctcgcgg agggtagcat gttgattgta acgatgacag agcgttgctg cctgtgatca     1740

aatatcatct ccctcgcaga gatccgaatt atcagccttc ttattcattt ctcgcttaac     1800

cgtgacaggc tgtcgatctt gagaactatg ccgacataat aggaaatcgc tggataaagc     1860

cgctgaggaa gctgagtggc gctatttctt tagaagtgaa cgttgacgat cgtcgaccgt     1920

accccgatga attaattcgg acgtacgttc tgaacacagc tggatactta cttgggcgat     1980

tgtcatacat gacatcaaca atgtacccgt ttgtgtaacc gtctcttgga ggttcgtatg     2040

acactagtgg ttcccctcag cttgcgacta gatgttgagg cctaacattt tattagagag     2100

caggctagtt gcttagatac atgatcttca ggccgttatc tgtcagggca agcgaaaatt     2160

ggccatttat gacgaccaat gccccgcaga agctcccatc tttgccgcca tagacgccgc     2220

gccccccttt tggggtgtag aacatccttt tgccagatgt ggaaaagaag ttcgttgtcc     2280

cattgttggc aatgacgtag tagccggcga aagtgcgaga cccatttgcg ctatatataa     2340

gcctacgatt tccgttgcga ctattgtcgt aattggatga actattatcg tagttgctct     2400

cagagttgtc gtaatttgat ggactattgt cgtaattgct tatggagttg tcgtagttgc     2460

ttggagaaat gtcgtagttg gatggggagt agtcataggg aagacgagct tcatccacta     2520

aaacaattgg caggtcagca agtgcctgcc ccgatgccat cgcaagtacg aggcttagaa     2580

ccaccttcaa cagatcgcgc atagtcttcc ccagctctct aacgcttgag ttaagccgcg     2640

ccgcgaagcg gcgtcggctt gaacgaattg ttagacatta tttgccgact accttggtga     2700

tctcgccttt cacgtagtga acaaattctt ccaactgatc tgcgcgcgag gccaagcgat     2760

cttcttgtcc aagataagcc tgcctagctt caagtatgac gggctgatac tgggccggca     2820

ggcgctccat tgcccagtcg gcagcgacat ccttcggcgc gattttgccg gttactgcgc     2880

tgtaccaaat gcgggacaac gtaagcacta catttcgctc atcgccagcc cagtcgggcg     2940

gcgagttcca tagcgttaag gtttcattta gcgcctcaaa tagatcctgt tcaggaaccg     3000

gatcaaagag ttcctccgcc gctggaccta ccaaggcaac gctatgttct cttgcttttg     3060

tcagcaagat agccagatca atgtcgatcg tggctggctc gaagatacct gcaagaatgt     3120

cattgcgctg ccattctcca aattgcagtt cgcgcttagc tggataacgc cacggaatga     3180

tgtcgtcgtg cacaacaatg gtgacttcta cagcgcggag aatctcgctc tctccagggg     3240

aagccgaagt ttccaaaagg tcgttgatca aagctcgccg cgttgtttca tcaagcctta     3300

cggtcaccgt aaccagcaaa tcaatatcac tgtgtggctt caggccgcca tccactgcgg     3360

agccgtacaa atgtacggcc agcaacgtcg gttcgagatg gcgctcgatg acgccaacta     3420

cctctgatag ttgagtcgat acttcggcga tcaccgcttc cctcatgatg tttaactcct     3480

gaattaagcc gcgccgcgaa gcggtgtcgg cttgaatgaa ttgttaggcg tcatcctgtg     3540

ctcccgagaa ccagtaccag tacatcgctg tttcgttcga gacttgaggt ctagttttat     3600

acgtgaacag gtcaatgccg ccgagagtaa agccacattt tgcgtacaaa ttgcaggcag     3660

gtacattgtt cgtttgtgtc tctaatcgta tgccaaggag ctgtctgctt agtgcccact     3720

ttttcgcaaa ttcgatgaga ctgtgcgcga ctcctttgcc tcggtgcgtg tgcgacacaa     3780

caatgtgttc gatagaggct agatcgttcc atgttgagtt gagttcaatc ttcccgacaa     3840

gctcttggtc gatgaatgcg ccatagcaag cagagtcttc atcagagtca tcatccgaga     3900

tgtaatcctt ccggtagggg ctcacacttc tggtagatag ttcaaagcct tggtcggata     3960

ggtgcacatc gaacacttca cgaacaatga aatggttctc agcatccaat gtttccgcca     4020

cctgctcagg gatcaccgaa atcttcatat gacgcctaac gcctggcaca gcggatcgca     4080

aacctggcgc ggcttttggc acaaaaggcg tgacaggttt gcgaatccgt tgctgccact     4140

tgttaaccct tttgccagat ttggtaacta taatttatgt tagaggcgaa gtcttgggta     4200

aaaactggcc taaaattgct ggggatttca ggaaagtaaa catcaccttc cggctcgatg     4260

tctattgtag atatatgtag tgtatctact tgatcggggg atctgctgcc tcgcgcgttt     4320

cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca cagcttgtct     4380

gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg     4440

tcggggcgca gccatgaccc agtcacgtag cgatagcgga gtgtatactg gcttaactat     4500

gcggcatcag agcagattgt actgagagtg caccatatgc ggtgtgaaat accgcacaga     4560

tgcgtaagga gaaaataccg catcaggcgc tcttccgctt cctcgctcac tgactcgctg     4620

cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta     4680

tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc     4740

aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag     4800

catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac     4860

caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc     4920

ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt     4980

aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc     5040

gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga     5100

cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta     5160

ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta     5220

tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga     5280

tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg     5340

cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag     5400

tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc     5460

tagatccttt taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact     5520

tggtctgaca gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt     5580

cgttcatcca tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta     5640

ccatctggcc ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta     5700

tcagcaataa accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc     5760

gcctccatcc agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat     5820

agtttgcgca acgttgttgc cattgctgca gggggggggg ggggggggtt ccattgttca     5880

ttccacggac aaaaacagag aaaggaaacg acagaggcca aaaagctcgc tttcagcacc     5940

tgtcgtttcc tttcttttca gagggtattt taaataaaaa cattaagtta tgacgaagaa     6000

gaacggaaac gccttaaacc ggaaaatttt cataaatagc gaaaacccgc gaggtcgccg     6060

ccccgtaacc tgtcggatca ccggaaagga cccgtaaagt gataatgatt atcatctaca     6120

tatcacaacg tgcgtggagg ccatcaaacc acgtcaaata atcaattatg acgcaggtat     6180

cgtattaatt gatctgcatc aacttaacgt aaaaacaact tcagacaata caaatcagcg     6240

acactgaata cggggcaacc tcatgtcccc cccccccccc ccctgcaggc atcgtggtgt     6300

cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta     6360

catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca     6420

gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta     6480

ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct     6540

gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaacacgg gataataccg     6600

cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac     6660

tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact     6720

gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa     6780

atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt     6840

ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat     6900

gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg     6960

acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc     7020

cctttcgtct tcaagaattg gtcgacgatc ttgctgcgtt cggatatttt cgtggagttc     7080

ccgccacaga cccggattga aggcgagatc cagcaactcg cgccagatca tcctgtgacg     7140

gaactttggc gcgtgatgac tggccaggac gtcggccgaa agagcgacaa gcagatcacg     7200

cttttcgaca gcgtcggatt tgcgatcgag gatttttcgg cgctgcgcta cgtccgcgac     7260

cgcgttgagg gatcaagcca cagcagccca ctcgaccttc tagccgaccc agacgagcca     7320

agggatcttt ttggaatgct gctccgtcgt caggctttcc gacgtttggg tggttgaaca     7380

gaagtcatta tcgcacggaa tgccaagcac tcccgagggg aaccctgtgg ttggcatgca     7440

catacaaatg gacgaacgga taaacctttt cacgcccttt taaatatccg attattctaa     7500

taaacgctct tttctcttag gtttacccgc caatatatcc tgtcaaacac tgatagttta     7560

aactgaaggc gggaaacgac aacctgatca tgagcggaga attaagggag tcacgttatg     7620

acccccgccg atgacgcggg acaagccgtt ttacgtttgg aactgacaga accgcaacgt     7680

tgaaggagcc actcagctta attaagtcta actcgagtta ctggtacgta ccaaatccat     7740

ggaatcaagg taccatcaat cccgggtatt catcctaggt atccaagaat tcatactaaa     7800

gcttgcatgc ctgcaggtcg actctagtaa cggccgccag tgtgctggaa ttaattcggc     7860

ttgtcgacca cccaacccca tatcgacaga ggatgtgaag aacaggtaaa tcacgcagaa     7920

gaacccatct ctgatagcag ctatcgatta gaacaacgaa tccatattgg gtccgtggga     7980

aatacttact gcacaggaag ggggcgatct gacgaggccc cgccaccggc ctcgacccga     8040

ggccgaggcc gacgaagcgc cggcgagtac ggcgccgcgg cggcctctgc ccgtgccctc     8100

tgcgcgtggg agggagaggc cgcggtggtg ggggcgcgcg cgcgcgcgcg cgcagctggt     8160

gcggcggcgc gggggtcagc cgccgagccg gcggcgacgg aggagcaggg cggcgtggac     8220

gcgaacttcc gatcggttgg tcagagtgcg cgagttgggc ttagccaatt aggtctcaac     8280

aatctattgg gccgtaaaat tcatgggccc tggtttgtct aggcccaata tcccgttcat     8340

ttcagcccac aaatatttcc ccagaggatt attaaggccc acacgcagct tatagcagat     8400

caagtacgat gtttcctgat cgttggatcg gaaacgtacg gtcttgatca ggcatgccga     8460

cttcgtcaaa gagaggcggc atgacctgac gcggagttgg ttccgggcac cgtctggatg     8520

gtcgtaccgg gaccggacac gtgtcgcgcc tccaactaca tggacacgtg tggtgctgcc     8580

attgggccgt acgcgtggcg gtgaccgcac cggatgctgc ctcgcaccgc cttgcccacg     8640

ctttatatag agaggttttc tctccattaa tcgcatagcg agtcgaatcg accgaagggg     8700

agggggagcg aagctttgcg ttctctaatc gcctcgtcaa ggtaactaat caatcacctc     8760

gtcctaatcc tcgaatctct cgtggtgccc gtctaatctc gcgattttga tgctcgtggt     8820

ggaaagcgta ggaggatccc gtgcgagtta gtctcaatct ctcagggttt cgtgcgattt     8880

tagggtgatc cacctcttaa tcgagttacg gtttcgtgcg attttagggt aatcctctta     8940

atctctcatt gatttagggt ttcgtgagaa tcgaggtagg gatctgtgtt atttatatcg     9000

atctaataga tggattggtt ttgagattgt tctgtcagat ggggattgtt tcgatatatt     9060

accctaatga tgtgtcagat ggggattgtt tcgatatatt accctaatga tgtgtcagat     9120

ggggattgtt tcgatatatt accctaatga tggataataa gagtagttca cagttatgtt     9180

ttgatcctgc cacatagttt gagttttgtg atcagattta gttttactta tttgtgctta     9240

gttcggatgg gattgttctg atattgttcc aatagatgaa tagctcgtta ggttaaaatc     9300

tttaggttga gttaggcgac acatagttta tttcctctgg atttggattg gaattgtgtt     9360

cttagttttt ttcccctgga tttggattgg aattgtgtgg agctgggtta gagaattaca     9420

tctgtatcgt gtacacctac ttgaactgta gagcttgggt tctaaggtca atttaatctg     9480

tattgtatct ggctctttgc ctagttgaac tgtagtgctg atgttgtact gtgttttttt     9540

acccgtttta tttgctttac tcgtgcaaat caaatctgtc agatgctaga actaggtggc     9600

tttattctgt gttcttacat agatctgttg tcctgtagtt acttatgtca gttttgttat     9660

tatctgaaga tatttttggt tgttgcttgt tgatgtggtg tgagctgtga gcagcgctct     9720

tatgattaat gatgctgtcc aattgtagtg tagtatgatg tgattgatat gttcatctat     9780

tttgagctga cagtaccgat atcgtaggat ctggtgccaa cttattctcc agctgctttt     9840

ttttacctat gttaattcca atcctttctt gcctcttcca gatccagata atgcagaaac     9900

tcattaactc agtgcaaaac tatgcctggg gcagcaaaac ggcgttgact gaactttatg     9960

gtatggaaaa tccgtccagc cag                                             9983


<210>  57
<211>  13393
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG2014

<400>  57
aattcatact aaagcttgca tgcctgcagg tcgactctag taacggccgc cagtgtgctg       60

gaattaattc ggcttgtcga ccacccaacc ccatatcgac agaggatgtg aagaacaggt      120

aaatcacgca gaagaaccca tctctgatag cagctatcga ttagaacaac gaatccatat      180

tgggtccgtg ggaaatactt actgcacagg aagggggcga tctgacgagg ccccgccacc      240

ggcctcgacc cgaggccgag gccgacgaag cgccggcgag tacggcgccg cggcggcctc      300

tgcccgtgcc ctctgcgcgt gggagggaga ggccgcggtg gtgggggcgc gcgcgcgcgc      360

gcgcgcagct ggtgcggcgg cgcgggggtc agccgccgag ccggcggcga cggaggagca      420

gggcggcgtg gacgcgaact tccgatcggt tggtcagagt gcgcgagttg ggcttagcca      480

attaggtctc aacaatctat tgggccgtaa aattcatggg ccctggtttg tctaggccca      540

atatcccgtt catttcagcc cacaaatatt tccccagagg attattaagg cccacacgca      600

gcttatagca gatcaagtac gatgtttcct gatcgttgga tcggaaacgt acggtcttga      660

tcaggcatgc cgacttcgtc aaagagaggc ggcatgacct gacgcggagt tggttccggg      720

caccgtctgg atggtcgtac cgggaccgga cacgtgtcgc gcctccaact acatggacac      780

gtgtggtgct gccattgggc cgtacgcgtg gcggtgaccg caccggatgc tgcctcgcac      840

cgccttgccc acgctttata tagagaggtt ttctctccat taatcgcata gcgagtcgaa      900

tcgaccgaag gggaggggga gcgaagcttt gcgttctcta atcgcctcgt caaggtaact      960

aatcaatcac ctcgtcctaa tcctcgaatc tctcgtggtg cccgtctaat ctcgcgattt     1020

tgatgctcgt ggtggaaagc gtaggaggat cccgtgcgag ttagtctcaa tctctcaggg     1080

tttcgtgcga ttttagggtg atccacctct taatcgagtt acggtttcgt gcgattttag     1140

ggtaatcctc ttaatctctc attgatttag ggtttcgtga gaatcgaggt agggatctgt     1200

gttatttata tcgatctaat agatggattg gttttgagat tgttctgtca gatggggatt     1260

gtttcgatat attaccctaa tgatgtgtca gatggggatt gtttcgatat attaccctaa     1320

tgatgtgtca gatggggatt gtttcgatat attaccctaa tgatggataa taagagtagt     1380

tcacagttat gttttgatcc tgccacatag tttgagtttt gtgatcagat ttagttttac     1440

ttatttgtgc ttagttcgga tgggattgtt ctgatattgt tccaatagat gaatagctcg     1500

ttaggttaaa atctttaggt tgagttaggc gacacatagt ttatttcctc tggatttgga     1560

ttggaattgt gttcttagtt tttttcccct ggatttggat tggaattgtg tggagctggg     1620

ttagagaatt acatctgtat cgtgtacacc tacttgaact gtagagcttg ggttctaagg     1680

tcaatttaat ctgtattgta tctggctctt tgcctagttg aactgtagtg ctgatgttgt     1740

actgtgtttt tttacccgtt ttatttgctt tactcgtgca aatcaaatct gtcagatgct     1800

agaactaggt ggctttattc tgtgttctta catagatctg ttgtcctgta gttacttatg     1860

tcagttttgt tattatctga agatattttt ggttgttgct tgttgatgtg gtgtgagctg     1920

tgagcagcgc tcttatgatt aatgatgctg tccaattgta gtgtagtatg atgtgattga     1980

tatgttcatc tattttgagc tgacagtacc gatatcgtag gatctggtgc caacttattc     2040

tccagctgct tttttttacc tatgttaatt ccaatccttt cttgcctctt ccagatccag     2100

ataatgcaga aactcattaa ctcagtgcaa aactatgcct ggggcagcaa aacggcgttg     2160

actgaacttt atggtatgga aaatccgtcc agccagccga tggccgagct gtggatgggc     2220

gcacatccga aaagcagttc acgagtgcag aatgccgccg gagatatcgt ttcactgcgt     2280

gatgtgattg agagtgataa atcgactctg ctcggagagg ccgttgccaa acgctttggc     2340

gaactgcctt tcctgttcaa agtattatgc gcagcacagc cactctccat tcaggttcat     2400

ccaaacaaac acaattctga aatcggtttt gccaaagaaa atgccgcagg tatcccgatg     2460

gatgccgccg agcgtaacta taaagatcct aaccacaagc cggagctggt ttttgcgctg     2520

acgcctttcc ttgcgatgaa cgcgtttcgt gaattttccg agattgtctc cctactccag     2580

ccggtcgcag gtgcacatcc ggcgattgct cactttttac aacagcctga tgccgaacgt     2640

ttaagcgaac tgttcgccag cctgttgaat atgcagggtg aagaaaaatc ccgcgcgctg     2700

gcgattttaa aatcggccct cgatagccag cagggtgaac cgtggcaaac gattcgttta     2760

atttctgaat tttacccgga agacagcggt ctgttctccc cgctattgct gaatgtggtg     2820

aaattgaacc ctggcgaagc gatgttcctg ttcgctgaaa caccgcacgc ttacctgcaa     2880

ggcgtggcgc tggaagtgat ggcaaactcc gataacgtgc tgcgtgcggg tctgacgcct     2940

aaatacattg atattccgga actggttgcc aatgtgaaat tcgaagccaa accggctaac     3000

cagttgttga cccagccggt gaaacaaggt gcagaactgg acttcccgat tccagtggat     3060

gattttgcct tctcgctgca tgaccttagt gataaagaaa ccaccattag ccagcagagt     3120

gccgccattt tgttctgcgt cgaaggcgat gcaacgttgt ggaaaggttc tcagcagtta     3180

cagcttaaac cgggtgaatc agcgtttatt gccgccaacg aatcaccggt gactgtcaaa     3240

ggccacggcc gtttagcgcg tgtttacaac aagctgtaag agcttactga aaaaattaac     3300

atctcttgct aagctgggag ctctagatcc ccgaatttcc ccgatcgttc aaacatttgg     3360

caataaagtt tcttaagatt gaatcctgtt gccggtcttg cgatgattat catataattt     3420

ctgttgaatt acgttaagca tgtaataatt aacatgtaat gcatgacgtt atttatgaga     3480

tgggttttta tgattagagt cccgcaatta tacatttaat acgcgataga aaacaaaata     3540

tagcgcgcaa actaggataa attatcgcgc gcggtgtcat ctatgttact agatcgggaa     3600

ttggcgagct cgaattaatt cagtacatta aaaacgtccg caatgtgtta ttaagttgtc     3660

taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa cagctccccg     3720

accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc cgggacggcg     3780

tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg ctattcggaa     3840

gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg tagcatgttg     3900

attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct cgcagagatc     3960

cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc gatcttgaga     4020

actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg agtggcgcta     4080

tttctttaga agtgaacgtt gacgatcgtc gaccgtaccc cgatgaatta attcggacgt     4140

acgttctgaa cacagctgga tacttacttg ggcgattgtc atacatgaca tcaacaatgt     4200

acccgtttgt gtaaccgtct cttggaggtt cgtatgacac tagtggttcc cctcagcttg     4260

cgactagatg ttgaggccta acattttatt agagagcagg ctagttgctt agatacatga     4320

tcttcaggcc gttatctgtc agggcaagcg aaaattggcc atttatgacg accaatgccc     4380

cgcagaagct cccatctttg ccgccataga cgccgcgccc cccttttggg gtgtagaaca     4440

tccttttgcc agatgtggaa aagaagttcg ttgtcccatt gttggcaatg acgtagtagc     4500

cggcgaaagt gcgagaccca tttgcgctat atataagcct acgatttccg ttgcgactat     4560

tgtcgtaatt ggatgaacta ttatcgtagt tgctctcaga gttgtcgtaa tttgatggac     4620

tattgtcgta attgcttatg gagttgtcgt agttgcttgg agaaatgtcg tagttggatg     4680

gggagtagtc atagggaaga cgagcttcat ccactaaaac aattggcagg tcagcaagtg     4740

cctgccccga tgccatcgca agtacgaggc ttagaaccac cttcaacaga tcgcgcatag     4800

tcttccccag ctctctaacg cttgagttaa gccgcgccgc gaagcggcgt cggcttgaac     4860

gaattgttag acattatttg ccgactacct tggtgatctc gcctttcacg tagtgaacaa     4920

attcttccaa ctgatctgcg cgcgaggcca agcgatcttc ttgtccaaga taagcctgcc     4980

tagcttcaag tatgacgggc tgatactggg ccggcaggcg ctccattgcc cagtcggcag     5040

cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta ccaaatgcgg gacaacgtaa     5100

gcactacatt tcgctcatcg ccagcccagt cgggcggcga gttccatagc gttaaggttt     5160

catttagcgc ctcaaataga tcctgttcag gaaccggatc aaagagttcc tccgccgctg     5220

gacctaccaa ggcaacgcta tgttctcttg cttttgtcag caagatagcc agatcaatgt     5280

cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt gcgctgccat tctccaaatt     5340

gcagttcgcg cttagctgga taacgccacg gaatgatgtc gtcgtgcaca acaatggtga     5400

cttctacagc gcggagaatc tcgctctctc caggggaagc cgaagtttcc aaaaggtcgt     5460

tgatcaaagc tcgccgcgtt gtttcatcaa gccttacggt caccgtaacc agcaaatcaa     5520

tatcactgtg tggcttcagg ccgccatcca ctgcggagcc gtacaaatgt acggccagca     5580

acgtcggttc gagatggcgc tcgatgacgc caactacctc tgatagttga gtcgatactt     5640

cggcgatcac cgcttccctc atgatgttta actcctgaat taagccgcgc cgcgaagcgg     5700

tgtcggcttg aatgaattgt taggcgtcat cctgtgctcc cgagaaccag taccagtaca     5760

tcgctgtttc gttcgagact tgaggtctag ttttatacgt gaacaggtca atgccgccga     5820

gagtaaagcc acattttgcg tacaaattgc aggcaggtac attgttcgtt tgtgtctcta     5880

atcgtatgcc aaggagctgt ctgcttagtg cccacttttt cgcaaattcg atgagactgt     5940

gcgcgactcc tttgcctcgg tgcgtgtgcg acacaacaat gtgttcgata gaggctagat     6000

cgttccatgt tgagttgagt tcaatcttcc cgacaagctc ttggtcgatg aatgcgccat     6060

agcaagcaga gtcttcatca gagtcatcat ccgagatgta atccttccgg taggggctca     6120

cacttctggt agatagttca aagccttggt cggataggtg cacatcgaac acttcacgaa     6180

caatgaaatg gttctcagca tccaatgttt ccgccacctg ctcagggatc accgaaatct     6240

tcatatgacg cctaacgcct ggcacagcgg atcgcaaacc tggcgcggct tttggcacaa     6300

aaggcgtgac aggtttgcga atccgttgct gccacttgtt aacccttttg ccagatttgg     6360

taactataat ttatgttaga ggcgaagtct tgggtaaaaa ctggcctaaa attgctgggg     6420

atttcaggaa agtaaacatc accttccggc tcgatgtcta ttgtagatat atgtagtgta     6480

tctacttgat cgggggatct gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg     6540

acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca     6600

agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc     6660

acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca gattgtactg     6720

agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc     6780

aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga     6840

gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca     6900

ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg     6960

ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt     7020

cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc     7080

ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct     7140

tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc     7200

gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta     7260

tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca     7320

gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag     7380

tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag     7440

ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt     7500

agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa     7560

gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg     7620

attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga     7680

agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta     7740

atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc     7800

cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg     7860

ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga     7920

agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt     7980

tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt     8040

gctgcagggg gggggggggg ggggttccat tgttcattcc acggacaaaa acagagaaag     8100

gaaacgacag aggccaaaaa gctcgctttc agcacctgtc gtttcctttc ttttcagagg     8160

gtattttaaa taaaaacatt aagttatgac gaagaagaac ggaaacgcct taaaccggaa     8220

aattttcata aatagcgaaa acccgcgagg tcgccgcccc gtaacctgtc ggatcaccgg     8280

aaaggacccg taaagtgata atgattatca tctacatatc acaacgtgcg tggaggccat     8340

caaaccacgt caaataatca attatgacgc aggtatcgta ttaattgatc tgcatcaact     8400

taacgtaaaa acaacttcag acaatacaaa tcagcgacac tgaatacggg gcaacctcat     8460

gtcccccccc ccccccccct gcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt     8520

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa     8580

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat     8640

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct     8700

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga     8760

gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga actttaaaag     8820

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga     8880

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca     8940

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg     9000

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc     9060

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag     9120

gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca     9180

tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa gaattggtcg     9240

acgatcttgc tgcgttcgga tattttcgtg gagttcccgc cacagacccg gattgaaggc     9300

gagatccagc aactcgcgcc agatcatcct gtgacggaac tttggcgcgt gatgactggc     9360

caggacgtcg gccgaaagag cgacaagcag atcacgcttt tcgacagcgt cggatttgcg     9420

atcgaggatt tttcggcgct gcgctacgtc cgcgaccgcg ttgagggatc aagccacagc     9480

agcccactcg accttctagc cgacccagac gagccaaggg atctttttgg aatgctgctc     9540

cgtcgtcagg ctttccgacg tttgggtggt tgaacagaag tcattatcgc acggaatgcc     9600

aagcactccc gaggggaacc ctgtggttgg catgcacata caaatggacg aacggataaa     9660

ccttttcacg cccttttaaa tatccgatta ttctaataaa cgctcttttc tcttaggttt     9720

acccgccaat atatcctgtc aaacactgat agtttaaact gaaggcggga aacgacaacc     9780

tgatcatgag cggagaatta agggagtcac gttatgaccc ccgccgatga cgcgggacaa     9840

gccgttttac gtttggaact gacagaaccg caacgttgaa ggagccactc agcttaatta     9900

agtctaactc gagttactgg tacgtaccaa atccatggaa tcaaggtacc gtcgactcta     9960

gtaacggccg ccagtgtgct ggaattaatt cggcttgtcg accacccaac cccatatcga    10020

cagaggatgt gaagaacagg taaatcacgc agaagaaccc atctctgata gcagctatcg    10080

attagaacaa cgaatccata ttgggtccgt gggaaatact tactgcacag gaagggggcg    10140

atctgacgag gccccgccac cggcctcgac ccgaggccga ggccgacgaa gcgccggcga    10200

gtacggcgcc gcggcggcct ctgcccgtgc cctctgcgcg tgggagggag aggccgcggt    10260

ggtgggggcg cgcgcgcgcg cgcgcgcagc tggtgcggcg gcgcgggggt cagccgccga    10320

gccggcggcg acggaggagc agggcggcgt ggacgcgaac ttccgatcgg ttggtcagag    10380

tgcgcgagtt gggcttagcc aattaggtct caacaatcta ttgggccgta aaattcatgg    10440

gccctggttt gtctaggccc aatatcccgt tcatttcagc ccacaaatat ttccccagag    10500

gattattaag gcccacacgc agcttatagc agatcaagta cgatgtttcc tgatcgttgg    10560

atcggaaacg tacggtcttg atcaggcatg ccgacttcgt caaagagagg cggcatgacc    10620

tgacgcggag ttggttccgg gcaccgtctg gatggtcgta ccgggaccgg acacgtgtcg    10680

cgcctccaac tacatggaca cgtgtggtgc tgccattggg ccgtacgcgt ggcggtgacc    10740

gcaccggatg ctgcctcgca ccgccttgcc cacgctttat atagagaggt tttctctcca    10800

ttaatcgcat agcgagtcga atcgaccgaa ggggaggggg agcgaagctt tgcgttctct    10860

aatcgcctcg tcaaggtaac taatcaatca cctcgtccta atcctcgaat ctctcgtggt    10920

gcccgtctaa tctcgcgatt ttgatgctcg tggtggaaag cgtaggagga tcccgtgcga    10980

gttagtctca atctctcagg gtttcgtgcg attttagggt gatccacctc ttaatcgagt    11040

tacggtttcg tgcgatttta gggtaatcct cttaatctct cattgattta gggtttcgtg    11100

agaatcgagg tagggatctg tgttatttat atcgatctaa tagatggatt ggttttgaga    11160

ttgttctgtc agatggggat tgtttcgata tattacccta atgatgtgtc agatggggat    11220

tgtttcgata tattacccta atgatgtgtc agatggggat tgtttcgata tattacccta    11280

atgatggata ataagagtag ttcacagtta tgttttgatc ctgccacata gtttgagttt    11340

tgtgatcaga tttagtttta cttatttgtg cttagttcgg atgggattgt tctgatattg    11400

ttccaataga tgaatagctc gttaggttaa aatctttagg ttgagttagg cgacacatag    11460

tttatttcct ctggatttgg attggaattg tgttcttagt ttttttcccc tggatttgga    11520

ttggaattgt gtggagctgg gttagagaat tacatctgta tcgtgtacac ctacttgaac    11580

tgtagagctt gggttctaag gtcaatttaa tctgtattgt atctggctct ttgcctagtt    11640

gaactgtagt gctgatgttg tactgtgttt ttttacccgt tttatttgct ttactcgtgc    11700

aaatcaaatc tgtcagatgc tagaactagg tggctttatt ctgtgttctt acatagatct    11760

gttgtcctgt agttacttat gtcagttttg ttattatctg aagatatttt tggttgttgc    11820

ttgttgatgt ggtgtgagct gtgagcagcg ctcttatgat taatgatgct gtccaattgt    11880

agtgtagtat gatgtgattg atatgttcat ctattttgag ctgacagtac cgatatcgta    11940

ggatctggtg ccaacttatt ctccagctgc ttttttttac ctatgttaat tccaatcctt    12000

tcttgcctct tccagatcca gataatggcg aacaaacatt tgtccctctc cctcttcctc    12060

gtcctccttg gcctgtcggc cagcttggcc tccgggcaac aaacaagcat tactctgaca    12120

tccaacgcat ccggtacgtt tgacggttac tattacgaac tctggaagga tactggcaat    12180

acaacaatga cggtctacac tcaaggtcgc ttttcctgcc agtggtcgaa catcaataac    12240

gcgttgttta ggaccgggaa gaaatacaac cagaattggc agtctcttgg cacaatccgg    12300

atcacgtact ctgcgactta caacccaaac gggaactcct acttgtgtat ctatggctgg    12360

tctaccaacc cattggtcga gttctacatc gttgagtcct gggggaactg gagaccgcct    12420

ggtgccacgt ccctgggcca agtgacaatc gatggcggga cctacgacat ctataggacg    12480

acacgcgtca accagccttc cattgtgggg acagccacgt tcgatcagta ctggagcgtg    12540

cgcacctcta agcggacttc aggaacagtg accgtgaccg atcacttccg cgcctgggcg    12600

aaccggggcc tgaacctcgg cacaatagac caaattacat tgtgcgtgga gggttaccaa    12660

agctctggat cagccaacat cacccagaac accttctctc agggctcttc ttccggcagt    12720

tcgggtggct catccggctc cacaacgact actcgcatcg agtgtgagaa catgtccttg    12780

tccggaccct acgttagcag gatcaccaat ccctttaatg gtattgcgct gtacgccaac    12840

ggagacacag cccgcgctac cgttaacttc cccgcaagtc gcaactacaa tttccgcctg    12900

cggggttgcg gcaacaacaa taatcttgcc cgtgtggacc tgaggatcga cggacggacc    12960

gtcgggacct tttattacca gggcacatac ccctgggagg ccccaattga caatgtttat    13020

gtcagtgcgg ggagtcatac agtcgaaatc actgttactg cggataacgg cacatgggac    13080

gtgtatgccg actacctggt gatacagtga cctaggtccc cgaatttccc cgatcgttca    13140

aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc gatgattatc    13200

atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg catgacgtta    13260

tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata cgcgatagaa    13320

aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc tatgttacta    13380

gatcgggaat tgg                                                       13393


<210>  58
<211>  14662
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG2029

<400>  58
aattcatact aaagcttgca tgcctgcagg tcgactctag taacggccgc cagtgtgctg       60

gaattaattc ggcttgtcga ccacccaacc ccatatcgac agaggatgtg aagaacaggt      120

aaatcacgca gaagaaccca tctctgatag cagctatcga ttagaacaac gaatccatat      180

tgggtccgtg ggaaatactt actgcacagg aagggggcga tctgacgagg ccccgccacc      240

ggcctcgacc cgaggccgag gccgacgaag cgccggcgag tacggcgccg cggcggcctc      300

tgcccgtgcc ctctgcgcgt gggagggaga ggccgcggtg gtgggggcgc gcgcgcgcgc      360

gcgcgcagct ggtgcggcgg cgcgggggtc agccgccgag ccggcggcga cggaggagca      420

gggcggcgtg gacgcgaact tccgatcggt tggtcagagt gcgcgagttg ggcttagcca      480

attaggtctc aacaatctat tgggccgtaa aattcatggg ccctggtttg tctaggccca      540

atatcccgtt catttcagcc cacaaatatt tccccagagg attattaagg cccacacgca      600

gcttatagca gatcaagtac gatgtttcct gatcgttgga tcggaaacgt acggtcttga      660

tcaggcatgc cgacttcgtc aaagagaggc ggcatgacct gacgcggagt tggttccggg      720

caccgtctgg atggtcgtac cgggaccgga cacgtgtcgc gcctccaact acatggacac      780

gtgtggtgct gccattgggc cgtacgcgtg gcggtgaccg caccggatgc tgcctcgcac      840

cgccttgccc acgctttata tagagaggtt ttctctccat taatcgcata gcgagtcgaa      900

tcgaccgaag gggaggggga gcgaagcttt gcgttctcta atcgcctcgt caaggtaact      960

aatcaatcac ctcgtcctaa tcctcgaatc tctcgtggtg cccgtctaat ctcgcgattt     1020

tgatgctcgt ggtggaaagc gtaggaggat cccgtgcgag ttagtctcaa tctctcaggg     1080

tttcgtgcga ttttagggtg atccacctct taatcgagtt acggtttcgt gcgattttag     1140

ggtaatcctc ttaatctctc attgatttag ggtttcgtga gaatcgaggt agggatctgt     1200

gttatttata tcgatctaat agatggattg gttttgagat tgttctgtca gatggggatt     1260

gtttcgatat attaccctaa tgatgtgtca gatggggatt gtttcgatat attaccctaa     1320

tgatgtgtca gatggggatt gtttcgatat attaccctaa tgatggataa taagagtagt     1380

tcacagttat gttttgatcc tgccacatag tttgagtttt gtgatcagat ttagttttac     1440

ttatttgtgc ttagttcgga tgggattgtt ctgatattgt tccaatagat gaatagctcg     1500

ttaggttaaa atctttaggt tgagttaggc gacacatagt ttatttcctc tggatttgga     1560

ttggaattgt gttcttagtt tttttcccct ggatttggat tggaattgtg tggagctggg     1620

ttagagaatt acatctgtat cgtgtacacc tacttgaact gtagagcttg ggttctaagg     1680

tcaatttaat ctgtattgta tctggctctt tgcctagttg aactgtagtg ctgatgttgt     1740

actgtgtttt tttacccgtt ttatttgctt tactcgtgca aatcaaatct gtcagatgct     1800

agaactaggt ggctttattc tgtgttctta catagatctg ttgtcctgta gttacttatg     1860

tcagttttgt tattatctga agatattttt ggttgttgct tgttgatgtg gtgtgagctg     1920

tgagcagcgc tcttatgatt aatgatgctg tccaattgta gtgtagtatg atgtgattga     1980

tatgttcatc tattttgagc tgacagtacc gatatcgtag gatctggtgc caacttattc     2040

tccagctgct tttttttacc tatgttaatt ccaatccttt cttgcctctt ccagatccag     2100

ataatgcaga aactcattaa ctcagtgcaa aactatgcct ggggcagcaa aacggcgttg     2160

actgaacttt atggtatgga aaatccgtcc agccagccga tggccgagct gtggatgggc     2220

gcacatccga aaagcagttc acgagtgcag aatgccgccg gagatatcgt ttcactgcgt     2280

gatgtgattg agagtgataa atcgactctg ctcggagagg ccgttgccaa acgctttggc     2340

gaactgcctt tcctgttcaa agtattatgc gcagcacagc cactctccat tcaggttcat     2400

ccaaacaaac acaattctga aatcggtttt gccaaagaaa atgccgcagg tatcccgatg     2460

gatgccgccg agcgtaacta taaagatcct aaccacaagc cggagctggt ttttgcgctg     2520

acgcctttcc ttgcgatgaa cgcgtttcgt gaattttccg agattgtctc cctactccag     2580

ccggtcgcag gtgcacatcc ggcgattgct cactttttac aacagcctga tgccgaacgt     2640

ttaagcgaac tgttcgccag cctgttgaat atgcagggtg aagaaaaatc ccgcgcgctg     2700

gcgattttaa aatcggccct cgatagccag cagggtgaac cgtggcaaac gattcgttta     2760

atttctgaat tttacccgga agacagcggt ctgttctccc cgctattgct gaatgtggtg     2820

aaattgaacc ctggcgaagc gatgttcctg ttcgctgaaa caccgcacgc ttacctgcaa     2880

ggcgtggcgc tggaagtgat ggcaaactcc gataacgtgc tgcgtgcggg tctgacgcct     2940

aaatacattg atattccgga actggttgcc aatgtgaaat tcgaagccaa accggctaac     3000

cagttgttga cccagccggt gaaacaaggt gcagaactgg acttcccgat tccagtggat     3060

gattttgcct tctcgctgca tgaccttagt gataaagaaa ccaccattag ccagcagagt     3120

gccgccattt tgttctgcgt cgaaggcgat gcaacgttgt ggaaaggttc tcagcagtta     3180

cagcttaaac cgggtgaatc agcgtttatt gccgccaacg aatcaccggt gactgtcaaa     3240

ggccacggcc gtttagcgcg tgtttacaac aagctgtaag agcttactga aaaaattaac     3300

atctcttgct aagctgggag ctctagatcc ccgaatttcc ccgatcgttc aaacatttgg     3360

caataaagtt tcttaagatt gaatcctgtt gccggtcttg cgatgattat catataattt     3420

ctgttgaatt acgttaagca tgtaataatt aacatgtaat gcatgacgtt atttatgaga     3480

tgggttttta tgattagagt cccgcaatta tacatttaat acgcgataga aaacaaaata     3540

tagcgcgcaa actaggataa attatcgcgc gcggtgtcat ctatgttact agatcgggaa     3600

ttggcgagct cgaattaatt cagtacatta aaaacgtccg caatgtgtta ttaagttgtc     3660

taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa cagctccccg     3720

accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc cgggacggcg     3780

tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg ctattcggaa     3840

gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg tagcatgttg     3900

attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct cgcagagatc     3960

cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc gatcttgaga     4020

actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg agtggcgcta     4080

tttctttaga agtgaacgtt gacgatcgtc gaccgtaccc cgatgaatta attcggacgt     4140

acgttctgaa cacagctgga tacttacttg ggcgattgtc atacatgaca tcaacaatgt     4200

acccgtttgt gtaaccgtct cttggaggtt cgtatgacac tagtggttcc cctcagcttg     4260

cgactagatg ttgaggccta acattttatt agagagcagg ctagttgctt agatacatga     4320

tcttcaggcc gttatctgtc agggcaagcg aaaattggcc atttatgacg accaatgccc     4380

cgcagaagct cccatctttg ccgccataga cgccgcgccc cccttttggg gtgtagaaca     4440

tccttttgcc agatgtggaa aagaagttcg ttgtcccatt gttggcaatg acgtagtagc     4500

cggcgaaagt gcgagaccca tttgcgctat atataagcct acgatttccg ttgcgactat     4560

tgtcgtaatt ggatgaacta ttatcgtagt tgctctcaga gttgtcgtaa tttgatggac     4620

tattgtcgta attgcttatg gagttgtcgt agttgcttgg agaaatgtcg tagttggatg     4680

gggagtagtc atagggaaga cgagcttcat ccactaaaac aattggcagg tcagcaagtg     4740

cctgccccga tgccatcgca agtacgaggc ttagaaccac cttcaacaga tcgcgcatag     4800

tcttccccag ctctctaacg cttgagttaa gccgcgccgc gaagcggcgt cggcttgaac     4860

gaattgttag acattatttg ccgactacct tggtgatctc gcctttcacg tagtgaacaa     4920

attcttccaa ctgatctgcg cgcgaggcca agcgatcttc ttgtccaaga taagcctgcc     4980

tagcttcaag tatgacgggc tgatactggg ccggcaggcg ctccattgcc cagtcggcag     5040

cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta ccaaatgcgg gacaacgtaa     5100

gcactacatt tcgctcatcg ccagcccagt cgggcggcga gttccatagc gttaaggttt     5160

catttagcgc ctcaaataga tcctgttcag gaaccggatc aaagagttcc tccgccgctg     5220

gacctaccaa ggcaacgcta tgttctcttg cttttgtcag caagatagcc agatcaatgt     5280

cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt gcgctgccat tctccaaatt     5340

gcagttcgcg cttagctgga taacgccacg gaatgatgtc gtcgtgcaca acaatggtga     5400

cttctacagc gcggagaatc tcgctctctc caggggaagc cgaagtttcc aaaaggtcgt     5460

tgatcaaagc tcgccgcgtt gtttcatcaa gccttacggt caccgtaacc agcaaatcaa     5520

tatcactgtg tggcttcagg ccgccatcca ctgcggagcc gtacaaatgt acggccagca     5580

acgtcggttc gagatggcgc tcgatgacgc caactacctc tgatagttga gtcgatactt     5640

cggcgatcac cgcttccctc atgatgttta actcctgaat taagccgcgc cgcgaagcgg     5700

tgtcggcttg aatgaattgt taggcgtcat cctgtgctcc cgagaaccag taccagtaca     5760

tcgctgtttc gttcgagact tgaggtctag ttttatacgt gaacaggtca atgccgccga     5820

gagtaaagcc acattttgcg tacaaattgc aggcaggtac attgttcgtt tgtgtctcta     5880

atcgtatgcc aaggagctgt ctgcttagtg cccacttttt cgcaaattcg atgagactgt     5940

gcgcgactcc tttgcctcgg tgcgtgtgcg acacaacaat gtgttcgata gaggctagat     6000

cgttccatgt tgagttgagt tcaatcttcc cgacaagctc ttggtcgatg aatgcgccat     6060

agcaagcaga gtcttcatca gagtcatcat ccgagatgta atccttccgg taggggctca     6120

cacttctggt agatagttca aagccttggt cggataggtg cacatcgaac acttcacgaa     6180

caatgaaatg gttctcagca tccaatgttt ccgccacctg ctcagggatc accgaaatct     6240

tcatatgacg cctaacgcct ggcacagcgg atcgcaaacc tggcgcggct tttggcacaa     6300

aaggcgtgac aggtttgcga atccgttgct gccacttgtt aacccttttg ccagatttgg     6360

taactataat ttatgttaga ggcgaagtct tgggtaaaaa ctggcctaaa attgctgggg     6420

atttcaggaa agtaaacatc accttccggc tcgatgtcta ttgtagatat atgtagtgta     6480

tctacttgat cgggggatct gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg     6540

acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca     6600

agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc     6660

acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca gattgtactg     6720

agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc     6780

aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga     6840

gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca     6900

ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg     6960

ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt     7020

cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc     7080

ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct     7140

tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc     7200

gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta     7260

tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca     7320

gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag     7380

tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag     7440

ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt     7500

agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa     7560

gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg     7620

attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga     7680

agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta     7740

atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc     7800

cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg     7860

ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga     7920

agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt     7980

tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt     8040

gctgcagggg gggggggggg ggggttccat tgttcattcc acggacaaaa acagagaaag     8100

gaaacgacag aggccaaaaa gctcgctttc agcacctgtc gtttcctttc ttttcagagg     8160

gtattttaaa taaaaacatt aagttatgac gaagaagaac ggaaacgcct taaaccggaa     8220

aattttcata aatagcgaaa acccgcgagg tcgccgcccc gtaacctgtc ggatcaccgg     8280

aaaggacccg taaagtgata atgattatca tctacatatc acaacgtgcg tggaggccat     8340

caaaccacgt caaataatca attatgacgc aggtatcgta ttaattgatc tgcatcaact     8400

taacgtaaaa acaacttcag acaatacaaa tcagcgacac tgaatacggg gcaacctcat     8460

gtcccccccc ccccccccct gcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt     8520

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa     8580

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat     8640

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct     8700

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga     8760

gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga actttaaaag     8820

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga     8880

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca     8940

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg     9000

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc     9060

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag     9120

gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca     9180

tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa gaattggtcg     9240

acgatcttgc tgcgttcgga tattttcgtg gagttcccgc cacagacccg gattgaaggc     9300

gagatccagc aactcgcgcc agatcatcct gtgacggaac tttggcgcgt gatgactggc     9360

caggacgtcg gccgaaagag cgacaagcag atcacgcttt tcgacagcgt cggatttgcg     9420

atcgaggatt tttcggcgct gcgctacgtc cgcgaccgcg ttgagggatc aagccacagc     9480

agcccactcg accttctagc cgacccagac gagccaaggg atctttttgg aatgctgctc     9540

cgtcgtcagg ctttccgacg tttgggtggt tgaacagaag tcattatcgc acggaatgcc     9600

aagcactccc gaggggaacc ctgtggttgg catgcacata caaatggacg aacggataaa     9660

ccttttcacg cccttttaaa tatccgatta ttctaataaa cgctcttttc tcttaggttt     9720

acccgccaat atatcctgtc aaacactgat agtttaaact gaaggcggga aacgacaacc     9780

tgatcatgag cggagaatta agggagtcac gttatgaccc ccgccgatga cgcgggacaa     9840

gccgttttac gtttggaact gacagaaccg caacgttgaa ggagccactc agcttaatta     9900

agtctaactc gagttactgg tacgtaccaa atccatggaa tcaaggtacc gtcgactcta     9960

gtaacggccg ccagtgtgct ggaattaatt cggcttgtcg accacccaac cccatatcga    10020

cagaggatgt gaagaacagg taaatcacgc agaagaaccc atctctgata gcagctatcg    10080

attagaacaa cgaatccata ttgggtccgt gggaaatact tactgcacag gaagggggcg    10140

atctgacgag gccccgccac cggcctcgac ccgaggccga ggccgacgaa gcgccggcga    10200

gtacggcgcc gcggcggcct ctgcccgtgc cctctgcgcg tgggagggag aggccgcggt    10260

ggtgggggcg cgcgcgcgcg cgcgcgcagc tggtgcggcg gcgcgggggt cagccgccga    10320

gccggcggcg acggaggagc agggcggcgt ggacgcgaac ttccgatcgg ttggtcagag    10380

tgcgcgagtt gggcttagcc aattaggtct caacaatcta ttgggccgta aaattcatgg    10440

gccctggttt gtctaggccc aatatcccgt tcatttcagc ccacaaatat ttccccagag    10500

gattattaag gcccacacgc agcttatagc agatcaagta cgatgtttcc tgatcgttgg    10560

atcggaaacg tacggtcttg atcaggcatg ccgacttcgt caaagagagg cggcatgacc    10620

tgacgcggag ttggttccgg gcaccgtctg gatggtcgta ccgggaccgg acacgtgtcg    10680

cgcctccaac tacatggaca cgtgtggtgc tgccattggg ccgtacgcgt ggcggtgacc    10740

gcaccggatg ctgcctcgca ccgccttgcc cacgctttat atagagaggt tttctctcca    10800

ttaatcgcat agcgagtcga atcgaccgaa ggggaggggg agcgaagctt tgcgttctct    10860

aatcgcctcg tcaaggtaac taatcaatca cctcgtccta atcctcgaat ctctcgtggt    10920

gcccgtctaa tctcgcgatt ttgatgctcg tggtggaaag cgtaggagga tcccgtgcga    10980

gttagtctca atctctcagg gtttcgtgcg attttagggt gatccacctc ttaatcgagt    11040

tacggtttcg tgcgatttta gggtaatcct cttaatctct cattgattta gggtttcgtg    11100

agaatcgagg tagggatctg tgttatttat atcgatctaa tagatggatt ggttttgaga    11160

ttgttctgtc agatggggat tgtttcgata tattacccta atgatgtgtc agatggggat    11220

tgtttcgata tattacccta atgatgtgtc agatggggat tgtttcgata tattacccta    11280

atgatggata ataagagtag ttcacagtta tgttttgatc ctgccacata gtttgagttt    11340

tgtgatcaga tttagtttta cttatttgtg cttagttcgg atgggattgt tctgatattg    11400

ttccaataga tgaatagctc gttaggttaa aatctttagg ttgagttagg cgacacatag    11460

tttatttcct ctggatttgg attggaattg tgttcttagt ttttttcccc tggatttgga    11520

ttggaattgt gtggagctgg gttagagaat tacatctgta tcgtgtacac ctacttgaac    11580

tgtagagctt gggttctaag gtcaatttaa tctgtattgt atctggctct ttgcctagtt    11640

gaactgtagt gctgatgttg tactgtgttt ttttacccgt tttatttgct ttactcgtgc    11700

aaatcaaatc tgtcagatgc tagaactagg tggctttatt ctgtgttctt acatagatct    11760

gttgtcctgt agttacttat gtcagttttg ttattatctg aagatatttt tggttgttgc    11820

ttgttgatgt ggtgtgagct gtgagcagcg ctcttatgat taatgatgct gtccaattgt    11880

agtgtagtat gatgtgattg atatgttcat ctattttgag ctgacagtac cgatatcgta    11940

ggatctggtg ccaacttatt ctccagctgc ttttttttac ctatgttaat tccaatcctt    12000

tcttgcctct tccagatcca gataatggcg aacaaacatt tgtccctctc cctcttcctc    12060

gtcctccttg gcctgtcggc cagcttggcc tccgggcaac aaacaagcat tactctgaca    12120

tccaacgcat ccggtacgtt tgacggttac tattacgaac tctggaagga tactggcaat    12180

acaacaatga cggtctacac tcaaggtcgc ttttcctgcc agtggtcgaa catcaataac    12240

gcgttgttta ggaccgggaa gaaatacaac cagaattggc agtctcttgg cacaatccgg    12300

atcacgtact ctgcgactta caacccaaac gggaactcct acttgtgtat ctatggctgg    12360

tctaccaacc cattggtcga gttctacatc gttgagtcct gggggaactg gagaccgcct    12420

ggtgcctgcc tggccgaggg ctcgctcgtc ttggacgcgg ctaccgggca gagggtccct    12480

atcgaaaagg tgcgtccggg gatggaagtt ttctccttgg gacctgatta cagactgtat    12540

cgggtgcccg ttttggaggt ccttgagagc ggggttaggg aagttgtgcg cctcagaact    12600

cggtcaggga gaacgctggt gttgacacca gatcacccgc ttttgacccc cgaaggttgg    12660

aaacctcttt gtgacctccc gcttggaact ccaattgcag tccccgcaga actgcctgtg    12720

gcgggccact tggccccacc tgaagaacgt gttacgctcc tggctcttct gttgggggat    12780

gggaacacaa agctgtcggg tcggagaggt acacgtccta atgcctcctt ctacagcaaa    12840

gaccccgaat tgctcgcggc ttatcgccgg tgtgcagaag ccttgggtgc aaaggtgaaa    12900

gcatacgtcc acccgactac gggggtggtt acactcgcaa ccctcgctcc acgtcctgga    12960

gctcaagatc ctgtcaaacg cctcgttgtc gaggcgggaa tggttgctaa agccgaagag    13020

aagagggtcc cggaggaggt gtttcgttac cggcgtgagg cgttggccct tttcttgggc    13080

cgtttgttct cgacagacgg ctctgttgaa aagaagagga tctcttattc aagtgccagt    13140

ttgggactgg cccaggatgt cgcacatctc ttgctgcgcc ttggaattag atctcaactc    13200

cgttcgagag ggccacgggc tcacgaggtt cttatatcgg gccgcgagga tattttgcga    13260

tttgctgaac ttatcggacc ctacctcttg ggggccaaga gggagagact tgcagcgctg    13320

gaagctgagg cccgcaggcg tttgcctgga cagggatggc acttgcggct tgttcttcct    13380

gccgtggcgt acagagtgag cgaggctaaa aggcgctcgg gattttcgtg gagtgaagcc    13440

ggtcggcgcg tcgcagttgc gggatcgtgt ttgtcatctg gactcaacct caaattgccc    13500

agacgctacc tttctcggca ccggttgtcg ctgctcggtg aggcttttgc cgaccctggg    13560

ctggaagcgc tcgcggaagg ccaagtgctc tgggacccta ttgttgctgt cgaaccggcc    13620

ggtaaggcga gaacattcga cttgcgcgtt ccaccctttg caaacttcgt gagcgaggac    13680

ctggtggtgc ataacacgtc cctgggccaa gtgacaatcg atggcgggac ctacgacatc    13740

tataggacga cacgcgtcaa ccagccttcc attgtgggga cagccacgtt cgatcagtac    13800

tggagcgtgc gcacctctaa gcggacttca ggaacagtga ccgtgaccga tcacttccgc    13860

gcctgggcga accggggcct gaacctcggc acaatagacc aaattacatt gtgcgtggag    13920

ggttaccaaa gctctggatc agccaacatc acccagaaca ccttctctca gggctcttct    13980

tccggcagtt cgggtggctc atccggctcc acaacgacta ctcgcatcga gtgtgagaac    14040

atgtccttgt ccggacccta cgttagcagg atcaccaatc cctttaatgg tattgcgctg    14100

tacgccaacg gagacacagc ccgcgctacc gttaacttcc ccgcaagtcg caactacaat    14160

ttccgcctgc ggggttgcgg caacaacaat aatcttgccc gtgtggacct gaggatcgac    14220

ggacggaccg tcgggacctt ttattaccag ggcacatacc cctgggaggc cccaattgac    14280

aatgtttatg tcagtgcggg gagtcataca gtcgaaatca ctgttactgc ggataacggc    14340

acatgggacg tgtatgccga ctacctggtg atacagtgac ctaggtcccc gaatttcccc    14400

gatcgttcaa acatttggca ataaagtttc ttaagattga atcctgttgc cggtcttgcg    14460

atgattatca tataatttct gttgaattac gttaagcatg taataattaa catgtaatgc    14520

atgacgttat ttatgagatg ggtttttatg attagagtcc cgcaattata catttaatac    14580

gcgatagaaa acaaaatata gcgcgcaaac taggataaat tatcgcgcgc ggtgtcatct    14640

atgttactag atcgggaatt gg                                             14662


<210>  59
<211>  2286
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, P77T134-100-101 Nucleotide sequence

<400>  59
atgcaaacaa gcattactct gacatccaac gcatccggta cgtttgacgg ttactattac       60

gaactctgga aggatactgg caatacaaca atgacggtct acactcaagg tcgcttttcc      120

tgccagtggt cgaacatcaa taacgcgttg tttaggaccg ggaagaaata caaccagaat      180

tggcagtctc ttggcacaat ccggatcacg tactctgcga cttacaaccc aaacgggaac      240

tcctacttgt gtatctatgg ctggtctacc aacccattgg tcgagttcta catcgttgag      300

tcctggggga actggagacc gcctggtgcc tgcctggccg agggctcgct cgtcttggac      360

gcggctaccg ggcagagggt ccctatcgaa aaggtgcgtc cggggatgga agttttctcc      420

ttgggacctg attacagact gtatcgggtg cccgttttgg aggtccttga gagcggggtt      480

agggaagttg tgcgcctcag aactcggtca gggagaacgc tggtgttgac accagatcac      540

ccgcttttga cccccgaagg ttggaaacct ctttgtgacc tcccgcttgg aactccaatt      600

gcagtccccg cagaactgcc tgtggcgggc cacttggccc cacctgaaga acgtgttacg      660

ctcctggctc ttctgttggg ggatgggaac acaaagctgt cgggtcggag aggtacacgt      720

cctaatgcct tcttctacag caaaaacccc gaattgctcg cggcttatcg ccggtgtgca      780

gaagccttgg gtgcaaaggt gaaagcatac gtccacccga ctacgggggt ggttacactc      840

gcaaccctcg ctccacgtcc tggagctcaa gatcctgtca aacgcctcgt tgtcgaggcg      900

ggaatggttg ctaaagccga agagaagagg gtcccggagg aggtgtttcg ttaccggcgt      960

gaggcgttgg cccttttctt gggccgtttg ttctcgacag acggctctgt tgaaaagaag     1020

aggatctctt attcaagtgc cagtttggga ctggcccagg atgtcgcaca tctcttgctg     1080

cgccttggaa ttacatctca actccgttcg agagggccac gggctcacga ggttcttata     1140

tcgggccgcg aggatatttt gcggtttgct gaacttatcg gaccctacct cttgggggcc     1200

aagagggaga gacttgcagc gctggaagct gaggcccgca ggcgtttgcc tggacaggga     1260

tggcacttgc ggcttgttct tcctgccgtg gcgtacagag tgggcgaggc ggaaaggcgc     1320

tcgggatttt cgtggagtga agccggtcgg cgcgtcgcag ttgcgggatc gtgtttgtca     1380

tctggactca acctcaaatt gcccagacgc tacctttctc ggcaccggtt gtcgctgctc     1440

ggtgaggctt ttgccgaccc tgggctggaa gcgctcgcgg aaggccaagt gctctgggac     1500

cctattgttg ctgtcgaacc ggccggtaag gcgagaacat tcgacttgcg cgttccaccc     1560

tttgcaaact tcgtgagcga ggacctggtg gtgcataaca ccgtccccct gggccaagtg     1620

acaatcgatg gcgggaccta cgacatctat aggacgacac gcgtcaacca gccttccatt     1680

gtggggacag ccacgttcga tcagtactgg agcgtgcgca cctctaagcg gacttcagga     1740

acagtgaccg tgaccgatca cttccgcgcc tgggcgaacc ggggcctgaa cctcggcaca     1800

atagaccaaa ttacattgtg cgtggagggt taccaaagct ctggatcagc caacatcacc     1860

cagaacacct tctctcaggg ctcttcttcc ggcagttcgg gtggctcatc cggctccaca     1920

acgactactc gcatcgagtg tgagaacatg tccttgtccg gaccctacgt tagcaggatc     1980

accaatccct ttaatggtat tgcgctgtac gccaacggag acacagcccg cgctaccgtt     2040

aacttccccg caagtcgcaa ctacaatttc cgcctgcggg gttgcggcaa caacaataat     2100

cttgcccgtg tggacctgag gatcgacgga cggaccgtcg ggacctttta ttaccagggc     2160

acatacccct gggaggcccc aattgacaat gtttatgtca gtgcggggag tcatacagtc     2220

gaaatcactg ttactgcgga taacggcaca tgggacgtgt atgccgacta cctggtgata     2280

cagtga                                                                2286


<210>  60
<211>  761
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, P77T134-100-101 Amino acid sequence

<400>  60

Met Gln Thr Ser Ile Thr Leu Thr Ser Asn Ala Ser Gly Thr Phe Asp 
1               5                   10                  15      


Gly Tyr Tyr Tyr Glu Leu Trp Lys Asp Thr Gly Asn Thr Thr Met Thr 
            20                  25                  30          


Val Tyr Thr Gln Gly Arg Phe Ser Cys Gln Trp Ser Asn Ile Asn Asn 
        35                  40                  45              


Ala Leu Phe Arg Thr Gly Lys Lys Tyr Asn Gln Asn Trp Gln Ser Leu 
    50                  55                  60                  


Gly Thr Ile Arg Ile Thr Tyr Ser Ala Thr Tyr Asn Pro Asn Gly Asn 
65                  70                  75                  80  


Ser Tyr Leu Cys Ile Tyr Gly Trp Ser Thr Asn Pro Leu Val Glu Phe 
                85                  90                  95      


Tyr Ile Val Glu Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Cys Leu 
            100                 105                 110         


Ala Glu Gly Ser Leu Val Leu Asp Ala Ala Thr Gly Gln Arg Val Pro 
        115                 120                 125             


Ile Glu Lys Val Arg Pro Gly Met Glu Val Phe Ser Leu Gly Pro Asp 
    130                 135                 140                 


Tyr Arg Leu Tyr Arg Val Pro Val Leu Glu Val Leu Glu Ser Gly Val 
145                 150                 155                 160 


Arg Glu Val Val Arg Leu Arg Thr Arg Ser Gly Arg Thr Leu Val Leu 
                165                 170                 175     


Thr Pro Asp His Pro Leu Leu Thr Pro Glu Gly Trp Lys Pro Leu Cys 
            180                 185                 190         


Asp Leu Pro Leu Gly Thr Pro Ile Ala Val Pro Ala Glu Leu Pro Val 
        195                 200                 205             


Ala Gly His Leu Ala Pro Pro Glu Glu Arg Val Thr Leu Leu Ala Leu 
    210                 215                 220                 


Leu Leu Gly Asp Gly Asn Thr Lys Leu Ser Gly Arg Arg Gly Thr Arg 
225                 230                 235                 240 


Pro Asn Ala Phe Phe Tyr Ser Lys Asn Pro Glu Leu Leu Ala Ala Tyr 
                245                 250                 255     


Arg Arg Cys Ala Glu Ala Leu Gly Ala Lys Val Lys Ala Tyr Val His 
            260                 265                 270         


Pro Thr Thr Gly Val Val Thr Leu Ala Thr Leu Ala Pro Arg Pro Gly 
        275                 280                 285             


Ala Gln Asp Pro Val Lys Arg Leu Val Val Glu Ala Gly Met Val Ala 
    290                 295                 300                 


Lys Ala Glu Glu Lys Arg Val Pro Glu Glu Val Phe Arg Tyr Arg Arg 
305                 310                 315                 320 


Glu Ala Leu Ala Leu Phe Leu Gly Arg Leu Phe Ser Thr Asp Gly Ser 
                325                 330                 335     


Val Glu Lys Lys Arg Ile Ser Tyr Ser Ser Ala Ser Leu Gly Leu Ala 
            340                 345                 350         


Gln Asp Val Ala His Leu Leu Leu Arg Leu Gly Ile Thr Ser Gln Leu 
        355                 360                 365             


Arg Ser Arg Gly Pro Arg Ala His Glu Val Leu Ile Ser Gly Arg Glu 
    370                 375                 380                 


Asp Ile Leu Arg Phe Ala Glu Leu Ile Gly Pro Tyr Leu Leu Gly Ala 
385                 390                 395                 400 


Lys Arg Glu Arg Leu Ala Ala Leu Glu Ala Glu Ala Arg Arg Arg Leu 
                405                 410                 415     


Pro Gly Gln Gly Trp His Leu Arg Leu Val Leu Pro Ala Val Ala Tyr 
            420                 425                 430         


Arg Val Gly Glu Ala Glu Arg Arg Ser Gly Phe Ser Trp Ser Glu Ala 
        435                 440                 445             


Gly Arg Arg Val Ala Val Ala Gly Ser Cys Leu Ser Ser Gly Leu Asn 
    450                 455                 460                 


Leu Lys Leu Pro Arg Arg Tyr Leu Ser Arg His Arg Leu Ser Leu Leu 
465                 470                 475                 480 


Gly Glu Ala Phe Ala Asp Pro Gly Leu Glu Ala Leu Ala Glu Gly Gln 
                485                 490                 495     


Val Leu Trp Asp Pro Ile Val Ala Val Glu Pro Ala Gly Lys Ala Arg 
            500                 505                 510         


Thr Phe Asp Leu Arg Val Pro Pro Phe Ala Asn Phe Val Ser Glu Asp 
        515                 520                 525             


Leu Val Val His Asn Thr Val Pro Leu Gly Gln Val Thr Ile Asp Gly 
    530                 535                 540                 


Gly Thr Tyr Asp Ile Tyr Arg Thr Thr Arg Val Asn Gln Pro Ser Ile 
545                 550                 555                 560 


Val Gly Thr Ala Thr Phe Asp Gln Tyr Trp Ser Val Arg Thr Ser Lys 
                565                 570                 575     


Arg Thr Ser Gly Thr Val Thr Val Thr Asp His Phe Arg Ala Trp Ala 
            580                 585                 590         


Asn Arg Gly Leu Asn Leu Gly Thr Ile Asp Gln Ile Thr Leu Cys Val 
        595                 600                 605             


Glu Gly Tyr Gln Ser Ser Gly Ser Ala Asn Ile Thr Gln Asn Thr Phe 
    610                 615                 620                 


Ser Gln Gly Ser Ser Ser Gly Ser Ser Gly Gly Ser Ser Gly Ser Thr 
625                 630                 635                 640 


Thr Thr Thr Arg Ile Glu Cys Glu Asn Met Ser Leu Ser Gly Pro Tyr 
                645                 650                 655     


Val Ser Arg Ile Thr Asn Pro Phe Asn Gly Ile Ala Leu Tyr Ala Asn 
            660                 665                 670         


Gly Asp Thr Ala Arg Ala Thr Val Asn Phe Pro Ala Ser Arg Asn Tyr 
        675                 680                 685             


Asn Phe Arg Leu Arg Gly Cys Gly Asn Asn Asn Asn Leu Ala Arg Val 
    690                 695                 700                 


Asp Leu Arg Ile Asp Gly Arg Thr Val Gly Thr Phe Tyr Tyr Gln Gly 
705                 710                 715                 720 


Thr Tyr Pro Trp Glu Ala Pro Ile Asp Asn Val Tyr Val Ser Ala Gly 
                725                 730                 735     


Ser His Thr Val Glu Ile Thr Val Thr Ala Asp Asn Gly Thr Trp Asp 
            740                 745                 750         


Val Tyr Ala Asp Tyr Leu Val Ile Gln 
        755                 760     


<210>  61
<211>  2358
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, BAASS:P77T134-100-101 Nucleotide sequence

<400>  61
atggcgaaca aacatttgtc cctctccctc ttcctcgtcc tccttggcct gtcggccagc       60

ttggcctccg ggcaacaaac aagcattact ctgacatcca acgcatccgg tacgtttgac      120

ggttactatt acgaactctg gaaggatact ggcaatacaa caatgacggt ctacactcaa      180

ggtcgctttt cctgccagtg gtcgaacatc aataacgcgt tgtttaggac cgggaagaaa      240

tacaaccaga attggcagtc tcttggcaca atccggatca cgtactctgc gacttacaac      300

ccaaacggga actcctactt gtgtatctat ggctggtcta ccaacccatt ggtcgagttc      360

tacatcgttg agtcctgggg gaactggaga ccgcctggtg cctgcctggc cgagggctcg      420

ctcgtcttgg acgcggctac cgggcagagg gtccctatcg aaaaggtgcg tccggggatg      480

gaagttttct ccttgggacc tgattacaga ctgtatcggg tgcccgtttt ggaggtcctt      540

gagagcgggg ttagggaagt tgtgcgcctc agaactcggt cagggagaac gctggtgttg      600

acaccagatc acccgctttt gacccccgaa ggttggaaac ctctttgtga cctcccgctt      660

ggaactccaa ttgcagtccc cgcagaactg cctgtggcgg gccacttggc cccacctgaa      720

gaacgtgtta cgctcctggc tcttctgttg ggggatggga acacaaagct gtcgggtcgg      780

agaggtacac gtcctaatgc cttcttctac agcaaaaacc ccgaattgct cgcggcttat      840

cgccggtgtg cagaagcctt gggtgcaaag gtgaaagcat acgtccaccc gactacgggg      900

gtggttacac tcgcaaccct cgctccacgt cctggagctc aagatcctgt caaacgcctc      960

gttgtcgagg cgggaatggt tgctaaagcc gaagagaaga gggtcccgga ggaggtgttt     1020

cgttaccggc gtgaggcgtt ggcccttttc ttgggccgtt tgttctcgac agacggctct     1080

gttgaaaaga agaggatctc ttattcaagt gccagtttgg gactggccca ggatgtcgca     1140

catctcttgc tgcgccttgg aattacatct caactccgtt cgagagggcc acgggctcac     1200

gaggttctta tatcgggccg cgaggatatt ttgcggtttg ctgaacttat cggaccctac     1260

ctcttggggg ccaagaggga gagacttgca gcgctggaag ctgaggcccg caggcgtttg     1320

cctggacagg gatggcactt gcggcttgtt cttcctgccg tggcgtacag agtgggcgag     1380

gcggaaaggc gctcgggatt ttcgtggagt gaagccggtc ggcgcgtcgc agttgcggga     1440

tcgtgtttgt catctggact caacctcaaa ttgcccagac gctacctttc tcggcaccgg     1500

ttgtcgctgc tcggtgaggc ttttgccgac cctgggctgg aagcgctcgc ggaaggccaa     1560

gtgctctggg accctattgt tgctgtcgaa ccggccggta aggcgagaac attcgacttg     1620

cgcgttccac cctttgcaaa cttcgtgagc gaggacctgg tggtgcataa caccgtcccc     1680

ctgggccaag tgacaatcga tggcgggacc tacgacatct ataggacgac acgcgtcaac     1740

cagccttcca ttgtggggac agccacgttc gatcagtact ggagcgtgcg cacctctaag     1800

cggacttcag gaacagtgac cgtgaccgat cacttccgcg cctgggcgaa ccggggcctg     1860

aacctcggca caatagacca aattacattg tgcgtggagg gttaccaaag ctctggatca     1920

gccaacatca cccagaacac cttctctcag ggctcttctt ccggcagttc gggtggctca     1980

tccggctcca caacgactac tcgcatcgag tgtgagaaca tgtccttgtc cggaccctac     2040

gttagcagga tcaccaatcc ctttaatggt attgcgctgt acgccaacgg agacacagcc     2100

cgcgctaccg ttaacttccc cgcaagtcgc aactacaatt tccgcctgcg gggttgcggc     2160

aacaacaata atcttgcccg tgtggacctg aggatcgacg gacggaccgt cgggaccttt     2220

tattaccagg gcacataccc ctgggaggcc ccaattgaca atgtttatgt cagtgcgggg     2280

agtcatacag tcgaaatcac tgttactgcg gataacggca catgggacgt gtatgccgac     2340

tacctggtga tacagtga                                                   2358


<210>  62
<211>  785
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, BAASS:P77T134-100-101 Amino acid sequence

<400>  62

Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 
1               5                   10                  15      


Leu Ser Ala Ser Leu Ala Ser Gly Gln Gln Thr Ser Ile Thr Leu Thr 
            20                  25                  30          


Ser Asn Ala Ser Gly Thr Phe Asp Gly Tyr Tyr Tyr Glu Leu Trp Lys 
        35                  40                  45              


Asp Thr Gly Asn Thr Thr Met Thr Val Tyr Thr Gln Gly Arg Phe Ser 
    50                  55                  60                  


Cys Gln Trp Ser Asn Ile Asn Asn Ala Leu Phe Arg Thr Gly Lys Lys 
65                  70                  75                  80  


Tyr Asn Gln Asn Trp Gln Ser Leu Gly Thr Ile Arg Ile Thr Tyr Ser 
                85                  90                  95      


Ala Thr Tyr Asn Pro Asn Gly Asn Ser Tyr Leu Cys Ile Tyr Gly Trp 
            100                 105                 110         


Ser Thr Asn Pro Leu Val Glu Phe Tyr Ile Val Glu Ser Trp Gly Asn 
        115                 120                 125             


Trp Arg Pro Pro Gly Ala Cys Leu Ala Glu Gly Ser Leu Val Leu Asp 
    130                 135                 140                 


Ala Ala Thr Gly Gln Arg Val Pro Ile Glu Lys Val Arg Pro Gly Met 
145                 150                 155                 160 


Glu Val Phe Ser Leu Gly Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val 
                165                 170                 175     


Leu Glu Val Leu Glu Ser Gly Val Arg Glu Val Val Arg Leu Arg Thr 
            180                 185                 190         


Arg Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr 
        195                 200                 205             


Pro Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile 
    210                 215                 220                 


Ala Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu 
225                 230                 235                 240 


Glu Arg Val Thr Leu Leu Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys 
                245                 250                 255     


Leu Ser Gly Arg Arg Gly Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys 
            260                 265                 270         


Asn Pro Glu Leu Leu Ala Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly 
        275                 280                 285             


Ala Lys Val Lys Ala Tyr Val His Pro Thr Thr Gly Val Val Thr Leu 
    290                 295                 300                 


Ala Thr Leu Ala Pro Arg Pro Gly Ala Gln Asp Pro Val Lys Arg Leu 
305                 310                 315                 320 


Val Val Glu Ala Gly Met Val Ala Lys Ala Glu Glu Lys Arg Val Pro 
                325                 330                 335     


Glu Glu Val Phe Arg Tyr Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly 
            340                 345                 350         


Arg Leu Phe Ser Thr Asp Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr 
        355                 360                 365             


Ser Ser Ala Ser Leu Gly Leu Ala Gln Asp Val Ala His Leu Leu Leu 
    370                 375                 380                 


Arg Leu Gly Ile Thr Ser Gln Leu Arg Ser Arg Gly Pro Arg Ala His 
385                 390                 395                 400 


Glu Val Leu Ile Ser Gly Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu 
                405                 410                 415     


Ile Gly Pro Tyr Leu Leu Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu 
            420                 425                 430         


Glu Ala Glu Ala Arg Arg Arg Leu Pro Gly Gln Gly Trp His Leu Arg 
        435                 440                 445             


Leu Val Leu Pro Ala Val Ala Tyr Arg Val Gly Glu Ala Glu Arg Arg 
    450                 455                 460                 


Ser Gly Phe Ser Trp Ser Glu Ala Gly Arg Arg Val Ala Val Ala Gly 
465                 470                 475                 480 


Ser Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu 
                485                 490                 495     


Ser Arg His Arg Leu Ser Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly 
            500                 505                 510         


Leu Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala 
        515                 520                 525             


Val Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro 
    530                 535                 540                 


Phe Ala Asn Phe Val Ser Glu Asp Leu Val Val His Asn Thr Val Pro 
545                 550                 555                 560 


Leu Gly Gln Val Thr Ile Asp Gly Gly Thr Tyr Asp Ile Tyr Arg Thr 
                565                 570                 575     


Thr Arg Val Asn Gln Pro Ser Ile Val Gly Thr Ala Thr Phe Asp Gln 
            580                 585                 590         


Tyr Trp Ser Val Arg Thr Ser Lys Arg Thr Ser Gly Thr Val Thr Val 
        595                 600                 605             


Thr Asp His Phe Arg Ala Trp Ala Asn Arg Gly Leu Asn Leu Gly Thr 
    610                 615                 620                 


Ile Asp Gln Ile Thr Leu Cys Val Glu Gly Tyr Gln Ser Ser Gly Ser 
625                 630                 635                 640 


Ala Asn Ile Thr Gln Asn Thr Phe Ser Gln Gly Ser Ser Ser Gly Ser 
                645                 650                 655     


Ser Gly Gly Ser Ser Gly Ser Thr Thr Thr Thr Arg Ile Glu Cys Glu 
            660                 665                 670         


Asn Met Ser Leu Ser Gly Pro Tyr Val Ser Arg Ile Thr Asn Pro Phe 
        675                 680                 685             


Asn Gly Ile Ala Leu Tyr Ala Asn Gly Asp Thr Ala Arg Ala Thr Val 
    690                 695                 700                 


Asn Phe Pro Ala Ser Arg Asn Tyr Asn Phe Arg Leu Arg Gly Cys Gly 
705                 710                 715                 720 


Asn Asn Asn Asn Leu Ala Arg Val Asp Leu Arg Ile Asp Gly Arg Thr 
                725                 730                 735     


Val Gly Thr Phe Tyr Tyr Gln Gly Thr Tyr Pro Trp Glu Ala Pro Ile 
            740                 745                 750         


Asp Asn Val Tyr Val Ser Ala Gly Ser His Thr Val Glu Ile Thr Val 
        755                 760                 765             


Thr Ala Asp Asn Gly Thr Trp Asp Val Tyr Ala Asp Tyr Leu Val Ile 
    770                 775                 780                 


Gln 
785 


<210>  63
<211>  2376
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, BAASS:P77T134-100-101:SEKDEL Nucleotide 
       sequence

<400>  63
atggcgaaca aacatttgtc cctctccctc ttcctcgtcc tccttggcct gtcggccagc       60

ttggcctccg ggcaacaaac aagcattact ctgacatcca acgcatccgg tacgtttgac      120

ggttactatt acgaactctg gaaggatact ggcaatacaa caatgacggt ctacactcaa      180

ggtcgctttt cctgccagtg gtcgaacatc aataacgcgt tgtttaggac cgggaagaaa      240

tacaaccaga attggcagtc tcttggcaca atccggatca cgtactctgc gacttacaac      300

ccaaacggga actcctactt gtgtatctat ggctggtcta ccaacccatt ggtcgagttc      360

tacatcgttg agtcctgggg gaactggaga ccgcctggtg cctgcctggc cgagggctcg      420

ctcgtcttgg acgcggctac cgggcagagg gtccctatcg aaaaggtgcg tccggggatg      480

gaagttttct ccttgggacc tgattacaga ctgtatcggg tgcccgtttt ggaggtcctt      540

gagagcgggg ttagggaagt tgtgcgcctc agaactcggt cagggagaac gctggtgttg      600

acaccagatc acccgctttt gacccccgaa ggttggaaac ctctttgtga cctcccgctt      660

ggaactccaa ttgcagtccc cgcagaactg cctgtggcgg gccacttggc cccacctgaa      720

gaacgtgtta cgctcctggc tcttctgttg ggggatggga acacaaagct gtcgggtcgg      780

agaggtacac gtcctaatgc cttcttctac agcaaaaacc ccgaattgct cgcggcttat      840

cgccggtgtg cagaagcctt gggtgcaaag gtgaaagcat acgtccaccc gactacgggg      900

gtggttacac tcgcaaccct cgctccacgt cctggagctc aagatcctgt caaacgcctc      960

gttgtcgagg cgggaatggt tgctaaagcc gaagagaaga gggtcccgga ggaggtgttt     1020

cgttaccggc gtgaggcgtt ggcccttttc ttgggccgtt tgttctcgac agacggctct     1080

gttgaaaaga agaggatctc ttattcaagt gccagtttgg gactggccca ggatgtcgca     1140

catctcttgc tgcgccttgg aattacatct caactccgtt cgagagggcc acgggctcac     1200

gaggttctta tatcgggccg cgaggatatt ttgcggtttg ctgaacttat cggaccctac     1260

ctcttggggg ccaagaggga gagacttgca gcgctggaag ctgaggcccg caggcgtttg     1320

cctggacagg gatggcactt gcggcttgtt cttcctgccg tggcgtacag agtgggcgag     1380

gcggaaaggc gctcgggatt ttcgtggagt gaagccggtc ggcgcgtcgc agttgcggga     1440

tcgtgtttgt catctggact caacctcaaa ttgcccagac gctacctttc tcggcaccgg     1500

ttgtcgctgc tcggtgaggc ttttgccgac cctgggctgg aagcgctcgc ggaaggccaa     1560

gtgctctggg accctattgt tgctgtcgaa ccggccggta aggcgagaac attcgacttg     1620

cgcgttccac cctttgcaaa cttcgtgagc gaggacctgg tggtgcataa caccgtcccc     1680

ctgggccaag tgacaatcga tggcgggacc tacgacatct ataggacgac acgcgtcaac     1740

cagccttcca ttgtggggac agccacgttc gatcagtact ggagcgtgcg cacctctaag     1800

cggacttcag gaacagtgac cgtgaccgat cacttccgcg cctgggcgaa ccggggcctg     1860

aacctcggca caatagacca aattacattg tgcgtggagg gttaccaaag ctctggatca     1920

gccaacatca cccagaacac cttctctcag ggctcttctt ccggcagttc gggtggctca     1980

tccggctcca caacgactac tcgcatcgag tgtgagaaca tgtccttgtc cggaccctac     2040

gttagcagga tcaccaatcc ctttaatggt attgcgctgt acgccaacgg agacacagcc     2100

cgcgctaccg ttaacttccc cgcaagtcgc aactacaatt tccgcctgcg gggttgcggc     2160

aacaacaata atcttgcccg tgtggacctg aggatcgacg gacggaccgt cgggaccttt     2220

tattaccagg gcacataccc ctgggaggcc ccaattgaca atgtttatgt cagtgcgggg     2280

agtcatacag tcgaaatcac tgttactgcg gataacggca catgggacgt gtatgccgac     2340

tacctggtga tacagagcga gaaggacgag ctgtga                               2376


<210>  64
<211>  791
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, BAASS:P77T134-100-101:SEKDEL Amino acid 
       sequence

<400>  64

Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 
1               5                   10                  15      


Leu Ser Ala Ser Leu Ala Ser Gly Gln Gln Thr Ser Ile Thr Leu Thr 
            20                  25                  30          


Ser Asn Ala Ser Gly Thr Phe Asp Gly Tyr Tyr Tyr Glu Leu Trp Lys 
        35                  40                  45              


Asp Thr Gly Asn Thr Thr Met Thr Val Tyr Thr Gln Gly Arg Phe Ser 
    50                  55                  60                  


Cys Gln Trp Ser Asn Ile Asn Asn Ala Leu Phe Arg Thr Gly Lys Lys 
65                  70                  75                  80  


Tyr Asn Gln Asn Trp Gln Ser Leu Gly Thr Ile Arg Ile Thr Tyr Ser 
                85                  90                  95      


Ala Thr Tyr Asn Pro Asn Gly Asn Ser Tyr Leu Cys Ile Tyr Gly Trp 
            100                 105                 110         


Ser Thr Asn Pro Leu Val Glu Phe Tyr Ile Val Glu Ser Trp Gly Asn 
        115                 120                 125             


Trp Arg Pro Pro Gly Ala Cys Leu Ala Glu Gly Ser Leu Val Leu Asp 
    130                 135                 140                 


Ala Ala Thr Gly Gln Arg Val Pro Ile Glu Lys Val Arg Pro Gly Met 
145                 150                 155                 160 


Glu Val Phe Ser Leu Gly Pro Asp Tyr Arg Leu Tyr Arg Val Pro Val 
                165                 170                 175     


Leu Glu Val Leu Glu Ser Gly Val Arg Glu Val Val Arg Leu Arg Thr 
            180                 185                 190         


Arg Ser Gly Arg Thr Leu Val Leu Thr Pro Asp His Pro Leu Leu Thr 
        195                 200                 205             


Pro Glu Gly Trp Lys Pro Leu Cys Asp Leu Pro Leu Gly Thr Pro Ile 
    210                 215                 220                 


Ala Val Pro Ala Glu Leu Pro Val Ala Gly His Leu Ala Pro Pro Glu 
225                 230                 235                 240 


Glu Arg Val Thr Leu Leu Ala Leu Leu Leu Gly Asp Gly Asn Thr Lys 
                245                 250                 255     


Leu Ser Gly Arg Arg Gly Thr Arg Pro Asn Ala Phe Phe Tyr Ser Lys 
            260                 265                 270         


Asn Pro Glu Leu Leu Ala Ala Tyr Arg Arg Cys Ala Glu Ala Leu Gly 
        275                 280                 285             


Ala Lys Val Lys Ala Tyr Val His Pro Thr Thr Gly Val Val Thr Leu 
    290                 295                 300                 


Ala Thr Leu Ala Pro Arg Pro Gly Ala Gln Asp Pro Val Lys Arg Leu 
305                 310                 315                 320 


Val Val Glu Ala Gly Met Val Ala Lys Ala Glu Glu Lys Arg Val Pro 
                325                 330                 335     


Glu Glu Val Phe Arg Tyr Arg Arg Glu Ala Leu Ala Leu Phe Leu Gly 
            340                 345                 350         


Arg Leu Phe Ser Thr Asp Gly Ser Val Glu Lys Lys Arg Ile Ser Tyr 
        355                 360                 365             


Ser Ser Ala Ser Leu Gly Leu Ala Gln Asp Val Ala His Leu Leu Leu 
    370                 375                 380                 


Arg Leu Gly Ile Thr Ser Gln Leu Arg Ser Arg Gly Pro Arg Ala His 
385                 390                 395                 400 


Glu Val Leu Ile Ser Gly Arg Glu Asp Ile Leu Arg Phe Ala Glu Leu 
                405                 410                 415     


Ile Gly Pro Tyr Leu Leu Gly Ala Lys Arg Glu Arg Leu Ala Ala Leu 
            420                 425                 430         


Glu Ala Glu Ala Arg Arg Arg Leu Pro Gly Gln Gly Trp His Leu Arg 
        435                 440                 445             


Leu Val Leu Pro Ala Val Ala Tyr Arg Val Gly Glu Ala Glu Arg Arg 
    450                 455                 460                 


Ser Gly Phe Ser Trp Ser Glu Ala Gly Arg Arg Val Ala Val Ala Gly 
465                 470                 475                 480 


Ser Cys Leu Ser Ser Gly Leu Asn Leu Lys Leu Pro Arg Arg Tyr Leu 
                485                 490                 495     


Ser Arg His Arg Leu Ser Leu Leu Gly Glu Ala Phe Ala Asp Pro Gly 
            500                 505                 510         


Leu Glu Ala Leu Ala Glu Gly Gln Val Leu Trp Asp Pro Ile Val Ala 
        515                 520                 525             


Val Glu Pro Ala Gly Lys Ala Arg Thr Phe Asp Leu Arg Val Pro Pro 
    530                 535                 540                 


Phe Ala Asn Phe Val Ser Glu Asp Leu Val Val His Asn Thr Val Pro 
545                 550                 555                 560 


Leu Gly Gln Val Thr Ile Asp Gly Gly Thr Tyr Asp Ile Tyr Arg Thr 
                565                 570                 575     


Thr Arg Val Asn Gln Pro Ser Ile Val Gly Thr Ala Thr Phe Asp Gln 
            580                 585                 590         


Tyr Trp Ser Val Arg Thr Ser Lys Arg Thr Ser Gly Thr Val Thr Val 
        595                 600                 605             


Thr Asp His Phe Arg Ala Trp Ala Asn Arg Gly Leu Asn Leu Gly Thr 
    610                 615                 620                 


Ile Asp Gln Ile Thr Leu Cys Val Glu Gly Tyr Gln Ser Ser Gly Ser 
625                 630                 635                 640 


Ala Asn Ile Thr Gln Asn Thr Phe Ser Gln Gly Ser Ser Ser Gly Ser 
                645                 650                 655     


Ser Gly Gly Ser Ser Gly Ser Thr Thr Thr Thr Arg Ile Glu Cys Glu 
            660                 665                 670         


Asn Met Ser Leu Ser Gly Pro Tyr Val Ser Arg Ile Thr Asn Pro Phe 
        675                 680                 685             


Asn Gly Ile Ala Leu Tyr Ala Asn Gly Asp Thr Ala Arg Ala Thr Val 
    690                 695                 700                 


Asn Phe Pro Ala Ser Arg Asn Tyr Asn Phe Arg Leu Arg Gly Cys Gly 
705                 710                 715                 720 


Asn Asn Asn Asn Leu Ala Arg Val Asp Leu Arg Ile Asp Gly Arg Thr 
                725                 730                 735     


Val Gly Thr Phe Tyr Tyr Gln Gly Thr Tyr Pro Trp Glu Ala Pro Ile 
            740                 745                 750         


Asp Asn Val Tyr Val Ser Ala Gly Ser His Thr Val Glu Ile Thr Val 
        755                 760                 765             


Thr Ala Asp Asn Gly Thr Trp Asp Val Tyr Ala Asp Tyr Leu Val Ile 
    770                 775                 780                 


Gln Ser Glu Lys Asp Glu Leu 
785                 790     


<210>  65
<211>  4654
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, expression cassette in pAG2227 is 
       OsUbi3P:P77853-T134-100-101

<400>  65
ggtaccgtcg actctagtaa cggccgccag tgtgctggaa ttaattcggc ttgtcgacca       60

cccaacccca tatcgacaga ggatgtgaag aacaggtaaa tcacgcagaa gaacccatct      120

ctgatagcag ctatcgatta gaacaacgaa tccatattgg gtccgtggga aatacttact      180

gcacaggaag ggggcgatct gacgaggccc cgccaccggc ctcgacccga ggccgaggcc      240

gacgaagcgc cggcgagtac ggcgccgcgg cggcctctgc ccgtgccctc tgcgcgtggg      300

agggagaggc cgcggtggtg ggggcgcgcg cgcgcgcgcg cgcagctggt gcggcggcgc      360

gggggtcagc cgccgagccg gcggcgacgg aggagcaggg cggcgtggac gcgaacttcc      420

gatcggttgg tcagagtgcg cgagttgggc ttagccaatt aggtctcaac aatctattgg      480

gccgtaaaat tcatgggccc tggtttgtct aggcccaata tcccgttcat ttcagcccac      540

aaatatttcc ccagaggatt attaaggccc acacgcagct tatagcagat caagtacgat      600

gtttcctgat cgttggatcg gaaacgtacg gtcttgatca ggcatgccga cttcgtcaaa      660

gagaggcggc atgacctgac gcggagttgg ttccgggcac cgtctggatg gtcgtaccgg      720

gaccggacac gtgtcgcgcc tccaactaca tggacacgtg tggtgctgcc attgggccgt      780

acgcgtggcg gtgaccgcac cggatgctgc ctcgcaccgc cttgcccacg ctttatatag      840

agaggttttc tctccattaa tcgcatagcg agtcgaatcg accgaagggg agggggagcg      900

aagctttgcg ttctctaatc gcctcgtcaa ggtaactaat caatcacctc gtcctaatcc      960

tcgaatctct cgtggtgccc gtctaatctc gcgattttga tgctcgtggt ggaaagcgta     1020

ggaggatccc gtgcgagtta gtctcaatct ctcagggttt cgtgcgattt tagggtgatc     1080

cacctcttaa tcgagttacg gtttcgtgcg attttagggt aatcctctta atctctcatt     1140

gatttagggt ttcgtgagaa tcgaggtagg gatctgtgtt atttatatcg atctaataga     1200

tggattggtt ttgagattgt tctgtcagat ggggattgtt tcgatatatt accctaatga     1260

tgtgtcagat ggggattgtt tcgatatatt accctaatga tgtgtcagat ggggattgtt     1320

tcgatatatt accctaatga tggataataa gagtagttca cagttatgtt ttgatcctgc     1380

cacatagttt gagttttgtg atcagattta gttttactta tttgtgctta gttcggatgg     1440

gattgttctg atattgttcc aatagatgaa tagctcgtta ggttaaaatc tttaggttga     1500

gttaggcgac acatagttta tttcctctgg atttggattg gaattgtgtt cttagttttt     1560

ttcccctgga tttggattgg aattgtgtgg agctgggtta gagaattaca tctgtatcgt     1620

gtacacctac ttgaactgta gagcttgggt tctaaggtca atttaatctg tattgtatct     1680

ggctctttgc ctagttgaac tgtagtgctg atgttgtact gtgttttttt acccgtttta     1740

tttgctttac tcgtgcaaat caaatctgtc agatgctaga actaggtggc tttattctgt     1800

gttcttacat agatctgttg tcctgtagtt acttatgtca gttttgttat tatctgaaga     1860

tatttttggt tgttgcttgt tgatgtggtg tgagctgtga gcagcgctct tatgattaat     1920

gatgctgtcc aattgtagtg tagtatgatg tgattgatat gttcatctat tttgagctga     1980

cagtaccgat atcgtaggat ctggtgccaa cttattctcc agctgctttt ttttacctat     2040

gttaattcca atcctttctt gcctcttcca gatccagata atgcaaacaa gcattactct     2100

gacatccaac gcatccggta cgtttgacgg ttactattac gaactctgga aggatactgg     2160

caatacaaca atgacggtct acactcaagg tcgcttttcc tgccagtggt cgaacatcaa     2220

taacgcgttg tttaggaccg ggaagaaata caaccagaat tggcagtctc ttggcacaat     2280

ccggatcacg tactctgcga cttacaaccc aaacgggaac tcctacttgt gtatctatgg     2340

ctggtctacc aacccattgg tcgagttcta catcgttgag tcctggggga actggagacc     2400

gcctggtgcc tgcctggccg agggctcgct cgtcttggac gcggctaccg ggcagagggt     2460

ccctatcgaa aaggtgcgtc cggggatgga agttttctcc ttgggacctg attacagact     2520

gtatcgggtg cccgttttgg aggtccttga gagcggggtt agggaagttg tgcgcctcag     2580

aactcggtca gggagaacgc tggtgttgac accagatcac ccgcttttga cccccgaagg     2640

ttggaaacct ctttgtgacc tcccgcttgg aactccaatt gcagtccccg cagaactgcc     2700

tgtggcgggc cacttggccc cacctgaaga acgtgttacg ctcctggctc ttctgttggg     2760

ggatgggaac acaaagctgt cgggtcggag aggtacacgt cctaatgcct tcttctacag     2820

caaaaacccc gaattgctcg cggcttatcg ccggtgtgca gaagccttgg gtgcaaaggt     2880

gaaagcatac gtccacccga ctacgggggt ggttacactc gcaaccctcg ctccacgtcc     2940

tggagctcaa gatcctgtca aacgcctcgt tgtcgaggcg ggaatggttg ctaaagccga     3000

agagaagagg gtcccggagg aggtgtttcg ttaccggcgt gaggcgttgg cccttttctt     3060

gggccgtttg ttctcgacag acggctctgt tgaaaagaag aggatctctt attcaagtgc     3120

cagtttggga ctggcccagg atgtcgcaca tctcttgctg cgccttggaa ttacatctca     3180

actccgttcg agagggccac gggctcacga ggttcttata tcgggccgcg aggatatttt     3240

gcggtttgct gaacttatcg gaccctacct cttgggggcc aagagggaga gacttgcagc     3300

gctggaagct gaggcccgca ggcgtttgcc tggacaggga tggcacttgc ggcttgttct     3360

tcctgccgtg gcgtacagag tgggcgaggc ggaaaggcgc tcgggatttt cgtggagtga     3420

agccggtcgg cgcgtcgcag ttgcgggatc gtgtttgtca tctggactca acctcaaatt     3480

gcccagacgc tacctttctc ggcaccggtt gtcgctgctc ggtgaggctt ttgccgaccc     3540

tgggctggaa gcgctcgcgg aaggccaagt gctctgggac cctattgttg ctgtcgaacc     3600

ggccggtaag gcgagaacat tcgacttgcg cgttccaccc tttgcaaact tcgtgagcga     3660

ggacctggtg gtgcataaca ccgtccccct gggccaagtg acaatcgatg gcgggaccta     3720

cgacatctat aggacgacac gcgtcaacca gccttccatt gtggggacag ccacgttcga     3780

tcagtactgg agcgtgcgca cctctaagcg gacttcagga acagtgaccg tgaccgatca     3840

cttccgcgcc tgggcgaacc ggggcctgaa cctcggcaca atagaccaaa ttacattgtg     3900

cgtggagggt taccaaagct ctggatcagc caacatcacc cagaacacct tctctcaggg     3960

ctcttcttcc ggcagttcgg gtggctcatc cggctccaca acgactactc gcatcgagtg     4020

tgagaacatg tccttgtccg gaccctacgt tagcaggatc accaatccct ttaatggtat     4080

tgcgctgtac gccaacggag acacagcccg cgctaccgtt aacttccccg caagtcgcaa     4140

ctacaatttc cgcctgcggg gttgcggcaa caacaataat cttgcccgtg tggacctgag     4200

gatcgacgga cggaccgtcg ggacctttta ttaccagggc acatacccct gggaggcccc     4260

aattgacaat gtttatgtca gtgcggggag tcatacagtc gaaatcactg ttactgcgga     4320

taacggcaca tgggacgtgt atgccgacta cctggtgata cagtgaccta ggtccccgaa     4380

tttccccgat cgttcaaaca tttggcaata aagtttctta agattgaatc ctgttgccgg     4440

tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa taattaacat     4500

gtaatgcatg acgttattta tgagatgggt ttttatgatt agagtcccgc aattatacat     4560

ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat cgcgcgcggt     4620

gtcatctatg ttactagatc gggaattgga attc                                 4654


<210>  66
<211>  4726
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Expression cassette in pAG2228 - 
       OsUbi3P:BAASS:P77853-T134-100-101:NosT

<400>  66
ggtaccgtcg actctagtaa cggccgccag tgtgctggaa ttaattcggc ttgtcgacca       60

cccaacccca tatcgacaga ggatgtgaag aacaggtaaa tcacgcagaa gaacccatct      120

ctgatagcag ctatcgatta gaacaacgaa tccatattgg gtccgtggga aatacttact      180

gcacaggaag ggggcgatct gacgaggccc cgccaccggc ctcgacccga ggccgaggcc      240

gacgaagcgc cggcgagtac ggcgccgcgg cggcctctgc ccgtgccctc tgcgcgtggg      300

agggagaggc cgcggtggtg ggggcgcgcg cgcgcgcgcg cgcagctggt gcggcggcgc      360

gggggtcagc cgccgagccg gcggcgacgg aggagcaggg cggcgtggac gcgaacttcc      420

gatcggttgg tcagagtgcg cgagttgggc ttagccaatt aggtctcaac aatctattgg      480

gccgtaaaat tcatgggccc tggtttgtct aggcccaata tcccgttcat ttcagcccac      540

aaatatttcc ccagaggatt attaaggccc acacgcagct tatagcagat caagtacgat      600

gtttcctgat cgttggatcg gaaacgtacg gtcttgatca ggcatgccga cttcgtcaaa      660

gagaggcggc atgacctgac gcggagttgg ttccgggcac cgtctggatg gtcgtaccgg      720

gaccggacac gtgtcgcgcc tccaactaca tggacacgtg tggtgctgcc attgggccgt      780

acgcgtggcg gtgaccgcac cggatgctgc ctcgcaccgc cttgcccacg ctttatatag      840

agaggttttc tctccattaa tcgcatagcg agtcgaatcg accgaagggg agggggagcg      900

aagctttgcg ttctctaatc gcctcgtcaa ggtaactaat caatcacctc gtcctaatcc      960

tcgaatctct cgtggtgccc gtctaatctc gcgattttga tgctcgtggt ggaaagcgta     1020

ggaggatccc gtgcgagtta gtctcaatct ctcagggttt cgtgcgattt tagggtgatc     1080

cacctcttaa tcgagttacg gtttcgtgcg attttagggt aatcctctta atctctcatt     1140

gatttagggt ttcgtgagaa tcgaggtagg gatctgtgtt atttatatcg atctaataga     1200

tggattggtt ttgagattgt tctgtcagat ggggattgtt tcgatatatt accctaatga     1260

tgtgtcagat ggggattgtt tcgatatatt accctaatga tgtgtcagat ggggattgtt     1320

tcgatatatt accctaatga tggataataa gagtagttca cagttatgtt ttgatcctgc     1380

cacatagttt gagttttgtg atcagattta gttttactta tttgtgctta gttcggatgg     1440

gattgttctg atattgttcc aatagatgaa tagctcgtta ggttaaaatc tttaggttga     1500

gttaggcgac acatagttta tttcctctgg atttggattg gaattgtgtt cttagttttt     1560

ttcccctgga tttggattgg aattgtgtgg agctgggtta gagaattaca tctgtatcgt     1620

gtacacctac ttgaactgta gagcttgggt tctaaggtca atttaatctg tattgtatct     1680

ggctctttgc ctagttgaac tgtagtgctg atgttgtact gtgttttttt acccgtttta     1740

tttgctttac tcgtgcaaat caaatctgtc agatgctaga actaggtggc tttattctgt     1800

gttcttacat agatctgttg tcctgtagtt acttatgtca gttttgttat tatctgaaga     1860

tatttttggt tgttgcttgt tgatgtggtg tgagctgtga gcagcgctct tatgattaat     1920

gatgctgtcc aattgtagtg tagtatgatg tgattgatat gttcatctat tttgagctga     1980

cagtaccgat atcgtaggat ctggtgccaa cttattctcc agctgctttt ttttacctat     2040

gttaattcca atcctttctt gcctcttcca gatccagata atggcgaaca aacatttgtc     2100

cctctccctc ttcctcgtcc tccttggcct gtcggccagc ttggcctccg ggcaacaaac     2160

aagcattact ctgacatcca acgcatccgg tacgtttgac ggttactatt acgaactctg     2220

gaaggatact ggcaatacaa caatgacggt ctacactcaa ggtcgctttt cctgccagtg     2280

gtcgaacatc aataacgcgt tgtttaggac cgggaagaaa tacaaccaga attggcagtc     2340

tcttggcaca atccggatca cgtactctgc gacttacaac ccaaacggga actcctactt     2400

gtgtatctat ggctggtcta ccaacccatt ggtcgagttc tacatcgttg agtcctgggg     2460

gaactggaga ccgcctggtg cctgcctggc cgagggctcg ctcgtcttgg acgcggctac     2520

cgggcagagg gtccctatcg aaaaggtgcg tccggggatg gaagttttct ccttgggacc     2580

tgattacaga ctgtatcggg tgcccgtttt ggaggtcctt gagagcgggg ttagggaagt     2640

tgtgcgcctc agaactcggt cagggagaac gctggtgttg acaccagatc acccgctttt     2700

gacccccgaa ggttggaaac ctctttgtga cctcccgctt ggaactccaa ttgcagtccc     2760

cgcagaactg cctgtggcgg gccacttggc cccacctgaa gaacgtgtta cgctcctggc     2820

tcttctgttg ggggatggga acacaaagct gtcgggtcgg agaggtacac gtcctaatgc     2880

cttcttctac agcaaaaacc ccgaattgct cgcggcttat cgccggtgtg cagaagcctt     2940

gggtgcaaag gtgaaagcat acgtccaccc gactacgggg gtggttacac tcgcaaccct     3000

cgctccacgt cctggagctc aagatcctgt caaacgcctc gttgtcgagg cgggaatggt     3060

tgctaaagcc gaagagaaga gggtcccgga ggaggtgttt cgttaccggc gtgaggcgtt     3120

ggcccttttc ttgggccgtt tgttctcgac agacggctct gttgaaaaga agaggatctc     3180

ttattcaagt gccagtttgg gactggccca ggatgtcgca catctcttgc tgcgccttgg     3240

aattacatct caactccgtt cgagagggcc acgggctcac gaggttctta tatcgggccg     3300

cgaggatatt ttgcggtttg ctgaacttat cggaccctac ctcttggggg ccaagaggga     3360

gagacttgca gcgctggaag ctgaggcccg caggcgtttg cctggacagg gatggcactt     3420

gcggcttgtt cttcctgccg tggcgtacag agtgggcgag gcggaaaggc gctcgggatt     3480

ttcgtggagt gaagccggtc ggcgcgtcgc agttgcggga tcgtgtttgt catctggact     3540

caacctcaaa ttgcccagac gctacctttc tcggcaccgg ttgtcgctgc tcggtgaggc     3600

ttttgccgac cctgggctgg aagcgctcgc ggaaggccaa gtgctctggg accctattgt     3660

tgctgtcgaa ccggccggta aggcgagaac attcgacttg cgcgttccac cctttgcaaa     3720

cttcgtgagc gaggacctgg tggtgcataa caccgtcccc ctgggccaag tgacaatcga     3780

tggcgggacc tacgacatct ataggacgac acgcgtcaac cagccttcca ttgtggggac     3840

agccacgttc gatcagtact ggagcgtgcg cacctctaag cggacttcag gaacagtgac     3900

cgtgaccgat cacttccgcg cctgggcgaa ccggggcctg aacctcggca caatagacca     3960

aattacattg tgcgtggagg gttaccaaag ctctggatca gccaacatca cccagaacac     4020

cttctctcag ggctcttctt ccggcagttc gggtggctca tccggctcca caacgactac     4080

tcgcatcgag tgtgagaaca tgtccttgtc cggaccctac gttagcagga tcaccaatcc     4140

ctttaatggt attgcgctgt acgccaacgg agacacagcc cgcgctaccg ttaacttccc     4200

cgcaagtcgc aactacaatt tccgcctgcg gggttgcggc aacaacaata atcttgcccg     4260

tgtggacctg aggatcgacg gacggaccgt cgggaccttt tattaccagg gcacataccc     4320

ctgggaggcc ccaattgaca atgtttatgt cagtgcgggg agtcatacag tcgaaatcac     4380

tgttactgcg gataacggca catgggacgt gtatgccgac tacctggtga tacagtgacc     4440

taggtccccg aatttccccg atcgttcaaa catttggcaa taaagtttct taagattgaa     4500

tcctgttgcc ggtcttgcga tgattatcat ataatttctg ttgaattacg ttaagcatgt     4560

aataattaac atgtaatgca tgacgttatt tatgagatgg gtttttatga ttagagtccc     4620

gcaattatac atttaatacg cgatagaaaa caaaatatag cgcgcaaact aggataaatt     4680

atcgcgcgcg gtgtcatcta tgttactaga tcgggaattg gaattc                    4726


<210>  67
<211>  4744
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, Expression cassette in pAG2229 - 
       OsUbi3P:BAASS:P77853-T134-100-101:SEKDEL:NosT

<400>  67
ggtaccgtcg actctagtaa cggccgccag tgtgctggaa ttaattcggc ttgtcgacca       60

cccaacccca tatcgacaga ggatgtgaag aacaggtaaa tcacgcagaa gaacccatct      120

ctgatagcag ctatcgatta gaacaacgaa tccatattgg gtccgtggga aatacttact      180

gcacaggaag ggggcgatct gacgaggccc cgccaccggc ctcgacccga ggccgaggcc      240

gacgaagcgc cggcgagtac ggcgccgcgg cggcctctgc ccgtgccctc tgcgcgtggg      300

agggagaggc cgcggtggtg ggggcgcgcg cgcgcgcgcg cgcagctggt gcggcggcgc      360

gggggtcagc cgccgagccg gcggcgacgg aggagcaggg cggcgtggac gcgaacttcc      420

gatcggttgg tcagagtgcg cgagttgggc ttagccaatt aggtctcaac aatctattgg      480

gccgtaaaat tcatgggccc tggtttgtct aggcccaata tcccgttcat ttcagcccac      540

aaatatttcc ccagaggatt attaaggccc acacgcagct tatagcagat caagtacgat      600

gtttcctgat cgttggatcg gaaacgtacg gtcttgatca ggcatgccga cttcgtcaaa      660

gagaggcggc atgacctgac gcggagttgg ttccgggcac cgtctggatg gtcgtaccgg      720

gaccggacac gtgtcgcgcc tccaactaca tggacacgtg tggtgctgcc attgggccgt      780

acgcgtggcg gtgaccgcac cggatgctgc ctcgcaccgc cttgcccacg ctttatatag      840

agaggttttc tctccattaa tcgcatagcg agtcgaatcg accgaagggg agggggagcg      900

aagctttgcg ttctctaatc gcctcgtcaa ggtaactaat caatcacctc gtcctaatcc      960

tcgaatctct cgtggtgccc gtctaatctc gcgattttga tgctcgtggt ggaaagcgta     1020

ggaggatccc gtgcgagtta gtctcaatct ctcagggttt cgtgcgattt tagggtgatc     1080

cacctcttaa tcgagttacg gtttcgtgcg attttagggt aatcctctta atctctcatt     1140

gatttagggt ttcgtgagaa tcgaggtagg gatctgtgtt atttatatcg atctaataga     1200

tggattggtt ttgagattgt tctgtcagat ggggattgtt tcgatatatt accctaatga     1260

tgtgtcagat ggggattgtt tcgatatatt accctaatga tgtgtcagat ggggattgtt     1320

tcgatatatt accctaatga tggataataa gagtagttca cagttatgtt ttgatcctgc     1380

cacatagttt gagttttgtg atcagattta gttttactta tttgtgctta gttcggatgg     1440

gattgttctg atattgttcc aatagatgaa tagctcgtta ggttaaaatc tttaggttga     1500

gttaggcgac acatagttta tttcctctgg atttggattg gaattgtgtt cttagttttt     1560

ttcccctgga tttggattgg aattgtgtgg agctgggtta gagaattaca tctgtatcgt     1620

gtacacctac ttgaactgta gagcttgggt tctaaggtca atttaatctg tattgtatct     1680

ggctctttgc ctagttgaac tgtagtgctg atgttgtact gtgttttttt acccgtttta     1740

tttgctttac tcgtgcaaat caaatctgtc agatgctaga actaggtggc tttattctgt     1800

gttcttacat agatctgttg tcctgtagtt acttatgtca gttttgttat tatctgaaga     1860

tatttttggt tgttgcttgt tgatgtggtg tgagctgtga gcagcgctct tatgattaat     1920

gatgctgtcc aattgtagtg tagtatgatg tgattgatat gttcatctat tttgagctga     1980

cagtaccgat atcgtaggat ctggtgccaa cttattctcc agctgctttt ttttacctat     2040

gttaattcca atcctttctt gcctcttcca gatccagata atggcgaaca aacatttgtc     2100

cctctccctc ttcctcgtcc tccttggcct gtcggccagc ttggcctccg ggcaacaaac     2160

aagcattact ctgacatcca acgcatccgg tacgtttgac ggttactatt acgaactctg     2220

gaaggatact ggcaatacaa caatgacggt ctacactcaa ggtcgctttt cctgccagtg     2280

gtcgaacatc aataacgcgt tgtttaggac cgggaagaaa tacaaccaga attggcagtc     2340

tcttggcaca atccggatca cgtactctgc gacttacaac ccaaacggga actcctactt     2400

gtgtatctat ggctggtcta ccaacccatt ggtcgagttc tacatcgttg agtcctgggg     2460

gaactggaga ccgcctggtg cctgcctggc cgagggctcg ctcgtcttgg acgcggctac     2520

cgggcagagg gtccctatcg aaaaggtgcg tccggggatg gaagttttct ccttgggacc     2580

tgattacaga ctgtatcggg tgcccgtttt ggaggtcctt gagagcgggg ttagggaagt     2640

tgtgcgcctc agaactcggt cagggagaac gctggtgttg acaccagatc acccgctttt     2700

gacccccgaa ggttggaaac ctctttgtga cctcccgctt ggaactccaa ttgcagtccc     2760

cgcagaactg cctgtggcgg gccacttggc cccacctgaa gaacgtgtta cgctcctggc     2820

tcttctgttg ggggatggga acacaaagct gtcgggtcgg agaggtacac gtcctaatgc     2880

cttcttctac agcaaaaacc ccgaattgct cgcggcttat cgccggtgtg cagaagcctt     2940

gggtgcaaag gtgaaagcat acgtccaccc gactacgggg gtggttacac tcgcaaccct     3000

cgctccacgt cctggagctc aagatcctgt caaacgcctc gttgtcgagg cgggaatggt     3060

tgctaaagcc gaagagaaga gggtcccgga ggaggtgttt cgttaccggc gtgaggcgtt     3120

ggcccttttc ttgggccgtt tgttctcgac agacggctct gttgaaaaga agaggatctc     3180

ttattcaagt gccagtttgg gactggccca ggatgtcgca catctcttgc tgcgccttgg     3240

aattacatct caactccgtt cgagagggcc acgggctcac gaggttctta tatcgggccg     3300

cgaggatatt ttgcggtttg ctgaacttat cggaccctac ctcttggggg ccaagaggga     3360

gagacttgca gcgctggaag ctgaggcccg caggcgtttg cctggacagg gatggcactt     3420

gcggcttgtt cttcctgccg tggcgtacag agtgggcgag gcggaaaggc gctcgggatt     3480

ttcgtggagt gaagccggtc ggcgcgtcgc agttgcggga tcgtgtttgt catctggact     3540

caacctcaaa ttgcccagac gctacctttc tcggcaccgg ttgtcgctgc tcggtgaggc     3600

ttttgccgac cctgggctgg aagcgctcgc ggaaggccaa gtgctctggg accctattgt     3660

tgctgtcgaa ccggccggta aggcgagaac attcgacttg cgcgttccac cctttgcaaa     3720

cttcgtgagc gaggacctgg tggtgcataa caccgtcccc ctgggccaag tgacaatcga     3780

tggcgggacc tacgacatct ataggacgac acgcgtcaac cagccttcca ttgtggggac     3840

agccacgttc gatcagtact ggagcgtgcg cacctctaag cggacttcag gaacagtgac     3900

cgtgaccgat cacttccgcg cctgggcgaa ccggggcctg aacctcggca caatagacca     3960

aattacattg tgcgtggagg gttaccaaag ctctggatca gccaacatca cccagaacac     4020

cttctctcag ggctcttctt ccggcagttc gggtggctca tccggctcca caacgactac     4080

tcgcatcgag tgtgagaaca tgtccttgtc cggaccctac gttagcagga tcaccaatcc     4140

ctttaatggt attgcgctgt acgccaacgg agacacagcc cgcgctaccg ttaacttccc     4200

cgcaagtcgc aactacaatt tccgcctgcg gggttgcggc aacaacaata atcttgcccg     4260

tgtggacctg aggatcgacg gacggaccgt cgggaccttt tattaccagg gcacataccc     4320

ctgggaggcc ccaattgaca atgtttatgt cagtgcgggg agtcatacag tcgaaatcac     4380

tgttactgcg gataacggca catgggacgt gtatgccgac tacctggtga tacagagcga     4440

gaaggacgag ctgtgaccta ggtccccgaa tttccccgat cgttcaaaca tttggcaata     4500

aagtttctta agattgaatc ctgttgccgg tcttgcgatg attatcatat aatttctgtt     4560

gaattacgtt aagcatgtaa taattaacat gtaatgcatg acgttattta tgagatgggt     4620

ttttatgatt agagtcccgc aattatacat ttaatacgcg atagaaaaca aaatatagcg     4680

cgcaaactag gataaattat cgcgcgcggt gtcatctatg ttactagatc gggaattgga     4740

attc                                                                  4744


<210>  68
<211>  4683
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct,  Expression cassettes in pAG2361 and pAG4004
       - ZmUbi1P:mmUBQ:ZmKozak:BAASS:P77853-T134-100-101:SEKDEL:NosT

<400>  68
ggtaccctgc agtgcagcgt gacccggtcg tgcccctctc tagagataat gagcattgca       60

tgtctaagtt ataaaaaatt accacatatt ttttttgtca cacttgtttg aagtgcagtt      120

tatctatctt tatacatata tttaaacttt actctacgaa taatataatc tatagtacta      180

caataatatc agtgttttag agaatcatat aaatgaacag ttagacatgg tctaaaggac      240

aattgagtat tttgacaaca ggactctaca gttttatctt tttagtgtgc atgtgttctc      300

cttttttttt gcaaatagct tcacctatat aatacttcat ccattttatt agtacatcca      360

tttagggttt agggttaatg gtttttatag actaattttt ttagtacatc tattttattc      420

tattttagcc tctaaattaa gaaaactaaa actctatttt agttttttta tttaataatt      480

tagatataaa atagaataaa ataaagtgac taaaaattaa acaaataccc tttaagaaat      540

taaaaaaact aaggaaacat ttttcttgtt tcgagtagat aatgccagcc tgttaaacgc      600

cgtcgacgag tctaacggac accaaccagc gaaccagcag cgtcgcgtcg ggccaagcga      660

agcagacggc acggcatctc tgtcgctgcc tctggacccc tctcgagagt tccgctccac      720

cgttggactt gctccgctgt cggcatccag aaattgcgtg gcggagcggc agacgtgagc      780

cggcacggca ggcggcctcc tcctcctctc acggcacggc agctacgggg gattcctttc      840

ccaccgctcc ttcgctttcc cttcctcgcc cgccgtaata aatagacacc ccctccacac      900

cctctttccc caacctcgtg ttgttcggag cgcacacaca cacaaccaga tctcccccaa      960

atccacccgt cggcacctcc gcttcaaggt acgccgctcg tcctcccccc ccccccctct     1020

ctaccttctc tagatcggcg ttccggtcca tggttagggc ccggtagttc tacttctgtt     1080

catgtttgtg ttagatccgt gtttgtgtta gatccgtgct gctagcgttc gtacacggat     1140

gcgacctgta cgtcagacac gttctgattg ctaacttgcc agtgtttctc tttggggaat     1200

cctgggatgg ctctagccgt tccgcagacg ggatcgattt catgattttt tttgtttcgt     1260

tgcatagggt ttggtttgcc cttttccttt atttcaatat atgccgtgca cttgtttgtc     1320

gggtcatctt ttcatgcttt tttttgtctt ggttgtgatg atgtggtctg gttgggcggt     1380

cgttctagat cggagtagaa ttctgtttca aactacctgg tggatttatt aattttggat     1440

ctgtatgtgt gtgccataca tattcatagt tacgaattga agatgatgga tggaaatatc     1500

gatctaggat aggtatacat gttgatgcgg gttttactga tgcatataca gagatgcttt     1560

ttgttcgctt ggttgtgatg atgtggtgtg gttgggcggt cgttcattcg ttctagatcg     1620

gagtagaata ctgtttcaaa ctacctggtg tatttattaa ttttggaact gtatgtgtgt     1680

gtcatacatc ttcatagtta cgagtttaag atggatggaa atatcgatct aggataggta     1740

tacatgttga tgtgggtttt actgatgcat atacatgatg gcatatgcag catctattca     1800

tatgctctaa ccttgagtac ctatctatta taataaacaa gtatgtttta taattatttt     1860

gatcttgata tacttggatg atggcatatg cagcagctat atgtggattt ttttagccct     1920

gccttcatac gctatttatt tgcttggtac tgtttctttt gtcgatgctc accctgttgt     1980

ttggtgttac ttctgcagat ccagatcgga tcctaaacca tggcgaacaa acatttgtcc     2040

ctctccctct tcctcgtcct ccttggcctg tcggccagct tggcctccgg gcaacaaaca     2100

agcattactc tgacatccaa cgcatccggt acgtttgacg gttactatta cgaactctgg     2160

aaggatactg gcaatacaac aatgacggtc tacactcaag gtcgcttttc ctgccagtgg     2220

tcgaacatca ataacgcgtt gtttaggacc gggaagaaat acaaccagaa ttggcagtct     2280

cttggcacaa tccggatcac gtactctgcg acttacaacc caaacgggaa ctcctacttg     2340

tgtatctatg gctggtctac caacccattg gtcgagttct acatcgttga gtcctggggg     2400

aactggagac cgcctggtgc ctgcctggcc gagggctcgc tcgtcttgga cgcggctacc     2460

gggcagaggg tccctatcga aaaggtgcgt ccggggatgg aagttttctc cttgggacct     2520

gattacagac tgtatcgggt gcccgttttg gaggtccttg agagcggggt tagggaagtt     2580

gtgcgcctca gaactcggtc agggagaacg ctggtgttga caccagatca cccgcttttg     2640

acccccgaag gttggaaacc tctttgtgac ctcccgcttg gaactccaat tgcagtcccc     2700

gcagaactgc ctgtggcggg ccacttggcc ccacctgaag aacgtgttac gctcctggct     2760

cttctgttgg gggatgggaa cacaaagctg tcgggtcgga gaggtacacg tcctaatgcc     2820

ttcttctaca gcaaaaaccc cgaattgctc gcggcttatc gccggtgtgc agaagccttg     2880

ggtgcaaagg tgaaagcata cgtccacccg actacggggg tggttacact cgcaaccctc     2940

gctccacgtc ctggagctca agatcctgtc aaacgcctcg ttgtcgaggc gggaatggtt     3000

gctaaagccg aagagaagag ggtcccggag gaggtgtttc gttaccggcg tgaggcgttg     3060

gcccttttct tgggccgttt gttctcgaca gacggctctg ttgaaaagaa gaggatctct     3120

tattcaagtg ccagtttggg actggcccag gatgtcgcac atctcttgct gcgccttgga     3180

attacatctc aactccgttc gagagggcca cgggctcacg aggttcttat atcgggccgc     3240

gaggatattt tgcggtttgc tgaacttatc ggaccctacc tcttgggggc caagagggag     3300

agacttgcag cgctggaagc tgaggcccgc aggcgtttgc ctggacaggg atggcacttg     3360

cggcttgttc ttcctgccgt ggcgtacaga gtgggcgagg ctgaaaggcg ctcgggattt     3420

tcgtggagtg aagccggtcg gcgcgtcgca gttgcgggat cgtgtttgtc atctggactc     3480

aacctcaaat tgcccagacg ctacctttct cggcaccggt tgtcgctgct cggtgaggct     3540

tttgccgacc ctgggctgga agcgctcgcg gaaggccaag tgctctggga ccctattgtt     3600

gctgtcgaac cggccggtaa ggcgagaaca ttcgacttgc gcgttccacc ctttgcaaac     3660

ttcgtgagcg aggacctggt ggtgcataac accgtccccc tgggccaagt gacaatcgat     3720

ggcgggacct acgacatcta taggacgaca cgcgtcaacc agccttccat tgtggggaca     3780

gccacgttcg atcagtactg gagcgtgcgc acctctaagc ggacttcagg aacagtgacc     3840

gtgaccgatc acttccgcgc ctgggcgaac cggggcctga acctcggcac aatagaccaa     3900

attacattgt gcgtggaggg ttaccaaagc tctggatcag ccaacatcac ccagaacacc     3960

ttctctcagg gctcttcttc cggcagttcg ggtggctcat ccggctccac aacgactact     4020

cgcatcgagt gtgagaacat gtccttgtcc ggaccctacg ttagcaggat caccaatccc     4080

tttaatggta ttgcgctgta cgccaacgga gacacagccc gcgctaccgt taacttcccc     4140

gcaagtcgca actacaattt ccgcctgcgg ggttgcggca acaacaataa tcttgcccgt     4200

gtggacctga ggatcgacgg acggaccgtc gggacctttt attaccaggg cacatacccc     4260

tgggaggccc caattgacaa tgtttatgtc agtgcgggga gtcatacagt cgaaatcact     4320

gttactgcgg ataacggcac atgggacgtg tatgccgact acctggtgat acagagcgag     4380

aaggacgagc tgtgacctag gtccccgaat ttccccgatc gttcaaacat ttggcaataa     4440

agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata atttctgttg     4500

aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat gagatgggtt     4560

tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa aatatagcgc     4620

gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg ggaattggaa     4680

ttc                                                                   4683


<210>  69
<211>  10146
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG4000

<400>  69
aattcctgca gtgcagcgtg acccggtcgt gcccctctct agagataatg agcattgcat       60

gtctaagtta taaaaaatta ccacatattt tttttgtcac acttgtttga agtgcagttt      120

atctatcttt atacatatat ttaaacttta ctctacgaat aatataatct atagtactac      180

aataatatca gtgttttaga gaatcatata aatgaacagt tagacatggt ctaaaggaca      240

attgagtatt ttgacaacag gactctacag ttttatcttt ttagtgtgca tgtgttctcc      300

tttttttttg caaatagctt cacctatata atacttcatc cattttatta gtacatccat      360

ttagggttta gggttaatgg tttttataga ctaatttttt tagtacatct attttattct      420

attttagcct ctaaattaag aaaactaaaa ctctatttta gtttttttat ttaataattt      480

agatataaaa tagaataaaa taaagtgact aaaaattaaa caaataccct ttaagaaatt      540

aaaaaaacta aggaaacatt tttcttgttt cgagtagata atgccagcct gttaaacgcc      600

gtcgacgagt ctaacggaca ccaaccagcg aaccagcagc gtcgcgtcgg gccaagcgaa      660

gcagacggca cggcatctct gtcgctgcct ctggacccct ctcgagagtt ccgctccacc      720

gttggacttg ctccgctgtc ggcatccaga aattgcgtgg cggagcggca gacgtgagcc      780

ggcacggcag gcggcctcct cctcctctca cggcacggca gctacggggg attcctttcc      840

caccgctcct tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc      900

ctctttcccc aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa      960

tccacccgtc ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc     1020

taccttctct agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc     1080

atgtttgtgt tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg     1140

cgacctgtac gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc     1200

ctgggatggc tctagccgtt ccgcagacgg gatcgatttc atgatttttt ttgtttcgtt     1260

gcatagggtt tggtttgccc ttttccttta tttcaatata tgccgtgcac ttgtttgtcg     1320

ggtcatcttt tcatgctttt ttttgtcttg gttgtgatga tgtggtctgg ttgggcggtc     1380

gttctagatc ggagtagaat tctgtttcaa actacctggt ggatttatta attttggatc     1440

tgtatgtgtg tgccatacat attcatagtt acgaattgaa gatgatggat ggaaatatcg     1500

atctaggata ggtatacatg ttgatgcggg ttttactgat gcatatacag agatgctttt     1560

tgttcgcttg gttgtgatga tgtggtgtgg ttgggcggtc gttcattcgt tctagatcgg     1620

agtagaatac tgtttcaaac tacctggtgt atttattaat tttggaactg tatgtgtgtg     1680

tcatacatct tcatagttac gagtttaaga tggatggaaa tatcgatcta ggataggtat     1740

acatgttgat gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat     1800

atgctctaac cttgagtacc tatctattat aataaacaag tatgttttat aattattttg     1860

atcttgatat acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg     1920

ccttcatacg ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt     1980

tggtgttact tctgcagatg cagaaactca ttaactcagt gcaaaactat gcctggggca     2040

gcaaaacggc gttgactgaa ctttatggta tggaaaatcc gtccagccag ccgatggccg     2100

agctgtggat gggcgcacat ccgaaaagca gttcacgagt gcagaatgcc gccggagata     2160

tcgtttcact gcgtgatgtg attgagagtg ataaatcgac tctgctcgga gaggccgttg     2220

ccaaacgctt tggcgaactg cctttcctgt tcaaagtatt atgcgcagca cagccactct     2280

ccattcaggt tcatccaaac aaacacaatt ctgaaatcgg ttttgccaaa gaaaatgccg     2340

caggtatccc gatggatgcc gccgagcgta actataaaga tcctaaccac aagccggagc     2400

tggtttttgc gctgacgcct ttccttgcga tgaacgcgtt tcgtgaattt tccgagattg     2460

tctccctact ccagccggtc gcaggtgcac atccggcgat tgctcacttt ttacaacagc     2520

ctgatgccga acgtttaagc gaactgttcg ccagcctgtt gaatatgcag ggtgaagaaa     2580

aatcccgcgc gctggcgatt ttaaaatcgg ccctcgatag ccagcagggt gaaccgtggc     2640

aaacgattcg tttaatttct gaattttacc cggaagacag cggtctgttc tccccgctat     2700

tgctgaatgt ggtgaaattg aaccctggcg aagcgatgtt cctgttcgct gaaacaccgc     2760

acgcttacct gcaaggcgtg gcgctggaag tgatggcaaa ctccgataac gtgctgcgtg     2820

cgggtctgac gcctaaatac attgatattc cggaactggt tgccaatgtg aaattcgaag     2880

ccaaaccggc taaccagttg ttgacccagc cggtgaaaca aggtgcagaa ctggacttcc     2940

cgattccagt ggatgatttt gccttctcgc tgcatgacct tagtgataaa gaaaccacca     3000

ttagccagca gagtgccgcc attttgttct gcgtcgaagg cgatgcaacg ttgtggaaag     3060

gttctcagca gttacagctt aaaccgggtg aatcagcgtt tattgccgcc aacgaatcac     3120

cggtgactgt caaaggccac ggccgtttag cgcgtgttta caacaagctg taagagctta     3180

ctgaaaaaat taacatctct tgctaagctg ggagctctag atccccgaat ttccccgatc     3240

gttcaaacat ttggcaataa agtttcttaa gattgaatcc tgttgccggt cttgcgatga     3300

ttatcatata atttctgttg aattacgtta agcatgtaat aattaacatg taatgcatga     3360

cgttatttat gagatgggtt tttatgatta gagtcccgca attatacatt taatacgcga     3420

tagaaaacaa aatatagcgc gcaaactagg ataaattatc gcgcgcggtg tcatctatgt     3480

tactagatcg ggaattggcg agctcgaatt aattcagtac attaaaaacg tccgcaatgt     3540

gttattaagt tgtctaagcg tcaatttgtt tacaccacaa tatatcctgc caccagccag     3600

ccaacagctc cccgaccggc agctcggcac aaaatcacca ctcgatacag gcagcccatc     3660

agtccgggac ggcgtcagcg ggagagccgt tgtaaggcgg cagactttgc tcatgttacc     3720

gatgctattc ggaagaacgg caactaagct gccgggtttg aaacacggat gatctcgcgg     3780

agggtagcat gttgattgta acgatgacag agcgttgctg cctgtgatca aatatcatct     3840

ccctcgcaga gatccgaatt atcagccttc ttattcattt ctcgcttaac cgtgacaggc     3900

tgtcgatctt gagaactatg ccgacataat aggaaatcgc tggataaagc cgctgaggaa     3960

gctgagtggc gctatttctt tagaagtgaa cgttgacgat cgtcgaccgt accccgatga     4020

attaattcgg acgtacgttc tgaacacagc tggatactta cttgggcgat tgtcatacat     4080

gacatcaaca atgtacccgt ttgtgtaacc gtctcttgga ggttcgtatg acactagtgg     4140

ttcccctcag cttgcgacta gatgttgagg cctaacattt tattagagag caggctagtt     4200

gcttagatac atgatcttca ggccgttatc tgtcagggca agcgaaaatt ggccatttat     4260

gacgaccaat gccccgcaga agctcccatc tttgccgcca tagacgccgc gccccccttt     4320

tggggtgtag aacatccttt tgccagatgt ggaaaagaag ttcgttgtcc cattgttggc     4380

aatgacgtag tagccggcga aagtgcgaga cccatttgcg ctatatataa gcctacgatt     4440

tccgttgcga ctattgtcgt aattggatga actattatcg tagttgctct cagagttgtc     4500

gtaatttgat ggactattgt cgtaattgct tatggagttg tcgtagttgc ttggagaaat     4560

gtcgtagttg gatggggagt agtcataggg aagacgagct tcatccacta aaacaattgg     4620

caggtcagca agtgcctgcc ccgatgccat cgcaagtacg aggcttagaa ccaccttcaa     4680

cagatcgcgc atagtcttcc ccagctctct aacgcttgag ttaagccgcg ccgcgaagcg     4740

gcgtcggctt gaacgaattg ttagacatta tttgccgact accttggtga tctcgccttt     4800

cacgtagtga acaaattctt ccaactgatc tgcgcgcgag gccaagcgat cttcttgtcc     4860

aagataagcc tgcctagctt caagtatgac gggctgatac tgggccggca ggcgctccat     4920

tgcccagtcg gcagcgacat ccttcggcgc gattttgccg gttactgcgc tgtaccaaat     4980

gcgggacaac gtaagcacta catttcgctc atcgccagcc cagtcgggcg gcgagttcca     5040

tagcgttaag gtttcattta gcgcctcaaa tagatcctgt tcaggaaccg gatcaaagag     5100

ttcctccgcc gctggaccta ccaaggcaac gctatgttct cttgcttttg tcagcaagat     5160

agccagatca atgtcgatcg tggctggctc gaagatacct gcaagaatgt cattgcgctg     5220

ccattctcca aattgcagtt cgcgcttagc tggataacgc cacggaatga tgtcgtcgtg     5280

cacaacaatg gtgacttcta cagcgcggag aatctcgctc tctccagggg aagccgaagt     5340

ttccaaaagg tcgttgatca aagctcgccg cgttgtttca tcaagcctta cggtcaccgt     5400

aaccagcaaa tcaatatcac tgtgtggctt caggccgcca tccactgcgg agccgtacaa     5460

atgtacggcc agcaacgtcg gttcgagatg gcgctcgatg acgccaacta cctctgatag     5520

ttgagtcgat acttcggcga tcaccgcttc cctcatgatg tttaactcct gaattaagcc     5580

gcgccgcgaa gcggtgtcgg cttgaatgaa ttgttaggcg tcatcctgtg ctcccgagaa     5640

ccagtaccag tacatcgctg tttcgttcga gacttgaggt ctagttttat acgtgaacag     5700

gtcaatgccg ccgagagtaa agccacattt tgcgtacaaa ttgcaggcag gtacattgtt     5760

cgtttgtgtc tctaatcgta tgccaaggag ctgtctgctt agtgcccact ttttcgcaaa     5820

ttcgatgaga ctgtgcgcga ctcctttgcc tcggtgcgtg tgcgacacaa caatgtgttc     5880

gatagaggct agatcgttcc atgttgagtt gagttcaatc ttcccgacaa gctcttggtc     5940

gatgaatgcg ccatagcaag cagagtcttc atcagagtca tcatccgaga tgtaatcctt     6000

ccggtagggg ctcacacttc tggtagatag ttcaaagcct tggtcggata ggtgcacatc     6060

gaacacttca cgaacaatga aatggttctc agcatccaat gtttccgcca cctgctcagg     6120

gatcaccgaa atcttcatat gacgcctaac gcctggcaca gcggatcgca aacctggcgc     6180

ggcttttggc acaaaaggcg tgacaggttt gcgaatccgt tgctgccact tgttaaccct     6240

tttgccagat ttggtaacta taatttatgt tagaggcgaa gtcttgggta aaaactggcc     6300

taaaattgct ggggatttca ggaaagtaaa catcaccttc cggctcgatg tctattgtag     6360

atatatgtag tgtatctact tgatcggggg atctgctgcc tcgcgcgttt cggtgatgac     6420

ggtgaaaacc tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat     6480

gccgggagca gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggcgca     6540

gccatgaccc agtcacgtag cgatagcgga gtgtatactg gcttaactat gcggcatcag     6600

agcagattgt actgagagtg caccatatgc ggtgtgaaat accgcacaga tgcgtaagga     6660

gaaaataccg catcaggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg     6720

ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat     6780

caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta     6840

aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa     6900

atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc     6960

cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt     7020

ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca     7080

gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg     7140

accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat     7200

cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta     7260

cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta tttggtatct     7320

gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac     7380

aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa     7440

aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa     7500

actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt     7560

taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca     7620

gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca     7680

tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc     7740

ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta tcagcaataa     7800

accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc     7860

agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca     7920

acgttgttgc cattgctgca gggggggggg ggggggggtt ccattgttca ttccacggac     7980

aaaaacagag aaaggaaacg acagaggcca aaaagctcgc tttcagcacc tgtcgtttcc     8040

tttcttttca gagggtattt taaataaaaa cattaagtta tgacgaagaa gaacggaaac     8100

gccttaaacc ggaaaatttt cataaatagc gaaaacccgc gaggtcgccg ccccgtaacc     8160

tgtcggatca ccggaaagga cccgtaaagt gataatgatt atcatctaca tatcacaacg     8220

tgcgtggagg ccatcaaacc acgtcaaata atcaattatg acgcaggtat cgtattaatt     8280

gatctgcatc aacttaacgt aaaaacaact tcagacaata caaatcagcg acactgaata     8340

cggggcaacc tcatgtcccc cccccccccc ccctgcaggc atcgtggtgt cacgctcgtc     8400

gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc     8460

catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt     8520

ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc     8580

atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg     8640

tatgcggcga ccgagttgct cttgcccggc gtcaacacgg gataataccg cgccacatag     8700

cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat     8760

cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc     8820

atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa     8880

aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta     8940

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa     9000

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga     9060

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtct     9120

tcaagaattg gtcgacgatc ttgctgcgtt cggatatttt cgtggagttc ccgccacaga     9180

cccggattga aggcgagatc cagcaactcg cgccagatca tcctgtgacg gaactttggc     9240

gcgtgatgac tggccaggac gtcggccgaa agagcgacaa gcagatcacg cttttcgaca     9300

gcgtcggatt tgcgatcgag gatttttcgg cgctgcgcta cgtccgcgac cgcgttgagg     9360

gatcaagcca cagcagccca ctcgaccttc tagccgaccc agacgagcca agggatcttt     9420

ttggaatgct gctccgtcgt caggctttcc gacgtttggg tggttgaaca gaagtcatta     9480

tcgcacggaa tgccaagcac tcccgagggg aaccctgtgg ttggcatgca catacaaatg     9540

gacgaacgga taaacctttt cacgcccttt taaatatccg attattctaa taaacgctct     9600

tttctcttag gtttacccgc caatatatcc tgtcaaacac tgatagttta aactgaaggc     9660

gggaaacgac aacctgatca tgagcggaga attaagggag tcacgttatg acccccgccg     9720

atgacgcggg acaagccgtt ttacgtttgg aactgacaga accgcaacgt tgaaggagcc     9780

actcagctta attaagtcta actcgagtta ctggtacgta ccaaatccat ggaatcaagg     9840

taccatcaat cccgggtatt catcctaggt ccccgaattt ccccgatcgt tcaaacattt     9900

ggcaataaag tttcttaaga ttgaatcctg ttgccggtct tgcgatgatt atcatataat     9960

ttctgttgaa ttacgttaag catgtaataa ttaacatgta atgcatgacg ttatttatga    10020

gatgggtttt tatgattaga gtcccgcaat tatacattta atacgcgata gaaaacaaaa    10080

tatagcgcgc aaactaggat aaattatcgc gcgcggtgtc atctatgtta ctagatcggg    10140

aattgg                                                               10146


<210>  70
<211>  14622
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG2361

<400>  70
ctaggtcccc gaatttcccc gatcgttcaa acatttggca ataaagtttc ttaagattga       60

atcctgttgc cggtcttgcg atgattatca tataatttct gttgaattac gttaagcatg      120

taataattaa catgtaatgc atgacgttat ttatgagatg ggtttttatg attagagtcc      180

cgcaattata catttaatac gcgatagaaa acaaaatata gcgcgcaaac taggataaat      240

tatcgcgcgc ggtgtcatct atgttactag atcgggaatt ggaattcata ctaaagcttg      300

catgcctgca ggtcgactct agtaacggcc gccagtgtgc tggaattaat tcggcttgtc      360

gaccacccaa ccccatatcg acagaggatg tgaagaacag gtaaatcacg cagaagaacc      420

catctctgat agcagctatc gattagaaca acgaatccat attgggtccg tgggaaatac      480

ttactgcaca ggaagggggc gatctgacga ggccccgcca ccggcctcga cccgaggccg      540

aggccgacga agcgccggcg agtacggcgc cgcggcggcc tctgcccgtg ccctctgcgc      600

gtgggaggga gaggccgcgg tggtgggggc gcgcgcgcgc gcgcgcgcag ctggtgcggc      660

ggcgcggggg tcagccgccg agccggcggc gacggaggag cagggcggcg tggacgcgaa      720

cttccgatcg gttggtcaga gtgcgcgagt tgggcttagc caattaggtc tcaacaatct      780

attgggccgt aaaattcatg ggccctggtt tgtctaggcc caatatcccg ttcatttcag      840

cccacaaata tttccccaga ggattattaa ggcccacacg cagcttatag cagatcaagt      900

acgatgtttc ctgatcgttg gatcggaaac gtacggtctt gatcaggcat gccgacttcg      960

tcaaagagag gcggcatgac ctgacgcgga gttggttccg ggcaccgtct ggatggtcgt     1020

accgggaccg gacacgtgtc gcgcctccaa ctacatggac acgtgtggtg ctgccattgg     1080

gccgtacgcg tggcggtgac cgcaccggat gctgcctcgc accgccttgc ccacgcttta     1140

tatagagagg ttttctctcc attaatcgca tagcgagtcg aatcgaccga aggggagggg     1200

gagcgaagct ttgcgttctc taatcgcctc gtcaaggtaa ctaatcaatc acctcgtcct     1260

aatcctcgaa tctctcgtgg tgcccgtcta atctcgcgat tttgatgctc gtggtggaaa     1320

gcgtaggagg atcccgtgcg agttagtctc aatctctcag ggtttcgtgc gattttaggg     1380

tgatccacct cttaatcgag ttacggtttc gtgcgatttt agggtaatcc tcttaatctc     1440

tcattgattt agggtttcgt gagaatcgag gtagggatct gtgttattta tatcgatcta     1500

atagatggat tggttttgag attgttctgt cagatgggga ttgtttcgat atattaccct     1560

aatgatgtgt cagatgggga ttgtttcgat atattaccct aatgatgtgt cagatgggga     1620

ttgtttcgat atattaccct aatgatggat aataagagta gttcacagtt atgttttgat     1680

cctgccacat agtttgagtt ttgtgatcag atttagtttt acttatttgt gcttagttcg     1740

gatgggattg ttctgatatt gttccaatag atgaatagct cgttaggtta aaatctttag     1800

gttgagttag gcgacacata gtttatttcc tctggatttg gattggaatt gtgttcttag     1860

tttttttccc ctggatttgg attggaattg tgtggagctg ggttagagaa ttacatctgt     1920

atcgtgtaca cctacttgaa ctgtagagct tgggttctaa ggtcaattta atctgtattg     1980

tatctggctc tttgcctagt tgaactgtag tgctgatgtt gtactgtgtt tttttacccg     2040

ttttatttgc tttactcgtg caaatcaaat ctgtcagatg ctagaactag gtggctttat     2100

tctgtgttct tacatagatc tgttgtcctg tagttactta tgtcagtttt gttattatct     2160

gaagatattt ttggttgttg cttgttgatg tggtgtgagc tgtgagcagc gctcttatga     2220

ttaatgatgc tgtccaattg tagtgtagta tgatgtgatt gatatgttca tctattttga     2280

gctgacagta ccgatatcgt aggatctggt gccaacttat tctccagctg ctttttttta     2340

cctatgttaa ttccaatcct ttcttgcctc ttccagatcc agataatgca gaaactcatt     2400

aactcagtgc aaaactatgc ctggggcagc aaaacggcgt tgactgaact ttatggtatg     2460

gaaaatccgt ccagccagcc gatggccgag ctgtggatgg gcgcacatcc gaaaagcagt     2520

tcacgagtgc agaatgccgc cggagatatc gtttcactgc gtgatgtgat tgagagtgat     2580

aaatcgactc tgctcggaga ggccgttgcc aaacgctttg gcgaactgcc tttcctgttc     2640

aaagtattat gcgcagcaca gccactctcc attcaggttc atccaaacaa acacaattct     2700

gaaatcggtt ttgccaaaga aaatgccgca ggtatcccga tggatgccgc cgagcgtaac     2760

tataaagatc ctaaccacaa gccggagctg gtttttgcgc tgacgccttt ccttgcgatg     2820

aacgcgtttc gtgaattttc cgagattgtc tccctactcc agccggtcgc aggtgcacat     2880

ccggcgattg ctcacttttt acaacagcct gatgccgaac gtttaagcga actgttcgcc     2940

agcctgttga atatgcaggg tgaagaaaaa tcccgcgcgc tggcgatttt aaaatcggcc     3000

ctcgatagcc agcagggtga accgtggcaa acgattcgtt taatttctga attttacccg     3060

gaagacagcg gtctgttctc cccgctattg ctgaatgtgg tgaaattgaa ccctggcgaa     3120

gcgatgttcc tgttcgctga aacaccgcac gcttacctgc aaggcgtggc gctggaagtg     3180

atggcaaact ccgataacgt gctgcgtgcg ggtctgacgc ctaaatacat tgatattccg     3240

gaactggttg ccaatgtgaa attcgaagcc aaaccggcta accagttgtt gacccagccg     3300

gtgaaacaag gtgcagaact ggacttcccg attccagtgg atgattttgc cttctcgctg     3360

catgacctta gtgataaaga aaccaccatt agccagcaga gtgccgccat tttgttctgc     3420

gtcgaaggcg atgcaacgtt gtggaaaggt tctcagcagt tacagcttaa accgggtgaa     3480

tcagcgttta ttgccgccaa cgaatcaccg gtgactgtca aaggccacgg ccgtttagcg     3540

cgtgtttaca acaagctgta agagcttact gaaaaaatta acatctcttg ctaagctggg     3600

agctctagat ccccgaattt ccccgatcgt tcaaacattt ggcaataaag tttcttaaga     3660

ttgaatcctg ttgccggtct tgcgatgatt atcatataat ttctgttgaa ttacgttaag     3720

catgtaataa ttaacatgta atgcatgacg ttatttatga gatgggtttt tatgattaga     3780

gtcccgcaat tatacattta atacgcgata gaaaacaaaa tatagcgcgc aaactaggat     3840

aaattatcgc gcgcggtgtc atctatgtta ctagatcggg aattggcgag ctcgaattaa     3900

ttcagtacat taaaaacgtc cgcaatgtgt tattaagttg tctaagcgtc aatttgttta     3960

caccacaata tatcctgcca ccagccagcc aacagctccc cgaccggcag ctcggcacaa     4020

aatcaccact cgatacaggc agcccatcag tccgggacgg cgtcagcggg agagccgttg     4080

taaggcggca gactttgctc atgttaccga tgctattcgg aagaacggca actaagctgc     4140

cgggtttgaa acacggatga tctcgcggag ggtagcatgt tgattgtaac gatgacagag     4200

cgttgctgcc tgtgatcaaa tatcatctcc ctcgcagaga tccgaattat cagccttctt     4260

attcatttct cgcttaaccg tgacaggctg tcgatcttga gaactatgcc gacataatag     4320

gaaatcgctg gataaagccg ctgaggaagc tgagtggcgc tatttcttta gaagtgaacg     4380

ttgacgatcg tcgaccgtac cccgatgaat taattcggac gtacgttctg aacacagctg     4440

gatacttact tgggcgattg tcatacatga catcaacaat gtacccgttt gtgtaaccgt     4500

ctcttggagg ttcgtatgac actagtggtt cccctcagct tgcgactaga tgttgaggcc     4560

taacatttta ttagagagca ggctagttgc ttagatacat gatcttcagg ccgttatctg     4620

tcagggcaag cgaaaattgg ccatttatga cgaccaatgc cccgcagaag ctcccatctt     4680

tgccgccata gacgccgcgc cccccttttg gggtgtagaa catccttttg ccagatgtgg     4740

aaaagaagtt cgttgtccca ttgttggcaa tgacgtagta gccggcgaaa gtgcgagacc     4800

catttgcgct atatataagc ctacgatttc cgttgcgact attgtcgtaa ttggatgaac     4860

tattatcgta gttgctctca gagttgtcgt aatttgatgg actattgtcg taattgctta     4920

tggagttgtc gtagttgctt ggagaaatgt cgtagttgga tggggagtag tcatagggaa     4980

gacgagcttc atccactaaa acaattggca ggtcagcaag tgcctgcccc gatgccatcg     5040

caagtacgag gcttagaacc accttcaaca gatcgcgcat agtcttcccc agctctctaa     5100

cgcttgagtt aagccgcgcc gcgaagcggc gtcggcttga acgaattgtt agacattatt     5160

tgccgactac cttggtgatc tcgcctttca cgtagtgaac aaattcttcc aactgatctg     5220

cgcgcgaggc caagcgatct tcttgtccaa gataagcctg cctagcttca agtatgacgg     5280

gctgatactg ggccggcagg cgctccattg cccagtcggc agcgacatcc ttcggcgcga     5340

ttttgccggt tactgcgctg taccaaatgc gggacaacgt aagcactaca tttcgctcat     5400

cgccagccca gtcgggcggc gagttccata gcgttaaggt ttcatttagc gcctcaaata     5460

gatcctgttc aggaaccgga tcaaagagtt cctccgccgc tggacctacc aaggcaacgc     5520

tatgttctct tgcttttgtc agcaagatag ccagatcaat gtcgatcgtg gctggctcga     5580

agatacctgc aagaatgtca ttgcgctgcc attctccaaa ttgcagttcg cgcttagctg     5640

gataacgcca cggaatgatg tcgtcgtgca caacaatggt gacttctaca gcgcggagaa     5700

tctcgctctc tccaggggaa gccgaagttt ccaaaaggtc gttgatcaaa gctcgccgcg     5760

ttgtttcatc aagccttacg gtcaccgtaa ccagcaaatc aatatcactg tgtggcttca     5820

ggccgccatc cactgcggag ccgtacaaat gtacggccag caacgtcggt tcgagatggc     5880

gctcgatgac gccaactacc tctgatagtt gagtcgatac ttcggcgatc accgcttccc     5940

tcatgatgtt taactcctga attaagccgc gccgcgaagc ggtgtcggct tgaatgaatt     6000

gttaggcgtc atcctgtgct cccgagaacc agtaccagta catcgctgtt tcgttcgaga     6060

cttgaggtct agttttatac gtgaacaggt caatgccgcc gagagtaaag ccacattttg     6120

cgtacaaatt gcaggcaggt acattgttcg tttgtgtctc taatcgtatg ccaaggagct     6180

gtctgcttag tgcccacttt ttcgcaaatt cgatgagact gtgcgcgact cctttgcctc     6240

ggtgcgtgtg cgacacaaca atgtgttcga tagaggctag atcgttccat gttgagttga     6300

gttcaatctt cccgacaagc tcttggtcga tgaatgcgcc atagcaagca gagtcttcat     6360

cagagtcatc atccgagatg taatccttcc ggtaggggct cacacttctg gtagatagtt     6420

caaagccttg gtcggatagg tgcacatcga acacttcacg aacaatgaaa tggttctcag     6480

catccaatgt ttccgccacc tgctcaggga tcaccgaaat cttcatatga cgcctaacgc     6540

ctggcacagc ggatcgcaaa cctggcgcgg cttttggcac aaaaggcgtg acaggtttgc     6600

gaatccgttg ctgccacttg ttaacccttt tgccagattt ggtaactata atttatgtta     6660

gaggcgaagt cttgggtaaa aactggccta aaattgctgg ggatttcagg aaagtaaaca     6720

tcaccttccg gctcgatgtc tattgtagat atatgtagtg tatctacttg atcgggggat     6780

ctgctgcctc gcgcgtttcg gtgatgacgg tgaaaacctc tgacacatgc agctcccgga     6840

gacggtcaca gcttgtctgt aagcggatgc cgggagcaga caagcccgtc agggcgcgtc     6900

agcgggtgtt ggcgggtgtc ggggcgcagc catgacccag tcacgtagcg atagcggagt     6960

gtatactggc ttaactatgc ggcatcagag cagattgtac tgagagtgca ccatatgcgg     7020

tgtgaaatac cgcacagatg cgtaaggaga aaataccgca tcaggcgctc ttccgcttcc     7080

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca     7140

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca     7200

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg     7260

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg     7320

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt     7380

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt     7440

tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc     7500

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt     7560

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt     7620

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc     7680

tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa     7740

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt     7800

tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct     7860

acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta     7920

tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa     7980

agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc     8040

tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact     8100

acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc     8160

tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt     8220

ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta     8280

agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctgcagg gggggggggg     8340

ggggggttcc attgttcatt ccacggacaa aaacagagaa aggaaacgac agaggccaaa     8400

aagctcgctt tcagcacctg tcgtttcctt tcttttcaga gggtatttta aataaaaaca     8460

ttaagttatg acgaagaaga acggaaacgc cttaaaccgg aaaattttca taaatagcga     8520

aaacccgcga ggtcgccgcc ccgtaacctg tcggatcacc ggaaaggacc cgtaaagtga     8580

taatgattat catctacata tcacaacgtg cgtggaggcc atcaaaccac gtcaaataat     8640

caattatgac gcaggtatcg tattaattga tctgcatcaa cttaacgtaa aaacaacttc     8700

agacaataca aatcagcgac actgaatacg gggcaacctc atgtcccccc cccccccccc     8760

ctgcaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc     8820

aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg     8880

gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag     8940

cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt     9000

actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt     9060

caacacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac     9120

gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac     9180

ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag     9240

caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa     9300

tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga     9360

gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc     9420

cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta acctataaaa     9480

ataggcgtat cacgaggccc tttcgtcttc aagaattggt cgacgatctt gctgcgttcg     9540

gatattttcg tggagttccc gccacagacc cggattgaag gcgagatcca gcaactcgcg     9600

ccagatcatc ctgtgacgga actttggcgc gtgatgactg gccaggacgt cggccgaaag     9660

agcgacaagc agatcacgct tttcgacagc gtcggatttg cgatcgagga tttttcggcg     9720

ctgcgctacg tccgcgaccg cgttgaggga tcaagccaca gcagcccact cgaccttcta     9780

gccgacccag acgagccaag ggatcttttt ggaatgctgc tccgtcgtca ggctttccga     9840

cgtttgggtg gttgaacaga agtcattatc gcacggaatg ccaagcactc ccgaggggaa     9900

ccctgtggtt ggcatgcaca tacaaatgga cgaacggata aaccttttca cgccctttta     9960

aatatccgat tattctaata aacgctcttt tctcttaggt ttacccgcca atatatcctg    10020

tcaaacactg atagtttaaa ctgaaggcgg gaaacgacaa cctgatcatg agcggagaat    10080

taagggagtc acgttatgac ccccgccgat gacgcgggac aagccgtttt acgtttggaa    10140

ctgacagaac cgcaacgttg aaggagccac tcagcttaat taagtctaac tcgagttact    10200

ggtacgtacc aaatccatgg aatcaaggta ccctgcagtg cagcgtgacc cggtcgtgcc    10260

cctctctaga gataatgagc attgcatgtc taagttataa aaaattacca catatttttt    10320

ttgtcacact tgtttgaagt gcagtttatc tatctttata catatattta aactttactc    10380

tacgaataat ataatctata gtactacaat aatatcagtg ttttagagaa tcatataaat    10440

gaacagttag acatggtcta aaggacaatt gagtattttg acaacaggac tctacagttt    10500

tatcttttta gtgtgcatgt gttctccttt ttttttgcaa atagcttcac ctatataata    10560

cttcatccat tttattagta catccattta gggtttaggg ttaatggttt ttatagacta    10620

atttttttag tacatctatt ttattctatt ttagcctcta aattaagaaa actaaaactc    10680

tattttagtt tttttattta ataatttaga tataaaatag aataaaataa agtgactaaa    10740

aattaaacaa atacccttta agaaattaaa aaaactaagg aaacattttt cttgtttcga    10800

gtagataatg ccagcctgtt aaacgccgtc gacgagtcta acggacacca accagcgaac    10860

cagcagcgtc gcgtcgggcc aagcgaagca gacggcacgg catctctgtc gctgcctctg    10920

gacccctctc gagagttccg ctccaccgtt ggacttgctc cgctgtcggc atccagaaat    10980

tgcgtggcgg agcggcagac gtgagccggc acggcaggcg gcctcctcct cctctcacgg    11040

cacggcagct acgggggatt cctttcccac cgctccttcg ctttcccttc ctcgcccgcc    11100

gtaataaata gacaccccct ccacaccctc tttccccaac ctcgtgttgt tcggagcgca    11160

cacacacaca accagatctc ccccaaatcc acccgtcggc acctccgctt caaggtacgc    11220

cgctcgtcct cccccccccc ccctctctac cttctctaga tcggcgttcc ggtccatggt    11280

tagggcccgg tagttctact tctgttcatg tttgtgttag atccgtgttt gtgttagatc    11340

cgtgctgcta gcgttcgtac acggatgcga cctgtacgtc agacacgttc tgattgctaa    11400

cttgccagtg tttctctttg gggaatcctg ggatggctct agccgttccg cagacgggat    11460

cgatttcatg attttttttg tttcgttgca tagggtttgg tttgcccttt tcctttattt    11520

caatatatgc cgtgcacttg tttgtcgggt catcttttca tgcttttttt tgtcttggtt    11580

gtgatgatgt ggtctggttg ggcggtcgtt ctagatcgga gtagaattct gtttcaaact    11640

acctggtgga tttattaatt ttggatctgt atgtgtgtgc catacatatt catagttacg    11700

aattgaagat gatggatgga aatatcgatc taggataggt atacatgttg atgcgggttt    11760

tactgatgca tatacagaga tgctttttgt tcgcttggtt gtgatgatgt ggtgtggttg    11820

ggcggtcgtt cattcgttct agatcggagt agaatactgt ttcaaactac ctggtgtatt    11880

tattaatttt ggaactgtat gtgtgtgtca tacatcttca tagttacgag tttaagatgg    11940

atggaaatat cgatctagga taggtataca tgttgatgtg ggttttactg atgcatatac    12000

atgatggcat atgcagcatc tattcatatg ctctaacctt gagtacctat ctattataat    12060

aaacaagtat gttttataat tattttgatc ttgatatact tggatgatgg catatgcagc    12120

agctatatgt ggattttttt agccctgcct tcatacgcta tttatttgct tggtactgtt    12180

tcttttgtcg atgctcaccc tgttgtttgg tgttacttct gcagatccag atcggatcct    12240

aaaccatggc gaacaaacat ttgtccctct ccctcttcct cgtcctcctt ggcctgtcgg    12300

ccagcttggc ctccgggcaa caaacaagca ttactctgac atccaacgca tccggtacgt    12360

ttgacggtta ctattacgaa ctctggaagg atactggcaa tacaacaatg acggtctaca    12420

ctcaaggtcg cttttcctgc cagtggtcga acatcaataa cgcgttgttt aggaccggga    12480

agaaatacaa ccagaattgg cagtctcttg gcacaatccg gatcacgtac tctgcgactt    12540

acaacccaaa cgggaactcc tacttgtgta tctatggctg gtctaccaac ccattggtcg    12600

agttctacat cgttgagtcc tgggggaact ggagaccgcc tggtgcctgc ctggccgagg    12660

gctcgctcgt cttggacgcg gctaccgggc agagggtccc tatcgaaaag gtgcgtccgg    12720

ggatggaagt tttctccttg ggacctgatt acagactgta tcgggtgccc gttttggagg    12780

tccttgagag cggggttagg gaagttgtgc gcctcagaac tcggtcaggg agaacgctgg    12840

tgttgacacc agatcacccg cttttgaccc ccgaaggttg gaaacctctt tgtgacctcc    12900

cgcttggaac tccaattgca gtccccgcag aactgcctgt ggcgggccac ttggccccac    12960

ctgaagaacg tgttacgctc ctggctcttc tgttggggga tgggaacaca aagctgtcgg    13020

gtcggagagg tacacgtcct aatgccttct tctacagcaa aaaccccgaa ttgctcgcgg    13080

cttatcgccg gtgtgcagaa gccttgggtg caaaggtgaa agcatacgtc cacccgacta    13140

cgggggtggt tacactcgca accctcgctc cacgtcctgg agctcaagat cctgtcaaac    13200

gcctcgttgt cgaggcggga atggttgcta aagccgaaga gaagagggtc ccggaggagg    13260

tgtttcgtta ccggcgtgag gcgttggccc ttttcttggg ccgtttgttc tcgacagacg    13320

gctctgttga aaagaagagg atctcttatt caagtgccag tttgggactg gcccaggatg    13380

tcgcacatct cttgctgcgc cttggaatta catctcaact ccgttcgaga gggccacggg    13440

ctcacgaggt tcttatatcg ggccgcgagg atattttgcg gtttgctgaa cttatcggac    13500

cctacctctt gggggccaag agggagagac ttgcagcgct ggaagctgag gcccgcaggc    13560

gtttgcctgg acagggatgg cacttgcggc ttgttcttcc tgccgtggcg tacagagtgg    13620

gcgaggctga aaggcgctcg ggattttcgt ggagtgaagc cggtcggcgc gtcgcagttg    13680

cgggatcgtg tttgtcatct ggactcaacc tcaaattgcc cagacgctac ctttctcggc    13740

accggttgtc gctgctcggt gaggcttttg ccgaccctgg gctggaagcg ctcgcggaag    13800

gccaagtgct ctgggaccct attgttgctg tcgaaccggc cggtaaggcg agaacattcg    13860

acttgcgcgt tccacccttt gcaaacttcg tgagcgagga cctggtggtg cataacaccg    13920

tccccctggg ccaagtgaca atcgatggcg ggacctacga catctatagg acgacacgcg    13980

tcaaccagcc ttccattgtg gggacagcca cgttcgatca gtactggagc gtgcgcacct    14040

ctaagcggac ttcaggaaca gtgaccgtga ccgatcactt ccgcgcctgg gcgaaccggg    14100

gcctgaacct cggcacaata gaccaaatta cattgtgcgt ggagggttac caaagctctg    14160

gatcagccaa catcacccag aacaccttct ctcagggctc ttcttccggc agttcgggtg    14220

gctcatccgg ctccacaacg actactcgca tcgagtgtga gaacatgtcc ttgtccggac    14280

cctacgttag caggatcacc aatcccttta atggtattgc gctgtacgcc aacggagaca    14340

cagcccgcgc taccgttaac ttccccgcaa gtcgcaacta caatttccgc ctgcggggtt    14400

gcggcaacaa caataatctt gcccgtgtgg acctgaggat cgacggacgg accgtcggga    14460

ccttttatta ccagggcaca tacccctggg aggccccaat tgacaatgtt tatgtcagtg    14520

cggggagtca tacagtcgaa atcactgtta ctgcggataa cggcacatgg gacgtgtatg    14580

ccgactacct ggtgatacag agcgagaagg acgagctgtg ac                       14622


<210>  71
<211>  14531
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG4004

<400>  71
ctaggtcccc gaatttcccc gatcgttcaa acatttggca ataaagtttc ttaagattga       60

atcctgttgc cggtcttgcg atgattatca tataatttct gttgaattac gttaagcatg      120

taataattaa catgtaatgc atgacgttat ttatgagatg ggtttttatg attagagtcc      180

cgcaattata catttaatac gcgatagaaa acaaaatata gcgcgcaaac taggataaat      240

tatcgcgcgc ggtgtcatct atgttactag atcgggaatt ggaattcctg cagtgcagcg      300

tgacccggtc gtgcccctct ctagagataa tgagcattgc atgtctaagt tataaaaaat      360

taccacatat tttttttgtc acacttgttt gaagtgcagt ttatctatct ttatacatat      420

atttaaactt tactctacga ataatataat ctatagtact acaataatat cagtgtttta      480

gagaatcata taaatgaaca gttagacatg gtctaaagga caattgagta ttttgacaac      540

aggactctac agttttatct ttttagtgtg catgtgttct cctttttttt tgcaaatagc      600

ttcacctata taatacttca tccattttat tagtacatcc atttagggtt tagggttaat      660

ggtttttata gactaatttt tttagtacat ctattttatt ctattttagc ctctaaatta      720

agaaaactaa aactctattt tagttttttt atttaataat ttagatataa aatagaataa      780

aataaagtga ctaaaaatta aacaaatacc ctttaagaaa ttaaaaaaac taaggaaaca      840

tttttcttgt ttcgagtaga taatgccagc ctgttaaacg ccgtcgacga gtctaacgga      900

caccaaccag cgaaccagca gcgtcgcgtc gggccaagcg aagcagacgg cacggcatct      960

ctgtcgctgc ctctggaccc ctctcgagag ttccgctcca ccgttggact tgctccgctg     1020

tcggcatcca gaaattgcgt ggcggagcgg cagacgtgag ccggcacggc aggcggcctc     1080

ctcctcctct cacggcacgg cagctacggg ggattccttt cccaccgctc cttcgctttc     1140

ccttcctcgc ccgccgtaat aaatagacac cccctccaca ccctctttcc ccaacctcgt     1200

gttgttcgga gcgcacacac acacaaccag atctccccca aatccacccg tcggcacctc     1260

cgcttcaagg tacgccgctc gtcctccccc cccccccctc tctaccttct ctagatcggc     1320

gttccggtcc atggttaggg cccggtagtt ctacttctgt tcatgtttgt gttagatccg     1380

tgtttgtgtt agatccgtgc tgctagcgtt cgtacacgga tgcgacctgt acgtcagaca     1440

cgttctgatt gctaacttgc cagtgtttct ctttggggaa tcctgggatg gctctagccg     1500

ttccgcagac gggatcgatt tcatgatttt ttttgtttcg ttgcataggg tttggtttgc     1560

ccttttcctt tatttcaata tatgccgtgc acttgtttgt cgggtcatct tttcatgctt     1620

ttttttgtct tggttgtgat gatgtggtct ggttgggcgg tcgttctaga tcggagtaga     1680

attctgtttc aaactacctg gtggatttat taattttgga tctgtatgtg tgtgccatac     1740

atattcatag ttacgaattg aagatgatgg atggaaatat cgatctagga taggtataca     1800

tgttgatgcg ggttttactg atgcatatac agagatgctt tttgttcgct tggttgtgat     1860

gatgtggtgt ggttgggcgg tcgttcattc gttctagatc ggagtagaat actgtttcaa     1920

actacctggt gtatttatta attttggaac tgtatgtgtg tgtcatacat cttcatagtt     1980

acgagtttaa gatggatgga aatatcgatc taggataggt atacatgttg atgtgggttt     2040

tactgatgca tatacatgat ggcatatgca gcatctattc atatgctcta accttgagta     2100

cctatctatt ataataaaca agtatgtttt ataattattt tgatcttgat atacttggat     2160

gatggcatat gcagcagcta tatgtggatt tttttagccc tgccttcata cgctatttat     2220

ttgcttggta ctgtttcttt tgtcgatgct caccctgttg tttggtgtta cttctgcaga     2280

tccagatcta aaccatgcag aaactcatta actcagtgca aaactatgcc tggggcagca     2340

aaacggcgtt gactgaactt tatggtatgg aaaatccgtc cagccagccg atggccgagc     2400

tgtggatggg cgcacatccg aaaagcagtt cacgagtgca gaatgccgcc ggagatatcg     2460

tttcactgcg tgatgtgatt gagagtgata aatcgactct gctcggagag gccgttgcca     2520

aacgctttgg cgaactgcct ttcctgttca aagtattatg cgcagcacag ccactctcca     2580

ttcaggttca tccaaacaaa cacaattctg aaatcggttt tgccaaagaa aatgccgcag     2640

gtatcccgat ggatgccgcc gagcgtaact ataaagatcc taaccacaag ccggagctgg     2700

tttttgcgct gacgcctttc cttgcgatga acgcgtttcg tgaattttcc gagattgtct     2760

ccctactcca gccggtcgca ggtgcacatc cggcgattgc tcacttttta caacagcctg     2820

atgccgaacg tttaagcgaa ctgttcgcca gcctgttgaa tatgcagggt gaagaaaaat     2880

cccgcgcgct ggcgatttta aaatcggccc tcgatagcca gcagggtgaa ccgtggcaaa     2940

cgattcgttt aatttctgaa ttttacccgg aagacagcgg tctgttctcc ccgctattgc     3000

tgaatgtggt gaaattgaac cctggcgaag cgatgttcct gttcgctgaa acaccgcacg     3060

cttacctgca aggcgtggcg ctggaagtga tggcaaactc cgataacgtg ctgcgtgcgg     3120

gtctgacgcc taaatacatt gatattccgg aactggttgc caatgtgaaa ttcgaagcca     3180

aaccggctaa ccagttgttg acccagccgg tgaaacaagg tgcagaactg gacttcccga     3240

ttccagtgga tgattttgcc ttctcgctgc atgaccttag tgataaagaa accaccatta     3300

gccagcagag tgccgccatt ttgttctgcg tcgaaggcga tgcaacgttg tggaaaggtt     3360

ctcagcagtt acagcttaaa ccgggtgaat cagcgtttat tgccgccaac gaatcaccgg     3420

tgactgtcaa aggccacggc cgtttagcgc gtgtttacaa caagctgtaa gagcttactg     3480

aaaaaattaa catctcttgc taagctggga gctctagatc cccgaatttc cccgatcgtt     3540

caaacatttg gcaataaagt ttcttaagat tgaatcctgt tgccggtctt gcgatgatta     3600

tcatataatt tctgttgaat tacgttaagc atgtaataat taacatgtaa tgcatgacgt     3660

tatttatgag atgggttttt atgattagag tcccgcaatt atacatttaa tacgcgatag     3720

aaaacaaaat atagcgcgca aactaggata aattatcgcg cgcggtgtca tctatgttac     3780

tagatcggga attggcgagc tcgaattaat tcagtacatt aaaaacgtcc gcaatgtgtt     3840

attaagttgt ctaagcgtca atttgtttac accacaatat atcctgccac cagccagcca     3900

acagctcccc gaccggcagc tcggcacaaa atcaccactc gatacaggca gcccatcagt     3960

ccgggacggc gtcagcggga gagccgttgt aaggcggcag actttgctca tgttaccgat     4020

gctattcgga agaacggcaa ctaagctgcc gggtttgaaa cacggatgat ctcgcggagg     4080

gtagcatgtt gattgtaacg atgacagagc gttgctgcct gtgatcaaat atcatctccc     4140

tcgcagagat ccgaattatc agccttctta ttcatttctc gcttaaccgt gacaggctgt     4200

cgatcttgag aactatgccg acataatagg aaatcgctgg ataaagccgc tgaggaagct     4260

gagtggcgct atttctttag aagtgaacgt tgacgatcgt cgaccgtacc ccgatgaatt     4320

aattcggacg tacgttctga acacagctgg atacttactt gggcgattgt catacatgac     4380

atcaacaatg tacccgtttg tgtaaccgtc tcttggaggt tcgtatgaca ctagtggttc     4440

ccctcagctt gcgactagat gttgaggcct aacattttat tagagagcag gctagttgct     4500

tagatacatg atcttcaggc cgttatctgt cagggcaagc gaaaattggc catttatgac     4560

gaccaatgcc ccgcagaagc tcccatcttt gccgccatag acgccgcgcc ccccttttgg     4620

ggtgtagaac atccttttgc cagatgtgga aaagaagttc gttgtcccat tgttggcaat     4680

gacgtagtag ccggcgaaag tgcgagaccc atttgcgcta tatataagcc tacgatttcc     4740

gttgcgacta ttgtcgtaat tggatgaact attatcgtag ttgctctcag agttgtcgta     4800

atttgatgga ctattgtcgt aattgcttat ggagttgtcg tagttgcttg gagaaatgtc     4860

gtagttggat ggggagtagt catagggaag acgagcttca tccactaaaa caattggcag     4920

gtcagcaagt gcctgccccg atgccatcgc aagtacgagg cttagaacca ccttcaacag     4980

atcgcgcata gtcttcccca gctctctaac gcttgagtta agccgcgccg cgaagcggcg     5040

tcggcttgaa cgaattgtta gacattattt gccgactacc ttggtgatct cgcctttcac     5100

gtagtgaaca aattcttcca actgatctgc gcgcgaggcc aagcgatctt cttgtccaag     5160

ataagcctgc ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc     5220

ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg     5280

ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag     5340

cgttaaggtt tcatttagcg cctcaaatag atcctgttca ggaaccggat caaagagttc     5400

ctccgccgct ggacctacca aggcaacgct atgttctctt gcttttgtca gcaagatagc     5460

cagatcaatg tcgatcgtgg ctggctcgaa gatacctgca agaatgtcat tgcgctgcca     5520

ttctccaaat tgcagttcgc gcttagctgg ataacgccac ggaatgatgt cgtcgtgcac     5580

aacaatggtg acttctacag cgcggagaat ctcgctctct ccaggggaag ccgaagtttc     5640

caaaaggtcg ttgatcaaag ctcgccgcgt tgtttcatca agccttacgg tcaccgtaac     5700

cagcaaatca atatcactgt gtggcttcag gccgccatcc actgcggagc cgtacaaatg     5760

tacggccagc aacgtcggtt cgagatggcg ctcgatgacg ccaactacct ctgatagttg     5820

agtcgatact tcggcgatca ccgcttccct catgatgttt aactcctgaa ttaagccgcg     5880

ccgcgaagcg gtgtcggctt gaatgaattg ttaggcgtca tcctgtgctc ccgagaacca     5940

gtaccagtac atcgctgttt cgttcgagac ttgaggtcta gttttatacg tgaacaggtc     6000

aatgccgccg agagtaaagc cacattttgc gtacaaattg caggcaggta cattgttcgt     6060

ttgtgtctct aatcgtatgc caaggagctg tctgcttagt gcccactttt tcgcaaattc     6120

gatgagactg tgcgcgactc ctttgcctcg gtgcgtgtgc gacacaacaa tgtgttcgat     6180

agaggctaga tcgttccatg ttgagttgag ttcaatcttc ccgacaagct cttggtcgat     6240

gaatgcgcca tagcaagcag agtcttcatc agagtcatca tccgagatgt aatccttccg     6300

gtaggggctc acacttctgg tagatagttc aaagccttgg tcggataggt gcacatcgaa     6360

cacttcacga acaatgaaat ggttctcagc atccaatgtt tccgccacct gctcagggat     6420

caccgaaatc ttcatatgac gcctaacgcc tggcacagcg gatcgcaaac ctggcgcggc     6480

ttttggcaca aaaggcgtga caggtttgcg aatccgttgc tgccacttgt taaccctttt     6540

gccagatttg gtaactataa tttatgttag aggcgaagtc ttgggtaaaa actggcctaa     6600

aattgctggg gatttcagga aagtaaacat caccttccgg ctcgatgtct attgtagata     6660

tatgtagtgt atctacttga tcgggggatc tgctgcctcg cgcgtttcgg tgatgacggt     6720

gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc     6780

gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggcgcagcc     6840

atgacccagt cacgtagcga tagcggagtg tatactggct taactatgcg gcatcagagc     6900

agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc gtaaggagaa     6960

aataccgcat caggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc     7020

ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag     7080

gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa     7140

aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc     7200

gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc     7260

ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg     7320

cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt     7380

cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc     7440

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc     7500

cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag     7560

agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg     7620

ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa     7680

ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag     7740

gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact     7800

cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa     7860

attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt     7920

accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag     7980

ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca     8040

gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc     8100

agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt     8160

ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg     8220

ttgttgccat tgctgcaggg gggggggggg gggggttcca ttgttcattc cacggacaaa     8280

aacagagaaa ggaaacgaca gaggccaaaa agctcgcttt cagcacctgt cgtttccttt     8340

cttttcagag ggtattttaa ataaaaacat taagttatga cgaagaagaa cggaaacgcc     8400

ttaaaccgga aaattttcat aaatagcgaa aacccgcgag gtcgccgccc cgtaacctgt     8460

cggatcaccg gaaaggaccc gtaaagtgat aatgattatc atctacatat cacaacgtgc     8520

gtggaggcca tcaaaccacg tcaaataatc aattatgacg caggtatcgt attaattgat     8580

ctgcatcaac ttaacgtaaa aacaacttca gacaatacaa atcagcgaca ctgaatacgg     8640

ggcaacctca tgtccccccc cccccccccc tgcaggcatc gtggtgtcac gctcgtcgtt     8700

tggtatggct tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat     8760

gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc     8820

cgcagtgtta tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc     8880

cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat     8940

gcggcgaccg agttgctctt gcccggcgtc aacacgggat aataccgcgc cacatagcag     9000

aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt     9060

accgctgttg agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc     9120

ttttactttc accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa     9180

gggaataagg gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg     9240

aagcatttat cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa     9300

taaacaaata ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac     9360

cattattatc atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtcttca     9420

agaattggtc gacgatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc     9480

ggattgaagg cgagatccag caactcgcgc cagatcatcc tgtgacggaa ctttggcgcg     9540

tgatgactgg ccaggacgtc ggccgaaaga gcgacaagca gatcacgctt ttcgacagcg     9600

tcggatttgc gatcgaggat ttttcggcgc tgcgctacgt ccgcgaccgc gttgagggat     9660

caagccacag cagcccactc gaccttctag ccgacccaga cgagccaagg gatctttttg     9720

gaatgctgct ccgtcgtcag gctttccgac gtttgggtgg ttgaacagaa gtcattatcg     9780

cacggaatgc caagcactcc cgaggggaac cctgtggttg gcatgcacat acaaatggac     9840

gaacggataa accttttcac gcccttttaa atatccgatt attctaataa acgctctttt     9900

ctcttaggtt tacccgccaa tatatcctgt caaacactga tagtttaaac tgaaggcggg     9960

aaacgacaac ctgatcatga gcggagaatt aagggagtca cgttatgacc cccgccgatg    10020

acgcgggaca agccgtttta cgtttggaac tgacagaacc gcaacgttga aggagccact    10080

cagcttaatt aagtctaact cgagttactg gtacgtacca aatccatgga atcaaggtac    10140

cctgcagtgc agcgtgaccc ggtcgtgccc ctctctagag ataatgagca ttgcatgtct    10200

aagttataaa aaattaccac atattttttt tgtcacactt gtttgaagtg cagtttatct    10260

atctttatac atatatttaa actttactct acgaataata taatctatag tactacaata    10320

atatcagtgt tttagagaat catataaatg aacagttaga catggtctaa aggacaattg    10380

agtattttga caacaggact ctacagtttt atctttttag tgtgcatgtg ttctcctttt    10440

tttttgcaaa tagcttcacc tatataatac ttcatccatt ttattagtac atccatttag    10500

ggtttagggt taatggtttt tatagactaa tttttttagt acatctattt tattctattt    10560

tagcctctaa attaagaaaa ctaaaactct attttagttt ttttatttaa taatttagat    10620

ataaaataga ataaaataaa gtgactaaaa attaaacaaa taccctttaa gaaattaaaa    10680

aaactaagga aacatttttc ttgtttcgag tagataatgc cagcctgtta aacgccgtcg    10740

acgagtctaa cggacaccaa ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag    10800

acggcacggc atctctgtcg ctgcctctgg acccctctcg agagttccgc tccaccgttg    10860

gacttgctcc gctgtcggca tccagaaatt gcgtggcgga gcggcagacg tgagccggca    10920

cggcaggcgg cctcctcctc ctctcacggc acggcagcta cgggggattc ctttcccacc    10980

gctccttcgc tttcccttcc tcgcccgccg taataaatag acaccccctc cacaccctct    11040

ttccccaacc tcgtgttgtt cggagcgcac acacacacaa ccagatctcc cccaaatcca    11100

cccgtcggca cctccgcttc aaggtacgcc gctcgtcctc cccccccccc cctctctacc    11160

ttctctagat cggcgttccg gtccatggtt agggcccggt agttctactt ctgttcatgt    11220

ttgtgttaga tccgtgtttg tgttagatcc gtgctgctag cgttcgtaca cggatgcgac    11280

ctgtacgtca gacacgttct gattgctaac ttgccagtgt ttctctttgg ggaatcctgg    11340

gatggctcta gccgttccgc agacgggatc gatttcatga ttttttttgt ttcgttgcat    11400

agggtttggt ttgccctttt cctttatttc aatatatgcc gtgcacttgt ttgtcgggtc    11460

atcttttcat gctttttttt gtcttggttg tgatgatgtg gtctggttgg gcggtcgttc    11520

tagatcggag tagaattctg tttcaaacta cctggtggat ttattaattt tggatctgta    11580

tgtgtgtgcc atacatattc atagttacga attgaagatg atggatggaa atatcgatct    11640

aggataggta tacatgttga tgcgggtttt actgatgcat atacagagat gctttttgtt    11700

cgcttggttg tgatgatgtg gtgtggttgg gcggtcgttc attcgttcta gatcggagta    11760

gaatactgtt tcaaactacc tggtgtattt attaattttg gaactgtatg tgtgtgtcat    11820

acatcttcat agttacgagt ttaagatgga tggaaatatc gatctaggat aggtatacat    11880

gttgatgtgg gttttactga tgcatataca tgatggcata tgcagcatct attcatatgc    11940

tctaaccttg agtacctatc tattataata aacaagtatg ttttataatt attttgatct    12000

tgatatactt ggatgatggc atatgcagca gctatatgtg gattttttta gccctgcctt    12060

catacgctat ttatttgctt ggtactgttt cttttgtcga tgctcaccct gttgtttggt    12120

gttacttctg cagatccaga tcggatccta aaccatggcg aacaaacatt tgtccctctc    12180

cctcttcctc gtcctccttg gcctgtcggc cagcttggcc tccgggcaac aaacaagcat    12240

tactctgaca tccaacgcat ccggtacgtt tgacggttac tattacgaac tctggaagga    12300

tactggcaat acaacaatga cggtctacac tcaaggtcgc ttttcctgcc agtggtcgaa    12360

catcaataac gcgttgttta ggaccgggaa gaaatacaac cagaattggc agtctcttgg    12420

cacaatccgg atcacgtact ctgcgactta caacccaaac gggaactcct acttgtgtat    12480

ctatggctgg tctaccaacc cattggtcga gttctacatc gttgagtcct gggggaactg    12540

gagaccgcct ggtgcctgcc tggccgaggg ctcgctcgtc ttggacgcgg ctaccgggca    12600

gagggtccct atcgaaaagg tgcgtccggg gatggaagtt ttctccttgg gacctgatta    12660

cagactgtat cgggtgcccg ttttggaggt ccttgagagc ggggttaggg aagttgtgcg    12720

cctcagaact cggtcaggga gaacgctggt gttgacacca gatcacccgc ttttgacccc    12780

cgaaggttgg aaacctcttt gtgacctccc gcttggaact ccaattgcag tccccgcaga    12840

actgcctgtg gcgggccact tggccccacc tgaagaacgt gttacgctcc tggctcttct    12900

gttgggggat gggaacacaa agctgtcggg tcggagaggt acacgtccta atgccttctt    12960

ctacagcaaa aaccccgaat tgctcgcggc ttatcgccgg tgtgcagaag ccttgggtgc    13020

aaaggtgaaa gcatacgtcc acccgactac gggggtggtt acactcgcaa ccctcgctcc    13080

acgtcctgga gctcaagatc ctgtcaaacg cctcgttgtc gaggcgggaa tggttgctaa    13140

agccgaagag aagagggtcc cggaggaggt gtttcgttac cggcgtgagg cgttggccct    13200

tttcttgggc cgtttgttct cgacagacgg ctctgttgaa aagaagagga tctcttattc    13260

aagtgccagt ttgggactgg cccaggatgt cgcacatctc ttgctgcgcc ttggaattac    13320

atctcaactc cgttcgagag ggccacgggc tcacgaggtt cttatatcgg gccgcgagga    13380

tattttgcgg tttgctgaac ttatcggacc ctacctcttg ggggccaaga gggagagact    13440

tgcagcgctg gaagctgagg cccgcaggcg tttgcctgga cagggatggc acttgcggct    13500

tgttcttcct gccgtggcgt acagagtggg cgaggctgaa aggcgctcgg gattttcgtg    13560

gagtgaagcc ggtcggcgcg tcgcagttgc gggatcgtgt ttgtcatctg gactcaacct    13620

caaattgccc agacgctacc tttctcggca ccggttgtcg ctgctcggtg aggcttttgc    13680

cgaccctggg ctggaagcgc tcgcggaagg ccaagtgctc tgggacccta ttgttgctgt    13740

cgaaccggcc ggtaaggcga gaacattcga cttgcgcgtt ccaccctttg caaacttcgt    13800

gagcgaggac ctggtggtgc ataacaccgt ccccctgggc caagtgacaa tcgatggcgg    13860

gacctacgac atctatagga cgacacgcgt caaccagcct tccattgtgg ggacagccac    13920

gttcgatcag tactggagcg tgcgcacctc taagcggact tcaggaacag tgaccgtgac    13980

cgatcacttc cgcgcctggg cgaaccgggg cctgaacctc ggcacaatag accaaattac    14040

attgtgcgtg gagggttacc aaagctctgg atcagccaac atcacccaga acaccttctc    14100

tcagggctct tcttccggca gttcgggtgg ctcatccggc tccacaacga ctactcgcat    14160

cgagtgtgag aacatgtcct tgtccggacc ctacgttagc aggatcacca atccctttaa    14220

tggtattgcg ctgtacgcca acggagacac agcccgcgct accgttaact tccccgcaag    14280

tcgcaactac aatttccgcc tgcggggttg cggcaacaac aataatcttg cccgtgtgga    14340

cctgaggatc gacggacgga ccgtcgggac cttttattac cagggcacat acccctggga    14400

ggccccaatt gacaatgttt atgtcagtgc ggggagtcat acagtcgaaa tcactgttac    14460

tgcggataac ggcacatggg acgtgtatgc cgactacctg gtgatacaga gcgagaagga    14520

cgagctgtga c                                                         14531


<210>  72
<211>  14593
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG2227

<400>  72
aattcatact aaagcttgca tgcctgcagg tcgactctag taacggccgc cagtgtgctg       60

gaattaattc ggcttgtcga ccacccaacc ccatatcgac agaggatgtg aagaacaggt      120

aaatcacgca gaagaaccca tctctgatag cagctatcga ttagaacaac gaatccatat      180

tgggtccgtg ggaaatactt actgcacagg aagggggcga tctgacgagg ccccgccacc      240

ggcctcgacc cgaggccgag gccgacgaag cgccggcgag tacggcgccg cggcggcctc      300

tgcccgtgcc ctctgcgcgt gggagggaga ggccgcggtg gtgggggcgc gcgcgcgcgc      360

gcgcgcagct ggtgcggcgg cgcgggggtc agccgccgag ccggcggcga cggaggagca      420

gggcggcgtg gacgcgaact tccgatcggt tggtcagagt gcgcgagttg ggcttagcca      480

attaggtctc aacaatctat tgggccgtaa aattcatggg ccctggtttg tctaggccca      540

atatcccgtt catttcagcc cacaaatatt tccccagagg attattaagg cccacacgca      600

gcttatagca gatcaagtac gatgtttcct gatcgttgga tcggaaacgt acggtcttga      660

tcaggcatgc cgacttcgtc aaagagaggc ggcatgacct gacgcggagt tggttccggg      720

caccgtctgg atggtcgtac cgggaccgga cacgtgtcgc gcctccaact acatggacac      780

gtgtggtgct gccattgggc cgtacgcgtg gcggtgaccg caccggatgc tgcctcgcac      840

cgccttgccc acgctttata tagagaggtt ttctctccat taatcgcata gcgagtcgaa      900

tcgaccgaag gggaggggga gcgaagcttt gcgttctcta atcgcctcgt caaggtaact      960

aatcaatcac ctcgtcctaa tcctcgaatc tctcgtggtg cccgtctaat ctcgcgattt     1020

tgatgctcgt ggtggaaagc gtaggaggat cccgtgcgag ttagtctcaa tctctcaggg     1080

tttcgtgcga ttttagggtg atccacctct taatcgagtt acggtttcgt gcgattttag     1140

ggtaatcctc ttaatctctc attgatttag ggtttcgtga gaatcgaggt agggatctgt     1200

gttatttata tcgatctaat agatggattg gttttgagat tgttctgtca gatggggatt     1260

gtttcgatat attaccctaa tgatgtgtca gatggggatt gtttcgatat attaccctaa     1320

tgatgtgtca gatggggatt gtttcgatat attaccctaa tgatggataa taagagtagt     1380

tcacagttat gttttgatcc tgccacatag tttgagtttt gtgatcagat ttagttttac     1440

ttatttgtgc ttagttcgga tgggattgtt ctgatattgt tccaatagat gaatagctcg     1500

ttaggttaaa atctttaggt tgagttaggc gacacatagt ttatttcctc tggatttgga     1560

ttggaattgt gttcttagtt tttttcccct ggatttggat tggaattgtg tggagctggg     1620

ttagagaatt acatctgtat cgtgtacacc tacttgaact gtagagcttg ggttctaagg     1680

tcaatttaat ctgtattgta tctggctctt tgcctagttg aactgtagtg ctgatgttgt     1740

actgtgtttt tttacccgtt ttatttgctt tactcgtgca aatcaaatct gtcagatgct     1800

agaactaggt ggctttattc tgtgttctta catagatctg ttgtcctgta gttacttatg     1860

tcagttttgt tattatctga agatattttt ggttgttgct tgttgatgtg gtgtgagctg     1920

tgagcagcgc tcttatgatt aatgatgctg tccaattgta gtgtagtatg atgtgattga     1980

tatgttcatc tattttgagc tgacagtacc gatatcgtag gatctggtgc caacttattc     2040

tccagctgct tttttttacc tatgttaatt ccaatccttt cttgcctctt ccagatccag     2100

ataatgcaga aactcattaa ctcagtgcaa aactatgcct ggggcagcaa aacggcgttg     2160

actgaacttt atggtatgga aaatccgtcc agccagccga tggccgagct gtggatgggc     2220

gcacatccga aaagcagttc acgagtgcag aatgccgccg gagatatcgt ttcactgcgt     2280

gatgtgattg agagtgataa atcgactctg ctcggagagg ccgttgccaa acgctttggc     2340

gaactgcctt tcctgttcaa agtattatgc gcagcacagc cactctccat tcaggttcat     2400

ccaaacaaac acaattctga aatcggtttt gccaaagaaa atgccgcagg tatcccgatg     2460

gatgccgccg agcgtaacta taaagatcct aaccacaagc cggagctggt ttttgcgctg     2520

acgcctttcc ttgcgatgaa cgcgtttcgt gaattttccg agattgtctc cctactccag     2580

ccggtcgcag gtgcacatcc ggcgattgct cactttttac aacagcctga tgccgaacgt     2640

ttaagcgaac tgttcgccag cctgttgaat atgcagggtg aagaaaaatc ccgcgcgctg     2700

gcgattttaa aatcggccct cgatagccag cagggtgaac cgtggcaaac gattcgttta     2760

atttctgaat tttacccgga agacagcggt ctgttctccc cgctattgct gaatgtggtg     2820

aaattgaacc ctggcgaagc gatgttcctg ttcgctgaaa caccgcacgc ttacctgcaa     2880

ggcgtggcgc tggaagtgat ggcaaactcc gataacgtgc tgcgtgcggg tctgacgcct     2940

aaatacattg atattccgga actggttgcc aatgtgaaat tcgaagccaa accggctaac     3000

cagttgttga cccagccggt gaaacaaggt gcagaactgg acttcccgat tccagtggat     3060

gattttgcct tctcgctgca tgaccttagt gataaagaaa ccaccattag ccagcagagt     3120

gccgccattt tgttctgcgt cgaaggcgat gcaacgttgt ggaaaggttc tcagcagtta     3180

cagcttaaac cgggtgaatc agcgtttatt gccgccaacg aatcaccggt gactgtcaaa     3240

ggccacggcc gtttagcgcg tgtttacaac aagctgtaag agcttactga aaaaattaac     3300

atctcttgct aagctgggag ctctagatcc ccgaatttcc ccgatcgttc aaacatttgg     3360

caataaagtt tcttaagatt gaatcctgtt gccggtcttg cgatgattat catataattt     3420

ctgttgaatt acgttaagca tgtaataatt aacatgtaat gcatgacgtt atttatgaga     3480

tgggttttta tgattagagt cccgcaatta tacatttaat acgcgataga aaacaaaata     3540

tagcgcgcaa actaggataa attatcgcgc gcggtgtcat ctatgttact agatcgggaa     3600

ttggcgagct cgaattaatt cagtacatta aaaacgtccg caatgtgtta ttaagttgtc     3660

taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa cagctccccg     3720

accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc cgggacggcg     3780

tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg ctattcggaa     3840

gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg tagcatgttg     3900

attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct cgcagagatc     3960

cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc gatcttgaga     4020

actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg agtggcgcta     4080

tttctttaga agtgaacgtt gacgatcgtc gaccgtaccc cgatgaatta attcggacgt     4140

acgttctgaa cacagctgga tacttacttg ggcgattgtc atacatgaca tcaacaatgt     4200

acccgtttgt gtaaccgtct cttggaggtt cgtatgacac tagtggttcc cctcagcttg     4260

cgactagatg ttgaggccta acattttatt agagagcagg ctagttgctt agatacatga     4320

tcttcaggcc gttatctgtc agggcaagcg aaaattggcc atttatgacg accaatgccc     4380

cgcagaagct cccatctttg ccgccataga cgccgcgccc cccttttggg gtgtagaaca     4440

tccttttgcc agatgtggaa aagaagttcg ttgtcccatt gttggcaatg acgtagtagc     4500

cggcgaaagt gcgagaccca tttgcgctat atataagcct acgatttccg ttgcgactat     4560

tgtcgtaatt ggatgaacta ttatcgtagt tgctctcaga gttgtcgtaa tttgatggac     4620

tattgtcgta attgcttatg gagttgtcgt agttgcttgg agaaatgtcg tagttggatg     4680

gggagtagtc atagggaaga cgagcttcat ccactaaaac aattggcagg tcagcaagtg     4740

cctgccccga tgccatcgca agtacgaggc ttagaaccac cttcaacaga tcgcgcatag     4800

tcttccccag ctctctaacg cttgagttaa gccgcgccgc gaagcggcgt cggcttgaac     4860

gaattgttag acattatttg ccgactacct tggtgatctc gcctttcacg tagtgaacaa     4920

attcttccaa ctgatctgcg cgcgaggcca agcgatcttc ttgtccaaga taagcctgcc     4980

tagcttcaag tatgacgggc tgatactggg ccggcaggcg ctccattgcc cagtcggcag     5040

cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta ccaaatgcgg gacaacgtaa     5100

gcactacatt tcgctcatcg ccagcccagt cgggcggcga gttccatagc gttaaggttt     5160

catttagcgc ctcaaataga tcctgttcag gaaccggatc aaagagttcc tccgccgctg     5220

gacctaccaa ggcaacgcta tgttctcttg cttttgtcag caagatagcc agatcaatgt     5280

cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt gcgctgccat tctccaaatt     5340

gcagttcgcg cttagctgga taacgccacg gaatgatgtc gtcgtgcaca acaatggtga     5400

cttctacagc gcggagaatc tcgctctctc caggggaagc cgaagtttcc aaaaggtcgt     5460

tgatcaaagc tcgccgcgtt gtttcatcaa gccttacggt caccgtaacc agcaaatcaa     5520

tatcactgtg tggcttcagg ccgccatcca ctgcggagcc gtacaaatgt acggccagca     5580

acgtcggttc gagatggcgc tcgatgacgc caactacctc tgatagttga gtcgatactt     5640

cggcgatcac cgcttccctc atgatgttta actcctgaat taagccgcgc cgcgaagcgg     5700

tgtcggcttg aatgaattgt taggcgtcat cctgtgctcc cgagaaccag taccagtaca     5760

tcgctgtttc gttcgagact tgaggtctag ttttatacgt gaacaggtca atgccgccga     5820

gagtaaagcc acattttgcg tacaaattgc aggcaggtac attgttcgtt tgtgtctcta     5880

atcgtatgcc aaggagctgt ctgcttagtg cccacttttt cgcaaattcg atgagactgt     5940

gcgcgactcc tttgcctcgg tgcgtgtgcg acacaacaat gtgttcgata gaggctagat     6000

cgttccatgt tgagttgagt tcaatcttcc cgacaagctc ttggtcgatg aatgcgccat     6060

agcaagcaga gtcttcatca gagtcatcat ccgagatgta atccttccgg taggggctca     6120

cacttctggt agatagttca aagccttggt cggataggtg cacatcgaac acttcacgaa     6180

caatgaaatg gttctcagca tccaatgttt ccgccacctg ctcagggatc accgaaatct     6240

tcatatgacg cctaacgcct ggcacagcgg atcgcaaacc tggcgcggct tttggcacaa     6300

aaggcgtgac aggtttgcga atccgttgct gccacttgtt aacccttttg ccagatttgg     6360

taactataat ttatgttaga ggcgaagtct tgggtaaaaa ctggcctaaa attgctgggg     6420

atttcaggaa agtaaacatc accttccggc tcgatgtcta ttgtagatat atgtagtgta     6480

tctacttgat cgggggatct gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg     6540

acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca     6600

agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc     6660

acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca gattgtactg     6720

agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc     6780

aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga     6840

gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca     6900

ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg     6960

ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt     7020

cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc     7080

ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct     7140

tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc     7200

gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta     7260

tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca     7320

gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag     7380

tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag     7440

ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt     7500

agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa     7560

gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg     7620

attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga     7680

agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta     7740

atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc     7800

cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg     7860

ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga     7920

agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt     7980

tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt     8040

gctgcagggg gggggggggg ggggttccat tgttcattcc acggacaaaa acagagaaag     8100

gaaacgacag aggccaaaaa gctcgctttc agcacctgtc gtttcctttc ttttcagagg     8160

gtattttaaa taaaaacatt aagttatgac gaagaagaac ggaaacgcct taaaccggaa     8220

aattttcata aatagcgaaa acccgcgagg tcgccgcccc gtaacctgtc ggatcaccgg     8280

aaaggacccg taaagtgata atgattatca tctacatatc acaacgtgcg tggaggccat     8340

caaaccacgt caaataatca attatgacgc aggtatcgta ttaattgatc tgcatcaact     8400

taacgtaaaa acaacttcag acaatacaaa tcagcgacac tgaatacggg gcaacctcat     8460

gtcccccccc ccccccccct gcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt     8520

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa     8580

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat     8640

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct     8700

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga     8760

gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga actttaaaag     8820

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga     8880

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca     8940

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg     9000

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc     9060

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag     9120

gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca     9180

tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa gaattggtcg     9240

acgatcttgc tgcgttcgga tattttcgtg gagttcccgc cacagacccg gattgaaggc     9300

gagatccagc aactcgcgcc agatcatcct gtgacggaac tttggcgcgt gatgactggc     9360

caggacgtcg gccgaaagag cgacaagcag atcacgcttt tcgacagcgt cggatttgcg     9420

atcgaggatt tttcggcgct gcgctacgtc cgcgaccgcg ttgagggatc aagccacagc     9480

agcccactcg accttctagc cgacccagac gagccaaggg atctttttgg aatgctgctc     9540

cgtcgtcagg ctttccgacg tttgggtggt tgaacagaag tcattatcgc acggaatgcc     9600

aagcactccc gaggggaacc ctgtggttgg catgcacata caaatggacg aacggataaa     9660

ccttttcacg cccttttaaa tatccgatta ttctaataaa cgctcttttc tcttaggttt     9720

acccgccaat atatcctgtc aaacactgat agtttaaact gaaggcggga aacgacaacc     9780

tgatcatgag cggagaatta agggagtcac gttatgaccc ccgccgatga cgcgggacaa     9840

gccgttttac gtttggaact gacagaaccg caacgttgaa ggagccactc agcttaatta     9900

agtctaactc gagttactgg tacgtaccaa atccatggaa tcaaggtacc gtcgactcta     9960

gtaacggccg ccagtgtgct ggaattaatt cggcttgtcg accacccaac cccatatcga    10020

cagaggatgt gaagaacagg taaatcacgc agaagaaccc atctctgata gcagctatcg    10080

attagaacaa cgaatccata ttgggtccgt gggaaatact tactgcacag gaagggggcg    10140

atctgacgag gccccgccac cggcctcgac ccgaggccga ggccgacgaa gcgccggcga    10200

gtacggcgcc gcggcggcct ctgcccgtgc cctctgcgcg tgggagggag aggccgcggt    10260

ggtgggggcg cgcgcgcgcg cgcgcgcagc tggtgcggcg gcgcgggggt cagccgccga    10320

gccggcggcg acggaggagc agggcggcgt ggacgcgaac ttccgatcgg ttggtcagag    10380

tgcgcgagtt gggcttagcc aattaggtct caacaatcta ttgggccgta aaattcatgg    10440

gccctggttt gtctaggccc aatatcccgt tcatttcagc ccacaaatat ttccccagag    10500

gattattaag gcccacacgc agcttatagc agatcaagta cgatgtttcc tgatcgttgg    10560

atcggaaacg tacggtcttg atcaggcatg ccgacttcgt caaagagagg cggcatgacc    10620

tgacgcggag ttggttccgg gcaccgtctg gatggtcgta ccgggaccgg acacgtgtcg    10680

cgcctccaac tacatggaca cgtgtggtgc tgccattggg ccgtacgcgt ggcggtgacc    10740

gcaccggatg ctgcctcgca ccgccttgcc cacgctttat atagagaggt tttctctcca    10800

ttaatcgcat agcgagtcga atcgaccgaa ggggaggggg agcgaagctt tgcgttctct    10860

aatcgcctcg tcaaggtaac taatcaatca cctcgtccta atcctcgaat ctctcgtggt    10920

gcccgtctaa tctcgcgatt ttgatgctcg tggtggaaag cgtaggagga tcccgtgcga    10980

gttagtctca atctctcagg gtttcgtgcg attttagggt gatccacctc ttaatcgagt    11040

tacggtttcg tgcgatttta gggtaatcct cttaatctct cattgattta gggtttcgtg    11100

agaatcgagg tagggatctg tgttatttat atcgatctaa tagatggatt ggttttgaga    11160

ttgttctgtc agatggggat tgtttcgata tattacccta atgatgtgtc agatggggat    11220

tgtttcgata tattacccta atgatgtgtc agatggggat tgtttcgata tattacccta    11280

atgatggata ataagagtag ttcacagtta tgttttgatc ctgccacata gtttgagttt    11340

tgtgatcaga tttagtttta cttatttgtg cttagttcgg atgggattgt tctgatattg    11400

ttccaataga tgaatagctc gttaggttaa aatctttagg ttgagttagg cgacacatag    11460

tttatttcct ctggatttgg attggaattg tgttcttagt ttttttcccc tggatttgga    11520

ttggaattgt gtggagctgg gttagagaat tacatctgta tcgtgtacac ctacttgaac    11580

tgtagagctt gggttctaag gtcaatttaa tctgtattgt atctggctct ttgcctagtt    11640

gaactgtagt gctgatgttg tactgtgttt ttttacccgt tttatttgct ttactcgtgc    11700

aaatcaaatc tgtcagatgc tagaactagg tggctttatt ctgtgttctt acatagatct    11760

gttgtcctgt agttacttat gtcagttttg ttattatctg aagatatttt tggttgttgc    11820

ttgttgatgt ggtgtgagct gtgagcagcg ctcttatgat taatgatgct gtccaattgt    11880

agtgtagtat gatgtgattg atatgttcat ctattttgag ctgacagtac cgatatcgta    11940

ggatctggtg ccaacttatt ctccagctgc ttttttttac ctatgttaat tccaatcctt    12000

tcttgcctct tccagatcca gataatgcaa acaagcatta ctctgacatc caacgcatcc    12060

ggtacgtttg acggttacta ttacgaactc tggaaggata ctggcaatac aacaatgacg    12120

gtctacactc aaggtcgctt ttcctgccag tggtcgaaca tcaataacgc gttgtttagg    12180

accgggaaga aatacaacca gaattggcag tctcttggca caatccggat cacgtactct    12240

gcgacttaca acccaaacgg gaactcctac ttgtgtatct atggctggtc taccaaccca    12300

ttggtcgagt tctacatcgt tgagtcctgg gggaactgga gaccgcctgg tgcctgcctg    12360

gccgagggct cgctcgtctt ggacgcggct accgggcaga gggtccctat cgaaaaggtg    12420

cgtccgggga tggaagtttt ctccttggga cctgattaca gactgtatcg ggtgcccgtt    12480

ttggaggtcc ttgagagcgg ggttagggaa gttgtgcgcc tcagaactcg gtcagggaga    12540

acgctggtgt tgacaccaga tcacccgctt ttgacccccg aaggttggaa acctctttgt    12600

gacctcccgc ttggaactcc aattgcagtc cccgcagaac tgcctgtggc gggccacttg    12660

gccccacctg aagaacgtgt tacgctcctg gctcttctgt tgggggatgg gaacacaaag    12720

ctgtcgggtc ggagaggtac acgtcctaat gccttcttct acagcaaaaa ccccgaattg    12780

ctcgcggctt atcgccggtg tgcagaagcc ttgggtgcaa aggtgaaagc atacgtccac    12840

ccgactacgg gggtggttac actcgcaacc ctcgctccac gtcctggagc tcaagatcct    12900

gtcaaacgcc tcgttgtcga ggcgggaatg gttgctaaag ccgaagagaa gagggtcccg    12960

gaggaggtgt ttcgttaccg gcgtgaggcg ttggcccttt tcttgggccg tttgttctcg    13020

acagacggct ctgttgaaaa gaagaggatc tcttattcaa gtgccagttt gggactggcc    13080

caggatgtcg cacatctctt gctgcgcctt ggaattacat ctcaactccg ttcgagaggg    13140

ccacgggctc acgaggttct tatatcgggc cgcgaggata ttttgcggtt tgctgaactt    13200

atcggaccct acctcttggg ggccaagagg gagagacttg cagcgctgga agctgaggcc    13260

cgcaggcgtt tgcctggaca gggatggcac ttgcggcttg ttcttcctgc cgtggcgtac    13320

agagtgggcg aggcggaaag gcgctcggga ttttcgtgga gtgaagccgg tcggcgcgtc    13380

gcagttgcgg gatcgtgttt gtcatctgga ctcaacctca aattgcccag acgctacctt    13440

tctcggcacc ggttgtcgct gctcggtgag gcttttgccg accctgggct ggaagcgctc    13500

gcggaaggcc aagtgctctg ggaccctatt gttgctgtcg aaccggccgg taaggcgaga    13560

acattcgact tgcgcgttcc accctttgca aacttcgtga gcgaggacct ggtggtgcat    13620

aacaccgtcc ccctgggcca agtgacaatc gatggcggga cctacgacat ctataggacg    13680

acacgcgtca accagccttc cattgtgggg acagccacgt tcgatcagta ctggagcgtg    13740

cgcacctcta agcggacttc aggaacagtg accgtgaccg atcacttccg cgcctgggcg    13800

aaccggggcc tgaacctcgg cacaatagac caaattacat tgtgcgtgga gggttaccaa    13860

agctctggat cagccaacat cacccagaac accttctctc agggctcttc ttccggcagt    13920

tcgggtggct catccggctc cacaacgact actcgcatcg agtgtgagaa catgtccttg    13980

tccggaccct acgttagcag gatcaccaat ccctttaatg gtattgcgct gtacgccaac    14040

ggagacacag cccgcgctac cgttaacttc cccgcaagtc gcaactacaa tttccgcctg    14100

cggggttgcg gcaacaacaa taatcttgcc cgtgtggacc tgaggatcga cggacggacc    14160

gtcgggacct tttattacca gggcacatac ccctgggagg ccccaattga caatgtttat    14220

gtcagtgcgg ggagtcatac agtcgaaatc actgttactg cggataacgg cacatgggac    14280

gtgtatgccg actacctggt gatacagtga cctaggtccc cgaatttccc cgatcgttca    14340

aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc gatgattatc    14400

atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg catgacgtta    14460

tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata cgcgatagaa    14520

aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc tatgttacta    14580

gatcgggaat tgg                                                       14593


<210>  73
<211>  14665
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG2228

<400>  73
aattcatact aaagcttgca tgcctgcagg tcgactctag taacggccgc cagtgtgctg       60

gaattaattc ggcttgtcga ccacccaacc ccatatcgac agaggatgtg aagaacaggt      120

aaatcacgca gaagaaccca tctctgatag cagctatcga ttagaacaac gaatccatat      180

tgggtccgtg ggaaatactt actgcacagg aagggggcga tctgacgagg ccccgccacc      240

ggcctcgacc cgaggccgag gccgacgaag cgccggcgag tacggcgccg cggcggcctc      300

tgcccgtgcc ctctgcgcgt gggagggaga ggccgcggtg gtgggggcgc gcgcgcgcgc      360

gcgcgcagct ggtgcggcgg cgcgggggtc agccgccgag ccggcggcga cggaggagca      420

gggcggcgtg gacgcgaact tccgatcggt tggtcagagt gcgcgagttg ggcttagcca      480

attaggtctc aacaatctat tgggccgtaa aattcatggg ccctggtttg tctaggccca      540

atatcccgtt catttcagcc cacaaatatt tccccagagg attattaagg cccacacgca      600

gcttatagca gatcaagtac gatgtttcct gatcgttgga tcggaaacgt acggtcttga      660

tcaggcatgc cgacttcgtc aaagagaggc ggcatgacct gacgcggagt tggttccggg      720

caccgtctgg atggtcgtac cgggaccgga cacgtgtcgc gcctccaact acatggacac      780

gtgtggtgct gccattgggc cgtacgcgtg gcggtgaccg caccggatgc tgcctcgcac      840

cgccttgccc acgctttata tagagaggtt ttctctccat taatcgcata gcgagtcgaa      900

tcgaccgaag gggaggggga gcgaagcttt gcgttctcta atcgcctcgt caaggtaact      960

aatcaatcac ctcgtcctaa tcctcgaatc tctcgtggtg cccgtctaat ctcgcgattt     1020

tgatgctcgt ggtggaaagc gtaggaggat cccgtgcgag ttagtctcaa tctctcaggg     1080

tttcgtgcga ttttagggtg atccacctct taatcgagtt acggtttcgt gcgattttag     1140

ggtaatcctc ttaatctctc attgatttag ggtttcgtga gaatcgaggt agggatctgt     1200

gttatttata tcgatctaat agatggattg gttttgagat tgttctgtca gatggggatt     1260

gtttcgatat attaccctaa tgatgtgtca gatggggatt gtttcgatat attaccctaa     1320

tgatgtgtca gatggggatt gtttcgatat attaccctaa tgatggataa taagagtagt     1380

tcacagttat gttttgatcc tgccacatag tttgagtttt gtgatcagat ttagttttac     1440

ttatttgtgc ttagttcgga tgggattgtt ctgatattgt tccaatagat gaatagctcg     1500

ttaggttaaa atctttaggt tgagttaggc gacacatagt ttatttcctc tggatttgga     1560

ttggaattgt gttcttagtt tttttcccct ggatttggat tggaattgtg tggagctggg     1620

ttagagaatt acatctgtat cgtgtacacc tacttgaact gtagagcttg ggttctaagg     1680

tcaatttaat ctgtattgta tctggctctt tgcctagttg aactgtagtg ctgatgttgt     1740

actgtgtttt tttacccgtt ttatttgctt tactcgtgca aatcaaatct gtcagatgct     1800

agaactaggt ggctttattc tgtgttctta catagatctg ttgtcctgta gttacttatg     1860

tcagttttgt tattatctga agatattttt ggttgttgct tgttgatgtg gtgtgagctg     1920

tgagcagcgc tcttatgatt aatgatgctg tccaattgta gtgtagtatg atgtgattga     1980

tatgttcatc tattttgagc tgacagtacc gatatcgtag gatctggtgc caacttattc     2040

tccagctgct tttttttacc tatgttaatt ccaatccttt cttgcctctt ccagatccag     2100

ataatgcaga aactcattaa ctcagtgcaa aactatgcct ggggcagcaa aacggcgttg     2160

actgaacttt atggtatgga aaatccgtcc agccagccga tggccgagct gtggatgggc     2220

gcacatccga aaagcagttc acgagtgcag aatgccgccg gagatatcgt ttcactgcgt     2280

gatgtgattg agagtgataa atcgactctg ctcggagagg ccgttgccaa acgctttggc     2340

gaactgcctt tcctgttcaa agtattatgc gcagcacagc cactctccat tcaggttcat     2400

ccaaacaaac acaattctga aatcggtttt gccaaagaaa atgccgcagg tatcccgatg     2460

gatgccgccg agcgtaacta taaagatcct aaccacaagc cggagctggt ttttgcgctg     2520

acgcctttcc ttgcgatgaa cgcgtttcgt gaattttccg agattgtctc cctactccag     2580

ccggtcgcag gtgcacatcc ggcgattgct cactttttac aacagcctga tgccgaacgt     2640

ttaagcgaac tgttcgccag cctgttgaat atgcagggtg aagaaaaatc ccgcgcgctg     2700

gcgattttaa aatcggccct cgatagccag cagggtgaac cgtggcaaac gattcgttta     2760

atttctgaat tttacccgga agacagcggt ctgttctccc cgctattgct gaatgtggtg     2820

aaattgaacc ctggcgaagc gatgttcctg ttcgctgaaa caccgcacgc ttacctgcaa     2880

ggcgtggcgc tggaagtgat ggcaaactcc gataacgtgc tgcgtgcggg tctgacgcct     2940

aaatacattg atattccgga actggttgcc aatgtgaaat tcgaagccaa accggctaac     3000

cagttgttga cccagccggt gaaacaaggt gcagaactgg acttcccgat tccagtggat     3060

gattttgcct tctcgctgca tgaccttagt gataaagaaa ccaccattag ccagcagagt     3120

gccgccattt tgttctgcgt cgaaggcgat gcaacgttgt ggaaaggttc tcagcagtta     3180

cagcttaaac cgggtgaatc agcgtttatt gccgccaacg aatcaccggt gactgtcaaa     3240

ggccacggcc gtttagcgcg tgtttacaac aagctgtaag agcttactga aaaaattaac     3300

atctcttgct aagctgggag ctctagatcc ccgaatttcc ccgatcgttc aaacatttgg     3360

caataaagtt tcttaagatt gaatcctgtt gccggtcttg cgatgattat catataattt     3420

ctgttgaatt acgttaagca tgtaataatt aacatgtaat gcatgacgtt atttatgaga     3480

tgggttttta tgattagagt cccgcaatta tacatttaat acgcgataga aaacaaaata     3540

tagcgcgcaa actaggataa attatcgcgc gcggtgtcat ctatgttact agatcgggaa     3600

ttggcgagct cgaattaatt cagtacatta aaaacgtccg caatgtgtta ttaagttgtc     3660

taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa cagctccccg     3720

accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc cgggacggcg     3780

tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg ctattcggaa     3840

gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg tagcatgttg     3900

attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct cgcagagatc     3960

cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc gatcttgaga     4020

actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg agtggcgcta     4080

tttctttaga agtgaacgtt gacgatcgtc gaccgtaccc cgatgaatta attcggacgt     4140

acgttctgaa cacagctgga tacttacttg ggcgattgtc atacatgaca tcaacaatgt     4200

acccgtttgt gtaaccgtct cttggaggtt cgtatgacac tagtggttcc cctcagcttg     4260

cgactagatg ttgaggccta acattttatt agagagcagg ctagttgctt agatacatga     4320

tcttcaggcc gttatctgtc agggcaagcg aaaattggcc atttatgacg accaatgccc     4380

cgcagaagct cccatctttg ccgccataga cgccgcgccc cccttttggg gtgtagaaca     4440

tccttttgcc agatgtggaa aagaagttcg ttgtcccatt gttggcaatg acgtagtagc     4500

cggcgaaagt gcgagaccca tttgcgctat atataagcct acgatttccg ttgcgactat     4560

tgtcgtaatt ggatgaacta ttatcgtagt tgctctcaga gttgtcgtaa tttgatggac     4620

tattgtcgta attgcttatg gagttgtcgt agttgcttgg agaaatgtcg tagttggatg     4680

gggagtagtc atagggaaga cgagcttcat ccactaaaac aattggcagg tcagcaagtg     4740

cctgccccga tgccatcgca agtacgaggc ttagaaccac cttcaacaga tcgcgcatag     4800

tcttccccag ctctctaacg cttgagttaa gccgcgccgc gaagcggcgt cggcttgaac     4860

gaattgttag acattatttg ccgactacct tggtgatctc gcctttcacg tagtgaacaa     4920

attcttccaa ctgatctgcg cgcgaggcca agcgatcttc ttgtccaaga taagcctgcc     4980

tagcttcaag tatgacgggc tgatactggg ccggcaggcg ctccattgcc cagtcggcag     5040

cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta ccaaatgcgg gacaacgtaa     5100

gcactacatt tcgctcatcg ccagcccagt cgggcggcga gttccatagc gttaaggttt     5160

catttagcgc ctcaaataga tcctgttcag gaaccggatc aaagagttcc tccgccgctg     5220

gacctaccaa ggcaacgcta tgttctcttg cttttgtcag caagatagcc agatcaatgt     5280

cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt gcgctgccat tctccaaatt     5340

gcagttcgcg cttagctgga taacgccacg gaatgatgtc gtcgtgcaca acaatggtga     5400

cttctacagc gcggagaatc tcgctctctc caggggaagc cgaagtttcc aaaaggtcgt     5460

tgatcaaagc tcgccgcgtt gtttcatcaa gccttacggt caccgtaacc agcaaatcaa     5520

tatcactgtg tggcttcagg ccgccatcca ctgcggagcc gtacaaatgt acggccagca     5580

acgtcggttc gagatggcgc tcgatgacgc caactacctc tgatagttga gtcgatactt     5640

cggcgatcac cgcttccctc atgatgttta actcctgaat taagccgcgc cgcgaagcgg     5700

tgtcggcttg aatgaattgt taggcgtcat cctgtgctcc cgagaaccag taccagtaca     5760

tcgctgtttc gttcgagact tgaggtctag ttttatacgt gaacaggtca atgccgccga     5820

gagtaaagcc acattttgcg tacaaattgc aggcaggtac attgttcgtt tgtgtctcta     5880

atcgtatgcc aaggagctgt ctgcttagtg cccacttttt cgcaaattcg atgagactgt     5940

gcgcgactcc tttgcctcgg tgcgtgtgcg acacaacaat gtgttcgata gaggctagat     6000

cgttccatgt tgagttgagt tcaatcttcc cgacaagctc ttggtcgatg aatgcgccat     6060

agcaagcaga gtcttcatca gagtcatcat ccgagatgta atccttccgg taggggctca     6120

cacttctggt agatagttca aagccttggt cggataggtg cacatcgaac acttcacgaa     6180

caatgaaatg gttctcagca tccaatgttt ccgccacctg ctcagggatc accgaaatct     6240

tcatatgacg cctaacgcct ggcacagcgg atcgcaaacc tggcgcggct tttggcacaa     6300

aaggcgtgac aggtttgcga atccgttgct gccacttgtt aacccttttg ccagatttgg     6360

taactataat ttatgttaga ggcgaagtct tgggtaaaaa ctggcctaaa attgctgggg     6420

atttcaggaa agtaaacatc accttccggc tcgatgtcta ttgtagatat atgtagtgta     6480

tctacttgat cgggggatct gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg     6540

acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca     6600

agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc     6660

acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca gattgtactg     6720

agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc     6780

aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga     6840

gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca     6900

ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg     6960

ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt     7020

cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc     7080

ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct     7140

tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc     7200

gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta     7260

tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca     7320

gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag     7380

tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag     7440

ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt     7500

agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa     7560

gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg     7620

attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga     7680

agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta     7740

atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc     7800

cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg     7860

ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga     7920

agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt     7980

tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt     8040

gctgcagggg gggggggggg ggggttccat tgttcattcc acggacaaaa acagagaaag     8100

gaaacgacag aggccaaaaa gctcgctttc agcacctgtc gtttcctttc ttttcagagg     8160

gtattttaaa taaaaacatt aagttatgac gaagaagaac ggaaacgcct taaaccggaa     8220

aattttcata aatagcgaaa acccgcgagg tcgccgcccc gtaacctgtc ggatcaccgg     8280

aaaggacccg taaagtgata atgattatca tctacatatc acaacgtgcg tggaggccat     8340

caaaccacgt caaataatca attatgacgc aggtatcgta ttaattgatc tgcatcaact     8400

taacgtaaaa acaacttcag acaatacaaa tcagcgacac tgaatacggg gcaacctcat     8460

gtcccccccc ccccccccct gcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt     8520

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa     8580

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat     8640

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct     8700

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga     8760

gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga actttaaaag     8820

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga     8880

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca     8940

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg     9000

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc     9060

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag     9120

gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca     9180

tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa gaattggtcg     9240

acgatcttgc tgcgttcgga tattttcgtg gagttcccgc cacagacccg gattgaaggc     9300

gagatccagc aactcgcgcc agatcatcct gtgacggaac tttggcgcgt gatgactggc     9360

caggacgtcg gccgaaagag cgacaagcag atcacgcttt tcgacagcgt cggatttgcg     9420

atcgaggatt tttcggcgct gcgctacgtc cgcgaccgcg ttgagggatc aagccacagc     9480

agcccactcg accttctagc cgacccagac gagccaaggg atctttttgg aatgctgctc     9540

cgtcgtcagg ctttccgacg tttgggtggt tgaacagaag tcattatcgc acggaatgcc     9600

aagcactccc gaggggaacc ctgtggttgg catgcacata caaatggacg aacggataaa     9660

ccttttcacg cccttttaaa tatccgatta ttctaataaa cgctcttttc tcttaggttt     9720

acccgccaat atatcctgtc aaacactgat agtttaaact gaaggcggga aacgacaacc     9780

tgatcatgag cggagaatta agggagtcac gttatgaccc ccgccgatga cgcgggacaa     9840

gccgttttac gtttggaact gacagaaccg caacgttgaa ggagccactc agcttaatta     9900

agtctaactc gagttactgg tacgtaccaa atccatggaa tcaaggtacc gtcgactcta     9960

gtaacggccg ccagtgtgct ggaattaatt cggcttgtcg accacccaac cccatatcga    10020

cagaggatgt gaagaacagg taaatcacgc agaagaaccc atctctgata gcagctatcg    10080

attagaacaa cgaatccata ttgggtccgt gggaaatact tactgcacag gaagggggcg    10140

atctgacgag gccccgccac cggcctcgac ccgaggccga ggccgacgaa gcgccggcga    10200

gtacggcgcc gcggcggcct ctgcccgtgc cctctgcgcg tgggagggag aggccgcggt    10260

ggtgggggcg cgcgcgcgcg cgcgcgcagc tggtgcggcg gcgcgggggt cagccgccga    10320

gccggcggcg acggaggagc agggcggcgt ggacgcgaac ttccgatcgg ttggtcagag    10380

tgcgcgagtt gggcttagcc aattaggtct caacaatcta ttgggccgta aaattcatgg    10440

gccctggttt gtctaggccc aatatcccgt tcatttcagc ccacaaatat ttccccagag    10500

gattattaag gcccacacgc agcttatagc agatcaagta cgatgtttcc tgatcgttgg    10560

atcggaaacg tacggtcttg atcaggcatg ccgacttcgt caaagagagg cggcatgacc    10620

tgacgcggag ttggttccgg gcaccgtctg gatggtcgta ccgggaccgg acacgtgtcg    10680

cgcctccaac tacatggaca cgtgtggtgc tgccattggg ccgtacgcgt ggcggtgacc    10740

gcaccggatg ctgcctcgca ccgccttgcc cacgctttat atagagaggt tttctctcca    10800

ttaatcgcat agcgagtcga atcgaccgaa ggggaggggg agcgaagctt tgcgttctct    10860

aatcgcctcg tcaaggtaac taatcaatca cctcgtccta atcctcgaat ctctcgtggt    10920

gcccgtctaa tctcgcgatt ttgatgctcg tggtggaaag cgtaggagga tcccgtgcga    10980

gttagtctca atctctcagg gtttcgtgcg attttagggt gatccacctc ttaatcgagt    11040

tacggtttcg tgcgatttta gggtaatcct cttaatctct cattgattta gggtttcgtg    11100

agaatcgagg tagggatctg tgttatttat atcgatctaa tagatggatt ggttttgaga    11160

ttgttctgtc agatggggat tgtttcgata tattacccta atgatgtgtc agatggggat    11220

tgtttcgata tattacccta atgatgtgtc agatggggat tgtttcgata tattacccta    11280

atgatggata ataagagtag ttcacagtta tgttttgatc ctgccacata gtttgagttt    11340

tgtgatcaga tttagtttta cttatttgtg cttagttcgg atgggattgt tctgatattg    11400

ttccaataga tgaatagctc gttaggttaa aatctttagg ttgagttagg cgacacatag    11460

tttatttcct ctggatttgg attggaattg tgttcttagt ttttttcccc tggatttgga    11520

ttggaattgt gtggagctgg gttagagaat tacatctgta tcgtgtacac ctacttgaac    11580

tgtagagctt gggttctaag gtcaatttaa tctgtattgt atctggctct ttgcctagtt    11640

gaactgtagt gctgatgttg tactgtgttt ttttacccgt tttatttgct ttactcgtgc    11700

aaatcaaatc tgtcagatgc tagaactagg tggctttatt ctgtgttctt acatagatct    11760

gttgtcctgt agttacttat gtcagttttg ttattatctg aagatatttt tggttgttgc    11820

ttgttgatgt ggtgtgagct gtgagcagcg ctcttatgat taatgatgct gtccaattgt    11880

agtgtagtat gatgtgattg atatgttcat ctattttgag ctgacagtac cgatatcgta    11940

ggatctggtg ccaacttatt ctccagctgc ttttttttac ctatgttaat tccaatcctt    12000

tcttgcctct tccagatcca gataatggcg aacaaacatt tgtccctctc cctcttcctc    12060

gtcctccttg gcctgtcggc cagcttggcc tccgggcaac aaacaagcat tactctgaca    12120

tccaacgcat ccggtacgtt tgacggttac tattacgaac tctggaagga tactggcaat    12180

acaacaatga cggtctacac tcaaggtcgc ttttcctgcc agtggtcgaa catcaataac    12240

gcgttgttta ggaccgggaa gaaatacaac cagaattggc agtctcttgg cacaatccgg    12300

atcacgtact ctgcgactta caacccaaac gggaactcct acttgtgtat ctatggctgg    12360

tctaccaacc cattggtcga gttctacatc gttgagtcct gggggaactg gagaccgcct    12420

ggtgcctgcc tggccgaggg ctcgctcgtc ttggacgcgg ctaccgggca gagggtccct    12480

atcgaaaagg tgcgtccggg gatggaagtt ttctccttgg gacctgatta cagactgtat    12540

cgggtgcccg ttttggaggt ccttgagagc ggggttaggg aagttgtgcg cctcagaact    12600

cggtcaggga gaacgctggt gttgacacca gatcacccgc ttttgacccc cgaaggttgg    12660

aaacctcttt gtgacctccc gcttggaact ccaattgcag tccccgcaga actgcctgtg    12720

gcgggccact tggccccacc tgaagaacgt gttacgctcc tggctcttct gttgggggat    12780

gggaacacaa agctgtcggg tcggagaggt acacgtccta atgccttctt ctacagcaaa    12840

aaccccgaat tgctcgcggc ttatcgccgg tgtgcagaag ccttgggtgc aaaggtgaaa    12900

gcatacgtcc acccgactac gggggtggtt acactcgcaa ccctcgctcc acgtcctgga    12960

gctcaagatc ctgtcaaacg cctcgttgtc gaggcgggaa tggttgctaa agccgaagag    13020

aagagggtcc cggaggaggt gtttcgttac cggcgtgagg cgttggccct tttcttgggc    13080

cgtttgttct cgacagacgg ctctgttgaa aagaagagga tctcttattc aagtgccagt    13140

ttgggactgg cccaggatgt cgcacatctc ttgctgcgcc ttggaattac atctcaactc    13200

cgttcgagag ggccacgggc tcacgaggtt cttatatcgg gccgcgagga tattttgcgg    13260

tttgctgaac ttatcggacc ctacctcttg ggggccaaga gggagagact tgcagcgctg    13320

gaagctgagg cccgcaggcg tttgcctgga cagggatggc acttgcggct tgttcttcct    13380

gccgtggcgt acagagtggg cgaggcggaa aggcgctcgg gattttcgtg gagtgaagcc    13440

ggtcggcgcg tcgcagttgc gggatcgtgt ttgtcatctg gactcaacct caaattgccc    13500

agacgctacc tttctcggca ccggttgtcg ctgctcggtg aggcttttgc cgaccctggg    13560

ctggaagcgc tcgcggaagg ccaagtgctc tgggacccta ttgttgctgt cgaaccggcc    13620

ggtaaggcga gaacattcga cttgcgcgtt ccaccctttg caaacttcgt gagcgaggac    13680

ctggtggtgc ataacaccgt ccccctgggc caagtgacaa tcgatggcgg gacctacgac    13740

atctatagga cgacacgcgt caaccagcct tccattgtgg ggacagccac gttcgatcag    13800

tactggagcg tgcgcacctc taagcggact tcaggaacag tgaccgtgac cgatcacttc    13860

cgcgcctggg cgaaccgggg cctgaacctc ggcacaatag accaaattac attgtgcgtg    13920

gagggttacc aaagctctgg atcagccaac atcacccaga acaccttctc tcagggctct    13980

tcttccggca gttcgggtgg ctcatccggc tccacaacga ctactcgcat cgagtgtgag    14040

aacatgtcct tgtccggacc ctacgttagc aggatcacca atccctttaa tggtattgcg    14100

ctgtacgcca acggagacac agcccgcgct accgttaact tccccgcaag tcgcaactac    14160

aatttccgcc tgcggggttg cggcaacaac aataatcttg cccgtgtgga cctgaggatc    14220

gacggacgga ccgtcgggac cttttattac cagggcacat acccctggga ggccccaatt    14280

gacaatgttt atgtcagtgc ggggagtcat acagtcgaaa tcactgttac tgcggataac    14340

ggcacatggg acgtgtatgc cgactacctg gtgatacagt gacctaggtc cccgaatttc    14400

cccgatcgtt caaacatttg gcaataaagt ttcttaagat tgaatcctgt tgccggtctt    14460

gcgatgatta tcatataatt tctgttgaat tacgttaagc atgtaataat taacatgtaa    14520

tgcatgacgt tatttatgag atgggttttt atgattagag tcccgcaatt atacatttaa    14580

tacgcgatag aaaacaaaat atagcgcgca aactaggata aattatcgcg cgcggtgtca    14640

tctatgttac tagatcggga attgg                                          14665


<210>  74
<211>  14683
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic construct, pAG2229

<400>  74
aattcatact aaagcttgca tgcctgcagg tcgactctag taacggccgc cagtgtgctg       60

gaattaattc ggcttgtcga ccacccaacc ccatatcgac agaggatgtg aagaacaggt      120

aaatcacgca gaagaaccca tctctgatag cagctatcga ttagaacaac gaatccatat      180

tgggtccgtg ggaaatactt actgcacagg aagggggcga tctgacgagg ccccgccacc      240

ggcctcgacc cgaggccgag gccgacgaag cgccggcgag tacggcgccg cggcggcctc      300

tgcccgtgcc ctctgcgcgt gggagggaga ggccgcggtg gtgggggcgc gcgcgcgcgc      360

gcgcgcagct ggtgcggcgg cgcgggggtc agccgccgag ccggcggcga cggaggagca      420

gggcggcgtg gacgcgaact tccgatcggt tggtcagagt gcgcgagttg ggcttagcca      480

attaggtctc aacaatctat tgggccgtaa aattcatggg ccctggtttg tctaggccca      540

atatcccgtt catttcagcc cacaaatatt tccccagagg attattaagg cccacacgca      600

gcttatagca gatcaagtac gatgtttcct gatcgttgga tcggaaacgt acggtcttga      660

tcaggcatgc cgacttcgtc aaagagaggc ggcatgacct gacgcggagt tggttccggg      720

caccgtctgg atggtcgtac cgggaccgga cacgtgtcgc gcctccaact acatggacac      780

gtgtggtgct gccattgggc cgtacgcgtg gcggtgaccg caccggatgc tgcctcgcac      840

cgccttgccc acgctttata tagagaggtt ttctctccat taatcgcata gcgagtcgaa      900

tcgaccgaag gggaggggga gcgaagcttt gcgttctcta atcgcctcgt caaggtaact      960

aatcaatcac ctcgtcctaa tcctcgaatc tctcgtggtg cccgtctaat ctcgcgattt     1020

tgatgctcgt ggtggaaagc gtaggaggat cccgtgcgag ttagtctcaa tctctcaggg     1080

tttcgtgcga ttttagggtg atccacctct taatcgagtt acggtttcgt gcgattttag     1140

ggtaatcctc ttaatctctc attgatttag ggtttcgtga gaatcgaggt agggatctgt     1200

gttatttata tcgatctaat agatggattg gttttgagat tgttctgtca gatggggatt     1260

gtttcgatat attaccctaa tgatgtgtca gatggggatt gtttcgatat attaccctaa     1320

tgatgtgtca gatggggatt gtttcgatat attaccctaa tgatggataa taagagtagt     1380

tcacagttat gttttgatcc tgccacatag tttgagtttt gtgatcagat ttagttttac     1440

ttatttgtgc ttagttcgga tgggattgtt ctgatattgt tccaatagat gaatagctcg     1500

ttaggttaaa atctttaggt tgagttaggc gacacatagt ttatttcctc tggatttgga     1560

ttggaattgt gttcttagtt tttttcccct ggatttggat tggaattgtg tggagctggg     1620

ttagagaatt acatctgtat cgtgtacacc tacttgaact gtagagcttg ggttctaagg     1680

tcaatttaat ctgtattgta tctggctctt tgcctagttg aactgtagtg ctgatgttgt     1740

actgtgtttt tttacccgtt ttatttgctt tactcgtgca aatcaaatct gtcagatgct     1800

agaactaggt ggctttattc tgtgttctta catagatctg ttgtcctgta gttacttatg     1860

tcagttttgt tattatctga agatattttt ggttgttgct tgttgatgtg gtgtgagctg     1920

tgagcagcgc tcttatgatt aatgatgctg tccaattgta gtgtagtatg atgtgattga     1980

tatgttcatc tattttgagc tgacagtacc gatatcgtag gatctggtgc caacttattc     2040

tccagctgct tttttttacc tatgttaatt ccaatccttt cttgcctctt ccagatccag     2100

ataatgcaga aactcattaa ctcagtgcaa aactatgcct ggggcagcaa aacggcgttg     2160

actgaacttt atggtatgga aaatccgtcc agccagccga tggccgagct gtggatgggc     2220

gcacatccga aaagcagttc acgagtgcag aatgccgccg gagatatcgt ttcactgcgt     2280

gatgtgattg agagtgataa atcgactctg ctcggagagg ccgttgccaa acgctttggc     2340

gaactgcctt tcctgttcaa agtattatgc gcagcacagc cactctccat tcaggttcat     2400

ccaaacaaac acaattctga aatcggtttt gccaaagaaa atgccgcagg tatcccgatg     2460

gatgccgccg agcgtaacta taaagatcct aaccacaagc cggagctggt ttttgcgctg     2520

acgcctttcc ttgcgatgaa cgcgtttcgt gaattttccg agattgtctc cctactccag     2580

ccggtcgcag gtgcacatcc ggcgattgct cactttttac aacagcctga tgccgaacgt     2640

ttaagcgaac tgttcgccag cctgttgaat atgcagggtg aagaaaaatc ccgcgcgctg     2700

gcgattttaa aatcggccct cgatagccag cagggtgaac cgtggcaaac gattcgttta     2760

atttctgaat tttacccgga agacagcggt ctgttctccc cgctattgct gaatgtggtg     2820

aaattgaacc ctggcgaagc gatgttcctg ttcgctgaaa caccgcacgc ttacctgcaa     2880

ggcgtggcgc tggaagtgat ggcaaactcc gataacgtgc tgcgtgcggg tctgacgcct     2940

aaatacattg atattccgga actggttgcc aatgtgaaat tcgaagccaa accggctaac     3000

cagttgttga cccagccggt gaaacaaggt gcagaactgg acttcccgat tccagtggat     3060

gattttgcct tctcgctgca tgaccttagt gataaagaaa ccaccattag ccagcagagt     3120

gccgccattt tgttctgcgt cgaaggcgat gcaacgttgt ggaaaggttc tcagcagtta     3180

cagcttaaac cgggtgaatc agcgtttatt gccgccaacg aatcaccggt gactgtcaaa     3240

ggccacggcc gtttagcgcg tgtttacaac aagctgtaag agcttactga aaaaattaac     3300

atctcttgct aagctgggag ctctagatcc ccgaatttcc ccgatcgttc aaacatttgg     3360

caataaagtt tcttaagatt gaatcctgtt gccggtcttg cgatgattat catataattt     3420

ctgttgaatt acgttaagca tgtaataatt aacatgtaat gcatgacgtt atttatgaga     3480

tgggttttta tgattagagt cccgcaatta tacatttaat acgcgataga aaacaaaata     3540

tagcgcgcaa actaggataa attatcgcgc gcggtgtcat ctatgttact agatcgggaa     3600

ttggcgagct cgaattaatt cagtacatta aaaacgtccg caatgtgtta ttaagttgtc     3660

taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa cagctccccg     3720

accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc cgggacggcg     3780

tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg ctattcggaa     3840

gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg tagcatgttg     3900

attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct cgcagagatc     3960

cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc gatcttgaga     4020

actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg agtggcgcta     4080

tttctttaga agtgaacgtt gacgatcgtc gaccgtaccc cgatgaatta attcggacgt     4140

acgttctgaa cacagctgga tacttacttg ggcgattgtc atacatgaca tcaacaatgt     4200

acccgtttgt gtaaccgtct cttggaggtt cgtatgacac tagtggttcc cctcagcttg     4260

cgactagatg ttgaggccta acattttatt agagagcagg ctagttgctt agatacatga     4320

tcttcaggcc gttatctgtc agggcaagcg aaaattggcc atttatgacg accaatgccc     4380

cgcagaagct cccatctttg ccgccataga cgccgcgccc cccttttggg gtgtagaaca     4440

tccttttgcc agatgtggaa aagaagttcg ttgtcccatt gttggcaatg acgtagtagc     4500

cggcgaaagt gcgagaccca tttgcgctat atataagcct acgatttccg ttgcgactat     4560

tgtcgtaatt ggatgaacta ttatcgtagt tgctctcaga gttgtcgtaa tttgatggac     4620

tattgtcgta attgcttatg gagttgtcgt agttgcttgg agaaatgtcg tagttggatg     4680

gggagtagtc atagggaaga cgagcttcat ccactaaaac aattggcagg tcagcaagtg     4740

cctgccccga tgccatcgca agtacgaggc ttagaaccac cttcaacaga tcgcgcatag     4800

tcttccccag ctctctaacg cttgagttaa gccgcgccgc gaagcggcgt cggcttgaac     4860

gaattgttag acattatttg ccgactacct tggtgatctc gcctttcacg tagtgaacaa     4920

attcttccaa ctgatctgcg cgcgaggcca agcgatcttc ttgtccaaga taagcctgcc     4980

tagcttcaag tatgacgggc tgatactggg ccggcaggcg ctccattgcc cagtcggcag     5040

cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta ccaaatgcgg gacaacgtaa     5100

gcactacatt tcgctcatcg ccagcccagt cgggcggcga gttccatagc gttaaggttt     5160

catttagcgc ctcaaataga tcctgttcag gaaccggatc aaagagttcc tccgccgctg     5220

gacctaccaa ggcaacgcta tgttctcttg cttttgtcag caagatagcc agatcaatgt     5280

cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt gcgctgccat tctccaaatt     5340

gcagttcgcg cttagctgga taacgccacg gaatgatgtc gtcgtgcaca acaatggtga     5400

cttctacagc gcggagaatc tcgctctctc caggggaagc cgaagtttcc aaaaggtcgt     5460

tgatcaaagc tcgccgcgtt gtttcatcaa gccttacggt caccgtaacc agcaaatcaa     5520

tatcactgtg tggcttcagg ccgccatcca ctgcggagcc gtacaaatgt acggccagca     5580

acgtcggttc gagatggcgc tcgatgacgc caactacctc tgatagttga gtcgatactt     5640

cggcgatcac cgcttccctc atgatgttta actcctgaat taagccgcgc cgcgaagcgg     5700

tgtcggcttg aatgaattgt taggcgtcat cctgtgctcc cgagaaccag taccagtaca     5760

tcgctgtttc gttcgagact tgaggtctag ttttatacgt gaacaggtca atgccgccga     5820

gagtaaagcc acattttgcg tacaaattgc aggcaggtac attgttcgtt tgtgtctcta     5880

atcgtatgcc aaggagctgt ctgcttagtg cccacttttt cgcaaattcg atgagactgt     5940

gcgcgactcc tttgcctcgg tgcgtgtgcg acacaacaat gtgttcgata gaggctagat     6000

cgttccatgt tgagttgagt tcaatcttcc cgacaagctc ttggtcgatg aatgcgccat     6060

agcaagcaga gtcttcatca gagtcatcat ccgagatgta atccttccgg taggggctca     6120

cacttctggt agatagttca aagccttggt cggataggtg cacatcgaac acttcacgaa     6180

caatgaaatg gttctcagca tccaatgttt ccgccacctg ctcagggatc accgaaatct     6240

tcatatgacg cctaacgcct ggcacagcgg atcgcaaacc tggcgcggct tttggcacaa     6300

aaggcgtgac aggtttgcga atccgttgct gccacttgtt aacccttttg ccagatttgg     6360

taactataat ttatgttaga ggcgaagtct tgggtaaaaa ctggcctaaa attgctgggg     6420

atttcaggaa agtaaacatc accttccggc tcgatgtcta ttgtagatat atgtagtgta     6480

tctacttgat cgggggatct gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg     6540

acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca     6600

agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc     6660

acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca gattgtactg     6720

agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc     6780

aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga     6840

gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca     6900

ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg     6960

ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt     7020

cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc     7080

ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct     7140

tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc     7200

gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta     7260

tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca     7320

gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag     7380

tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag     7440

ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt     7500

agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa     7560

gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg     7620

attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga     7680

agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta     7740

atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc     7800

cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg     7860

ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga     7920

agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt     7980

tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt     8040

gctgcagggg gggggggggg ggggttccat tgttcattcc acggacaaaa acagagaaag     8100

gaaacgacag aggccaaaaa gctcgctttc agcacctgtc gtttcctttc ttttcagagg     8160

gtattttaaa taaaaacatt aagttatgac gaagaagaac ggaaacgcct taaaccggaa     8220

aattttcata aatagcgaaa acccgcgagg tcgccgcccc gtaacctgtc ggatcaccgg     8280

aaaggacccg taaagtgata atgattatca tctacatatc acaacgtgcg tggaggccat     8340

caaaccacgt caaataatca attatgacgc aggtatcgta ttaattgatc tgcatcaact     8400

taacgtaaaa acaacttcag acaatacaaa tcagcgacac tgaatacggg gcaacctcat     8460

gtcccccccc ccccccccct gcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt     8520

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa     8580

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat     8640

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct     8700

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga     8760

gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga actttaaaag     8820

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga     8880

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca     8940

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg     9000

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc     9060

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag     9120

gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca     9180

tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa gaattggtcg     9240

acgatcttgc tgcgttcgga tattttcgtg gagttcccgc cacagacccg gattgaaggc     9300

gagatccagc aactcgcgcc agatcatcct gtgacggaac tttggcgcgt gatgactggc     9360

caggacgtcg gccgaaagag cgacaagcag atcacgcttt tcgacagcgt cggatttgcg     9420

atcgaggatt tttcggcgct gcgctacgtc cgcgaccgcg ttgagggatc aagccacagc     9480

agcccactcg accttctagc cgacccagac gagccaaggg atctttttgg aatgctgctc     9540

cgtcgtcagg ctttccgacg tttgggtggt tgaacagaag tcattatcgc acggaatgcc     9600

aagcactccc gaggggaacc ctgtggttgg catgcacata caaatggacg aacggataaa     9660

ccttttcacg cccttttaaa tatccgatta ttctaataaa cgctcttttc tcttaggttt     9720

acccgccaat atatcctgtc aaacactgat agtttaaact gaaggcggga aacgacaacc     9780

tgatcatgag cggagaatta agggagtcac gttatgaccc ccgccgatga cgcgggacaa     9840

gccgttttac gtttggaact gacagaaccg caacgttgaa ggagccactc agcttaatta     9900

agtctaactc gagttactgg tacgtaccaa atccatggaa tcaaggtacc gtcgactcta     9960

gtaacggccg ccagtgtgct ggaattaatt cggcttgtcg accacccaac cccatatcga    10020

cagaggatgt gaagaacagg taaatcacgc agaagaaccc atctctgata gcagctatcg    10080

attagaacaa cgaatccata ttgggtccgt gggaaatact tactgcacag gaagggggcg    10140

atctgacgag gccccgccac cggcctcgac ccgaggccga ggccgacgaa gcgccggcga    10200

gtacggcgcc gcggcggcct ctgcccgtgc cctctgcgcg tgggagggag aggccgcggt    10260

ggtgggggcg cgcgcgcgcg cgcgcgcagc tggtgcggcg gcgcgggggt cagccgccga    10320

gccggcggcg acggaggagc agggcggcgt ggacgcgaac ttccgatcgg ttggtcagag    10380

tgcgcgagtt gggcttagcc aattaggtct caacaatcta ttgggccgta aaattcatgg    10440

gccctggttt gtctaggccc aatatcccgt tcatttcagc ccacaaatat ttccccagag    10500

gattattaag gcccacacgc agcttatagc agatcaagta cgatgtttcc tgatcgttgg    10560

atcggaaacg tacggtcttg atcaggcatg ccgacttcgt caaagagagg cggcatgacc    10620

tgacgcggag ttggttccgg gcaccgtctg gatggtcgta ccgggaccgg acacgtgtcg    10680

cgcctccaac tacatggaca cgtgtggtgc tgccattggg ccgtacgcgt ggcggtgacc    10740

gcaccggatg ctgcctcgca ccgccttgcc cacgctttat atagagaggt tttctctcca    10800

ttaatcgcat agcgagtcga atcgaccgaa ggggaggggg agcgaagctt tgcgttctct    10860

aatcgcctcg tcaaggtaac taatcaatca cctcgtccta atcctcgaat ctctcgtggt    10920

gcccgtctaa tctcgcgatt ttgatgctcg tggtggaaag cgtaggagga tcccgtgcga    10980

gttagtctca atctctcagg gtttcgtgcg attttagggt gatccacctc ttaatcgagt    11040

tacggtttcg tgcgatttta gggtaatcct cttaatctct cattgattta gggtttcgtg    11100

agaatcgagg tagggatctg tgttatttat atcgatctaa tagatggatt ggttttgaga    11160

ttgttctgtc agatggggat tgtttcgata tattacccta atgatgtgtc agatggggat    11220

tgtttcgata tattacccta atgatgtgtc agatggggat tgtttcgata tattacccta    11280

atgatggata ataagagtag ttcacagtta tgttttgatc ctgccacata gtttgagttt    11340

tgtgatcaga tttagtttta cttatttgtg cttagttcgg atgggattgt tctgatattg    11400

ttccaataga tgaatagctc gttaggttaa aatctttagg ttgagttagg cgacacatag    11460

tttatttcct ctggatttgg attggaattg tgttcttagt ttttttcccc tggatttgga    11520

ttggaattgt gtggagctgg gttagagaat tacatctgta tcgtgtacac ctacttgaac    11580

tgtagagctt gggttctaag gtcaatttaa tctgtattgt atctggctct ttgcctagtt    11640

gaactgtagt gctgatgttg tactgtgttt ttttacccgt tttatttgct ttactcgtgc    11700

aaatcaaatc tgtcagatgc tagaactagg tggctttatt ctgtgttctt acatagatct    11760

gttgtcctgt agttacttat gtcagttttg ttattatctg aagatatttt tggttgttgc    11820

ttgttgatgt ggtgtgagct gtgagcagcg ctcttatgat taatgatgct gtccaattgt    11880

agtgtagtat gatgtgattg atatgttcat ctattttgag ctgacagtac cgatatcgta    11940

ggatctggtg ccaacttatt ctccagctgc ttttttttac ctatgttaat tccaatcctt    12000

tcttgcctct tccagatcca gataatggcg aacaaacatt tgtccctctc cctcttcctc    12060

gtcctccttg gcctgtcggc cagcttggcc tccgggcaac aaacaagcat tactctgaca    12120

tccaacgcat ccggtacgtt tgacggttac tattacgaac tctggaagga tactggcaat    12180

acaacaatga cggtctacac tcaaggtcgc ttttcctgcc agtggtcgaa catcaataac    12240

gcgttgttta ggaccgggaa gaaatacaac cagaattggc agtctcttgg cacaatccgg    12300

atcacgtact ctgcgactta caacccaaac gggaactcct acttgtgtat ctatggctgg    12360

tctaccaacc cattggtcga gttctacatc gttgagtcct gggggaactg gagaccgcct    12420

ggtgcctgcc tggccgaggg ctcgctcgtc ttggacgcgg ctaccgggca gagggtccct    12480

atcgaaaagg tgcgtccggg gatggaagtt ttctccttgg gacctgatta cagactgtat    12540

cgggtgcccg ttttggaggt ccttgagagc ggggttaggg aagttgtgcg cctcagaact    12600

cggtcaggga gaacgctggt gttgacacca gatcacccgc ttttgacccc cgaaggttgg    12660

aaacctcttt gtgacctccc gcttggaact ccaattgcag tccccgcaga actgcctgtg    12720

gcgggccact tggccccacc tgaagaacgt gttacgctcc tggctcttct gttgggggat    12780

gggaacacaa agctgtcggg tcggagaggt acacgtccta atgccttctt ctacagcaaa    12840

aaccccgaat tgctcgcggc ttatcgccgg tgtgcagaag ccttgggtgc aaaggtgaaa    12900

gcatacgtcc acccgactac gggggtggtt acactcgcaa ccctcgctcc acgtcctgga    12960

gctcaagatc ctgtcaaacg cctcgttgtc gaggcgggaa tggttgctaa agccgaagag    13020

aagagggtcc cggaggaggt gtttcgttac cggcgtgagg cgttggccct tttcttgggc    13080

cgtttgttct cgacagacgg ctctgttgaa aagaagagga tctcttattc aagtgccagt    13140

ttgggactgg cccaggatgt cgcacatctc ttgctgcgcc ttggaattac atctcaactc    13200

cgttcgagag ggccacgggc tcacgaggtt cttatatcgg gccgcgagga tattttgcgg    13260

tttgctgaac ttatcggacc ctacctcttg ggggccaaga gggagagact tgcagcgctg    13320

gaagctgagg cccgcaggcg tttgcctgga cagggatggc acttgcggct tgttcttcct    13380

gccgtggcgt acagagtggg cgaggcggaa aggcgctcgg gattttcgtg gagtgaagcc    13440

ggtcggcgcg tcgcagttgc gggatcgtgt ttgtcatctg gactcaacct caaattgccc    13500

agacgctacc tttctcggca ccggttgtcg ctgctcggtg aggcttttgc cgaccctggg    13560

ctggaagcgc tcgcggaagg ccaagtgctc tgggacccta ttgttgctgt cgaaccggcc    13620

ggtaaggcga gaacattcga cttgcgcgtt ccaccctttg caaacttcgt gagcgaggac    13680

ctggtggtgc ataacaccgt ccccctgggc caagtgacaa tcgatggcgg gacctacgac    13740

atctatagga cgacacgcgt caaccagcct tccattgtgg ggacagccac gttcgatcag    13800

tactggagcg tgcgcacctc taagcggact tcaggaacag tgaccgtgac cgatcacttc    13860

cgcgcctggg cgaaccgggg cctgaacctc ggcacaatag accaaattac attgtgcgtg    13920

gagggttacc aaagctctgg atcagccaac atcacccaga acaccttctc tcagggctct    13980

tcttccggca gttcgggtgg ctcatccggc tccacaacga ctactcgcat cgagtgtgag    14040

aacatgtcct tgtccggacc ctacgttagc aggatcacca atccctttaa tggtattgcg    14100

ctgtacgcca acggagacac agcccgcgct accgttaact tccccgcaag tcgcaactac    14160

aatttccgcc tgcggggttg cggcaacaac aataatcttg cccgtgtgga cctgaggatc    14220

gacggacgga ccgtcgggac cttttattac cagggcacat acccctggga ggccccaatt    14280

gacaatgttt atgtcagtgc ggggagtcat acagtcgaaa tcactgttac tgcggataac    14340

ggcacatggg acgtgtatgc cgactacctg gtgatacaga gcgagaagga cgagctgtga    14400

cctaggtccc cgaatttccc cgatcgttca aacatttggc aataaagttt cttaagattg    14460

aatcctgttg ccggtcttgc gatgattatc atataatttc tgttgaatta cgttaagcat    14520

gtaataatta acatgtaatg catgacgtta tttatgagat gggtttttat gattagagtc    14580

ccgcaattat acatttaata cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa    14640

ttatcgcgcg cggtgtcatc tatgttacta gatcgggaat tgg                      14683


