
                                SEQUENCE LISTING

<110> Cathy Hass
      Peter Jauert
      Panagiota Kiriakou
      Nandita Kohli
      Eric Lenneman
      Hans Liao
      Catherine Poor
      Brian Rush
      Robbyn Weaver
      Cargill Inc.

<120> ENZYMES AND METHODS FOR PRODUCTION OF MALONIC ACID
      AND DERIVATIVES THEREOF
  

<130> 4361.178WO1

<150> US 63/055,624
<151> 2020-07-23

<160> 26

<170> FastSEQ for Windows Version 4.0

<210> 1
<211> 4476
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 1
ctaaattcgg ccttgctcag agactcctgg attttggcta acaacgcagt cccttcgatg      60
catatagcta ggccacaaat tatgccaata acggtccatg ggttgatgtt ttcttgaatt     120
ctttcgtttt tcatgctatt tgcgtcttcc caagtcccag cgttccagta ttcatactgc     180
gcgttagagt ggtagccata agagccggca tattggtaat tttcagtatt aacgttagaa     240
cgtggtgaat acgatgtggt ccagccttgc ctcgttgtgt catatacgat ctttttcttt     300
gggtcacaaa gaatatcata tgcttgagag atgactttaa atctatgtag tttttcgctt     360
gatgttagca gcagcggtga tttactatca ctgttggtaa ccttttctga gctaaatatt     420
tgaatgttat cggaatggtc agggtggtac aattttacat aacgatgata tttttttttt     480
aacgacttct tgtccagttt aggatttcca gatccggcct ttggaatgcc aaaaatatca     540
tagggagttg gatctgccaa ctcaggccat tgttcatccc ttatcgtaag ttttctattg     600
ccatttttat cgttcgctgt agcatactta gctataaaag tgatttgtgg gggacacttt     660
tctacacatg ataagtgcca cttgaataaa aatgggtata cgaacttatg gtgtagcata     720
acaaatatat tgcaagtagt gacctatggt gtgtagatat acgtacagtt agttacgagc     780
ctaaagacac aacgtgtttg ttaattatac tgtcgctgta atatcttctc ttccattatc     840
accggtcatt ccttgcaggg gcggtagtac ccggagaccc tgaacttttc tttttttttt     900
tgcgaaatta aaaagttcat tttcaattcg acaatgagat ctacaagcca ttgttttatg     960
ttgatgagag ccagcttaaa gagttctcga gatctcccga gtttatcatt atcaatactg    1020
ccatttcaaa gaatacgtaa ataattaata gtagtgattt tcctaacttt atttagtcaa    1080
aaaattggcc ttttaattct gctgtaaccc gtacatgccc aaaatagggg gcgggttaca    1140
cagaatatat aacatcatag gtgtctgggt gaacagttta ttcctggcat ccactaaata    1200
taatggagcc cgcttttttt aagctggcat ccagaaaaaa aaagaatccc agcaccaaaa    1260
tattgttttc ttcaccaacc atcagttcat aggtccattc tcttagcgca actacacaga    1320
acaggggcac aaacaggcaa aaaacgggca caacctcaat ggagtgatgc aacctgcttg    1380
gagtaaatga tgacacaagg caattgacct acgcatgtat ctatctcatt ttcttacacc    1440
ttctattacc ttctgctctc tctgatttgg aaaaagctga aaaaaaaggt tgaaaccagt    1500
tccctgaaat tattccccta tttgactaat aagtatataa agacggtagg tattgattgt    1560
aattctgtaa atctatttct taaacttctt aaattctact tttatagtta gtcttttttt    1620
tagtttaaaa caccaagaac ttagtttcga ataaacacac ataaacaaac aaatctagaa    1680
tgtctattag tgaaaaatat tttcctcaag aacctcaatc tcctgaattg aaaactgcaa    1740
ttccaggtcc tcaatcaaag gcaaagctcg aggaattatc tgctgtctat gatacaaagg    1800
ctgcatattt tgttaccgac tactacaaat ctcttggtaa ctatattgtg gatgcagatg    1860
gcaacaagct actagattct tattgccaaa tctcttctat cgcattgggt tacaataatc    1920
cagcattatt aaaagtagca cattctgatg aaatggcagt tgctttatgt aacagacctg    1980
ctttggcatg ttttccatcc actgattact atgaaatact aaagaaggga ttgttgtccg    2040
ttgctccaaa gggattagat aaggtttgta ctgcacacac gggatctgat gccaatgaaa    2100
tggcatttaa ggctgcattt ttgtttcaag caagtaagaa gagaggtgac aaaccattta    2160
ccagcgaaga gctggaatct gtcatggaga acaagttgcc aggcacctct gacatggtta    2220
tcctgtcatt tgaaaaaggg ttccatggta gattgtttgg atctttatct accactagat    2280
ctaaagctat tcacaaactg gatattcctg cgtttgaatg gccaaaggct ccattccctc    2340
agttaaagta tcctctggat caattccaag ctgaaaacaa agcagaagaa gaaagatgtt    2400
tgaaggcttt agaggaaatt attgtcaact ctcctgccaa aattgcagct gcaatcattg    2460
aaccggtcca atctgaaggt ggtgataatc atgcttcacc agaattcttc caaggtatta    2520
gagaaatcac caaaaagcac ggtgtcattc ttattgttga tgaagttcaa acaggaggtg    2580
gtgcttctgg taagatgtgg ttacatgaac actatggcat tgtcccagac atcatgactt    2640
tttctaaaaa aatgcaaaat gcaggtttct ttttcagtga agcaggtctt gctggggacc    2700
aaccattcag acaattcaat acctggtgcg gtgatccatc aaaagctcta attgcaagaa    2760
ccataattga agaaattaaa gataagaacc tattgactag tgttaccgaa acaggtgact    2820
acctatattc aaagctcgaa gcaatttcag caaagtatga caaaatgatc aacttgagag    2880
gtaagggaag aggtttcttt attgcatttg atgccccaac accggagtta agaaacaaat    2940
ttattgctga atgtaagaaa ttaggtttaa acattggtgg atgcggtgaa caaggtgtta    3000
gattgagacc tgcattagtt tttgaaaaga agcatgctga tatcttagcc tccattattg    3060
atcaagcttt ttccaaaatt taattaatta aacaggcccc ttttcctttg tcgatatcat    3120
gtaattagtt atgtcacgct tacattcacg ccctcctccc acatccgctc taaccgaaaa    3180
ggaaggagtt agacaacctg aagtctaggt ccctatttat ttttttatag ttatgttagt    3240
attaagaacg ttatttatat ttcaaatttt tctttttttt ctgtacaaac gcgtgtacgc    3300
atgtaacggg cagacggccg gccataactt cgtataatgt atgctatacg aagttatggc    3360
aacggttcat catctcatgg atctgcacat gaacaaacac cagagtcaaa cgacgttgaa    3420
attgaggcta ctgcgccaat tgatgacaat acagacgatg ataacaaacc gaagttatct    3480
gatgtagaaa aggattagag atgctaagag atagtgatga tatttcataa ataatgtaat    3540
tctatatatg ttaattacct tttttgcgag gcatatttat ggtgaaggat aagttttgac    3600
catcaaagaa ggttaatgtg gctgtggttt cagggtccat aaagcttttc aattcatctt    3660
tttttttttt gttctttttt ttgattccgg tttctttgaa atttttttga ttcggtaatc    3720
tccgagcaga aggaagaacg aaggaaggag cacagactta gattggtata tatacgcata    3780
tgtggtgttg aagaaacatg aaattgccca gtattcttaa cccaactgca cagaacaaaa    3840
acctgcagga aacgaagata aatcatgtcg aaagctacat ataaggaacg tgctgctact    3900
catcctagtc ctgttgctgc caagctattt aatatcatgc acgaaaagca aacaaacttg    3960
tgtgcttcat tggatgttcg taccaccaag gaattactgg agttagttga agcattaggt    4020
cccaaaattt gtttactaaa aacacatgtg gatatcttga ctgatttttc catggagggc    4080
acagttaagc cgctaaaggc attatccgcc aagtacaatt ttttactctt cgaagacaga    4140
aaatttgctg acattggtaa tacagtcaaa ttgcagtact ctgcgggtgt atacagaata    4200
gcagaatggg cagacattac gaatgcacac ggtgtggtgg gcccaggtat tgttagcggt    4260
ttgaagcagg cggcggaaga agtaacaaag gaacctagag gccttttgat gttagcagaa    4320
ttgtcatgca agggctccct agctactgga gaatatacta agggtactgt tgacattgcg    4380
aagagcgaca aagattttgt tatcggcttt attgctcaaa gagacatggg tggaagagat    4440
gaaggttacg attggttgat tatgacacgc ggccgc                              4476

<210> 2
<211> 4355
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 2
ggccgctcca tggagggcac agttaagccg ctaaaggcat tatccgccaa gtacaatttt      60
ttactcttcg aagacagaaa atttgctgac attggtaata cagtcaaatt gcagtactct     120
gcgggtgtat acagaatagc agaatgggca gacattacga atgcacacgg tgtggtgggc     180
ccaggtattg ttagcggttt gaagcaggcg gcggaagaag taacaaagga acctagaggc     240
cttttgatgt tagcagaatt gtcatgcaag ggctccctag ctactggaga atatactaag     300
ggtactgttg acattgcgaa gagcgacaaa gattttgtta tcggctttat tgctcaaaga     360
gacatgggtg gaagagatga aggttacgat tggttgatta tgacacccgg tgtgggttta     420
gatgacaagg gagacgcatt gggtcaacag tatagaaccg tggatgatgt ggtctctaca     480
ggatctgaca ttattattgt tggaagagga ctatttgcaa agggaaggga tgctaaggta     540
gagggtgaac gttacagaaa agcaggctgg gaagcatatt tgagaagatg cggccagcaa     600
aactaaaaaa ctgtattata agtaaatgca tgtatactaa actcacaaat tagagcttca     660
atttaattat atcagttatt acccgggaat ctcggtcgta atgattttta taatgacgaa     720
aaaaaaaaaa ttggaaagaa aaagcttcat ggcctttata aaaaggaacc atccaatacc     780
tcgccagaac caagtaacag tattttacgg ggcacaaatc aagaacaata agacaggact     840
gtaaagatgg acgcattgaa ctccaaagaa caacaagagt tccaaaaagt agtggaacaa     900
aagcaaatga aggatttcat gcgtttgata acttcgtata atgtatgcta tacgaagtta     960
tctcgagggc cagaaaaagg aagtgtttcc ctccttcttg aattgatgtt accctcataa    1020
agcacgtggc ctcttatcga gaaagaaatt accgtcgctc gtgatttgtt tgcaaaaaga    1080
acaaaactga aaaaacccag acacgctcga cttcctgtct tcctgttgat tgcagcttcc    1140
aatttcgtca cacaacaagg tcctagcgac ggctcacagg ttttgtaaca agcaatcgaa    1200
ggttctggaa tggcgggaaa gggtttagta ccacatgcta tgatgcccac tgtgatctcc    1260
agagcaaagt tcgttcgatc gtactgttac tctctctctt tcaaacagaa ttgtccgaat    1320
cgtgtgacaa caacagcctg ttctcacaca ctcttttctt ctaaccaagg gggtggttta    1380
gtttagtaga acctcgtgaa acttacattt acatatatat aaacttgcat aaattggtca    1440
atgcaagaaa tacatatttg gtcttttcta attcgtagtt tttcaagttc ttagatgctt    1500
tctttttctc ttttttacag atcatcaagg aagtaattat ctacttttta caagtctaga    1560
atgtctatta gtgaaaaata ttttcctcaa gaacctcaat ctcctgaatt gaaaactgca    1620
attccaggtc ctcaatcaaa ggcaaagctc gaggaattat ctgctgtcta tgatacaaag    1680
gctgcatatt ttgttaccga ctactacaaa tctcttggta actatattgt ggatgcagat    1740
ggcaacaagc tactagattc ttattgccaa atctcttcta tcgcattggg ttacaataat    1800
ccagcattat taaaagtagc acattctgat gaaatggcag ttgctttatg taacagacct    1860
gctttggcat gttttccatc cactgattac tatgaaatac taaagaaggg attgttgtcc    1920
gttgctccaa agggattaga taaggtttgt actgcacaca cgggatctga tgccaatgaa    1980
atggcattta aggctgcatt tttgtttcaa gcaagtaaga agagaggtga caaaccattt    2040
accagcgaag agctggaatc tgtcatggag aacaagttgc caggcacctc tgacatggtt    2100
atcctgtcat ttgaaaaagg gttccatggt agattgtttg gatctttatc taccactaga    2160
tctaaagcta ttcacaaact ggatattcct gcgtttgaat ggccaaaggc tccattccct    2220
cagttaaagt atcctctgga tcaattccaa gctgaaaaca aagcagaaga agaaagatgt    2280
ttgaaggctt tagaggaaat tattgtcaac tctcctgcca aaattgcagc tgcaatcatt    2340
gaaccggtcc aatctgaagg tggtgataat catgcttcac cagaattctt ccaaggtatt    2400
agagaaatca ccaaaaagca cggtgtcatt cttattgttg atgaagttca aacaggaggt    2460
ggtgcttctg gtaagatgtg gttacatgaa cactatggca ttgtcccaga catcatgact    2520
ttttctaaaa aaatgcaaaa tgcaggtttc tttttcagtg aagcaggtct tgctggggac    2580
caaccattca gacaattcaa tacctggtgc ggtgatccat caaaagctct aattgcaaga    2640
accataattg aagaaattaa agataagaac ctattgacta gtgttaccga aacaggtgac    2700
tacctatatt caaagctcga agcaatttca gcaaagtatg acaaaatgat caacttgaga    2760
ggtaagggaa gaggtttctt tattgcattt gatgccccaa caccggagtt aagaaacaaa    2820
tttattgctg aatgtaagaa attaggttta aacattggtg gatgcggtga acaaggtgtt    2880
agattgagac ctgcattagt ttttgaaaag aagcatgctg atatcttagc ctccattatt    2940
gatcaagctt tttccaaaat ttaattaatt aatttaccag cttactatcc ttcttgaaaa    3000
tatgcactct atatctttta gttcttaatt gcaacacata gatttgctgt ataacgaatt    3060
ttatgctatt tttttaattt ggagttcggt gatgaaagtg tcacagcgaa tttcctcaca    3120
tgtagggacc gaattgttta caagttctct gtaccaccat ggagacatca aagattgaaa    3180
atctatggaa agatatggac ggtagcaaca agaatatagc acgagccgcg gagttcattt    3240
cgttactttt gatatcgctc acaactattg cgaagcgctt cagtgaaaaa atcataagga    3300
aaagttgtaa atattattgg tagtattcgt ttggtaaagt agagggggta atttttcccc    3360
tttattttgt tcatacattc ttaaattgct ttgcctctcc ttttggaaag ctatacttcg    3420
gagcactgtt gagcgaaggc tcaggccggc agcacgcagc acgctgtatt tacgtattta    3480
attttatata tttgtgcata cactactagg gaagacttga aaaaaaccta ggaaatgaaa    3540
aaacgacaca ggaagtcccg tatttactat tttttccttc cttttgatgg ggcagggcgg    3600
aaatagagga taggataagc ctactgctta gctgtttccg tctctacttc ggtagttgtc    3660
tcaattgtcg tttcagtatt acctttagag ccgctagacg atggttgagc tatttgttga    3720
gggaaaacta agttcatgta acacacgcat aacccgatta aactcatgaa tagcttgatt    3780
gcaggaggct ggtccattgg agatggtgcc ttattttcct tataggcaac gatgatgtct    3840
tcgtcggtgt tcaggtagta gtgtacactc tgaatcaggg agaaccaggc aatgaacttg    3900
ttcctcaaga aaatagcggc cataggcatg gattggttaa ccacaccaga tatgcttggt    3960
gtggcagaat atagtccttt tggtggcgca attttcttgt acctgtggta gaaagggagc    4020
ggttgaactg ttagtatata ttggcaatat cagcaaattt gaaagaaaat tgtcggtgaa    4080
aaacatacga aacacaaagg tcgggccttg caacgttatt caaagtcatt gtttagttga    4140
ggaggtagca gcggagtata tgtattcctt ttttttgcct atggatgttg taccatgccc    4200
attctgctca agcttttgtt aaaattattt ttcagtattt tttcttccat gttgcgcgtt    4260
acgagaacag aagcgacaga taaccgcaat catacaacta gcgctactgc ggggtgtaaa    4320
aagcacaaga actaagccaa gatcacaaca gttat                               4355

<210> 3
<211> 467
<212> PRT
<213> Issachenkia orientalis

<400> 3
Met Ser Ile Ser Glu Lys Tyr Phe Pro Gln Glu Pro Gln Ser Pro Glu
 1               5                  10                  15
Leu Lys Thr Ala Ile Pro Gly Pro Gln Ser Lys Ala Lys Leu Glu Glu
            20                  25                  30
Leu Ser Ala Val Tyr Asp Thr Lys Ala Ala Tyr Phe Val Thr Asp Tyr
        35                  40                  45
Tyr Lys Ser Leu Gly Asn Tyr Ile Val Asp Ala Asp Gly Asn Lys Leu
    50                  55                  60
Leu Asp Ser Tyr Cys Gln Ile Ser Ser Ile Ala Leu Gly Tyr Asn Asn
65                  70                  75                  80
Pro Ala Leu Leu Lys Val Ala His Ser Asp Glu Met Ala Val Ala Leu
                85                  90                  95
Cys Asn Arg Pro Ala Leu Ala Cys Phe Pro Ser Thr Asp Tyr Tyr Glu
            100                 105                 110
Ile Leu Lys Lys Gly Leu Leu Ser Val Ala Pro Lys Gly Leu Asp Lys
        115                 120                 125
Val Cys Thr Ala His Thr Gly Ser Asp Ala Asn Glu Met Ala Phe Lys
    130                 135                 140
Ala Ala Phe Leu Phe Gln Ala Ser Lys Lys Arg Gly Asp Lys Pro Phe
145                 150                 155                 160
Thr Ser Glu Glu Leu Glu Ser Val Met Glu Asn Lys Leu Pro Gly Thr
                165                 170                 175
Ser Asp Met Val Ile Leu Ser Phe Glu Lys Gly Phe His Gly Arg Leu
            180                 185                 190
Phe Gly Ser Leu Ser Thr Thr Arg Ser Lys Ala Ile His Lys Leu Asp
        195                 200                 205
Ile Pro Ala Phe Glu Trp Pro Lys Ala Pro Phe Pro Gln Leu Lys Tyr
    210                 215                 220
Pro Leu Asp Gln Phe Gln Ala Glu Asn Lys Ala Glu Glu Glu Arg Cys
225                 230                 235                 240
Leu Lys Ala Leu Glu Glu Ile Ile Val Asn Ser Pro Ala Lys Ile Ala
                245                 250                 255
Ala Ala Ile Ile Glu Pro Val Gln Ser Glu Gly Gly Asp Asn His Ala
            260                 265                 270
Ser Pro Glu Phe Phe Gln Gly Ile Arg Glu Ile Thr Lys Lys His Gly
        275                 280                 285
Val Ile Leu Ile Val Asp Glu Val Gln Thr Gly Gly Gly Ala Ser Gly
    290                 295                 300
Lys Met Trp Leu His Glu His Tyr Gly Ile Val Pro Asp Ile Met Thr
305                 310                 315                 320
Phe Ser Lys Lys Met Gln Asn Ala Gly Phe Phe Phe Ser Glu Ala Gly
                325                 330                 335
Leu Ala Gly Asp Gln Pro Phe Arg Gln Phe Asn Thr Trp Cys Gly Asp
            340                 345                 350
Pro Ser Lys Ala Leu Ile Ala Arg Thr Ile Ile Glu Glu Ile Lys Asp
        355                 360                 365
Lys Asn Leu Leu Thr Ser Val Thr Glu Thr Gly Asp Tyr Leu Tyr Ser
    370                 375                 380
Lys Leu Glu Ala Ile Ser Ala Lys Tyr Asp Lys Met Ile Asn Leu Arg
385                 390                 395                 400
Gly Lys Gly Arg Gly Phe Phe Ile Ala Phe Asp Ala Pro Thr Pro Glu
                405                 410                 415
Leu Arg Asn Lys Phe Ile Ala Glu Cys Lys Lys Leu Gly Leu Asn Ile
            420                 425                 430
Gly Gly Cys Gly Glu Gln Gly Val Arg Leu Arg Pro Ala Leu Val Phe
        435                 440                 445
Glu Lys Lys His Ala Asp Ile Leu Ala Ser Ile Ile Asp Gln Ala Phe
    450                 455                 460
Ser Lys Ile
465

<210> 4
<211> 8446
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 4
catctcccat cacataggaa gcaacaggcg cgttggactt ttaattttcg aggaccgcga      60
atccttacat cacacccaat cccccacaag tgatccccca cacaccatag cttcaaaatg     120
tttctactcc ttttttactc ttccagattt tctcggactc cgcgcatcgc cgtaccactt     180
caaaacaccc aagcacagca tactaaattc cccctctttc ttcctctagg gtgtcgttaa     240
ttacccgtac taaaggtttg gaaaagaaaa aagagaccgc ctcgtttctt tttcttcgtc     300
gaaaaaggca ataaaaattt ttatcacgtt tctttttctt gaaaattttt ttttttgttt     360
ttttttctct ttcgatgacc tcccattgat atttaagtta ataaatggtc ttcaatttct     420
caagtttcag tttcattttt cttgttctat tacaactttt tttacttctt gctcattaga     480
aagaaagcat agcaatctaa tctaagtttt aattacaaat ctagaatggg gaaagagaag     540
acgcatgtct ctaggcccag actgaatagc aatatggatg ccgacttata cgggtataag     600
tgggccagag ataatgttgg tcagagcggg gcgaccatct atcgtcttta tggtaagcca     660
gacgcgccag aattattcct aaaacacggt aagggttctg ttgctaatga cgttacggat     720
gaaatggtta ggctgaactg gctgactgag tttatgcctt tacctacaat aaagcacttc     780
ataagaactc cggatgacgc ctggttacta acgactgcta ttcccggtaa gaccgcattc     840
caagtattgg aggagtaccc ggacagtggg gagaatatag ttgacgcatt agctgttttt     900
ttaaggaggc tacattctat accagtatgt aactgcccct ttaatagtga cagagtgttt     960
cgtcttgctc aggcgcagag tcgtatgaat aacgggttag ttgatgcgtc tgatttcgac    1020
gatgagagaa acggttggcc cgttgaacaa gtctggaaag agatgcacaa attgcttcct    1080
tttagtcccg attctgtggt aacacacggt gacttctctt tggataatct aatatttgat    1140
gaaggaaagt taatcgggtg catagatgtg ggccgtgtag gcatagcaga tcgttaccaa    1200
gatttagcca ttttatggaa ttgtctaggg gagtttagtc cctcactgca aaaaaggtta    1260
tttcagaaat acggcatcga caaccccgac atgaacaagc tgcagttcca cctaatgttg    1320
gacgaatttt tttaattaat ttaccagctt actatccttc ttgaaaatat gcactctata    1380
tcttttagtt cttaattgca acacatagat ttgctgtata acgaatttta tgctattttt    1440
ttaatttgga gttcggtgat gaaagtgtca cagcgaattt cctcacatgt agggaccgaa    1500
ttgtttacaa gttctctgta ccaccatgga gacatcaaag attgaaaatc tatggaaaga    1560
tatggacggt agcaacaaga atatagcacg agccgcggag ttcatttcgt tacttttgat    1620
atcgctcaca actattgcga agcgcttcag tgaaaaaatc ataaggaaaa gttgtaaata    1680
ttattggtag tattcgtttg gtaaagtaga gggggtaatt tttccccttt attttgttca    1740
tacattctta aattgctttg cctctccttt tggaaagcta tacttcggag cactgttgag    1800
cgaaggctca gcggccgcgc cagaaaaagg aagtgtttcc ctccttcttg aattgatgtt    1860
accctcataa agcacgtggc ctcttatcga gaaagaaatt accgtcgctc gtgatttgtt    1920
tgcaaaaaga acaaaactga aaaaacccag acacgctcga cttcctgtct tcctgttgat    1980
tgcagcttcc aatttcgtca cacaacaagg tcctagcgac ggctcacagg ttttgtaaca    2040
agcaatcgaa ggttctggaa tggcgggaaa gggtttagta ccacatgcta tgatgcccac    2100
tgtgatctcc agagcaaagt tcgttcgatc gtactgttac tctctctctt tcaaacagaa    2160
ttgtccgaat cgtgtgacaa caacagcctg ttctcacaca ctcttttctt ctaaccaagg    2220
gggtggttta gtttagtaga acctcgtgaa acttacattt acatatatat aaacttgcat    2280
aaattggtca atgcaagaaa tacatatttg gtcttttcta attcgtagtt tttcaagttc    2340
ttagatgctt tctttttctc ttttttacag atcatcaagg aagtaattat ctacttttta    2400
caagaattca tgtctattta cttactgttc accaaaactt gcctgcatta ccagttgacg    2460
caacctccga tgaagtcaga aagaacctta tggatatgtt tagagataga caagctttct    2520
ccgaacatac ttggaaaatg ttattatccg tttgtagatc ctgggccgct tggtgtaaac    2580
ttaacaatag aaaatggttt cctgctgaac cagaagacgt cagagattac ttactttact    2640
tacaagctag aggtttggct gttaaaacta tccaacaaca cttaggtcaa ttgaatatgt    2700
tacacagaag atccggttta ccaagaccat ccgattccaa cgcagtttcc cttgttatga    2760
gaagaattag aaaagaaaat gttgacgctg gtgaaagagc taaacaagca ttagcatttg    2820
aaagaaccga tttcgatcaa gttagatcct taatggaaaa ttccgataga tgtcaagata    2880
ttagaaactt agctttctta ggtattgctt acaacacatt attaagaatc gctgaaattg    2940
ctagaattag agttaaagat atttcaagaa ccgatggcgg tagaatgtta atccacattg    3000
gcagaacaaa aaccttagtc tccacagcag gcgtcgaaaa agcattatca ttaggtgtta    3060
ctaaattagt tgaacgttgg atttccgttt ccggtgttgc agatgaccca aacaactact    3120
tattctgtcg tgttagaaaa aatggtgttg ccgctccttc cgctacctca caattatcca    3180
caagagcatt agaaggcatt tttgaagcta cccacagact tatttatggt gcaaaagacg    3240
attccggtca aagatattta gcttggtctg gtcattccgc tagagttggt gccgcaagag    3300
acatggcaag agctggtgtt tctattcctg aaattatgca agccggtggt tggactaatg    3360
ttaacattgt tatgaactat atcagaaact tagattccga aacaggtgct atggttagat    3420
tacttgaaga cggtgattaa gctagctaag atccgctcta accgaaaagg aaggagttag    3480
acaacctgaa gtctaggtcc ctatttattt ttttatagtt atgttagtat taagaacgtt    3540
atttatattt caaatttttc ttttttttct gtacagacgc gtgtacgcat gtaacattat    3600
actgaaaacc ttgcttgaga aggttttggg acgctcgaag gagctccaat tcgccctata    3660
gtgagtcgta ttacaattca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg    3720
gcgttaccca acttaatcgc cttgcagcac atcccccctt cgccagctgg cgtaatagcg    3780
aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatggcgcg    3840
acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg    3900
ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca    3960
cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta    4020
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc    4080
catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg    4140
gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat    4200
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta    4260
acgcgaattt taacaaaata ttaacgttta caatttcctg atgcggtatt ttctccttac    4320
gcatctgtgc ggtatttcac accgcagggt aataactgat ataattaaat tgaagctcta    4380
atttgtgagt ttagtataca tgcatttact tataatacag ttttttagtt ttgctggccg    4440
catcttctca aatatgcttc ccagcctgct tttctgtaac gttcaccctc taccttagca    4500
tcccttccct ttgcaaatag tcctcttcca acaataataa tgtcagatcc tgtagagacc    4560
acatcatcca cggttctata ctgttgaccc aatgcgtctc ccttgtcatc taaacccaca    4620
ccgggtgtca taatcaacca atcgtaacct tcatctcttc cacccatgtc tctttgagca    4680
ataaagccga taacaaaatc tttgtcgctc ttcgcaatgt caacagtacc cttagtatat    4740
tctccagtag atagggagcc cttgcatgac aattctgcta acatcaaaag gcctctaggt    4800
tcctttgtta cttcttctgc cgcctgcttc aaaccgctaa caatacctgg gcccaccaca    4860
ccgtgtgcat tcgtaatgtc tgcccattct gctattctgt atacacccgc agagtactgc    4920
aatttgactg tattaccaat gtcagcaaat tttctgtctt cgaagagtaa aaaattgtac    4980
ttggcggata atgcctttag cggcttaact gtgccctcca tggaaaaatc agtcaagata    5040
tccacatgtg tttttagtaa acaaattttg ggacctaatg cttcaactaa ctccagtaat    5100
tccttggtgg tacgaacatc caatgaagca cacaagtttg tttgcttttc gtgcatgata    5160
ttaaatagct tggcagcaac aggactagga tgagtagcag cacgttcctt atatgtagct    5220
ttcgacatga tttatcttcg tttcctgcag gtttttgttc tgtgcagttg ggttaagaat    5280
actgggcaat ttcatgtttc ttcaacacta catatgcgta tatataccaa tctaagtctg    5340
tgctccttcc ttcgttcttc cttctgttcg gagattaccg aatcaaaaaa atttcaaaga    5400
aaccgaaatc aaaaaaaaga ataaaaaaaa aatgatgaat tgaattgaaa agcgtggtgc    5460
actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca cccgccaaca    5520
cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag acaagctgtg    5580
accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa acgcgcgaga    5640
cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct    5700
taggacggat cgcttgcctg taacttacac gcgcctcgta tcttttaatg atggaataat    5760
ttgggaattt actctgtgtt tatttatttt tatgttttgt atttggattt tagaaagtaa    5820
ataaagaagg tagaagagtt acggaatgaa gaaaaaaaaa taaacaaagg tttaaaaaat    5880
ttcaacaaaa agcgtacttt acatatatat ttattagaca agaaaagcag attaaataga    5940
tatacattcg attaacgata agtaaaatgt aaaatcacag gattttcgtg tgtggtcttc    6000
tacacagaca agatgaaaca attcggcatt aatacctgag agcaggaaga gcaagataaa    6060
aggtagtatt tgttggcgat ccccctagag tcttttacat cttcggaaaa caaaaactat    6120
tttttcttta atttcttttt ttactttcta tttttaattt atatatttat attaaaaaat    6180
ttaaattata attattttta tagcacgtga tgaaaaggac ccaggtggca cttttcgggg    6240
aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct    6300
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat    6360
tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc    6420
tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg    6480
ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg    6540
ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga    6600
cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta    6660
ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc    6720
tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc    6780
gaaggagcta accgcttttt ttcacaacat gggggatcat gtaactcgcc ttgatcgttg    6840
ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc    6900
aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca    6960
acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct    7020
tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat    7080
cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg    7140
cagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat    7200
taagcattgg taactgtcag accaagttta ctcatatata ctttagattg atttaaaact    7260
tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat    7320
cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc    7380
ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct    7440
accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg    7500
cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca    7560
cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc    7620
tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga    7680
taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac    7740
gacctacacc gaactgagat acctacagcg tgagcattga gaaagcgcca cgcttcccga    7800
agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag    7860
ggagcttcca ggggggaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg    7920
acttgagcgt cgatttttgt gatgctcgtc aggggggccg agcctatgga aaaacgccag    7980
caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc    8040
tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc    8100
tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc    8160
aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag    8220
gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt acctcactca    8280
ttaggcaccc caggctttac actttatgct tccggctcct atgttgtgtg gaattgtgag    8340
cggataacaa tttcacacag gaaacagcta tgaccatgat tacgccaagc tcggaattaa    8400
ccctcactaa agggaacaaa agctgggtac cgggcccccc ctcgag                   8446



<210> 5
<211> 2622
<212> DNA
<213> Aspergillus nidulans

<400> 5
tttttcttat cggagcgata taaaaagctg aagaaaggag gatagatgaa acagcatggc      60
gcatagaaag tgttcaagct cactagtaaa ggcgggaaat agaacattga gaacgtattt     120
tgataggaaa cgaagataaa gcggccgcat aacttcgtat aatgtatgct atacgaagtt     180
atccttacat cacacccaat cccccacaag tgatccccca cacaccatag cttcaaaatg     240
tttctactcc ttttttactc ttccagattt tctcggactc cgcgcatcgc cgtaccactt     300
caaaacaccc aagcacagca tactaaattt cccctctttc ttcctctagg gtggcgttaa     360
ttacccgtac taaaggtttg gaaaagaaaa aagagaccgc ctcgtttctt tttcttcgtc     420
gaaaaaggca ataaaaattt ttatcacgtt tctttttctt gaaaaatttt ttttttgatt     480
tttttctctt tcgatgacct cccattgata tttaagttaa taaatggtct tcaatttctc     540
aagtttcagt ttcgtttttc ttgttctatt acaacttttt ttacttcttg ctcattagaa     600
agaaagcata gcaatctaat ctaagtttta attacaaaat gccacaatcc tgggaagaat     660
tggccgccga caaacgtgcc cgtttggcta aaaccattcc tgacgaatgg aaggttcaaa     720
ctttgcctgc cgaagattcc gttattgatt tcccaaagaa gtccggtatt ttgtctgagg     780
ctgaattgaa gattaccgaa gcctctgctg ctgatttggt ctccaagttg gccgctggtg     840
agttgacttc tgttgaagtc actttggctt tttgtaagag agctgctatt gctcaacaat     900
taaccaactg tgctcacgaa ttcttcccag atgctgcttt agctcaagct agagaattag     960
atgaatacta cgctaagcat aagagaccag ttggtccatt acacggttta ccaatctctt    1020
taaaggacca attgcgtgtt aagggttacg aaacctccat gggttacatt tcctggttaa    1080
acaaatacga tgaaggtgat tccgtcttaa ccaccatgtt gagaaaagct ggtgctgttt    1140
tctacgttaa gacctctgtc ccacaaacct tgatggtctg tgaaaccgtc aacaacatca    1200
ttggtagaac tgtcaatcca agaaacaaaa attggtcctg tggtggttct tctggtggtg    1260
aaggtgctat tgttggtatt agaggtggtg ttattggtgt cggtactgac attggtggtt    1320
ccattagagt cccagctgct ttcaactttt tatacggttt gagaccatct cacggtagat    1380
tgccatatgc taaaatggct aactctatgg aaggtcaaga aaccgttcac tccgtcgttg    1440
gtcctatcac tcactccgtc gaagacttga gattgttcac caaatctgtc ttgggtcaag    1500
aaccttggaa gtacgactct aaggtcatcc caatgccatg gagacaatct gaatctgaca    1560
tcattgcctc taagattaag aatggtggtt tgaacattgg ttattacaat ttcgacggta    1620
acgtcttgcc acacccacca attttacgtg gtgtcgaaac taccgttgcc gctttggcca    1680
aggctggtca caccgttact ccatggactc catacaagca tgatttcggt catgacttga    1740
tttcccacat ctatgctgct gatggttctg ccgacgtcat gagagacatt tctgcctctg    1800
gtgagccagc catccctaac attaaggact tgttgaaccc aaatattaag gctgttaaca    1860
tgaacgaatt gtgggacact catttacaaa agtggaacta tcaaatggaa tacttggaaa    1920
agtggcgtga agctgaagaa aaagctggta aggaattgga cgctattatc gctccaatta    1980
ctcctaccgc cgctgtcaga cacgatcaat tcagatacta cggttacgcc tccgttatta    2040
acttattgga tttcacctct gttgtcgtcc cagtcacttt cgctgataag aatattgata    2100
agaagaacga atcttttaaa gctgtttccg aattggatgc tttggttcaa gaagaatacg    2160
acccagaggc ttatcacggt gctcctgttg ctgttcaagt tattggtaga agattgtccg    2220
aagagagaac tttggctatc gccgaagaag tcggtaaatt gttgggtaac gtcgtcactc    2280
cataaggagg ttgataagac ttttctagtt gcatatcttt tatatttaaa tcttatctat    2340
tagttaattt tttgtaattt atccttatat atagtctggt tattctaaaa tatcatttca    2400
gtatctaaaa attcccctct tttttcagtt atatcttaac aggcgataac ttcgtataat    2460
gtatgctata cgaagttatg cggccgcgag aagatgcacc tagctaaact aagtaaatct    2520
gtatactttt tatacatgta agacttttta gctatttcat tcttccctga accgttttgc    2580
gcgattctac ggaatatacc ggcgaaataa aagagataaa ct                       2622

<210> 6
<211> 490
<212> PRT
<213> Paraburkholderia xenovorans

<400> 6
Met Thr Arg Thr Phe Glu Leu Gly Glu Leu Ile Arg Ser Asp Asn Phe
 1               5                  10                  15
Ile Asp Gly Ala Trp Thr Pro Ala Gln Asp Asn Leu Arg Phe Ala Val
            20                  25                  30
Thr Asn Pro Ala Ser Gly Glu Ile Ile Ala Glu Val Ala Asp Ser Ser
        35                  40                  45
Pro Ala Asp Ala Arg Ala Ala Thr Asp Ala Ala Ala Arg Ala Leu Pro
    50                  55                  60
Ala Trp Arg Ala Arg Leu Pro Lys Glu Arg Ala Ala Val Leu His Arg
65                  70                  75                  80
Trp His Ala Leu Ile Met Ala Asn Leu Asp Ala Leu Gly Ala Leu Ile
                85                  90                  95
Ser Leu Glu Gln Gly Lys Pro Leu Ala Glu Gly Lys Gly Glu Val Ala
            100                 105                 110
Tyr Gly Ala Ser Tyr Val Ala Trp Phe Ala Glu Glu Ala Thr Arg Ile
        115                 120                 125
Tyr Gly Asp Leu Ile Pro Gln Gln Gln Arg Gly Lys Arg Met Thr Ala
    130                 135                 140
Val Lys Glu Pro Val Gly Val Val Ala Ala Ile Thr Pro Trp Asn Phe
145                 150                 155                 160
Pro Leu Ala Met Ile Ala Arg Lys Ile Ala Pro Ala Leu Ala Ala Gly
                165                 170                 175
Cys Thr Val Val Ala Lys Pro Ala Glu Asp Thr Pro Leu Thr Ala Ser
            180                 185                 190
Ala Leu Val Leu Leu Ala His Glu Ala Gly Val Pro Pro Gly Val Leu
        195                 200                 205
Asn Leu Ile Thr Ala Ser Arg Asp His Ala Val Ala Ala Val Ala Glu
    210                 215                 220
Trp Leu His Asp Ala Arg Val Arg Lys Ile Thr Phe Thr Gly Ser Thr
225                 230                 235                 240
Pro Val Gly Lys Tyr Leu Ala Arg Glu Ser Ala Glu Thr Leu Lys Lys
                245                 250                 255
Leu Ser Leu Glu Leu Gly Gly Asn Ala Pro Phe Ile Val Phe Asp Asp
            260                 265                 270
Ala Asp Leu Glu Ala Ala Val Ala Gly Leu Met Ala Ala Lys Phe Arg
        275                 280                 285
Asn Gly Gly Gln Thr Cys Val Cys Pro Asn Arg Val Tyr Val Gln Ala
    290                 295                 300
Gly Val Tyr Glu Arg Phe Gly Ala Leu Leu Ala Glu Arg Val Gly Ala
305                 310                 315                 320
Leu Lys Val Ala Pro Ala Thr Asp Pro Ala Ala Gln Ile Gly Pro Met
                325                 330                 335
Ile Asn Ser Arg Ala Leu Asp Lys Ile Ala Arg His Val Asp Asp Ala
            340                 345                 350
Val Ala His Gly Ala Arg Val Leu Thr Gly Gly Lys Arg Leu Ala Glu
        355                 360                 365
Leu Gly Pro His Tyr Tyr Ala Pro Thr Val Leu Ala Asp Ala Thr Ala
    370                 375                 380
Ala Met Gln Leu Asn Ser Glu Glu Thr Phe Gly Pro Ile Val Pro Leu
385                 390                 395                 400
Phe Arg Phe Glu Asp Glu Ala Glu Ala Val Asn Ala Ala Asn Asp Thr
                405                 410                 415
Pro Phe Gly Leu Ala Ala Tyr Phe Tyr Ser Glu Gly Val Lys Arg Ile
            420                 425                 430
Asp Arg Val Ala Arg Ala Leu Glu Ala Gly Ile Val Gly Ile Asn Glu
        435                 440                 445
Gly Ala Val Ala Ser Glu Ala Ala Pro Phe Gly Gly Val Lys Glu Ser
    450                 455                 460
Gly Tyr Gly Arg Glu Gly Ser Lys Tyr Gly Leu Asp Asp Tyr Leu Ser
465                 470                 475                 480
Ile Lys Tyr Leu Cys Gln Gly Asn Leu Glu
                485                 490

<210> 7
<211> 7216
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 7
gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa      60
tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta     120
ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag     180
taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca     240
gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta     300
aagttctgct atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc     360
gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc     420
ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca     480
ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc gctttttttc     540
acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca     600
taccaaacga cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac     660
tattaactgg cgaactactt actctagctt cccggcaaca attaatagac tggatggagg     720
cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg     780
ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg     840
gtaagccctc ccgtatcgta gttatctaca cgacgggcag tcaggcaact atggatgaac     900
gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc     960
aagtttactc atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct    1020
aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc    1080
actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc    1140
gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg    1200
atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa    1260
atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc    1320
ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt    1380
gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa    1440
cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc    1500
tacagcgtga gcattgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc    1560
cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg gggaacgcct    1620
ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat    1680
gctcgtcagg ggggccgagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc    1740
tggccttttg ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg    1800
ataaccgtat taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc    1860
gcagcgagtc agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg    1920
cgcgttggcc gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca    1980
gtgagcgcaa cgcaattaat gtgagttacc tcactcatta ggcaccccag gctttacact    2040
ttatgcttcc ggctcctatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa    2100
acagctatga ccatgattac gccaagctcg gaattaaccc tcactaaagg gaacaaaagc    2160
tgggtaccgg gccccccctc gagatctccc gagtttatca ttatcaatac tgccatttca    2220
aagaatacgt aaataattaa tagtagtgat tttcctaact ttatttagtc aaaaaattgg    2280
ccttttaatt ctgctgtaac ccgtacatgc ccaaaatagg gggcgggtta cacagaatat    2340
ataacatcat aggtgtctgg gtgaacagtt tattcctggc atccactaaa tataatggag    2400
cccgcttttt ttaagctggc atccagaaaa aaaaagaatc ccagcaccaa aatattgttt    2460
tcttcaccaa ccatcagttc ataggtccat tctcttagcg caactacaca gaacaggggc    2520
acaaacaggc aaaaaacggg cacaacctca atggagtgat gcaacctgct tggagtaaat    2580
gatgacacaa ggcaattgac ctacgcatgt atctatctca ttttcttaca ccttctatta    2640
ccttctgctc tctctgattt ggaaaaagct gaaaaaaaag gttgaaacca gttccctgaa    2700
attattcccc tatttgacta ataagtatat aaagacggta ggtattgatt gtaattctgt    2760
aaatctattt cttaaacttc ttaaattcta cttttatagt tagtcttttt tttagtttta    2820
aaacactaag aacttagttt cgaataaaca cacataaaaa caaatctaga atgactagaa    2880
cttttgaatt gggtgaattg attcgttctg ataacttcat tgatggtgct tggactccag    2940
cacaagacaa cttgaggttc gctgtcacta acccagcttc tggagagata attgctgagg    3000
tcgctgactc ttctccagct gatgcaagag ctgctactga tgcagctgca agagctttgc    3060
cagcttggag ggctagattg ccaaaagaga gagctgcagt cttgcatcgt tggcacgctt    3120
tgataatggc taacttggat gcattgggtg ctttaatatc tttggagcaa ggtaaacctt    3180
tggctgaggg taagggtgag gtcgcttatg gtgcttctta cgtcgcatgg ttcgcagagg    3240
aagcaacaag aatttacggt gatttgattc ctcagcaaca gaggggtaag aggatgactg    3300
ctgtcaaaga gccagtcgga gtcgttgctg ctattacacc atggaattgg ccattggcaa    3360
tgattgcaag aaagatagca cccgctttgg cagctggttg tactgttgtc gctaagccag    3420
ctgaggacac tccattgact gcttcagctt tggtcttgtt ggctcacgaa gctggtgttc    3480
cacccggtgt tttgaacttg attactgcat cccgtgatca tgctgtcgca gcagtcgctg    3540
agtggttgca tgacgctaga gttagaaaaa ttacttttac tggatcaact ccagtcggta    3600
agtacttggc tagggaatct gctgaaactt taaagaagtt atctttggag ttgggtggta    3660
atgctccatt tattgttttt gacgatgctg acttggaggc tgcagtcgct ggtttgatgg    3720
ctgctaagtt tcgtaacggt ggtcagactt gtgtctgtcc aaatcgtgtc tacgtccaag    3780
ctggtgtcta cgagaggttt ggtgctttgt tggctgaaag ggttggtgct ttgaaggttg    3840
ctccagcaac tgatccagct gctcaaattg gtccaatgat taattctagg gcattggaca    3900
agattgctag gcacgtcgat gacgctgttg cacatggtgc tagagtcttg actggtggta    3960
agaggttggc agagttgggt ccacactact acgctccaac tgttttggct gacgcaacag    4020
cagcaatgca gttgaactca gaggaaactt tcggtccaat agtcccattg tttcgtttcg    4080
aggacgaggc tgaagcagtt aacgctgcta acgacactcc attcggttta gcagcttatt    4140
tttattctga aggtgtcaaa agaattgata gggtcgctag ggctttggaa gctggtattg    4200
ttggtataaa tgaaggtgca gtcgcttcag aagctgctcc attcggtggt gttaaggaat    4260
ctggttacgg tagagaaggt tctaaatatg gtttggatga ttatttgtct attaaatatt    4320
tatgtcaagg taatttagaa taattaatta aacaggcccc ttttcctttg tcgatatcat    4380
gtaattagtt atgtcacgct tacattcacg ccctcctccc acatccgctc taaccgaaaa    4440
ggaaggagtt agacaacctg aagtctaggt ccctatttat ttttttatag ttatgttagt    4500
attaagaacg ttatttatat ttcaaatttt tctttttttt ctgtacaaac gcgtgtacgc    4560
atgtaacggg cagacgcggc cgccaccgcg gtggagctcc aattcgccct atagtgagtc    4620
gtattacaat tcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac    4680
ccaacttaat cgccttgcag cacatccccc cttcgccagc tggcgtaata gcgaagaggc    4740
ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc gcgacgcgcc    4800
ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga ccgctacact    4860
tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg ccacgttcgc    4920
cggctttccc cgtcaagctc taaatcgggg gctcccttta gggttccgat ttagtgcttt    4980
acggcacctc gaccccaaaa aacttgatta gggtgatggt tcacgtagtg ggccatcgcc    5040
ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg ttctttaata gtggactctt    5100
gttccaaact ggaacaacac tcaaccctat ctcggtctat tcttttgatt tataagggat    5160
tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa    5220
ttttaacaaa atattaacgt ttacaatttc ctgatgcggt attttctcct tacgcatctg    5280
tgcggtattt cacaccgcag ggtaataact gatataatta aattgaagct ctaatttgtg    5340
agtttagtat acatgcattt acttataata cagtttttta gttttgctgg ccgcatcttc    5400
tcaaatatgc ttcccagcct gcttttctgt aacgttcacc ctctacctta gcatcccttc    5460
cctttgcaaa tagtcctctt ccaacaataa taatgtcaga tcctgtagag accacatcat    5520
ccacggttct atactgttga cccaatgcgt ctcccttgtc atctaaaccc acaccgggtg    5580
tcataatcaa ccaatcgtaa ccttcatctc ttccacccat gtctctttga gcaataaagc    5640
cgataacaaa atctttgtcg ctcttcgcaa tgtcaacagt acccttagta tattctccag    5700
tagataggga gcccttgcat gacaattctg ctaacatcaa aaggcctcta ggttcctttg    5760
ttacttcttc tgccgcctgc ttcaaaccgc taacaatacc tgggcccacc acaccgtgtg    5820
cattcgtaat gtctgcccat tctgctattc tgtatacacc cgcagagtac tgcaatttga    5880
ctgtattacc aatgtcagca aattttctgt cttcgaagag taaaaaattg tacttggcgg    5940
ataatgcctt tagcggctta actgtgccct ccatggaaaa atcagtcaag atatccacat    6000
gtgtttttag taaacaaatt ttgggaccta atgcttcaac taactccagt aattccttgg    6060
tggtacgaac atccaatgaa gcacacaagt ttgtttgctt ttcgtgcatg atattaaata    6120
gcttggcagc aacaggacta ggatgagtag cagcacgttc cttatatgta gctttcgaca    6180
tgatttatct tcgtttcctg caggtttttg ttctgtgcag ttgggttaag aatactgggc    6240
aatttcatgt ttcttcaaca ctacatatgc gtatatatac caatctaagt ctgtgctcct    6300
tccttcgttc ttccttctgt tcggagatta ccgaatcaaa aaaatttcaa agaaaccgaa    6360
atcaaaaaaa agaataaaaa aaaaatgatg aattgaattg aaaagcgtgg tgcactctca    6420
gtacaatctg ctctgatgcc gcatagttaa gccagccccg acacccgcca acacccgctg    6480
acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct    6540
ccgggagctg catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg agacgaaagg    6600
gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttaggacg    6660
gatcgcttgc ctgtaactta cacgcgcctc gtatctttta atgatggaat aatttgggaa    6720
tttactctgt gtttatttat ttttatgttt tgtatttgga ttttagaaag taaataaaga    6780
aggtagaaga gttacggaat gaagaaaaaa aaataaacaa aggtttaaaa aatttcaaca    6840
aaaagcgtac tttacatata tatttattag acaagaaaag cagattaaat agatatacat    6900
tcgattaacg ataagtaaaa tgtaaaatca caggattttc gtgtgtggtc ttctacacag    6960
acaagatgaa acaattcggc attaatacct gagagcagga agagcaagat aaaaggtagt    7020
atttgttggc gatcccccta gagtctttta catcttcgga aaacaaaaac tattttttct    7080
ttaatttctt tttttacttt ctatttttaa tttatatatt tatattaaaa aatttaaatt    7140
ataattattt ttatagcacg tgatgaaaag gacccaggtg gcacttttcg gggaaatgtg    7200
cgcggaaccc ctattt                                                    7216



<210> 8
<211> 490
<212> PRT
<213> Artificial Sequence

<220> 
<223> A synthetic polypeptide

<400> 8
Met Thr Arg Thr Phe Glu Leu Gly Glu Leu Ile Arg Ser Asp Asn Phe
 1               5                  10                  15
Ile Asp Gly Ala Trp Thr Pro Ala Gln Asp Asn Leu Arg Phe Ala Val
            20                  25                  30
Thr Asn Pro Ala Ser Gly Glu Ile Ile Ala Glu Val Ala Asp Ser Ser
        35                  40                  45
Pro Ala Asp Ala Arg Ala Ala Thr Asp Ala Ala Ala Arg Ala Leu Pro
    50                  55                  60
Ala Trp Arg Ala Arg Leu Pro Lys Glu Arg Ala Ala Val Leu His Arg
65                  70                  75                  80
Trp His Ala Leu Ile Met Ala Asn Leu Asp Ala Leu Gly Ala Leu Ile
                85                  90                  95
Ser Leu Glu Gln Gly Lys Pro Leu Ala Glu Gly Lys Gly Glu Val Ala
            100                 105                 110
Tyr Gly Ala Ser Tyr Val Ala Trp Phe Ala Glu Glu Ala Thr Arg Ile
        115                 120                 125
Tyr Gly Asp Leu Ile Pro Gln Gln Gln Arg Gly Lys Arg Met Thr Ala
    130                 135                 140
Val Lys Glu Pro Val Gly Val Val Ala Ala Ile Thr Pro Trp Asn Trp
145                 150                 155                 160
Pro Leu Ala Met Ile Ala Arg Lys Ile Ala Pro Ala Leu Ala Ala Gly
                165                 170                 175
Cys Thr Val Val Ala Lys Pro Ala Glu Asp Thr Pro Leu Thr Ala Ser
            180                 185                 190
Ala Leu Val Leu Leu Ala His Glu Ala Gly Val Pro Pro Gly Val Leu
        195                 200                 205
Asn Leu Ile Thr Ala Ser Arg Asp His Ala Val Ala Ala Val Ala Glu
    210                 215                 220
Trp Leu His Asp Ala Arg Val Arg Lys Ile Thr Phe Thr Gly Ser Thr
225                 230                 235                 240
Pro Val Gly Lys Tyr Leu Ala Arg Glu Ser Ala Glu Thr Leu Lys Lys
                245                 250                 255
Leu Ser Leu Glu Leu Gly Gly Asn Ala Pro Phe Ile Val Phe Asp Asp
            260                 265                 270
Ala Asp Leu Glu Ala Ala Val Ala Gly Leu Met Ala Ala Lys Phe Arg
        275                 280                 285
Asn Gly Gly Gln Thr Cys Val Cys Pro Asn Arg Val Tyr Val Gln Ala
    290                 295                 300
Gly Val Tyr Glu Arg Phe Gly Ala Leu Leu Ala Glu Arg Val Gly Ala
305                 310                 315                 320
Leu Lys Val Ala Pro Ala Thr Asp Pro Ala Ala Gln Ile Gly Pro Met
                325                 330                 335
Ile Asn Ser Arg Ala Leu Asp Lys Ile Ala Arg His Val Asp Asp Ala
            340                 345                 350
Val Ala His Gly Ala Arg Val Leu Thr Gly Gly Lys Arg Leu Ala Glu
        355                 360                 365
Leu Gly Pro His Tyr Tyr Ala Pro Thr Val Leu Ala Asp Ala Thr Ala
    370                 375                 380
Ala Met Gln Leu Asn Ser Glu Glu Thr Phe Gly Pro Ile Val Pro Leu
385                 390                 395                 400
Phe Arg Phe Glu Asp Glu Ala Glu Ala Val Asn Ala Ala Asn Asp Thr
                405                 410                 415
Pro Phe Gly Leu Ala Ala Tyr Phe Tyr Ser Glu Gly Val Lys Arg Ile
            420                 425                 430
Asp Arg Val Ala Arg Ala Leu Glu Ala Gly Ile Val Gly Ile Asn Glu
        435                 440                 445
Gly Ala Val Ala Ser Glu Ala Ala Pro Phe Gly Gly Val Lys Glu Ser
    450                 455                 460
Gly Tyr Gly Arg Glu Gly Ser Lys Tyr Gly Leu Asp Asp Tyr Leu Ser
465                 470                 475                 480
Ile Lys Tyr Leu Cys Gln Gly Asn Leu Glu
                485                 490

<210> 9
<211> 2319
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 9
ctagcgtacc gtatcacagt atagtctaat attccgtatc ttattgtatc ctatcctatt      60
cgatcctatt gtatttcagt gcaccatttt aatttctatt gctataatgt ccttattagt     120
tgccactgtg aggtgaccaa tggacgaggg cgagccgttc agaagccgcg aagggtgttc     180
ttcccatgaa tttcttaagg agggcggctc agctccgaga gtgaggcgag acgtctcggt     240
cagcgtatcc cccttcctcg gcttttacaa atgatgcgct cttaatagtg tgtcgttatc     300
cttttggcat tgacggggga gggaaattga ttgagcgcat ccatattttt gcggactgct     360
gaggacaatg gtggtttttc cgggtggcgt gggctacaaa tgatacgatg gtttttttct     420
tttcggagaa ggcgtataaa aaggacacgg agaacccatt tattctaaaa acagttgagc     480
ttctttaatt attttttgat ataatattct attattatat attttcttcc caataaaaca     540
aaataaaaca aaacacagca aaacacaaaa gaattcgccc ttacatatga taacttcgta     600
taatgtatgc tatacgaagt tatcatagcc tcatgaaatc agccatttgc ttttgttcaa     660
cgatcttttg aaattgttgt tgttcttggt agttaagttg atccatcttg gcttatgttg     720
tgtgtatgtt gtagttattc ttagtatatt cctgtcctga gtttagtgaa acataatatc     780
gccttgaaat gaaaatgctg aaattcgtcg acatacaatt tttcaaactt tttttttttc     840
ttggtgcacg gacatgtttt taaaggaagt actctatacc agttattctt cacaaattta     900
attgctggag aatagatctt caacgcttta ataaagtagt ttgtttgtca aggatggcgt     960
catacaaaga aagatcagaa tcacacactt cccctgttgc taggagactt ttctccatca    1020
tggaggaaaa gaagtctaac ctttgtgcat cattggatat tactgaaact gaaaagcttc    1080
tctctatttt ggacactatt ggtccttaca tctgtctagt taaaacacac atcgatattg    1140
tttctgattt tacgtatgaa ggaactgtgt tgcctttgaa ggagcttgcc aagaaacata    1200
attttatgat ttttgaagat agaaaatttg ctgatattgg taacactgtt aaaaatcaat    1260
ataaatctgg tgtcttccgt attgccgaat gggctgacat cactaatgca catggtgtaa    1320
cgggtgcagg tattgtttct ggcttgaagg aggcagccca agaaacaacc agtgaaccta    1380
gaggtttgct aatgcttgct gagttatcat caaagggttc tttagcatat ggtgaatata    1440
cagaaaaaac agtagaaatt gctaaatctg ataaagagtt tgtcattggt tttattgcgc    1500
aacacgatat gggcggtaga gaagaaggtt ttgactggat cattatgact ccaggggttg    1560
gtttagatga caaaggtgat gcacttggtc aacaatatag aactgttgat gaagttgtaa    1620
agactggaac ggatatcata attgttggta gaggtttgta cggtcaagga agagatccta    1680
tagagcaagc taaaagatac caacaagctg gttggaatgc ttatttaaac agatttaaat    1740
gattcttaca caaagatttg atacatgtac actagtttaa ataagcatga aaagaattac    1800
acaagcaaaa aaaaaaaaat aaatgaggta ctttacgttc acctacaacc aaaaaaacta    1860
gatagagtaa aatcttaaga tttagaaaaa gttgtttaac aaaggcttta gtatgtgaat    1920
ttttaatgta gcaaagcgat aactaataaa cataaacaaa agtatggttt tctttatcag    1980
tcaaatcatt atcgattgat tgttccgcgt atctgcagat aacttcgtat aatgtatgct    2040
atacgaagtt atagatccgc ggccgcttaa catctgaatg taaaatgaac attaaaatga    2100
attactaaac tttacgtcta ctttacaatc tataaacttt gtttaatcat ataacgaaat    2160
acactaatac acaatcctgt acgtatgtaa tacttttatc catcaaggat tgagaaaaaa    2220
aagtaatgat tccctgggcc attaaaactt agacccccaa gcttggatag gtcactctct    2280
attttcgttt ctcccttccc tgatagaagg gtgagagct                           2319

<210> 10
<211> 3252
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 10
ctagcgtacc gtatcacagt atagtctaat attccgtatc ttattgtatc ctatcctatt      60
cgatcctatt gtatttcagt gcaccatttt aatttctatt gctataatgt ccttattagt     120
tgccactgtg aggtgaccaa tggacgaggg cgagccgttc agaagccgcg aagggtgttc     180
ttcccatgaa tttcttaagg agggcggctc agctccgaga gtgaggcgag acgtctcggt     240
cagcgtatcc cccttcctcg gcttttacaa atgatgcgct cttaatagtg tgtcgttatc     300
cttttggcat tgacggggga gggaaattga ttgagcgcat ccatattttt gcggactgct     360
gaggacaatg gtggtttttc cgggtggcgt gggctacaaa tgatacgatg gtttttttct     420
tttcggagaa ggcgtataaa aaggacacgg agaacccatt tattctaaaa acagttgagc     480
ttctttaatt attttttgat ataatattct attattatat attttcttcc caataaaaca     540
aaataaaaca aaacacagca aaacacaaaa gaattcgccc ttacatatgg ataacttcgt     600
ataatgtatg ctatacgaag ttatgctgca acggcaacat caatgtccac gtttacacac     660
ctacatttat atctatattt atatttatat ttatttattt atgctactta gcttctatag     720
ttagttaatg cactcacgat attcaaaatt gacacccttc aactactccc tactattgtc     780
tactactgtc tactactcct ctttactata gctgctccca ataggctcca ccaataggct     840
ctgtcaatac attttgcgcc gccacctttc aggttgtgtc actcctgaag gaccatattg     900
ggtaatcgtg caatttctgg aagagagtgc cgcgagaagt gaggccccca ctgtaaatcc     960
tcgagggggc atggagtatg gggcatggag gatggaggat gggggggggg ggggaaaata    1020
ggtagcgaaa ggacccgcta tcaccccacc cggagaactc gttgccggga agtcatattt    1080
cgacactccg gggagtctat aaaaggcggg ttttgtcttt tgccagttga tgttgctgag    1140
aggacttgtt tgccgtttct tccgatttaa cagtatagaa tcaaccactg ttaattatac    1200
acgttatact aacacaacaa aaacaaaaac aacgacaaca acaacaacaa tgtttgcttt    1260
ctactttctc accgcatgca ccactttgaa gggtgttttc ggagtttctc cgagttacaa    1320
tggtcttggt ctcaccccac agatgggttg ggacagctgg aatacgtttg cctgcgatgt    1380
cagtgaacag ctacttctag acactgctga tagaatttct gacttggggc taaaggatat    1440
gggttacaag tatgtcatcc tagatgactg ttggtctagc ggcagggatt ccgacggttt    1500
cctcgttgca gacaagcaca aatttcccaa cggtatgggc catgttgcag accacctgca    1560
taataacagc tttcttttcg gtatgtattc gtctgctggt gagtacacct gtgctgggta    1620
ccctgggtct ctggggcgtg aggaagaaga tgctcaattc tttgcaaata accgcgttga    1680
ctacttgaag tatgataatt gttacaataa aggtcaattt ggtacaccag acgtttctta    1740
ccaccgttac aaggccatgt cagatgcttt gaataaaact ggtaggccta ttttctattc    1800
tctatgtaac tggggtcagg atttgacatt ttactggggc tctggtatcg ccaattcttg    1860
gagaatgagc ggagatatta ctgctgagtt cacccgtcca gatagcagat gtccctgtga    1920
cggtgacgaa tatgattgca agtacgccgg tttccattgt tctattatga atattcttaa    1980
caaggcagct ccaatggggc aaaatgcagg tgttggtggt tggaacgatc tggacaatct    2040
agaggtcgga gtcggtaatt tgactgacga tgaggaaaag gcccatttct ctatgtgggc    2100
aatggtaaag tccccactta tcattggtgc cgacgtgaat cacttaaagg catcttcgta    2160
ctcgatctac agtcaagcct ctgtcatcgc aattaatcaa gatccaaagg gtattccagc    2220
cacaagagtc tggagatatt atgtttcaga caccgatgaa tatggacaag gtgaaattca    2280
aatgtggagt ggtccgcttg acaatggtga ccaagtggtt gctttattga atggaggaag    2340
cgtagcaaga ccaatgaaca cgaccttgga agagattttc tttgacagca atttgggttc    2400
aaaggaactg acatcgactt gggatattta cgacttatgg gccaacagag ttgacaactc    2460
tacggcgtct gctatccttg aacagaataa ggcagccacc ggtattctct acaatgctac    2520
agagcagtct tataaagacg gtttgtctaa gaatgataca agactgtttg gccagaaaat    2580
tggtagtctt tctccaaatg ctatacttaa cacaactgtt ccagctcatg gtatcgcctt    2640
ctataggttg agaccctcgg cttaagctca atgttgagca aagcaggacg agaaaaaaaa    2700
aaataatgat tgttaagaag ttcatgaaaa aaaaaaggaa aaatactcaa atacttataa    2760
cagagtgatt aaataataaa cggcagtata ccctatcagg tattgagata gttttatttt    2820
tgtaggtata taatctgaag cctttgaact attttctcgt atatatcatg gagtatacat    2880
tgcattagca acattgcata ctagttcata acttcgtata atgtatgcta tacgaagtta    2940
ttaattaaca agggcgattt ctgcagatat cggccggccc catggagatc cgcggccgct    3000
taacatctga atgtaaaatg aacattaaaa tgaattacta aactttacgt ctactttaca    3060
atctataaac tttgtttaat catataacga aatacactaa tacacaatcc tgtacgtatg    3120
taatactttt atccatcaag gattgagaaa aaaaagtaat gattccctgg gccattaaaa    3180
cttagacccc caagcttgga taggtcactc tctattttcg tttctccctt ccctgataga    3240
agggtgagag ct                                                        3252

<210> 11
<211> 9629
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 11
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc      60
attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga     120
gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc     180
caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc     240
ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag     300
cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa     360
agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac     420
cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg     480
caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg     540
gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg     600
taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggtcgagga     660
gtccatcggt tcctgtcaga tgggatactc ttgacgtgga aaattcaaac agaaaaaaaa     720
ccccaataat gaaaaataac actacgttat atccgtggta tcctctatcg tatcgtatcg     780
tagcgtatcg tagcgtaccg tatcacagta tagtctaata ttccgtatct tattgtatcc     840
tatcctattc gatcctattg tatttcagtg caccatttta atttctattg ctataatgtc     900
cttattagtt gccactgtga ggtgaccaat ggacgagggc gagccgttca gaagccgcga     960
agggtgttct tcccatgaat ttcttaagga gggcggctca gctccgagag tgaggcgaga    1020
cgtctcggtc agcgtatccc ccttcctcgg cttttacaaa tgatgcgctc ttaatagtgt    1080
gtcgttatcc ttttggcatt gacgggggag ggaaattgat tgagcgcatc catatttttg    1140
cggactgctg aggacaatgg tggtttttcc gggtggcgtg ggctacaaat gatacgatgg    1200
tttttttctt ttcggagaag gcgtataaaa aggacacgga gaacccattt attctaaaaa    1260
cagttgagct tctttaatta ttttttgata taatattcta ttattatata ttttcttccc    1320
aataaaacaa aataaaacaa aacacagcaa aacacaaaaa ggatccatgt ctaatttact    1380
tactgttcac caaaacttgc ctgcattacc agttgacgca acctccgatg aagtcagaaa    1440
gaaccttatg gatatgttta gagatagaca agctttctcc gaacatactt ggaaaatgtt    1500
attatccgtt tgtagatcct gggccgcttg gtgtaaactt aacaatagaa aatggtttcc    1560
tgctgaacca gaagacgtca gagattactt actttactta caagctagag gtttggctgt    1620
taaaactatc caacaacact taggtcaatt gaatatgtta cacagaagat ccggtttacc    1680
aagaccatcc gattccaacg cagtttccct tgttatgaga agaattagaa aagaaaatgt    1740
tgacgctggt gaaagagcta aacaagcatt agcatttgaa agaaccgatt tcgatcaagt    1800
tagatcctta atggaaaatt ccgatagatg tcaagatatt agaaacttag ctttcttagg    1860
tattgcttac aacacattat taagaatcgc tgaaattgct agaattagag ttaaagatat    1920
ttcaagaacc gatggcggta gaatgttaat ccacattggc agaacaaaaa ccttagtctc    1980
cacagcaggc gtcgaaaaag cattatcatt aggtgttact aaattagttg aacgttggat    2040
ttccgtttcc ggtgttgcag atgacccaaa caactactta ttctgtcgtg ttagaaaaaa    2100
tggtgttgcc gctccttccg ctacctcaca attatccaca agagcattag aaggcatttt    2160
tgaagctacc cacagactta tttatggtgc aaaagacgat tccggtcaaa gatatttagc    2220
ttggtctggt cattccgcta gagttggtgc cgcaagagac atggcaagag ctggtgtttc    2280
tattcctgaa attatgcaag ccggtggttg gactaatgtt aacattgtta tgaactatat    2340
cagaaactta gattccgaaa caggtgctat ggttagatta cttgaagacg gtgattaagt    2400
taattaacat ctgaatgtaa aatgaacatt aaaatgaatt actaaacttt acgtctactt    2460
tacaatctat aaactttgtt taatcatata acgaaataca ctaatacaca atcctgtacg    2520
tatgtaatac ttttatccat caaggattga gaaaaaaaag taatgattcc ctgggccatt    2580
aaaacttaga cccccaagct tggataggtc actctctatt ttcgtttctc ccttccctga    2640
tagaagggtg atatgtaatt aagaataata tataatttta taataaaaga attcggcaga    2700
tctggatcga tcccccgggc tgcatgcaac ggcaacatca atgtccacgt ttacacacct    2760
acatttatat ctatatttat atttatattt atttatttat gctacttagc ttctatagtt    2820
agttaatgca ctcacgatat tcaaaattga cacccttcaa ctactcccta ctattgtcta    2880
ctactgtcta ctactcctct ttactatagc tgctcccaat aggctccacc aataggctct    2940
gtcaatacat tttgcgccgc cacctttcag gttgtgtcac tcctgaagga ccatattggg    3000
taatcgtgca atttctggaa gagagtgccg cgagaagtga ggcccccact gtaaatcctc    3060
gagggggcat ggagtatggg gcatgaagga tggaggatgg gggggggggg ggaaaatagg    3120
tagcgaaagg acccgctatc accccacccg gagaactcgt tgccgggaag tcatatttcg    3180
acactccggg gagtctataa aaggcgggtt ttgtcttttg ccagttgatg ttgctgagag    3240
gacttgtttg ccgtttcttc cgatttaaca gtatagaatc aaccactgtt aattatacac    3300
gttatactaa cacaacaaaa acaaaaacaa cgacaacaac aacaacctgc aggaaatgct    3360
tttgcaagct ttccttttcc ttttggctgg ttttgcagcc aaaatatctg catcaatgac    3420
aaacgaaact agcgatagac ctttggtcca cttcacaccc aacaagggct ggatgaatga    3480
cccaaatggg ttgtggtacg atgaaaaaga tgccaaatgg catctgtact ttcaatacaa    3540
cccaaatgac accgtatggg gtacgccatt gttttggggc catgctactt ccgatgattt    3600
gactaattgg gaagatcaac ccattgctat cgctcccaag cgtaacgatt caggtgcttt    3660
ctctggctcc atggtggttg attacaacaa cacgagtggg tttttcaatg atactattga    3720
tccaagacaa agatgcgttg cgatttggac ttataacact cctgaaagtg aagagcaata    3780
cattagctat tctcttgatg gtggttacac ttttactgaa taccaaaaga accctgtttt    3840
agctgccaac tccactcaat tcagagatcc aaaggtgttc tggtatgaac cttctcaaaa    3900
atggattatg acggctgcca aatcacaaga ctacaaaatt gaaatttact cctctgatga    3960
cttgaagtcc tggaagctag aatctgcatt tgccaatgaa ggtttcttag gctaccaata    4020
cgaatgtcca ggtttgattg aagtcccaac tgagcaagat ccttccaaat cttattgggt    4080
catgtttatt tctatcaacc caggtgcacc tgctggcggt tccttcaacc aatattttgt    4140
tggatccttc aatggtactc attttgaagc gtttgacaat caatctagag tggtagattt    4200
tggtaaggac tactatgcct tgcaaacttt cttcaacact gacccaacct acggttcagc    4260
attaggtatt gcctgggctt caaactggga gtacagtgcc tttgtcccaa ctaacccatg    4320
gagatcatcc atgtctttgg tccgcaagtt ttctttgaac actgaatatc aagctaatcc    4380
agagactgaa ttgatcaatt tgaaagccga accaatattg aacattagta atgctggtcc    4440
ctggtctcgt tttgctacta acacaactct aactaaggcc aattcttaca atgtcgattt    4500
gagcaactcg actggtaccc tagagtttga gttggtttac gctgttaaca ccacacaaac    4560
catatccaaa tccgtctttg ccgacttatc actttggttc aagggtttag aagatcctga    4620
agaatatttg agaatgggtt ttgaagtcag tgcttcttcc ttctttttgg accgtggtaa    4680
ctctaaggtc aagtttgtca aggagaaccc atatttcaca aacagaatgt ctgtcaacaa    4740
ccaaccattc aagtctgaga acgacctaag ttactataaa gtgtacggcc tactggatca    4800
aaacatcttg gaattgtact tcaacgatgg agatgtggtt tctacaaata cctacttcat    4860
gaccaccggt aacgctctag gatctgtgaa catgaccact ggtgtcgata atttgttcta    4920
cattgacaag ttccaagtaa gggaagtaaa atagcctgca ggcacgtccg acggcggccc    4980
acgggtccca ggcctcggag atccgtcccc cttttccttt gtcgatatca tgtaattagt    5040
tatgtcacgc ttacattcac gccctccccc cacatccgct ctaaccgaaa aggaaggagt    5100
tagacaacct gaagtctagg tccctattta tttttttata gttatgttag tattaagaac    5160
gttatttata tttcaaattt ttcttttttt tctgtacaga cgcgtgtacg catgtaacat    5220
tatactgaaa accttgcttg agaaggtttt gggacgctcg aaggctttaa tttgcaagct    5280
gaattcccgg gccttaccgt cgacgaattt cagcattttc atttcaaggc gatattatgt    5340
ttcactaaac tcaggacagg aatatactaa gaataactac aacatacaca caacataagc    5400
caagatggat caacttaact accaagaaca acaacaattt caaaagatcg ttgaacaaaa    5460
gcaaatggct gatttcatga ggctatgaat tcgcccttga tctgggtgta tactgcacaa    5520
cctcattgtt cgggaatttg attctcatct cacatacagg cctgtagtat tgcgccctct    5580
ccttctcctt ctccttctcc ttctccaaga gagacttctc tctcatcgcc ctcgtcatca    5640
atggctgctc gctgtattgt cgttggagca tctcccgata cttctgcaac tgtgataaac    5700
tcatctcagg tgacccatcc gattctgtat cggtgtctcc atctggggct acatctcggg    5760
ccagtctaga tttaaacttt gcagaacctt cactttgggg gatatacact agtgtctctc    5820
ccgtgactac atcaccgaca ccctcaactg taccattatt attgtcattg ttttcctcta    5880
agttctcgct ttggtcttca tccatctctc cttcgggtgc tgtatcactc ttgatgattt    5940
ctctaaccct aatacggaga ctgtgattgc ctgaaataat acccacatct ttcaacttct    6000
gatgaagtga atctccagag atgaccttca tcagcacttg cacatcaacc acatcaccct    6060
ccttttgagc atccctcatg attccataga ctacatcccg tagcgtctcc ttgttcttgt    6120
acttcttaac aacagtctcg ccacagacat ggcccctgat aatcacctcc tgtctctcct    6180
catggccatc ctggtcgcca ttgtcttcgt cgctcggctc aattgccaat gtagcaccct    6240
gtggaagatt gcttagtctg tatggaacag actcatcaac tcttttgcca ttatgcatta    6300
acttgtactt tcgcccttgg ctaagttgaa aatgtttaca tccttcttca agtacattgg    6360
acatgattgt ccctgcattg acatttgtcc ggtaagtcct aaacccactc gctagattca    6420
ctgtaggcat attcaatcac gttccgtttg aaaaaaagga aaccaattta ttatctccag    6480
aaatagttgg cgtcttgcat cttgtttggt cttgatcttt cgtgtttttt ttttttctgt    6540
cttttttttt ctcctctctc caactttttg atttttagtg taccaaatcg cactgcttat    6600
ccacattcat cataaagagg gggggagaag aggggcagaa aataaaaggc catgtcacgt    6660
gcctgtgcat ttatttgtgt gtgtgtcacg tgctcaaaat gtcttttttt tacgttttta    6720
acattttccc tttctgtagt tgaatccatt tgcatgagtc gtacatgatg tttgctgtat    6780
ttacgttaag acactaattc aaatgacaaa cagctattat tcttagccat taatgcattt    6840
ttgcaaatct ttaactggat ttaactatgg ctaggtgaat ttgttctgga catcattgcc    6900
ttgacttgtt ttagtgccga tgtccttatc acttacactc gtaacacaac acaacagcag    6960
ctaatgttgt tgtgtatcgc ttgaccctta ataactgatt cttttttgat gaatgttaag    7020
aagaaacaaa caagaaaata aaatcaaaac aggcttcttt tgacctcttt caagagaagg    7080
ttttcttggt tgtttcatat accaagatct gaatatcttc tattattata caaaccactg    7140
attatacaaa tctattcatc gacagtatga gctacgaaaa cacactgata aagagagtca    7200
tttcttcccc ttctttttct ttttcttttt cttcttcttc ttagtatccc catcttcatt    7260
aactccacca agtagatcct ctacaccccc catggccgtt aaaaaatgtt cacgaaagaa    7320
atccatatca ttattcttac catccattaa actgtttaga tagatggtga tcatctccct    7380
tgcattgtct atatcttcaa cgtcgagtaa atgcgacgca atggtaccca gcttttgttc    7440
cctttagtga gggttaattg cgcgcttggc gtaatcatgg tcatagctgt ttcctgtgtg    7500
aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa agtgtaaagc    7560
ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac tgcccgcttt    7620
ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggggagagg    7680
cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt    7740
tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc    7800
aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa    7860
aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa    7920
tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc    7980
ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc    8040
cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag    8100
ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga    8160
ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc    8220
gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac    8280
agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat ttggtatctg    8340
cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca    8400
aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa    8460
aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa    8520
ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt    8580
aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag    8640
ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat    8700
agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac catctggccc    8760
cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat cagcaataaa    8820
ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca    8880
gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata gtttgcgcaa    8940
cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt    9000
cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc    9060
ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag tgttatcact    9120
catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa gatgcttttc    9180
tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg    9240
ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt taaaagtgct    9300
catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc tgttgagatc    9360
cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta ctttcaccag    9420
cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa taagggcgac    9480
acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca tttatcaggg    9540
ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac aaataggggt    9600
tccgcgcaca tttccccgaa aagtgccac                                      9629



<210> 12
<211> 4770
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 12
cgccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg      60
ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca     120
gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattcgttaa ccccatgcac     180
tcaactatcc ccgtaaacat agtagtaggc tataggcttt cttctcgtca tcacagagaa     240
gccatataga gacatataca tattaataca tggacttcta ggcttacttt ctcacatact     300
tttcacgtac tttctcacat atttactaat acacaatttt tacattatgt gcatttgtaa     360
attgcaatgt aggattttcc tcattgacaa ggacaaaccc tcatttggga ggaaaagaga     420
aaataacttc tccgtttttt tttgtttttt tttctcagat gagccgccac tgtttttttt     480
ttttttgcat cccccttttt ccaatttgtc atgactataa aagcgcgaat tccccccccc     540
cccttgtctc ccgtttcttt cctgccatgc gagattgctc attcaatttg accttctctt     600
gatgtggcaa tttatctcca aaactgacag cttaagcttc tttccaatgt tggtttctgg     660
taaacgaatg ccatagagaa aggctggaga tacagagaca ggtctgtggt tgtttcccac     720
acgggtttgt ttttgattct cctcgacatg gtttgtctgc tgaatatata taaatattca     780
tacacctcta tctaactatc tatctataaa aaatcgcaat gtagtagcta tattaacatt     840
aactatggca ataataacaa tagcaatact gaaaacaatg atgttattat tattatcgtg     900
ggcaatacca aaaaccgagg tcattcaaaa gggcgcacat tggtggaaaa acaatacttg     960
cgcggaaaaa aaaaaattaa tatctgccaa atcctaaact gcaaatcatg gattgcaatt    1020
ctccgttatc tatataaggc taggaggttg ccaatcggta tataatagtt gggggggagg    1080
aggaggagga ttctctttag attatcactt gatacaaatc aatcggttat cttttattga    1140
aaagtcaaga cacaaaccaa ccaactaaac gagcggccgc gagtccatcg gttcctgtca    1200
gatgggatac tcttgacgtg gaaaattcaa acagaaaaaa aaccccaata atgaaaaata    1260
acactacgtt atatccgtgg tatcctctat cgtatcgtat cgtagcgtat cgtagcgtac    1320
cgtatcacag tatagtctaa tattccgtat cttattgtat cctatcctat tcgatcctat    1380
tgtatttcag tgcaccattt taatttctat tgctataatg tccttattag ttgccactgt    1440
gaggtgacca atggacgagg gcgagccgtt cagaagccgc gaagggtgtt cttcccatga    1500
atttcttaag gagggcggct cagctccgag agtgaggcga gacgtctcgg tcagcgtatc    1560
ccccttcctc ggcttttaca aatgatgcgc tcttaatagt gtgtcgttat ccttttggca    1620
ttgacggggg agggaaattg attgagcgca tccatatttt tgcggactgc tgaggacaat    1680
ggtggttttt ccgggtggcg tgggctacaa atgatacgat ggtttttttc ttttcggaga    1740
aggcgtataa aaaggacacg gagaacccat ttattctaaa aacagttgag cttctttaat    1800
tattttttga tataatattc tattattata tattttcttc ccaataaaac aaaataaaac    1860
aaaacacagc aaaacacaaa aagctagcta aaatgagagt tgattccgag atcattgtca    1920
agaaggaaac acaagatgaa aacctctacc aatctttagc agaaagatcc aaacatgaag    1980
agttcttaag aaaggccgtt gacttacttg tcgaaagagt tgttttcggc agatcaacca    2040
gatcttccaa ggttgttgag tgggctgcac cagacgagat taagaaagca attgacttaa    2100
agccacgttt gggtcctgct tctcacgatg agttgttggc ctttatggca aacgttgcta    2160
ggtattctgt caatactggt catccatact tcgttaacca gttattctca tccgttgatc    2220
catatggttt agttggtcaa tggttaaccg acgcattgaa cccatccgtt tacacctttg    2280
aagtcgctcc tgtctttacc ttgatggagg aagaggtttt gcgtgaaatg cgtaagattg    2340
tcggttggcc tgagggcgaa ggtgatggta tcttctgccc aggtggttct atcgccaacg    2400
gttacgctat ttcctgtgct agacatcact tttacccaga ggtcaagtat aagggtgttc    2460
atgctgttcc taagttagtt ttgtttacct ccgaactcgc gcactacagt accaaaaaga    2520
tggctgcttt catgggtatt ggttccgata actgtgttaa cattaagacc gatgatgttg    2580
gtaagatgaa catcgttgat ttagaaatga agatcaagat tgctattgac aacaagtgta    2640
ccccattcat ggttacagct acctctggta ctaccgtttt cggtgctttc gacccattag    2700
tcgcaatctc cgatctttgc aagaagtaca atctgtggtt gcatgtcgat gctgcttggg    2760
gtggtggtgc tctcatgtct aagaagcaca gacacctttt gaacggtatc gaattagccg    2820
actctgttac ttggaaccct cataagcttt tagccgctcc acaacaatgt tccacctttt    2880
tgaccagaca caagaaagtt ctatccgaag gtcattcatc caatgcaaag tacttattcc    2940
aaaaagacaa gttttacgac acatcctacg acactggtga taaacatatt caatgtggta    3000
gaagagcaga tgttcttaag ttctggttca tgtggaaggc taagggtact gaaggtttcg    3060
agaaacacgt tgacaagctt tttgataacg caaagtactt cttagaccac attaagcaga    3120
gagaaggttt ccaattggtt attgcagaac cacaatgcac caatatcatg ttctggtaca    3180
tcccaaagtg cctgagaggt tgtgaaaatg atgcagacta ctatgagaga ttacataaag    3240
ttgcgcctaa gatcaaggaa agaatgatca aggaaggttc aatgatggtc acctaccaac    3300
cacaaggtga tcttgttaac ttctttagaa ttgtttttca gaactctgca ttggaccata    3360
aggacatggt ttacttcgca aatgaattcg aaagattggg ctccgatatg atcgtttaat    3420
taattaattt attttactag tttatttttg ctcctgagaa taggattaca aacacttaaa    3480
gtctttaatt acaactatat ataatattct gttggttttc ttgaattggt tcgctgcgat    3540
tcatgcctcc cattcaccaa aggtggagtg ggaaataacg gttttactgc ggtaattagc    3600
agaggcaaga acaggataca ctttttgatg ataaatctgt attatagtcg agcctattta    3660
ggaaatcaaa ttttcttgtg tttacttttc aaataaataa tgttcgaaaa tttttacttt    3720
actccttcat ttaactatac cagacgttat atcatcaaca ccttctgacc atatacagct    3780
caagatgttt aagagtctgt taaatttttt caatccattt catggagtac caggaggtgc    3840
tacaaaagga attcatagcc tcatgaaatc agccatttgc ttttgttcaa cgatcttttg    3900
aaattgttgt tgttcttggt agttaagttg atccatcttg gcttatgttg tgtgtatgtt    3960
gtagttattc ttagtatatt cctgtcctga gtttagtgaa acataatatc gccttgaaat    4020
gaaaatgctg aaattcgtcg acatacaatt tttcaaactt tttttttttc ttggtgcacg    4080
gacatgtttt taaaggaagt actctatacc agttattctt cacaaattta attgctggag    4140
aatagatctt caacgcttta ataaagtagt ttgtttgtca aggatggcgt catacaaaga    4200
aagatcagaa tcacacactt cccctgttgc taggagactt ttctccatca tggaggaaaa    4260
gaagtctaac ctttgtgcat cattggatat tactgaaact gaaaagcttc tctctatttt    4320
ggacactatt ggtccttaca tctgtctagt taaaacacac atcgatattg tttctgattt    4380
tacgtatgaa ggaactgtgt tgcctttgaa ggagcttgcc aagaaacata attttatgat    4440
ttttgaagat agaaaatttg ctgatattgg taacactgtt aaaaatcaat ataaatctgg    4500
tgtcttccgt attgccgaat gggctgacat cactaatgca catggtgtaa cgggtgcagg    4560
tattgtttct ggcttgaagg aggcagccca agaaacaacc agtgaaccta gaggtttgct    4620
aatgcttgct gagttatcat caaagggttc tttagcatat ggtgaatata cagaaaaaac    4680
agtagaaatt gctaaatctg ataaagagtt tgtcattggt tttattgcgc aacacgatat    4740
gggcggtaga gaagaaggtt ttgactccgc                                     4770

<210> 13
<211> 456
<212> PRT
<213> Danaus plexippus

<400> 13
Lys Val Val Glu Trp Ala Ala Pro Asp Glu Ile Lys Lys Ala Ile Asp
 1               5                  10                  15
Leu Lys Pro Arg Leu Gly Pro Ala Ser His Asp Glu Leu Leu Ala Phe
            20                  25                  30
Met Ala Asn Val Ala Arg Tyr Ser Val Asn Thr Gly His Pro Tyr Phe
        35                  40                  45
Val Asn Gln Leu Phe Ser Ser Val Asp Pro Tyr Gly Leu Val Gly Gln
    50                  55                  60
Trp Leu Thr Asp Ala Leu Asn Pro Ser Val Tyr Thr Phe Glu Val Ala
65                  70                  75                  80
Pro Val Phe Thr Leu Met Glu Glu Glu Val Leu Arg Glu Met Arg Lys
                85                  90                  95
Ile Val Gly Trp Pro Glu Gly Glu Gly Asp Gly Ile Phe Cys Pro Gly
            100                 105                 110
Gly Ser Ile Ala Asn Gly Tyr Ala Ile Ser Cys Ala Arg His His Phe
        115                 120                 125
Tyr Pro Glu Val Lys Tyr Lys Gly Val His Ala Val Pro Lys Leu Val
    130                 135                 140
Leu Phe Thr Ser Glu Leu Ala His Tyr Ser Thr Lys Lys Met Ala Ala
145                 150                 155                 160
Phe Met Gly Ile Gly Ser Asp Asn Cys Val Asn Ile Lys Thr Asp Asp
                165                 170                 175
Val Gly Lys Met Asn Ile Val Asp Leu Glu Met Lys Ile Lys Ile Ala
            180                 185                 190
Ile Asp Asn Lys Cys Thr Pro Phe Met Val Thr Ala Thr Ser Gly Thr
        195                 200                 205
Thr Val Phe Gly Ala Phe Asp Pro Leu Val Ala Ile Ser Asp Leu Cys
    210                 215                 220
Lys Lys Tyr Asn Leu Trp Leu His Val Asp Ala Ala Trp Gly Gly Gly
225                 230                 235                 240
Ala Leu Met Ser Lys Lys His Arg His Leu Leu Asn Gly Ile Glu Leu
                245                 250                 255
Ala Asp Ser Val Thr Trp Asn Pro His Lys Leu Leu Ala Ala Pro Gln
            260                 265                 270
Gln Cys Ser Thr Phe Leu Thr Arg His Lys Lys Val Leu Ser Glu Gly
        275                 280                 285
His Ser Ser Asn Ala Lys Tyr Leu Phe Gln Lys Asp Lys Phe Tyr Asp
    290                 295                 300
Thr Ser Tyr Asp Thr Gly Asp Lys His Ile Gln Cys Gly Arg Arg Ala
305                 310                 315                 320
Asp Val Leu Lys Phe Trp Phe Met Trp Lys Ala Lys Gly Thr Glu Gly
                325                 330                 335
Phe Glu Lys His Val Asp Lys Leu Phe Asp Asn Ala Lys Tyr Phe Leu
            340                 345                 350
Asp His Ile Lys Gln Arg Glu Gly Phe Gln Leu Val Ile Ala Glu Pro
        355                 360                 365
Gln Cys Thr Asn Ile Met Phe Trp Tyr Ile Pro Lys Cys Leu Arg Gly
    370                 375                 380
Cys Glu Asn Asp Ala Asp Tyr Tyr Glu Arg Leu His Lys Val Ala Pro
385                 390                 395                 400
Lys Ile Lys Glu Arg Met Ile Lys Glu Gly Ser Met Met Val Thr Tyr
                405                 410                 415
Gln Pro Gln Gly Asp Leu Val Asn Phe Phe Arg Ile Val Phe Gln Asn
            420                 425                 430
Ser Ala Leu Asp His Lys Asp Met Val Tyr Phe Ala Asn Glu Phe Glu
        435                 440                 445
Arg Leu Gly Ser Asp Met Ile Val
    450                 455

<210> 14
<211> 5134
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 14
cgccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg      60
ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca     120
gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattctttga aggagcttgc     180
caagaaacat aattttatga tttttgaaga tagaaaattt gctgatattg gtaacactgt     240
taaaaatcaa tataaatctg gtgtcttccg tattgccgaa tgggctgaca tcactaatgc     300
acatggtgta acgggtgcag gtattgtttc tggcttgaag gaggcagccc aagaaacaac     360
cagtgaacct agaggtttgc taatgcttgc tgagttatca tcaaagggtt ctttagcata     420
tggtgaatat acagaaaaaa cagtagaaat tgctaaatct gataaagagt ttgtcattgg     480
ttttattgcg caacacgata tgggcggtag agaagaaggt tttgactgga tcattatgac     540
tccaggggtt ggtttagatg acaaaggtga tgcacttggt caacaatata gaactgttga     600
tgaagttgta aagactggaa cggatatcat aattgttggt agaggtttgt acggtcaagg     660
aagagatcct atagagcaag ctaaaagata ccaacaagct ggttggaatg cttatttaaa     720
cagatttaaa tgattcttac acaaagattt gatacatgta cactagttta aataagcatg     780
aaaagaatta cacaagcaaa aaaaaaaaaa taaatgaggt actttacgtt cacctacaac     840
caaaaaaact agatagagta aaatcttaag atttagaaaa agttgtttaa caaaggcttt     900
agtatgtgaa tttttaatgt agcaaagcga taactaataa acataaacaa aagtatggtt     960
ttctttatca gtcaaatcat tatcgattga ttgttccgcg tatctgcaga tagcctcatg    1020
aaatcagcca tttgcttttg ttcaacgatc ttttgaaatt gttgttgttc ttggtagtta    1080
agttgatcca tcttggctta tgttgtgtgt atgttgtagt tattcttagt atattcctgt    1140
cctgagttta gtgaaacata atatcgcctt gaaatgaaaa tgctgaaatt cgtcgacata    1200
caatttttca aacttttttt ttttcttggt gcacggacat gtttttaaag gaagtactct    1260
ataccagtta ttcttcacaa atttaattgc tggagaatag atcttcaacg cgtttaaaca    1320
gcaatttgag gaaggaatag gagaaggaga agcaatttct aggaaagagc aaggtgtgca    1380
acagcatgct ctgaatgata ttttcagcaa tagttcagtt gaagaacctg ttggcgtatc    1440
tacatcactt cctacaaaca acaccacgaa ttgcgtccgt ggtgacgcaa ctacgaatgg    1500
cattgtcaat gccaatgcca gtgcacatac acgtgcaagt cccaccggtt ccctgcccgg    1560
ctatggtaga gacaagaagg acgataccgg catcgacatc aacagtttca acagcaatgc    1620
gtttggcgtc gacgcgtcga tggggctgcc gtatttggat ttggacgggc tagatttcga    1680
tatggatatg gatatggata tggatatgga gatgaatttg aatttagatt tgggtcttga    1740
tttggggttg gaattaaaag gggataacaa tgagggtttt cctgttgatt taaacaatgg    1800
acgtgggagg tgattgattt aacctgatcc aaaaggggta tgtctatttt ttagagtgtg    1860
tctttgtgtc aaattatggt agaatgtgta aagtagtata aactttcctc tcaaatgacg    1920
aggtttaaaa caccccccgg gtgagccgag ccgagaatgg ggcaattgtt caatgtgaaa    1980
tagaagtatc gagtgagaaa cttgggtgtt ggccagccaa gggggaagga aaatggcgcg    2040
aatgctcagg tgagattgtt ttggaattgg gtgaagcgag gaaatgagcg acccggaggt    2100
tgtgacttta gtggcggagg aggacggagg aaaagccaag agggaagtgt atataagggg    2160
agcaatttgc caccaggata gaattggatg agttataatt ctactgtatt tattgtataa    2220
tttatttctc cttttatatc aaacacatta caaaacacac aaaacacaca aacaaacaca    2280
tctagataaa atgagggttg actccaaaat catcgtcaaa aaggaaaccc aagatgaaaa    2340
cttgtaccaa tctttggctg aaagatccaa gcatgaagag tttttgagaa aagccgttga    2400
tttgttggtt gaaagagttg tttttggtag atccactaga tcctccaagg tcgttgaatg    2460
ggcagctcca gatgaaatca aaaaggccat cgatcttaag cctaggttag gcccagcttc    2520
ccatgatgaa ttgttagcct ttatggcaaa tgttgcaaga tattccgtca atactggtca    2580
cccatacttc gttaatcaat tgttttcatc tgttgatcca tacggtctag ttggtcaatg    2640
gttgaccgat gccttaaacc cttctgtcta tacctttgaa gttgctccag tctttacatt    2700
gatggaggaa gaggttttga gagaaatgag gaaaatcgtt ggttggccag aaggtgaggg    2760
tgatggcatc ttttgtccag gtggttctat cgccaacggt tacgctattt cttgtgctcg    2820
tcatcacttc tacccagaag ttaagtacaa aggtgttcat gctgtcccta agttagtctt    2880
gtttacctct gaattagctc actattctac aaagaaaatg gctgcattca tgggtattgg    2940
ttccgacaat tgtgtcaata tcaaaacaga cgacgttggt aagatgaaca ttgttgacct    3000
agaaatgaag attaagatcg caattgacaa caagtgtacc cctttcatgg ttacagctac    3060
atcaggtact actgttttcg gtgcttttga cccacttgtc gcaatctccg atttgtgtaa    3120
gaagtacaac ttatggctac atgtcgatgc ggcatggggt ggtggtgcgt taatgtcaaa    3180
gaagcataga catcttctaa acggtatcga attagcagac tccgttacct ggaatccaca    3240
caaattgtta gctgctcctc aacaatgttc cacattcctt accagacaca agaaggtcct    3300
gtctgaaggt cattcttcca acgccaaata cttgtttcag aaggacaagt tttacgacac    3360
ttcttatgac actggtgaca aacatattca gtgtggtaga agagcagatg ttttgaagtt    3420
ctggtttatg tggaaggcaa agggtacgga gggtttcgag aagcacgttg ataagttgtt    3480
tgacaatgct aagtacttcc tagatcacat taagcaaagg gaaggtttcc aattggtcat    3540
tgcagaacct cagtgtacca acatcatgtt ctggtatatc ccaaagtgtc taagaggttg    3600
tgaaaatgat gcagactact atgaaagatt gcataaagtc gctccaaaga tcaaagaaag    3660
aatgatcaaa gaaggttcta tgatggtcac ctaccaacca caaggtgatt tggtcaattt    3720
ctttagaatc gttttccaga attctgcttt agatcataag gatatggtct actttgcaaa    3780
cgaattcgaa agattaggtt ccgacatgat cgtttaatta acatctgaat gtaaaatgaa    3840
cattaaaatg aattactaaa ctttacgtct actttacaat ctataaactt tgtttaatca    3900
tataacgaaa tacactaata cacaatcctg tacgtatgta atacttttat ccatcaagga    3960
ttgagaaaaa aaagtaatga ttccctgggc cattaaaact tagaccccca agcttggata    4020
ggtcactctc tattttcgtt tctcccttcc ctgatagaag ggtgatatgt aattaagaat    4080
aatatataat tttataataa aagcggccgc caacttggtt caaaccaaga aagccgtcgt    4140
gtactttgcg tattttcaac ttcttccctt caactccaac cttataaccc aatccgacag    4200
ttagcataca gtagaaaatc cataagttaa ttgcaaaaac cacataattc ccaatgactg    4260
gcacaatttg catgaacggc gcaaaaccaa cacctatcct tcttgacttg tggacaactt    4320
ctccggcatc ggctccagtt tctggtttag gcgacacttt caaggtaccc ttgttgacta    4380
ggttctcagc aacatactcc tcctcgataa ctttgaccgt ctttgtgcca aacacattgt    4440
tcaatagttt ggacgctcgc ataccgaaga cagactcgtc catccaatag catgtactcc    4500
gaatgctctc aatcaacggt atcaaatcgc ttggtgtctt tggaggcggt gttctaggtt    4560
tcatcttcgc cttcccgttt cctgtttctt ggacttcaaa atacgggtca tatgcatctt    4620
cattggtgaa cctgaacagt ttgggccgag tagtgccaaa caccttcaac agtataatct    4680
tcactccttg gccgattaga aacatgctta tatgtttgcc ggtattctta aagtgtatac    4740
tggagccctt ttaatcacat tttttttact tgtcagtctc catacggagt ttaatgtcct    4800
tatatcgatc ttcatcagct ccaacgggac aggaatcaat aaccttcctt gcctgtccca    4860
aaaagaatga attttttttc aaaagcttta cgatgcatac cacaaaagga agattattcc    4920
cacatgttcc agaagtgtgc ggagatacaa agggttcatg aaaacgtgaa tcttctaaaa    4980
acttagcaca acaataaaaa tctacaatgt tacagtaagt attattttct ttttgtcgac    5040
acactccaac ggttagattt ccaagtattc aatccaatgt attacttgtc agacagccat    5100
ccactcccat cttagaacat cacttccgaa ccgc                                5134



<210> 15
<211> 4685
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 15
ctggcgaaag ggggatgtgc tgcaaggcga ttaagttggg taacgccagg gttttcccag      60
tcacgacgtt gtaaaacgac ggccagtgaa ttcgttaacc ccatgcactc aactatcccc     120
gtaaacatag tagtaggcta taggctttct tctcgtcatc acagagaagc catatagaga     180
catatacata ttaatacatg gacttctagg cttactttct cacatacttt tcacgtactt     240
tctcacatat ttactaatac acaattttta cattatgtgc atttgtaaat tgcaatgtag     300
gattttcctc attgacaagg acaaaccctc atttgggagg aaaagagaaa ataacttctc     360
cgtttttttt tgtttttttt tctcagatga gccgccactg tttttttttt ttttgcatcc     420
ccctttttcc aatttgtcat gactataaaa gcgcgaattc cccccccccc cttgtctccc     480
gtttctttcc tgccatgcga gattgctcat tcaatttgac cttctcttga tgtggcaatt     540
tatctccaaa actgacagct taagcttctt tccaatgttg gtttctggta aacgaatgcc     600
atagagaaag gctggagata cagagacagg tctgtggttg tttcccacac gggtttgttt     660
ttgattctcc tcgacatggt ttgtctgctg aatatatata aatattcata cacctctatc     720
taactatcta tctataaaaa atcgcaatgt agtagctata ttaacattaa ctatggcaat     780
aataacaata gcaatactga aaacaatgat gttattatta ttatcgtggg caataccaaa     840
aaccgaggtc attcaaaagg gcgcacattg gtggaaaaac aatacttgcg cggaaaaaaa     900
aaaattaata tctgccaaat cctaaactgc aaatcatgga ttgcaattct ccgttatcta     960
tataaggcta ggaggttgcc aatcggtata taatagttgg gggggaggag gaggaggatt    1020
ctctttagat tatcacttga tacaaatcaa tcggttatct tttattgaaa agtcaagaca    1080
caaaccaacc aactaaacga gcggccgcga gtccatcggt tcctgtcaga tgggatactc    1140
ttgacgtgga aaattcaaac agaaaaaaaa ccccaataat gaaaaataac actacgttat    1200
atccgtggta tcctctatcg tatcgtatcg tagcgtatcg tagcgtaccg tatcacagta    1260
tagtctaata ttccgtatct tattgtatcc tatcctattc gatcctattg tatttcagtg    1320
caccatttta atttctattg ctataatgtc cttattagtt gccactgtga ggtgaccaat    1380
ggacgagggc gagccgttca gaagccgcga agggtgttct tcccatgaat ttcttaagga    1440
gggcggctca gctccgagag tgaggcgaga cgtctcggtc agcgtatccc ccttcctcgg    1500
cttttacaaa tgatgcgctc ttaatagtgt gtcgttatcc ttttggcatt gacgggggag    1560
ggaaattgat tgagcgcatc catatttttg cggactgctg aggacaatgg tggtttttcc    1620
gggtggcgtg ggctacaaat gatacgatgg tttttttctt ttcggagaag gcgtataaaa    1680
aggacacgga gaacccattt attctaaaaa cagttgagct tctttaatta ttttttgata    1740
taatattcta ttattatata ttttcttccc aataaaacaa aataaaacaa aacacagcaa    1800
aacacaaaaa gctagctaaa atgagagttg attccaagat cattgtcaag aaagagactc    1860
aagacgaaaa cttgtatcaa tccttgcaag gttgttgagt gggctgcacc agacgagatt    1920
aagaaagcaa ttgacttaaa gccacgtttg ggtcctgctt ctcacgatga gttgttggcc    1980
tttatggcaa acgttgctag gtattctgtc aatactggtc atccatactt cgttaaccag    2040
ttattctcat ccgttgatcc atatggttta gttggtcaat ggttaaccga cgcattgaac    2100
ccatccgttt acacctttga agtcgctcct gtctttacct tgatggagga agaggttttg    2160
cgtgaaatgc gtaagattgt cggttggcct gagggcgaag gtgatggtat cttctgccca    2220
ggtggttcta tcgccaacgg ttacgctatt tcctgtgcta gacatcactt ttacccagag    2280
gtcaagtata agggtgttca tgctgttcct aagttagttt tgtttacctc cgaactcgcg    2340
cactacagta ccaaaaagat ggctgctttc atgggtattg gttccgataa ctgtgttaac    2400
attaagaccg atgatgttgg taagatgaac atcgttgatt tagaaatgaa gatcaagatt    2460
gctattgaca acaagtgtac cccattcatg gttacagcta cctctggtac taccgttttc    2520
ggtgctttcg acccattagt cgcaatctcc gatctttgca agaagtacaa tctgtggttg    2580
catgtcgatg ctgcttgggg tggtggtgct ctcatgtcta agaagcacag acaccttttg    2640
aacggtatcg aattagccga ctctgttact tggaaccctc ataagctttt agccgctcca    2700
caacaatgtt ccaccttttt gaccagacac aagaaagttc tatccgaagg tcattcatcc    2760
aatgcaaagt acttattcca aaaagacaag ttttacgaca catcctacga cactggtgat    2820
aaacatattc aatgtggtag aagagcagat gttcttaagt tctggttcat gtggaaggct    2880
aagggtactg aaggtttcga gaaacacgtt gacaagcttt ttgataacgc aaagtacttc    2940
ttagaccaca ttaagcagag agaaggtttc caattggtta ttgcagaacc acaatgcacc    3000
aatatcatgt tctggtacat cccaaagtgc ctgagaggtt gtgaaaatga tgcagactac    3060
tatgagagat tacataaagt tgcgcctaag atcaaggaaa gaatgatcaa ggaaggttca    3120
atgatggtca cctaccaacc acaaggtgat cttgttaact tctttagaat tgtttttcag    3180
aactctgcat tggaccataa ggacatggtt tacttcgcaa atgaattcga aagattgggc    3240
tccgatatga tcgtttaatt aattaattta ttttactagt ttatttttgc tcctgagaat    3300
aggattacaa acacttaaag tctttaatta caactatata taatattctg ttggttttct    3360
tgaattggtt cgctgcgatt catgcctccc attcaccaaa ggtggagtgg gaaataacgg    3420
ttttactgcg gtaattagca gaggcaagaa caggatacac tttttgatga taaatctgta    3480
ttatagtcga gcctatttag gaaatcaaat tttcttgtgt ttacttttca aataaataat    3540
gttcgaaaat ttttacttta ctccttcatt taactatacc agacgttata tcatcaacac    3600
cttctgacca tatacagctc aagatgttta agagtctgtt aaattttttc aatccatttc    3660
atggcctgca gggataactt cgtataatgt atgctatacg aagttatgct gcaacggcaa    3720
catcaatgtc cacgtttaca cacctacatt tatatctata tttatattta tatttattta    3780
tttatgctac ttagcttcta tagttagtta atgcactcac gatattcaaa attgacaccc    3840
ttcaactact ccctactatt gtctactact gtctactact cctctttact atagctgctc    3900
ccaataggct ccaccaatag gctctgtcaa tacattttgc gccgccacct ttcaggttgt    3960
gtcactcctg aaggaccata ttgggtaatc gtgcaatttc tggaagagag tgccgcgaga    4020
agtgaggccc ccactgtaaa tcctcgaggg ggcatggagt atggggcatg aaggatggag    4080
gatggggggg gggggggaaa ataggtagcg aaaggacccg ctatcacccc acccggagaa    4140
ctcgttgccg ggaagtcata tttcgacact ccggggagtc tataaaaggc gggttttgtc    4200
ttttgccagt tgatgttgct gagaggactt gtttgccgtt tcttccgatt taacagtata    4260
gaatcaacca ctgttaatta tacacgttat actaacacaa caaaaacaaa aacaacgaca    4320
acaacaatct agatgaaaaa gcctgaactc accgcgacgt ctgtcgagaa gtttctgatc    4380
gaaaagttcg acagcgtctc cgacctgatg cagctctcgg agggcgaaga atctcgtgct    4440
ttcagcttcg atgtaggagg gcgtggatat gtcctgcggg taaatagctg cgccgatggt    4500
ttctacaaag atcgttatgt ttatcggcac tttgcatcgg ccgcgctccc gattccggaa    4560
gtgcttgaca ttggggaatt cagcgagagc ctgacctatt gcatctcccg ccgtgcacag    4620
ggtgtcacgt tgcaagacct gcctgaaacc gaactgcccg ctgttctgca gccggtcgcg    4680
gaggc                                                                4685

<210> 16
<211> 5656
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 16
tcgagggggc atggagtatg gggcatgaag gatggaggat gggggggggg ggggaaaata      60
ggtagcgaaa ggacccgcta tcaccccacc cggagaactc gttgccggga agtcatattt     120
cgacactccg gggagtctat aaaaggcggg ttttgtcttt tgccagttga tgttgctgag     180
aggacttgtt tgccgtttct tccgatttaa cagtatagaa tcaaccactg ttaattatac     240
acgttatact aacacaacaa aaacaaaaac aacgacaaca acaatctaga tgaaaaagcc     300
tgaactcacc gcgacgtctg tcgagaagtt tctgatcgaa aagttcgaca gcgtctccga     360
cctgatgcag ctctcggagg gcgaagaatc tcgtgctttc agcttcgatg taggagggcg     420
tggatatgtc ctgcgggtaa atagctgcgc cgatggtttc tacaaagatc gttatgttta     480
tcggcacttt gcatcggccg cgctcccgat tccggaagtg cttgacattg gggaattcag     540
cgagagcctg acctattgca tctcccgccg tgcacagggt gtcacgttgc aagacctgcc     600
tgaaaccgaa ctgcccgctg ttctgcagcc ggtcgcggag gccatggatg cgatcgctgc     660
ggccgatctt agccagacga gcgggttcgg cccattcgga ccgcaaggaa tcggtcaata     720
cactacatgg cgtgatttca tatgcgcgat tgctgatccc catgtgtatc actggcaaac     780
tgtgatggac gacaccgtca gtgcgtccgt cgcgcaggct ctcgatgagc tgatgctttg     840
ggccgaggac tgccccgaag tccggcacct cgtgcacgcg gatttcggct ccaacaatgt     900
cctgacggac aatggccgca taacagcggt cattgactgg agcgaggcga tgttcgggga     960
ttcccaatac gaggtcgcca acatcttctt ctggaggccg tggttggctt gtatggagca    1020
gcagacgcgc tacttcgagc ggaggcatcc ggagcttgca ggatcgccgc ggctccgggc    1080
gtatatgctc cgcattggtc ttgaccaact ctatcagagc ttggttgacg gcaatttcga    1140
tgatgcagct tgggcgcagg gtcgatgcga cgcaatcgtc cgatccggag ccgggactgt    1200
cgggcgtaca caaatcgccc gcagaagcgc ggccgtctgg accgatggct gtgtagaagt    1260
actcgccgat agtggaaacc gacgccccag cactcgtccg agggcaaagg aatagaaatt    1320
cggatccggt agatacattg atgctatcaa tccagagaac tggaaagatt gtgtagcctt    1380
gaaaaacggt gaaacttacg ggtccaagat tgtctacaga ttttcctgat ttgccagctt    1440
actatccttc ttgaaaatat gcactctata tcttttagtt cttaattgca acacatagat    1500
ttgctgtata acgaatttta tgctattttt taaatttgga gttcagtgat aaaagtgtca    1560
cagcgaattt cctcacatgt agggaccgaa ttgtttacaa gttctctgta ccataacttc    1620
gtataatgta tgctatacga agttattaag ttaaacagca atttgaggaa ggaataggag    1680
aaggagaagc aatttctagg aaagagcaag gtgtgcaaca gcatgctctg aatgatattt    1740
tcagcaatag ttcagttgaa gaacctgttg gcgtatctac atcacttcct acaaacaaca    1800
ccacgaattg cgtccgtggt gacgcaacta cgaatggcat tgtcaatgcc aatgccagtg    1860
cacatacacg tgcaagtccc accggttccc tgcccggcta tggtagagac aagaaggacg    1920
ataccggcat cgacatcaac agtttcaaca gcaatgcgtt tggcgtcgac gcgtcgatgg    1980
ggctgccgta tttggatttg gacgggctag atttcgatat ggatatggat atggatatgg    2040
atatggagat gaatttgaat ttagatttgg gtcttgattt ggggttggaa ttaaaagggg    2100
ataacaatga gggttttcct gttgatttaa acaatggacg tgggaggtga ttgatttaac    2160
ctgatccaaa aggggtatgt ctatttttta gagtgtgtct ttgtgtcaaa ttatggtaga    2220
atgtgtaaag tagtataaac tttcctctca aatgacgagg tttaaaacac cccccgggtg    2280
agccgagccg agaatggggc aattgttcaa tgtgaaatag aagtatcgag tgagaaactt    2340
gggtgttggc cagccaaggg ggaaggaaaa tggcgcgaat gctcaggtga gattgttttg    2400
gaattgggtg aagcgaggaa atgagcgacc cggaggttgt gactttagtg gcggaggagg    2460
acggaggaaa agccaagagg gaagtgtata taaggggagc aatttgccac caggatagaa    2520
ttggatgagt tataattcta ctgtatttat tgtataattt atttctcctt ttatatcaaa    2580
cacattacaa aacacacaaa acacacaaac aaacacatct agataaaatg agggttgact    2640
ccaaaatcat cgtcaaaaag gaaacccaag atgaaaactt gtaccaatct ttggctgaaa    2700
gatccaagca tgaagagttt ttgagaaaag ccgttgattt gttggttgaa agagttgttt    2760
ttggtagatc cactagatcc tccaaggtcg ttgaatgggc agctccagat gaaatcaaaa    2820
aggccatcga tcttaagcct aggttaggcc cagcttccca tgatgaattg ttagccttta    2880
tggcaaatgt tgcaagatat tccgtcaata ctggtcaccc atacttcgtt aatcaattgt    2940
tttcatctgt tgatccatac ggtctagttg gtcaatggtt gaccgatgcc ttaaaccctt    3000
ctgtctatac ctttgaagtt gctccagtct ttacattgat ggaggaagag gttttgagag    3060
aaatgaggaa aatcgttggt tggccagaag gtgagggtga tggcatcttt tgtccaggtg    3120
gttctatcgc caacggttac gctatttctt gtgctcgtca tcacttctac ccagaagtta    3180
agtacaaagg tgttcatgct gtccctaagt tagtcttgtt tacctctgaa ttagctcact    3240
attctacaaa gaaaatggct gcattcatgg gtattggttc cgacaattgt gtcaatatca    3300
aaacagacga cgttggtaag atgaacattg ttgacctaga aatgaagatt aagatcgcaa    3360
ttgacaacaa gtgtacccct ttcatggtta cagctacatc aggtactact gttttcggtg    3420
cttttgaccc acttgtcgca atctccgatt tgtgtaagaa gtacaactta tggctacatg    3480
tcgatgcggc atggggtggt ggtgcgttaa tgtcaaagaa gcatagacat cttctaaacg    3540
gtatcgaatt agcagactcc gttacctgga atccacacaa attgttagct gctcctcaac    3600
aatgttccac attccttacc agacacaaga aggtcctgtc tgaaggtcat tcttccaacg    3660
ccaaatactt gtttcagaag gacaagtttt acgacacttc ttatgacact ggtgacaaac    3720
atattcagtg tggtagaaga gcagatgttt tgaagttctg gtttatgtgg aaggcaaagg    3780
gtacggaggg tttcgagaag cacgttgata agttgtttga caatgctaag tacttcctag    3840
atcacattaa gcaaagggaa ggtttccaat tggtcattgc agaacctcag tgtaccaaca    3900
tcatgttctg gtatatccca aagtgtctaa gaggttgtga aaatgatgca gactactatg    3960
aaagattgca taaagtcgct ccaaagatca aagaaagaat gatcaaagaa ggttctatga    4020
tggtcaccta ccaaccacaa ggtgatttgg tcaatttctt tagaatcgtt ttccagaatt    4080
ctgctttaga tcataaggat atggtctact ttgcaaacga attcgaaaga ttaggttccg    4140
acatgatcgt ttaattaaca tctgaatgta aaatgaacat taaaatgaat tactaaactt    4200
tacgtctact ttacaatcta taaactttgt ttaatcatat aacgaaatac actaatacac    4260
aatcctgtac gtatgtaata cttttatcca tcaaggattg agaaaaaaaa gtaatgattc    4320
cctgggccat taaaacttag acccccaagc ttggataggt cactctctat tttcgtttct    4380
cccttccctg atagaagggt gatatgtaat taagaataat atataatttt ataataaaag    4440
cggccgccaa cttggttcaa accaagaaag ccgtcgtgta ctttgcgtat tttcaacttc    4500
ttcccttcaa ctccaacctt ataacccaat ccgacagtta gcatacagta gaaaatccat    4560
aagttaattg caaaaaccac ataattccca atgactggca caatttgcat gaacggcgca    4620
aaaccaacac ctatccttct tgacttgtgg acaacttctc cggcatcggc tccagtttct    4680
ggtttaggcg acactttcaa ggtacccttg ttgactaggt tctcagcaac atactcctcc    4740
tcgataactt tgaccgtctt tgtgccaaac acattgttca atagtttgga cgctcgcata    4800
ccgaagacag actcgtccat ccaatagcat gtactccgaa tgctctcaat caacggtatc    4860
aaatcgcttg gtgtctttgg aggcggtgtt ctaggtttca tcttcgcctt cccgtttcct    4920
gtttcttgga cttcaaaata cgggtcatat gcatcttcat tggtgaacct gaacagtttg    4980
ggccgagtag tgccaaacac cttcaacagt ataatcttca ctccttggcc gattagaaac    5040
atgcttatat gtttgccggt attcttaaag tgtatactgg agccctttta atcacatttt    5100
ttttacttgt cagtctccat acggagttta atgtccttat atcgatcttc atcagctcca    5160
acgggacagg aatcaataac cttccttgcc tgtcccaaaa agaatgaatt ttttttcaaa    5220
agctttacga tgcataccac aaaaggaaga ttattcccac atgttccaga agtgtgcgga    5280
gatacaaagg gttcatgaaa acgtgaatct tctaaaaact tagcacaaca ataaaaatct    5340
acaatgttac agtaagtatt attttctttt tgtcgacaca ctccaacggt tagatttcca    5400
agtattcaat ccaatgtatt acttgtcaga cagccatcca ctcccatctt agaacatcac    5460
ttccgaaccg cggagcttgg cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta    5520
tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag cctggggtgc    5580
ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt tccagtcggg    5640
aaacctgtcg tgccag                                                    5656



<210> 17
<211> 3235
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 17
cgcccagcag atatcaatgc tttacttggg gaaatccatc aaggtgcaag gttgaagaag      60
gtcgacgaca gcgagaaaca cattgctgac ggtgccgttg ttggcagagt attgtaattg     120
caacatgttc ctgtttagtc tagcatatat atacttactt ataaagctaa aatattgcgt     180
tttgttctag ttttaactgc cgtctctttc ttgccgagcc ccctttattt tcttccttta     240
gwgaggcaat aawatttcga taagcgcaat cccgaatttt ttctgcggcg agagagatgc     300
gggactgggc ttccgaagaa cgtaaccaaa ctgattttat ccaatataaa agagagggtg     360
ccacagctgt ttgcgggaga aggtactctt atttacacgg tagtcttctc tcacatttca     420
gatataatac aagcaatatt caacagcaga caggcctgcg gccgcggatc aattcgccct     480
tacatatgga taacttcgta taatgtatgc tatacgaagt tatgctgcaa cggcaacatc     540
aatgtccacg tttacacacc tacatttata tctatattta tatttatatt tatttattta     600
tgctacttag cttctatagt tagttaatgc actcacgata ttcaaaattg acacccttca     660
actactccct actattgtct actactgtct actactcctc tttactatag ctgctcccaa     720
taggctccac caataggctc tgtcaataca ttttgcgccg ccacctttca ggttgtgtca     780
ctcctgaagg accatattgg gtaatcgtgc aatttctgga agagagtgcc gcgagaagtg     840
aggcccccac tgtaaatcct cgagggggca tggagtatgg ggcatggagg atggaggatg     900
gggggggggg gggaaaatag gtagcgaaag gacccgctat caccccaccc ggagaactcg     960
ttgccgggaa gtcatatttc gacactccgg ggagtctata aaaggcgggt tttgtctttt    1020
gccagttgat gttgctgaga ggacttgttt gccgtttctt ccgatttaac agtatagaat    1080
caaccactgt taattataca cgttatacta acacaacaaa aacaaaaaca acgacaacaa    1140
caacaacata aaatgaagtt tacacaaatt gcacaagctt tagcactagc aggatctgca    1200
actgctgttt ctccgagtta caatggtctt ggtctcaccc cacagatggg ttgggacagc    1260
tggaatacgt ttgcctgcga tgtcagtgaa cagctacttc tagacactgc tgatagagtt    1320
tctgacttgg ggctaaagga tatgggttac aagtatgtca tcctagatga ctgttggtct    1380
agcggcaggg attccgacgg tttcctcgtt gcagacaagc acaaatttcc caacggtatg    1440
ggccatgttg cagaccacct gcataataac agctttcttt tcggtatgta ttcgtctgct    1500
ggtgagtaca cctgtgctgg gtaccctggg tctctggggc gtgaggaaga agatgctcaa    1560
ttctttgcaa ataaccgcgt tgactacttg aagtatgata attgttacaa taaaggtcaa    1620
tttggtacac cagacgtttc ttaccaccgt tacaaggcca tgtcagatgc tttgaataaa    1680
actggtaggc ctattttcta ttctctatgt aactggggtc aggatttgac attttactgg    1740
ggctctggta tcgccaattc ttggagaatg agcggagata ttactgctga gttcacccgc    1800
ccagatagca gatgtccctg tgacggtgac gaatatgatt gcaagtacgc cggtttccat    1860
tgttctatta tgaatattct taacaaggca gctccaatgg ggcaaaatgc aggtgttggt    1920
ggttggaacg atctggacaa tctagaggtc ggagtcggta acttgactga cgatgaggaa    1980
aaggcccatt tctctatgtg ggcaatggta aagtccccac ttatcattgg tgccgacgtg    2040
aatcacttaa aggcatcttc gtactcgatc tacagtcaag cctctgtcat cgcaattaat    2100
caagatccaa agggtattcc agccacaaga gtctggagat attatgtttc agacaccgat    2160
gaatatggac aaggtgaaat tcaaatgtgg agtggtccgc ttgacaatgg tgaccaagtg    2220
gttgctttat tgaatggagg aagcgtagca agaccaatga acacgacctt ggaagagatt    2280
ttctttgaca gcaatttggg ttcaaaggaa ctgacatcga cttgggatat ttacgactta    2340
tgggccaaca gagttgacaa ctctacggcg tctgctatcc ttgaacagaa taaggcagcc    2400
accggtattc tctacaatgc tacagagcag tcttataaag acggtttgtc taagaatgat    2460
acaagactgt ttggccagaa aattggtagt ctttctccaa atgctatact taacacaact    2520
gttccagctc atggtatcgc cttctatagg ttgagaccct cggcttaagc tcaatgttga    2580
gcaaagcagg acgagaaaaa aaaaaataat gattgttaag aagttcatga aaaaaaaaag    2640
gaaaaatact caaatactta taacagagtg attaaataat aaacggcagt ataccctatc    2700
aggtattgag atagttttat ttttgtaggt atataatctg aagcctttga actattttct    2760
cgtatatatc atggagtata cattgcatta gcaacattgc atactagttc ataacttcgt    2820
ataatgtatg ctatacgaag ttattaatta agccgaaaag gtctatgagg gaaccgagcc    2880
attgtatggt accgatattg cagaattgat tctatttgca gtttctagac ctcaaaacac    2940
tgttattgca gaaacacttg tttttgctag taaccaagct tctgcttacc atattttcag    3000
aggatcatta gataaataga tttgatataa acgcttctat aataataaat aayatcaaca    3060
agtgactaac cccaagtatc caattttaga yctaatgcct gaaagtgcct tgcacgattt    3120
aggattcggc accagaaagt ttgtgccgtg ctcgcttatt ggacttgtat agtgataagg    3180
caaaaaaaaa aaacttgaaa gtacttgcgt aactcagagg ttgccttttc gggcc         3235

<210> 18
<211> 3132
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 18
cgcccagcag atatcaatgc tttacttggg gaaatccatc aaggtgcaag gttgaagaag      60
gtcgacgaca gcgagaaaca cattgctgac ggtgccgttg ttggcagagt attgtaattg     120
caacatgttc ctgtttagtc tagcatatat atacttactt ataaagctaa aatattgcgt     180
tttgttctag ttttaactgc cgtctctttc ttgccgagcc ccctttattt tcttccttta     240
gagaggcaat aaaatttcga taagcgcaat cccgaatttt ttctgcggcg agagagatgc     300
gggactgggc ttccgaagaa cgtaaccaaa ctgattttat ccaatataaa agagagggtg     360
ccacagctgt ttgcgggaga aggtactctt atttacacgg tagtcttctc tcacatttca     420
gatataatac aagcaatatt caacagcaga cgcggccgcg ttgttgatgc tgcgcacctg     480
tggttgccca acatggttgt atatcgtgta accacaccaa cacatgtgca gcacatgtgt     540
ttaaaagagt gtcatggagg tggatcatga tggaagtgga ctttaccact tgggaactgt     600
ctccactccc gggaagaaaa gacccggcgt atcacgcggt tgcctcaatg gggcaatttg     660
gaaggagaaa tatagggaaa atcacgtcgc tctcggacgg ggaagagttc cagactatga     720
gggggggggg tggtatataa agacaggaga tgtccacccc cagagagagg aagaagttgg     780
aactttagaa gagagagata actttcccca gtgtccatca atacacaacc aaacacaaac     840
tctatattta cacatataac cccctccaac caaaaggcta gcatatatcg atcgtaaatc     900
taactaatgc ttttactaaa tatctagtac aatttttaca gtccctacgt ttataaatga     960
atttaatgaa aaaaaaatat tttgtaacga tgtgtttatt aagttgcgct cttccgataa    1020
tcccggactt tggttaattt ctcaatgggt ttttttttca aaaccattgt tgtagtgtaa    1080
cagactttaa caaaaggaca tcactctaca gggcagcttt aaaatccctc agtgtaattg    1140
ttcttcattc ataacgtggc agtcaaggac tcgaggagtc catcggttcc tgtcagatgg    1200
gatactcttg acgtggaaaa ttcaaacaga aaaaaaaccc caataatgaa aaataacact    1260
acgttatatc cgtggtatcc tctatcgtat cgtatcgtag cgtatcgtag cgtaccgtat    1320
cacagtatag tctaatattc cgtatcttat tgtatcctat cctattcgat cctattgtat    1380
ttcagtgcac cattttaatt tctattgcta taatgtcctt attagttgcc actgtgaggt    1440
gaccaatgga cgagggcgag ccgttcagaa gccgcgaagg gtgttcttcc catgaatttc    1500
ttaaggaggg cggctcagct ccgagagtga ggcgagacgt ctcggtcagc gtatccccct    1560
tcctcggctt ttacaaatga tgcgctctta atagtgtgtc gttatccttt tggcattgac    1620
gggggaggga aattgattga gcgcatccat atttttgcgg actgctgagg acaatggtgg    1680
tttttccggg tggcgtgggc tacaaatgat acgatggttt ttttcttttc ggagaaggcg    1740
tataaaaagg acacggagaa cccatttatt ctaaaaacag ttgagcttct ttaattattt    1800
tttgatataa tattctatta ttatatattt tcttcccaat aaaacaaaat aaaacaaaac    1860
acagcaaaac acaaaaattc tagactatct taattaacat ctgaatgtaa aatgaacatt    1920
aaaatgaatt actaaacttt acgtctactt tacaatctat aaactttgtt taatcatata    1980
acgaaataca ctaatacaca atcctgtacg tatgtaatac ttttatccat caaggattga    2040
gaaaaaaaag taatgattcc ctgggccatt aaaacttaga cccccaagct tggataggtc    2100
actctctatt ttcgtttctc ccttccctga tagaagggtg atatgtaatt aagaataata    2160
tataatttta taataaaaga attcgccctt acatatgata acttcgtata atgtatgcta    2220
tacgaagtta tcatagcctc atgaaatcag ccatttgctt ttgttcaacg atcttttgaa    2280
attgttgttg ttcttggtag ttaagttgat ccatcttggc ttatgttgtg tgtatgttgt    2340
agttattctt agtatattcc tgtcctgagt ttagtgaaac ataatatcgc cttgaaatga    2400
aaatgctgaa attcgtcgac atacaatttt tcaaactttt tttttttctt ggtgcacgga    2460
catgttttta aaggaagtac tctataccag ttattcttca caaatttaat tgctggagaa    2520
tagatcttca acgctttaat aaagtagttt gtttgtcaag gatggcgtca tacaaagaaa    2580
gatcagaatc acacacttcc cctgttgcta ggagactttt ctccatcatg gaggaaaaga    2640
agtctaacct ttgtgcatca ttggatatta ctgaaactga aaagcttctc tctattttgg    2700
acactattgg tccttacatc tgtctagtta aaacacacat cgatattgtt tctgatttta    2760
cgtatgaagg aactgtgttg cctttgaagg agcttgccaa gaaacataat tttatgattt    2820
ttgaagatag aaaatttgct gatattggta acactgttaa aaatcaatat aaatctggtg    2880
tcttccgtat tgccgaatgg gctgacatca ctaatgcaca tggtgtaacg ggtgcaggta    2940
ttgtttctgg cttgaaggag gcagcccaag aaacaaccag tgaacctaga ggtttgctaa    3000
tgcttgctga gttatcatca aagggttctt tagcatatgg tgaatataca gaaaaaacag    3060
tagaaattgc taaatctgat aaagagtttg tcattggttt tattgcgcaa cacgatatgg    3120
gcggtagaga ag                                                        3132

<210> 19
<211> 4228
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 19
ccacatggtg taacgggtgc aggtattgtt tctggcttga aggaggcagc ccaagaaaca      60
accagtgaac ctagaggttt gctaatgctt gctgagttat catcaaaggg ttctttagca     120
tatggtgaat atacagaaaa aacagtagaa attgctaaat ctgataaaga gtttgtcatt     180
ggttttattg cgcaacacga tatgggcggt agagaagaag gttttgactg gatcattatg     240
actccagggg ttggtttaga tgacaaaggt gatgcacttg gtcaacaata tagaactgtt     300
gatgaagttg taaagactgg aacggatatc ataattgttg gtagaggttt gtacggtcaa     360
ggaagagatc ctatagagca agctaaaaga taccaacaag ctggttggaa tgcttattta     420
aacagattta aatgattctt acacaaagat ttgatacatg tacactagtt taaataagca     480
tgaaaagaat tacacaagca aaaaaaaaaa aataaatgag gtactttacg ttcacctaca     540
accaaaaaaa ctagatagag taaaatctta agatttagaa aaagttgttt aacaaaggct     600
ttagtatgtg aatttttaat gtagcaaagc gataactaat aaacataaac aaaagtatgg     660
ttttctttat cagtcaaatc attatcgatt gattgttccg cgtatctgca gataacttcg     720
tataatgtat gctatacgaa gttatagatc gcggccgcta acctgatcca aaaggggtat     780
gtctattttt tagagtgtgt ctttgtgtca aattatggta gaatgtgtaa agtagtataa     840
actttcctct caaatgacga ggtttaaaac accccccggg tgagccgagc cgagaatggg     900
gcaattgttc aatgtgaaat agaagtatcg agtgagaaac ttgggtgttg gccagccaag     960
ggggaaggaa aatggcgcga atgctcaggt gagattgttt tggaattggg tgaagcgagg    1020
aaatgagcga cccggaggtt gtgactttag tggcggagga ggacggagga aaagccaaga    1080
gggaagtgta tataagggga gcaatttgcc accaggatag aattggatga gttataattc    1140
tactgtattt attgtataat ttatttctcc ttttatatca aacacattac aaaacacaca    1200
aaacacacaa acaaacacaa ttacaaaaag ctagcatgac tagaactttt gaattgggtg    1260
aattgattcg ttctgataac ttcattgatg gtgcttggac tccagcacaa gacaacttga    1320
ggttcgctgt cactaaccca gcttctggag agataattgc tgaggtcgct gactcttctc    1380
cagctgatgc aagagctgct actgatgcag ctgcaagagc tttgccagct tggagggcta    1440
gattgccaaa agagagagct gcagtcttgc atcgttggca cgctttgata atggctaact    1500
tggatgcatt gggtgcttta atatctttgg agcaaggtaa acctttggct gagggtaagg    1560
gtgaggtcgc ttatggtgct tcttacgtcg catggttcgc agaggaagca acaagaattt    1620
acggtgattt gattcctcag caacagaggg gtaagaggat gactgctgtc aaagagccag    1680
tcggagtcgt tgctgctatt acaccatgga attggccatt ggcaatgatt gcaagaaaga    1740
tagcacccgc tttggcagct ggttgtactg ttgtcgctaa gccagctgag gacactccat    1800
tgactgcttc agctttggtc ttgttggctc acgaagctgg tgttccaccc ggtgttttga    1860
acttgattac tgcatcccgt gatcatgctg tcgcagcagt cgctgagtgg ttgcatgacg    1920
ctagagttag aaaaattact tttactggat caactccagt cggtaagtac ttggctaggg    1980
aatctgctga aactttaaag aagttatctt tggagttggg tggtaatgct ccatttattg    2040
tttttgacga tgctgacttg gaggctgcag tcgctggttt gatggctgct aagtttcgta    2100
acggtggtca gacttgtgtc tgtccaaatc gtgtctacgt ccaagctggt gtctacgaga    2160
ggtttggtgc tttgttggct gaaagggttg gtgctttgaa ggttgctcca gcaactgatc    2220
cagctgctca aattggtcca atgattaatt ctagggcatt ggacaagatt gctaggcacg    2280
tcgatgacgc tgttgcacat ggtgctagag tcttgactgg tggtaagagg ttggcagagt    2340
tgggtccaca ctactacgct ccaactgttt tggctgacgc aacagcagca atgcagttga    2400
actcagagga aactttcggt ccaatagtcc cattgtttcg tttcgaggac gaggctgaag    2460
cagttaacgc tgctaacgac actccattcg gtttagcagc ttatttttat tctgaaggtg    2520
tcaaaagaat tgatagggtc gctagggctt tggaagctgg tattgttggt ataaatgaag    2580
gtgcagtcgc ttcagaagct gctccattcg gtggtgttaa ggaatctggt tacggtagag    2640
aaggttctaa atatggtttg gatgattatt tgtctattaa atatttatgt caaggtaatt    2700
tagaataacg atcgtaagcg gcgaatctct ggctcatggg ggatatcctc tttgtttggc    2760
ttttttttcc cattctctgt tttgattatc taatgactca ttgggaggat tttctcactt    2820
caagcttttt tttcttgcac tctttcataa ctccagctct ctctaactga ggctacaatg    2880
ccttttaacg aacttatgag acgtttctaa attatatagg tatatgccaa tatataatta    2940
cacataaaaa taaatataaa taaaatataa aaataaaaat aaacatcgaa aaagaagatg    3000
tgaaattgcg aagactagaa agcacaaacg agcggtctat atcggcgact cgaggctcta    3060
caagcctcat atgggttcaa tgggtctgca atgaccgcat acggacttgg acaattacct    3120
tctattgaat ttctgagaag agatacatct gaccagcaat gtaagcagac aatcccaatt    3180
ctgtaaacaa cctctttgtc cataattccc catcagaaga gtgaaaaatg ccctcaaaac    3240
gcatgcgcca ctcccacctc tcagctgcac tgcgccacct ctgagggtcc tttcaggggt    3300
cgactacccc ggacacctcg cagaggagcg agatcacgta cttttaaaat ggcagagacg    3360
cgcagtttct tgaagaaagg ataaaaatga aatggtgcgg aaatgcgaaa atgatgaaaa    3420
attttcttgg tggcgaggaa attgagtgca ataattggca cgaggttgtt gccacccgag    3480
tgtgagtata tatcctagtt tctgcacttt tcttcttctt ttccttgcgt tttcttttca    3540
acttttttta ctttttcctt caacagacaa atctaactta tatatcacat ctagactatc    3600
ttaattaagt atagccatat agtttaattc ctttatactt tttataacta tttcttacac    3660
taattattat tatcaattat ttattgtaga acttgactct tgcgtcgatc accatgacag    3720
ggctatctta acaaggggta atttttgttg atggagtcaa gtagcattcc gacgggaagt    3780
gtcgatgcct ctgaacgaaa tcttccgatt agctctgcaa agaagtggaa attgtcagcg    3840
cagaattcgc cgaaaaggtc tatgagggaa ccgagccatt gtatggtacc gatattgcag    3900
aattgattct atttgcagtt tctacacctc aaaacactgt tattgcagaa acacttgttt    3960
ttgctagtaa ccaagcttct gcttaccata ttttcagagg atcattagat aaatagattt    4020
gatataaacg cttctataat aataaataac atcaacaagt gactaacccc aagtatccaa    4080
ttttagacct aatgcctgaa agtgccttgc acgatttagg attcggcacc agaaagtttg    4140
tgccgtgctc gcttattgga cttgtatagt gataaggcaa aaaaaaaaaa cttgaaagta    4200
cttgcgtaac tcagaggttg ccttttcg                                       4228

<210> 20
<211> 4228
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 20
ccacatggtg taacgggtgc aggtattgtt tctggcttga aggaggcagc ccaagaaaca      60
accagtgaac ctagaggttt gctaatgctt gctgagttat catcaaaggg ttctttagca     120
tatggtgaat atacagaaaa aacagtagaa attgctaaat ctgataaaga gtttgtcatt     180
ggttttattg cgcaacacga tatgggcggt agagaagaag gttttgactg gatcattatg     240
actccagggg ttggtttaga tgacaaaggt gatgcacttg gtcaacaata tagaactgtt     300
gatgaagttg taaagactgg aacggatatc ataattgttg gtagaggttt gtacggtcaa     360
ggaagagatc ctatagagca agctaaaaga taccaacaag ctggttggaa tgcttattta     420
aacagattta aatgattctt acacaaagat ttgatacatg tacactagtt taaataagca     480
tgaaaagaat tacacaagca aaaaaaaaaa aataaatgag gtactttacg ttcacctaca     540
accaaaaaaa ctagatagag taaaatctta agatttagaa aaagttgttt aacaaaggct     600
ttagtatgtg aatttttaat gtagcaaagc gataactaat aaacataaac aaaagtatgg     660
ttttctttat cagtcaaatc attatcgatt gattgttccg cgtatctgca gataacttcg     720
tataatgtat gctatacgaa gttatagatc gcggccgcta acctgatcca aaaggggtat     780
gtctattttt tagagtgtgt ctttgtgtca aattatggta gaatgtgtaa agtagtataa     840
actttcctct caaatgacga ggtttaaaac accccccggg tgagccgagc cgagaatggg     900
gcaattgttc aatgtgaaat agaagtatcg agtgagaaac ttgggtgttg gccagccaag     960
ggggaaggaa aatggcgcga atgctcaggt gagattgttt tggaattggg tgaagcgagg    1020
aaatgagcga cccggaggtt gtgactttag tggcggagga ggacggagga aaagccaaga    1080
gggaagtgta tataagggga gcaatttgcc accaggatag aattggatga gttataattc    1140
tactgtattt attgtataat ttatttctcc ttttatatca aacacattac aaaacacaca    1200
aaacacacaa acaaacacaa ttacaaaaag ctagcatgac tagaactttt gaattgggtg    1260
aattgattcg ttctgataac ttcattgatg gtgcttggac tccagcacaa gacaacttga    1320
ggttcgctgt cactaaccca gcttctggag agataattgc tgaggtcgct gactcttctc    1380
cagctgatgc aagagctgct actgatgcag ctgcaagagc tttgccagct tggagggcta    1440
gattgccaaa agagagagct gcagtcttgc atcgttggca cgctttgata atggctaact    1500
ctgatgcatt gggtgcttta atatctttgg agcaaggtaa acctttggct gagggtaagg    1560
gtgaggtcgc ttatggtgct tcttacgtcg catggttcgc agaggaagca acaagaattt    1620
acggtgattt gattcctcag caacagaggg gtaagaggat gactgctgtc aaagagccag    1680
tcggagtcgt tgctgctatt acaccatgga attggccatt ggcaatgatt gcaagaaaga    1740
tagcacccgc tttggcagct ggttgtactg ttgtcgctaa gccagctgag gacactccat    1800
tgactgcttc agctttggtc ttgttggctc acgaagctgg tgttccaccc ggtgttttga    1860
acttgattac tgcatcccgt gatcatgctg tcgcagcagt cgctgagtgg ttgcatgacg    1920
ctagagttag aaaaattact tttactggat caactccagt cggtaagtac ttggctaggg    1980
aatctgctga aactttaaag aagttatctt tggagttggg tggtaatgct ccatttattg    2040
tttttgacga tgctgacttg gaggctgcag tcgctggttt gatggctgct aagtttcgta    2100
actctggtca gacttgtgtc tgtccaaatc gtgtctacgt ccaagctggt gtctacgaga    2160
ggtttggtgc tttgttggct gaaagggttg gtgctttgaa ggttgctcca gcaactgatc    2220
cagctgctca aattggtcca atgattaatt ctagggcatt ggacaagatt gctaggcacg    2280
tcgatgacgc tgttgcacat ggtgctagag tcttgactgg tggtaagagg ttggcagagt    2340
tgggtccaca ctactacgct ccaactgttt tggctgacgc aacagcagca atgcagttga    2400
actcagagga aactttcggt ccaatagtcc cattgtttcg tttcgaggac gaggctgaag    2460
cagttaacgc tgctaacgac actccattcg gtttagcagc ttatttttat tctgaaggtg    2520
tcaaaagaat tgatagggtc gctagggctt tggaagctgg tattgttggt ataaatgaag    2580
gtgcagtcgc ttcagaagct gctccattcg gtggtgttaa ggaatctggt tacggtagag    2640
aaggttctaa atatggtttg gatgattatt tgtctattaa atatttatgt caaggtaatt    2700
tagaataacg atcgtaagcg gcgaatctct ggctcatggg ggatatcctc tttgtttggc    2760
ttttttttcc cattctctgt tttgattatc taatgactca ttgggaggat tttctcactt    2820
caagcttttt tttcttgcac tctttcataa ctccagctct ctctaactga ggctacaatg    2880
ccttttaacg aacttatgag acgtttctaa attatatagg tatatgccaa tatataatta    2940
cacataaaaa taaatataaa taaaatataa aaataaaaat aaacatcgaa aaagaagatg    3000
tgaaattgcg aagactagaa agcacaaacg agcggtctat atcggcgact cgaggctcta    3060
caagcctcat atgggttcaa tgggtctgca atgaccgcat acggacttgg acaattacct    3120
tctattgaat ttctgagaag agatacatct gaccagcaat gtaagcagac aatcccaatt    3180
ctgtaaacaa cctctttgtc cataattccc catcagaaga gtgaaaaatg ccctcaaaac    3240
gcatgcgcca ctcccacctc tcagctgcac tgcgccacct ctgagggtcc tttcaggggt    3300
cgactacccc ggacacctcg cagaggagcg agatcacgta cttttaaaat ggcagagacg    3360
cgcagtttct tgaagaaagg ataaaaatga aatggtgcgg aaatgcgaaa atgatgaaaa    3420
attttcttgg tggcgaggaa attgagtgca ataattggca cgaggttgtt gccacccgag    3480
tgtgagtata tatcctagtt tctgcacttt tcttcttctt ttccttgcgt tttcttttca    3540
acttttttta ctttttcctt caacagacaa atctaactta tatatcacat ctagactatc    3600
ttaattaagt atagccatat agtttaattc ctttatactt tttataacta tttcttacac    3660
taattattat tatcaattat ttattgtaga acttgactct tgcgtcgatc accatgacag    3720
ggctatctta acaaggggta atttttgttg atggagtcaa gtagcattcc gacgggaagt    3780
gtcgatgcct ctgaacgaaa tcttccgatt agctctgcaa agaagtggaa attgtcagcg    3840
cagaattcgc cgaaaaggtc tatgagggaa ccgagccatt gtatggtacc gatattgcag    3900
aattgattct atttgcagtt tctacacctc aaaacactgt tattgcagaa acacttgttt    3960
ttgctagtaa ccaagcttct gcttaccata ttttcagagg atcattagat aaatagattt    4020
gatataaacg cttctataat aataaataac atcaacaagt gactaacccc aagtatccaa    4080
ttttagacct aatgcctgaa agtgccttgc acgatttagg attcggcacc agaaagtttg    4140
tgccgtgctc gcttattgga cttgtatagt gataaggcaa aaaaaaaaaa cttgaaagta    4200
cttgcgtaac tcagaggttg ccttttcg                                       4228

<210> 21
<211> 490
<212> PRT
<213> Artificial Sequence

<220> 
<223> A synthetic polypeptide

<400> 21
Met Thr Arg Thr Phe Glu Leu Gly Glu Leu Ile Arg Ser Asp Asn Phe
 1               5                  10                  15
Ile Asp Gly Ala Trp Thr Pro Ala Gln Asp Asn Leu Arg Phe Ala Val
            20                  25                  30
Thr Asn Pro Ala Ser Gly Glu Ile Ile Ala Glu Val Ala Asp Ser Ser
        35                  40                  45
Pro Ala Asp Ala Arg Ala Ala Thr Asp Ala Ala Ala Arg Ala Leu Pro
    50                  55                  60
Ala Trp Arg Ala Arg Leu Pro Lys Glu Arg Ala Ala Val Leu His Arg
65                  70                  75                  80
Trp His Ala Leu Ile Met Ala Asn Ser Asp Ala Leu Gly Ala Leu Ile
                85                  90                  95
Ser Leu Glu Gln Gly Lys Pro Leu Ala Glu Gly Lys Gly Glu Val Ala
            100                 105                 110
Tyr Gly Ala Ser Tyr Val Ala Trp Phe Ala Glu Glu Ala Thr Arg Ile
        115                 120                 125
Tyr Gly Asp Leu Ile Pro Gln Gln Gln Arg Gly Lys Arg Met Thr Ala
    130                 135                 140
Val Lys Glu Pro Val Gly Val Val Ala Ala Ile Thr Pro Trp Asn Trp
145                 150                 155                 160
Pro Leu Ala Met Ile Ala Arg Lys Ile Ala Pro Ala Leu Ala Ala Gly
                165                 170                 175
Cys Thr Val Val Ala Lys Pro Ala Glu Asp Thr Pro Leu Thr Ala Ser
            180                 185                 190
Ala Leu Val Leu Leu Ala His Glu Ala Gly Val Pro Pro Gly Val Leu
        195                 200                 205
Asn Leu Ile Thr Ala Ser Arg Asp His Ala Val Ala Ala Val Ala Glu
    210                 215                 220
Trp Leu His Asp Ala Arg Val Arg Lys Ile Thr Phe Thr Gly Ser Thr
225                 230                 235                 240
Pro Val Gly Lys Tyr Leu Ala Arg Glu Ser Ala Glu Thr Leu Lys Lys
                245                 250                 255
Leu Ser Leu Glu Leu Gly Gly Asn Ala Pro Phe Ile Val Phe Asp Asp
            260                 265                 270
Ala Asp Leu Glu Ala Ala Val Ala Gly Leu Met Ala Ala Lys Phe Arg
        275                 280                 285
Asn Ser Gly Gln Thr Cys Val Cys Pro Asn Arg Val Tyr Val Gln Ala
    290                 295                 300
Gly Val Tyr Glu Arg Phe Gly Ala Leu Leu Ala Glu Arg Val Gly Ala
305                 310                 315                 320
Leu Lys Val Ala Pro Ala Thr Asp Pro Ala Ala Gln Ile Gly Pro Met
                325                 330                 335
Ile Asn Ser Arg Ala Leu Asp Lys Ile Ala Arg His Val Asp Asp Ala
            340                 345                 350
Val Ala His Gly Ala Arg Val Leu Thr Gly Gly Lys Arg Leu Ala Glu
        355                 360                 365
Leu Gly Pro His Tyr Tyr Ala Pro Thr Val Leu Ala Asp Ala Thr Ala
    370                 375                 380
Ala Met Gln Leu Asn Ser Glu Glu Thr Phe Gly Pro Ile Val Pro Leu
385                 390                 395                 400
Phe Arg Phe Glu Asp Glu Ala Glu Ala Val Asn Ala Ala Asn Asp Thr
                405                 410                 415
Pro Phe Gly Leu Ala Ala Tyr Phe Tyr Ser Glu Gly Val Lys Arg Ile
            420                 425                 430
Asp Arg Val Ala Arg Ala Leu Glu Ala Gly Ile Val Gly Ile Asn Glu
        435                 440                 445
Gly Ala Val Ala Ser Glu Ala Ala Pro Phe Gly Gly Val Lys Glu Ser
    450                 455                 460
Gly Tyr Gly Arg Glu Gly Ser Lys Tyr Gly Leu Asp Asp Tyr Leu Ser
465                 470                 475                 480
Ile Lys Tyr Leu Cys Gln Gly Asn Leu Glu
                485                 490

<210> 22
<211> 4228
<212> DNA
<213> Artificial Sequence

<220> 
<223> A synthetic oligonucleotide

<400> 22
ccacatggtg taacgggtgc aggtattgtt tctggcttga aggaggcagc ccaagaaaca      60
accagtgaac ctagaggttt gctaatgctt gctgagttat catcaaaggg ttctttagca     120
tatggtgaat atacagaaaa aacagtagaa attgctaaat ctgataaaga gtttgtcatt     180
ggttttattg cgcaacacga tatgggcggt agagaagaag gttttgactg gatcattatg     240
actccagggg ttggtttaga tgacaaaggt gatgcacttg gtcaacaata tagaactgtt     300
gatgaagttg taaagactgg aacggatatc ataattgttg gtagaggttt gtacggtcaa     360
ggaagagatc ctatagagca agctaaaaga taccaacaag ctggttggaa tgcttattta     420
aacagattta aatgattctt acacaaagat ttgatacatg tacactagtt taaataagca     480
tgaaaagaat tacacaagca aaaaaaaaaa aataaatgag gtactttacg ttcacctaca     540
accaaaaaaa ctagatagag taaaatctta agatttagaa aaagttgttt aacaaaggct     600
ttagtatgtg aatttttaat gtagcaaagc gataactaat aaacataaac aaaagtatgg     660
ttttctttat cagtcaaatc attatcgatt gattgttccg cgtatctgca gataacttcg     720
tataatgtat gctatacgaa gttatagatc gcggccgcta acctgatcca aaaggggtat     780
gtctattttt tagagtgtgt ctttgtgtca aattatggta gaatgtgtaa agtagtataa     840
actttcctct caaatgacga ggtttaaaac accccccggg tgagccgagc cgagaatggg     900
gcaattgttc aatgtgaaat agaagtatcg agtgagaaac ttgggtgttg gccagccaag     960
ggggaaggaa aatggcgcga atgctcaggt gagattgttt tggaattggg tgaagcgagg    1020
aaatgagcga cccggaggtt gtgactttag tggcggagga ggacggagga aaagccaaga    1080
gggaagtgta tataagggga gcaatttgcc accaggatag aattggatga gttataattc    1140
tactgtattt attgtataat ttatttctcc ttttatatca aacacattac aaaacacaca    1200
aaacacacaa acaaacacaa ttacaaaaag ctagcatgac tagaactttt gaattgggtg    1260
aattgattcg ttctgataac ttcattgatg gtgcttggat tccagcacaa gacaacttga    1320
ggttcgctgt cactaaccca gcttctggag agataattgc tgaggtcgct gactcttctc    1380
cagctgatgc aagagctgct actgatgcag ctgcaagagc tttgccagct tggagggtta    1440
gattgccaaa agagagagct gcagtcttgc atcgttggca cgctttgata atggctaact    1500
ctgatgcatt gggtgcttta atatctttgg agcaaggtaa acctttggct cagggtaagg    1560
gtgaggtcgc ttatggtgct tcttacgtcg catggttcgc agaggaagca acaagaattt    1620
acggtgattt gattcctcag caacagaggg gtaagaggat gactgctgtc aaagagccag    1680
tcggagtcgt tgctgctatt acaccatgga attggccatt ggcaatgatt gcaagaaaga    1740
tagcacccgc tttggcagct ggttgtactg ttgtcgctaa gccagctgag gacactccat    1800
tgactgcttc agctttggtc ttgttggctc acgaagctgg cgttccaccc ggtgttttga    1860
acttgattac tgcatcccgt gatcatgctg tcgcagcagt cgctgagtgg ttgcatgacg    1920
ctagagttag aaaaattact tttactggat caactccagt cggtaagtac ttggctaggg    1980
aatctgctga aactttaaag aagttatctt tggagttggg tggtaatgct ccatttattg    2040
tttttgacga tgctgacttg gaggcagcag tcgctggttt gatggctgct aagtttcgta    2100
actctggtca gacttgtgtc tgtccaaatc gtgtctacgt ccaagctgat gtctacgaga    2160
ggtttggtgc tttgttggct gaaagggttg gtgctttgaa ggttgctcca gcaactgatc    2220
cagctgctca aattggtcca atgattaatt ctagggcatt ggacaagatt gctaggcacg    2280
tcgatgacgc tgttgcacat ggtgctagag tcttgactgg tggtaagagg ttggcagagt    2340
tgggtccaca ctactacgct ccaactgttt tggctgacgc aacagcagca atgcagttga    2400
actcagagga aactttcggt ccaatagtcc cattgtttcg tttcgaggac gaggctgaag    2460
cagttaacgc tgctaacaac actccattcg gtttagcagc ttatttttat tctgaaggtg    2520
tcaaaagaat tgatagggtc gctagggctt tggaagctgg tattgttggt ataaatgaag    2580
gtgcagtcgc ttcagaagct gctccattcg gtggtgttaa ggaatctggt tacggtagag    2640
aaggttctaa atatggtttg gatgattatt tgtctattaa atatttatgt caaggtaatt    2700
tagaataacg atcgtaagcg gcgaatctct ggctcatggg ggatatcctc tttgtttggc    2760
ttttttttcc cattctctgt tttgattatc taatgactca ttgggaggat tttctcactt    2820
caagcttttt tttcttgcac tctttcataa ctccagctct ctctaactga ggctacaatg    2880
ccttttaacg aacttatgag acgtttctaa attatatagg tatatgccaa tatataatta    2940
cacataaaaa taaatataaa taaaatataa aaataaaaat aaacatcgaa aaagaagatg    3000
tgaaattgcg aagactagaa agcacaaacg agcggtctat atcggcgact cgaggctcta    3060
caagcctcat atgggttcaa tgggtctgca atgaccgcat acggacttgg acaattacct    3120
tctattgaat ttctgagaag agatacatct gaccagcaat gtaagcagac aatcccaatt    3180
ctgtaaacaa cctctttgtc cataattccc catcagaaga gtgaaaaatg ccctcaaaac    3240
gcatgcgcca ctcccacctc tcagctgcac tgcgccacct ctgagggtcc tttcaggggt    3300
cgactacccc ggacacctcg cagaggagcg agatcacgta cttttaaaat ggcagagacg    3360
cgcagtttct tgaagaaagg ataaaaatga aatggtgcgg aaatgcgaaa atgatgaaaa    3420
attttcttgg tggcgaggaa attgagtgca ataattggca cgaggttgtt gccacccgag    3480
tgtgagtata tatcctagtt tctgcacttt tcttcttctt ttccttgcgt tttcttttca    3540
acttttttta ctttttcctt caacagacaa atctaactta tatatcacat ctagactatc    3600
ttaattaagt atagccatat agtttaattc ctttatactt tttataacta tttcttacac    3660
taattattat tatcaattat ttattgtaga acttgactct tgcgtcgatc accatgacag    3720
ggctatctta acaaggggta atttttgttg atggagtcaa gtagcattcc gacgggaagt    3780
gtcgatgcct ctgaacgaaa tcttccgatt agctctgcaa agaagtggaa attgtcagcg    3840
cagaattcgc cgaaaaggtc tatgagggaa ccgagccatt gtatggtacc gatattgcag    3900
aattgattct atttgcagtt tctacacctc aaaacactgt tattgcagaa acacttgttt    3960
ttgctagtaa ccaagcttct gcttaccata ttttcagagg atcattagat aaatagattt    4020
gatataaacg cttctataat aataaataac atcaacaagt gactaacccc aagtatccaa    4080
ttttagacct aatgcctgaa agtgccttgc acgatttagg attcggcacc agaaagtttg    4140
tgccgtgctc gcttattgga cttgtatagt gataaggcaa aaaaaaaaaa cttgaaagta    4200
cttgcgtaac tcagaggttg ccttttcg                                       4228

<210> 23
<211> 490
<212> PRT
<213> Artificial Sequence

<220> 
<223> A synthetic polypeptide

<400> 23
Met Thr Arg Thr Phe Glu Leu Gly Glu Leu Ile Arg Ser Asp Asn Phe
 1               5                  10                  15
Ile Asp Gly Ala Trp Ile Pro Ala Gln Asp Asn Leu Arg Phe Ala Val
            20                  25                  30
Thr Asn Pro Ala Ser Gly Glu Ile Ile Ala Glu Val Ala Asp Ser Ser
        35                  40                  45
Pro Ala Asp Ala Arg Ala Ala Thr Asp Ala Ala Ala Arg Ala Leu Pro
    50                  55                  60
Ala Trp Arg Val Arg Leu Pro Lys Glu Arg Ala Ala Val Leu His Arg
65                  70                  75                  80
Trp His Ala Leu Ile Met Ala Asn Ser Asp Ala Leu Gly Ala Leu Ile
                85                  90                  95
Ser Leu Glu Gln Gly Lys Pro Leu Ala Gln Gly Lys Gly Glu Val Ala
            100                 105                 110
Tyr Gly Ala Ser Tyr Val Ala Trp Phe Ala Glu Glu Ala Thr Arg Ile
        115                 120                 125
Tyr Gly Asp Leu Ile Pro Gln Gln Gln Arg Gly Lys Arg Met Thr Ala
    130                 135                 140
Val Lys Glu Pro Val Gly Val Val Ala Ala Ile Thr Pro Trp Asn Trp
145                 150                 155                 160
Pro Leu Ala Met Ile Ala Arg Lys Ile Ala Pro Ala Leu Ala Ala Gly
                165                 170                 175
Cys Thr Val Val Ala Lys Pro Ala Glu Asp Thr Pro Leu Thr Ala Ser
            180                 185                 190
Ala Leu Val Leu Leu Ala His Glu Ala Gly Val Pro Pro Gly Val Leu
        195                 200                 205
Asn Leu Ile Thr Ala Ser Arg Asp His Ala Val Ala Ala Val Ala Glu
    210                 215                 220
Trp Leu His Asp Ala Arg Val Arg Lys Ile Thr Phe Thr Gly Ser Thr
225                 230                 235                 240
Pro Val Gly Lys Tyr Leu Ala Arg Glu Ser Ala Glu Thr Leu Lys Lys
                245                 250                 255
Leu Ser Leu Glu Leu Gly Gly Asn Ala Pro Phe Ile Val Phe Asp Asp
            260                 265                 270
Ala Asp Leu Glu Ala Ala Val Ala Gly Leu Met Ala Ala Lys Phe Arg
        275                 280                 285
Asn Ser Gly Gln Thr Cys Val Cys Pro Asn Arg Val Tyr Val Gln Ala
    290                 295                 300
Asp Val Tyr Glu Arg Phe Gly Ala Leu Leu Ala Glu Arg Val Gly Ala
305                 310                 315                 320
Leu Lys Val Ala Pro Ala Thr Asp Pro Ala Ala Gln Ile Gly Pro Met
                325                 330                 335
Ile Asn Ser Arg Ala Leu Asp Lys Ile Ala Arg His Val Asp Asp Ala
            340                 345                 350
Val Ala His Gly Ala Arg Val Leu Thr Gly Gly Lys Arg Leu Ala Glu
        355                 360                 365
Leu Gly Pro His Tyr Tyr Ala Pro Thr Val Leu Ala Asp Ala Thr Ala
    370                 375                 380
Ala Met Gln Leu Asn Ser Glu Glu Thr Phe Gly Pro Ile Val Pro Leu
385                 390                 395                 400
Phe Arg Phe Glu Asp Glu Ala Glu Ala Val Asn Ala Ala Asn Asn Thr
                405                 410                 415
Pro Phe Gly Leu Ala Ala Tyr Phe Tyr Ser Glu Gly Val Lys Arg Ile
            420                 425                 430
Asp Arg Val Ala Arg Ala Leu Glu Ala Gly Ile Val Gly Ile Asn Glu
        435                 440                 445
Gly Ala Val Ala Ser Glu Ala Ala Pro Phe Gly Gly Val Lys Glu Ser
    450                 455                 460
Gly Tyr Gly Arg Glu Gly Ser Lys Tyr Gly Leu Asp Asp Tyr Leu Ser
465                 470                 475                 480
Ile Lys Tyr Leu Cys Gln Gly Asn Leu Glu
                485                 490

<210> 24
<211> 1180
<212> PRT
<213> Issachenkia orientalis

<400> 24
Met Ser Thr Val Glu Asp His Ser Ser Leu His Lys Leu Arg Lys Glu
 1               5                  10                  15
Ser Glu Ile Leu Ser Asn Ala Asn Lys Ile Leu Val Ala Asn Arg Gly
            20                  25                  30
Glu Ile Pro Ile Arg Ile Phe Arg Ser Ala His Glu Leu Ser Met His
        35                  40                  45
Thr Val Ala Ile Tyr Ser His Glu Asp Arg Leu Ser Met His Arg Leu
    50                  55                  60
Lys Ala Asp Glu Ala Tyr Ala Ile Gly Lys Thr Gly Gln Tyr Ser Pro
65                  70                  75                  80
Val Gln Ala Tyr Leu Gln Ile Asp Glu Ile Ile Lys Ile Ala Lys Glu
                85                  90                  95
His Asp Val Ser Met Ile His Pro Gly Tyr Gly Phe Leu Ser Glu Asn
            100                 105                 110
Ser Glu Phe Ala Lys Lys Val Glu Glu Ser Gly Met Ile Trp Val Gly
        115                 120                 125
Pro Pro Ala Glu Val Ile Asp Ser Val Gly Asp Lys Val Ser Ala Arg
    130                 135                 140
Asn Leu Ala Ile Lys Cys Asp Val Pro Val Val Pro Gly Thr Asp Gly
145                 150                 155                 160
Pro Ile Glu Asp Ile Glu Gln Ala Lys Gln Phe Val Glu Gln Tyr Gly
                165                 170                 175
Tyr Pro Val Ile Ile Lys Ala Ala Phe Gly Gly Gly Gly Arg Gly Met
            180                 185                 190
Arg Val Val Arg Glu Gly Asp Asp Ile Val Asp Ala Phe Gln Arg Ala
        195                 200                 205
Ser Ser Glu Ala Lys Ser Ala Phe Gly Asn Gly Thr Cys Phe Ile Glu
    210                 215                 220
Arg Phe Leu Asp Lys Pro Lys His Ile Glu Val Gln Leu Leu Ala Asp
225                 230                 235                 240
Asn Tyr Gly Asn Thr Ile His Leu Phe Glu Arg Asp Cys Ser Val Gln
                245                 250                 255
Arg Arg His Gln Lys Val Val Glu Ile Ala Pro Ala Lys Thr Leu Pro
            260                 265                 270
Val Glu Val Arg Asn Ala Ile Leu Lys Asp Ala Val Thr Leu Ala Lys
        275                 280                 285
Thr Ala Asn Tyr Arg Asn Ala Gly Thr Ala Glu Phe Leu Val Asp Ser
    290                 295                 300
Gln Asn Arg His Tyr Phe Ile Glu Ile Asn Pro Arg Ile Gln Val Glu
305                 310                 315                 320
His Thr Ile Thr Glu Glu Ile Thr Gly Val Asp Ile Val Ala Ala Gln
                325                 330                 335
Ile Gln Ile Ala Ala Gly Ala Ser Leu Glu Gln Leu Gly Leu Leu Gln
            340                 345                 350
Asn Lys Ile Thr Thr Arg Gly Phe Ala Ile Gln Cys Arg Ile Thr Thr
        355                 360                 365
Glu Asp Pro Ala Lys Asn Phe Ala Pro Asp Thr Gly Lys Ile Glu Val
    370                 375                 380
Tyr Arg Ser Ala Gly Gly Asn Gly Val Arg Leu Asp Gly Gly Asn Gly
385                 390                 395                 400
Phe Ala Gly Ala Val Ile Ser Pro His Tyr Asp Ser Met Leu Val Lys
                405                 410                 415
Cys Ser Thr Ser Gly Ser Asn Tyr Glu Ile Ala Arg Arg Lys Met Ile
            420                 425                 430
Arg Ala Leu Val Glu Phe Arg Ile Arg Gly Val Lys Thr Asn Ile Pro
        435                 440                 445
Phe Leu Leu Ala Leu Leu Thr His Pro Val Phe Ile Ser Gly Asp Cys
    450                 455                 460
Trp Thr Thr Phe Ile Asp Asp Thr Pro Ser Leu Phe Glu Met Val Ser
465                 470                 475                 480
Ser Lys Asn Arg Ala Gln Lys Leu Leu Ala Tyr Ile Gly Asp Leu Cys
                485                 490                 495
Val Asn Gly Ser Ser Ile Lys Gly Gln Ile Gly Phe Pro Lys Leu Asn
            500                 505                 510
Lys Glu Ala Glu Ile Pro Asp Leu Leu Asp Pro Asn Asp Glu Val Ile
        515                 520                 525
Asp Val Ser Lys Pro Ser Thr Asn Gly Leu Arg Pro Tyr Leu Leu Lys
    530                 535                 540
Tyr Gly Pro Asp Ala Phe Ser Lys Lys Val Arg Glu Phe Asp Gly Cys
545                 550                 555                 560
Met Ile Met Asp Thr Thr Trp Arg Asp Ala His Gln Ser Leu Leu Ala
                565                 570                 575
Thr Arg Val Arg Thr Ile Asp Leu Leu Arg Ile Ala Pro Thr Thr Ser
            580                 585                 590
His Ala Leu Gln Asn Ala Phe Ala Leu Glu Cys Trp Gly Gly Ala Thr
        595                 600                 605
Phe Asp Val Ala Met Arg Phe Leu Tyr Glu Asp Pro Trp Glu Arg Leu
    610                 615                 620
Arg Gln Leu Arg Lys Ala Val Pro Asn Ile Pro Phe Gln Met Leu Leu
625                 630                 635                 640
Arg Gly Ala Asn Gly Val Ala Tyr Ser Ser Leu Pro Asp Asn Ala Ile
                645                 650                 655
Asp His Phe Val Lys Gln Ala Lys Asp Asn Gly Val Asp Ile Phe Arg
            660                 665                 670
Val Phe Asp Ala Leu Asn Asp Leu Glu Gln Leu Lys Val Gly Val Asp
        675                 680                 685
Ala Val Lys Lys Ala Gly Gly Val Val Glu Ala Thr Val Cys Tyr Ser
    690                 695                 700
Gly Asp Met Leu Ile Pro Gly Lys Lys Tyr Asn Leu Asp Tyr Tyr Leu
705                 710                 715                 720
Glu Thr Val Gly Lys Ile Val Glu Met Gly Thr His Ile Leu Gly Ile
                725                 730                 735
Lys Asp Met Ala Gly Thr Leu Lys Pro Lys Ala Ala Lys Leu Leu Ile
            740                 745                 750
Gly Ser Ile Arg Ser Lys Tyr Pro Asp Leu Val Ile His Val His Thr
        755                 760                 765
His Asp Ser Ala Gly Thr Gly Ile Ser Thr Tyr Val Ala Cys Ala Leu
    770                 775                 780
Ala Gly Ala Asp Ile Val Asp Cys Ala Ile Asn Ser Met Ser Gly Leu
785                 790                 795                 800
Thr Ser Gln Pro Ser Met Ser Ala Phe Ile Ala Ala Leu Asp Gly Asp
                805                 810                 815
Ile Glu Thr Gly Val Pro Glu His Phe Ala Arg Gln Leu Asp Ala Tyr
            820                 825                 830
Trp Ala Glu Met Arg Leu Leu Tyr Ser Cys Phe Glu Ala Asp Leu Lys
        835                 840                 845
Gly Pro Asp Pro Glu Val Tyr Lys His Glu Ile Pro Gly Gly Gln Leu
    850                 855                 860
Thr Asn Leu Ile Phe Gln Ala Gln Gln Val Gly Leu Gly Glu Gln Trp
865                 870                 875                 880
Glu Glu Thr Lys Lys Lys Tyr Glu Asp Ala Asn Met Leu Leu Gly Asp
                885                 890                 895
Ile Val Lys Val Thr Pro Thr Ser Lys Val Val Gly Asp Leu Ala Gln
            900                 905                 910
Phe Met Val Ser Asn Lys Leu Glu Lys Glu Asp Val Glu Lys Leu Ala
        915                 920                 925
Asn Glu Leu Asp Phe Pro Asp Ser Val Leu Asp Phe Phe Glu Gly Leu
    930                 935                 940
Met Gly Thr Pro Tyr Gly Gly Phe Pro Glu Pro Leu Arg Thr Asn Val
945                 950                 955                 960
Ile Ser Gly Lys Arg Arg Lys Leu Lys Gly Arg Pro Gly Leu Glu Leu
                965                 970                 975
Glu Pro Phe Asn Leu Glu Glu Ile Arg Glu Asn Leu Val Ser Arg Phe
            980                 985                 990
Gly Pro Gly Ile Thr Glu Cys Asp Val Ala Ser Tyr Asn Met Tyr Pro
        995                 1000                1005
Lys Val Tyr Glu Gln Tyr Arg Lys Val Val Glu Lys Tyr Gly Asp Leu
    1010                1015                1020
Ser Val Leu Pro Thr Lys Ala Phe Leu Ala Pro Pro Thr Ile Gly Glu
1025                1030                1035                1040
Glu Val His Val Glu Ile Glu Gln Gly Lys Thr Leu Ile Ile Lys Leu
                1045                1050                1055
Leu Ala Ile Ser Asp Leu Ser Lys Ser His Gly Thr Arg Glu Val Tyr
            1060                1065                1070
Phe Glu Leu Asn Gly Glu Met Arg Lys Val Thr Ile Glu Asp Lys Thr
        1075                1080                1085
Ala Ala Ile Glu Thr Val Thr Arg Ala Lys Ala Asp Gly His Asn Pro
    1090                1095                1100
Asn Glu Val Gly Ala Pro Met Ala Gly Val Val Val Glu Val Arg Val
1105                1110                1115                1120
Lys His Gly Thr Glu Val Lys Lys Gly Asp Pro Leu Ala Val Leu Ser
                1125                1130                1135
Ala Met Lys Met Glu Met Val Ile Ser Ala Pro Val Ser Gly Arg Val
            1140                1145                1150
Gly Glu Val Phe Val Asn Glu Gly Asp Ser Val Asp Met Gly Asp Leu
        1155                1160                1165
Leu Val Lys Ile Ala Lys Asp Glu Ala Pro Ala Ala
    1170                1175                1180

<210> 25
<211> 419
<212> PRT
<213> Issachenkia orientalis

<400> 25
Met Ser Arg Gly Phe Phe Thr Glu Asn Ile Thr Gln Leu Pro Pro Asp
 1               5                  10                  15
Pro Leu Phe Gly Leu Lys Ala Arg Phe Ser Asn Asp Ser Arg Glu Asn
            20                  25                  30
Lys Val Asp Leu Gly Ile Gly Ala Tyr Arg Asp Asp Asn Gly Lys Pro
        35                  40                  45
Trp Ile Leu Pro Ser Val Arg Leu Ala Glu Asn Leu Ile Gln Asn Ser
    50                  55                  60
Pro Asp Tyr Asn His Glu Tyr Leu Pro Ile Gly Gly Leu Ala Asp Phe
65                  70                  75                  80
Thr Ser Ala Ala Ala Arg Val Val Phe Gly Gly Asp Ser Lys Ala Ile
                85                  90                  95
Ser Gln Asn Arg Leu Val Ser Ile Gln Ser Leu Ser Gly Thr Gly Ala
            100                 105                 110
Leu His Val Ala Gly Leu Phe Ile Lys Arg Gln Tyr Lys Ser Leu Asp
        115                 120                 125
Gly Thr Ser Glu Asp Pro Leu Ile Tyr Leu Ser Glu Pro Thr Trp Ala
    130                 135                 140
Asn His Val Gln Ile Phe Glu Val Ile Gly Leu Lys Pro Val Phe Tyr
145                 150                 155                 160
Pro Tyr Trp His Ala Ala Ser Lys Thr Leu Asp Leu Lys Gly Tyr Leu
                165                 170                 175
Lys Ala Ile Asn Asp Ala Pro Glu Gly Ser Val Phe Val Leu His Ala
            180                 185                 190
Thr Ala His Asn Pro Thr Gly Leu Asp Pro Thr Gln Glu Gln Trp Met
        195                 200                 205
Glu Ile Leu Ala Ala Ile Ser Ala Lys Lys His Leu Pro Leu Phe Asp
    210                 215                 220
Cys Ala Tyr Gln Gly Phe Thr Ser Gly Ser Leu Asp Arg Asp Ala Trp
225                 230                 235                 240
Ala Val Arg Glu Ala Val Asn Asn Asp Lys Tyr Glu Phe Pro Gly Ile
                245                 250                 255
Ile Val Cys Gln Ser Phe Ala Lys Asn Val Gly Met Tyr Gly Glu Arg
            260                 265                 270
Ile Gly Ala Val His Ile Val Leu Pro Glu Ser Asp Ala Ser Leu Asn
        275                 280                 285
Ser Ala Ile Phe Ser Gln Leu Gln Lys Thr Ile Arg Ser Glu Ile Ser
    290                 295                 300
Asn Pro Pro Gly Tyr Gly Ala Lys Ile Val Ser Lys Val Leu Asn Thr
305                 310                 315                 320
Pro Glu Leu Tyr Lys Gln Trp Glu Gln Asp Leu Ile Thr Met Ser Ser
                325                 330                 335
Arg Ile Thr Ala Met Arg Lys Glu Leu Val Asn Glu Leu Glu Arg Leu
            340                 345                 350
Gly Thr Pro Gly Thr Trp Arg His Ile Thr Glu Gln Gln Gly Met Phe
        355                 360                 365
Ser Phe Thr Gly Leu Asn Pro Glu Gln Val Ala Lys Leu Glu Lys Glu
    370                 375                 380
His Gly Val Tyr Leu Val Arg Ser Gly Arg Ala Ser Ile Ala Gly Leu
385                 390                 395                 400
Asn Met Gly Asn Val Lys Tyr Val Ala Lys Ala Ile Asp Ser Val Val
                405                 410                 415
Arg Asp Leu


<210> 26
<211> 344
<212> PRT
<213> Issachenkia orientalis

<400> 26
Met Ser Lys Pro Lys Val Leu Leu Ile Gly Phe Gly Gly Val Gly Thr
 1               5                  10                  15
Ile Val Ser Tyr Thr Leu Glu His Leu Gly Arg Ala Glu Val Ser Ala
            20                  25                  30
Val Ser Arg Pro Glu Thr His Asp Ser Ile Val Asn Gly Phe Arg Ile
        35                  40                  45
Glu Ser Ile Asp Tyr Gly Ile Val Glu Asn Tyr Val Pro Thr Asn Val
    50                  55                  60
Tyr Val Thr Ala Lys Glu Ala Tyr Lys Gln Gln Gly Pro Phe Asp Tyr
65                  70                  75                  80
Ile Ile Ile Thr Thr Lys Asn Ile Pro Asp Ile Ala Pro Val Val Asp
                85                  90                  95
Met Ile Asp Gly Cys Tyr Asn Glu Lys Ser Val Ile Val Leu Ile Gln
            100                 105                 110
Asn Gly Ile Gly Ile Glu Ile Pro Ile Tyr Arg Arg Tyr Pro Asn Ala
        115                 120                 125
Ile Ile Leu Ser Gly Val Thr Leu Ile Gly Thr Thr Leu Tyr Glu Ala
    130                 135                 140
Thr Val Lys His Val Ala Arg Asp Asp Ile Lys Phe Gly Pro Phe Ile
145                 150                 155                 160
Asn Tyr Asn Leu Asp Lys Gln Leu Gln Ile Asn Lys Cys Lys Glu Phe
                165                 170                 175
Ile Glu Leu Tyr Glu Asn Asp Lys Asn Leu Val Glu Tyr Glu Glu Asp
            180                 185                 190
Val Lys Phe Thr Arg Trp Arg Lys Leu Val Tyr Asn Ala Cys Ile Asn
        195                 200                 205
Thr Thr Cys Ala Leu Ala Asn Leu Asp Ala Gly Arg Val Gln Ile Phe
    210                 215                 220
Gly Gly Phe Glu Thr Leu Val Lys Pro Ala Met Leu Glu Val Ile Ala
225                 230                 235                 240
Val Ala Lys Ser Glu Gly Val Glu Leu Pro Ala Lys Glu Val Met Asp
                245                 250                 255
Thr Met Cys Asn Met Gly Lys Asp Val Tyr Tyr Pro Pro Ser Met Leu
            260                 265                 270
Ile Asp Val Arg Asn Gly Thr Tyr Leu Glu His Ile Val Ile Ile Gly
        275                 280                 285
Asn Val Val Lys Tyr Gly Ser Arg Asn Gly Val Pro Ile Pro Thr Leu
    290                 295                 300
Thr Val Leu Asn Asn Leu Leu Lys Leu Val Gln Met Arg Thr Met Glu
305                 310                 315                 320
Ala Asn Lys Arg Phe Val Leu Pro Glu Lys Arg Pro Leu Pro Glu Glu
                325                 330                 335
Asn Tyr Gln Ile Glu Tyr Leu Tyr
            340
