                         SEQUENCE LISTING

<110>  Universit de Genve
 
<120>  FlmG-dependent soluble protein O-glycosylation systems in 
       bacteria

<130>  PAT7460PC00

<150>  EP20202114.3
<151>  2020-10-15

<160>  33    

<170>  PatentIn version 3.5

<210>  1
<211>  596
<212>  PRT
<213>  Caulobacter crescentus

<400>  1

Met Ser Arg Lys Ser Ala Leu Glu Ser Ser Ala Ser Val Leu Ala Gln 
1               5                   10                  15      


Ala Asp Val Gly Ala Ser Gly Ile His Pro Ser Val Ile Ala Asp Ala 
            20                  25                  30          


Met Gly Asp Ser Ala Ser Ala Glu Ala Leu Glu Arg Leu Asn Arg Ala 
        35                  40                  45              


Ala Gln Asp Thr Lys Asn Val Asp Asn Ala Lys His Leu Ala Arg Ala 
    50                  55                  60                  


Ile Gln Ala Val Gln Leu Gln Asp Tyr Ala Lys Ala Asp Lys Leu Ala 
65                  70                  75                  80  


Leu Lys Leu Leu Glu Lys Asp Glu Arg Leu Gly Leu Ala Trp His Ile 
                85                  90                  95      


Leu Ala Ile Ala Arg Glu Lys Thr Gly Asp Phe Ala Ser Ser Leu Arg 
            100                 105                 110         


Ala Tyr Glu Ala Ala Leu Ala Leu Leu Pro Asp His Gly Pro Val Ala 
        115                 120                 125             


Gly Asp Leu Gly Arg Leu Ala Phe Arg Met Asn Met Pro Glu Leu Ala 
    130                 135                 140                 


Ala Lys Phe Phe Ala His Tyr Arg Leu Ala Arg Pro Asp Asp Val Glu 
145                 150                 155                 160 


Gly Ala Asn Asn Leu Ala Cys Ala Leu Arg Glu Leu Asn Arg Glu Ser 
                165                 170                 175     


Glu Ala Ile Glu Val Leu Lys Ala Ala Leu Gly Ala Asn Pro Glu Ala 
            180                 185                 190         


Ala Val Leu Trp Asn Thr Leu Gly Thr Val Leu Cys Asn Ile Gly Asp 
        195                 200                 205             


Ala Ala Gly Ser Ile Val Phe Phe Asp Glu Ser Leu Arg Leu Ala Pro 
    210                 215                 220                 


Asp Phe Ser Lys Ala Tyr His Asn Arg Ala Phe Ala Arg Leu Asp Leu 
225                 230                 235                 240 


Gly Glu Ile Glu Ala Ala Leu Ala Asp Cys Glu Ala Ala Met Arg Ser 
                245                 250                 255     


Pro Gly Ser Pro Glu Asp Leu Ala Met Met Gln Phe Ala Arg Ala Thr 
            260                 265                 270         


Ile Leu Leu Ala Leu Gly Arg Val Gly Glu Gly Trp Glu Ala Tyr Glu 
        275                 280                 285             


Ser Arg Phe Ser Pro Ala Leu Ser Asp Ala Pro Arg Phe Gln Ile Pro 
    290                 295                 300                 


Gly Val Arg Trp Ser Gly Gln Asp Leu Arg Gly Lys Arg Leu Met Ile 
305                 310                 315                 320 


Thr Thr Glu Gln Gly Leu Gly Asp Glu Val Met Phe Ala Asn Met Leu 
                325                 330                 335     


Pro Asp Ile Val Glu Ala Leu Gly Pro Asp Gly Phe Leu Ser Leu Ala 
            340                 345                 350         


Val Glu Arg Arg Leu Ala Pro Leu Phe Glu Arg Thr Phe Pro Lys Val 
        355                 360                 365             


Glu Val Thr Ala His Arg Thr Ile Ala Tyr Glu Gly Arg Val Phe Arg 
    370                 375                 380                 


Ala Ala Pro Tyr Ile Glu Asn Trp Asp Arg Phe Asp Tyr Trp Ala Ala 
385                 390                 395                 400 


Ile Gly Asp Phe Leu Pro Ser Leu Arg Pro Thr Ala Glu Ala Phe Pro 
                405                 410                 415     


Lys Arg Asn Ala Phe Leu Gln Pro Asp Pro Ala Arg Val Ala His Trp 
            420                 425                 430         


Lys Ala Gln Leu Glu Lys Leu Gly Pro Gly Pro Lys Val Gly Leu Leu 
        435                 440                 445             


Trp Lys Ser Leu Lys Leu Asn Ala Glu Arg Ala Arg Gln Phe Ser Pro 
    450                 455                 460                 


Phe His Leu Trp Glu Pro Val Leu His Thr Pro Gly Val Val Phe Val 
465                 470                 475                 480 


Asn Leu Gln Tyr Gly Asp Cys Glu Glu Glu Ile Ala Phe Ala Lys Glu 
                485                 490                 495     


Glu Leu Gly Val Glu Ile Trp Gln Pro Glu Gly Ile Asp Leu Lys Ala 
            500                 505                 510         


Asp Leu Asp Asp Val Ala Ala Leu Cys Ala Ala Val Asp Leu Val Ile 
        515                 520                 525             


Gly Phe Ser Asn Ala Thr Ile Asn Leu Ala Gly Ala Val Gly Thr Pro 
    530                 535                 540                 


Ile Phe Met Leu Thr Gly Ala Ser Ser Trp Thr Arg Leu Gly Thr Glu 
545                 550                 555                 560 


Tyr Tyr Pro Trp Tyr Pro Ser Val Arg Cys Phe Val Thr Glu Gln Tyr 
                565                 570                 575     


Gly Val Trp Glu Pro Thr Met Gly Arg Val Ala Thr Ala Leu Arg Asp 
            580                 585                 590         


Phe Ala Ala Ser 
        595     


<210>  2
<211>  1791
<212>  DNA
<213>  Caulobacter crescentus

<400>  2
atgtcccgta aaagcgccct ggaatcctcg gcaagcgtcc tggcccaggc cgatgtcggc       60

gcttcgggca tccaccccag cgttatcgcc gacgccatgg gcgattcggc gtccgccgag      120

gcgctggagc gcctgaatcg ggcggcgcag gacaccaaga acgtcgacaa cgccaagcac      180

ttggcgcgcg cgatccaggc cgtgcagctg caggactacg ccaaggccga caagctggcc      240

ctgaagctgc tggagaagga cgagcgactg ggcctagcct ggcacatcct ggcgattgca      300

cgcgagaaga ccggcgattt cgcctcctcg ctgcgggcct atgaagccgc gctggctctg      360

ctgcccgacc atggccccgt cgccggcgac ctgggccgct tggccttccg catgaacatg      420

ccggagctgg cggccaagtt cttcgcacac taccgtctcg ctcggcccga cgacgtcgag      480

ggcgccaaca acctggcgtg cgccctgcgc gagcttaatc gcgaaagcga agccatcgaa      540

gtcctcaagg ccgccctggg cgccaacccc gaggctgcgg tgctgtggaa cacgctgggc      600

acggtgcttt gcaatatcgg cgacgcggcg ggctcgatcg tgttcttcga cgagtccctg      660

cgcctcgcgc ccgacttttc gaaagcctat cacaaccgcg ccttcgccag gctcgatctg      720

ggcgagatag aggccgcgct ggccgattgc gaagccgcca tgcgcagccc cggctcaccg      780

gaagatctgg cgatgatgca gttcgcccgc gccacgattc tcctggctct gggccgcgtc      840

ggcgaaggct gggaggctta tgagtcacgc ttctcgccgg cgctgagcga cgcgccacgg      900

ttccagattc ctggcgtccg ctggtcagga caggacctca ggggcaagcg tttgatgatc      960

accaccgagc agggcctcgg cgacgaggtg atgttcgcca acatgttgcc cgacatcgtc     1020

gaagccttgg gcccagacgg cttcctgtcc ctggcggtcg agcgccgtct ggcgccgctg     1080

ttcgagcgca ccttcccgaa ggtcgaggtg accgcccacc gtacgatcgc ctacgaaggc     1140

cgcgtgttcc gggccgcgcc ctatatcgag aactgggacc gcttcgacta ttgggcggcc     1200

atcggcgact tcctgccgag ccttcgcccc accgccgagg cgtttcccaa gcgcaacgcc     1260

ttcctgcagc cggatccggc gcgggtggcc cactggaagg cccaactcga gaagcttggc     1320

cccggcccga aagtcggcct gctctggaag agcctgaaac tgaacgcgga acgcgcgcgg     1380

cagttttcgc ccttccacct gtgggagccg gttttgcaca cgccaggcgt ggtgttcgtg     1440

aacctgcagt atggcgactg cgaggaagag atcgccttcg ccaaggaaga gctgggcgtg     1500

gagatctggc agccggaagg cattgatctg aaggccgacc tcgacgacgt ggccgctctc     1560

tgcgcggcgg tggacctggt gatcgggttc tccaacgcca cgatcaatct ggccggtgcg     1620

gtggggacgc cgatcttcat gctgaccggc gcctcgtcct ggacccgcct cggcaccgaa     1680

tattacccct ggtatccgag cgttcgctgc ttcgtcaccg agcagtacgg ggtctgggaa     1740

ccgaccatgg gtcgcgtcgc caccgctctg cgcgatttcg ccgcatcctg a              1791


<210>  3
<211>  273
<212>  PRT
<213>  Caulobacter crescentus

<400>  3

Met Ala Leu Asn Ser Ile Asn Thr Asn Ala Gly Ala Met Ile Ala Leu 
1               5                   10                  15      


Gln Asn Leu Asn Gly Thr Asn Ser Glu Leu Thr Thr Val Gln Gln Arg 
            20                  25                  30          


Ile Asn Thr Gly Lys Lys Ile Ala Ser Ala Lys Asp Asn Gly Ala Ile 
        35                  40                  45              


Trp Ala Thr Ala Lys Asn Gln Ser Ala Thr Ala Ala Ser Met Asn Ala 
    50                  55                  60                  


Val Lys Asp Ser Leu Gln Arg Gly Gln Ser Thr Ile Asp Val Ala Leu 
65                  70                  75                  80  


Ala Ala Gly Asp Thr Ile Thr Asp Leu Leu Gly Lys Met Lys Glu Lys 
                85                  90                  95      


Ala Leu Ala Ala Ser Asp Thr Ser Leu Asn Thr Ala Ser Phe Asn Ala 
            100                 105                 110         


Leu Lys Ser Asp Phe Asp Ser Leu Arg Asp Gln Ile Glu Lys Ala Ala 
        115                 120                 125             


Thr Asn Ala Lys Phe Asn Gly Val Ser Ile Ala Asp Gly Ser Thr Thr 
    130                 135                 140                 


Lys Leu Thr Phe Leu Ala Asn Ser Asp Gly Ser Gly Phe Thr Val Asn 
145                 150                 155                 160 


Ala Lys Thr Ile Ser Leu Ala Gly Ile Gly Leu Thr Thr Thr Ser Thr 
                165                 170                 175     


Phe Thr Thr Ala Ala Ala Ala Lys Thr Met Ile Gly Thr Ile Asp Thr 
            180                 185                 190         


Ala Leu Gln Thr Ala Thr Asn Lys Leu Ala Ser Leu Gly Thr Ser Ser 
        195                 200                 205             


Val Gly Leu Asp Thr His Leu Thr Phe Val Gly Lys Leu Gln Asp Ser 
    210                 215                 220                 


Leu Asp Ala Gly Val Gly Asn Leu Val Asp Ala Asp Leu Ala Lys Glu 
225                 230                 235                 240 


Ser Ala Lys Leu Gln Ser Leu Gln Thr Lys Gln Gln Leu Gly Val Gln 
                245                 250                 255     


Ala Leu Ser Ile Ala Asn Gln Ser Ser Ser Ser Ile Leu Ser Leu Phe 
            260                 265                 270         


Arg 
    


<210>  4
<211>  822
<212>  DNA
<213>  Caulobacter crescentus

<400>  4
atggcgctga acagcatcaa tacgaacgcg ggcgcgatga tcgccctgca aaatctgaat       60

ggcacgaatt ccgagctgac gaccgttcag cagcggatca ataccggcaa gaagatcgcc      120

agcgccaagg acaatggcgc catctgggcg accgccaaga accagtcggc caccgccgcc      180

agcatgaacg ccgtgaagga ctcgctgcaa cgcggccagt cgacgatcga cgtcgcgctc      240

gccgccggcg acaccatcac cgacctgctc ggcaagatga aggaaaaggc cctggccgct      300

tccgacacct cgctgaacac cgcctcgttc aacgccctga agtcggactt cgactcgctg      360

cgtgaccaaa tcgaaaaggc cgcgacgaac gccaagttca acggtgtcag catcgcggac      420

ggttcgacca ccaagctgac cttcctggcc aactcggacg gcagcggctt cacggtcaac      480

gccaagacca tctccctggc gggtatcggt ctgacgacca cctcgacctt caccacggcc      540

gccgccgcca agacgatgat cggcaccatc gacacggcgc tgcagacggc gaccaacaag      600

ctggcctcgc tgggcaccag ctcggtcggt ctggacactc acctgacctt cgtcggcaag      660

ctgcaagaca gcctggacgc gggtgtgggc aacctggtgg acgccgacct cgccaaggaa      720

agcgccaagc tgcagtcgct gcaaaccaag cagcagctgg gcgtccaggc gctgtcgatc      780

gccaaccagt cttcgtcctc gatcctgagc ctgttccgtt aa                         822


<210>  5
<211>  821
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic sequence

<400>  5
atggctctga actctattaa caccaacgcc ggcgctatga ttgcactgca gaacctgaat       60

ggtaccaact ccgagctgac gactgttcag cagcgtatca acaccggtaa gaaaattgcg      120

tccgccaaag ataacggcgc aatctgggct accgcaaaaa accaatctgc cactgcagct      180

agcatgaacg ctgttaaaga ctccctgcag cgtggccagt ctaccatcga cgttgccctg      240

gccgccggtg acactatcac cgatctgctg ggtaaaatga aagaaaaggc tctggccgcg      300

tctgacacct ccctgaatac cgcgtctttt aacgccctga aatctgactt cgatagcctg      360

cgtgaccaga tcgagaaggc agcgactaat gctaaattca acggtgttag catcgccgac      420

ggctctacca ctaaactgac cttcctggcc aactctgacg gttctggttt caccgttaac      480

gccaaaacta tttctctggc tggtatcggc ctgaccacta cctccacctt taccaccgcg      540

gcggcggcga agaccatgat cggcactatc gatactgcgc tgcagactgc caccaacaaa      600

ctggcctctc tgggtacctc ctctgtgggt ctggacacgc atctgacttt tgtgggcaaa      660

ctgcaggata gcctggatgc gggcgttggt aacctggtgg atgctgacct ggcaaaggag      720

tctgctaagc tgcaatctct gcagactaaa caacagctgg gcgttcaggc actgtccatc      780

gccaaccaat cttcttcctc tattctgtct ctgtttcgct g                          821


<210>  6
<211>  107
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  6
ggatccagga tccggatgtg agcggataac aattacgagc ttcatgcaca gtgaaatcat       60

gaaaaattta ttggctttgt gagcggataa caattataat atgtgga                    107


<210>  7
<211>  1068
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  7
aagaaggaga taccatatgg ggcgttttag cccaaagagt ttggatctgg acggaaaggt       60

tatcttggta acgggcggta ctggaagctt cgggcgtcgt ttcatcgaga ctgtcttgcg      120

ccgttacgat ccccgcaaag ttatcgtcta ttcgcgcgat gaattaaaac agagtgacat      180

gcaaattgag cttcgcgagc aattcgatga ggccaccgta gcaaagatgc gtttttttct      240

gggcgacgtg cgtgatcgtg agcgtttaac gttagcgctt cgtggagtcg acattgtcat      300

tcatgcagcc gcacttaaac aggtaccagc ggcagaatat aatccctccg aatgtatcca      360

cacgaatgtg ttgggtgcgg aaaacgtagt atgggcgtca ctggctaacg ccgttaagca      420

ggtggtcgcc ttatctacgg acaaagcttg taatccgact aacctgtatg gtgcaacgaa      480

gttggcctct gacaagacgt tcgtggctgc caacaatctg agtggagaca tcgggacccg      540

cttttgcgtg gttcgctatg gtaacgtagt cgggtctcgc ggctcagtag taccacttta      600

tcgtcgtctg ttgagccaag gggcgacgga gttgccagtc acggaccctc gcatgacccg      660

cttctggatt acgttgaatg agggcgtgga cttcgtactt tcttcattga ccatgatgcg      720

cggaggcgag atttttgtgc cgaagatccc cagtatggca atgcctgatt tggtaaaagc      780

catgtctagc actgctgcaa tgaaggtaat cggtatccgc ccaggagaga aacttcatga      840

aatcatgatc agcgcggatg atgcccgcag caccgtggag ttcgatgacc gctatgcaat      900

cgaaccgaat ttcgcagaat ttggccgtga gccctacgca gcaagtgacg gcgctaaacc      960

cgtggccgag gacttcagct acagctcaga caataatcat gactggttgt ctcccgaagg     1020

cttgttagcc atgttagaag agaaggccac gtgaagatct cccccggg                  1068


<210>  8
<211>  345
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  8

Met Gly Arg Phe Ser Pro Lys Ser Leu Asp Leu Asp Gly Lys Val Ile 
1               5                   10                  15      


Leu Val Thr Gly Gly Thr Gly Ser Phe Gly Arg Arg Phe Ile Glu Thr 
            20                  25                  30          


Val Leu Arg Arg Tyr Asp Pro Arg Lys Val Ile Val Tyr Ser Arg Asp 
        35                  40                  45              


Glu Leu Lys Gln Ser Asp Met Gln Ile Glu Leu Arg Glu Gln Phe Asp 
    50                  55                  60                  


Glu Ala Thr Val Ala Lys Met Arg Phe Phe Leu Gly Asp Val Arg Asp 
65                  70                  75                  80  


Arg Glu Arg Leu Thr Leu Ala Leu Arg Gly Val Asp Ile Val Ile His 
                85                  90                  95      


Ala Ala Ala Leu Lys Gln Val Pro Ala Ala Glu Tyr Asn Pro Ser Glu 
            100                 105                 110         


Cys Ile His Thr Asn Val Leu Gly Ala Glu Asn Val Val Trp Ala Ser 
        115                 120                 125             


Leu Ala Asn Ala Val Lys Gln Val Val Ala Leu Ser Thr Asp Lys Ala 
    130                 135                 140                 


Cys Asn Pro Thr Asn Leu Tyr Gly Ala Thr Lys Leu Ala Ser Asp Lys 
145                 150                 155                 160 


Thr Phe Val Ala Ala Asn Asn Leu Ser Gly Asp Ile Gly Thr Arg Phe 
                165                 170                 175     


Cys Val Val Arg Tyr Gly Asn Val Val Gly Ser Arg Gly Ser Val Val 
            180                 185                 190         


Pro Leu Tyr Arg Arg Leu Leu Ser Gln Gly Ala Thr Glu Leu Pro Val 
        195                 200                 205             


Thr Asp Pro Arg Met Thr Arg Phe Trp Ile Thr Leu Asn Glu Gly Val 
    210                 215                 220                 


Asp Phe Val Leu Ser Ser Leu Thr Met Met Arg Gly Gly Glu Ile Phe 
225                 230                 235                 240 


Val Pro Lys Ile Pro Ser Met Ala Met Pro Asp Leu Val Lys Ala Met 
                245                 250                 255     


Ser Ser Thr Ala Ala Met Lys Val Ile Gly Ile Arg Pro Gly Glu Lys 
            260                 265                 270         


Leu His Glu Ile Met Ile Ser Ala Asp Asp Ala Arg Ser Thr Val Glu 
        275                 280                 285             


Phe Asp Asp Arg Tyr Ala Ile Glu Pro Asn Phe Ala Glu Phe Gly Arg 
    290                 295                 300                 


Glu Pro Tyr Ala Ala Ser Asp Gly Ala Lys Pro Val Ala Glu Asp Phe 
305                 310                 315                 320 


Ser Tyr Ser Ser Asp Asn Asn His Asp Trp Leu Ser Pro Glu Gly Leu 
                325                 330                 335     


Leu Ala Met Leu Glu Glu Lys Ala Thr 
            340                 345 


<210>  9
<211>  1190
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  9
aagaaggaga tataccatga caggcggatt tttaccttat gggcgtcaga ctattgagga       60

ggatgacatc gctgcggtag cggaagcatt gcgcggcgac tttctgacga ctggccctac      120

agtggaagct ttcgagacag cgttcgccgc taaagtcggc gctgatcacg caatcgcggt      180

atcgaacgga acagctacct tgcaccttgc catgatggct cttggtattg gtgaaggtga      240

tgtatgcgta gcaccaagtg tgacttttct ggcaaccgct aactgtgcac gttatgtagg      300

tgcggaagta gtgttcgccg acgttgaccc ggacagtggt cttatgacac cagacaccct      360

ggcgcgcgct ttggcaggtg cacgtgataa gcgtgttaaa gctgtacttc cagtacatct      420

gcgtggggac gtatgtgatc ttcccgcgtt gaaagcaatg gcatcagcga gcggcgccgt      480

gcttgtggaa gatgccccgc atgccctggg ttcgatcgct acctttgatg gcgtagcgca      540

tccagtcgga gatggtgcgt acagttcatt cgcaagtttc tcctttcacc ccgtaaagac      600

gctggccaca ggggagggag ggatgttgac caccaacgac cccgcactgg ccgcaaaggc      660

gcgtttgctt cgcagtcacg ggatggtccg ccagccgggt ggagatccgt ggtggtacga      720

gatgcccgaa ctgggattca attaccgcat tcctgatgtt ttatgtgcct taggtttatc      780

ccaactggcg aaacttgacc gttttgttgc acgccgtcgt gaccttactg ccctttacgc      840

tcgcttattg gcggagcgcg ctccccgtgc gcgtttagcc accagcccgg accactcaga      900

cgctgcctta cacctgttga cggttttaat tgatttcgag gccgagggta tttcccgccg      960

taccgtagtt gaatccctta aaactcaagg agtaggaacg caggtgcact acatcccggt     1020

gcaccgtcag ccatattatg cacagcgcta cggggtcgcc gacttgcccg gagctgacgc     1080

gtggtacgcc cgttgcttaa ccttgccgct gtatccagct atgactaatg gagacgttga     1140

gcgcgttgtc ggtgctttag ccactgtttt agggtgagct agcggagctc                1190


<210>  10
<211>  386
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  10

Met Thr Gly Gly Phe Leu Pro Tyr Gly Arg Gln Thr Ile Glu Glu Asp 
1               5                   10                  15      


Asp Ile Ala Ala Val Ala Glu Ala Leu Arg Gly Asp Phe Leu Thr Thr 
            20                  25                  30          


Gly Pro Thr Val Glu Ala Phe Glu Thr Ala Phe Ala Ala Lys Val Gly 
        35                  40                  45              


Ala Asp His Ala Ile Ala Val Ser Asn Gly Thr Ala Thr Leu His Leu 
    50                  55                  60                  


Ala Met Met Ala Leu Gly Ile Gly Glu Gly Asp Val Cys Val Ala Pro 
65                  70                  75                  80  


Ser Val Thr Phe Leu Ala Thr Ala Asn Cys Ala Arg Tyr Val Gly Ala 
                85                  90                  95      


Glu Val Val Phe Ala Asp Val Asp Pro Asp Ser Gly Leu Met Thr Pro 
            100                 105                 110         


Asp Thr Leu Ala Arg Ala Leu Ala Gly Ala Arg Asp Lys Arg Val Lys 
        115                 120                 125             


Ala Val Leu Pro Val His Leu Arg Gly Asp Val Cys Asp Leu Pro Ala 
    130                 135                 140                 


Leu Lys Ala Met Ala Ser Ala Ser Gly Ala Val Leu Val Glu Asp Ala 
145                 150                 155                 160 


Pro His Ala Leu Gly Ser Ile Ala Thr Phe Asp Gly Val Ala His Pro 
                165                 170                 175     


Val Gly Asp Gly Ala Tyr Ser Ser Phe Ala Ser Phe Ser Phe His Pro 
            180                 185                 190         


Val Lys Thr Leu Ala Thr Gly Glu Gly Gly Met Leu Thr Thr Asn Asp 
        195                 200                 205             


Pro Ala Leu Ala Ala Lys Ala Arg Leu Leu Arg Ser His Gly Met Val 
    210                 215                 220                 


Arg Gln Pro Gly Gly Asp Pro Trp Trp Tyr Glu Met Pro Glu Leu Gly 
225                 230                 235                 240 


Phe Asn Tyr Arg Ile Pro Asp Val Leu Cys Ala Leu Gly Leu Ser Gln 
                245                 250                 255     


Leu Ala Lys Leu Asp Arg Phe Val Ala Arg Arg Arg Asp Leu Thr Ala 
            260                 265                 270         


Leu Tyr Ala Arg Leu Leu Ala Glu Arg Ala Pro Arg Ala Arg Leu Ala 
        275                 280                 285             


Thr Ser Pro Asp His Ser Asp Ala Ala Leu His Leu Leu Thr Val Leu 
    290                 295                 300                 


Ile Asp Phe Glu Ala Glu Gly Ile Ser Arg Arg Thr Val Val Glu Ser 
305                 310                 315                 320 


Leu Lys Thr Gln Gly Val Gly Thr Gln Val His Tyr Ile Pro Val His 
                325                 330                 335     


Arg Gln Pro Tyr Tyr Ala Gln Arg Tyr Gly Val Ala Asp Leu Pro Gly 
            340                 345                 350         


Ala Asp Ala Trp Tyr Ala Arg Cys Leu Thr Leu Pro Leu Tyr Pro Ala 
        355                 360                 365             


Met Thr Asn Gly Asp Val Glu Arg Val Val Gly Ala Leu Ala Thr Val 
    370                 375                 380                 


Leu Gly 
385     


<210>  11
<211>  612
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic sequence

<400>  11
aagaaggaga tataccatgg ttgcggtgcg ccttcgtaac ctggtcgaat ctgatcgcga       60

acgccttctt atttggcgca acagtccaga tgttagcgca tatatgtact cagatcataa      120

gattggtcac gaagaacatg accactggtt cgacgtcgcg cgtcatgacc cacgtcgtcg      180

ctactggatt atcgaggctg acggggagcc ggtcggtctt gccaatcttg ctgacattga      240

tttggttcac cgtcgctgtg cttgggccta ctacttggca agccccaaag tgcgtggact      300

gggtgtcggc agttttgttg agttccaaat tatcgaatac gttttcaatc agttgcacct      360

gaacaaattg tggtgcgaag tccttatcag taatgaatcc gtatggcgtc tgcatgaact      420

ttacggcttc cagcgcgagg ctttatttcg ccagcatgtt atgaaacagg gccatgaagt      480

ggacgtaatt ggtttaggac tgcttgccag tgactgggcc gctcgccgcg atgccatggc      540

cgaacgcttg tgtgcgaaag gatatacaat ccccgacttg acctgccgcg cggcctgaga      600

tatcgcggcc gc                                                          612


<210>  12
<211>  193
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  12

Met Val Ala Val Arg Leu Arg Asn Leu Val Glu Ser Asp Arg Glu Arg 
1               5                   10                  15      


Leu Leu Ile Trp Arg Asn Ser Pro Asp Val Ser Ala Tyr Met Tyr Ser 
            20                  25                  30          


Asp His Lys Ile Gly His Glu Glu His Asp His Trp Phe Asp Val Ala 
        35                  40                  45              


Arg His Asp Pro Arg Arg Arg Tyr Trp Ile Ile Glu Ala Asp Gly Glu 
    50                  55                  60                  


Pro Val Gly Leu Ala Asn Leu Ala Asp Ile Asp Leu Val His Arg Arg 
65                  70                  75                  80  


Cys Ala Trp Ala Tyr Tyr Leu Ala Ser Pro Lys Val Arg Gly Leu Gly 
                85                  90                  95      


Val Gly Ser Phe Val Glu Phe Gln Ile Ile Glu Tyr Val Phe Asn Gln 
            100                 105                 110         


Leu His Leu Asn Lys Leu Trp Cys Glu Val Leu Ile Ser Asn Glu Ser 
        115                 120                 125             


Val Trp Arg Leu His Glu Leu Tyr Gly Phe Gln Arg Glu Ala Leu Phe 
    130                 135                 140                 


Arg Gln His Val Met Lys Gln Gly His Glu Val Asp Val Ile Gly Leu 
145                 150                 155                 160 


Gly Leu Leu Ala Ser Asp Trp Ala Ala Arg Arg Asp Ala Met Ala Glu 
                165                 170                 175     


Arg Leu Cys Ala Lys Gly Tyr Thr Ile Pro Asp Leu Thr Cys Arg Ala 
            180                 185                 190         


Ala 
    


<210>  13
<211>  1018
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic sequence

<400>  13
aagaaggaga tataccatgt ccttacgcat cgtctttgta tgcgcagccg gcccatctgt       60

gggtggtgga catgtcatgc gttccttgac acttgcacgc gcgttagcgg cgcgcggagc      120

gacatgtgcg tttttgggaa cacccgaggt agcagcagtc ttagacgcct tcggtcctga      180

tatggcgcgt gccgacaccg ccgagccctt cgaagctgta gtctttgact cctatgcact      240

taccgcggac gaccatcgcc gtatcgcggc gggacgtccc gcgttagtaa tcgacgattt      300

agccgaccgc cctcttgcag cagacctggt gcttgatgct ggaccggctc gccgcgccga      360

ggattacgca ggactggtgc ccgcacatgc acgtcttctg ctgggtccga atcacgcacc      420

ggtccgtcca gcttttgttg cgttacgcga ggcagcctta gcacgccgtg cgcagcaggg      480

accggtacgt cgcattcttg tatctctggg catgacggac gtggggggaa ttacaggacg      540

tgtggtcgca cttcttgccc caatccttgg ggaggtcact ctggatcttg tggtgggagc      600

gggagccccg agcttgcctg ctctgcgtgc attagccgct gaagaccctc gccttgttct      660

tcatattgac acgcaggata tgccacgcct tgttcttgaa gccgacttgg ccatcggcgc      720

aggaggttcc acgacgtggg agcgctgtgt ccttgccttg ccagctttga ctcttatctt      780

agccgataac caaattgccg cggcacgtgc tcttgaagca gctggcgtaa ccccttgttt      840

ggacgtaaca gccccggatt ttgacacggc ctttgcagct cttgcgcaga acctgattgc      900

tgatccggat cgtcgtgccg cacttagtgc tgcctcagct acggtctgtg atggacgtgg      960

cgcggagcgc gtggctgaag cattcttggg agtcaccacc acatgacaat tgctgcag       1018


<210>  14
<211>  329
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic sequence

<400>  14

Met Ser Leu Arg Ile Val Phe Val Cys Ala Ala Gly Pro Ser Val Gly 
1               5                   10                  15      


Gly Gly His Val Met Arg Ser Leu Thr Leu Ala Arg Ala Leu Ala Ala 
            20                  25                  30          


Arg Gly Ala Thr Cys Ala Phe Leu Gly Thr Pro Glu Val Ala Ala Val 
        35                  40                  45              


Leu Asp Ala Phe Gly Pro Asp Met Ala Arg Ala Asp Thr Ala Glu Pro 
    50                  55                  60                  


Phe Glu Ala Val Val Phe Asp Ser Tyr Ala Leu Thr Ala Asp Asp His 
65                  70                  75                  80  


Arg Arg Ile Ala Ala Gly Arg Pro Ala Leu Val Ile Asp Asp Leu Ala 
                85                  90                  95      


Asp Arg Pro Leu Ala Ala Asp Leu Val Leu Asp Ala Gly Pro Ala Arg 
            100                 105                 110         


Arg Ala Glu Asp Tyr Ala Gly Leu Val Pro Ala His Ala Arg Leu Leu 
        115                 120                 125             


Leu Gly Pro Asn His Ala Pro Val Arg Pro Ala Phe Val Ala Leu Arg 
    130                 135                 140                 


Glu Ala Ala Leu Ala Arg Arg Ala Gln Gln Gly Pro Val Arg Arg Ile 
145                 150                 155                 160 


Leu Val Ser Leu Gly Met Thr Asp Val Gly Gly Ile Thr Gly Arg Val 
                165                 170                 175     


Val Ala Leu Leu Ala Pro Ile Leu Gly Glu Val Thr Leu Asp Leu Val 
            180                 185                 190         


Val Gly Ala Gly Ala Pro Ser Leu Pro Ala Leu Arg Ala Leu Ala Ala 
        195                 200                 205             


Glu Asp Pro Arg Leu Val Leu His Ile Asp Thr Gln Asp Met Pro Arg 
    210                 215                 220                 


Leu Val Leu Glu Ala Asp Leu Ala Ile Gly Ala Gly Gly Ser Thr Thr 
225                 230                 235                 240 


Trp Glu Arg Cys Val Leu Ala Leu Pro Ala Leu Thr Leu Ile Leu Ala 
                245                 250                 255     


Asp Asn Gln Ile Ala Ala Ala Arg Ala Leu Glu Ala Ala Gly Val Thr 
            260                 265                 270         


Pro Cys Leu Asp Val Thr Ala Pro Asp Phe Asp Thr Ala Phe Ala Ala 
        275                 280                 285             


Leu Ala Gln Asn Leu Ile Ala Asp Pro Asp Arg Arg Ala Ala Leu Ser 
    290                 295                 300                 


Ala Ala Ser Ala Thr Val Cys Asp Gly Arg Gly Ala Glu Arg Val Ala 
305                 310                 315                 320 


Glu Ala Phe Leu Gly Val Thr Thr Thr 
                325                 


<210>  15
<211>  1101
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  15
aagaaggaga tataccatgt ccgctccgag taccgagatc cccccttcga ttgagattgc       60

tggacgcaag atcggggccg atcacagccc ctacgttatc tgtgagttgt cgggcaatca      120

taatgggtcc cttgaacgtt gcttagctat ggtagacgct gccgcagata ccggatgcga      180

cgccatcaaa attcaaactt acacagccga cacaatcacc ttggatgtag atcgtccgga      240

gttcaaaatc cacgggggat tgtgggatgg acgcactctg tatgagcttt atgaggaagc      300

tcatactccc tttgagtggc acgcggccat cttcgaacgc gctcgtcagc gcggtgtcac      360

gattttttct tctccatttg acgagactgc cgtcgacctt ttagattcgc tgggggcgcc      420

agcttttaaa attgcaagct ttgaagcggt agaccttccg cttatcaaat acgcggcagc      480

caaagggaaa cccttaatta tttccactgg aatggcgaac cttacggaga tgcaaaccgc      540

ccttgataca gctttgtcag caggcgctcc gggagtgtta cttttacact gtgtttcttc      600

ataccctgct acgttcgcag acgcgaacgt ccgcaccgtg ccggatatgg cggcacgctt      660

cggatgcccg attggccttt ccgatcacac gcccggtaca gcagctagtg tcgccgctgt      720

gagcttaggg gcgtgtgcag tagaaaaaca tttcacgctt gcccgtgccg atggtggtcc      780

ggacgccgca ttctctcttg aacctgcgga gtttaaggca ttagttgatg acacaaagaa      840

tgcttgggct gcgttgggac gtgcacacta cgatgtgctg gggtcagagg caacatcact      900

gttattccgc cgttctctgt acgttacagc cgacgtgaag gctggcgaac ctttaacgcg      960

cgcgaatgtt cgttcagtgc gtcccggcaa tgggttgcca cctgcggatt tggataaagt     1020

tctggcggga aaggcaaccc gcgatcttgc gcgcggcgag cctcttgact ggtcaatggt     1080

cggttgaaac tagtacttaa g                                               1101


<210>  16
<211>  356
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic sequence

<400>  16

Met Ser Ala Pro Ser Thr Glu Ile Pro Pro Ser Ile Glu Ile Ala Gly 
1               5                   10                  15      


Arg Lys Ile Gly Ala Asp His Ser Pro Tyr Val Ile Cys Glu Leu Ser 
            20                  25                  30          


Gly Asn His Asn Gly Ser Leu Glu Arg Cys Leu Ala Met Val Asp Ala 
        35                  40                  45              


Ala Ala Asp Thr Gly Cys Asp Ala Ile Lys Ile Gln Thr Tyr Thr Ala 
    50                  55                  60                  


Asp Thr Ile Thr Leu Asp Val Asp Arg Pro Glu Phe Lys Ile His Gly 
65                  70                  75                  80  


Gly Leu Trp Asp Gly Arg Thr Leu Tyr Glu Leu Tyr Glu Glu Ala His 
                85                  90                  95      


Thr Pro Phe Glu Trp His Ala Ala Ile Phe Glu Arg Ala Arg Gln Arg 
            100                 105                 110         


Gly Val Thr Ile Phe Ser Ser Pro Phe Asp Glu Thr Ala Val Asp Leu 
        115                 120                 125             


Leu Asp Ser Leu Gly Ala Pro Ala Phe Lys Ile Ala Ser Phe Glu Ala 
    130                 135                 140                 


Val Asp Leu Pro Leu Ile Lys Tyr Ala Ala Ala Lys Gly Lys Pro Leu 
145                 150                 155                 160 


Ile Ile Ser Thr Gly Met Ala Asn Leu Thr Glu Met Gln Thr Ala Leu 
                165                 170                 175     


Asp Thr Ala Leu Ser Ala Gly Ala Pro Gly Val Leu Leu Leu His Cys 
            180                 185                 190         


Val Ser Ser Tyr Pro Ala Thr Phe Ala Asp Ala Asn Val Arg Thr Val 
        195                 200                 205             


Pro Asp Met Ala Ala Arg Phe Gly Cys Pro Ile Gly Leu Ser Asp His 
    210                 215                 220                 


Thr Pro Gly Thr Ala Ala Ser Val Ala Ala Val Ser Leu Gly Ala Cys 
225                 230                 235                 240 


Ala Val Glu Lys His Phe Thr Leu Ala Arg Ala Asp Gly Gly Pro Asp 
                245                 250                 255     


Ala Ala Phe Ser Leu Glu Pro Ala Glu Phe Lys Ala Leu Val Asp Asp 
            260                 265                 270         


Thr Lys Asn Ala Trp Ala Ala Leu Gly Arg Ala His Tyr Asp Val Leu 
        275                 280                 285             


Gly Ser Glu Ala Thr Ser Leu Leu Phe Arg Arg Ser Leu Tyr Val Thr 
    290                 295                 300                 


Ala Asp Val Lys Ala Gly Glu Pro Leu Thr Arg Ala Asn Val Arg Ser 
305                 310                 315                 320 


Val Arg Pro Gly Asn Gly Leu Pro Pro Ala Asp Leu Asp Lys Val Leu 
                325                 330                 335     


Ala Gly Lys Ala Thr Arg Asp Leu Ala Arg Gly Glu Pro Leu Asp Trp 
            340                 345                 350         


Ser Met Val Gly 
        355     


<210>  17
<211>  759
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic sequence

<400>  17
aagaaggaga tataccatga tcttggccat cctgcaagcc cgcatgtcat ccacgcgcct       60

tcctggcaag gtactgatgc ccctgcaacg ccagcccatg atcgttcgtc agatcgaacg      120

tgttgcccgc tcaaaacgca ttgataagtt agttgtggcc acgtcggacc gcccagagga      180

cgatgcaatc gaagcagccg ttcgccgtga aggtattgcg gtgtttcgcg ggtcattaga      240

caatgtccag cagcgtttta ttggggcatt ggatgcccac cctgccgacc atgtagtacg      300

tctgaccgcc gattgtcccc ttgccgaccc gacacttatt gatgccacaa tcgatttatg      360

tctttcaaag ggcgcggact acgtatctaa tacaccggag ggtcacgccc atccaaaagg      420

gaccgatgta gaggtaatga ccgcagcggc attgcgtcgc gcggctgctg aagccacgac      480

caaagaagca tttgaacacg ttacttggga cctttggaac caacctcaac gctggacgtg      540

tgcatggttg ccgtgcttcc cagatcaagg agcggtacgc tggactgtgg atcgtccgga      600

tgattatgct tttgtcgctg ctgtatacga tgccctgtac ccagcaaatc gcgcctttac      660

gtcggatgac atccgtgcgt ttgtcgctgg tcgccccgac ctgcaagatt atggtggtga      720

tcgccgtgca tgagaattcc tcggcctcta gaactcgag                             759


<210>  18
<211>  238
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Synthetic sequence

<400>  18

Met Ile Leu Ala Ile Leu Gln Ala Arg Met Ser Ser Thr Arg Leu Pro 
1               5                   10                  15      


Gly Lys Val Leu Met Pro Leu Gln Arg Gln Pro Met Ile Val Arg Gln 
            20                  25                  30          


Ile Glu Arg Val Ala Arg Ser Lys Arg Ile Asp Lys Leu Val Val Ala 
        35                  40                  45              


Thr Ser Asp Arg Pro Glu Asp Asp Ala Ile Glu Ala Ala Val Arg Arg 
    50                  55                  60                  


Glu Gly Ile Ala Val Phe Arg Gly Ser Leu Asp Asn Val Gln Gln Arg 
65                  70                  75                  80  


Phe Ile Gly Ala Leu Asp Ala His Pro Ala Asp His Val Val Arg Leu 
                85                  90                  95      


Thr Ala Asp Cys Pro Leu Ala Asp Pro Thr Leu Ile Asp Ala Thr Ile 
            100                 105                 110         


Asp Leu Cys Leu Ser Lys Gly Ala Asp Tyr Val Ser Asn Thr Pro Glu 
        115                 120                 125             


Gly His Ala His Pro Lys Gly Thr Asp Val Glu Val Met Thr Ala Ala 
    130                 135                 140                 


Ala Leu Arg Arg Ala Ala Ala Glu Ala Thr Thr Lys Glu Ala Phe Glu 
145                 150                 155                 160 


His Val Thr Trp Asp Leu Trp Asn Gln Pro Gln Arg Trp Thr Cys Ala 
                165                 170                 175     


Trp Leu Pro Cys Phe Pro Asp Gln Gly Ala Val Arg Trp Thr Val Asp 
            180                 185                 190         


Arg Pro Asp Asp Tyr Ala Phe Val Ala Ala Val Tyr Asp Ala Leu Tyr 
        195                 200                 205             


Pro Ala Asn Arg Ala Phe Thr Ser Asp Asp Ile Arg Ala Phe Val Ala 
    210                 215                 220                 


Gly Arg Pro Asp Leu Gln Asp Tyr Gly Gly Asp Arg Arg Ala 
225                 230                 235             


<210>  19
<211>  5848
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  19
ggatccggat gtgagcggat aacaattacg agcttcatgc acagtgaaat catgaaaaat       60

ttattggctt tgtgagcgga taacaattat aatatgtgga aagaaggaga taccatatgg      120

ggcgttttag cccaaagagt ttggatctgg acggaaaggt tatcttggta acgggcggta      180

ctggaagctt cgggcgtcgt ttcatcgaga ctgtcttgcg ccgttacgat ccccgcaaag      240

ttatcgtcta ttcgcgcgat gaattaaaac agagtgacat gcaaattgag cttcgcgagc      300

aattcgatga ggccaccgta gcaaagatgc gtttttttct gggcgacgtg cgtgatcgtg      360

agcgtttaac gttagcgctt cgtggagtcg acattgtcat tcatgcagcc gcacttaaac      420

aggtaccagc ggcagaatat aatccctccg aatgtatcca cacgaatgtg ttgggtgcgg      480

aaaacgtagt atgggcgtca ctggctaacg ccgttaagca ggtggtcgcc ttatctacgg      540

acaaagcttg taatccgact aacctgtatg gtgcaacgaa gttggcctct gacaagacgt      600

tcgtggctgc caacaatctg agtggagaca tcgggacccg cttttgcgtg gttcgctatg      660

gtaacgtagt cgggtctcgc ggctcagtag taccacttta tcgtcgtctg ttgagccaag      720

gggcgacgga gttgccagtc acggaccctc gcatgacccg cttctggatt acgttgaatg      780

agggcgtgga cttcgtactt tcttcattga ccatgatgcg cggaggcgag atttttgtgc      840

cgaagatccc cagtatggca atgcctgatt tggtaaaagc catgtctagc actgctgcaa      900

tgaaggtaat cggtatccgc ccaggagaga aacttcatga aatcatgatc agcgcggatg      960

atgcccgcag caccgtggag ttcgatgacc gctatgcaat cgaaccgaat ttcgcagaat     1020

ttggccgtga gccctacgca gcaagtgacg gcgctaaacc cgtggccgag gacttcagct     1080

acagctcaga caataatcat gactggttgt ctcccgaagg cttgttagcc atgttagaag     1140

agaaggccac gtgaagatct cccccgggaa gaaggagata taccatgaca ggcggatttt     1200

taccttatgg gcgtcagact attgaggagg atgacatcgc tgcggtagcg gaagcattgc     1260

gcggcgactt tctgacgact ggccctacag tggaagcttt cgagacagcg ttcgccgcta     1320

aagtcggcgc tgatcacgca atcgcggtat cgaacggaac agctaccttg caccttgcca     1380

tgatggctct tggtattggt gaaggtgatg tatgcgtagc accaagtgtg acttttctgg     1440

caaccgctaa ctgtgcacgt tatgtaggtg cggaagtagt gttcgccgac gttgacccgg     1500

acagtggtct tatgacacca gacaccctgg cgcgcgcttt ggcaggtgca cgtgataagc     1560

gtgttaaagc tgtacttcca gtacatctgc gtggggacgt atgtgatctt cccgcgttga     1620

aagcaatggc atcagcgagc ggcgccgtgc ttgtggaaga tgccccgcat gccctgggtt     1680

cgatcgctac ctttgatggc gtagcgcatc cagtcggaga tggtgcgtac agttcattcg     1740

caagtttctc ctttcacccc gtaaagacgc tggccacagg ggagggaggg atgttgacca     1800

ccaacgaccc cgcactggcc gcaaaggcgc gtttgcttcg cagtcacggg atggtccgcc     1860

agccgggtgg agatccgtgg tggtacgaga tgcccgaact gggattcaat taccgcattc     1920

ctgatgtttt atgtgcctta ggtttatccc aactggcgaa acttgaccgt tttgttgcac     1980

gccgtcgtga ccttactgcc ctttacgctc gcttattggc ggagcgcgct ccccgtgcgc     2040

gtttagccac cagcccggac cactcagacg ctgccttaca cctgttgacg gttttaattg     2100

atttcgaggc cgagggtatt tcccgccgta ccgtagttga atcccttaaa actcaaggag     2160

taggaacgca ggtgcactac atcccggtgc accgtcagcc atattatgca cagcgctacg     2220

gggtcgccga cttgcccgga gctgacgcgt ggtacgcccg ttgcttaacc ttgccgctgt     2280

atccagctat gactaatgga gacgttgagc gcgttgtcgg tgctttagcc actgttttag     2340

ggtgagctag cggagctcaa gaaggagata taccatggtt gcggtgcgcc ttcgtaacct     2400

ggtcgaatct gatcgcgaac gccttcttat ttggcgcaac agtccagatg ttagcgcata     2460

tatgtactca gatcataaga ttggtcacga agaacatgac cactggttcg acgtcgcgcg     2520

tcatgaccca cgtcgtcgct actggattat cgaggctgac ggggagccgg tcggtcttgc     2580

caatcttgct gacattgatt tggttcaccg tcgctgtgct tgggcctact acttggcaag     2640

ccccaaagtg cgtggactgg gtgtcggcag ttttgttgag ttccaaatta tcgaatacgt     2700

tttcaatcag ttgcacctga acaaattgtg gtgcgaagtc cttatcagta atgaatccgt     2760

atggcgtctg catgaacttt acggcttcca gcgcgaggct ttatttcgcc agcatgttat     2820

gaaacagggc catgaagtgg acgtaattgg tttaggactg cttgccagtg actgggccgc     2880

tcgccgcgat gccatggccg aacgcttgtg tgcgaaagga tatacaatcc ccgacttgac     2940

ctgccgcgcg gcctgagata tcgcggccgc aagaaggaga tataccatgt ccttacgcat     3000

cgtctttgta tgcgcagccg gcccatctgt gggtggtgga catgtcatgc gttccttgac     3060

acttgcacgc gcgttagcgg cgcgcggagc gacatgtgcg tttttgggaa cacccgaggt     3120

agcagcagtc ttagacgcct tcggtcctga tatggcgcgt gccgacaccg ccgagccctt     3180

cgaagctgta gtctttgact cctatgcact taccgcggac gaccatcgcc gtatcgcggc     3240

gggacgtccc gcgttagtaa tcgacgattt agccgaccgc cctcttgcag cagacctggt     3300

gcttgatgct ggaccggctc gccgcgccga ggattacgca ggactggtgc ccgcacatgc     3360

acgtcttctg ctgggtccga atcacgcacc ggtccgtcca gcttttgttg cgttacgcga     3420

ggcagcctta gcacgccgtg cgcagcaggg accggtacgt cgcattcttg tatctctggg     3480

catgacggac gtggggggaa ttacaggacg tgtggtcgca cttcttgccc caatccttgg     3540

ggaggtcact ctggatcttg tggtgggagc gggagccccg agcttgcctg ctctgcgtgc     3600

attagccgct gaagaccctc gccttgttct tcatattgac acgcaggata tgccacgcct     3660

tgttcttgaa gccgacttgg ccatcggcgc aggaggttcc acgacgtggg agcgctgtgt     3720

ccttgccttg ccagctttga ctcttatctt agccgataac caaattgccg cggcacgtgc     3780

tcttgaagca gctggcgtaa ccccttgttt ggacgtaaca gccccggatt ttgacacggc     3840

ctttgcagct cttgcgcaga acctgattgc tgatccggat cgtcgtgccg cacttagtgc     3900

tgcctcagct acggtctgtg atggacgtgg cgcggagcgc gtggctgaag cattcttggg     3960

agtcaccacc acatgacaat tgctgcagaa gaaggagata taccatgtcc gctccgagta     4020

ccgagatccc cccttcgatt gagattgctg gacgcaagat cggggccgat cacagcccct     4080

acgttatctg tgagttgtcg ggcaatcata atgggtccct tgaacgttgc ttagctatgg     4140

tagacgctgc cgcagatacc ggatgcgacg ccatcaaaat tcaaacttac acagccgaca     4200

caatcacctt ggatgtagat cgtccggagt tcaaaatcca cgggggattg tgggatggac     4260

gcactctgta tgagctttat gaggaagctc atactccctt tgagtggcac gcggccatct     4320

tcgaacgcgc tcgtcagcgc ggtgtcacga ttttttcttc tccatttgac gagactgccg     4380

tcgacctttt agattcgctg ggggcgccag cttttaaaat tgcaagcttt gaagcggtag     4440

accttccgct tatcaaatac gcggcagcca aagggaaacc cttaattatt tccactggaa     4500

tggcgaacct tacggagatg caaaccgccc ttgatacagc tttgtcagca ggcgctccgg     4560

gagtgttact tttacactgt gtttcttcat accctgctac gttcgcagac gcgaacgtcc     4620

gcaccgtgcc ggatatggcg gcacgcttcg gatgcccgat tggcctttcc gatcacacgc     4680

ccggtacagc agctagtgtc gccgctgtga gcttaggggc gtgtgcagta gaaaaacatt     4740

tcacgcttgc ccgtgccgat ggtggtccgg acgccgcatt ctctcttgaa cctgcggagt     4800

ttaaggcatt agttgatgac acaaagaatg cttgggctgc gttgggacgt gcacactacg     4860

atgtgctggg gtcagaggca acatcactgt tattccgccg ttctctgtac gttacagccg     4920

acgtgaaggc tggcgaacct ttaacgcgcg cgaatgttcg ttcagtgcgt cccggcaatg     4980

ggttgccacc tgcggatttg gataaagttc tggcgggaaa ggcaacccgc gatcttgcgc     5040

gcggcgagcc tcttgactgg tcaatggtcg gttgaaacta gtacttaaga agaaggagat     5100

ataccatgat cttggccatc ctgcaagccc gcatgtcatc cacgcgcctt cctggcaagg     5160

tactgatgcc cctgcaacgc cagcccatga tcgttcgtca gatcgaacgt gttgcccgct     5220

caaaacgcat tgataagtta gttgtggcca cgtcggaccg cccagaggac gatgcaatcg     5280

aagcagccgt tcgccgtgaa ggtattgcgg tgtttcgcgg gtcattagac aatgtccagc     5340

agcgttttat tggggcattg gatgcccacc ctgccgacca tgtagtacgt ctgaccgccg     5400

attgtcccct tgccgacccg acacttattg atgccacaat cgatttatgt ctttcaaagg     5460

gcgcggacta cgtatctaat acaccggagg gtcacgccca tccaaaaggg accgatgtag     5520

aggtaatgac cgcagcggca ttgcgtcgcg cggctgctga agccacgacc aaagaagcat     5580

ttgaacacgt tacttgggac ctttggaacc aacctcaacg ctggacgtgt gcatggttgc     5640

cgtgcttccc agatcaagga gcggtacgct ggactgtgga tcgtccggat gattatgctt     5700

ttgtcgctgc tgtatacgat gccctgtacc cagcaaatcg cgcctttacg tcggatgaca     5760

tccgtgcgtt tgtcgctggt cgccccgacc tgcaagatta tggtggtgat cgccgtgcat     5820

gagaattcct cggcctctag aactcgag                                        5848


<210>  20
<211>  8600
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  20
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accaaatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcatcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgcaac gcgatgacga tggatagcga      420

ttcatcgatg agctgacccg atcgccgccg ccggagggtt gcgtttgaga cgggcgacag      480

atggatccgg atgtgagcgg ataacaatta cgagcttcat gcacagtgaa atcatgaaaa      540

atttattggc tttgtgagcg gataacaatt ataatatgtg gaaagaagga gataccatat      600

ggggcgtttt agcccaaaga gtttggatct ggacggaaag gttatcttgg taacgggcgg      660

tactggaagc ttcgggcgtc gtttcatcga gactgtcttg cgccgttacg atccccgcaa      720

agttatcgtc tattcgcgcg atgaattaaa acagagtgac atgcaaattg agcttcgcga      780

gcaattcgat gaggccaccg tagcaaagat gcgttttttt ctgggcgacg tgcgtgatcg      840

tgagcgttta acgttagcgc ttcgtggagt cgacattgtc attcatgcag ccgcacttaa      900

acaggtacca gcggcagaat ataatccctc cgaatgtatc cacacgaatg tgttgggtgc      960

ggaaaacgta gtatgggcgt cactggctaa cgccgttaag caggtggtcg ccttatctac     1020

ggacaaagct tgtaatccga ctaacctgta tggtgcaacg aagttggcct ctgacaagac     1080

gttcgtggct gccaacaatc tgagtggaga catcgggacc cgcttttgcg tggttcgcta     1140

tggtaacgta gtcgggtctc gcggctcagt agtaccactt tatcgtcgtc tgttgagcca     1200

aggggcgacg gagttgccag tcacggaccc tcgcatgacc cgcttctgga ttacgttgaa     1260

tgagggcgtg gacttcgtac tttcttcatt gaccatgatg cgcggaggcg agatttttgt     1320

gccgaagatc cccagtatgg caatgcctga tttggtaaaa gccatgtcta gcactgctgc     1380

aatgaaggta atcggtatcc gcccaggaga gaaacttcat gaaatcatga tcagcgcgga     1440

tgatgcccgc agcaccgtgg agttcgatga ccgctatgca atcgaaccga atttcgcaga     1500

atttggccgt gagccctacg cagcaagtga cggcgctaaa cccgtggccg aggacttcag     1560

ctacagctca gacaataatc atgactggtt gtctcccgaa ggcttgttag ccatgttaga     1620

agagaaggcc acgtgaagat ctcccccggg aagaaggaga tataccatga caggcggatt     1680

tttaccttat gggcgtcaga ctattgagga ggatgacatc gctgcggtag cggaagcatt     1740

gcgcggcgac tttctgacga ctggccctac agtggaagct ttcgagacag cgttcgccgc     1800

taaagtcggc gctgatcacg caatcgcggt atcgaacgga acagctacct tgcaccttgc     1860

catgatggct cttggtattg gtgaaggtga tgtatgcgta gcaccaagtg tgacttttct     1920

ggcaaccgct aactgtgcac gttatgtagg tgcggaagta gtgttcgccg acgttgaccc     1980

ggacagtggt cttatgacac cagacaccct ggcgcgcgct ttggcaggtg cacgtgataa     2040

gcgtgttaaa gctgtacttc cagtacatct gcgtggggac gtatgtgatc ttcccgcgtt     2100

gaaagcaatg gcatcagcga gcggcgccgt gcttgtggaa gatgccccgc atgccctggg     2160

ttcgatcgct acctttgatg gcgtagcgca tccagtcgga gatggtgcgt acagttcatt     2220

cgcaagtttc tcctttcacc ccgtaaagac gctggccaca ggggagggag ggatgttgac     2280

caccaacgac cccgcactgg ccgcaaaggc gcgtttgctt cgcagtcacg ggatggtccg     2340

ccagccgggt ggagatccgt ggtggtacga gatgcccgaa ctgggattca attaccgcat     2400

tcctgatgtt ttatgtgcct taggtttatc ccaactggcg aaacttgacc gttttgttgc     2460

acgccgtcgt gaccttactg ccctttacgc tcgcttattg gcggagcgcg ctccccgtgc     2520

gcgtttagcc accagcccgg accactcaga cgctgcctta cacctgttga cggttttaat     2580

tgatttcgag gccgagggta tttcccgccg taccgtagtt gaatccctta aaactcaagg     2640

agtaggaacg caggtgcact acatcccggt gcaccgtcag ccatattatg cacagcgcta     2700

cggggtcgcc gacttgcccg gagctgacgc gtggtacgcc cgttgcttaa ccttgccgct     2760

gtatccagct atgactaatg gagacgttga gcgcgttgtc ggtgctttag ccactgtttt     2820

agggtgagct agcggagctc aagaaggaga tataccatgg ttgcggtgcg ccttcgtaac     2880

ctggtcgaat ctgatcgcga acgccttctt atttggcgca acagtccaga tgttagcgca     2940

tatatgtact cagatcataa gattggtcac gaagaacatg accactggtt cgacgtcgcg     3000

cgtcatgacc cacgtcgtcg ctactggatt atcgaggctg acggggagcc ggtcggtctt     3060

gccaatcttg ctgacattga tttggttcac cgtcgctgtg cttgggccta ctacttggca     3120

agccccaaag tgcgtggact gggtgtcggc agttttgttg agttccaaat tatcgaatac     3180

gttttcaatc agttgcacct gaacaaattg tggtgcgaag tccttatcag taatgaatcc     3240

gtatggcgtc tgcatgaact ttacggcttc cagcgcgagg ctttatttcg ccagcatgtt     3300

atgaaacagg gccatgaagt ggacgtaatt ggtttaggac tgcttgccag tgactgggcc     3360

gctcgccgcg atgccatggc cgaacgcttg tgtgcgaaag gatatacaat ccccgacttg     3420

acctgccgcg cggcctgaga tatcgcggcc gcaagaagga gatataccat gtccttacgc     3480

atcgtctttg tatgcgcagc cggcccatct gtgggtggtg gacatgtcat gcgttccttg     3540

acacttgcac gcgcgttagc ggcgcgcgga gcgacatgtg cgtttttggg aacacccgag     3600

gtagcagcag tcttagacgc cttcggtcct gatatggcgc gtgccgacac cgccgagccc     3660

ttcgaagctg tagtctttga ctcctatgca cttaccgcgg acgaccatcg ccgtatcgcg     3720

gcgggacgtc ccgcgttagt aatcgacgat ttagccgacc gccctcttgc agcagacctg     3780

gtgcttgatg ctggaccggc tcgccgcgcc gaggattacg caggactggt gcccgcacat     3840

gcacgtcttc tgctgggtcc gaatcacgca ccggtccgtc cagcttttgt tgcgttacgc     3900

gaggcagcct tagcacgccg tgcgcagcag ggaccggtac gtcgcattct tgtatctctg     3960

ggcatgacgg acgtgggggg aattacagga cgtgtggtcg cacttcttgc cccaatcctt     4020

ggggaggtca ctctggatct tgtggtggga gcgggagccc cgagcttgcc tgctctgcgt     4080

gcattagccg ctgaagaccc tcgccttgtt cttcatattg acacgcagga tatgccacgc     4140

cttgttcttg aagccgactt ggccatcggc gcaggaggtt ccacgacgtg ggagcgctgt     4200

gtccttgcct tgccagcttt gactcttatc ttagccgata accaaattgc cgcggcacgt     4260

gctcttgaag cagctggcgt aaccccttgt ttggacgtaa cagccccgga ttttgacacg     4320

gcctttgcag ctcttgcgca gaacctgatt gctgatccgg atcgtcgtgc cgcacttagt     4380

gctgcctcag ctacggtctg tgatggacgt ggcgcggagc gcgtggctga agcattcttg     4440

ggagtcacca ccacatgaca attgctgcag aagaaggaga tataccatgt ccgctccgag     4500

taccgagatc cccccttcga ttgagattgc tggacgcaag atcggggccg atcacagccc     4560

ctacgttatc tgtgagttgt cgggcaatca taatgggtcc cttgaacgtt gcttagctat     4620

ggtagacgct gccgcagata ccggatgcga cgccatcaaa attcaaactt acacagccga     4680

cacaatcacc ttggatgtag atcgtccgga gttcaaaatc cacgggggat tgtgggatgg     4740

acgcactctg tatgagcttt atgaggaagc tcatactccc tttgagtggc acgcggccat     4800

cttcgaacgc gctcgtcagc gcggtgtcac gattttttct tctccatttg acgagactgc     4860

cgtcgacctt ttagattcgc tgggggcgcc agcttttaaa attgcaagct ttgaagcggt     4920

agaccttccg cttatcaaat acgcggcagc caaagggaaa cccttaatta tttccactgg     4980

aatggcgaac cttacggaga tgcaaaccgc ccttgataca gctttgtcag caggcgctcc     5040

gggagtgtta cttttacact gtgtttcttc ataccctgct acgttcgcag acgcgaacgt     5100

ccgcaccgtg ccggatatgg cggcacgctt cggatgcccg attggccttt ccgatcacac     5160

gcccggtaca gcagctagtg tcgccgctgt gagcttaggg gcgtgtgcag tagaaaaaca     5220

tttcacgctt gcccgtgccg atggtggtcc ggacgccgca ttctctcttg aacctgcgga     5280

gtttaaggca ttagttgatg acacaaagaa tgcttgggct gcgttgggac gtgcacacta     5340

cgatgtgctg gggtcagagg caacatcact gttattccgc cgttctctgt acgttacagc     5400

cgacgtgaag gctggcgaac ctttaacgcg cgcgaatgtt cgttcagtgc gtcccggcaa     5460

tgggttgcca cctgcggatt tggataaagt tctggcggga aaggcaaccc gcgatcttgc     5520

gcgcggcgag cctcttgact ggtcaatggt cggttgaaac tagtacttaa gaagaaggag     5580

atataccatg atcttggcca tcctgcaagc ccgcatgtca tccacgcgcc ttcctggcaa     5640

ggtactgatg cccctgcaac gccagcccat gatcgttcgt cagatcgaac gtgttgcccg     5700

ctcaaaacgc attgataagt tagttgtggc cacgtcggac cgcccagagg acgatgcaat     5760

cgaagcagcc gttcgccgtg aaggtattgc ggtgtttcgc gggtcattag acaatgtcca     5820

gcagcgtttt attggggcat tggatgccca ccctgccgac catgtagtac gtctgaccgc     5880

cgattgtccc cttgccgacc cgacacttat tgatgccaca atcgatttat gtctttcaaa     5940

gggcgcggac tacgtatcta atacaccgga gggtcacgcc catccaaaag ggaccgatgt     6000

agaggtaatg accgcagcgg cattgcgtcg cgcggctgct gaagccacga ccaaagaagc     6060

atttgaacac gttacttggg acctttggaa ccaacctcaa cgctggacgt gtgcatggtt     6120

gccgtgcttc ccagatcaag gagcggtacg ctggactgtg gatcgtccgg atgattatgc     6180

ttttgtcgct gctgtatacg atgccctgta cccagcaaat cgcgccttta cgtcggatga     6240

catccgtgcg tttgtcgctg gtcgccccga cctgcaagat tatggtggtg atcgccgtgc     6300

atgagaattc ctcggcctct agaactcgag atcagttctg gaccagcgag ctgtgctgcg     6360

actcgtggcg taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacaat     6420

tccacacaac atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag     6480

ctaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg     6540

ccagctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc     6600

ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc     6660

agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa     6720

catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt     6780

tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg     6840

gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg     6900

ctctcctgtt ccgaccctgt cgcttaccgg atacctgtcc gcctttctcc cttcgggaag     6960

cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc     7020

caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa     7080

ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg     7140

taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc     7200

taactacggc tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac     7260

cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg     7320

tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt     7380

gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt     7440

catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa     7500

atcaatctaa agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga     7560

ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt     7620

gtagataact acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg     7680

agacccacgc tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga     7740

gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga     7800

agctagagta agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg     7860

catcgtggtg tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc     7920

aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc     7980

gatcgttgtc agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca     8040

taattctctt actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac     8100

caagtcattc tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg     8160

ggataatacc gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc     8220

ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg     8280

tgcacccaac tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac     8340

aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat     8400

actctacctt tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata     8460

catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa     8520

agtgccacct gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg     8580

tatcacgagg ccctttcgtc                                                 8600


<210>  21
<211>  2639
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  21
catatggctc tgaactctat taacaccaac gccggcgcta tgattgcact gcagaacctg       60

aatggtacca actccgagct gacgactgtt cagcagcgta tcaacaccgg taagaaaatt      120

gcgtccgcca aagataacgg cgcaatctgg gctaccgcaa aaaaccaatc tgccactgca      180

gctagcatga acgctgttaa agactccctg cagcgtggcc agtctaccat cgacgttgcc      240

ctggccgccg gtgacactat caccgatctg ctgggtaaaa tgaaagaaaa ggctctggcc      300

gcgtctgaca cctccctgaa taccgcgtct tttaacgccc tgaaatctga cttcgatagc      360

ctgcgtgacc agatcgagaa ggcagcgact aatgctaaat tcaacggtgt tagcatcgcc      420

gacggctcta ccactaaact gaccttcctg gccaactctg acggttctgg tttcaccgtt      480

aacgccaaaa ctatttctct ggctggtatc ggcctgacca ctacctccac ctttaccacc      540

gcggcggcgg cgaagaccat gatcggcact atcgatactg cgctgcagac tgccaccaac      600

aaactggcct ctctgggtac ctcctctgtg ggtctggaca cgcatctgac ttttgtgggc      660

aaactgcagg atagcctgga tgcgggcgtt ggtaacctgg tggatgctga cctggcaaag      720

gagtctgcta agctgcaatc tctgcagact aaacaacagc tgggcgttca ggcactgtcc      780

atcgccaacc aatcttcttc ctctattctg tctctgtttc gctgaattca ggaggtaaaa      840

aaatgtcccg taaaagcgcc ctggaatcct cggcaagcgt cctggcccag gccgatgtcg      900

gcgcttcggg catccacccc agcgttatcg ccgacgccat gggcgattcg gcgtccgccg      960

aggcgctgga gcgcctgaat cgggcggcgc aggacaccaa gaacgtcgac aacgccaagc     1020

acttggcgcg cgcgatccag gccgtgcagc tgcaggacta cgccaaggcc gacaagctgg     1080

ccctgaagct gctggagaag gacgagcgac tgggcctagc ctggcacatc ctggcgattg     1140

cacgcgagaa gaccggcgat ttcgcctcct cgctgcgggc ctatgaagcc gcgctggctc     1200

tgctgcccga ccatggcccc gtcgccggcg acctgggccg cttggccttc cgcatgaaca     1260

tgccggagct ggcggccaag ttcttcgcac actaccgtct cgctcggccc gacgacgtcg     1320

agggcgccaa caacctggcg tgcgccctgc gcgagcttaa tcgcgaaagc gaagccatcg     1380

aagtcctcaa ggccgccctg ggcgccaacc ccgaggctgc ggtgctgtgg aacacgctgg     1440

gcacggtgct ttgcaatatc ggcgacgcgg cgggctcgat cgtgttcttc gacgagtccc     1500

tgcgcctcgc gcccgacttt tcgaaagcct atcacaaccg cgccttcgcc aggctcgatc     1560

tgggcgagat agaggccgcg ctggccgatt gcgaagccgc catgcgcagc cccggctcac     1620

cggaagatct ggcgatgatg cagttcgccc gcgccacgat tctcctggct ctgggccgcg     1680

tcggcgaagg ctgggaggct tatgagtcac gcttctcgcc ggcgctgagc gacgcgccac     1740

ggttccagat tcctggcgtc cgctggtcag gacaggacct caggggcaag cgtttgatga     1800

tcaccaccga gcagggcctc ggcgacgagg tgatgttcgc caacatgttg cccgacatcg     1860

tcgaagcctt gggcccagac ggcttcctgt ccctggcggt cgagcgccgt ctggcgccgc     1920

tgttcgagcg caccttcccg aaggtcgagg tgaccgccca ccgtacgatc gcctacgaag     1980

gccgcgtgtt ccgggccgcg ccctatatcg agaactggga ccgcttcgac tattgggcgg     2040

ccatcggcga cttcctgccg agccttcgcc ccaccgccga ggcgtttccc aagcgcaacg     2100

ccttcctgca gccggatccg gcgcgggtgg cccactggaa ggcccaactc gagaagcttg     2160

gccccggccc gaaagtcggc ctgctctgga agagcctgaa actgaacgcg gaacgcgcgc     2220

ggcagttttc gcccttccac ctgtgggagc cggttttgca cacgccaggc gtggtgttcg     2280

tgaacctgca gtatggcgac tgcgaggaag agatcgcctt cgccaaggaa gagctgggcg     2340

tggagatctg gcagccggaa ggcattgatc tgaaggccga cctcgacgac gtggccgctc     2400

tctgcgcggc ggtggacctg gtgatcgggt tctccaacgc cacgatcaat ctggccggtg     2460

cggtggggac gccgatcttc atgctgaccg gcgcctcgtc ctggacccgc ctcggcaccg     2520

aatattaccc ctggtatccg agcgttcgct gcttcgtcac cgagcagtac ggggtctggg     2580

aaccgaccat gggtcgcgtc gccaccgctc tgcgcgattt cgccgcatcc taatctaga      2639


<210>  22
<211>  5728
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  22
catatggggc gttttagccc aaagagtttg gatctggacg gaaaggttat cttggtaacg       60

ggcggtactg gaagcttcgg gcgtcgtttc atcgagactg tcttgcgccg ttacgatccc      120

cgcaaagtta tcgtctattc gcgcgatgaa ttaaaacaga gtgacatgca aattgagctt      180

cgcgagcaat tcgatgaggc caccgtagca aagatgcgtt tttttctggg cgacgtgcgt      240

gatcgtgagc gtttaacgtt agcgcttcgt ggagtcgaca ttgtcattca tgcagccgca      300

cttaaacagg taccagcggc agaatataat ccctccgaat gtatccacac gaatgtgttg      360

ggtgcggaaa acgtagtatg ggcgtcactg gctaacgccg ttaagcaggt ggtcgcctta      420

tctacggaca aagcttgtaa tccgactaac ctgtatggtg caacgaagtt ggcctctgac      480

aagacgttcg tggctgccaa caatctgagt ggagacatcg ggacccgctt ttgcgtggtt      540

cgctatggta acgtagtcgg gtctcgcggc tcagtagtac cactttatcg tcgtctgttg      600

agccaagggg cgacggagtt gccagtcacg gaccctcgca tgacccgctt ctggattacg      660

ttgaatgagg gcgtggactt cgtactttct tcattgacca tgatgcgcgg aggcgagatt      720

tttgtgccga agatccccag tatggcaatg cctgatttgg taaaagccat gtctagcact      780

gctgcaatga aggtaatcgg tatccgccca ggagagaaac ttcatgaaat catgatcagc      840

gcggatgatg cccgcagcac cgtggagttc gatgaccgct atgcaatcga accgaatttc      900

gcagaatttg gccgtgagcc ctacgcagca agtgacggcg ctaaacccgt ggccgaggac      960

ttcagctaca gctcagacaa taatcatgac tggttgtctc ccgaaggctt gttagccatg     1020

ttagaagaga aggccacgtg aagatctccc ccgggaagaa ggagatatac catgacaggc     1080

ggatttttac cttatgggcg tcagactatt gaggaggatg acatcgctgc ggtagcggaa     1140

gcattgcgcg gcgactttct gacgactggc cctacagtgg aagctttcga gacagcgttc     1200

gccgctaaag tcggcgctga tcacgcaatc gcggtatcga acggaacagc taccttgcac     1260

cttgccatga tggctcttgg tattggtgaa ggtgatgtat gcgtagcacc aagtgtgact     1320

tttctggcaa ccgctaactg tgcacgttat gtaggtgcgg aagtagtgtt cgccgacgtt     1380

gacccggaca gtggtcttat gacaccagac accctggcgc gcgctttggc aggtgcacgt     1440

gataagcgtg ttaaagctgt acttccagta catctgcgtg gggacgtatg tgatcttccc     1500

gcgttgaaag caatggcatc agcgagcggc gccgtgcttg tggaagatgc cccgcatgcc     1560

ctgggttcga tcgctacctt tgatggcgta gcgcatccag tcggagatgg tgcgtacagt     1620

tcattcgcaa gtttctcctt tcaccccgta aagacgctgg ccacagggga gggagggatg     1680

ttgaccacca acgaccccgc actggccgca aaggcgcgtt tgcttcgcag tcacgggatg     1740

gtccgccagc cgggtggaga tccgtggtgg tacgagatgc ccgaactggg attcaattac     1800

cgcattcctg atgttttatg tgccttaggt ttatcccaac tggcgaaact tgaccgtttt     1860

gttgcacgcc gtcgtgacct tactgccctt tacgctcgct tattggcgga gcgcgctccc     1920

cgtgcgcgtt tagccaccag cccggaccac tcagacgctg ccttacacct gttgacggtt     1980

ttaattgatt tcgaggccga gggtatttcc cgccgtaccg tagttgaatc ccttaaaact     2040

caaggagtag gaacgcaggt gcactacatc ccggtgcacc gtcagccata ttatgcacag     2100

cgctacgggg tcgccgactt gcccggagct gacgcgtggt acgcccgttg cttaaccttg     2160

ccgctgtatc cagctatgac taatggagac gttgagcgcg ttgtcggtgc tttagccact     2220

gttttagggt gagctagcgg agctcaagaa ggagatatac catggttgcg gtgcgccttc     2280

gtaacctggt cgaatctgat cgcgaacgcc ttcttatttg gcgcaacagt ccagatgtta     2340

gcgcatatat gtactcagat cataagattg gtcacgaaga acatgaccac tggttcgacg     2400

tcgcgcgtca tgacccacgt cgtcgctact ggattatcga ggctgacggg gagccggtcg     2460

gtcttgccaa tcttgctgac attgatttgg ttcaccgtcg ctgtgcttgg gcctactact     2520

tggcaagccc caaagtgcgt ggactgggtg tcggcagttt tgttgagttc caaattatcg     2580

aatacgtttt caatcagttg cacctgaaca aattgtggtg cgaagtcctt atcagtaatg     2640

aatccgtatg gcgtctgcat gaactttacg gcttccagcg cgaggcttta tttcgccagc     2700

atgttatgaa acagggccat gaagtggacg taattggttt aggactgctt gccagtgact     2760

gggccgctcg ccgcgatgcc atggccgaac gcttgtgtgc gaaaggatat acaatccccg     2820

acttgacctg ccgcgcggcc tgagatatcg cggccgcaag aaggagatat accatgtcct     2880

tacgcatcgt ctttgtatgc gcagccggcc catctgtggg tggtggacat gtcatgcgtt     2940

ccttgacact tgcacgcgcg ttagcggcgc gcggagcgac atgtgcgttt ttgggaacac     3000

ccgaggtagc agcagtctta gacgccttcg gtcctgatat ggcgcgtgcc gacaccgccg     3060

agcccttcga agctgtagtc tttgactcct atgcacttac cgcggacgac catcgccgta     3120

tcgcggcggg acgtcccgcg ttagtaatcg acgatttagc cgaccgccct cttgcagcag     3180

acctggtgct tgatgctgga ccggctcgcc gcgccgagga ttacgcagga ctggtgcccg     3240

cacatgcacg tcttctgctg ggtccgaatc acgcaccggt ccgtccagct tttgttgcgt     3300

tacgcgaggc agccttagca cgccgtgcgc agcagggacc ggtacgtcgc attcttgtat     3360

ctctgggcat gacggacgtg gggggaatta caggacgtgt ggtcgcactt cttgccccaa     3420

tccttgggga ggtcactctg gatcttgtgg tgggagcggg agccccgagc ttgcctgctc     3480

tgcgtgcatt agccgctgaa gaccctcgcc ttgttcttca tattgacacg caggatatgc     3540

cacgccttgt tcttgaagcc gacttggcca tcggcgcagg aggttccacg acgtgggagc     3600

gctgtgtcct tgccttgcca gctttgactc ttatcttagc cgataaccaa attgccgcgg     3660

cacgtgctct tgaagcagct ggcgtaaccc cttgtttgga cgtaacagcc ccggattttg     3720

acacggcctt tgcagctctt gcgcagaacc tgattgctga tccggatcgt cgtgccgcac     3780

ttagtgctgc ctcagctacg gtctgtgatg gacgtggcgc ggagcgcgtg gctgaagcat     3840

tcttgggagt caccaccaca tgacaattgc tgcagaagaa ggagatatac catgtccgct     3900

ccgagtaccg agatcccccc ttcgattgag attgctggac gcaagatcgg ggccgatcac     3960

agcccctacg ttatctgtga gttgtcgggc aatcataatg ggtcccttga acgttgctta     4020

gctatggtag acgctgccgc agataccgga tgcgacgcca tcaaaattca aacttacaca     4080

gccgacacaa tcaccttgga tgtagatcgt ccggagttca aaatccacgg gggattgtgg     4140

gatggacgca ctctgtatga gctttatgag gaagctcata ctccctttga gtggcacgcg     4200

gccatcttcg aacgcgctcg tcagcgcggt gtcacgattt tttcttctcc atttgacgag     4260

actgccgtcg accttttaga ttcgctgggg gcgccagctt ttaaaattgc aagctttgaa     4320

gcggtagacc ttccgcttat caaatacgcg gcagccaaag ggaaaccctt aattatttcc     4380

actggaatgg cgaaccttac ggagatgcaa accgcccttg atacagcttt gtcagcaggc     4440

gctccgggag tgttactttt acactgtgtt tcttcatacc ctgctacgtt cgcagacgcg     4500

aacgtccgca ccgtgccgga tatggcggca cgcttcggat gcccgattgg cctttccgat     4560

cacacgcccg gtacagcagc tagtgtcgcc gctgtgagct taggggcgtg tgcagtagaa     4620

aaacatttca cgcttgcccg tgccgatggt ggtccggacg ccgcattctc tcttgaacct     4680

gcggagttta aggcattagt tgatgacaca aagaatgctt gggctgcgtt gggacgtgca     4740

cactacgatg tgctggggtc agaggcaaca tcactgttat tccgccgttc tctgtacgtt     4800

acagccgacg tgaaggctgg cgaaccttta acgcgcgcga atgttcgttc agtgcgtccc     4860

ggcaatgggt tgccacctgc ggatttggat aaagttctgg cgggaaaggc aacccgcgat     4920

cttgcgcgcg gcgagcctct tgactggtca atggtcggtt gaaactagta cttaagaaga     4980

aggagatata ccatgatctt ggccatcctg caagcccgca tgtcatccac gcgccttcct     5040

ggcaaggtac tgatgcccct gcaacgccag cccatgatcg ttcgtcagat cgaacgtgtt     5100

gcccgctcaa aacgcattga taagttagtt gtggccacgt cggaccgccc agaggacgat     5160

gcaatcgaag cagccgttcg ccgtgaaggt attgcggtgt ttcgcgggtc attagacaat     5220

gtccagcagc gttttattgg ggcattggat gcccaccctg ccgaccatgt agtacgtctg     5280

accgccgatt gtccccttgc cgacccgaca cttattgatg ccacaatcga tttatgtctt     5340

tcaaagggcg cggactacgt atctaataca ccggagggtc acgcccatcc aaaagggacc     5400

gatgtagagg taatgaccgc agcggcattg cgtcgcgcgg ctgctgaagc cacgaccaaa     5460

gaagcatttg aacacgttac ttgggacctt tggaaccaac ctcaacgctg gacgtgtgca     5520

tggttgccgt gcttcccaga tcaaggagcg gtacgctgga ctgtggatcg tccggatgat     5580

tatgcttttg tcgctgctgt atacgatgcc ctgtacccag caaatcgcgc ctttacgtcg     5640

gatgacatcc gtgcgtttgt cgctggtcgc cccgacctgc aagattatgg tggtgatcgc     5700

cgtgcatgag aattcctcgg cctctaga                                        5728


<210>  23
<211>  612
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Mutated FlmG protein sequence

<400>  23

Met Ser Arg Lys Arg Ala Leu Glu Leu Ser Ala Ser Ala Leu Ala Gln 
1               5                   10                  15      


Ala Asp Leu Gly Ala Ala Gly Ile His Arg Ala Val Ile Ala Gly Ala 
            20                  25                  30          


Val Gly Asp Ser Ala Ser Ala Glu Ala Ile Glu Arg Leu Lys Arg Ser 
        35                  40                  45              


Ala Gln Gln Ile Lys Asn Asn Asp Asn Ala Lys Ala Val Thr Arg Ala 
    50                  55                  60                  


Ile Gln Ala Ile His Leu Arg Asp Tyr Ala Lys Ala Asp Lys Leu Ala 
65                  70                  75                  80  


Leu Ala Leu Leu Lys Lys Asp Glu His Leu Gly Leu Ala Trp His Ile 
                85                  90                  95      


Leu Gly Ile Ala Arg Glu Lys Gln Gly Asp Phe Ala Thr Ser Leu Arg 
            100                 105                 110         


Ala Tyr Glu Ala Ala Leu Lys Leu Leu Ala Asp His Gly Pro Val Ala 
        115                 120                 125             


Gly Asp Leu Gly Arg Leu Ala Phe Arg Met Asn Met Pro Glu Ile Ala 
    130                 135                 140                 


Ala Gln Phe Phe Ala His Tyr Arg Leu Ala Lys Pro Asp Asp Val Glu 
145                 150                 155                 160 


Gly Ala Asn Asn Leu Ala Cys Ala Leu Arg Glu Leu Asn Arg Glu Gly 
                165                 170                 175     


Glu Ala Val Glu Val Leu Lys Ala Ala Ile Gly Ala Asn Pro Thr Ala 
            180                 185                 190         


Ala Leu Leu Trp Asn Thr Leu Gly Thr Val Leu Cys Asn Val Gly Asp 
        195                 200                 205             


Ala Gly Gly Ser Leu Val Phe Phe Asp Glu Ala Leu Arg Leu Arg Pro 
    210                 215                 220                 


Asp Phe Ser Lys Ala His His Asn Arg Ala Phe Ala Lys Leu Asp Leu 
225                 230                 235                 240 


Gly Gln Val Glu Glu Ala Leu Val Asp Cys Glu Ala Ala Ile Lys Ser 
                245                 250                 255     


Pro Glu Ser Pro Glu Asp Leu Ala Met Met Gln Phe Ala Gln Ala Thr 
            260                 265                 270         


Ile Leu Leu Gly Leu Gly Arg Val Ala Glu Gly Trp Glu Ala Tyr Glu 
        275                 280                 285             


Ala Arg Phe Ala Pro Ala Leu Val Glu Ala Pro Arg Phe Gln Ile Pro 
    290                 295                 300                 


Gly Thr Arg Trp Ser Gly Gln Asp Leu Ala Gly Lys Thr Leu Met Ile 
305                 310                 315                 320 


Ser Thr Glu Gln Gly Leu Gly Asp Glu Val Met Phe Ala Gly Met Leu 
                325                 330                 335     


Pro Asp Ile Leu Glu Arg Leu Gly Pro Asp Gly Ser Leu Ser Leu Ala 
            340                 345                 350         


Val Glu Arg Arg Leu Ile Pro Leu Phe Gln Arg Ser Phe Pro Gly Ile 
        355                 360                 365             


Glu Val Thr Ala His Arg Thr Val Ala Tyr Glu Gly Arg Thr Tyr Arg 
    370                 375                 380                 


Ala Ala Pro Glu Ile Glu Asp Trp Asp Arg Phe Asp Tyr Trp Ala Ala 
385                 390                 395                 400 


Ile Gly Asp Phe Leu Pro Ser Leu Arg Gly Ser Val Glu Ala Phe Pro 
                405                 410                 415     


Arg Arg Asp His Tyr Leu Thr Pro Asp Pro Glu Arg Val Ala His Trp 
            420                 425                 430         


Lys Ala Glu Leu Glu Lys Leu Gly Pro Ala Pro Lys Val Gly Leu Leu 
        435                 440                 445             


Trp Lys Ser Leu Lys Leu Gly Ala Glu Arg Gly Arg Gln Phe Ser Pro 
    450                 455                 460                 


Phe Glu Ala Trp Arg Ala Val Leu Gln Thr Pro Gly Ala Val Phe Val 
465                 470                 475                 480 


Asn Leu Gln Tyr Gly Asp Cys Asp Glu Glu Ile Ala Tyr Ala Lys Glu 
                485                 490                 495     


Thr Phe Gly Val Glu Ile Trp Gln Pro Pro Gly Ile Asp Leu Lys Lys 
            500                 505                 510         


Asp Leu Asp Asp Val Ala Ala Leu Cys Ala Ala Val Asp Leu Ile Ile 
        515                 520                 525             


Gly Phe Ser Asn Ala Thr Ile Asn Leu Ala Gly Ala Val Gly Ala Pro 
    530                 535                 540                 


Ile Trp Met Met Thr Ala Pro Lys Val Trp Thr Lys Leu Gly Thr Asp 
545                 550                 555                 560 


Arg Tyr Pro Trp Tyr Pro Gln Ala Gln Val Phe Ser Pro Ala Asp Phe 
                565                 570                 575     


Ser Asp Trp Glu Pro Val Met Glu Glu Val Ala Arg Ala Leu Ala Ala 
            580                 585                 590         


Lys Ile Ala Gly Pro Val Met Glu Glu Val Ala Arg Ala Leu Ala Ala 
        595                 600                 605             


Lys Ile Ala Gly 
    610         


<210>  24
<211>  168
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  24

Met Ala Leu Asn Ser Ile Asn Thr Asn Ala Gly Ala Met Ile Ala Leu 
1               5                   10                  15      


Gln Asn Leu Asn Gly Thr Asn Ser Glu Leu Thr Thr Val Gln Gln Arg 
            20                  25                  30          


Ile Asn Thr Gly Thr Ser Leu Asn Thr Ala Ser Phe Asn Ala Leu Lys 
        35                  40                  45              


Ser Asp Phe Asp Ser Leu Arg Asp Gln Ile Glu Lys Ala Ala Thr Asn 
    50                  55                  60                  


Ala Lys Phe Asn Gly Val Ser Ile Ala Asp Gly Ser Thr Thr Lys Leu 
65                  70                  75                  80  


Thr Phe Leu Ala Asn Ser Asp Gly Ser Gly Phe Thr Val Asn Ala Lys 
                85                  90                  95      


Thr Ile Ser Leu Ala Gly Ile Gly Leu Thr Thr Thr Ser Thr Phe Thr 
            100                 105                 110         


Thr Ala Ala Ala Ala Lys Thr Met Ile Gly Thr Ile Asp Thr Ala Leu 
        115                 120                 125             


Gln Thr Ala Thr Asn Lys Leu Ala Ser Leu Gly Thr Ser Ser Val Gly 
    130                 135                 140                 


Leu Asp Thr His Leu Thr Phe Val Gly Lys Leu Gln Asp Ser Leu Asp 
145                 150                 155                 160 


Ala Gly Val Gly Asn Leu Val Asp 
                165             


<210>  25
<211>  1847
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  25

Met Gly Arg Phe Ser Pro Lys Ser Leu Asp Leu Asp Gly Lys Val Ile 
1               5                   10                  15      


Leu Val Thr Gly Gly Thr Gly Ser Phe Gly Arg Arg Phe Ile Glu Thr 
            20                  25                  30          


Val Leu Arg Arg Tyr Asp Pro Arg Lys Val Ile Val Tyr Ser Arg Asp 
        35                  40                  45              


Glu Leu Lys Gln Ser Asp Met Gln Ile Glu Leu Arg Glu Gln Phe Asp 
    50                  55                  60                  


Glu Ala Thr Val Ala Lys Met Arg Phe Phe Leu Gly Asp Val Arg Asp 
65                  70                  75                  80  


Arg Glu Arg Leu Thr Leu Ala Leu Arg Gly Val Asp Ile Val Ile His 
                85                  90                  95      


Ala Ala Ala Leu Lys Gln Val Pro Ala Ala Glu Tyr Asn Pro Ser Glu 
            100                 105                 110         


Cys Ile His Thr Asn Val Leu Gly Ala Glu Asn Val Val Trp Ala Ser 
        115                 120                 125             


Leu Ala Asn Ala Val Lys Gln Val Val Ala Leu Ser Thr Asp Lys Ala 
    130                 135                 140                 


Cys Asn Pro Thr Asn Leu Tyr Gly Ala Thr Lys Leu Ala Ser Asp Lys 
145                 150                 155                 160 


Thr Phe Val Ala Ala Asn Asn Leu Ser Gly Asp Ile Gly Thr Arg Phe 
                165                 170                 175     


Cys Val Val Arg Tyr Gly Asn Val Val Gly Ser Arg Gly Ser Val Val 
            180                 185                 190         


Pro Leu Tyr Arg Arg Leu Leu Ser Gln Gly Ala Thr Glu Leu Pro Val 
        195                 200                 205             


Thr Asp Pro Arg Met Thr Arg Phe Trp Ile Thr Leu Asn Glu Gly Val 
    210                 215                 220                 


Asp Phe Val Leu Ser Ser Leu Thr Met Met Arg Gly Gly Glu Ile Phe 
225                 230                 235                 240 


Val Pro Lys Ile Pro Ser Met Ala Met Pro Asp Leu Val Lys Ala Met 
                245                 250                 255     


Ser Ser Thr Ala Ala Met Lys Val Ile Gly Ile Arg Pro Gly Glu Lys 
            260                 265                 270         


Leu His Glu Ile Met Ile Ser Ala Asp Asp Ala Arg Ser Thr Val Glu 
        275                 280                 285             


Phe Asp Asp Arg Tyr Ala Ile Glu Pro Asn Phe Ala Glu Phe Gly Arg 
    290                 295                 300                 


Glu Pro Tyr Ala Ala Ser Asp Gly Ala Lys Pro Val Ala Glu Asp Phe 
305                 310                 315                 320 


Ser Tyr Ser Ser Asp Asn Asn His Asp Trp Leu Ser Pro Glu Gly Leu 
                325                 330                 335     


Leu Ala Met Leu Glu Glu Lys Ala Thr Met Thr Gly Gly Phe Leu Pro 
            340                 345                 350         


Tyr Gly Arg Gln Thr Ile Glu Glu Asp Asp Ile Ala Ala Val Ala Glu 
        355                 360                 365             


Ala Leu Arg Gly Asp Phe Leu Thr Thr Gly Pro Thr Val Glu Ala Phe 
    370                 375                 380                 


Glu Thr Ala Phe Ala Ala Lys Val Gly Ala Asp His Ala Ile Ala Val 
385                 390                 395                 400 


Ser Asn Gly Thr Ala Thr Leu His Leu Ala Met Met Ala Leu Gly Ile 
                405                 410                 415     


Gly Glu Gly Asp Val Cys Val Ala Pro Ser Val Thr Phe Leu Ala Thr 
            420                 425                 430         


Ala Asn Cys Ala Arg Tyr Val Gly Ala Glu Val Val Phe Ala Asp Val 
        435                 440                 445             


Asp Pro Asp Ser Gly Leu Met Thr Pro Asp Thr Leu Ala Arg Ala Leu 
    450                 455                 460                 


Ala Gly Ala Arg Asp Lys Arg Val Lys Ala Val Leu Pro Val His Leu 
465                 470                 475                 480 


Arg Gly Asp Val Cys Asp Leu Pro Ala Leu Lys Ala Met Ala Ser Ala 
                485                 490                 495     


Ser Gly Ala Val Leu Val Glu Asp Ala Pro His Ala Leu Gly Ser Ile 
            500                 505                 510         


Ala Thr Phe Asp Gly Val Ala His Pro Val Gly Asp Gly Ala Tyr Ser 
        515                 520                 525             


Ser Phe Ala Ser Phe Ser Phe His Pro Val Lys Thr Leu Ala Thr Gly 
    530                 535                 540                 


Glu Gly Gly Met Leu Thr Thr Asn Asp Pro Ala Leu Ala Ala Lys Ala 
545                 550                 555                 560 


Arg Leu Leu Arg Ser His Gly Met Val Arg Gln Pro Gly Gly Asp Pro 
                565                 570                 575     


Trp Trp Tyr Glu Met Pro Glu Leu Gly Phe Asn Tyr Arg Ile Pro Asp 
            580                 585                 590         


Val Leu Cys Ala Leu Gly Leu Ser Gln Leu Ala Lys Leu Asp Arg Phe 
        595                 600                 605             


Val Ala Arg Arg Arg Asp Leu Thr Ala Leu Tyr Ala Arg Leu Leu Ala 
    610                 615                 620                 


Glu Arg Ala Pro Arg Ala Arg Leu Ala Thr Ser Pro Asp His Ser Asp 
625                 630                 635                 640 


Ala Ala Leu His Leu Leu Thr Val Leu Ile Asp Phe Glu Ala Glu Gly 
                645                 650                 655     


Ile Ser Arg Arg Thr Val Val Glu Ser Leu Lys Thr Gln Gly Val Gly 
            660                 665                 670         


Thr Gln Val His Tyr Ile Pro Val His Arg Gln Pro Tyr Tyr Ala Gln 
        675                 680                 685             


Arg Tyr Gly Val Ala Asp Leu Pro Gly Ala Asp Ala Trp Tyr Ala Arg 
    690                 695                 700                 


Cys Leu Thr Leu Pro Leu Tyr Pro Ala Met Thr Asn Gly Asp Val Glu 
705                 710                 715                 720 


Arg Val Val Gly Ala Leu Ala Thr Val Leu Gly Met Val Ala Val Arg 
                725                 730                 735     


Leu Arg Asn Leu Val Glu Ser Asp Arg Glu Arg Leu Leu Ile Trp Arg 
            740                 745                 750         


Asn Ser Pro Asp Val Ser Ala Tyr Met Tyr Ser Asp His Lys Ile Gly 
        755                 760                 765             


His Glu Glu His Asp His Trp Phe Asp Val Ala Arg His Asp Pro Arg 
    770                 775                 780                 


Arg Arg Tyr Trp Ile Ile Glu Ala Asp Gly Glu Pro Val Gly Leu Ala 
785                 790                 795                 800 


Asn Leu Ala Asp Ile Asp Leu Val His Arg Arg Cys Ala Trp Ala Tyr 
                805                 810                 815     


Tyr Leu Ala Ser Pro Lys Val Arg Gly Leu Gly Val Gly Ser Phe Val 
            820                 825                 830         


Glu Phe Gln Ile Ile Glu Tyr Val Phe Asn Gln Leu His Leu Asn Lys 
        835                 840                 845             


Leu Trp Cys Glu Val Leu Ile Ser Asn Glu Ser Val Trp Arg Leu His 
    850                 855                 860                 


Glu Leu Tyr Gly Phe Gln Arg Glu Ala Leu Phe Arg Gln His Val Met 
865                 870                 875                 880 


Lys Gln Gly His Glu Val Asp Val Ile Gly Leu Gly Leu Leu Ala Ser 
                885                 890                 895     


Asp Trp Ala Ala Arg Arg Asp Ala Met Ala Glu Arg Leu Cys Ala Lys 
            900                 905                 910         


Gly Tyr Thr Ile Pro Asp Leu Thr Cys Arg Ala Ala Met Ser Leu Arg 
        915                 920                 925             


Ile Val Phe Val Cys Ala Ala Gly Pro Ser Val Gly Gly Gly His Val 
    930                 935                 940                 


Met Arg Ser Leu Thr Leu Ala Arg Ala Leu Ala Ala Arg Gly Ala Thr 
945                 950                 955                 960 


Cys Ala Phe Leu Gly Thr Pro Glu Val Ala Ala Val Leu Asp Ala Phe 
                965                 970                 975     


Gly Pro Asp Met Ala Arg Ala Asp Thr Ala Glu Pro Phe Glu Ala Val 
            980                 985                 990         


Val Phe Asp Ser Tyr Ala Leu Thr  Ala Asp Asp His Arg  Arg Ile Ala 
        995                 1000                 1005             


Ala Gly  Arg Pro Ala Leu Val  Ile Asp Asp Leu Ala  Asp Arg Pro 
    1010                 1015                 1020             


Leu Ala  Ala Asp Leu Val Leu  Asp Ala Gly Pro Ala  Arg Arg Ala 
    1025                 1030                 1035             


Glu Asp  Tyr Ala Gly Leu Val  Pro Ala His Ala Arg  Leu Leu Leu 
    1040                 1045                 1050             


Gly Pro  Asn His Ala Pro Val  Arg Pro Ala Phe Val  Ala Leu Arg 
    1055                 1060                 1065             


Glu Ala  Ala Leu Ala Arg Arg  Ala Gln Gln Gly Pro  Val Arg Arg 
    1070                 1075                 1080             


Ile Leu  Val Ser Leu Gly Met  Thr Asp Val Gly Gly  Ile Thr Gly 
    1085                 1090                 1095             


Arg Val  Val Ala Leu Leu Ala  Pro Ile Leu Gly Glu  Val Thr Leu 
    1100                 1105                 1110             


Asp Leu  Val Val Gly Ala Gly  Ala Pro Ser Leu Pro  Ala Leu Arg 
    1115                 1120                 1125             


Ala Leu  Ala Ala Glu Asp Pro  Arg Leu Val Leu His  Ile Asp Thr 
    1130                 1135                 1140             


Gln Asp  Met Pro Arg Leu Val  Leu Glu Ala Asp Leu  Ala Ile Gly 
    1145                 1150                 1155             


Ala Gly  Gly Ser Thr Thr Trp  Glu Arg Cys Val Leu  Ala Leu Pro 
    1160                 1165                 1170             


Ala Leu  Thr Leu Ile Leu Ala  Asp Asn Gln Ile Ala  Ala Ala Arg 
    1175                 1180                 1185             


Ala Leu  Glu Ala Ala Gly Val  Thr Pro Cys Leu Asp  Val Thr Ala 
    1190                 1195                 1200             


Pro Asp  Phe Asp Thr Ala Phe  Ala Ala Leu Ala Gln  Asn Leu Ile 
    1205                 1210                 1215             


Ala Asp  Pro Asp Arg Arg Ala  Ala Leu Ser Ala Ala  Ser Ala Thr 
    1220                 1225                 1230             


Val Cys  Asp Gly Arg Gly Ala  Glu Arg Val Ala Glu  Ala Phe Leu 
    1235                 1240                 1245             


Gly Val  Thr Thr Thr Met Ser  Ala Pro Ser Thr Glu  Ile Pro Pro 
    1250                 1255                 1260             


Ser Ile  Glu Ile Ala Gly Arg  Lys Ile Gly Ala Asp  His Ser Pro 
    1265                 1270                 1275             


Tyr Val  Ile Cys Glu Leu Ser  Gly Asn His Asn Gly  Ser Leu Glu 
    1280                 1285                 1290             


Arg Cys  Leu Ala Met Val Asp  Ala Ala Ala Asp Thr  Gly Cys Asp 
    1295                 1300                 1305             


Ala Ile  Lys Ile Gln Thr Tyr  Thr Ala Asp Thr Ile  Thr Leu Asp 
    1310                 1315                 1320             


Val Asp  Arg Pro Glu Phe Lys  Ile His Gly Gly Leu  Trp Asp Gly 
    1325                 1330                 1335             


Arg Thr  Leu Tyr Glu Leu Tyr  Glu Glu Ala His Thr  Pro Phe Glu 
    1340                 1345                 1350             


Trp His  Ala Ala Ile Phe Glu  Arg Ala Arg Gln Arg  Gly Val Thr 
    1355                 1360                 1365             


Ile Phe  Ser Ser Pro Phe Asp  Glu Thr Ala Val Asp  Leu Leu Asp 
    1370                 1375                 1380             


Ser Leu  Gly Ala Pro Ala Phe  Lys Ile Ala Ser Phe  Glu Ala Val 
    1385                 1390                 1395             


Asp Leu  Pro Leu Ile Lys Tyr  Ala Ala Ala Lys Gly  Lys Pro Leu 
    1400                 1405                 1410             


Ile Ile  Ser Thr Gly Met Ala  Asn Leu Thr Glu Met  Gln Thr Ala 
    1415                 1420                 1425             


Leu Asp  Thr Ala Leu Ser Ala  Gly Ala Pro Gly Val  Leu Leu Leu 
    1430                 1435                 1440             


His Cys  Val Ser Ser Tyr Pro  Ala Thr Phe Ala Asp  Ala Asn Val 
    1445                 1450                 1455             


Arg Thr  Val Pro Asp Met Ala  Ala Arg Phe Gly Cys  Pro Ile Gly 
    1460                 1465                 1470             


Leu Ser  Asp His Thr Pro Gly  Thr Ala Ala Ser Val  Ala Ala Val 
    1475                 1480                 1485             


Ser Leu  Gly Ala Cys Ala Val  Glu Lys His Phe Thr  Leu Ala Arg 
    1490                 1495                 1500             


Ala Asp  Gly Gly Pro Asp Ala  Ala Phe Ser Leu Glu  Pro Ala Glu 
    1505                 1510                 1515             


Phe Lys  Ala Leu Val Asp Asp  Thr Lys Asn Ala Trp  Ala Ala Leu 
    1520                 1525                 1530             


Gly Arg  Ala His Tyr Asp Val  Leu Gly Ser Glu Ala  Thr Ser Leu 
    1535                 1540                 1545             


Leu Phe  Arg Arg Ser Leu Tyr  Val Thr Ala Asp Val  Lys Ala Gly 
    1550                 1555                 1560             


Glu Pro  Leu Thr Arg Ala Asn  Val Arg Ser Val Arg  Pro Gly Asn 
    1565                 1570                 1575             


Gly Leu  Pro Pro Ala Asp Leu  Asp Lys Val Leu Ala  Gly Lys Ala 
    1580                 1585                 1590             


Thr Arg  Asp Leu Ala Arg Gly  Glu Pro Leu Asp Trp  Ser Met Val 
    1595                 1600                 1605             


Gly Met  Ile Leu Ala Ile Leu  Gln Ala Arg Met Ser  Ser Thr Arg 
    1610                 1615                 1620             


Leu Pro  Gly Lys Val Leu Met  Pro Leu Gln Arg Gln  Pro Met Ile 
    1625                 1630                 1635             


Val Arg  Gln Ile Glu Arg Val  Ala Arg Ser Lys Arg  Ile Asp Lys 
    1640                 1645                 1650             


Leu Val  Val Ala Thr Ser Asp  Arg Pro Glu Asp Asp  Ala Ile Glu 
    1655                 1660                 1665             


Ala Ala  Val Arg Arg Glu Gly  Ile Ala Val Phe Arg  Gly Ser Leu 
    1670                 1675                 1680             


Asp Asn  Val Gln Gln Arg Phe  Ile Gly Ala Leu Asp  Ala His Pro 
    1685                 1690                 1695             


Ala Asp  His Val Val Arg Leu  Thr Ala Asp Cys Pro  Leu Ala Asp 
    1700                 1705                 1710             


Pro Thr  Leu Ile Asp Ala Thr  Ile Asp Leu Cys Leu  Ser Lys Gly 
    1715                 1720                 1725             


Ala Asp  Tyr Val Ser Asn Thr  Pro Glu Gly His Ala  His Pro Lys 
    1730                 1735                 1740             


Gly Thr  Asp Val Glu Val Met  Thr Ala Ala Ala Leu  Arg Arg Ala 
    1745                 1750                 1755             


Ala Ala  Glu Ala Thr Thr Lys  Glu Ala Phe Glu His  Val Thr Trp 
    1760                 1765                 1770             


Asp Leu  Trp Asn Gln Pro Gln  Arg Trp Thr Cys Ala  Trp Leu Pro 
    1775                 1780                 1785             


Cys Phe  Pro Asp Gln Gly Ala  Val Arg Trp Thr Val  Asp Arg Pro 
    1790                 1795                 1800             


Asp Asp  Tyr Ala Phe Val Ala  Ala Val Tyr Asp Ala  Leu Tyr Pro 
    1805                 1810                 1815             


Ala Asn  Arg Ala Phe Thr Ser  Asp Asp Ile Arg Ala  Phe Val Ala 
    1820                 1825                 1830             


Gly Arg  Pro Asp Leu Gln Asp  Tyr Gly Gly Asp Arg  Arg Ala 
    1835                 1840                 1845         


<210>  26
<211>  1806
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  26
atgtcacgta agagcgcctt ggaaagttcc gcgtcagtct tagctcaggc ggatgtgggc       60

gctagtggca ttcacccttc ggtcatcgcg gacgcgatgg gcgactcagc gtcagccgag      120

gcgttagagc gcttgaatcg tgcggcacag gacacaaaaa atgtcgataa cgcgaaacac      180

ctggctcgcg caattcaagc tgttcaatta caagactacg caaaggcaga caagcttgca      240

ttgaaactgt tggaaaaaga cgagcgcctg ggccttgcct ggcacattct tgccattgct      300

cgcgaaaaaa cgggcgattt cgcttctagc cttcgcgcct acgaggctgc acttgcgctg      360

cttccggatc atggcccggt tgcgggcgac cttggtcgtc ttgcatttcg tatgaacatg      420

ccggagctgg ccgccaagtt cttcgctcac taccgtctgg cgcgtcccga tgacgtggaa      480

ggggcaaata acctggcctg tgccctgcgt gaacttaatc gcgagagcga agcaattgaa      540

gttttgaaag ccgctctggg cgcaaaccca gaggcagcag tgctgtggaa cacattaggg      600

acagtgttgt gcaacatcgg cgatgcggct ggatctatcg tcttctttga tgaatcactg      660

cgcttagccc ctgacttctc gaaagcttac cacaaccgcg cgttcgcccg cttagatctt      720

ggggagattg aagcggcgtt ggcggactgt gaagctgcca tgcgtagtcc cggctcacca      780

gaggacctgg caatgatgca gttcgctcgc gccacaattt tactggcact tgggcgcgtt      840

ggcgaagggt gggaagcgta cgagtcacgc ttttcccccg cattaagcga cgcacctcgt      900

ttccaaattc cgggaacccg ttggtcgggg caagaccttg ctggaaagac actgatgatc      960

tcaactgaac aagggttagg cgacgaagtg atgtttgcag gtatgttacc cgacatcctg     1020

gaacgtcttg gaccagacgg gtccctgtct ctggcagttg agcgccgcct gattccgctg     1080

tttcagcgta gctttcctgg tattgaagtg actgcccacc gcacggttgc atacgaaggt     1140

cgtacctatc gtgctgcccc agagattgag gattgggatc gcttcgatta ttgggccgcg     1200

attggagact tccttccctc cttacgcggg agtgttgaag cattcccgcg tcgcgatcac     1260

tacttgacgc cagatccgga gcgcgttgct cattggaagg cggaacttga aaagttgggc     1320

cctgccccta aagtagggtt attgtggaag agtctgaaat tgggagcaga acgcggacgt     1380

cagttctccc ctttcgaagc atggcgcgca gtacttcaaa cccctggcgc ggttttcgtg     1440

aatctgcagt atggtgattg tgacgaagag atcgcctacg caaaggagac gttcggagtt     1500

gaaatttggc agcccccagg tattgattta aaaaaggatt tggacgatgt tgcagcgtta     1560

tgtgcagcag tcgacttgat tatcggcttc tctaacgcaa ccattaattt ggcgggtgca     1620

gtaggtgcac cgatttggat gatgacagct cctaaggcat ggactaaatt aggaactgat     1680

cgttatccct ggtacccgca ggcgcaggta ttttcaccgg cagattttag cgactgggag     1740

ccagtcatgg aagaagtagc ccgtgcattg gcagctaaaa ttgcgggttg aattctagac     1800

tcgaga                                                                1806


<210>  27
<211>  596
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  27

Met Ser Arg Lys Ser Ala Leu Glu Ser Ser Ala Ser Val Leu Ala Gln 
1               5                   10                  15      


Ala Asp Val Gly Ala Ser Gly Ile His Pro Ser Val Ile Ala Asp Ala 
            20                  25                  30          


Met Gly Asp Ser Ala Ser Ala Glu Ala Leu Glu Arg Leu Asn Arg Ala 
        35                  40                  45              


Ala Gln Asp Thr Lys Asn Val Asp Asn Ala Lys His Leu Ala Arg Ala 
    50                  55                  60                  


Ile Gln Ala Val Gln Leu Gln Asp Tyr Ala Lys Ala Asp Lys Leu Ala 
65                  70                  75                  80  


Leu Lys Leu Leu Glu Lys Asp Glu Arg Leu Gly Leu Ala Trp His Ile 
                85                  90                  95      


Leu Ala Ile Ala Arg Glu Lys Thr Gly Asp Phe Ala Ser Ser Leu Arg 
            100                 105                 110         


Ala Tyr Glu Ala Ala Leu Ala Leu Leu Pro Asp His Gly Pro Val Ala 
        115                 120                 125             


Gly Asp Leu Gly Arg Leu Ala Phe Arg Met Asn Met Pro Glu Leu Ala 
    130                 135                 140                 


Ala Lys Phe Phe Ala His Tyr Arg Leu Ala Arg Pro Asp Asp Val Glu 
145                 150                 155                 160 


Gly Ala Asn Asn Leu Ala Cys Ala Leu Arg Glu Leu Asn Arg Glu Ser 
                165                 170                 175     


Glu Ala Ile Glu Val Leu Lys Ala Ala Leu Gly Ala Asn Pro Glu Ala 
            180                 185                 190         


Ala Val Leu Trp Asn Thr Leu Gly Thr Val Leu Cys Asn Ile Gly Asp 
        195                 200                 205             


Ala Ala Gly Ser Ile Val Phe Phe Asp Glu Ser Leu Arg Leu Ala Pro 
    210                 215                 220                 


Asp Phe Ser Lys Ala Tyr His Asn Arg Ala Phe Ala Arg Leu Asp Leu 
225                 230                 235                 240 


Gly Glu Ile Glu Ala Ala Leu Ala Asp Cys Glu Ala Ala Met Arg Ser 
                245                 250                 255     


Pro Gly Ser Pro Glu Asp Leu Ala Met Met Gln Phe Ala Arg Ala Thr 
            260                 265                 270         


Ile Leu Leu Ala Leu Gly Arg Val Gly Glu Gly Trp Glu Ala Tyr Glu 
        275                 280                 285             


Ser Arg Phe Ser Pro Ala Leu Ser Asp Ala Pro Arg Phe Gln Ile Pro 
    290                 295                 300                 


Gly Thr Arg Trp Ser Gly Gln Asp Leu Ala Gly Lys Thr Leu Met Ile 
305                 310                 315                 320 


Ser Thr Glu Gln Gly Leu Gly Asp Glu Val Met Phe Ala Gly Met Leu 
                325                 330                 335     


Pro Asp Ile Leu Glu Arg Leu Gly Pro Asp Gly Ser Leu Ser Leu Ala 
            340                 345                 350         


Val Glu Arg Arg Leu Ile Pro Leu Phe Gln Arg Ser Phe Pro Gly Ile 
        355                 360                 365             


Glu Val Thr Ala His Arg Thr Val Ala Tyr Glu Gly Arg Thr Tyr Arg 
    370                 375                 380                 


Ala Ala Pro Glu Ile Glu Asp Trp Asp Arg Phe Asp Tyr Trp Ala Ala 
385                 390                 395                 400 


Ile Gly Asp Phe Leu Pro Ser Leu Arg Gly Ser Val Glu Ala Phe Pro 
                405                 410                 415     


Arg Arg Asp His Tyr Leu Thr Pro Asp Pro Glu Arg Val Ala His Trp 
            420                 425                 430         


Lys Ala Glu Leu Glu Lys Leu Gly Pro Ala Pro Lys Val Gly Leu Leu 
        435                 440                 445             


Trp Lys Ser Leu Lys Leu Gly Ala Glu Arg Gly Arg Gln Phe Ser Pro 
    450                 455                 460                 


Phe Glu Ala Trp Arg Ala Val Leu Gln Thr Pro Gly Ala Val Phe Val 
465                 470                 475                 480 


Asn Leu Gln Tyr Gly Asp Cys Asp Glu Glu Ile Ala Tyr Ala Lys Glu 
                485                 490                 495     


Thr Phe Gly Val Glu Ile Trp Gln Pro Pro Gly Ile Asp Leu Lys Lys 
            500                 505                 510         


Asp Leu Asp Asp Val Ala Ala Leu Cys Ala Ala Val Asp Leu Ile Ile 
        515                 520                 525             


Gly Phe Ser Asn Ala Thr Ile Asn Leu Ala Gly Ala Val Gly Ala Pro 
    530                 535                 540                 


Ile Trp Met Met Thr Ala Pro Lys Ala Trp Thr Lys Leu Gly Thr Asp 
545                 550                 555                 560 


Arg Tyr Pro Trp Tyr Pro Gln Ala Gln Val Phe Ser Pro Ala Asp Phe 
                565                 570                 575     


Ser Asp Trp Glu Pro Val Met Glu Glu Val Ala Arg Ala Leu Ala Ala 
            580                 585                 590         


Lys Ile Ala Gly 
        595     


<210>  28
<211>  10397
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  28
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accaaatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcatcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgcaac gcgatgacga tggatagcga      420

ttcatcgatg agctgacccg atcgccgccg ccggagggtt gcgtttgaga cgggcgacag      480

atggatccgg atgtgagcgg ataacaatta cgagcttcat gcacagtgaa atcatgaaaa      540

atttattggc tttgtgagcg gataacaatt ataatatgtg gaaagaagga gataccatat      600

ggggcgtttt agcccaaaga gtttggatct ggacggaaag gttatcttgg taacgggcgg      660

tactggaagc ttcgggcgtc gtttcatcga gactgtcttg cgccgttacg atccccgcaa      720

agttatcgtc tattcgcgcg atgaattaaa acagagtgac atgcaaattg agcttcgcga      780

gcaattcgat gaggccaccg tagcaaagat gcgttttttt ctgggcgacg tgcgtgatcg      840

tgagcgttta acgttagcgc ttcgtggagt cgacattgtc attcatgcag ccgcacttaa      900

acaggtacca gcggcagaat ataatccctc cgaatgtatc cacacgaatg tgttgggtgc      960

ggaaaacgta gtatgggcgt cactggctaa cgccgttaag caggtggtcg ccttatctac     1020

ggacaaagct tgtaatccga ctaacctgta tggtgcaacg aagttggcct ctgacaagac     1080

gttcgtggct gccaacaatc tgagtggaga catcgggacc cgcttttgcg tggttcgcta     1140

tggtaacgta gtcgggtctc gcggctcagt agtaccactt tatcgtcgtc tgttgagcca     1200

aggggcgacg gagttgccag tcacggaccc tcgcatgacc cgcttctgga ttacgttgaa     1260

tgagggcgtg gacttcgtac tttcttcatt gaccatgatg cgcggaggcg agatttttgt     1320

gccgaagatc cccagtatgg caatgcctga tttggtaaaa gccatgtcta gcactgctgc     1380

aatgaaggta atcggtatcc gcccaggaga gaaacttcat gaaatcatga tcagcgcgga     1440

tgatgcccgc agcaccgtgg agttcgatga ccgctatgca atcgaaccga atttcgcaga     1500

atttggccgt gagccctacg cagcaagtga cggcgctaaa cccgtggccg aggacttcag     1560

ctacagctca gacaataatc atgactggtt gtctcccgaa ggcttgttag ccatgttaga     1620

agagaaggcc acgtgaagat ctcccccggg aagaaggaga tataccatga caggcggatt     1680

tttaccttat gggcgtcaga ctattgagga ggatgacatc gctgcggtag cggaagcatt     1740

gcgcggcgac tttctgacga ctggccctac agtggaagct ttcgagacag cgttcgccgc     1800

taaagtcggc gctgatcacg caatcgcggt atcgaacgga acagctacct tgcaccttgc     1860

catgatggct cttggtattg gtgaaggtga tgtatgcgta gcaccaagtg tgacttttct     1920

ggcaaccgct aactgtgcac gttatgtagg tgcggaagta gtgttcgccg acgttgaccc     1980

ggacagtggt cttatgacac cagacaccct ggcgcgcgct ttggcaggtg cacgtgataa     2040

gcgtgttaaa gctgtacttc cagtacatct gcgtggggac gtatgtgatc ttcccgcgtt     2100

gaaagcaatg gcatcagcga gcggcgccgt gcttgtggaa gatgccccgc atgccctggg     2160

ttcgatcgct acctttgatg gcgtagcgca tccagtcgga gatggtgcgt acagttcatt     2220

cgcaagtttc tcctttcacc ccgtaaagac gctggccaca ggggagggag ggatgttgac     2280

caccaacgac cccgcactgg ccgcaaaggc gcgtttgctt cgcagtcacg ggatggtccg     2340

ccagccgggt ggagatccgt ggtggtacga gatgcccgaa ctgggattca attaccgcat     2400

tcctgatgtt ttatgtgcct taggtttatc ccaactggcg aaacttgacc gttttgttgc     2460

acgccgtcgt gaccttactg ccctttacgc tcgcttattg gcggagcgcg ctccccgtgc     2520

gcgtttagcc accagcccgg accactcaga cgctgcctta cacctgttga cggttttaat     2580

tgatttcgag gccgagggta tttcccgccg taccgtagtt gaatccctta aaactcaagg     2640

agtaggaacg caggtgcact acatcccggt gcaccgtcag ccatattatg cacagcgcta     2700

cggggtcgcc gacttgcccg gagctgacgc gtggtacgcc cgttgcttaa ccttgccgct     2760

gtatccagct atgactaatg gagacgttga gcgcgttgtc ggtgctttag ccactgtttt     2820

agggtgagct agcggagctc aagaaggaga tataccatgg ttgcggtgcg ccttcgtaac     2880

ctggtcgaat ctgatcgcga acgccttctt atttggcgca acagtccaga tgttagcgca     2940

tatatgtact cagatcataa gattggtcac gaagaacatg accactggtt cgacgtcgcg     3000

cgtcatgacc cacgtcgtcg ctactggatt atcgaggctg acggggagcc ggtcggtctt     3060

gccaatcttg ctgacattga tttggttcac cgtcgctgtg cttgggccta ctacttggca     3120

agccccaaag tgcgtggact gggtgtcggc agttttgttg agttccaaat tatcgaatac     3180

gttttcaatc agttgcacct gaacaaattg tggtgcgaag tccttatcag taatgaatcc     3240

gtatggcgtc tgcatgaact ttacggcttc cagcgcgagg ctttatttcg ccagcatgtt     3300

atgaaacagg gccatgaagt ggacgtaatt ggtttaggac tgcttgccag tgactgggcc     3360

gctcgccgcg atgccatggc cgaacgcttg tgtgcgaaag gatatacaat ccccgacttg     3420

acctgccgcg cggcctgaga tatcgcggcc gcaagaagga gatataccat gtccttacgc     3480

atcgtctttg tatgcgcagc cggcccatct gtgggtggtg gacatgtcat gcgttccttg     3540

acacttgcac gcgcgttagc ggcgcgcgga gcgacatgtg cgtttttggg aacacccgag     3600

gtagcagcag tcttagacgc cttcggtcct gatatggcgc gtgccgacac cgccgagccc     3660

ttcgaagctg tagtctttga ctcctatgca cttaccgcgg acgaccatcg ccgtatcgcg     3720

gcgggacgtc ccgcgttagt aatcgacgat ttagccgacc gccctcttgc agcagacctg     3780

gtgcttgatg ctggaccggc tcgccgcgcc gaggattacg caggactggt gcccgcacat     3840

gcacgtcttc tgctgggtcc gaatcacgca ccggtccgtc cagcttttgt tgcgttacgc     3900

gaggcagcct tagcacgccg tgcgcagcag ggaccggtac gtcgcattct tgtatctctg     3960

ggcatgacgg acgtgggggg aattacagga cgtgtggtcg cacttcttgc cccaatcctt     4020

ggggaggtca ctctggatct tgtggtggga gcgggagccc cgagcttgcc tgctctgcgt     4080

gcattagccg ctgaagaccc tcgccttgtt cttcatattg acacgcagga tatgccacgc     4140

cttgttcttg aagccgactt ggccatcggc gcaggaggtt ccacgacgtg ggagcgctgt     4200

gtccttgcct tgccagcttt gactcttatc ttagccgata accaaattgc cgcggcacgt     4260

gctcttgaag cagctggcgt aaccccttgt ttggacgtaa cagccccgga ttttgacacg     4320

gcctttgcag ctcttgcgca gaacctgatt gctgatccgg atcgtcgtgc cgcacttagt     4380

gctgcctcag ctacggtctg tgatggacgt ggcgcggagc gcgtggctga agcattcttg     4440

ggagtcacca ccacatgaca attgctgcag aagaaggaga tataccatgt ccgctccgag     4500

taccgagatc cccccttcga ttgagattgc tggacgcaag atcggggccg atcacagccc     4560

ctacgttatc tgtgagttgt cgggcaatca taatgggtcc cttgaacgtt gcttagctat     4620

ggtagacgct gccgcagata ccggatgcga cgccatcaaa attcaaactt acacagccga     4680

cacaatcacc ttggatgtag atcgtccgga gttcaaaatc cacgggggat tgtgggatgg     4740

acgcactctg tatgagcttt atgaggaagc tcatactccc tttgagtggc acgcggccat     4800

cttcgaacgc gctcgtcagc gcggtgtcac gattttttct tctccatttg acgagactgc     4860

cgtcgacctt ttagattcgc tgggggcgcc agcttttaaa attgcaagct ttgaagcggt     4920

agaccttccg cttatcaaat acgcggcagc caaagggaaa cccttaatta tttccactgg     4980

aatggcgaac cttacggaga tgcaaaccgc ccttgataca gctttgtcag caggcgctcc     5040

gggagtgtta cttttacact gtgtttcttc ataccctgct acgttcgcag acgcgaacgt     5100

ccgcaccgtg ccggatatgg cggcacgctt cggatgcccg attggccttt ccgatcacac     5160

gcccggtaca gcagctagtg tcgccgctgt gagcttaggg gcgtgtgcag tagaaaaaca     5220

tttcacgctt gcccgtgccg atggtggtcc ggacgccgca ttctctcttg aacctgcgga     5280

gtttaaggca ttagttgatg acacaaagaa tgcttgggct gcgttgggac gtgcacacta     5340

cgatgtgctg gggtcagagg caacatcact gttattccgc cgttctctgt acgttacagc     5400

cgacgtgaag gctggcgaac ctttaacgcg cgcgaatgtt cgttcagtgc gtcccggcaa     5460

tgggttgcca cctgcggatt tggataaagt tctggcggga aaggcaaccc gcgatcttgc     5520

gcgcggcgag cctcttgact ggtcaatggt cggttgaaac tagtacttaa gaagaaggag     5580

atataccatg atcttggcca tcctgcaagc ccgcatgtca tccacgcgcc ttcctggcaa     5640

ggtactgatg cccctgcaac gccagcccat gatcgttcgt cagatcgaac gtgttgcccg     5700

ctcaaaacgc attgataagt tagttgtggc cacgtcggac cgcccagagg acgatgcaat     5760

cgaagcagcc gttcgccgtg aaggtattgc ggtgtttcgc gggtcattag acaatgtcca     5820

gcagcgtttt attggggcat tggatgccca ccctgccgac catgtagtac gtctgaccgc     5880

cgattgtccc cttgccgacc cgacacttat tgatgccaca atcgatttat gtctttcaaa     5940

gggcgcggac tacgtatcta atacaccgga gggtcacgcc catccaaaag ggaccgatgt     6000

agaggtaatg accgcagcgg cattgcgtcg cgcggctgct gaagccacga ccaaagaagc     6060

atttgaacac gttacttggg acctttggaa ccaacctcaa cgctggacgt gtgcatggtt     6120

gccgtgcttc ccagatcaag gagcggtacg ctggactgtg gatcgtccgg atgattatgc     6180

ttttgtcgct gctgtatacg atgccctgta cccagcaaat cgcgccttta cgtcggatga     6240

catccgtgcg tttgtcgctg gtcgccccga cctgcaagat tatggtggtg atcgccgtgc     6300

atgagaattc aggaggtaaa aaaatgtccc gtaaaagcgc cctggaatcc tcggcaagcg     6360

tcctggccca ggccgatgtc ggcgcttcgg gcatccaccc cagcgttatc gccgacgcca     6420

tgggcgattc ggcgtccgcc gaggcgctgg agcgcctgaa tcgggcggcg caggacacca     6480

agaacgtcga caacgccaag cacttggcgc gcgcgatcca ggccgtgcag ctgcaggact     6540

acgccaaggc cgacaagctg gccctgaagc tgctggagaa ggacgagcga ctgggcctag     6600

cctggcacat cctggcgatt gcacgcgaga agaccggcga tttcgcctcc tcgctgcggg     6660

cctatgaagc cgcgctggct ctgctgcccg accatggccc cgtcgccggc gacctgggcc     6720

gcttggcctt ccgcatgaac atgccggagc tggcggccaa gttcttcgca cactaccgtc     6780

tcgctcggcc cgacgacgtc gagggcgcca acaacctggc gtgcgccctg cgcgagctta     6840

atcgcgaaag cgaagccatc gaagtcctca aggccgccct gggcgccaac cccgaggctg     6900

cggtgctgtg gaacacgctg ggcacggtgc tttgcaatat cggcgacgcg gcgggctcga     6960

tcgtgttctt cgacgagtcc ctgcgcctcg cgcccgactt ttcgaaagcc tatcacaacc     7020

gcgccttcgc caggctcgat ctgggcgaga tagaggccgc gctggccgat tgcgaagccg     7080

ccatgcgcag ccccggctca ccggaagatc tggcgatgat gcagttcgcc cgcgccacga     7140

ttctcctggc tctgggccgc gtcggcgaag gctgggaggc ttatgagtca cgcttctcgc     7200

cggcgctgag cgacgcgcca cggttccaga ttcctggcgt ccgctggtca ggacaggacc     7260

tcaggggcaa gcgtttgatg atcaccaccg agcagggcct cggcgacgag gtgatgttcg     7320

ccaacatgtt gcccgacatc gtcgaagcct tgggcccaga cggcttcctg tccctggcgg     7380

tcgagcgccg tctggcgccg ctgttcgagc gcaccttccc gaaggtcgag gtgaccgccc     7440

accgtacgat cgcctacgaa ggccgcgtgt tccgggccgc gccctatatc gagaactggg     7500

accgcttcga ctattgggcg gccatcggcg acttcctgcc gagccttcgc cccaccgccg     7560

aggcgtttcc caagcgcaac gccttcctgc agccggatcc ggcgcgggtg gcccactgga     7620

aggcccaact cgagaagctt ggccccggcc cgaaagtcgg cctgctctgg aagagcctga     7680

aactgaacgc ggaacgcgcg cggcagtttt cgcccttcca cctgtgggag ccggttttgc     7740

acacgccagg cgtggtgttc gtgaacctgc agtatggcga ctgcgaggaa gagatcgcct     7800

tcgccaagga agagctgggc gtggagatct ggcagccgga aggcattgat ctgaaggccg     7860

acctcgacga cgtggccgct ctctgcgcgg cggtggacct ggtgatcggg ttctccaacg     7920

ccacgatcaa tctggccggt gcggtgggga cgccgatctt catgctgacc ggcgcctcgt     7980

cctggacccg cctcggcacc gaatattacc cctggtatcc gagcgttcgc tgcttcgtca     8040

ccgagcagta cggggtctgg gaaccgacca tgggtcgcgt cgccaccgct ctgcgcgatt     8100

tcgccgcatc ctgatctaga actcgagatc agttctggac cagcgagctg tgctgcgact     8160

cgtggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc     8220

acacaacata cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta     8280

actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca     8340

gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc     8400

cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc     8460

tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat     8520

gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt     8580

ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg     8640

aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc     8700

tcctgttccg accctgtcgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt     8760

ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa     8820

gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta     8880

tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa     8940

caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa     9000

ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt     9060

cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt     9120

ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat     9180

cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat     9240

gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc     9300

aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc     9360

acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta     9420

gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga     9480

cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg     9540

cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc     9600

tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat     9660

cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag     9720

gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat     9780

cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa     9840

ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa     9900

gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga     9960

taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg    10020

gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc    10080

acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg    10140

aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact    10200

ctaccttttt caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat    10260

atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt    10320

gccacctgac gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat    10380

cacgaggccc tttcgtc                                                   10397


<210>  29
<211>  10397
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  29
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accaaatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcatcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgcaac gcgatgacga tggatagcga      420

ttcatcgatg agctgacccg atcgccgccg ccggagggtt gcgtttgaga cgggcgacag      480

atggatccgg atgtgagcgg ataacaatta cgagcttcat gcacagtgaa atcatgaaaa      540

atttattggc tttgtgagcg gataacaatt ataatatgtg gaaagaagga gataccatat      600

ggggcgtttt agcccaaaga gtttggatct ggacggaaag gttatcttgg taacgggcgg      660

tactggaagc ttcgggcgtc gtttcatcga gactgtcttg cgccgttacg atccccgcaa      720

agttatcgtc tattcgcgcg atgaattaaa acagagtgac atgcaaattg agcttcgcga      780

gcaattcgat gaggccaccg tagcaaagat gcgttttttt ctgggcgacg tgcgtgatcg      840

tgagcgttta acgttagcgc ttcgtggagt cgacattgtc attcatgcag ccgcacttaa      900

acaggtacca gcggcagaat ataatccctc cgaatgtatc cacacgaatg tgttgggtgc      960

ggaaaacgta gtatgggcgt cactggctaa cgccgttaag caggtggtcg ccttatctac     1020

ggacaaagct tgtaatccga ctaacctgta tggtgcaacg aagttggcct ctgacaagac     1080

gttcgtggct gccaacaatc tgagtggaga catcgggacc cgcttttgcg tggttcgcta     1140

tggtaacgta gtcgggtctc gcggctcagt agtaccactt tatcgtcgtc tgttgagcca     1200

aggggcgacg gagttgccag tcacggaccc tcgcatgacc cgcttctgga ttacgttgaa     1260

tgagggcgtg gacttcgtac tttcttcatt gaccatgatg cgcggaggcg agatttttgt     1320

gccgaagatc cccagtatgg caatgcctga tttggtaaaa gccatgtcta gcactgctgc     1380

aatgaaggta atcggtatcc gcccaggaga gaaacttcat gaaatcatga tcagcgcgga     1440

tgatgcccgc agcaccgtgg agttcgatga ccgctatgca atcgaaccga atttcgcaga     1500

atttggccgt gagccctacg cagcaagtga cggcgctaaa cccgtggccg aggacttcag     1560

ctacagctca gacaataatc atgactggtt gtctcccgaa ggcttgttag ccatgttaga     1620

agagaaggcc acgtgaagat ctcccccggg aagaaggaga tataccatga caggcggatt     1680

tttaccttat gggcgtcaga ctattgagga ggatgacatc gctgcggtag cggaagcatt     1740

gcgcggcgac tttctgacga ctggccctac agtggaagct ttcgagacag cgttcgccgc     1800

taaagtcggc gctgatcacg caatcgcggt atcgaacgga acagctacct tgcaccttgc     1860

catgatggct cttggtattg gtgaaggtga tgtatgcgta gcaccaagtg tgacttttct     1920

ggcaaccgct aactgtgcac gttatgtagg tgcggaagta gtgttcgccg acgttgaccc     1980

ggacagtggt cttatgacac cagacaccct ggcgcgcgct ttggcaggtg cacgtgataa     2040

gcgtgttaaa gctgtacttc cagtacatct gcgtggggac gtatgtgatc ttcccgcgtt     2100

gaaagcaatg gcatcagcga gcggcgccgt gcttgtggaa gatgccccgc atgccctggg     2160

ttcgatcgct acctttgatg gcgtagcgca tccagtcgga gatggtgcgt acagttcatt     2220

cgcaagtttc tcctttcacc ccgtaaagac gctggccaca ggggagggag ggatgttgac     2280

caccaacgac cccgcactgg ccgcaaaggc gcgtttgctt cgcagtcacg ggatggtccg     2340

ccagccgggt ggagatccgt ggtggtacga gatgcccgaa ctgggattca attaccgcat     2400

tcctgatgtt ttatgtgcct taggtttatc ccaactggcg aaacttgacc gttttgttgc     2460

acgccgtcgt gaccttactg ccctttacgc tcgcttattg gcggagcgcg ctccccgtgc     2520

gcgtttagcc accagcccgg accactcaga cgctgcctta cacctgttga cggttttaat     2580

tgatttcgag gccgagggta tttcccgccg taccgtagtt gaatccctta aaactcaagg     2640

agtaggaacg caggtgcact acatcccggt gcaccgtcag ccatattatg cacagcgcta     2700

cggggtcgcc gacttgcccg gagctgacgc gtggtacgcc cgttgcttaa ccttgccgct     2760

gtatccagct atgactaatg gagacgttga gcgcgttgtc ggtgctttag ccactgtttt     2820

agggtgagct agcggagctc aagaaggaga tataccatgg ttgcggtgcg ccttcgtaac     2880

ctggtcgaat ctgatcgcga acgccttctt atttggcgca acagtccaga tgttagcgca     2940

tatatgtact cagatcataa gattggtcac gaagaacatg accactggtt cgacgtcgcg     3000

cgtcatgacc cacgtcgtcg ctactggatt atcgaggctg acggggagcc ggtcggtctt     3060

gccaatcttg ctgacattga tttggttcac cgtcgctgtg cttgggccta ctacttggca     3120

agccccaaag tgcgtggact gggtgtcggc agttttgttg agttccaaat tatcgaatac     3180

gttttcaatc agttgcacct gaacaaattg tggtgcgaag tccttatcag taatgaatcc     3240

gtatggcgtc tgcatgaact ttacggcttc cagcgcgagg ctttatttcg ccagcatgtt     3300

atgaaacagg gccatgaagt ggacgtaatt ggtttaggac tgcttgccag tgactgggcc     3360

gctcgccgcg atgccatggc cgaacgcttg tgtgcgaaag gatatacaat ccccgacttg     3420

acctgccgcg cggcctgaga tatcgcggcc gcaagaagga gatataccat gtccttacgc     3480

atcgtctttg tatgcgcagc cggcccatct gtgggtggtg gacatgtcat gcgttccttg     3540

acacttgcac gcgcgttagc ggcgcgcgga gcgacatgtg cgtttttggg aacacccgag     3600

gtagcagcag tcttagacgc cttcggtcct gatatggcgc gtgccgacac cgccgagccc     3660

ttcgaagctg tagtctttga ctcctatgca cttaccgcgg acgaccatcg ccgtatcgcg     3720

gcgggacgtc ccgcgttagt aatcgacgat ttagccgacc gccctcttgc agcagacctg     3780

gtgcttgatg ctggaccggc tcgccgcgcc gaggattacg caggactggt gcccgcacat     3840

gcacgtcttc tgctgggtcc gaatcacgca ccggtccgtc cagcttttgt tgcgttacgc     3900

gaggcagcct tagcacgccg tgcgcagcag ggaccggtac gtcgcattct tgtatctctg     3960

ggcatgacgg acgtgggggg aattacagga cgtgtggtcg cacttcttgc cccaatcctt     4020

ggggaggtca ctctggatct tgtggtggga gcgggagccc cgagcttgcc tgctctgcgt     4080

gcattagccg ctgaagaccc tcgccttgtt cttcatattg acacgcagga tatgccacgc     4140

cttgttcttg aagccgactt ggccatcggc gcaggaggtt ccacgacgtg ggagcgctgt     4200

gtccttgcct tgccagcttt gactcttatc ttagccgata accaaattgc cgcggcacgt     4260

gctcttgaag cagctggcgt aaccccttgt ttggacgtaa cagccccgga ttttgacacg     4320

gcctttgcag ctcttgcgca gaacctgatt gctgatccgg atcgtcgtgc cgcacttagt     4380

gctgcctcag ctacggtctg tgatggacgt ggcgcggagc gcgtggctga agcattcttg     4440

ggagtcacca ccacatgaca attgctgcag aagaaggaga tataccatgt ccgctccgag     4500

taccgagatc cccccttcga ttgagattgc tggacgcaag atcggggccg atcacagccc     4560

ctacgttatc tgtgagttgt cgggcaatca taatgggtcc cttgaacgtt gcttagctat     4620

ggtagacgct gccgcagata ccggatgcga cgccatcaaa attcaaactt acacagccga     4680

cacaatcacc ttggatgtag atcgtccgga gttcaaaatc cacgggggat tgtgggatgg     4740

acgcactctg tatgagcttt atgaggaagc tcatactccc tttgagtggc acgcggccat     4800

cttcgaacgc gctcgtcagc gcggtgtcac gattttttct tctccatttg acgagactgc     4860

cgtcgacctt ttagattcgc tgggggcgcc agcttttaaa attgcaagct ttgaagcggt     4920

agaccttccg cttatcaaat acgcggcagc caaagggaaa cccttaatta tttccactgg     4980

aatggcgaac cttacggaga tgcaaaccgc ccttgataca gctttgtcag caggcgctcc     5040

gggagtgtta cttttacact gtgtttcttc ataccctgct acgttcgcag acgcgaacgt     5100

ccgcaccgtg ccggatatgg cggcacgctt cggatgcccg attggccttt ccgatcacac     5160

gcccggtaca gcagctagtg tcgccgctgt gagcttaggg gcgtgtgcag tagaaaaaca     5220

tttcacgctt gcccgtgccg atggtggtcc ggacgccgca ttctctcttg aacctgcgga     5280

gtttaaggca ttagttgatg acacaaagaa tgcttgggct gcgttgggac gtgcacacta     5340

cgatgtgctg gggtcagagg caacatcact gttattccgc cgttctctgt acgttacagc     5400

cgacgtgaag gctggcgaac ctttaacgcg cgcgaatgtt cgttcagtgc gtcccggcaa     5460

tgggttgcca cctgcggatt tggataaagt tctggcggga aaggcaaccc gcgatcttgc     5520

gcgcggcgag cctcttgact ggtcaatggt cggttgaaac tagtacttaa gaagaaggag     5580

atataccatg atcttggcca tcctgcaagc ccgcatgtca tccacgcgcc ttcctggcaa     5640

ggtactgatg cccctgcaac gccagcccat gatcgttcgt cagatcgaac gtgttgcccg     5700

ctcaaaacgc attgataagt tagttgtggc cacgtcggac cgcccagagg acgatgcaat     5760

cgaagcagcc gttcgccgtg aaggtattgc ggtgtttcgc gggtcattag acaatgtcca     5820

gcagcgtttt attggggcat tggatgccca ccctgccgac catgtagtac gtctgaccgc     5880

cgattgtccc cttgccgacc cgacacttat tgatgccaca atcgatttat gtctttcaaa     5940

gggcgcggac tacgtatcta atacaccgga gggtcacgcc catccaaaag ggaccgatgt     6000

agaggtaatg accgcagcgg cattgcgtcg cgcggctgct gaagccacga ccaaagaagc     6060

atttgaacac gttacttggg acctttggaa ccaacctcaa cgctggacgt gtgcatggtt     6120

gccgtgcttc ccagatcaag gagcggtacg ctggactgtg gatcgtccgg atgattatgc     6180

ttttgtcgct gctgtatacg atgccctgta cccagcaaat cgcgccttta cgtcggatga     6240

catccgtgcg tttgtcgctg gtcgccccga cctgcaagat tatggtggtg atcgccgtgc     6300

atgagaattc aggaggtaaa aaaatgtcac gtaagagcgc cttggaaagt tccgcgtcag     6360

tcttagctca ggcggatgtg ggcgctagtg gcattcaccc ttcggtcatc gcggacgcga     6420

tgggcgactc agcgtcagcc gaggcgttag agcgcttgaa tcgtgcggca caggacacaa     6480

aaaatgtcga taacgcgaaa cacctggctc gcgcaattca agctgttcaa ttacaagact     6540

acgcaaaggc agacaagctt gcattgaaac tgttggaaaa agacgagcgc ctgggccttg     6600

cctggcacat tcttgccatt gctcgcgaaa aaacgggcga tttcgcttct agccttcgcg     6660

cctacgaggc tgcacttgcg ctgcttccgg atcatggccc ggttgcgggc gaccttggtc     6720

gtcttgcatt tcgtatgaac atgccggagc tggccgccaa gttcttcgct cactaccgtc     6780

tggcgcgtcc cgatgacgtg gaaggggcaa ataacctggc ctgtgccctg cgtgaactta     6840

atcgcgagag cgaagcaatt gaagttttga aagccgctct gggcgcaaac ccagaggcag     6900

cagtgctgtg gaacacatta gggacagtgt tgtgcaacat cggcgatgcg gctggatcta     6960

tcgtcttctt tgatgaatca ctgcgcttag cccctgactt ctcgaaagct taccacaacc     7020

gcgcgttcgc ccgcttagat cttggggaga ttgaagcggc gttggcggac tgtgaagctg     7080

ccatgcgtag tcccggctca ccagaggacc tggcaatgat gcagttcgct cgcgccacaa     7140

ttttactggc acttgggcgc gttggcgaag ggtgggaagc gtacgagtca cgcttttccc     7200

ccgcattaag cgacgcacct cgtttccaaa ttccgggaac ccgttggtcg gggcaagacc     7260

ttgctggaaa gacactgatg atctcaactg aacaagggtt aggcgacgaa gtgatgtttg     7320

caggtatgtt acccgacatc ctggaacgtc ttggaccaga cgggtccctg tctctggcag     7380

ttgagcgccg cctgattccg ctgtttcagc gtagctttcc tggtattgaa gtgactgccc     7440

accgcacggt tgcatacgaa ggtcgtacct atcgtgctgc cccagagatt gaggattggg     7500

atcgcttcga ttattgggcc gcgattggag acttccttcc ctccttacgc gggagtgttg     7560

aagcattccc gcgtcgcgat cactacttga cgccagatcc ggagcgcgtt gctcattgga     7620

aggcggaact tgaaaagttg ggccctgccc ctaaagtagg gttattgtgg aagagtctga     7680

aattgggagc agaacgcgga cgtcagttct cccctttcga agcatggcgc gcagtacttc     7740

aaacccctgg cgcggttttc gtgaatctgc agtatggtga ttgtgacgaa gagatcgcct     7800

acgcaaagga gacgttcgga gttgaaattt ggcagccccc aggtattgat ttaaaaaagg     7860

atttggacga tgttgcagcg ttatgtgcag cagtcgactt gattatcggc ttctctaacg     7920

caaccattaa tttggcgggt gcagtaggtg caccgatttg gatgatgaca gctcctaagg     7980

catggactaa attaggaact gatcgttatc cctggtaccc gcaggcgcag gtattttcac     8040

cggcagattt tagcgactgg gagccagtca tggaagaagt agcccgtgca ttggcagcta     8100

aaattgcggg ttgatctaga actcgagatc agttctggac cagcgagctg tgctgcgact     8160

cgtggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc     8220

acacaacata cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta     8280

actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca     8340

gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc     8400

cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc     8460

tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat     8520

gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt     8580

ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg     8640

aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc     8700

tcctgttccg accctgtcgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt     8760

ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa     8820

gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta     8880

tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa     8940

caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa     9000

ctacggctac actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt     9060

cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt     9120

ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat     9180

cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat     9240

gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc     9300

aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc     9360

acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta     9420

gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga     9480

cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg     9540

cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc     9600

tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat     9660

cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag     9720

gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat     9780

cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa     9840

ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa     9900

gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga     9960

taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg    10020

gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc    10080

acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg    10140

aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact    10200

ctaccttttt caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat    10260

atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt    10320

gccacctgac gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat    10380

cacgaggccc tttcgtc                                                   10397


<210>  30
<211>  825
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  30
atggcgctga atagcatcaa taccaatgca ggggccatga tcgcgttaca aaacttaaac       60

gggacaaact ccgaactgac aacagttcag cagcgcatca ataccggcaa aaaaatcgcc      120

tctgccaaag acaacggggc tatttgggct accgccaaga atcagagtgc gaccgccgcg      180

agcatgaatg ctgtgaagga ttcattgcaa cgcggtcaga gcactattga cgtggcatta      240

gcggctggcg acacaatcac ggatctgtta ggcaaaatga aggaaaaagc tctggctgcc      300

agtgacacta gccttaatac cgcatccttc aatgcgttaa agagcgattt cgacagcctg      360

cgtgatcaga ttgagaaggc cgctacaaac gcaaagttta acggagttaa cctgcttgat      420

aatagtacgg gcacgggcgg gtataaagcg ttgtccaaca ctgctggatc aacgattaaa      480

gttgccgggg agaacttatc tttaggtatc gggttaacca ccacgtccac tttcacaact      540

gctgccgccg caaagactat gatcggcacc atcgacacag cattgcaaac tgctacaaat      600

aaattggctt ctctgggcac atcctcggtg gggttagaca cgcatcttac ctttgtaggg      660

aagctgcagg attctctgga cgccggtgta ggcaatctgg tagatgcgga cttagcaaag      720

gagagtgcca aattacagtc gttacagaca aagcaacaac ttggcgtgca agcactgtcc      780

atcgcaaatc agtcttcctc atctattctg tcattgttcc gctga                      825


<210>  31
<211>  274
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  31

Met Ala Leu Asn Ser Ile Asn Thr Asn Ala Gly Ala Met Ile Ala Leu 
1               5                   10                  15      


Gln Asn Leu Asn Gly Thr Asn Ser Glu Leu Thr Thr Val Gln Gln Arg 
            20                  25                  30          


Ile Asn Thr Gly Lys Lys Ile Ala Ser Ala Lys Asp Asn Gly Ala Ile 
        35                  40                  45              


Trp Ala Thr Ala Lys Asn Gln Ser Ala Thr Ala Ala Ser Met Asn Ala 
    50                  55                  60                  


Val Lys Asp Ser Leu Gln Arg Gly Gln Ser Thr Ile Asp Val Ala Leu 
65                  70                  75                  80  


Ala Ala Gly Asp Thr Ile Thr Asp Leu Leu Gly Lys Met Lys Glu Lys 
                85                  90                  95      


Ala Leu Ala Ala Ser Asp Thr Ser Leu Asn Thr Ala Ser Phe Asn Ala 
            100                 105                 110         


Leu Lys Ser Asp Phe Asp Ser Leu Arg Asp Gln Ile Glu Lys Ala Ala 
        115                 120                 125             


Thr Asn Ala Lys Phe Asn Gly Val Asn Leu Leu Asp Asn Ser Thr Gly 
    130                 135                 140                 


Thr Gly Gly Tyr Lys Ala Leu Ser Asn Thr Ala Gly Ser Thr Ile Lys 
145                 150                 155                 160 


Val Ala Gly Glu Asn Leu Ser Leu Gly Ile Gly Leu Thr Thr Thr Ser 
                165                 170                 175     


Thr Phe Thr Thr Ala Ala Ala Ala Lys Thr Met Ile Gly Thr Ile Asp 
            180                 185                 190         


Thr Ala Leu Gln Thr Ala Thr Asn Lys Leu Ala Ser Leu Gly Thr Ser 
        195                 200                 205             


Ser Val Gly Leu Asp Thr His Leu Thr Phe Val Gly Lys Leu Gln Asp 
    210                 215                 220                 


Ser Leu Asp Ala Gly Val Gly Asn Leu Val Asp Ala Asp Leu Ala Lys 
225                 230                 235                 240 


Glu Ser Ala Lys Leu Gln Ser Leu Gln Thr Lys Gln Gln Leu Gly Val 
                245                 250                 255     


Gln Ala Leu Ser Ile Ala Asn Gln Ser Ser Ser Ser Ile Leu Ser Leu 
            260                 265                 270         


Phe Arg 
        


<210>  32
<211>  831
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  32
atggccttga acagcgtgaa taccaattcg ggggcattag tagcacttca aaacctgcaa       60

tccacgaaca gtgagttagc gacagtacag agtcgtatta acacgggtaa aaaagtgaac      120

agcgccaaag acaacggtgc cgtttgggcc attgcacagg ggcaacgctc cgaggtcaat      180

gctcttggcg cagtcaagga ttccttagcg cgtgggtcga gcgctgtcga tgtctcgatt      240

gcggcaggag aaagcgtgtc tgatttactg ttacagttaa aagaaaaagc gttgagcgcc      300

accgataagt cgcttacgac ggcggcgcgc actgccttga acgaggattt taaggccatc      360

cgcgatcaga ttactacggt tgtaacaaat gccaagttca atggcgtctc tattgcagac      420

gggagtacca ccaagctgac gtttttggct aattcagatg gttcgggctt cactgtcaac      480

gctaaaacca tcagccttgc tgggggtacc aacgtaaccg tcgctactac aacgactatc      540

gggacgtcaa ccttggctac taccgccctg gggctggtaa acgcttcaat cgacaaagtc      600

tcggcctctt tagcccgtct gggtacggga gcgaaggctc tggacactca tagcacattt      660

gttggaaagt tatcggatgc gctggagaat ggcattggca acttagttga cgctgacctt      720

gcaaaagagt ctgctcgcct gcaaagttta cagacgaagc aacagttggg ggttcaggcg      780

ttgagcatcg ctaaccaatc gtcttctatt cttctgggtc ttttccgctg a               831


<210>  33
<211>  276
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic sequence

<400>  33

Met Ala Leu Asn Ser Val Asn Thr Asn Ser Gly Ala Leu Val Ala Leu 
1               5                   10                  15      


Gln Asn Leu Gln Ser Thr Asn Ser Glu Leu Ala Thr Val Gln Ser Arg 
            20                  25                  30          


Ile Asn Thr Gly Lys Lys Val Asn Ser Ala Lys Asp Asn Gly Ala Val 
        35                  40                  45              


Trp Ala Ile Ala Gln Gly Gln Arg Ser Glu Val Asn Ala Leu Gly Ala 
    50                  55                  60                  


Val Lys Asp Ser Leu Ala Arg Gly Ser Ser Ala Val Asp Val Ser Ile 
65                  70                  75                  80  


Ala Ala Gly Glu Ser Val Ser Asp Leu Leu Leu Gln Leu Lys Glu Lys 
                85                  90                  95      


Ala Leu Ser Ala Thr Asp Lys Ser Leu Thr Thr Ala Ala Arg Thr Ala 
            100                 105                 110         


Leu Asn Glu Asp Phe Lys Ala Ile Arg Asp Gln Ile Thr Thr Val Val 
        115                 120                 125             


Thr Asn Ala Lys Phe Asn Gly Val Ser Ile Ala Asp Gly Ser Thr Thr 
    130                 135                 140                 


Lys Leu Thr Phe Leu Ala Asn Ser Asp Gly Ser Gly Phe Thr Val Asn 
145                 150                 155                 160 


Ala Lys Thr Ile Ser Leu Ala Gly Gly Thr Asn Val Thr Val Ala Thr 
                165                 170                 175     


Thr Thr Thr Ile Gly Thr Ser Thr Leu Ala Thr Thr Ala Leu Gly Leu 
            180                 185                 190         


Val Asn Ala Ser Ile Asp Lys Val Ser Ala Ser Leu Ala Arg Leu Gly 
        195                 200                 205             


Thr Gly Ala Lys Ala Leu Asp Thr His Ser Thr Phe Val Gly Lys Leu 
    210                 215                 220                 


Ser Asp Ala Leu Glu Asn Gly Ile Gly Asn Leu Val Asp Ala Asp Leu 
225                 230                 235                 240 


Ala Lys Glu Ser Ala Arg Leu Gln Ser Leu Gln Thr Lys Gln Gln Leu 
                245                 250                 255     


Gly Val Gln Ala Leu Ser Ile Ala Asn Gln Ser Ser Ser Ile Leu Leu 
            260                 265                 270         


Gly Leu Phe Arg 
        275     


