                         SEQUENCE LISTING

<110>  DANISCO US, Inc.
       De Lange, Dennis
       Frisch, Ryan L.
       Masson, Helen Olivia
 
<120>  Modified 5'-Untranslated Region (UTR) Sequences For Increased 
       Protein Production in Bacillus Cells

<130>  NB41250-WO-PCT

<150>  US 62/558304
<151>  2017-09-13

<160>  21    

<170>  PatentIn version 3.5

<210>  1
<211>  58
<212>  DNA
<213>  Bacillus subtilis

<400>  1
acagaatagt cttttaagta agtctactct gaattttttt aaaaggagag ggtaaaga         58

<210>  2
<211>  57
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Modified 5'-UTR

<400>  2
acagaatagt cttttaagta agtctactct gaattttttt aaaaggagag ggtaaag          57

<210>  3
<211>  579
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  comK

<400>  3
atgagcacag aggatatgac aaaggatacg tatgaagtaa acagttcgac aatggctgtc       60
ctgcctctgg gtgaggggga gaaatccgcc tcaaaaatac ttgagaccga caggactttc      120
cgcgtcaata tgaagccgtt tcaaattatc gaaagaagct gccgctattt cggatcgagc      180
tatgcgggaa gaaaagcggg cacatatgaa gtcattaaag tttcccataa accgccgatc      240
atggtggatc actcaaacaa catttttctt ttccccacat tttcctcaac tcgtcctcag      300
tgcgggtggc tttcccatgc gcatgttcac gagttttgcg cggcaaagta tgacaacacg      360
tttgtcacgt ttgtcaacgg ggaaacgctg gagctgcccg tatccatctc atctttcgaa      420
aaccaggttt accgaacggc atggctgaga acaaaattta tcgacaggat tgaaggaaac      480
cccatgcaga agaaacagga atttatgctc tatccgaaag aagaccggaa tcagctgata      540
tacgaattca tcctcaggga gctgaaaaag cgctattga                             579

<210>  4
<211>  6208
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic WT-5'-UTR construct

<400>  4
ggctatgacg aaacgattgc agcctatgag gcgagtctcg aaaagctcgg acttgactac       60
cttgatttat acctgatcca ctggcctgtt gaaggacgct acaaagcggc gtggaaagcg      120
cttgaaacac tttatgaaca aggacgcgta aaagcaatcg gagtgagcaa ttttcagatt      180
caccatctgg aagacttgct gaaagatgcc gccgtcaaac cggcgatcaa ccaggttgag      240
tatcatccgc ggctgacgca gaaagagctg caagcgtttt gccgtgcgca cggcatccag      300
ctgcaagcat ggtcgccgct gatgcaaggc caattgctca gccatccact gctgaaagat      360
atcgcggaca agtacggcaa gacaccggcc caagtcattt tgcgctggga tttgcaaaac      420
ggggtcgtta cgattccgaa gtcgactaaa gcggagcgga ttgcccaaaa cgcggacata      480
tttgattttg aactgaccac cgaggaaatg aagcaaattg acgcgctgaa tgaaaacacc      540
cgtgtcggcc ctgatcccga taactttgac ttttaacaaa acggccccgt tcgacattcg      600
aacggggctt taattgaatt gtgcggttac accgccggac tccatcatca tcagttcttt      660
tttcatatcc aatccgcccc ggtatcccgt gagctgcccg cttttaccga taacccgatg      720
gcaaggcacc accattaaca gcggatttgc gccgatcgcc gcgcctactg cccgcacagc      780
ggcctgcttt tcaatatgct cggcgatatc ggaataggag caagtgctgc cgtaagggat      840
ttcggagagc gccttccaca ctgccagctg aaaaggcgtg ccggcaaggt cgacaggaaa      900
gctgaaatga gttcgcttgc cgttcaaata cgcctgcagc tgctcggcgt attctgccaa      960
tcctttgtca tcccgaatga aaactggctg tgtaaatctt ttttcagccc aagcggccaa     1020
atcctcgaag ccttgattcc atccccctgt aaaacagagc ccgcgggcag tcgccccaat     1080
gtgaatctgc caacctcggc aaataagcgt acgccagtat acgatttgat cgtccatatg     1140
tttacctccg tttcatttgc cggtacgacg tcggcgattg cccagtcttc tttttaaaca     1200
aagaggcaaa atattccgca ttcgcaatgc ctaccattga agcgatttct gcgatcgatc     1260
gttctgaatg agcaagcaaa tcgaccgctt tctcaatcct tttctgcagg atgtattctg     1320
ccggcgagac gcctttgatt cgtttaaatg tccgctgcag gtgaaaaggg ctgatatggc     1380
acctgtcagc caaagcttgc agagacagcg gatcgcgata agattcctcg atgatttcca     1440
ccacacgctg tgccagctct tcatccggca gcagcgcccc ggccggattg cagcgtttgc     1500
aggggcggta cccttctgat aaagcatctt ttgcattgaa aaagatctgc acattgtcga     1560
tttgcggaac tctcgatttg caggaagggc ggcaaaatat gccggtcgtt ttgaccgcgt     1620
aataaaaaac tccgtcatag gcggaatcgt tttccgtaat cgcccgccac atttcaggcg     1680
tcaatcgtga tttgctgttc atatcttcac cccgatctat gtcagtataa cctatatgac     1740
agccggaggt ggagaggcgg agaacggcac agcaagaaga caaagaagaa gagagactgt     1800
tgcctggacc tccgaaacgc gctacaattc atttacaaca caggatgggg tgagaatatt     1860
gccggaatca gtgaagcagg cctcctaaaa taaaaatcta tattttagga ggtaaaacat     1920
gaattttcaa acaatcgagc ttgacacatg gtatagaaaa tcttattttg accattacat     1980
gaaggaagcg aaatgttctt tcagcatcac ggcaaacgtc aatgtgacaa atttgctcgc     2040
cgtgctcaag aaaaagaagc tcaagctgta tccggctttt atttatatcg tatcaagggt     2100
cattcattcg cgccctgagt ttagaacaac gtttgatgac aaaggacagc tgggttattg     2160
ggaacaaatg catccgtgct atgcgatttt tcatcaggac gaccaaacgt tttccgccct     2220
ctggacggaa tactcagacg atttttcgca gttttatcat caatatcttc tggacgccga     2280
gcgctttgga gacaaaaggg gcctttgggc taagccggac atcccgccca atacgttttc     2340
agtttcttct attccatggg tgcgcttttc aaacttcaat ttaaaccttg ataacagcga     2400
acacttgctg ccgattatta caaacgggaa atacttttca gaaggcaggg aaacattttt     2460
gcccgtttcc ttgcaagttc accatgcagt gtgtgacggc tatcatgccg gcgcttttat     2520
aaacgagttg gaacggcttg ccgccgattg tgaggagtgg cttgtgtgac agaggaaagg     2580
ccgatatgat tcggcctttt ttatatgtac ttcttagcgg gtctcttaac ccccctcgag     2640
gtcgctgata aacagctgac atcaatatcc tattttttca aaaaatattt taaaaagttg     2700
ttgacttaaa agaagctaaa tgttatagta ataaaacaga atagtctttt aagtaagtct     2760
actctgaatt tttttaaaag gagagggtaa agaatgaaac aacaaaaacg gctttacgcc     2820
cgattgctga cgctgttatt tgcgctcatc ttcttgctgc ctcattctgc agctagcgca     2880
gccgcaccgt ttaacggtac catgatgcag tattttgaat ggtacttgcc ggatgatggc     2940
acgttatgga ccaaagtggc caatgaagcc aacaacttat ccagccttgg catcaccgct     3000
ctttggctgc cgcccgctta caaaggaaca agccgcagcg acgtagggta cggagtatac     3060
gacttgtatg acctcggcga attcaatcaa aaagggaccg tccgcacaaa atatggaaca     3120
aaagctcaat atcttcaagc cattcaagcc gcccacgccg ctggaatgca agtgtacgcc     3180
gatgtcgtgt tcgaccataa aggcggcgct gacggcacgg aatgggtgga cgccgtcgaa     3240
gtcaatccgt ccgaccgcaa ccaagaaatc tcgggcacct atcaaatcca agcatggacg     3300
aaatttgatt ttcccgggcg gggcaacacc tactccagct ttaagtggcg ctggtaccat     3360
tttgacggcg ttgattggga cgaaagccga aaattaagcc gcatttacaa attcaggggc     3420
atcggcaaag cgtgggattg gccggtagac acagaaaacg gaaactatga ctacttaatg     3480
tatgccgacc ttgatatgga tcatcccgaa gtcgtgaccg agctgaaaaa ctgggggaaa     3540
tggtatgtca acacaacgaa cattgatggg ttccggcttg atgccgtcaa gcatattaag     3600
ttcagttttt ttcctgattg gttgtcgtat gtgcgttctc agactggcaa gccgctattt     3660
accgtcgggg aatattggag ctatgacatc aacaagttgc acaattacat tacgaaaaca     3720
aacggaacga tgtctttgtt tgatgccccg ttacacaaca aattttatac cgcttccaaa     3780
tcagggggcg catttgatat gcgcacgtta atgaccaata ctctcatgaa agatcaaccg     3840
acattggccg tcaccttcgt tgataatcat gacaccgaac ccggccaagc gcttcagtca     3900
tgggtcgacc catggttcaa accgttggct tacgccttta ttctaactcg gcaggaagga     3960
tacccgtgcg tcttttatgg tgactattat ggcattccac aatataacat tccttcgctg     4020
aaaagcaaaa tcgatccgct cctcatcgcg cgcagggatt atgcttacgg aacgcaacat     4080
gattatcttg atcactccga catcatcggg tggacaaggg aaggggtcac tgaaaaacca     4140
ggatccgggc tggccgcact gatcaccgat gggccgggag gaagcaaatg gatgtacgtt     4200
ggcaaacaac acgctggaaa agtgttctat gaccttaccg gcaaccggag tgacaccgtc     4260
accatcaaca gtgatggatg gggggaattc aaagtcaatg gcggttcggt ttcggtttgg     4320
gttcctagaa aaacgaccta aaagcttctc gaggttaaca gaggacggat ttcctgaagg     4380
aaatccgttt ttttattttt aacatctctc actgctgtgt gattttactc acggcatttg     4440
gaacgccggc tctcaacaaa ctttctgtag tgaaaatcat gaaccaaacg gatcgtcggc     4500
ctgattaaca gctgaaagct gccgatcaca aacatccata gtcccgccgg cttcagttcc     4560
tcggagaaaa agcagaagct cccgacaagg aataaaaggc cgatgagaaa atcgtttaat     4620
gtatgtagaa ctttgtatct ttttttgaaa aagagttcat atcgattgtt attgttttgc     4680
ggcattgctt gatcactcca atccttttat ttaccctgcc ggaagccgga gtgaaacgcc     4740
ggtatacata ggatttatga attaggaaaa catatgggga aataaaccat ccaggagtga     4800
aaaatatgcg gttattcata tgtgcatcgt gcctgttcgg cttgattgtt ccgtcatttg     4860
aaacgaaagc gctgacgttt gaagaattgc cggttaaaca agcttcaaaa caatgggaag     4920
ttcaaatcgg taaagccgaa gccggaaacg gaatggcgaa accggaaaaa ggagcgtttc     4980
atacttatgc tgtcgaaatc aaaaacattg gacacgatgt ggcttcggcg gaaatttttg     5040
tctatcggaa cgagcctaat tcttcaacga aattttcgct ttggaacatt cctcacgaaa     5100
atccggtttc tttagccaaa agcttaaatc acggaagctc tgtcaagcac cgcaatctgc     5160
ttatggcaga gaatgcgacc gaattggaag tggacatgat ttggacggaa aaaggaagcg     5220
aaggcagact tttaaaggaa acgttcattt tcaagggaga tgaatcatga agaaaaaatg     5280
gccgttcatc gtcaacggtc tttttttaat gacttaggca gccgatcgtt cggccatacg     5340
atatcgaagc gacctcgaac cagcagagct cgtcacaaaa catttgcatt taaagaaaaa     5400
tacaggatgt tttcaccaat atttttctca atgatgatac actattgaca agctgctact     5460
ttgggagggt gtttccatag atgccgatga agcaaaaaca ccaaatgtgt catgagagct     5520
ctctctaatc gatataaaag tagggtgaac cggggttgtc aatctgtaaa agatcttttt     5580
ttatcccgtg atacgctttt ggaattctga atcttcaaga aagtccccag ccttttgctg     5640
atcaatcgag aacaaaggat gatacatatg aaaagaatag ataaaatcta ccatcagctg     5700
ctggataatt ttcgcgaaaa gaatatcaat cagcttttaa agatacaagg gaattcggct     5760
aaagaaatcg ccgggcagct gcaaatggag cgttccaatg tcagctttga attaaacaat     5820
ctggttcggg ccaaaaaggt gatcaagatt aaaacgttcc ccgtccgcta catcccggtg     5880
gaaattgttg aaaacgtctt gaacatcaaa tggaattcag agttgatgga ggttgaagaa     5940
ctgaggcggc tggctgacgg ccaaaaaaag ccggcgcgca atatatccgc cgatcccctc     6000
gagctcatga tcggggctaa agggagcttg aaaaaggcaa tttctcaggc gaaagcggca     6060
gtcttttatc ctccgcacgg cttgcatatg ctgctgctcg ggccgacggg ttcggggaaa     6120
tcgctgtttg cgaatcggat ctaccagttc gccgtttatt ctgacatatt gaagcccgat     6180
tccccgttca tcacattcaa ctgtgcag                                        6208

<210>  5
<211>  6207
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  synthetic mod-5-UTR construct

<400>  5
ggctatgacg aaacgattgc agcctatgag gcgagtctcg aaaagctcgg acttgactac       60
cttgatttat acctgatcca ctggcctgtt gaaggacgct acaaagcggc gtggaaagcg      120
cttgaaacac tttatgaaca aggacgcgta aaagcaatcg gagtgagcaa ttttcagatt      180
caccatctgg aagacttgct gaaagatgcc gccgtcaaac cggcgatcaa ccaggttgag      240
tatcatccgc ggctgacgca gaaagagctg caagcgtttt gccgtgcgca cggcatccag      300
ctgcaagcat ggtcgccgct gatgcaaggc caattgctca gccatccact gctgaaagat      360
atcgcggaca agtacggcaa gacaccggcc caagtcattt tgcgctggga tttgcaaaac      420
ggggtcgtta cgattccgaa gtcgactaaa gcggagcgga ttgcccaaaa cgcggacata      480
tttgattttg aactgaccac cgaggaaatg aagcaaattg acgcgctgaa tgaaaacacc      540
cgtgtcggcc ctgatcccga taactttgac ttttaacaaa acggccccgt tcgacattcg      600
aacggggctt taattgaatt gtgcggttac accgccggac tccatcatca tcagttcttt      660
tttcatatcc aatccgcccc ggtatcccgt gagctgcccg cttttaccga taacccgatg      720
gcaaggcacc accattaaca gcggatttgc gccgatcgcc gcgcctactg cccgcacagc      780
ggcctgcttt tcaatatgct cggcgatatc ggaataggag caagtgctgc cgtaagggat      840
ttcggagagc gccttccaca ctgccagctg aaaaggcgtg ccggcaaggt cgacaggaaa      900
gctgaaatga gttcgcttgc cgttcaaata cgcctgcagc tgctcggcgt attctgccaa      960
tcctttgtca tcccgaatga aaactggctg tgtaaatctt ttttcagccc aagcggccaa     1020
atcctcgaag ccttgattcc atccccctgt aaaacagagc ccgcgggcag tcgccccaat     1080
gtgaatctgc caacctcggc aaataagcgt acgccagtat acgatttgat cgtccatatg     1140
tttacctccg tttcatttgc cggtacgacg tcggcgattg cccagtcttc tttttaaaca     1200
aagaggcaaa atattccgca ttcgcaatgc ctaccattga agcgatttct gcgatcgatc     1260
gttctgaatg agcaagcaaa tcgaccgctt tctcaatcct tttctgcagg atgtattctg     1320
ccggcgagac gcctttgatt cgtttaaatg tccgctgcag gtgaaaaggg ctgatatggc     1380
acctgtcagc caaagcttgc agagacagcg gatcgcgata agattcctcg atgatttcca     1440
ccacacgctg tgccagctct tcatccggca gcagcgcccc ggccggattg cagcgtttgc     1500
aggggcggta cccttctgat aaagcatctt ttgcattgaa aaagatctgc acattgtcga     1560
tttgcggaac tctcgatttg caggaagggc ggcaaaatat gccggtcgtt ttgaccgcgt     1620
aataaaaaac tccgtcatag gcggaatcgt tttccgtaat cgcccgccac atttcaggcg     1680
tcaatcgtga tttgctgttc atatcttcac cccgatctat gtcagtataa cctatatgac     1740
agccggaggt ggagaggcgg agaacggcac agcaagaaga caaagaagaa gagagactgt     1800
tgcctggacc tccgaaacgc gctacaattc atttacaaca caggatgggg tgagaatatt     1860
gccggaatca gtgaagcagg cctcctaaaa taaaaatcta tattttagga ggtaaaacat     1920
gaattttcaa acaatcgagc ttgacacatg gtatagaaaa tcttattttg accattacat     1980
gaaggaagcg aaatgttctt tcagcatcac ggcaaacgtc aatgtgacaa atttgctcgc     2040
cgtgctcaag aaaaagaagc tcaagctgta tccggctttt atttatatcg tatcaagggt     2100
cattcattcg cgccctgagt ttagaacaac gtttgatgac aaaggacagc tgggttattg     2160
ggaacaaatg catccgtgct atgcgatttt tcatcaggac gaccaaacgt tttccgccct     2220
ctggacggaa tactcagacg atttttcgca gttttatcat caatatcttc tggacgccga     2280
gcgctttgga gacaaaaggg gcctttgggc taagccggac atcccgccca atacgttttc     2340
agtttcttct attccatggg tgcgcttttc aaacttcaat ttaaaccttg ataacagcga     2400
acacttgctg ccgattatta caaacgggaa atacttttca gaaggcaggg aaacattttt     2460
gcccgtttcc ttgcaagttc accatgcagt gtgtgacggc tatcatgccg gcgcttttat     2520
aaacgagttg gaacggcttg ccgccgattg tgaggagtgg cttgtgtgac agaggaaagg     2580
ccgatatgat tcggcctttt ttatatgtac ttcttagcgg gtctcttaac ccccctcgag     2640
gtcgctgata aacagctgac atcaatatcc tattttttca aaaaatattt taaaaagttg     2700
ttgacttaaa agaagctaaa tgttatagta ataaaacaga atagtctttt aagtaagtct     2760
actctgaatt tttttaaaag gagagggtaa agatgaaaca acaaaaacgg ctttacgccc     2820
gattgctgac gctgttattt gcgctcatct tcttgctgcc tcattctgca gctagcgcag     2880
ccgcaccgtt taacggtacc atgatgcagt attttgaatg gtacttgccg gatgatggca     2940
cgttatggac caaagtggcc aatgaagcca acaacttatc cagccttggc atcaccgctc     3000
tttggctgcc gcccgcttac aaaggaacaa gccgcagcga cgtagggtac ggagtatacg     3060
acttgtatga cctcggcgaa ttcaatcaaa aagggaccgt ccgcacaaaa tatggaacaa     3120
aagctcaata tcttcaagcc attcaagccg cccacgccgc tggaatgcaa gtgtacgccg     3180
atgtcgtgtt cgaccataaa ggcggcgctg acggcacgga atgggtggac gccgtcgaag     3240
tcaatccgtc cgaccgcaac caagaaatct cgggcaccta tcaaatccaa gcatggacga     3300
aatttgattt tcccgggcgg ggcaacacct actccagctt taagtggcgc tggtaccatt     3360
ttgacggcgt tgattgggac gaaagccgaa aattaagccg catttacaaa ttcaggggca     3420
tcggcaaagc gtgggattgg ccggtagaca cagaaaacgg aaactatgac tacttaatgt     3480
atgccgacct tgatatggat catcccgaag tcgtgaccga gctgaaaaac tgggggaaat     3540
ggtatgtcaa cacaacgaac attgatgggt tccggcttga tgccgtcaag catattaagt     3600
tcagtttttt tcctgattgg ttgtcgtatg tgcgttctca gactggcaag ccgctattta     3660
ccgtcgggga atattggagc tatgacatca acaagttgca caattacatt acgaaaacaa     3720
acggaacgat gtctttgttt gatgccccgt tacacaacaa attttatacc gcttccaaat     3780
cagggggcgc atttgatatg cgcacgttaa tgaccaatac tctcatgaaa gatcaaccga     3840
cattggccgt caccttcgtt gataatcatg acaccgaacc cggccaagcg cttcagtcat     3900
gggtcgaccc atggttcaaa ccgttggctt acgcctttat tctaactcgg caggaaggat     3960
acccgtgcgt cttttatggt gactattatg gcattccaca atataacatt ccttcgctga     4020
aaagcaaaat cgatccgctc ctcatcgcgc gcagggatta tgcttacgga acgcaacatg     4080
attatcttga tcactccgac atcatcgggt ggacaaggga aggggtcact gaaaaaccag     4140
gatccgggct ggccgcactg atcaccgatg ggccgggagg aagcaaatgg atgtacgttg     4200
gcaaacaaca cgctggaaaa gtgttctatg accttaccgg caaccggagt gacaccgtca     4260
ccatcaacag tgatggatgg ggggaattca aagtcaatgg cggttcggtt tcggtttggg     4320
ttcctagaaa aacgacctaa aagcttctcg aggttaacag aggacggatt tcctgaagga     4380
aatccgtttt tttattttta acatctctca ctgctgtgtg attttactca cggcatttgg     4440
aacgccggct ctcaacaaac tttctgtagt gaaaatcatg aaccaaacgg atcgtcggcc     4500
tgattaacag ctgaaagctg ccgatcacaa acatccatag tcccgccggc ttcagttcct     4560
cggagaaaaa gcagaagctc ccgacaagga ataaaaggcc gatgagaaaa tcgtttaatg     4620
tatgtagaac tttgtatctt tttttgaaaa agagttcata tcgattgtta ttgttttgcg     4680
gcattgcttg atcactccaa tccttttatt taccctgccg gaagccggag tgaaacgccg     4740
gtatacatag gatttatgaa ttaggaaaac atatggggaa ataaaccatc caggagtgaa     4800
aaatatgcgg ttattcatat gtgcatcgtg cctgttcggc ttgattgttc cgtcatttga     4860
aacgaaagcg ctgacgtttg aagaattgcc ggttaaacaa gcttcaaaac aatgggaagt     4920
tcaaatcggt aaagccgaag ccggaaacgg aatggcgaaa ccggaaaaag gagcgtttca     4980
tacttatgct gtcgaaatca aaaacattgg acacgatgtg gcttcggcgg aaatttttgt     5040
ctatcggaac gagcctaatt cttcaacgaa attttcgctt tggaacattc ctcacgaaaa     5100
tccggtttct ttagccaaaa gcttaaatca cggaagctct gtcaagcacc gcaatctgct     5160
tatggcagag aatgcgaccg aattggaagt ggacatgatt tggacggaaa aaggaagcga     5220
aggcagactt ttaaaggaaa cgttcatttt caagggagat gaatcatgaa gaaaaaatgg     5280
ccgttcatcg tcaacggtct ttttttaatg acttaggcag ccgatcgttc ggccatacga     5340
tatcgaagcg acctcgaacc agcagagctc gtcacaaaac atttgcattt aaagaaaaat     5400
acaggatgtt ttcaccaata tttttctcaa tgatgataca ctattgacaa gctgctactt     5460
tgggagggtg tttccataga tgccgatgaa gcaaaaacac caaatgtgtc atgagagctc     5520
tctctaatcg atataaaagt agggtgaacc ggggttgtca atctgtaaaa gatctttttt     5580
tatcccgtga tacgcttttg gaattctgaa tcttcaagaa agtccccagc cttttgctga     5640
tcaatcgaga acaaaggatg atacatatga aaagaataga taaaatctac catcagctgc     5700
tggataattt tcgcgaaaag aatatcaatc agcttttaaa gatacaaggg aattcggcta     5760
aagaaatcgc cgggcagctg caaatggagc gttccaatgt cagctttgaa ttaaacaatc     5820
tggttcgggc caaaaaggtg atcaagatta aaacgttccc cgtccgctac atcccggtgg     5880
aaattgttga aaacgtcttg aacatcaaat ggaattcaga gttgatggag gttgaagaac     5940
tgaggcggct ggctgacggc caaaaaaagc cggcgcgcaa tatatccgcc gatcccctcg     6000
agctcatgat cggggctaaa gggagcttga aaaaggcaat ttctcaggcg aaagcggcag     6060
tcttttatcc tccgcacggc ttgcatatgc tgctgctcgg gccgacgggt tcggggaaat     6120
cgctgtttgc gaatcggatc taccagttcg ccgtttattc tgacatattg aagcccgatt     6180
ccccgttcat cacattcaac tgtgcag                                         6207

<210>  6
<211>  1702
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5'-HR to catH locus

<400>  6
ggctatgacg aaacgattgc agcctatgag gcgagtctcg aaaagctcgg acttgactac       60
cttgatttat acctgatcca ctggcctgtt gaaggacgct acaaagcggc gtggaaagcg      120
cttgaaacac tttatgaaca aggacgcgta aaagcaatcg gagtgagcaa ttttcagatt      180
caccatctgg aagacttgct gaaagatgcc gccgtcaaac cggcgatcaa ccaggttgag      240
tatcatccgc ggctgacgca gaaagagctg caagcgtttt gccgtgcgca cggcatccag      300
ctgcaagcat ggtcgccgct gatgcaaggc caattgctca gccatccact gctgaaagat      360
atcgcggaca agtacggcaa gacaccggcc caagtcattt tgcgctggga tttgcaaaac      420
ggggtcgtta cgattccgaa gtcgactaaa gcggagcgga ttgcccaaaa cgcggacata      480
tttgattttg aactgaccac cgaggaaatg aagcaaattg acgcgctgaa tgaaaacacc      540
cgtgtcggcc ctgatcccga taactttgac ttttaacaaa acggccccgt tcgacattcg      600
aacggggctt taattgaatt gtgcggttac accgccggac tccatcatca tcagttcttt      660
tttcatatcc aatccgcccc ggtatcccgt gagctgcccg cttttaccga taacccgatg      720
gcaaggcacc accattaaca gcggatttgc gccgatcgcc gcgcctactg cccgcacagc      780
ggcctgcttt tcaatatgct cggcgatatc ggaataggag caagtgctgc cgtaagggat      840
ttcggagagc gccttccaca ctgccagctg aaaaggcgtg ccggcaaggt cgacaggaaa      900
gctgaaatga gttcgcttgc cgttcaaata cgcctgcagc tgctcggcgt attctgccaa      960
tcctttgtca tcccgaatga aaactggctg tgtaaatctt ttttcagccc aagcggccaa     1020
atcctcgaag ccttgattcc atccccctgt aaaacagagc ccgcgggcag tcgccccaat     1080
gtgaatctgc caacctcggc aaataagcgt acgccagtat acgatttgat cgtccatatg     1140
tttacctccg tttcatttgc cggtacgacg tcggcgattg cccagtcttc tttttaaaca     1200
aagaggcaaa atattccgca ttcgcaatgc ctaccattga agcgatttct gcgatcgatc     1260
gttctgaatg agcaagcaaa tcgaccgctt tctcaatcct tttctgcagg atgtattctg     1320
ccggcgagac gcctttgatt cgtttaaatg tccgctgcag gtgaaaaggg ctgatatggc     1380
acctgtcagc caaagcttgc agagacagcg gatcgcgata agattcctcg atgatttcca     1440
ccacacgctg tgccagctct tcatccggca gcagcgcccc ggccggattg cagcgtttgc     1500
aggggcggta cccttctgat aaagcatctt ttgcattgaa aaagatctgc acattgtcga     1560
tttgcggaac tctcgatttg caggaagggc ggcaaaatat gccggtcgtt ttgaccgcgt     1620
aataaaaaac tccgtcatag gcggaatcgt tttccgtaat cgcccgccac atttcaggcg     1680
tcaatcgtga tttgctgttc at                                              1702

<210>  7
<211>  938
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  catH gene

<400>  7
atcttcaccc cgatctatgt cagtataacc tatatgacag ccggaggtgg agaggcggag       60
aacggcacag caagaagaca aagaagaaga gagactgttg cctggacctc cgaaacgcgc      120
tacaattcat ttacaacaca ggatggggtg agaatattgc cggaatcagt gaagcaggcc      180
tcctaaaata aaaatctata ttttaggagg taaaacatga attttcaaac aatcgagctt      240
gacacatggt atagaaaatc ttattttgac cattacatga aggaagcgaa atgttctttc      300
agcatcacgg caaacgtcaa tgtgacaaat ttgctcgccg tgctcaagaa aaagaagctc      360
aagctgtatc cggcttttat ttatatcgta tcaagggtca ttcattcgcg ccctgagttt      420
agaacaacgt ttgatgacaa aggacagctg ggttattggg aacaaatgca tccgtgctat      480
gcgatttttc atcaggacga ccaaacgttt tccgccctct ggacggaata ctcagacgat      540
ttttcgcagt tttatcatca atatcttctg gacgccgagc gctttggaga caaaaggggc      600
ctttgggcta agccggacat cccgcccaat acgttttcag tttcttctat tccatgggtg      660
cgcttttcaa acttcaattt aaaccttgat aacagcgaac acttgctgcc gattattaca      720
aacgggaaat acttttcaga aggcagggaa acatttttgc ccgtttcctt gcaagttcac      780
catgcagtgt gtgacggcta tcatgccggc gcttttataa acgagttgga acggcttgcc      840
gccgattgtg aggagtggct tgtgtgacag aggaaaggcc gatatgattc ggcctttttt      900
atatgtactt cttagcgggt ctcttaaccc ccctcgag                              938

<210>  8
<211>  95
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  spoVGrrnIp promoter

<400>  8
gtcgctgata aacagctgac atcaatatcc tattttttca aaaaatattt taaaaagttg       60
ttgacttaaa agaagctaaa tgttatagta ataaa                                  95

<210>  9
<211>  87
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  amylase signal sequence

<400>  9
atgaaacaac aaaaacggct ttacgcccga ttgctgacgc tgttatttgc gctcatcttc       60
ttgctgcctc attctgcagc tagcgca                                           87

<210>  10
<211>  1461
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  variant amylase

<400>  10
gccgcaccgt ttaacggtac catgatgcag tattttgaat ggtacttgcc ggatgatggc       60
acgttatgga ccaaagtggc caatgaagcc aacaacttat ccagccttgg catcaccgct      120
ctttggctgc cgcccgctta caaaggaaca agccgcagcg acgtagggta cggagtatac      180
gacttgtatg acctcggcga attcaatcaa aaagggaccg tccgcacaaa atatggaaca      240
aaagctcaat atcttcaagc cattcaagcc gcccacgccg ctggaatgca agtgtacgcc      300
gatgtcgtgt tcgaccataa aggcggcgct gacggcacgg aatgggtgga cgccgtcgaa      360
gtcaatccgt ccgaccgcaa ccaagaaatc tcgggcacct atcaaatcca agcatggacg      420
aaatttgatt ttcccgggcg gggcaacacc tactccagct ttaagtggcg ctggtaccat      480
tttgacggcg ttgattggga cgaaagccga aaattaagcc gcatttacaa attcaggggc      540
atcggcaaag cgtgggattg gccggtagac acagaaaacg gaaactatga ctacttaatg      600
tatgccgacc ttgatatgga tcatcccgaa gtcgtgaccg agctgaaaaa ctgggggaaa      660
tggtatgtca acacaacgaa cattgatggg ttccggcttg atgccgtcaa gcatattaag      720
ttcagttttt ttcctgattg gttgtcgtat gtgcgttctc agactggcaa gccgctattt      780
accgtcgggg aatattggag ctatgacatc aacaagttgc acaattacat tacgaaaaca      840
aacggaacga tgtctttgtt tgatgccccg ttacacaaca aattttatac cgcttccaaa      900
tcagggggcg catttgatat gcgcacgtta atgaccaata ctctcatgaa agatcaaccg      960
acattggccg tcaccttcgt tgataatcat gacaccgaac ccggccaagc gcttcagtca     1020
tgggtcgacc catggttcaa accgttggct tacgccttta ttctaactcg gcaggaagga     1080
tacccgtgcg tcttttatgg tgactattat ggcattccac aatataacat tccttcgctg     1140
aaaagcaaaa tcgatccgct cctcatcgcg cgcagggatt atgcttacgg aacgcaacat     1200
gattatcttg atcactccga catcatcggg tggacaaggg aaggggtcac tgaaaaacca     1260
ggatccgggc tggccgcact gatcaccgat gggccgggag gaagcaaatg gatgtacgtt     1320
ggcaaacaac acgctggaaa agtgttctat gaccttaccg gcaaccggag tgacaccgtc     1380
accatcaaca gtgatggatg gggggaattc aaagtcaatg gcggttcggt ttcggtttgg     1440
gttcctagaa aaacgaccta a                                               1461

<210>  11
<211>  58
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  amylase terminator sequence

<400>  11
aagcttctcg aggttaacag aggacggatt tcctgaagga aatccgtttt tttatttt         58

<210>  12
<211>  1809
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3'-HR to CatH locus

<400>  12
taacatctct cactgctgtg tgattttact cacggcattt ggaacgccgg ctctcaacaa       60
actttctgta gtgaaaatca tgaaccaaac ggatcgtcgg cctgattaac agctgaaagc      120
tgccgatcac aaacatccat agtcccgccg gcttcagttc ctcggagaaa aagcagaagc      180
tcccgacaag gaataaaagg ccgatgagaa aatcgtttaa tgtatgtaga actttgtatc      240
tttttttgaa aaagagttca tatcgattgt tattgttttg cggcattgct tgatcactcc      300
aatcctttta tttaccctgc cggaagccgg agtgaaacgc cggtatacat aggatttatg      360
aattaggaaa acatatgggg aaataaacca tccaggagtg aaaaatatgc ggttattcat      420
atgtgcatcg tgcctgttcg gcttgattgt tccgtcattt gaaacgaaag cgctgacgtt      480
tgaagaattg ccggttaaac aagcttcaaa acaatgggaa gttcaaatcg gtaaagccga      540
agccggaaac ggaatggcga aaccggaaaa aggagcgttt catacttatg ctgtcgaaat      600
caaaaacatt ggacacgatg tggcttcggc ggaaattttt gtctatcgga acgagcctaa      660
ttcttcaacg aaattttcgc tttggaacat tcctcacgaa aatccggttt ctttagccaa      720
aagcttaaat cacggaagct ctgtcaagca ccgcaatctg cttatggcag agaatgcgac      780
cgaattggaa gtggacatga tttggacgga aaaaggaagc gaaggcagac ttttaaagga      840
aacgttcatt ttcaagggag atgaatcatg aagaaaaaat ggccgttcat cgtcaacggt      900
ctttttttaa tgacttaggc agccgatcgt tcggccatac gatatcgaag cgacctcgaa      960
ccagcagagc tcgtcacaaa acatttgcat ttaaagaaaa atacaggatg ttttcaccaa     1020
tatttttctc aatgatgata cactattgac aagctgctac tttgggaggg tgtttccata     1080
gatgccgatg aagcaaaaac accaaatgtg tcatgagagc tctctctaat cgatataaaa     1140
gtagggtgaa ccggggttgt caatctgtaa aagatctttt tttatcccgt gatacgcttt     1200
tggaattctg aatcttcaag aaagtcccca gccttttgct gatcaatcga gaacaaagga     1260
tgatacatat gaaaagaata gataaaatct accatcagct gctggataat tttcgcgaaa     1320
agaatatcaa tcagctttta aagatacaag ggaattcggc taaagaaatc gccgggcagc     1380
tgcaaatgga gcgttccaat gtcagctttg aattaaacaa tctggttcgg gccaaaaagg     1440
tgatcaagat taaaacgttc cccgtccgct acatcccggt ggaaattgtt gaaaacgtct     1500
tgaacatcaa atggaattca gagttgatgg aggttgaaga actgaggcgg ctggctgacg     1560
gccaaaaaaa gccggcgcgc aatatatccg ccgatcccct cgagctcatg atcggggcta     1620
aagggagctt gaaaaaggca atttctcagg cgaaagcggc agtcttttat cctccgcacg     1680
gcttgcatat gctgctgctc gggccgacgg gttcggggaa atcgctgttt gcgaatcgga     1740
tctaccagtt cgccgtttat tctgacatat tgaagcccga ttccccgttc atcacattca     1800
actgtgcag                                                             1809

<210>  13
<211>  486
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  variant amylase

<400>  13
Ala Ala Pro Phe Asn Gly Thr Met Met Gln Tyr Phe Glu Trp Tyr Leu 
1               5                   10                  15      
Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
            20                  25                  30          
Leu Ser Ser Leu Gly Ile Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 
        35                  40                  45              
Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
    50                  55                  60                  
Leu Gly Glu Phe Asn Gln Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr 
65                  70                  75                  80  
Lys Ala Gln Tyr Leu Gln Ala Ile Gln Ala Ala His Ala Ala Gly Met 
                85                  90                  95      
Gln Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly 
            100                 105                 110         
Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gln 
        115                 120                 125             
Glu Ile Ser Gly Thr Tyr Gln Ile Gln Ala Trp Thr Lys Phe Asp Phe 
    130                 135                 140                 
Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 
145                 150                 155                 160 
Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg Ile Tyr 
                165                 170                 175     
Lys Phe Arg Gly Ile Gly Lys Ala Trp Asp Trp Pro Val Asp Thr Glu 
            180                 185                 190         
Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His 
        195                 200                 205             
Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn 
    210                 215                 220                 
Thr Thr Asn Ile Asp Gly Phe Arg Leu Asp Ala Val Lys His Ile Lys 
225                 230                 235                 240 
Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gln Thr Gly 
                245                 250                 255     
Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp Ile Asn Lys 
            260                 265                 270         
Leu His Asn Tyr Ile Thr Lys Thr Asn Gly Thr Met Ser Leu Phe Asp 
        275                 280                 285             
Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala 
    290                 295                 300                 
Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gln Pro 
305                 310                 315                 320 
Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gln 
                325                 330                 335     
Ala Leu Gln Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 
            340                 345                 350         
Phe Ile Leu Thr Arg Gln Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp 
        355                 360                 365             
Tyr Tyr Gly Ile Pro Gln Tyr Asn Ile Pro Ser Leu Lys Ser Lys Ile 
    370                 375                 380                 
Asp Pro Leu Leu Ile Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gln His 
385                 390                 395                 400 
Asp Tyr Leu Asp His Ser Asp Ile Ile Gly Trp Thr Arg Glu Gly Val 
                405                 410                 415     
Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu Ile Thr Asp Gly Pro 
            420                 425                 430         
Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gln His Ala Gly Lys Val 
        435                 440                 445             
Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr Ile Asn Ser 
    450                 455                 460                 
Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp 
465                 470                 475                 480 
Val Pro Arg Lys Thr Thr 
                485     

<210>  14
<211>  1967
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  colony pcr construct

<400>  14
tgtgtgacgg ctatcatgcc ggcgctttta taaacgagtt ggaacggctt gccgccgatt       60
gtgaggagtg gcttgtgtga cagaggaaag gccgatatga ttcggccttt tttatatgta      120
cttcttagcg ggtctcttaa cccccctcga ggtcgctgat aaacagctga catcaatatc      180
ctattttttc aaaaaatatt ttaaaaagtt gttgacttaa aagaagctaa atgttatagt      240
aataaaacag aatagtcttt taagtaagtc tactctgaat ttttttaaaa ggagagggta      300
aagaatgaaa caacaaaaac ggctttacgc ccgattgctg acgctgttat ttgcgctcat      360
cttcttgctg cctcattctg cagctagcgc agccgcaccg tttaacggta ccatgatgca      420
gtattttgaa tggtacttgc cggatgatgg cacgttatgg accaaagtgg ccaatgaagc      480
caacaactta tccagccttg gcatcaccgc tctttggctg ccgcccgctt acaaaggaac      540
aagccgcagc gacgtagggt acggagtata cgacttgtat gacctcggcg aattcaatca      600
aaaagggacc gtccgcacaa aatatggaac aaaagctcaa tatcttcaag ccattcaagc      660
cgcccacgcc gctggaatgc aagtgtacgc cgatgtcgtg ttcgaccata aaggcggcgc      720
tgacggcacg gaatgggtgg acgccgtcga agtcaatccg tccgaccgca accaagaaat      780
ctcgggcacc tatcaaatcc aagcatggac gaaatttgat tttcccgggc ggggcaacac      840
ctactccagc tttaagtggc gctggtacca ttttgacggc gttgattggg acgaaagccg      900
aaaattaagc cgcatttaca aattcagggg catcggcaaa gcgtgggatt ggccggtaga      960
cacagaaaac ggaaactatg actacttaat gtatgccgac cttgatatgg atcatcccga     1020
agtcgtgacc gagctgaaaa actgggggaa atggtatgtc aacacaacga acattgatgg     1080
gttccggctt gatgccgtca agcatattaa gttcagtttt tttcctgatt ggttgtcgta     1140
tgtgcgttct cagactggca agccgctatt taccgtcggg gaatattgga gctatgacat     1200
caacaagttg cacaattaca ttacgaaaac aaacggaacg atgtctttgt ttgatgcccc     1260
gttacacaac aaattttata ccgcttccaa atcagggggc gcatttgata tgcgcacgtt     1320
aatgaccaat actctcatga aagatcaacc gacattggcc gtcaccttcg ttgataatca     1380
tgacaccgaa cccggccaag cgcttcagtc atgggtcgac ccatggttca aaccgttggc     1440
ttacgccttt attctaactc ggcaggaagg atacccgtgc gtcttttatg gtgactatta     1500
tggcattcca caatataaca ttccttcgct gaaaagcaaa atcgatccgc tcctcatcgc     1560
gcgcagggat tatgcttacg gaacgcaaca tgattatctt gatcactccg acatcatcgg     1620
gtggacaagg gaaggggtca ctgaaaaacc aggatccggg ctggccgcac tgatcaccga     1680
tgggccggga ggaagcaaat ggatgtacgt tggcaaacaa cacgctggaa aagtgttcta     1740
tgaccttacc ggcaaccgga gtgacaccgt caccatcaac agtgatggat ggggggaatt     1800
caaagtcaat ggcggttcgg tttcggtttg ggttcctaga aaaacgacct aaaagcttct     1860
cgaggttaac agaggacgga tttcctgaag gaaatccgtt tttttatttt taacatctct     1920
cactgctgtg tgattttact cacggcattt ggaacgccgg ctctcaa                   1967

<210>  15
<211>  1966
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  colony pcr construct -1A

<400>  15
tgtgtgacgg ctatcatgcc ggcgctttta taaacgagtt ggaacggctt gccgccgatt       60
gtgaggagtg gcttgtgtga cagaggaaag gccgatatga ttcggccttt tttatatgta      120
cttcttagcg ggtctcttaa cccccctcga ggtcgctgat aaacagctga catcaatatc      180
ctattttttc aaaaaatatt ttaaaaagtt gttgacttaa aagaagctaa atgttatagt      240
aataaaacag aatagtcttt taagtaagtc tactctgaat ttttttaaaa ggagagggta      300
aagatgaaac aacaaaaacg gctttacgcc cgattgctga cgctgttatt tgcgctcatc      360
ttcttgctgc ctcattctgc agctagcgca gccgcaccgt ttaacggtac catgatgcag      420
tattttgaat ggtacttgcc ggatgatggc acgttatgga ccaaagtggc caatgaagcc      480
aacaacttat ccagccttgg catcaccgct ctttggctgc cgcccgctta caaaggaaca      540
agccgcagcg acgtagggta cggagtatac gacttgtatg acctcggcga attcaatcaa      600
aaagggaccg tccgcacaaa atatggaaca aaagctcaat atcttcaagc cattcaagcc      660
gcccacgccg ctggaatgca agtgtacgcc gatgtcgtgt tcgaccataa aggcggcgct      720
gacggcacgg aatgggtgga cgccgtcgaa gtcaatccgt ccgaccgcaa ccaagaaatc      780
tcgggcacct atcaaatcca agcatggacg aaatttgatt ttcccgggcg gggcaacacc      840
tactccagct ttaagtggcg ctggtaccat tttgacggcg ttgattggga cgaaagccga      900
aaattaagcc gcatttacaa attcaggggc atcggcaaag cgtgggattg gccggtagac      960
acagaaaacg gaaactatga ctacttaatg tatgccgacc ttgatatgga tcatcccgaa     1020
gtcgtgaccg agctgaaaaa ctgggggaaa tggtatgtca acacaacgaa cattgatggg     1080
ttccggcttg atgccgtcaa gcatattaag ttcagttttt ttcctgattg gttgtcgtat     1140
gtgcgttctc agactggcaa gccgctattt accgtcgggg aatattggag ctatgacatc     1200
aacaagttgc acaattacat tacgaaaaca aacggaacga tgtctttgtt tgatgccccg     1260
ttacacaaca aattttatac cgcttccaaa tcagggggcg catttgatat gcgcacgtta     1320
atgaccaata ctctcatgaa agatcaaccg acattggccg tcaccttcgt tgataatcat     1380
gacaccgaac ccggccaagc gcttcagtca tgggtcgacc catggttcaa accgttggct     1440
tacgccttta ttctaactcg gcaggaagga tacccgtgcg tcttttatgg tgactattat     1500
ggcattccac aatataacat tccttcgctg aaaagcaaaa tcgatccgct cctcatcgcg     1560
cgcagggatt atgcttacgg aacgcaacat gattatcttg atcactccga catcatcggg     1620
tggacaaggg aaggggtcac tgaaaaacca ggatccgggc tggccgcact gatcaccgat     1680
gggccgggag gaagcaaatg gatgtacgtt ggcaaacaac acgctggaaa agtgttctat     1740
gaccttaccg gcaaccggag tgacaccgtc accatcaaca gtgatggatg gggggaattc     1800
aaagtcaatg gcggttcggt ttcggtttgg gttcctagaa aaacgaccta aaagcttctc     1860
gaggttaaca gaggacggat ttcctgaagg aaatccgttt ttttattttt aacatctctc     1920
actgctgtgt gattttactc acggcatttg gaacgccggc tctcaa                    1966

<210>  16
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  16
tgtgtgacgg ctatcatgcc                                                   20

<210>  17
<211>  17
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  17
ttgagagccg gcgttcc                                                      17

<210>  18
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  18
aacgagttgg aacggcttgc                                                   20

<210>  19
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  19
ggcaacacct actccagctt                                                   20

<210>  20
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  20
gatcactccg acatcatcgg                                                   20

<210>  21
<211>  192
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  comK protein

<400>  21
Met Ser Thr Glu Asp Met Thr Lys Asp Thr Tyr Glu Val Asn Ser Ser 
1               5                   10                  15      
Thr Met Ala Val Leu Pro Leu Gly Glu Gly Glu Lys Ser Ala Ser Lys 
            20                  25                  30          
Ile Leu Glu Thr Asp Arg Thr Phe Arg Val Asn Met Lys Pro Phe Gln 
        35                  40                  45              
Ile Ile Glu Arg Ser Cys Arg Tyr Phe Gly Ser Ser Tyr Ala Gly Arg 
    50                  55                  60                  
Lys Ala Gly Thr Tyr Glu Val Ile Lys Val Ser His Lys Pro Pro Ile 
65                  70                  75                  80  
Met Val Asp His Ser Asn Asn Ile Phe Leu Phe Pro Thr Phe Ser Ser 
                85                  90                  95      
Thr Arg Pro Gln Cys Gly Trp Leu Ser His Ala His Val His Glu Phe 
            100                 105                 110         
Cys Ala Ala Lys Tyr Asp Asn Thr Phe Val Thr Phe Val Asn Gly Glu 
        115                 120                 125             
Thr Leu Glu Leu Pro Val Ser Ile Ser Ser Phe Glu Asn Gln Val Tyr 
    130                 135                 140                 
Arg Thr Ala Trp Leu Arg Thr Lys Phe Ile Asp Arg Ile Glu Gly Asn 
145                 150                 155                 160 
Pro Met Gln Lys Lys Gln Glu Phe Met Leu Tyr Pro Lys Glu Asp Arg 
                165                 170                 175     
Asn Gln Leu Ile Tyr Glu Phe Ile Leu Arg Glu Leu Lys Lys Arg Tyr 
            180                 185                 190         
