                         SEQUENCE LISTING

<110>  BASF SE

<120>  Improved Method for the Production of Isoprenoids

<130>  202025WO01

<150>  EP20175075
<151>  2020-05-15

<160>  491

<170>  According Wipo Std 25



<210> 1
<211> 582
<212> PRT
<213> Saccharomyces cerevisiae

<220>
<223> CK1

<400> 1
Met Val Gln Glu Ser Arg Pro Gly Ser Val Arg Ser Tyr Ser Val Gly
1               5                   10                  15
Tyr Gln Ala Arg Ser Arg Ser Ser Ser Gln Arg Arg His Ser Leu Thr
            20                  25                  30
Arg Gln Arg Ser Ser Gln Arg Leu Ile Arg Thr Ile Ser Ile Glu Ser
        35                  40                  45
Asp Val Ser Asn Ile Thr Asp Asp Asp Asp Leu Arg Ala Val Asn Glu
    50                  55                  60
Gly Val Ala Gly Val Gln Leu Asp Val Ser Glu Thr Ala Asn Lys Gly
65                  70                  75                  80
Pro Arg Arg Ala Ser Ala Thr Asp Val Thr Asp Ser Leu Gly Ser Thr
                85                  90                  95
Ser Ser Glu Tyr Ile Glu Ile Pro Phe Val Lys Glu Thr Leu Asp Ala
            100                 105                 110
Ser Leu Pro Ser Asp Tyr Leu Lys Gln Asp Ile Leu Asn Leu Ile Gln
        115                 120                 125
Ser Leu Lys Ile Ser Lys Trp Tyr Asn Asn Lys Lys Ile Gln Pro Val
    130                 135                 140
Ala Gln Asp Met Asn Leu Val Lys Ile Ser Gly Ala Met Thr Asn Ala
145                 150                 155                 160
Ile Phe Lys Val Glu Tyr Pro Lys Leu Pro Ser Leu Leu Leu Arg Ile
                165                 170                 175
Tyr Gly Pro Asn Ile Asp Asn Ile Ile Asp Arg Glu Tyr Glu Leu Gln
            180                 185                 190
Ile Leu Ala Arg Leu Ser Leu Lys Asn Ile Gly Pro Ser Leu Tyr Gly
        195                 200                 205
Cys Phe Val Asn Gly Arg Phe Glu Gln Phe Leu Glu Asn Ser Lys Thr
    210                 215                 220
Leu Thr Lys Asp Asp Ile Arg Asn Trp Lys Asn Ser Gln Arg Ile Ala
225                 230                 235                 240
Arg Arg Met Lys Glu Leu His Val Gly Val Pro Leu Leu Ser Ser Glu
                245                 250                 255
Arg Lys Asn Gly Ser Ala Cys Trp Gln Lys Ile Asn Gln Trp Leu Arg
            260                 265                 270
Thr Ile Glu Lys Val Asp Gln Trp Val Gly Asp Pro Lys Asn Ile Glu
        275                 280                 285
Asn Ser Leu Leu Cys Glu Asn Trp Ser Lys Phe Met Asp Ile Val Asp
    290                 295                 300
Arg Tyr His Lys Trp Leu Ile Ser Gln Glu Gln Gly Ile Glu Gln Val
305                 310                 315                 320
Asn Lys Asn Leu Ile Phe Cys His Asn Asp Ala Gln Tyr Gly Asn Leu
                325                 330                 335
Leu Phe Thr Ala Pro Val Met Asn Thr Pro Ser Leu Tyr Thr Ala Pro
            340                 345                 350
Ser Ser Thr Ser Leu Thr Ser Gln Ser Ser Ser Leu Phe Pro Ser Ser
        355                 360                 365
Ser Asn Val Ile Val Asp Asp Ile Ile Asn Pro Pro Lys Gln Glu Gln
    370                 375                 380
Ser Gln Asp Ser Lys Leu Val Val Ile Asp Phe Glu Tyr Ala Gly Ala
385                 390                 395                 400
Asn Pro Ala Ala Tyr Asp Leu Ala Asn His Leu Ser Glu Trp Met Tyr
                405                 410                 415
Asp Tyr Asn Asn Ala Lys Ala Pro His Gln Cys His Ala Asp Arg Tyr
            420                 425                 430
Pro Asp Lys Glu Gln Val Leu Asn Phe Leu Tyr Ser Tyr Val Ser His
        435                 440                 445
Leu Arg Gly Gly Ala Lys Glu Pro Ile Asp Glu Glu Val Gln Arg Leu
    450                 455                 460
Tyr Lys Ser Ile Ile Gln Trp Arg Pro Thr Val Gln Leu Phe Trp Ser
465                 470                 475                 480
Leu Trp Ala Ile Leu Gln Ser Gly Lys Leu Glu Lys Lys Glu Ala Ser
                485                 490                 495
Thr Ala Ile Thr Arg Glu Glu Ile Gly Pro Asn Gly Lys Lys Tyr Ile
            500                 505                 510
Ile Lys Thr Glu Pro Glu Ser Pro Glu Glu Asp Phe Val Glu Asn Asp
        515                 520                 525
Asp Glu Pro Glu Ala Gly Val Ser Ile Asp Thr Phe Asp Tyr Met Ala
    530                 535                 540
Tyr Gly Arg Asp Lys Ile Ala Val Phe Trp Gly Asp Leu Ile Gly Leu
545                 550                 555                 560
Gly Ile Ile Thr Glu Glu Glu Cys Lys Asn Phe Ser Ser Phe Lys Phe
                565                 570                 575
Leu Asp Thr Ser Tyr Leu
            580

<210> 2
<211> 1749
<212> DNA
<213> Saccharomyces cerevisiae

<220>
<223> CK1 wild type

<400> 2
atggtacaag aatcacgtcc agggagtgta agaagttact cggtcggtta ccaagcaagg     60
tccagatcga gttctcaaag aagacattcg ttaacacgcc aacgttcctc gcaaagactg    120
attagaacca tcagtatcga gtctgatgtg tctaatatta ctgacgatga cgatttgaga    180
gctgtcaatg agggagtagc gggtgtgcaa ctggacgtct ctgaaaccgc aaataaggga    240
ccaagaagag catcagcaac tgatgtcaca gatagtttgg gttcgacttc gtcggaatat    300
attgagattc cctttgttaa ggaaacattg gatgcaagtt taccttcgga ttatctgaag    360
caggacatat taaatctcat tcagagtttg aagatatcca aatggtataa caacaagaaa    420
atccaaccgg tagcacaaga tatgaactta gtcaagatct ctggtgcgat gacaaacgca    480
attttcaaag ttgaataccc taagttacca tcgttgctat tgagaatata cggaccgaat    540
attgataata tcattgacag ggaatatgaa ttgcagattt tggctaggct ttcattgaaa    600
aatataggtc cttcccttta cggctgtttt gtaaacggta gatttgagca gtttctggag    660
aattctaaga ctttaacaaa agacgacatt agaaactgga agaactctca aaggattgca    720
aggagaatga aggagttaca tgtaggtgtt cctctcttga gttcagaaag gaagaacggg    780
tcggcttgtt ggcaaaagat taaccagtgg ttgcgcacga ttgagaaagt cgaccaatgg    840
gtgggggatc ctaaaaacat tgaaaactct ttattatgtg agaattggtc caagtttatg    900
gatattgtcg atagatatca caagtggctt atttctcaag aacagggtat agagcaagtc    960
aacaaaaatc ttatattctg ccataatgat gcccaatacg gcaatttact tttcactgct   1020
cctgtgatga acacaccgag cctatacact gcaccttcgt ctacatcatt gacttcccaa   1080
tcaagttcct tatttccttc gagctccaat gtcattgtag atgatataat caacccgcca   1140
aagcaggagc aaagccaaga ttccaaattg gtcgtcattg attttgaata tgcaggtgcc   1200
aatcccgccg catatgattt agcgaatcat ctttccgagt ggatgtatga ttacaacaat   1260
gctaaggccc cacatcagtg ccacgctgat agatatcccg ataaagaaca ggttttgaat   1320
ttcttatact cttatgtttc gcatctaagg ggtggtgcta aggaacccat agatgaagag   1380
gttcaaagac tctataagtc aatcattcaa tggagaccca ctgtacaact attttggtcg   1440
ctctgggcca tcctacaaag tggtaaatta gagaaaaaag aagcctccac tgccatcact   1500
agagaagaaa ttggacccaa tggaaaaaaa tatatcatca agactgaacc cgaatcccct   1560
gaagaagact ttgttgaaaa tgacgacgag cctgaagctg gcgtcagcat tgacacgttc   1620
gattatatgg cttatggtcg tgacaagatt gcggtctttt ggggcgacct cattggctta   1680
ggcataatca ccgaagaaga atgcaaaaat ttcagctctt tcaagttcct cgatactagt   1740
tatttgtaa                                                           1749

<210> 3
<211> 1749
<212> DNA
<213> Artificial Sequence

<220>
<223> CK1 codon optimised

<400> 3
atggttcagg aaagtcgtcc gggcagtgtg cgtagctata gtgttggtta tcaggcccgc     60
agtcgcagca gtagtcagcg tcgtcatagc ctgacccgcc agcgcagtag ccagcgtctg    120
attcgcacca ttagtattga aagtgatgtg agcaatatca ccgatgatga tgatctgcgt    180
gccgttaatg aaggcgttgc cggcgtgcag ctggatgtta gtgaaaccgc caataaaggt    240
ccgcgccgcg ccagcgccac cgatgtgacc gatagcctgg gcagtaccag tagtgaatat    300
attgaaattc cgttcgtgaa agaaacctta gatgcaagcc tgccgagcga ttatctgaaa    360
caggatattc tgaatctgat tcagagtctg aaaattagta agtggtataa taacaagaag    420
atccagccgg tggcccagga tatgaatctg gtgaaaatta gcggcgccat gaccaatgcc    480
atcttcaaag tggaatatcc gaaactgccg agtctgctgc tgcgcatcta tggtccgaat    540
attgataata ttatcgaccg cgaatacgaa ctgcaaattc tggcacgcct gagcctgaaa    600
aatattggtc cgagcctgta tggttgcttc gttaatggtc gcttcgaaca gttcctggaa    660
aatagcaaaa ccttaaccaa agatgatatt cgcaattgga aaaatagcca gcgtattgca    720
cgtcgcatga aagaactgca tgttggcgtt ccgctgctga gtagtgaacg taaaaatggc    780
agtgcatgtt ggcagaaaat taatcagtgg ctgcgcacca ttgaaaaagt tgatcagtgg    840
gttggcgatc cgaaaaatat tgaaaatagt ctgctgtgcg aaaattggag caaattcatg    900
gatattgttg atcgctatca taaatggctg attagtcagg aacagggcat tgaacaggtt    960
aataaaaatc tgatcttctg ccataacgat gcacagtatg gtaatctgct gttcaccgcc   1020
ccggttatga ataccccgag cctgtacacc gcaccgagta gcaccagcct gaccagtcag   1080
agtagcagtc tgttcccgag cagtagtaat gtgattgtgg atgatattat caacccgccg   1140
aaacaggaac agagccagga tagtaaactg gttgttattg acttcgaata tgccggtgcc   1200
aatccggccg cctatgatct ggccaatcat ctgagcgaat ggatgtatga ttataataat   1260
gcaaaggccc cgcatcagtg ccatgcagat cgttatccgg ataaagaaca ggttctgaac   1320
ttcctgtata gttatgttag tcatctgcgt ggcggtgcca aagaaccgat tgatgaagaa   1380
gtgcagcgtc tgtataaaag tattattcag tggcgtccga ccgtgcagct gttctggagc   1440
ctgtgggcaa ttctgcaaag tggtaaactg gaaaaaaaag aagcaagtac cgcaattacc   1500
cgcgaagaaa ttggcccgaa tggtaaaaaa tatattatca aaaccgagcc ggaaagtccg   1560
gaagaagact tcgttgaaaa tgatgatgaa ccggaagcag gtgtgagtat tgataccttc   1620
gattatatgg cctatggtcg cgataaaatt gcagtgttct ggggtgatct gattggcctg   1680
ggcattatta ccgaagaaga atgtaaaaac ttcagtagct tcaaattcct ggataccagc   1740
tatctgtga                                                           1749

<210> 4
<211> 383
<212> PRT
<213> Galerina marginata

<220>
<223> CK2

<400> 4
Met Lys Glu Ser Trp His Tyr Lys Lys Ser Pro Phe Cys Leu Gln Leu
1               5                   10                  15
Leu Glu Ile Leu Arg Lys Leu His Val Arg Val Trp Ser Thr Ala Gln
            20                  25                  30
Ile Ser Pro Glu Asp Val Ile Ile His Lys Val Ser Gly Ala Leu Thr
        35                  40                  45
Asn Ala Val Phe Phe Val Ser Cys Pro Thr Val Pro Ser Ala Arg Thr
    50                  55                  60
Leu Leu Leu Arg Val Tyr Gly Ser Ser Ser Gly Ser Leu Ile Ser Arg
65                  70                  75                  80
Pro Arg Glu Leu His Thr Leu His Val Leu Ser Ser Gln Tyr Lys Ile
                85                  90                  95
Gly Pro Arg Val Tyr Gly Thr Phe Asp Asn Gly Arg Ile Glu Glu Tyr
            100                 105                 110
Phe Asp Ser Val Thr Leu Thr Ala Ala Asp Ile Arg Asp Pro Ile Thr
        115                 120                 125
Ser Gln Trp Ile Gly Ala Arg Met Ala Asp Leu His Ser Val Asp Ile
    130                 135                 140
Asp Val Val Tyr Glu Ala Gly Ser Leu Pro Leu Asn Glu Asn Arg Gly
145                 150                 155                 160
Phe Glu Ile Ala Ala Asn Val Ser Ser Trp Leu Asp Pro Ala Glu Gln
                165                 170                 175
Val Leu Asn Leu Pro Ala Val Ser Glu Ala Thr Ser Arg Glu Leu Asp
            180                 185                 190
Leu Pro Arg Phe Lys Lys Glu Trp Ala Arg Tyr Leu Cys Trp Ser Leu
        195                 200                 205
Asn Arg Pro His Ser Phe Gly Thr Arg Arg Val Phe Ala His Asn Asp
    210                 215                 220
Ala Gln Tyr Gly Asn Leu Leu Arg Leu Lys Asp Gly Ser Glu Gly Val
225                 230                 235                 240
Asp Glu His Arg Gln Ile Ile Val Val Asp Phe Glu Tyr Ala Ala Pro
                245                 250                 255
Asn Pro Ala Ala Phe Asp Ile Ala Asn His Phe His Glu Trp Thr Ala
            260                 265                 270
Asn Tyr His Cys Pro Thr Pro His Val Leu Ile Pro Ser Arg Tyr Pro
        275                 280                 285
Thr Phe Glu Glu Arg Arg Asn Phe Tyr Thr Ser Tyr Ile Arg His Ala
    290                 295                 300
Ala Met Leu Ala Glu Asp Pro Gly Leu Ser Asp Ala Asp Leu Asp Lys
305                 310                 315                 320
Met Ile Thr Glu Leu Asp Arg Asp Val Leu Ile Trp Gly Ala Ala Ser
                325                 330                 335
His Ala Gly Trp Ala Ile Trp Gly Ile Ile Gln Ala Arg Glu Asp Leu
            340                 345                 350
Glu Ala Ala Val Thr Glu Leu Glu Phe Asp Tyr Ile Gly Tyr Ala Lys
        355                 360                 365
Gly Arg Met Thr Ala Phe Arg Lys Asp Leu Gln Glu Phe Gly Ile
    370                 375                 380

<210> 5
<211> 1152
<212> DNA
<213> Galerina marginata

<220>
<223> CK2 wild type

<400> 5
atgaaggaat catggcacta taagaaatca cctttttgcc tccaacttct tgaaatctta     60
cggaagttgc acgttcgtgt gtggtctact gcacaaatct ccccagaaga cgtcataatc    120
cacaaggttt ccggagctct caccaatgct gtctttttcg tgtcatgccc aaccgtcccg    180
tccgcccgta cgctgttact acgcgtgtat ggttcttcct ccggatctct catttctcga    240
ccacgggaac tacacaccct gcacgtcctt tcgtcacagt acaagatcgg tcctcgagtt    300
tacgggactt ttgataacgg gagaattgag gagtatttcg actccgtgac cttgactgcg    360
gccgacattc gcgatccgat taccagtcaa tggattggag cacgaatggc agaccttcat    420
tctgtcgata tagacgttgt ctatgaagcc ggttcgcttc cactgaatga aaatcgtggt    480
tttgaaattg ctgccaatgt ttcttcttgg ttagatccag ccgaacaagt tctcaattta    540
cctgctgtgt ctgaagccac gtcacgagag ttggacctgc ctaggttcaa gaaagaatgg    600
gcacggtacc tatgttggtc attaaatcga ccgcattcat ttggcactag acgggtcttc    660
gctcacaatg atgctcaata cgggaattta ttgagactca aagacggcag tgagggtgtg    720
gatgagcatc gccagatcat cgtcgtcgat tttgagtacg ctgcccccaa tccagcagca    780
ttcgacattg ccaatcactt ccacgagtgg acggcaaatt atcattgccc gaccccacat    840
gtcctcatcc catcacgtta tcctacattt gaagagcgtc gcaattttta cacttcctac    900
atccgacatg cggcaatgtt ggcagaagat cctggattga gcgatgcgga cctcgacaaa    960
atgatcacag aattggatag ggatgttctt atctggggcg ccgcctcgca tgctggctgg   1020
gccatctggg gcatcatcca agcgagggaa gacttggagg cagctgtcac ggagctggag   1080
ttcgattata tcggttacgc aaaaggtcga atgacagctt tccgcaagga tttacaagaa   1140
tttggcattt aa                                                       1152

<210> 6
<211> 1152
<212> DNA
<213> Artificial Sequence

<220>
<223> CK2 codon optimised

<400> 6
atgaaggaaa gttggcatta taagaagagt ccgttctgtc tgcaactgct ggaaattctg     60
cgtaaactgc atgttcgcgt gtggagcacc gcccagatta gcccggaaga tgtgattatt    120
cataaagtta gtggtgcact gaccaatgca gtgttcttcg tgagttgtcc gaccgtgccg    180
agcgcacgca ccctgttact gcgtgtgtat ggcagtagca gtggtagcct gattagtcgt    240
ccgcgtgaac tgcataccct gcatgtgctg agtagccagt ataaaattgg cccgcgcgtg    300
tatggcacct tcgataatgg ccgtattgaa gaatacttcg atagcgtgac cctgaccgca    360
gccgatattc gcgatccgat taccagccag tggattggcg cccgcatggc agatctgcat    420
agcgttgata ttgatgttgt gtatgaagca ggcagcctgc cgctgaatga aaatcgtggc    480
ttcgaaattg cagcaaatgt gagtagttgg ctggaccctg ccgaacaggt gctgaatctg    540
ccggcagtta gcgaagcaac cagtcgcgaa ctggatctgc cgcgcttcaa aaaagaatgg    600
gcccgctatc tgtgttggag cctgaatcgt ccgcatagct tcggtacccg ccgcgtgttc    660
gcccataatg atgcccagta tggtaatctg ctgcgcctga aagatggtag tgaaggtgtg    720
gatgaacatc gtcagattat tgttgtggac ttcgaatatg cagcaccgaa tccggccgca    780
ttcgatattg caaatcactt ccatgaatgg accgcaaatt atcattgtcc gaccccgcat    840
gttctgattc cgagccgtta tccgaccttc gaagaacgcc gtaacttcta taccagttat    900
attcgccatg ccgccatgct ggccgaagat ccgggtctga gcgatgccga tctggataaa    960
atgattaccg aactggatcg tgatgttctg atctggggcg cagcaagcca tgcaggctgg   1020
gcaatctggg gtattattca ggcacgcgaa gatctggaag ccgccgtgac cgaactggag   1080
ttcgattata ttggctatgc aaaaggtcgc atgaccgcat tccgtaaaga tctgcaagag   1140
ttcggtattt ga                                                       1152

<210> 7
<211> 354
<212> PRT
<213> Piromyces finnis

<220>
<223> CK3

<400> 7
Met Ser Thr Ala Val Glu Glu Glu Tyr Phe Lys Asn Lys Ile Ala Ser
1               5                   10                  15
Ile Glu Lys Phe Asp Phe Thr Leu Asn Thr Glu Ser Gln Glu Thr Ile
            20                  25                  30
Phe Glu Gly Ile Lys Ile Ile Leu Lys His Phe Leu Pro Glu Trp Asn
        35                  40                  45
Asp Leu Lys Phe Thr Pro Gln Thr Asp Gly Ile Thr Asn Thr Leu Ile
    50                  55                  60
Leu Val Ser Cys Pro Gln Gly Lys Val Ile Val Arg Val Phe Gly Asn
65                  70                  75                  80
Gly Thr Glu Tyr Ile Ile Asn Arg Asn Ala Glu Gln Lys Asn Phe Ile
                85                  90                  95
Phe Leu Ser Asp Asn Lys Leu Ala Ala Pro Ile Ile Gly Asn Phe Asn
            100                 105                 110
Asn Gly Phe Val His Gly Tyr Val Glu Gly Ser Val Phe Ser Val Pro
        115                 120                 125
Asp Met Ser Asp Pro Gln Lys Ser Leu Leu Val Ala Lys Lys Ile Gly
    130                 135                 140
Lys Trp His Ser Leu Asn Phe Pro Phe Glu Lys Lys Ser Ser Val Tyr
145                 150                 155                 160
Asp Val Ile Asn Lys Trp Ile Asp Glu Ala Pro Glu Val Phe Glu Asp
                165                 170                 175
Lys Arg Lys Asn Glu Ile Tyr Tyr Ser Lys Asp Tyr Leu Arg Lys Glu
            180                 185                 190
Asn Leu Arg Asn Glu Ile Asn Phe Leu Lys Glu Lys Leu Asp Glu Ile
        195                 200                 205
Ser Ser Pro Leu Ala Phe Cys His Cys Asp Leu Leu Tyr Gly Asn Ile
    210                 215                 220
Ile Leu His Lys Asp Glu Asn Gly Asn Asp Asp Val Thr Phe Ile Asp
225                 230                 235                 240
Tyr Glu Tyr Gly Ser Ile Asn Pro Arg Gly Phe Asp Ile Gly Asn His
                245                 250                 255
Phe Asn Glu Tyr Ala Gly Phe Asp Cys Asp Tyr Asn Leu Tyr Pro Ser
            260                 265                 270
Lys Glu Phe Gln Tyr Lys Trp Leu Lys Val Tyr Leu Gln Ser Tyr Leu
        275                 280                 285
Gly Lys Glu Asn Ile Ser Glu Lys Glu Ile Glu Asp Leu Tyr Arg Glu
    290                 295                 300
Val Asn Lys Tyr Ala Leu Leu Ser His Tyr Tyr Trp Gly Val Trp Ala
305                 310                 315                 320
Ile Leu Gln Ala Lys Tyr Ser Gln Ile Asp Phe Asp Tyr Ile Ser Tyr
                325                 330                 335
Ser Ile Leu Arg Leu Asp Glu Tyr Tyr Asn Gln Lys Glu Arg Phe Leu
            340                 345                 350
Ser Leu

<210> 8
<211> 1065
<212> DNA
<213> Piromyces finnis

<220>
<223> CK3 wild type

<400> 8
atgtcaacag ctgtcgaaga agaatatttt aaaaataaaa ttgcttctat agagaaattt     60
gattttactt taaatacaga aagtcaagaa actattttcg aaggaattaa aattatttta    120
aaacattttt tacctgaatg gaatgattta aagtttactc cacaaactga tggtattaca    180
aatacactta ttttagtatc atgtcctcaa ggaaaagtta ttgtaagagt atttggtaat    240
ggtactgaat acattattaa tagaaatgct gaacaaaaga attttatatt tttatccgat    300
aataaactcg ctgctccaat tattggaaat ttcaacaatg gctttgttca tggatatgtt    360
gaaggaagtg tattttctgt tccagacatg tcagatccac aaaagtcttt attagttgca    420
aaaaaaattg gaaaatggca tagtttaaat ttcccatttg aaaagaagag ctctgtttat    480
gatgttatta ataagtggat tgatgaagct ccagaagttt ttgaagataa aagaaagaat    540
gaaatatatt attctaagga ctatttaaga aaagaaaatt taagaaatga aatcaacttt    600
ttaaaggaaa aacttgatga aatttcatct ccattagctt tttgtcattg tgatcttctt    660
tatggaaata tcattttaca taaggatgaa aatggaaatg atgatgtaac ctttattgat    720
tatgaatatg gttcaattaa tccaagagga tttgacattg gaaatcattt taatgaatat    780
gctggctttg attgtgacta taatttatat ccatcaaaag aatttcaata taaatggtta    840
aaggtttatt tacaaagtta cttaggtaaa gaaaatatta gtgaaaagga aatagaagat    900
ttatatcgtg aagttaataa atatgcctta ttatcacatt attattgggg tgtctgggcc    960
attcttcaag caaagtactc tcaaattgat tttgattata ttagttattc tattctaagg   1020
cttgatgaat actacaatca aaaagaaaga tttttaagtc tttaa                   1065

<210> 9
<211> 1065
<212> DNA
<213> Artificial Sequence

<220>
<223> CK3 codon optimised

<400> 9
atgagtaccg cagtggaaga agaatacttc aaaaataaaa tcgcgagcat tgaaaagttc     60
gacttcaccc tgaataccga aagtcaggaa accatcttcg aaggcattaa aattattctg    120
aagcacttcc tgccggaatg gaatgatctg aaattcaccc cgcagaccga tggtattacc    180
aataccctga ttctggttag ctgcccgcag ggtaaagtta ttgttcgtgt gttcggtaat    240
ggtaccgaat atattattaa ccgtaacgca gaacagaaaa acttcatctt cctgagtgat    300
aataagctgg ccgccccgat tattggtaac ttcaataatg gcttcgttca tggctatgtg    360
gaaggcagcg tgttcagcgt gccggatatg agtgatccgc agaaaagtct gctggtggcc    420
aaaaaaattg gcaaatggca tagcctgaac ttcccgttcg aaaaaaaaag cagcgtgtat    480
gatgtgatta ataaatggat tgacgaggca ccggaagtgt tcgaagataa acgtaaaaat    540
gaaatctact acagcaagga ttacctgcgt aaagaaaatc tgcgtaatga aattaacttc    600
ctgaaagaaa agctggatga aattagcagt ccgctggcct tctgtcattg tgatctgctg    660
tatggtaata ttattctgca taaagacgag aatggcaatg atgatgtgac cttcattgat    720
tatgaatatg gcagtattaa cccgcgtggc ttcgatattg gtaatcactt caatgaatac    780
gcaggcttcg attgtgatta taatctgtat ccgagcaaag agttccagta taaatggctg    840
aaagtgtatc tgcaaagcta tctgggcaaa gaaaatatta gcgaaaaaga aatcgaggat    900
ctgtatcgcg aagtgaataa atatgcactg ctgagccatt attattgggg tgtgtgggcc    960
attctgcaag ccaaatatag ccagattgac ttcgattata tcagctatag cattctgcgc   1020
ctggatgaat attataatca gaaagaacgc ttcctgagcc tgtga                   1065

<210> 10
<211> 537
<212> PRT
<213> Babjeviella inositovora

<220>
<223> CK4

<400> 10
Met Asp Thr Leu Ala Thr Glu Lys Pro Arg Ser Arg Ser Arg Ser Met
1               5                   10                  15
Ser Arg Ser Arg Ser Arg Ser Asn Ile Arg Pro Val Leu Thr Pro Leu
            20                  25                  30
Leu Ser Ser Asn Gln Val Lys Gln Val Pro Arg Arg Arg Ser His Ser
        35                  40                  45
Arg Arg Pro Ser Leu Ser Arg Gln Ser Ser Gly Ser Val Asp Glu Val
    50                  55                  60
Pro His Ile Lys Ala Asn Leu Asp Asn Ser Leu Pro Leu Asp Phe Met
65                  70                  75                  80
Lys Glu Glu Ile Met Ile Ile Val Lys Ala Leu Arg Ile Lys His Trp
                85                  90                  95
His Lys Leu Pro Glu Ser Ala Ala Ser Arg Ile Lys Val Asn Arg Ile
            100                 105                 110
Ser Gly Ala Leu Thr Asn Ser Ile Tyr Lys Leu Asn Leu Asp Glu Cys
        115                 120                 125
Pro Ala Leu Leu Leu Arg Val Tyr Gly Lys Asn Val Asp Glu Ile Ile
    130                 135                 140
Asp Arg Glu Ala Glu Leu Ile Ile Leu Lys Arg Leu Ser Ser Lys Arg
145                 150                 155                 160
Ile Gly Pro Arg Leu Leu Gly Thr Phe Thr Asn Gly Arg Phe Glu Gln
                165                 170                 175
Phe Leu Asp Gly Phe Ile Thr Leu Asn Lys Asp Gln Leu Arg Asn Lys
            180                 185                 190
Tyr Ile Ser Gln Met Ile Ala Lys Arg Met Lys Glu Leu His Val Asn
        195                 200                 205
Met Glu Leu Glu Ala Lys Asp Thr His Pro Met Ser Trp Ala Leu Ile
    210                 215                 220
Asp Lys Trp Phe Pro Leu Ala Glu Glu Val Val Lys Ser Tyr Glu Ala
225                 230                 235                 240
Asn Pro Asp Val Ser Glu Ala Asp Phe Leu Leu Thr Asn Phe Ala Thr
                245                 250                 255
Phe Lys Lys Asn Val Gln Ala Tyr Arg Thr Trp Leu Met Asn Lys Tyr
            260                 265                 270
Gly Lys Ala Glu Phe Pro Arg Glu Val Leu Cys Phe Cys His Asn Asp
        275                 280                 285
Thr Gln Tyr Gly Asn Leu Leu Leu His Ser Ser Leu Leu Glu Asp Ser
    290                 295                 300
Lys Thr Glu Val Ala Lys Val Ile Glu Lys Met Glu Ser Leu Ser Leu
305                 310                 315                 320
Asp Phe Asp Ser Asp Lys Leu Ala Ala Ala Ser His Ser Asn Leu Val
                325                 330                 335
Val Ile Asp Leu Glu Tyr Ser Gly Pro Asn Cys Pro Pro Phe Glu Phe
            340                 345                 350
Ala Asn His Phe Ser Glu Trp Met Ala Asp Tyr Leu Asp Ala Thr Asn
        355                 360                 365
Ser His Tyr Leu Asp Glu Arg Lys Tyr Pro Thr Thr Glu Glu Gln Leu
    370                 375                 380
Asn Phe Phe Arg Val Tyr Thr Glu Phe Ser Gly Arg Ala Thr Asp Pro
385                 390                 395                 400
Ala Asp Ser Thr Arg Pro Asp Glu Ala Ala Thr Lys Lys Leu Phe Asn
                405                 410                 415
Glu Thr Ile Trp Trp Arg Gly Thr Val Ser Val Tyr Trp Cys Leu Trp
            420                 425                 430
Gly Ile Val Gln Asn Gly Pro Trp Lys Pro Thr Pro Thr Pro Glu Ala
        435                 440                 445
Ala Thr Gly Glu Gly Phe Leu Gly Thr Tyr Lys Phe Ser Thr Glu Thr
    450                 455                 460
Glu Glu Gly Asp Asp Gln Gly Ala Glu Val Glu Ile Thr Glu Ser Ser
465                 470                 475                 480
Asp Asp Ala Phe Ser Tyr Ile Arg Tyr Ser Gln Gln Lys Ala Ala Met
                485                 490                 495
Phe Tyr Gly Asp Ala Val Gln Leu Gly Ile Val Asp Arg Asp Ala Ile
            500                 505                 510
Cys Glu Arg Tyr Leu Thr Gln Gly Glu Gly Ala Asp Glu Gly His Val
        515                 520                 525
Lys Phe Leu Ser Val Lys Glu Leu Asp
    530                 535

<210> 11
<211> 1614
<212> DNA
<213> Babjeviella inositovora

<220>
<223> CK4 wild type

<400> 11
atggacacct tagctaccga gaaaccgaga tcccgctctc ggtcgatgtc ccgctctaga     60
tcacggtcca acatcagacc ggttttgacc cccctgcttt cctccaacca ggtgaaacaa    120
gtccccagac gccgttcgca ctcgagaaga ccctcgttaa gcagacagag ctcggggagc    180
gtcgatgaag tgccgcacat taaggctaac ttggacaaca gcttgccatt ggacttcatg    240
aaggaggaga tcatgatcat tgtcaaggcg ttgcgtatca agcactggca caagctcccg    300
gaatctgctg cgtcgcgcat caaggtgaac cgaatcagcg gtgccttgac caactcgatt    360
tacaagttga atcttgacga gtgtccggcg ttgctcttgc gcgtgtatgg gaagaacgtc    420
gatgagatca tcgacagaga ggcggagctt atcattttaa agaggctctc gagtaagcgg    480
attggcccgc gtctcttggg aaccttcact aatgggcggt ttgagcagtt tttggacggg    540
ttcatcacct tgaacaagga ccagctccgt aacaagtaca tctcgcagat gattgccaag    600
cgcatgaagg agttgcatgt caacatggag ttggaggcca aggacacgca tcccatgtct    660
tgggcgttga ttgacaagtg gtttccgctc gcggaggagg tagtgaaatc ttatgaagcc    720
aacccagacg tttccgaagc cgactttctc ctcacaaatt ttgccacctt taagaagaat    780
gtgcaagcgt acagaacctg gttgatgaac aagtacggca aggctgagtt tccacgggag    840
gtcttgtgct tctgccacaa cgacacccag tacggaaact tgcttttgca tagcagcttg    900
ttggaggact ccaagactga ggttgctaag gtgatcgaaa agatggaatc actttcgctt    960
gactttgatt cggataagct tgccgcagcc tcgcattcca acttggtggt cattgattta   1020
gagtactctg gccctaactg tccgcctttc gaattcgcca accatttcag cgaatggatg   1080
gccgattacc ttgatgccac taactcccac tacttggacg aaaggaagta cccaaccacc   1140
gaagagcagt tgaacttttt ccgtgtctat accgagtttt ccggccgtgc caccgatcct   1200
gccgactcca ccaggccaga tgaagcagcc acgaaaaagt tgtttaacga aaccatctgg   1260
tggaggggta ccgtttctgt ctattggtgc ttgtggggga ttgtccagaa tgggccgtgg   1320
aagcctacgc ccacgccaga ggcagccacc ggcgagggat tcctggggac ttacaagttt   1380
agtaccgaaa cggaggaagg agatgaccaa ggagccgagg tggagattac cgagtcttca   1440
gacgatgcgt tcagttatat ccgatactct caacagaagg cggccatgtt ctacggagac   1500
gctgttcagc ttgggattgt tgatagagac gcaatctgcg agagatactt gacgcaaggc   1560
gaaggggctg atgaaggaca tgttaagttc ttgtctgtca aggaattaga ctaa         1614

<210> 12
<211> 1614
<212> DNA
<213> Artificial Sequence

<220>
<223> CK4 codon optimised

<400> 12
atggataccc tggccaccga aaaaccgcgt agccgtagcc gcagcatgag ccgtagccgt     60
agtcgcagta atattcgccc ggttctgacc ccgctgctga gcagtaatca ggttaaacag    120
gtgccgcgcc gtcgcagcca tagccgtcgt cctagtctga gtcgtcagag tagcggtagt    180
gtggatgaag ttccgcatat taaagccaat ctggataata gtctgccgct ggacttcatg    240
aaagaagaaa ttatgattat cgtgaaggca ctgcgcatta aacattggca taaactgccg    300
gaaagcgccg caagccgtat taaagtgaat cgcattagtg gtgcactgac caatagtatc    360
tataaactga atctggatga gtgcccggcc ctgctgctgc gtgtgtatgg caaaaatgtt    420
gatgaaatta tcgaccgtga agcagaactg attattctga aacgcctgag cagtaaacgc    480
attggtccgc gtctgctggg taccttcacc aatggtcgct tcgaacagtt cctggatggc    540
ttcattaccc tgaataaaga tcagctgcgc aataaatata tcagccagat gattgccaaa    600
cgtatgaaag aactgcatgt gaatatggaa ctggaagcca aagataccca tccgatgagc    660
tgggccctga ttgataaatg gttcccgctg gccgaagaag ttgtgaaaag ctatgaagca    720
aatccggatg tgagtgaagc cgacttcctg ctgaccaact tcgccacctt caaaaaaaat    780
gttcaggcat atcgtacctg gctgatgaat aaatatggta aagcagagtt cccgcgcgaa    840
gttctgtgct tctgtcataa tgatacccag tatggcaatc tgctgctgca tagcagtctg    900
ctggaagata gcaaaaccga agtggccaaa gtgattgaaa aaatggaaag tctgagcctg    960
gacttcgata gcgataaact ggccgcagcc agccatagta atctggtggt tattgatctg   1020
gaatatagtg gcccgaattg tccgccgttc gagttcgcca atcacttcag cgaatggatg   1080
gcagattatc tggatgcaac caatagccat tatctggatg aacgtaaata tccgaccacc   1140
gaagaacagc tgaacttctt ccgcgtgtat accgagttca gtggtcgtgc caccgatccg   1200
gccgatagca cccgtccgga tgaagccgcc accaaaaaac tgttcaatga aaccatctgg   1260
tggcgcggca ccgttagtgt gtattggtgc ctgtggggca ttgttcagaa tggcccgtgg   1320
aaaccgaccc cgaccccgga agccgcaacc ggtgaaggct tcctgggtac ctataaattc   1380
agtaccgaaa ccgaagaagg cgatgatcag ggtgccgaag tggaaattac cgaaagtagt   1440
gatgatgcct tcagctatat tcgttatagc cagcagaaag ccgccatgtt ctatggtgat   1500
gccgtgcagc tgggtattgt ggatcgtgat gcaatctgtg aacgttatct gacccagggc   1560
gaaggcgcag atgaaggtca tgttaaattc ctgagtgtga aagaactgga ttga         1614

<210> 13
<211> 405
<212> PRT
<213> Kazachstania naganishii

<220>
<223> CK5

<400> 13
Met Lys Arg Leu His Val Thr Ile Pro Leu Asp Ala Pro Asp Asn Leu
1               5                   10                  15
Val Ser Leu Leu Thr Asp Asp Cys Gln Asn Tyr Glu Ile Val Lys Leu
            20                  25                  30
Lys Gly Ala Leu Thr Asn Val Ile Tyr Lys Leu Ser Ile Arg Asp Ser
        35                  40                  45
Ser Gly Thr Ser Thr Ser Tyr Leu Val Arg Ile Phe Gly Ala Lys Leu
    50                  55                  60
Glu Ser Leu Val Asp Arg Val Glu Glu Phe Asn Asn Ile Thr Arg Val
65                  70                  75                  80
Pro Pro Val Val Gly Tyr Val Asn Val Leu Tyr Val Phe Asp Asn Gly
                85                  90                  95
Arg Val Glu Tyr Phe Leu Glu Gly Phe Lys Ser Val Ser Ala Lys Gln
            100                 105                 110
Met Val Gln Gln Asn Val Tyr Arg Val Leu Ala Gln Lys Phe Lys Ala
        115                 120                 125
Leu His Cys Leu Val Ser Ile Thr Asp Lys Glu Ile Ala His His Arg
    130                 135                 140
Asp Gly Met Cys Trp Tyr Lys Leu Gly Gln Trp Ile Glu Ile Ile Glu
145                 150                 155                 160
Asn Ile Asn Gly Gly Glu Trp Ile Asp Ser Arg Asp His Gln Asn Val
                165                 170                 175
Thr Glu Ile Leu Leu Cys Arg Asp Trp Ala Thr Phe Lys Lys Thr Val
            180                 185                 190
Leu Asn Tyr Lys Asn Trp Leu Leu Glu Glu Asp Ala Glu Ser Phe Gln
        195                 200                 205
Gln Met Lys Phe Cys His Asn Asp Ala Gln Gln Gly Asn Ile Leu Leu
    210                 215                 220
Asp Ser Lys Thr Lys Asp Asp Asp Ile Pro Asn Leu Asn Leu Ile Asp
225                 230                 235                 240
Tyr Glu Tyr Ser Gly Val Asn Ala Ile Gln Phe Asp Leu Ala Asn Phe
                245                 250                 255
Leu Thr Glu Cys Met His Asp Tyr Glu Ile Asp Glu Ser Tyr Lys Cys
            260                 265                 270
His Gly Glu Gln Tyr Pro Ser Lys Glu Lys Val Leu Asp Phe Leu Tyr
        275                 280                 285
His Tyr Ser Thr His Leu His His Gly Asp Ser Lys Gly Glu Ala Ser
    290                 295                 300
Ile Val Lys Leu Tyr Asn Ser Val Leu Lys Trp Arg Ala Ala Ser Gln
305                 310                 315                 320
Leu Phe Trp Ser Val Trp Ala Ile Leu Gln Ser Gly Gln Leu Glu Ala
                325                 330                 335
Ala Ser Ala Lys Ile Glu Ala His Val Pro Asp Lys Gly Ser Asn Arg
            340                 345                 350
Val Thr Ser Ser Asp Asp Pro Asn Glu Glu Val Phe Asp Tyr Met Gly
        355                 360                 365
Phe Cys Asn Glu Lys Leu Ser Tyr Phe Trp Gly Asp Met Ile Lys Phe
    370                 375                 380
Asn Leu Ala Ser Lys Glu Asp Cys Ile Val Ser Lys Val Arg Tyr Leu
385                 390                 395                 400
Asp Thr Glu Phe Ile
                405

<210> 14
<211> 1218
<212> DNA
<213> Kazachstania naganishii

<220>
<223> CK5 wild type

<400> 14
atgaagagat tgcatgtcac aataccgcta gatgctccag ataatctggt atcactcctg     60
actgatgatt gtcaaaatta tgaaattgtc aagctgaaag gtgcattgac taatgtcatt    120
tacaagttgt cgatccggga ttcttctggg acttctacaa gctacctagt tcggatattt    180
ggagcaaaat tggaatcgct tgttgataga gttgaagagt ttaataacat aacgagagta    240
cctccagtgg ttggatatgt aaatgtgctt tacgtttttg acaacggaag agtggaatac    300
ttcttagaag ggttcaaaag tgtgtctgct aagcaaatgg tgcagcaaaa tgtataccga    360
gttcttgctc aaaaatttaa agcgttacac tgtttggttt cgattacaga taaggaaatt    420
gctcaccatc gtgacggtat gtgttggtat aaattgggcc aatggattga gatcattgag    480
aatataaatg gaggagaatg gatagacagt agggatcacc agaacgtgac tgagattttg    540
ctatgtagag attgggctac attcaaaaag acggtactaa attataaaaa ctggcttttg    600
gaagaagatg ccgaaagttt ccagcaaatg aaattttgcc ataacgatgc tcaacagggg    660
aatatcctgt tggattcaaa aacgaaggat gatgatattc caaatcttaa tctgatcgat    720
tacgagtatt ccggggtcaa cgctatccag tttgatcttg caaatttttt gactgaatgc    780
atgcatgact atgaaattga cgaatcttat aagtgccatg gtgaacagta tccaagtaaa    840
gaaaaagttt tggacttttt ataccactac tcgacacact tgcatcacgg tgattccaag    900
ggagaagctt ctattgtcaa actgtacaac tctgttttga aatggagagc tgcatctcag    960
ttgttctggt cggtatgggc catactacag agcggtcaac tggaggcagc atcagcaaaa   1020
attgaggcgc acgtcccaga taagggaagc aatagagtga cctcctctga tgatcccaac   1080
gaagaggtgt ttgattacat ggggttctgt aatgaaaaat tgtcttactt ctggggggat   1140
atgataaaat tcaacctagc tagtaaggaa gattgcattg tttcaaaagt tagatatctg   1200
gatacagagt tcatatag                                                 1218

<210> 15
<211> 1218
<212> DNA
<213> Artificial Sequence

<220>
<223> CK5 codon optimised

<400> 15
atgaagcgtc tgcatgttac cattccgctg gatgccccgg ataatctggt tagcctgctg     60
accgatgatt gtcagaatta tgaaattgtt aagctgaagg gcgccctgac caatgttatc    120
tataaactga gtattcgcga tagcagtggt accagtacca gttatctggt tcgtatcttc    180
ggcgccaaac tggaaagtct ggtggatcgt gttgaagagt tcaataatat tacccgtgtt    240
ccgccggttg tgggttatgt taatgttctg tatgtgttcg ataacggtcg cgttgaatac    300
ttcctggaag gcttcaaaag cgtgagtgca aaacagatgg tgcagcagaa tgtgtatcgc    360
gttctggccc agaaattcaa agccctgcat tgcctggtga gcattaccga taaagaaatt    420
gcacatcatc gcgatggtat gtgttggtat aaactgggcc agtggattga aattattgaa    480
aatattaacg gtggtgagtg gattgatagc cgtgatcatc agaatgtgac cgaaattctg    540
ctgtgccgtg attgggccac cttcaaaaaa accgtgctga attataaaaa ctggctgctg    600
gaagaagatg ccgaatcatt ccagcagatg aaattctgcc ataatgatgc acagcagggc    660
aatattctgc tggatagcaa aaccaaagat gatgatattc cgaatctgaa tctgattgat    720
tatgaatata gcggtgttaa tgccattcag ttcgatctgg caaacttcct gaccgaatgt    780
atgcatgatt atgaaattga tgagagttac aagtgccacg gtgaacagta tccgagtaaa    840
gaaaaagtgc tggacttcct gtatcattat agcacccatc tgcatcatgg cgatagtaaa    900
ggtgaagcca gtattgtgaa actgtataat agcgttctga aatggcgtgc agccagccag    960
ctgttctgga gcgtgtgggc aattctgcaa agcggccagc tggaagccgc aagtgccaaa   1020
attgaagccc atgttccgga taaaggtagc aatcgcgtga ccagtagtga tgatccgaat   1080
gaagaagtgt tcgattatat gggcttctgt aatgaaaaac tgagctactt ctggggtgat   1140
atgattaaat tcaatctggc cagcaaagaa gattgcattg tgagcaaagt tcgctatctg   1200
gataccgagt tcatttga                                                 1218

<210> 16
<211> 1181
<212> PRT
<213> Umbilicaria pustulata

<220>
<223> CK6

<400> 16
Met Ser Leu His Pro Ala Ser Ser Trp Glu Ala Pro Met Met Met Leu
1               5                   10                  15
Lys Glu Gly Ala Leu Pro Pro His Ser Ser His Pro Ser Phe Gly His
            20                  25                  30
Leu Ile Leu Leu Val Phe Glu Ala Val Leu Glu Val Val Cys Val Ser
        35                  40                  45
Leu Pro Gly Tyr Ile Val Ala Arg Gln Gly Leu Phe Ser Thr Glu Met
    50                  55                  60
Gln Lys Phe Val Ala Asn Leu Asn Val Met Leu Phe Thr Pro Cys Leu
65                  70                  75                  80
Ile Phe Thr Lys Leu Ala Ser Gln Leu Ser Ala Glu Lys Leu Gly Glu
                85                  90                  95
Leu Ala Val Ile Pro Val Ile Phe Ile Val Gln Thr Leu Val Ser Tyr
            100                 105                 110
Leu Cys Ser Ile Gly Met Ser Arg Leu Leu Gly Leu Gln Lys Arg Pro
        115                 120                 125
Arg Asn Phe Val Val Ala Met Gly Val Phe Gly Asn Ser Asn Ser Leu
    130                 135                 140
Pro Ile Ser Leu Val Ile Ser Leu Ser Gln Thr Leu Lys Gly Leu His
145                 150                 155                 160
Trp Asp Lys Val Pro Gly Asp Asn Asp Asp Glu Val Ala Ala Arg Gly
                165                 170                 175
Ile Leu Tyr Leu Met Ile Phe Gln Gln Leu Gly Gln Leu Val Arg Trp
            180                 185                 190
Ser Trp Gly Tyr His Val Leu Leu Ala Ala Pro Glu Lys Tyr Lys Val
        195                 200                 205
Glu Asp His Tyr Thr Asp Ser Gln Leu Glu Gln Gly Gln Ser Ile Tyr
    210                 215                 220
Arg Asp Asp Pro Asp Ser Glu Asp Glu Ala Leu Leu Arg Asn Pro Leu
225                 230                 235                 240
Thr Glu Ser Asp Arg Asp Ser Ile Asp Val Ser Lys Asp Ser Ser Asp
                245                 250                 255
Thr Asn Val Glu Gly Arg Asp Pro Asn Ser Ala Ser Ala Thr Gly Ser
            260                 265                 270
Gln Thr Gly Phe Glu Ile Gly Ser Gly Arg Gln Thr Pro Val Thr His
        275                 280                 285
Gln Gln Tyr Ala Ser Ser Val Ser Ser Arg Gln Leu Glu Asn Ala Gly
    290                 295                 300
Pro Thr Glu Ser Glu Pro Ser Ser Gly Leu Leu Pro Thr Pro Thr Asn
305                 310                 315                 320
Gly Asn Val Lys Pro His Ser Ser Glu Val Asp Tyr Thr Gly Ser Val
                325                 330                 335
His Glu Met His Ser Pro Thr Ala Pro His Asp Glu His Val Pro Ala
            340                 345                 350
Gly Leu Trp Gly Leu Pro Val Arg Ala Met Leu Ala Val Lys Arg Gly
        355                 360                 365
Val Gln Gln Met Ser Val Thr Val Ser Thr Phe Ser Arg Ser Ile Tyr
    370                 375                 380
Glu Thr Leu Pro Glu Ser Val Gln Ser Ile Met Met Lys Ile Tyr Met
385                 390                 395                 400
Gly Leu Arg Arg Phe Val Leu Gly Ile Trp Glu Phe Met Asn Pro Pro
                405                 410                 415
Leu Trp Ala Met Leu Ala Ala Ile Ile Val Ala Ser Val Pro Ser Leu
            420                 425                 430
Gln His Leu Phe Phe Ser Glu Gly Thr Phe Ile Arg Asn Ser Val Thr
        435                 440                 445
Arg Ala Ile Ser Gln Ser Gly Gly Val Ala Val Pro Leu Ile Leu Val
    450                 455                 460
Val Leu Gly Ala Asn Leu Ala Arg Asn Thr Leu Pro Glu Asp Pro His
465                 470                 475                 480
His Ser Val Glu Asp Asp Arg Ile Glu Lys Lys Leu Leu Ile Ala Ser
                485                 490                 495
Leu Val Ser Arg Met Leu Leu Pro Ile Ile Val Met Ala Pro Leu Leu
            500                 505                 510
Ala Val Thr Ala Lys Tyr Val Pro Val Ser Ile Leu Asp Asp Pro Ile
        515                 520                 525
Phe Val Ile Val Cys Phe Leu Leu Thr Gly Ala Pro Ser Ala Leu Gln
    530                 535                 540
Leu Ala Gln Ile Cys Gln Ile Asn Gly Val Tyr Met Gly Ala Met Ser
545                 550                 555                 560
Lys Leu Leu Phe Gln Ser Tyr Val Val Trp Ile Leu Pro Ser Thr Leu
                565                 570                 575
Ile Leu Val Met Glu Thr Asn Leu Ser Arg Arg Ala Ser Ser Ile Arg
            580                 585                 590
Lys Leu Arg Lys Gly Ser Thr Ala Ala Gly Ser Ser Asp Thr Glu Tyr
        595                 600                 605
His Glu Gly Asp Ala Leu Val Pro Thr Ala Glu Val Phe Leu Asp Asn
    610                 615                 620
Ser Lys Thr Leu Ser Tyr Phe Gly Gly Thr Ala Glu Ser Glu Pro Asp
625                 630                 635                 640
Leu Ser Asn Ser Ser Lys Arg Ala Ala Lys Glu Gly Trp Leu Ile Phe
                645                 650                 655
Lys Asn Glu Ile Leu Arg Leu Thr His Thr Leu Arg Leu Lys Gly Trp
            660                 665                 670
Arg Met Leu Pro Leu Glu Arg Gly Gly Asp Ile Asp Val Glu Arg Leu
        675                 680                 685
Ser Gly Ala Leu Thr Asn Ala Val Tyr Val Val Ser Pro Pro Lys Asp
    690                 695                 700
Leu Glu Gln Thr Pro Ser Asp Lys Asp Gly Gly Thr Ala Pro Leu Ala
705                 710                 715                 720
Pro Lys Lys Pro Pro Pro Lys Leu Leu Leu Arg Ile Tyr Gly Pro Gln
                725                 730                 735
Val Glu His Leu Ile Asp Arg Glu Asn Glu Leu Gln Ile Leu Arg Arg
            740                 745                 750
Leu Ala Arg Lys Lys Ile Gly Pro Arg Leu Leu Gly Thr Phe Val Asn
        755                 760                 765
Gly Arg Phe Glu Glu Phe Phe His Ala Arg Thr Leu Thr Ala His Asp
    770                 775                 780
Leu Arg Val Pro Ala Thr Ser Lys Gln Ile Ala Lys Arg Met Arg Glu
785                 790                 795                 800
Leu His Glu Gly Ile Glu Leu Leu Glu Glu Glu Arg Ala Ala Gly Pro
                805                 810                 815
Phe Val Trp Arg Asn Trp Asp Lys Trp Val Asp Arg Cys Glu Glu Val
            820                 825                 830
Ile Ser Trp Leu Asp Asn Gln Ile Leu Ser Gly Asn Gln Ser Ser Ala
        835                 840                 845
Arg Thr Pro Thr Glu Met Trp Lys Gln Arg Gly Leu Ile Cys Gly Val
    850                 855                 860
Glu Trp Ser Val Phe Arg Glu Thr Val Asn Arg Tyr Arg Ala Trp Leu
865                 870                 875                 880
Asp Lys Gln Tyr Gly Gly Arg Asp Ala Leu Arg Gln Gln Leu Val Phe
                885                 890                 895
Ala His Asn Asp Thr Gln Tyr Gly Asn Leu Leu Arg Leu Glu Pro Ser
            900                 905                 910
Gly Glu Ser Pro Leu Leu Leu Pro Val Asn Glu His Lys Gln Leu Ile
        915                 920                 925
Val Ile Asp Phe Glu Tyr Ala Ser Ala Asn Leu Pro Gly Leu Glu Phe
    930                 935                 940
Ala Asn His Phe Thr Glu Trp Cys Tyr Asn Tyr His Asp Ala Glu Lys
945                 950                 955                 960
Pro Tyr Ser Leu Thr Glu Ala Asn Tyr Pro Thr Lys Asp Glu Gln Arg
                965                 970                 975
Thr Phe Leu Lys Ala Tyr Val Gln His Arg Pro Gln Phe His Ala Arg
            980                 985                 990
Ala Ser Ala Thr Phe Ala Thr Ala Pro Gly Pro Gly Pro Ser Ser Ser
        995                 1000                1005
Ile Ser Ser Phe Met Leu Asp Ser Arg Ala Pro Pro Ala Gln Leu Val
    1010                1015                1020
Glu Glu Glu Lys Glu Arg Asp Glu Ala Thr Glu Arg Glu Val Lys Arg
1025                1030                1035                1040
Leu Met Arg Glu Ala His Leu Trp Arg Val Ala Asn Thr Ala Gln Trp
                1045                1050                1055
Val Ala Trp Gly Ile Val Gln Ala Lys Val Pro Gly Met Asp Glu Ala
            1060                1065                1070
Leu Glu Asp Gln Lys Asn Pro Ser Pro Glu Ser Glu Arg Thr Ala Arg
        1075                1080                1085
Arg Gly Ser Ser Thr Ser Val Thr Leu Phe Gly Ser Asp Pro Leu Asp
    1090                1095                1100
Pro Asp Ile Ala Glu Asp Ile His Asn Arg Arg Pro Glu Arg Leu Asp
1105                1110                1115                1120
Ala Asp Ala Met Gly Gly Ala Thr Glu Val Pro Thr Lys Glu Glu Asp
                1125                1130                1135
Thr Asp Glu Phe Asp Tyr Leu Gly Tyr Ala His Glu Arg Ala Met Phe
            1140                1145                1150
Phe Trp Gly Asp Ala Leu Ser Leu Gly Leu Val Arg Lys Glu Asp Leu
        1155                1160                1165
Pro Val Asp Val Leu Arg Lys Val Lys Met Val Glu Tyr
    1170                1175                1180

<210> 17
<211> 3546
<212> DNA
<213> Umbilicaria pustulata

<220>
<223> CK6 wild type

<400> 17
atgtccttac atccagcatc atcatgggaa gcaccaatga tgatgctgaa ggaaggcgcg     60
ctgcctccac actcctcaca cccatccttc ggccacctca tcctactcgt gtttgaagca    120
gtgttggagg tagtatgtgt cagtcttccc ggatatatcg tggcccggca gggcttgttc    180
agtaccgaga tgcagaaatt cgttgccaat ctgaacgtca tgctcttcac accctgcctg    240
atcttcacga aactagcctc ccagttgtcg gcggaaaagc tcggagaact ggcagtcatc    300
ccagtcatct ttatagtgca gactctggta tcatacctgt gttcaatagg catgtcgagg    360
cttctgggat tacagaagcg gccacgaaac ttcgtcgttg ccatgggtgt ttttggaaac    420
tcaaactctc tcccgatctc attagtcata tctttgtcgc agactctcaa gggcctgcac    480
tgggataaag tacctggtga caacgacgac gaagtcgcag cccgaggcat tctttacctc    540
atgatcttcc aacaactggg ccaacttgtg aggtggagtt ggggttatca cgtgctacta    600
gcagcaccag agaagtacaa agtggaggat cattacacgg attctcagct cgaacaaggg    660
caaagtatat atcgggatga tccagatagt gaagacgagg ctctgcttcg caaccccctt    720
acagaaagcg accgggattc tatagacgtc agcaaggact ctagcgacac gaacgtcgag    780
ggccgcgacc caaacagtgc gagcgcaacg ggatcacaga caggcttcga gataggatcc    840
gggagacaaa ctcctgtgac gcaccagcag tatgcttcat ctgtttcgtc tcgacagctt    900
gagaacgcag gaccgacgga gtcagaacca tcatcaggac tactaccaac gcctaccaat    960
ggtaatgtaa agcctcattc tagtgaagtc gactacaccg gctctgtgca cgagatgcat   1020
agccctaccg ccccccacga cgaacatgtc ccagctggat tatggggcct accagtgcgg   1080
gcaatgctgg ccgtcaagcg gggcgttcag caaatgtcgg taacggtctc tacgttctca   1140
cgaagcatat atgagactct accagaatct gtacagagta tcatgatgaa gatctatatg   1200
ggccttcgga gattcgtcct cggcatctgg gagtttatga atcccccact gtgggccatg   1260
ctggcagcaa tcattgtcgc ctctgtcccc tccctgcagc atctattctt cagcgaaggc   1320
acgttcatca ggaattccgt gacccgagcc atctctcaga gtgggggcgt cgcggtccct   1380
ctcattctcg tcgtccttgg agccaatttg gcgaggaaca ctcttcctga agatcctcat   1440
cactcggtcg aagatgacag gatagaaaag aaacttctca ttgcttctct agttagccgg   1500
atgctcctgc cgatcatcgt gatggctccc ctattagccg tcactgccaa gtacgtgccc   1560
gtgagcatct tggatgaccc tatcttcgtc atcgtctgct tcctcctcac gggcgcgcct   1620
agcgcgctac agttggcaca aatctgccag atcaatggag tatacatggg ggctatgtcg   1680
aaactactct tccaaagcta tgttgtgtgg atcctcccat caacactcat ccttgtcatg   1740
gagactaatt tgtctcggcg ggcgtcatcg attcgaaagc tgaggaaggg ttcaacagct   1800
gccggatctt cggacacgga gtatcatgag ggcgatgccc ttgtcccgac ggccgaagtg   1860
ttcctagata actcgaagac actaagttac ttcggtggca ccgccgagtc agaaccagat   1920
ctcagcaatt cgagtaagcg cgcggccaaa gaagggtggc tgatcttcaa gaacgagata   1980
ctgcgtttga cacacacctt gaggctgaaa ggttggagga tgctaccact cgaacgtggc   2040
ggagacattg atgtggaacg gctcagtggg gcactgacaa acgcagttta cgtggtgtcg   2100
ccacccaagg atttagagca gaccccgtct gacaaagatg gcggtacggc gccgctcgca   2160
cctaagaagc caccaccgaa gcttctcctc cgtatctatg gcccccaggt ggaacatctc   2220
atcgaccggg aaaacgagct gcaaatcctc cgccgtctgg ctcgaaagaa gattggcccg   2280
cgcttactag gcacctttgt caacggtaga tttgaagagt ttttccatgc gcgcacactt   2340
accgcgcatg acttgcgtgt acccgcgact tcaaagcaga ttgcaaagcg tatgcgtgaa   2400
ttacatgagg gcattgagtt acttgaggag gaaagagcag cgggcccatt tgtctggaga   2460
aactgggata agtgggtgga caggtgtgaa gaggtcattt cctggctcga caaccagatc   2520
ctttcgggta accaatcttc cgcaagaact cccacagaga tgtggaagca gcgtggccta   2580
atctgcggtg tcgagtggtc cgtttttcga gagacggtca atcgatatcg cgcgtggctt   2640
gataaacaat atggcggcag ggatgctctc aggcaacagc tcgtgttcgc acataatgat   2700
acgcaatatg gtaacctcct ccgtctggaa cctagcggag aatccccgct cctgctaccg   2760
gtcaacgagc ataagcaatt aattgttatc gatttcgagt acgcatcggc taatcttcct   2820
ggcctggagt ttgccaacca ctttactgaa tggtgctaca actatcatga cgcagaaaag   2880
ccctattccc tcacagaagc gaactaccca acaaaggatg aacagcgcac atttttgaaa   2940
gcttacgtcc agcatcgtcc ccaattccac gcacgcgctt cggccacatt tgccacagcg   3000
ccgggtcctg gtccgtcaag ctcaatttct tctttcatgc tcgacagtcg agctccacca   3060
gctcaactag tcgaagagga aaaggagagg gacgaggcca ccgagcgaga agtaaagcga   3120
ctgatgcgtg aggcacacct gtggagggtg gccaacactg cgcaatgggt tgcgtggggc   3180
attgtacagg ccaaagtgcc cgggatggat gaagcgcttg aagaccaaaa gaacccctcc   3240
cctgaatcgg aacgtacagc aaggcgcggt agtagtactt cggtcaccct tttcggctcc   3300
gatccgctcg accccgacat tgccgaggat atccacaaca gaagaccgga aaggctggac   3360
gccgacgcca tgggtggcgc aactgaggtg cccaccaagg aagaggacac cgatgagttt   3420
gattatctgg gctatgctca tgagcgggcg atgttcttct ggggtgatgc tctctctctt   3480
gggctcgtga ggaaggagga ccttccggtg gacgttctgc ggaaggtaaa gatggttgag   3540
tattag                                                              3546

<210> 18
<211> 3546
<212> DNA
<213> Artificial Sequence

<220>
<223> CK6 codon optimised

<400> 18
atgagtctgc atccggccag cagttgggaa gcaccgatga tgatgctgaa agaaggcgca     60
ctgccgccgc atagcagtca tccgagtttt ggtcatctga ttctgctggt ttttgaagca    120
gtgctggaag tggtttgtgt tagcctgccg ggttatattg ttgcccgcca gggtctgttt    180
agtaccgaaa tgcagaaatt tgttgcaaat ctgaatgtga tgctgtttac cccgtgcctg    240
atttttacca aactggccag ccagctgagc gcagaaaaac tgggcgaact ggcagtgatt    300
ccggttattt ttattgtgca gaccctggtt agctatctgt gtagcattgg tatgagtcgt    360
ctgctgggcc tgcaaaaacg cccgcgcaat tttgttgttg caatgggtgt ttttggtaat    420
agtaatagcc tgccgattag cctggttatt agtctgagtc agaccctgaa aggcctgcat    480
tgggataaag ttccgggcga taatgatgat gaagttgcag cacgtggcat tctgtatctg    540
atgatttttc agcagctggg tcagctggtt cgttggagtt ggggctatca tgttctgctg    600
gcagcaccgg aaaaatataa agtggaagat cattataccg acagccagct ggaacagggt    660
cagagcatct atcgcgatga tccggatagt gaagatgaag cactgctgcg caatccgctg    720
accgaaagcg atcgcgatag tattgatgtg agcaaagata gcagcgatac caatgttgaa    780
ggccgcgatc cgaatagcgc aagtgcaacc ggtagtcaga ccggttttga aattggcagc    840
ggtcgtcaga ccccggttac ccatcagcag tatgccagca gcgttagcag tcgtcagctg    900
gaaaatgccg gtccgaccga aagtgaaccg agtagtggcc tgctgccgac cccgaccaat    960
ggcaatgtga aaccgcatag ctcagaagtt gattataccg gtagcgtgca tgaaatgcat   1020
agtccgaccg ccccgcatga tgaacatgtg ccggcaggtc tgtggggcct gccggttcgt   1080
gcaatgctgg ccgtgaaacg tggtgtgcag cagatgagcg tgaccgtgag tacctttagc   1140
cgcagtatct atgaaaccct gccggaaagt gttcagagca ttatgatgaa aatctatatg   1200
ggtctgcgcc gctttgttct gggtatttgg gagtttatga atccgccgct gtgggcaatg   1260
ctggcagcca ttattgtggc aagtgtgccg agcctgcaac atctgttttt cagcgaaggt   1320
acctttattc gtaatagtgt tacccgcgca attagccaga gtggtggcgt tgccgttccg   1380
ctgattctgg ttgtgctggg tgccaatctg gcccgtaata ccctgccgga ggaccctcat   1440
catagcgttg aagatgatcg tattgaaaag aaactgctga ttgccagcct ggtgagccgc   1500
atgttactgc cgattattgt tatggcaccg ctgctggccg tgaccgcaaa atatgttccg   1560
gttagcattc tggatgatcc gatttttgtg attgtttgtt ttctgctgac cggcgcaccg   1620
agcgccctgc aactggcaca gatttgtcag attaatggcg tttatatggg tgccatgagc   1680
aaactgctgt ttcagagcta tgttgtgtgg attctgccga gcaccctgat tctggttatg   1740
gaaaccaatc tgagtcgtcg tgcaagcagt attcgtaaac tgcgtaaagg tagcaccgcc   1800
gcaggtagca gcgatacaga atatcatgaa ggtgacgcac tggtgccgac cgccgaagtt   1860
tttctggata atagcaaaac cctgagctat tttggtggta ccgccgaaag tgaacctgat   1920
ctgagcaata gcagtaaacg cgcagccaaa gaaggctggc tgatttttaa aaatgaaatt   1980
ctgcgcctga cccataccct gcgtctgaaa ggttggcgca tgttaccgct ggaacgcggt   2040
ggcgatattg atgttgaacg tctgagcggt gcactgacca atgccgttta tgttgtgagc   2100
ccgccgaaag atctggaaca gaccccgagc gataaagatg gcggtaccgc accgctggca   2160
ccgaaaaaac cgccgccgaa actgctgctg cgcatctatg gtccgcaggt ggaacatctg   2220
attgatcgtg aaaatgaact gcaaattctg cgtcgcctgg cacgcaaaaa gattggtccg   2280
cgtctgctgg gtacctttgt taatggccgt tttgaagaat ttttccatgc ccgtaccctg   2340
accgcccatg atctgcgcgt tccggcaacc agcaaacaga ttgcaaaacg tatgcgtgaa   2400
ctgcatgaag gcattgaact gctggaagaa gaacgtgcag ccggcccgtt tgtgtggcgc   2460
aattgggata aatgggtgga tcgttgcgaa gaagttatta gctggctgga taatcagatt   2520
ctgagcggca atcagagcag tgcccgcacc ccgaccgaaa tgtggaaaca gcgcggtctg   2580
atttgcggcg ttgaatggag cgtttttcgc gaaaccgtga atcgctatcg cgcctggctg   2640
gataaacagt atggcggccg tgatgccctg cgccagcagt tagtgtttgc acataatgat   2700
acccagtatg gtaatctgct gcgtctggaa ccgagcggtg aaagcccgct gctgctgccg   2760
gttaatgaac ataaacagct gattgtgatt gattttgaat acgcaagcgc caatctgccg   2820
ggtctggaat ttgcaaatca ttttaccgaa tggtgttata attaccatga tgcagaaaaa   2880
ccgtatagtc tgaccgaagc aaattatccg accaaagatg aacagcgtac ctttctgaaa   2940
gcatacgttc agcatcgccc gcagtttcat gcccgcgcca gcgcaacctt tgcaaccgcc   3000
cctggtccgg gcccgagcag cagcattagc agttttatgc tggatagtcg tgccccgccg   3060
gcccagctgg tggaagaaga aaaagaacgc gatgaagcaa ccgaacgtga agttaaacgt   3120
ctgatgcgcg aagcacatct gtggcgcgtt gcaaataccg cacagtgggt tgcctggggc   3180
attgttcagg ccaaagtgcc gggcatggat gaagcattag aagatcagaa aaatccgagt   3240
ccggaaagcg aacgtaccgc acgccgtggt agtagcacca gcgttaccct gtttggcagt   3300
gatccgctgg accctgatat tgcagaagat attcataatc gtcgtccgga acgtctggat   3360
gcagatgcaa tgggtggtgc aaccgaagtg ccgaccaaag aagaagatac cgatgaattt   3420
gattacctgg gttatgccca tgaacgcgcc atgtttttct ggggtgacgc attaagtctg   3480
ggtctggtgc gcaaagaaga tctgccggtt gatgttctgc gtaaagttaa aatggttgaa   3540
tattga                                                              3546

<210> 19
<211> 534
<212> PRT
<213> Saccharomyces cerevisiae

<220>
<223> CK7

<400> 19
Met Tyr Thr Asn Tyr Ser Leu Thr Ser Ser Asp Ala Met Pro Arg Thr
1               5                   10                  15
Tyr Leu Val Gly Thr Ala Ser Pro Glu Met Ser Lys Lys Lys Arg Gln
            20                  25                  30
Ser Ala Asn Cys Asp Lys Pro Thr Arg Arg Val Ile His Ile Ile Asp
        35                  40                  45
Thr Asn Glu His Ser Glu Val Asp Leu Lys Asn Glu Leu Pro Ile Thr
    50                  55                  60
Cys Thr Asn Glu Asp Gly Glu Met Thr Ser Ser Ser Trp Thr Ser Gln
65                  70                  75                  80
Thr Ala Asn Asp Phe Leu Lys Leu Ala Tyr Val Asn Ala Lys Leu Asp
                85                  90                  95
Pro Ser Leu Pro Ser Gln Tyr Phe Lys Gln Asp Ile Ile Asn Val Leu
            100                 105                 110
Gln Ser Leu Glu Ile Pro Gly Trp Ser Val Pro Gly Ser Lys Glu Ser
        115                 120                 125
Ser Leu Asn Lys Asn Leu Leu Thr Leu Thr Gln Ile Lys Gly Ala Leu
    130                 135                 140
Thr Asn Val Ile Tyr Lys Ile Arg Tyr Pro Asn Leu Pro Pro Leu Leu
145                 150                 155                 160
Met Arg Ile Phe Gly Asp Ser Ile Asp Ser Val Ile Asp Arg Glu Tyr
                165                 170                 175
Glu Leu Lys Val Ile Ala Arg Leu Ser Phe Tyr Asp Leu Gly Pro Lys
            180                 185                 190
Leu Glu Gly Phe Phe Glu Asn Gly Arg Phe Glu Lys Tyr Ile Glu Gly
        195                 200                 205
Ser Arg Thr Ser Thr Gln Ala Asp Phe Ile Asp Arg Asp Thr Ser Ile
    210                 215                 220
Lys Ile Ala Lys Lys Leu Lys Glu Leu His Cys Thr Val Pro Leu Thr
225                 230                 235                 240
His Lys Glu Ile Thr Asp Gln Pro Ser Cys Trp Thr Thr Phe Asp Gln
                245                 250                 255
Trp Ile Lys Leu Ile Asp Ser His Lys Glu Trp Val Ser Asn Asn Val
            260                 265                 270
Asn Ile Ser Glu Asn Leu Arg Cys Ser Ser Trp Asn Phe Phe Leu Lys
        275                 280                 285
Ser Phe Lys Asn Tyr Lys Arg Trp Leu Tyr Asn Asp Ser Ala Phe Thr
    290                 295                 300
Ser Lys Leu Leu Arg Glu Asp Asp Lys Asp Ser Met Ile Asn Ser Gly
305                 310                 315                 320
Leu Lys Met Val Phe Cys His Asn Asp Leu Gln His Gly Asn Leu Leu
                325                 330                 335
Phe Lys Ser Lys Gly Lys Asp Asp Ile Ser Val Gly Asp Leu Thr Ile
            340                 345                 350
Ile Asp Phe Glu Tyr Ala Gly Pro Asn Pro Val Val Phe Asp Leu Ser
        355                 360                 365
Asn His Leu Asn Glu Trp Met Gln Asp Tyr Asn Asp Val Gln Ser Phe
    370                 375                 380
Lys Ser His Ile Asp Lys Tyr Pro Lys Glu Glu Asp Ile Leu Val Phe
385                 390                 395                 400
Ala Gln Ser Tyr Ile Asn His Met Asn Glu Asn His Val Lys Ile Ala
                405                 410                 415
Ser Gln Glu Val Arg Ile Leu Tyr Asn Leu Ile Ile Glu Trp Arg Pro
            420                 425                 430
Cys Thr Gln Leu Phe Trp Cys Leu Trp Ala Leu Leu Gln Ser Gly Arg
        435                 440                 445
Leu Pro Gln Arg Pro Leu Ile Glu Gly Glu Lys Leu Met Ser Glu Lys
    450                 455                 460
Ala Gly Leu Gly Asp Glu Thr His Leu Met Glu His Lys Asn Lys Glu
465                 470                 475                 480
Asn Gly Lys Tyr Asp Cys Ser Glu Asp Asp Ser Phe Asn Tyr Leu Gly
                485                 490                 495
Phe Cys Lys Glu Lys Met Ser Val Phe Trp Gly Asp Leu Ile Thr Leu
            500                 505                 510
Gly Val Ile Asp Lys Asp Cys Pro Asp Ile Gly Lys Thr His Tyr Leu
        515                 520                 525
Asp Thr Lys Leu Ile Phe
    530

<210> 20
<211> 1605
<212> DNA
<213> Saccharomyces cerevisiae

<220>
<223> CK7 wild type

<400> 20
atgtacacca attattcact tacaagcagt gacgcaatgc cccgaactta cttggtcggc     60
acagcctcac cagaaatgtc taagaaaaag cgccagtctg caaattgtga caagccaacg    120
aggagggtga tccacatcat cgatactaat gagcactcag aagtcgactt gaaaaatgaa    180
ctgccaataa catgcacgaa tgaagacggt gagatgacat caagctcatg gacttcacaa    240
accgctaatg attttctcaa actggcatac gtaaacgcga aattagaccc gtctttgcca    300
tctcaatatt tcaaacaaga tatcataaat gttttacaaa gcttggagat tcctggatgg    360
tctgtgcccg gctctaaaga atcctcgttg aataagaatt tattaacttt gacacaaatc    420
aagggggccc ttacaaatgt tatttataag attcgctatc caaacttacc ccctttactg    480
atgagaattt ttggtgatag catagattct gtaattgata gagagtatga attaaaggtt    540
attgcgagat tatcatttta tgatttagga cccaaactag aagggttttt tgaaaatggc    600
agatttgaaa aatacattga gggttcgagg acatctactc aagccgactt tatagatcgg    660
gatacttcaa taaaaattgc taaaaaattg aaagagttgc attgtactgt tccattaacg    720
cataaagaaa taacggatca accgtcatgc tggacaacct ttgatcagtg gatcaaatta    780
atagactcgc ataaagagtg ggtttccaat aatgtaaata taagtgaaaa tttacgctgt    840
tcaagttgga attttttttt aaagagtttc aaaaattata agcgttggtt gtataatgat    900
tctgctttta cctcaaaatt attaagagag gatgataaag atagtatgat aaattctggg    960
ctaaaaatgg tattttgcca taacgactta cagcatggta atttactttt taaaagtaag   1020
ggtaaggatg acatctcagt gggcgattta acaattattg actttgagta cgcaggccct   1080
aaccccgttg tatttgattt atcaaatcat ttgaatgaat ggatgcaaga ctataatgat   1140
gtgcagtctt tcaagtctca tattgataag tatccaaaag aagaagatat tttagttttt   1200
gcacaaagtt atataaatca catgaatgag aaccatgtca aaattgcttc tcaagaggtt   1260
aggattcttt acaatctaat catcgaatgg aggccttgta cacaattatt ctggtgcctt   1320
tgggctcttt tgcaaagcgg aaggctacca caacgaccac tgatagaagg tgaaaaacta   1380
atgagcgaaa aggctggact aggtgatgaa acgcacctta tggagcacaa aaataaagag   1440
aatggaaaat acgattgttc tgaagatgat tcgttcaact atttaggatt ttgtaaagaa   1500
aaaatgtctg tcttttgggg cgatttaatt acattgggag ttatcgacaa agactgccca   1560
gatattggaa aaacacatta cttagacact aaacttattt tttaa                   1605

<210> 21
<211> 1605
<212> DNA
<213> Artificial Sequence

<220>
<223> CK7 codon optimised

<400> 21
atgtacacca attacagtct gaccagtagt gatgccatgc cgcgcaccta tctggttggc     60
accgcaagtc cggaaatgag caaaaaaaaa cgtcagagcg ccaattgtga taaaccgacc    120
cgccgtgtta ttcatattat tgataccaat gagcacagtg aagtggatct gaaaaatgaa    180
ctgccgatta cctgcaccaa tgaagatggt gaaatgacca gcagtagttg gaccagccag    240
accgcaaatg acttcctgaa actggcatac gttaatgcca aactggaccc tagcctgccg    300
agtcagtact tcaaacagga tattattaac gtgctgcaaa gcctggaaat tccgggctgg    360
agcgttccgg gcagcaaaga aagcagcctg aataaaaatc tgctgaccct gacccagatt    420
aaaggtgccc tgaccaatgt tatctataaa attcgctatc cgaacctgcc gccgctgctg    480
atgcgtatct tcggtgatag tattgatagt gtgattgatc gcgaatatga actgaaagtg    540
attgcccgtc tgagcttcta tgatctgggc ccgaaactgg aaggcttctt cgaaaatggc    600
cgcttcgaaa aatatattga aggtagtcgc accagtaccc aggccgactt cattgatcgc    660
gataccagta ttaaaattgc aaaaaaactg aaggagctgc attgtaccgt tccgctgacc    720
cataaagaaa ttaccgatca gccgagctgc tggaccacct tcgatcagtg gattaaactg    780
attgatagcc ataaagaatg ggttagtaat aatgttaaca tcagtgaaaa cctgcgttgt    840
agcagttgga acttcttcct gaaatcattc aaaaattaca agcgttggct gtataatgat    900
agtgcattca ccagtaaact gctgcgcgaa gatgataaag atagcatgat taatagcggc    960
ctgaaaatgg tgttctgcca taatgatctg caacatggca atctgctgtt caaaagcaaa   1020
ggcaaagatg atattagcgt gggtgatctg accattattg acttcgaata tgccggcccg   1080
aatccggttg tgttcgatct gagtaatcat ctgaatgaat ggatgcagga ttataatgat   1140
gtgcagagct tcaaaagcca tattgataaa tatccgaagg aagaagatat tctggtgttc   1200
gcacagagct atattaatca catgaatgaa aaccacgtga aaattgccag tcaggaagtt   1260
cgtattctgt ataatctgat tatcgaatgg cgcccgtgta cccagctgtt ctggtgcctg   1320
tgggcactgc tgcaaagcgg ccgtctgccg cagcgtccgc tgattgaagg cgaaaaactg   1380
atgagtgaaa aagccggtct gggtgatgaa acacatctga tggaacataa aaataaggaa   1440
aacggcaaat acgattgtag tgaagatgat agcttcaatt atctgggctt ctgcaaagaa   1500
aaaatgagcg tgttctgggg tgatctgatt accctgggtg ttattgataa agattgtccg   1560
gatattggta aaacacatta tctggatacc aaactgatct tctga                   1605

<210> 22
<211> 439
<212> PRT
<213> Diaporthe ampelina

<220>
<223> CK8

<400> 22
Met Ile His Gln Asn Gly Asp Asn Ser Pro Arg Ala Gly Ala Gly Ser
1               5                   10                  15
Asp Gln His Pro Pro Val Arg Phe Ile Ser Gln Ala Tyr Asp Gly Ser
            20                  25                  30
Asp Ser Glu Asn Ser Ala Arg Gln Leu Ile Leu Ala Leu Arg Pro Glu
        35                  40                  45
Trp Ser Ala Asp Asp Ser Asn Ile Glu Phe Val Arg Phe Thr Asp Gly
    50                  55                  60
Ile Thr Asn Thr Leu Leu Lys Ala Val Tyr Lys Arg Pro Gly Leu Ser
65                  70                  75                  80
Lys Asp Asp Val Asp Arg Glu Ala Ile Leu Leu Arg Ala Tyr Gly Pro
                85                  90                  95
Gly Thr Asp Leu Ile Ile Asp Arg Ala Arg Glu Thr Gln Asn His Glu
            100                 105                 110
Leu Leu Ser Gln His Gly Leu Ala Pro Glu Leu Leu Ala Arg Phe Asn
        115                 120                 125
Asn Gly Met Met Tyr Arg Phe Ile Arg Gly Ser Val Thr His Pro Glu
    130                 135                 140
Asp Leu Arg Arg Pro Glu Ile Tyr Leu Ala Val Ala Arg Arg Leu Ala
145                 150                 155                 160
Gln Trp His Ala Thr Val Pro Cys Leu Pro Gly Lys Thr His Ile Ser
                165                 170                 175
Asp Lys Met Asp Val Arg Cys Leu Asp Ala Leu Asn Gly Ala Ala Lys
            180                 185                 190
Lys His Thr Thr Leu Gln Glu Ala Val Asp Ala Ala Ala Pro Gly Lys
        195                 200                 205
Gln Ala Pro Asn Val Trp Thr Val Met Gln Lys Trp Ile Phe Ala Leu
    210                 215                 220
Pro Thr Lys Thr Ala Ala Gln Arg Glu Arg Gln Gln Leu Leu Gln Val
225                 230                 235                 240
Glu Leu Ser Lys Leu Val Ser Glu Leu Ser His Arg Pro Gly Leu Gly
                245                 250                 255
Lys Asp Gly Leu Val Phe Ala His Cys Asp Leu Leu Ser Gly Asn Val
            260                 265                 270
Ile Val Leu Pro Lys Gly Ser Asn Gln Asp Ser Gly Arg Ile Ser Asn
        275                 280                 285
Gly Thr Ala Asp Ser Ala Gly Glu Thr Val Thr Phe Ile Asp Tyr Glu
    290                 295                 300
Tyr Ala Val Pro Ser Pro Ala Ala Phe Asp Leu Cys Asn His Phe Ala
305                 310                 315                 320
Glu Trp Gly Gly Phe Asp Cys Asp Tyr Asn Val Leu Pro Thr Lys Ser
                325                 330                 335
Gln Arg Arg Glu Phe Ile Thr Glu Phe Val Arg Ser Tyr Phe Ser Leu
            340                 345                 350
Leu Pro Gly Gln Pro Glu His Asp Glu Ala Ser Glu Ile Gln Lys Leu
        355                 360                 365
Ala Asp Glu Val Asp Leu Tyr Arg Gly Val Pro Gly Leu Tyr Trp Gly
    370                 375                 380
Ile Trp Ala Leu Ile Gln Ala Thr Ile Ser Asp Ile Asp Phe Asp Tyr
385                 390                 395                 400
Ala Ser Tyr Ala Glu Thr Arg Leu Gly Glu Tyr Trp Ala Trp Lys Ala
                405                 410                 415
Glu Val Asp Gly Ser Arg Ile Ala Glu Asp Lys Glu Leu Pro Leu Arg
            420                 425                 430
Glu Arg Arg Trp Ala Glu Gln
        435

<210> 23
<211> 1320
<212> DNA
<213> Diaporthe ampelina

<220>
<223> CK8 wild type

<400> 23
atgatacacc aaaacggcga caacagccct cgcgccggtg ccgggtctga ccagcaccct     60
ccggtccgct ttatttccca ggcctacgac ggctccgact ccgaaaactc ggcccgccag    120
ctcatcctgg ccctgcgccc ggaatggtct gcagacgact ccaacataga gtttgtgcgc    180
ttcaccgatg gcatcacaaa caccctgctc aaggccgtct acaagcgccc cggcctgtcc    240
aaggacgacg tcgaccggga ggccatcctg ctgcgtgcat acggccccgg caccgacctg    300
atcatcgacc gcgcccgtga gacacaaaac cacgagttgc tgtcacagca tggcctggca    360
ccagagcttc tggcgcggtt taacaacggc atgatgtaca ggttcatcag gggtagtgtc    420
acgcatcctg aggacctgcg gaggcccgag atatacctgg ccgtggcgag acggctggct    480
caatggcatg ccacggtgcc ctgtctcccg ggcaagacgc acatcagcga caagatggac    540
gtgcgctgtc ttgatgcgct caacggggct gcaaagaagc acacaactct acaggaggcc    600
gtagatgcag ctgcccctgg gaagcaggcc cccaacgtgt ggaccgtcat gcagaagtgg    660
atctttgccc tgcccaccaa gacggctgca cagagggagc ggcaacaact gctgcaagtc    720
gagctgtcca agttggtgtc cgagctgagc caccggccag gcctgggaaa ggacggtctg    780
gtcttcgcgc attgcgacct tctcagcgga aatgttattg tactgcccaa gggcagcaac    840
caagactctg gcagaatcag caacggcacc gcagacagtg ccggcgagac cgtgaccttt    900
atcgactatg aatacgccgt gccctctccg gcagcattcg acctgtgcaa tcactttgcc    960
gaatggggcg gtttcgactg tgattacaat gtgctgccca ccaagtccca acggcgcgag   1020
ttcataaccg agttcgtccg gtcgtacttc agcctcctcc ctgggcaacc cgagcacgac   1080
gaggcgtctg agatccagaa actggccgac gaggtcgact tgtaccgcgg agtcccggga   1140
ttgtactggg gcatttgggc gctcatccag gcgaccatct ctgacatcga cttcgactac   1200
gcatcgtacg ccgagaccag gctaggcgag tactgggctt ggaaggccga ggttgacggg   1260
agccgcattg ccgaggacaa ggagctgccg ctgcgggaac ggcggtgggc tgagcagtga   1320

<210> 24
<211> 1320
<212> DNA
<213> Artificial Sequence

<220>
<223> CK8 codon optimised

<400> 24
atgatccatc agaatggcga taatagcccg cgcgccggcg caggtagcga tcaacatccg     60
ccggttcgct tcattagtca ggcctatgat ggcagtgata gcgaaaatag tgcacgtcag    120
ctgattctgg cactgcgccc ggaatggagt gcagatgata gcaatattga gttcgtgcgc    180
ttcaccgatg gtattaccaa taccctgctg aaagccgtgt ataaacgccc tggtctgagc    240
aaagatgatg tggatcgtga agccattctg ctgcgtgcat acggtccggg caccgatctg    300
attattgatc gcgcccgcga aacacaaaat catgaactgc tgagtcagca tggtctggcc    360
ccggaactgc tggcacgctt caataatggc atgatgtatc gcttcattcg cggcagcgtg    420
acccatccgg aagatctgcg tcgtccggaa atctatctgg ccgtggcacg ccgtctggca    480
cagtggcacg ctaccgttcc gtgtctgccg ggtaaaacac atattagtga taaaatggac    540
gtgcgctgtc tggatgccct gaatggcgcc gcaaaaaaac ataccaccct gcaagaagcc    600
gtggatgccg ccgcaccggg caaacaggcc cctaatgtgt ggaccgtgat gcagaaatgg    660
atcttcgcac tgccgaccaa aaccgccgca cagcgtgaac gtcagcagct gctgcaagtt    720
gaactgagta aactggtgag cgaactgagc catcgtccgg gtctgggcaa agatggcctg    780
gtgttcgccc attgtgatct gctgagtggc aatgtgattg tgctgccgaa aggtagtaat    840
caggatagtg gccgcattag caatggtacc gccgatagtg caggtgaaac cgtgaccttc    900
attgattatg aatatgccgt gccgagcccg gcagcattcg atctgtgcaa tcacttcgca    960
gaatggggtg gcttcgattg tgattataat gtgctgccga ccaagagcca gcgccgcgag   1020
ttcattaccg agttcgtgcg tagctacttc agcctgctgc cgggtcagcc ggaacatgat   1080
gaagcaagtg aaattcagaa actggcagat gaagttgatc tgtatcgcgg tgtgccgggc   1140
ctgtattggg gtatctgggc actgattcag gccaccatta gcgatattga cttcgattat   1200
gccagctatg cagaaaccag actgggcgaa tattgggcct ggaaagcaga agtggatggt   1260
agtcgtattg ccgaagataa agaactgccg ctgcgcgaac gtcgttgggc cgaacagtga   1320

<210> 25
<211> 1374
<212> PRT
<213> Bipolaris sorokiniana

<220>
<223> CK9

<400> 25
Met Ser Phe Phe Ser Ala Pro Thr Ala Leu Arg Leu Arg Ser Leu Ser
1               5                   10                  15
Pro Ser Ser Ser Pro Phe Val Ala Leu Ala Ser Gln Ser Gln Asp Ser
            20                  25                  30
Ile Thr Gly Thr Met Phe Thr Ser Leu Glu Asn Ala His Pro Ser His
        35                  40                  45
Pro Asp Phe Ala Asn Leu Ala Leu Leu Val Phe Glu Ala Val Met Glu
    50                  55                  60
Val Val Cys Val Ser Ala Pro Gly Tyr Val Val Ala Arg Met Gly Gln
65                  70                  75                  80
Phe Asp Ala Glu Ser Gln Lys Phe Leu Ala Asn Leu Asn Thr Gln Leu
                85                  90                  95
Phe Thr Pro Phe Phe Thr Lys Leu Ala Ser Gln Leu Thr Ala Glu Lys
            100                 105                 110
Leu Ala Glu Leu Ala Val Ile Pro Val Ile Phe Val Val Gln Thr Leu
        115                 120                 125
Ile Ser Tyr Ile Ala Ala Leu Ala Val Ser Arg Ile Phe Lys Phe Asn
    130                 135                 140
Lys Arg Ala Ser Asn Phe Val Val Ala Met Ala Val Phe Gly Asn Ser
145                 150                 155                 160
Asn Ser Leu Pro Ile Ser Leu Val Ile Ser Leu Ser Lys Thr Leu Arg
                165                 170                 175
Gly Leu His Trp Asp Arg Ile Pro Gly Asp Asn Asp Asn Glu Val Gly
            180                 185                 190
Ala Arg Gly Ile Leu Tyr Leu Leu Ile Phe Gln Gln Leu Gly Gln Leu
        195                 200                 205
Val Arg Trp Thr Trp Gly Phe Asn Val Leu Leu Ala Pro Ala Ser Ala
    210                 215                 220
Tyr Lys Asp Asp Glu Gly Arg Asn His Ala Leu Glu Ser Gly Glu Tyr
225                 230                 235                 240
Ser Asp Asp Glu Thr Gln Arg Leu Leu Asp Asp Ser His Ser Asp Tyr
                245                 250                 255
Glu Ser Gly Asn Val Thr Ser Tyr Ala Thr Ser Ala Asp Cys Ser Asp
            260                 265                 270
Ser Asp Ser Asp Ser Ile Phe Asn Arg Gly Gln Ala Gln Ala Ala Ala
        275                 280                 285
Leu Phe Ile Thr Pro Thr Asn Gly Asn Ala Thr Val Pro Gly Ala Gly
    290                 295                 300
Asp Met Ser Gly Ser Pro Lys Gly Thr Phe Ala Asn Gly His Leu Asn
305                 310                 315                 320
Gly Val Leu Ala Ala His Lys Lys Gln Gln Asp Thr Pro Lys Gly Ile
                325                 330                 335
Lys Gly Val Pro Thr Arg Ala Arg Leu Ala Leu Gln Arg Ser Ala Thr
            340                 345                 350
Ser Val Ser Val Ser Thr Thr Arg Ala Gly Asn Arg Ile Phe Asn Ser
        355                 360                 365
Leu Pro Lys Trp Leu Gln Gly Pro Leu Ser Lys Ile Gly Ser Gly Leu
    370                 375                 380
Ser Arg Phe Ala Lys Gly Val Trp Asp Phe Met Asn Pro Pro Leu Trp
385                 390                 395                 400
Ala Met Leu Ile Ala Ile Phe Val Ala Ser Ile Pro Pro Leu Gln Arg
                405                 410                 415
Leu Phe Phe Thr Pro Gly Thr Phe Ile Asn Thr Ser Val Thr Arg Ala
            420                 425                 430
Val Asn Gln Ser Gly Gln Val Ala Val Pro Leu Ile Leu Val Val Leu
        435                 440                 445
Gly Ala Asn Leu Ala Arg Asn Thr Leu Pro Lys Glu Asp Gln Asn Ser
    450                 455                 460
Ile Glu Asp Pro Ser Val Glu Arg Lys Leu Val Ile Ala Ser Leu Ile
465                 470                 475                 480
Ser Arg Met Leu Ile Pro Thr Leu Leu Met Ala Pro Met Leu Ala Leu
                485                 490                 495
Thr Ala Lys Tyr Val Pro Val Ser Ile Leu Asp Asp Pro Ile Phe Ile
            500                 505                 510
Ile Val Cys Phe Leu Leu Ser Gly Ala Pro Ser Ala Leu Gln Leu Ala
        515                 520                 525
Gln Ile Cys Gln Ile Asn Asn Val Tyr Met Gly Ala Met Ser Arg Ile
    530                 535                 540
Leu Phe Gln Ser Tyr Val Gln Thr Met Ala Thr Ser Ser Asn Trp Gln
545                 550                 555                 560
Ala Gln Gln Pro Ala Gly Pro Gln Ser Ser Asp Ser Leu Ala Leu Ser
                565                 570                 575
Arg Asn Asn Thr Leu Asp Asp Gly Ala Ser Pro Arg Ser Ser Lys Ala
            580                 585                 590
Val Ser Phe Pro Asp Asp Ser Thr Ile Ser Pro Leu Ile Ile Gly Lys
        595                 600                 605
Asn Lys Glu Leu Asp Gln Lys Asp Tyr Leu Asp Leu Asp Lys Pro Pro
    610                 615                 620
Arg His Phe Pro Ala Ser Val Ser Lys Lys Arg Leu Ser Gly Arg Pro
625                 630                 635                 640
Ser Tyr Glu Arg Glu Gly Ser Ser Lys Ser Gly Ala Ala Asp Ala Ser
                645                 650                 655
Thr Ala Leu Thr Ser Leu Leu Pro Glu Gly Ser Ile Asp Ala Ser Ser
            660                 665                 670
His Ser His Gln Ala His Glu Asn Leu Leu Lys Gln Val Gly Thr Trp
        675                 680                 685
Leu Lys Gln Glu Arg Ser Arg Arg His Ala Arg Arg Ala Arg Arg Lys
    690                 695                 700
Ala Ala Arg Ala Ser Thr Val Asp His Glu Ser Glu Ser Ala Ala Ala
705                 710                 715                 720
Glu Ala Leu Glu Lys Ser Ile Ser His His Arg Ser Asp Ser Asp Ser
                725                 730                 735
Ser His Gly Glu Asp Ala Leu Thr Gln Leu Ala Gln Ile Leu Glu Lys
            740                 745                 750
Asn Met Thr Leu Lys Leu Ser Glu Ala Lys Lys Arg His His Leu His
        755                 760                 765
Arg Ser Ser Ile Gly Leu Lys Arg His Ser Ala Ile Ser Leu Asp Ser
    770                 775                 780
Asp Tyr Phe Glu Ser Val Asp Gln Leu Val Pro Ser Cys Glu Ala Thr
785                 790                 795                 800
Leu Asp Asn Ser Lys Thr Met Ala Tyr Asn Val Asp Glu Pro Gly Ala
                805                 810                 815
Glu Ser Asn Leu Asp Val Ala Asp Lys Glu Lys Glu Ala Trp Ser Lys
            820                 825                 830
Phe Arg Ala Glu Ile Leu Arg Leu Thr His Thr Leu Lys Leu Lys Gly
        835                 840                 845
Trp Arg Lys Val Pro Ser Glu Leu Ser Asn Glu Ile Ser Val Gln Arg
    850                 855                 860
Leu Ser Gly Ala Leu Thr Asn Ala Val Tyr Val Val Ser Pro Pro Lys
865                 870                 875                 880
Asn Leu Pro Val Pro Glu Gln Ser Glu Asp Gly Pro Pro Lys Pro Arg
                885                 890                 895
Asn Pro Pro Pro Lys Leu Leu Leu Arg Ile Tyr Gly Pro Gln Val Glu
            900                 905                 910
His Leu Ile Asp Arg Glu Ser Glu Leu Gln Ile Leu Thr Arg Leu Ala
        915                 920                 925
Arg Lys Arg Ile Gly Pro Arg Leu Leu Gly Thr Phe Gly Asn Gly Arg
    930                 935                 940
Phe Glu Glu Phe Leu His Ala Gln Pro Leu Thr Ser Lys Glu Leu Arg
945                 950                 955                 960
Asn Pro Glu Thr Ser Val Gln Ile Ala Lys Arg Met Arg Glu Leu His
                965                 970                 975
Glu Gly Ile Asp Leu Leu Lys Lys Glu Arg Glu Ala Gly Pro Phe Val
            980                 985                 990
Trp Gln Asn Trp Asp Lys Trp Val Asn Arg Cys Glu His Ile Val Thr
        995                 1000                1005
Trp Leu Asp Gln Gln Val Arg Glu Ser Ser Gln Gly Leu Ser Arg Ala
    1010                1015                1020
Ser Ser Asp Lys Trp Lys Lys Arg Gly Tyr Val Cys Gly Val Glu Trp
1025                1030                1035                1040
Pro Val Phe Lys Gln Met Ile Tyr Lys Tyr Arg Lys Trp Leu Glu Asp
                1045                1050                1055
Gln Tyr Gly Gly Leu Asp Lys Ile Asn Glu Arg Met Val Phe Ala His
            1060                1065                1070
Asn Asp Thr Gln Tyr Gly Asn Ile Leu Arg Met Met Pro Glu Gly Glu
        1075                1080                1085
Ser Pro Leu Met Leu Pro Ala Asn Gln His Lys Gln Leu Val Val Ile
    1090                1095                1100
Asp Phe Glu Tyr Ala Asn Ala Asn Leu Pro Gly Leu Glu Phe Ala Asn
1105                1110                1115                1120
His Phe Thr Glu Trp Ala Tyr Asn Tyr His Asp Ala Glu Ala Pro Trp
                1125                1130                1135
Arg Cys Asn Thr Lys Tyr Tyr Pro Thr Ile Glu Glu Gln His Arg Phe
            1140                1145                1150
Ile Arg Ala Tyr Leu Met His Asn Pro Ser Tyr Lys Ala Ser Gly Gly
        1155                1160                1165
Tyr Thr Ser Asn Pro Ala Thr Pro His Leu Gly Pro Leu Pro Ser Ser
    1170                1175                1180
Gly Ser Thr Thr Ala Leu Ala Ala Thr Ala Ala Pro Ser Ser Ile Ser
1185                1190                1195                1200
Ala Phe Met Leu Asp Ser Arg Ala Pro Pro Gly Glu Lys Tyr Gln Glu
                1205                1210                1215
Gln Glu Ala Gln Tyr Glu Arg Gln Ile Glu Glu Glu Ala Arg Arg Leu
            1220                1225                1230
Leu Ala Glu Thr Lys Leu Trp Arg Leu Ala Asn Ser Ala Met Trp Val
        1235                1240                1245
Ala Trp Gly Ile Val Gln Ala His Ile Pro Gly Leu Pro Asp Phe Asp
    1250                1255                1260
Glu Glu Ser Glu Asp Asn Lys Thr Ser Ala Asn Pro Ser Ala Glu Ala
1265                1270                1275                1280
Ala Thr Leu Asp Ser Ala Thr Ala Glu Leu Glu Ala Ala Ala Lys Ala
                1285                1290                1295
Glu Gln Lys Ser Thr Gly Thr Val Ser Glu Glu Thr Ala Ala Lys Ile
            1300                1305                1310
Gln Ala Gln Ala Gln Met Glu Asn Asp Ala Asp Leu Phe Lys Pro Gln
        1315                1320                1325
Asp Glu Glu Glu Phe Asp Tyr Leu Gln Tyr Ala Asn Asp Arg Ala Met
    1330                1335                1340
Phe Val Trp Gly Asp Ala Leu Arg Met Gly Ile Val Ser Gln Ser Glu
1345                1350                1355                1360
Leu Pro Glu Glu Phe Leu Gln Arg Ile Lys Leu Val Glu Tyr
                1365                1370

<210> 26
<211> 4125
<212> DNA
<213> Bipolaris sorokiniana

<220>
<223> CK9 wild type

<400> 26
atgtccttct tctccgcgcc cacggccttg cgcctgcgct ccctctcccc ttcctcctct     60
ccctttgtcg ccctcgcctc ccagtcgcaa gactccatca ccggcaccat gttcacctcg    120
ctcgagaatg cccatccctc gcatcccgac tttgccaatt tggctctcct cgtcttcgag    180
gccgttatgg aggtcgtctg cgtgagcgct ccgggctatg ttgtcgcccg aatgggtcaa    240
ttcgacgcag agagccaaaa gtttcttgca aacctcaaca cccaattgtt tacgcctttc    300
ttcaccaagc tcgcctctca attgacggct gagaaactgg cagagcttgc agtcattccc    360
gttatttttg tcgtccagac gctcatatct tacattgccg cccttgccgt ctcacgcata    420
ttcaagttca acaaaagggc ctccaacttt gtcgtcgcca tggcagtttt tggtaactcc    480
aactcgctgc ccatctctct cgtcatctcc ctctccaaaa cactccgcgg cctgcactgg    540
gacagaatac cgggtgataa cgacaatgaa gttggtgccc gcggtatcct ctacctcctc    600
atcttccagc agctcggtca gctcgtgcgg tggacttggg gcttcaacgt gctcttggcc    660
ccagccagcg cttacaagga cgatgaggga agaaaccatg ccctcgagag tggagagtac    720
agcgatgacg agacgcagcg tctgctcgac gactcgcatt ctgactatga atctggaaat    780
gtcacaagct acgccacctc tgccgactgc agcgattcag actcggattc cattttcaac    840
cgtggtcagg cccaagcggc ggccctattc attacgccca ccaatggcaa tgcaacagtc    900
cctggagctg gcgacatgag cggaagcccc aagggcactt tcgcaaacgg acaccttaat    960
ggcgtgcttg cagctcacaa gaaacaacaa gacacaccca agggcatcaa gggcgtgccc   1020
acgcgtgctc ggctggcact gcaacgctcg gcgacttccg tatctgtctc aacgacacgc   1080
gctggcaacc gcatcttcaa cagcttgccc aagtggctcc aaggacccct gtctaaaatt   1140
ggttcaggct tgtccaggtt cgccaagggt gtttgggact ttatgaaccc tccgctctgg   1200
gccatgttga ttgccatctt cgtcgcctct attccaccgc tacaacgctt gttcttcact   1260
cctggaacat tcatcaacac gtcagttacg cgcgctgtaa atcaaagcgg acaagtggcc   1320
gtgcccctga ttctggttgt cctgggcgcc aatcttgctc gcaacacgct accaaaggag   1380
gaccaaaact caatcgagga ccccagcgta gagaggaagt tggtcattgc ctcgcttatc   1440
agtcgcatgc tcatcccaac gcttctcatg gcaccaatgc tggctctgac tgccaagtac   1500
gtgcccgtca gcatcctcga cgacccgatt ttcattattg tctgcttcct tctcagcggc   1560
gcacccagcg cgcttcaact ggcgcagatc tgccagatca acaatgtgta catgggcgcc   1620
atgtcgcgga tcctgtttca gagctacgtg cagaccatgg ccacttcttc caattggcag   1680
gcccagcagc ccgctggccc acagtcaagc gactctctag ccctctcgcg caacaacact   1740
ttggacgacg gtgcatcacc gcgctcaagc aaagccgtat cctttcccga tgacagcaca   1800
atttccccat tgatcatcgg caagaacaag gagctcgacc aaaaagacta cttagatctt   1860
gacaagcctc ccaggcactt tccagcaagt gtcagtaaaa agcgcctgtc cggccgtcct   1920
tcctatgaac gcgaggggtc atccaaatct ggagcggccg acgccagcac tgccttgacc   1980
tcgctacttc cagaaggctc aattgatgcg tcctctcata gccaccaagc ccacgagaac   2040
ctgttgaaac aggtcggcac ctggctcaag caggagcgaa gccggcgcca tgcacgaagg   2100
gcaagacgca aagctgccag agctagtacc gttgaccacg agtccgagag cgctgcagca   2160
gaggcactcg agaagtccat ctctcaccac aggagcgatt ctgactcctc ccatggcgag   2220
gatgcactca ctcaactggc acaaattctc gaaaagaaca tgaccttgaa gctttccgaa   2280
gcgaaaaaga ggcatcactt gcacaggtcg tccattggct taaagcgaca ctctgcaatt   2340
tctcttgatt cggactactt tgagtcggta gatcagcttg tccctagttg tgaagccact   2400
ttggacaatt caaagaccat ggcatacaac gtcgatgagc ctggggccga gtccaacctc   2460
gatgtagcag acaaggaaaa agaggcttgg tccaaatttc gagctgaaat ccttcgtctt   2520
actcacacgt tgaaattgaa gggatggagg aaggtgccca gtgagctgag caacgaaatc   2580
agtgtacaaa gactgagcgg tgcacttaca aacgccgttt acgtcgtctc tcctcccaag   2640
aatctaccag ttcccgaaca gagcgaagat ggtccaccaa agcccaggaa ccctccaccg   2700
aagctcctac ttcgtatcta tggtccacag gtagaacatc tgatcgaccg agagtcggaa   2760
ttacagatct tgacgcgact tgcccgcaaa cgtattggcc cacgcctgct cggtactttt   2820
ggcaatggcc gatttgaaga gtttctgcat gcccaaccac ttacttccaa ggagctacgc   2880
aacccggaaa cctcagtaca gatcgcaaag cgtatgcgag aacttcacga aggtattgat   2940
ttgctgaaga aagagcgaga ggctggcccg tttgtttggc aaaattggga caagtgggtc   3000
aatcgctgcg aacacattgt gacctggttg gatcaacagg tccgcgagag tagccagggc   3060
ctctccagag cctcttcgga caaatggaag aagcgaggat atgtgtgcgg cgtcgagtgg   3120
cccgtgttca agcaaatgat ttacaagtac cgaaagtggc tagaggatca gtatggcggt   3180
cttgataaga tcaacgagcg catggtcttt gcccataatg atacccagta tggaaatatc   3240
cttcgcatga tgccagaagg cgagtcacca ctgatgctac cggcaaacca gcacaagcag   3300
ctagtagtta ttgatttcga atacgcaaat gccaatcttc ctgggttgga atttgccaat   3360
cacttcacgg aatgggcgta caactaccac gatgcagaag ctccctggcg ttgcaacact   3420
aagtattatc caacaatcga agagcagcac cgttttatcc gggcgtactt gatgcacaac   3480
ccctcgtaca aggccagtgg cggatacacg tccaatcccg ctacaccaca ccttggaccg   3540
cttccctctt cgggtagcac aactgccctg gcagcaacag ccgccccaag cagcatctcc   3600
gcctttatgc tcgactcgcg agctcctcca ggcgaaaagt atcaagagca ggaagcccag   3660
tacgagcgac agattgaaga ggaagcgcgc cgcctcctgg cagaaaccaa actctggcgt   3720
ctcgccaact ccgccatgtg ggttgcctgg ggcatagtcc aagcacacat cccgggcctc   3780
cctgacttcg acgaggagag cgaagacaac aagacaagcg ccaacccaag cgcagaagca   3840
gcaacgctag acagcgcaac tgccgagctg gaagccgcag ctaaagcaga gcaaaagagc   3900
acggggacag tgagtgaaga aacggcggca aagatacagg cgcaggcgca gatggaaaat   3960
gatgctgacc ttttcaagcc gcaggacgag gaggaattcg attacttgca gtatgctaat   4020
gaccgggcga tgtttgtctg gggggatgcg cttaggatgg ggattgtgag tcaaagcgag   4080
ttgcccgagg agtttctgca aaggattaag cttgtggagt attga                   4125

<210> 27
<211> 4125
<212> DNA
<213> Artificial Sequence

<220>
<223> CK9 codon optimised

<400> 27
atgagctttt tcagcgcccc gaccgccctg cgcctgagat cactgagtcc gagtagcagt     60
ccgtttgtgg cactggcaag tcagagccag gatagcatta ccggtaccat gtttaccagt    120
ctggaaaatg cacatccgag ccatccggat tttgccaatc tggccctgct ggtttttgaa    180
gccgttatgg aagttgtgtg cgttagcgca ccgggctatg ttgtggcacg tatgggtcag    240
tttgatgcag aaagtcagaa atttctggca aatctgaata cccagctgtt taccccgttt    300
ttcaccaaac tggcaagcca gctgaccgcc gaaaaactgg cagaactggc agtgattccg    360
gttatttttg tggttcagac cctgattagc tatattgcag ccctggccgt gagccgtatt    420
tttaaattca ataagcgtgc aagtaacttc gttgttgcaa tggcagtttt tggcaatagc    480
aatagtctgc cgattagcct ggttattagc ctgagtaaaa ccctgcgcgg cctgcattgg    540
gatcgcattc cgggtgacaa tgataatgaa gtgggtgcac gcggcattct gtatctgctg    600
atttttcagc agctgggcca gctggtgcgc tggacctggg gttttaatgt tctgctggcc    660
ccggcaagtg cctataaaga tgatgaaggt cgcaatcatg cactggaaag tggtgaatat    720
agtgatgatg aaacccagcg cctgctggat gatagccata gtgattatga aagcggcaat    780
gtgaccagtt atgccaccag cgccgattgc agcgatagtg atagtgatag catttttaat    840
cgcggccagg cacaggcagc agccctgttt attaccccga ccaatggtaa tgccaccgtt    900
ccgggtgcag gtgacatgag tggcagcccg aaaggtacct ttgccaatgg tcatctgaat    960
ggcgtgctgg cagcccataa aaaacagcag gataccccga aaggcattaa gggcgtgccg   1020
acccgcgcac gtctggcttt acagcgtagc gccaccagtg tgagtgtgag taccacccgc   1080
gcaggtaatc gcatttttaa tagcctgccg aaatggctgc aaggtccgct gagtaaaatt   1140
ggtagtggcc tgagtcgctt tgcaaaaggt gtttgggatt ttatgaatcc gccgctgtgg   1200
gccatgctga ttgcaatttt tgttgcaagt attccgccgc tgcaacgcct gtttttcacc   1260
cctggtacct ttattaatac cagtgtgacc cgcgccgtga atcagagcgg tcaggttgcc   1320
gtgccgctga ttctggtggt gctgggcgcc aatctggcgc gtaataccct gccgaaagaa   1380
gatcagaata gcattgaaga tccgagtgtg gaacgtaaac tggttattgc cagcctgatt   1440
agccgtatgc tgattccgac cctgctgatg gccccgatgc tggcactgac cgcaaaatat   1500
gtgccggtta gcattctgga tgatccgatt tttattattg tgtgctttct gctgagcggc   1560
gcaccgagcg ccctgcaact ggcacagatt tgtcagatta ataatgttta catgggcgca   1620
atgagtcgta ttctgtttca gagttatgtt cagacaatgg ctaccagtag caattggcag   1680
gcacagcagc cggccggccc gcagagcagc gacagcttag cactgagccg caataatacc   1740
ctggatgatg gcgcaagtcc gcgcagtagc aaagccgtta gttttccgga tgatagtacc   1800
attagtccgc tgattattgg taaaaataag gaactggatc agaaagatta cctggatctg   1860
gataaaccgc cgcgccattt tccggccagc gtgagcaaaa aacgtctgag cggtcgcccg   1920
agctatgaac gtgaaggcag cagcaaaagc ggcgcagccg atgcaagtac cgcactgacc   1980
agcctgctgc cggaaggcag cattgatgcc agtagtcata gccatcaggc ccatgaaaat   2040
ctgctgaaac aggttggtac ctggctgaaa caggaacgca gccgtcgcca tgcacgccgt   2100
gccagacgca aagccgcacg cgcttcaacc gtggatcatg aaagtgaaag tgccgcagcc   2160
gaagcactgg aaaaaagtat tagtcatcat cgtagcgata gcgatagtag tcatggtgaa   2220
gatgccctga cccagctggc ccagattctg gaaaagaata tgaccctgaa actgagtgaa   2280
gcaaaaaagc gtcatcatct gcatcgtagt agtattggcc tgaaacgcca tagtgcaatt   2340
agtctggata gcgattattt tgaaagtgtt gatcagctgg tgccgagctg cgaagcaacc   2400
ctggataata gcaaaacaat ggcttataat gtggatgaac cgggcgccga aagtaatctg   2460
gatgttgccg ataaagaaaa agaagcatgg agtaaattcc gtgccgaaat tctgcgcctg   2520
acccataccc tgaaactgaa aggctggcgt aaagtgccga gcgaactgag taatgaaatt   2580
agcgtgcagc gtctgagcgg cgcgctgacc aatgccgtgt atgtggttag cccgccgaaa   2640
aatctgccgg ttccggaaca gagtgaagat ggtccgccga aaccgcgtaa tccgccgccg   2700
aaactgctgc tgcgcatcta tggcccgcag gttgaacatc tgattgatcg tgaaagtgaa   2760
ctgcaaattc tgacccgcct ggcacgtaaa cgtattggtc cgcgtctgct gggcaccttt   2820
ggtaatggtc gttttgaaga atttctgcac gctcagccgc tgaccagcaa agaactgcgc   2880
aatccggaaa ccagcgtgca gattgcaaaa cgtatgcgtg aactgcatga aggcattgat   2940
ctgctgaaaa aagaacgcga agccggcccg tttgtttggc agaattggga taaatgggtg   3000
aatcgttgcg aacatattgt gacctggctg gatcagcagg ttcgcgaaag cagtcagggc   3060
ctgagtcgtg caagtagcga taaatggaaa aaacgtggct atgtgtgtgg cgttgaatgg   3120
ccggtgttta aacagatgat ctataaatat cgcaagtggc tggaagatca gtatggcggt   3180
ctggataaaa ttaatgaacg catggttttt gcccataatg atacccagta tggcaatatt   3240
ctgcgcatga tgccggaagg cgaaagcccg ctgatgctgc cggcaaatca gcataaacag   3300
ctggtggtta ttgattttga atatgcaaat gcaaacctgc cgggcctgga atttgccaat   3360
cattttaccg aatgggccta taattatcat gatgcagaag ccccgtggcg ttgcaatacc   3420
aaatattatc cgaccattga agaacagcat cgctttattc gcgcctatct gatgcataat   3480
ccgagctata aagccagcgg tggctatacc agcaatccgg ccaccccgca tctgggtccg   3540
ctgcctagca gcggtagtac caccgcactg gccgcaaccg ccgcacctag cagtattagc   3600
gcctttatgc tggatagccg tgcaccgccg ggtgaaaaat atcaggaaca ggaagcccag   3660
tatgaacgcc agattgaaga agaagcccgc cgtctgctgg cagaaaccaa actgtggcgt   3720
ctggccaata gtgccatgtg ggttgcctgg ggcattgtgc aggcccatat tccgggtctg   3780
ccggattttg atgaagaaag cgaagataat aagaccagcg caaatccgag cgcagaagca   3840
gcaaccctgg acagtgcaac cgcagaactg gaagccgccg ccaaagcaga acagaaaagt   3900
accggcaccg tgagtgaaga aaccgccgca aaaattcagg cacaggcgca gatggaaaat   3960
gatgccgatc tgtttaaacc gcaggatgaa gaagaatttg attatctgca atacgcaaat   4020
gatcgcgcca tgtttgtttg gggtgacgca ctgcgcatgg gtattgtgag ccagagtgaa   4080
ctgccggaag aatttttaca gcgcattaag ctggtggaat attga                   4125

<210> 28
<211> 369
<212> PRT
<213> Paramicrosporidium saccamoebae

<220>
<223> CK10

<400> 28
Met Ile Phe Ser Val Leu Gln Ser Glu Asn Ala Cys Glu Val Ala Glu
1               5                   10                  15
Ala Val Leu Ser Leu Val Gln Ser Val Leu Gly Ile Trp Leu His Ser
            20                  25                  30
Ser Ala Gln Asp Ile Gln Val Tyr Arg Ile Ser Ser Ala Met Thr Asn
        35                  40                  45
Met Val Phe Ser Val Thr Leu Ser Cys Ala Gly Pro Ser Glu Glu Asp
    50                  55                  60
Thr Glu Ser Ala Thr Trp His Pro Glu Arg Leu Leu Leu Arg Val Tyr
65                  70                  75                  80
Gly Asn Ser Asn Trp Met Phe Gln Arg Asp Leu Glu Glu Ser Thr Ala
                85                  90                  95
Leu Val Leu Thr Glu His Gly Ile Ile Pro Gln Trp Tyr Gly Val Phe
            100                 105                 110
Gly Asn Gly Arg Phe Glu Asp Tyr Ile Pro Ser Thr Ser Val Ser Ala
        115                 120                 125
Arg Glu Phe Gln Ser Pro Glu Leu Cys Ala Glu Ile Ser Lys Cys Leu
    130                 135                 140
Gly Arg Ile His Asn Met Leu Pro Asn Val Val Glu Ala Thr Thr Trp
145                 150                 155                 160
Glu Asn Arg Asp Tyr Met Leu Glu Arg Leu Glu Ser Trp Arg Leu Ala
                165                 170                 175
Ala Cys Leu Ser Met Ser Asn Leu Leu Gln Arg Lys Leu Ser Ser Asp
            180                 185                 190
His Ala Glu Ile Leu Arg Lys Ile Glu Leu Trp Asp Ala Phe Ser Pro
        195                 200                 205
Glu Phe Ile Pro Thr Leu Arg Cys Arg Ile Ala Gln Val Asp Ser Pro
    210                 215                 220
Val Val Phe Ala His Cys Asp Leu His His Gly Asn Val Leu Arg Phe
225                 230                 235                 240
His Ala Lys Asp Gly Val Ile Ala Ile Asp Phe Glu Tyr Gly Met Pro
                245                 250                 255
Thr Phe Arg Gly Phe Asp Leu Ala Asn Phe Leu Ser Glu Phe Cys Tyr
            260                 265                 270
Asp Tyr Asn Ser Pro Thr Pro Glu Val Pro Asp Trp Ser Asn Tyr Pro
        275                 280                 285
Ser Arg Asp Thr Ile Val Arg Ile Leu Gln Asn Tyr Leu Gly Glu Ser
    290                 295                 300
Pro Pro Glu Asp Val Tyr Lys Ile Met Asp Glu Ile Ser Val Phe Ala
305                 310                 315                 320
Ala Ala Val Gln Leu Phe Trp Gly His Trp Cys Leu Ile Lys Ala Val
                325                 330                 335
Asp Met Val Asp Asp Gly Tyr Lys Gly Phe Asp Tyr Ile Thr Ser Ala
            340                 345                 350
Phe Glu Arg Tyr Lys Arg Phe Ile Gln Leu Thr Glu Gln Leu Ala Ile
        355                 360                 365
Val

<210> 29
<211> 1110
<212> DNA
<213> Paramicrosporidium saccamoebae

<220>
<223> CK10 wild type

<400> 29
atgattttta gtgtcctaca gtctgaaaac gcctgcgagg tcgctgaagc tgtgctctcg     60
ctcgttcaga gcgtgttggg gatttggctc cattcaagtg cccaagacat tcaagtctat    120
cgcatcagct cggcaatgac taacatggtg ttttctgtga cgctttcgtg tgccgggccc    180
tctgaggagg atacagaatc tgccacttgg catcccgaga ggctgctctt acgagtctat    240
ggaaattcaa attggatgtt tcagcgcgac ttggaagagt cgacggcgtt agtgctaacc    300
gagcacggca taattccgca atggtatggg gtgtttggaa atggccgttt cgaggactat    360
atcccgagca cttcagtgtc ggcccgagag tttcagtccc cggaactatg tgcagagatc    420
tcaaagtgtc tggggagaat tcacaacatg ctgccaaacg tcgtcgaggc cacgacgtgg    480
gagaatcggg attacatgct cgagcgctta gagtcctggc gtcttgcggc ttgtctctca    540
atgtcgaacc tcttgcaacg aaagttgagc agtgatcatg ctgaaattct aaggaagatt    600
gaactgtggg atgcattttc tcctgaattt atccccacgc ttagatgcag gattgcacaa    660
gttgactctc ccgtcgtctt tgctcattgt gatttgcacc atggcaatgt gctgaggttt    720
catgcaaaag atggggtcat tgccattgac tttgagtatg ggatgccgac attcagaggg    780
ttcgatttgg ccaatttttt gagcgagttt tgctacgact acaactcacc caccccagaa    840
gtacccgatt ggtccaacta tccatcaaga gacactattg tccggattct ccaaaactac    900
ctcggggaga gtccccctga agatgtttac aaaattatgg acgagattag tgtctttgca    960
gcagctgtgc agctgttttg ggggcattgg tgcctgatca aggctgtgga catggtcgat   1020
gatggatata aaggattcga ctatatcact tctgcatttg aacggtacaa gagatttatt   1080
cagctcaccg agcaacttgc gattgtgtaa                                    1110

<210> 30
<211> 1110
<212> DNA
<213> Artificial Sequence

<220>
<223> CK10 codon optimised

<400> 30
atgatcttca gcgtgctgca aagtgaaaat gcatgtgaag ttgccgaagc cgttctgagt     60
ctggttcaga gtgttctggg catctggctg catagcagcg cccaggatat tcaggtgtat    120
cgtattagta gcgcaatgac caatatggtg ttcagcgtga ccctgagttg tgccggcccg    180
agcgaagaag ataccgaaag cgcaacctgg catccggaac gcctgctgct gcgcgtgtat    240
ggcaatagca attggatgtt ccagcgcgat ctggaagaaa gcaccgccct ggtgctgacc    300
gaacatggta ttattccgca gtggtatggc gtgttcggta atggccgctt cgaagattat    360
attccgagca ccagcgttag tgcccgcgag ttccagagcc cggaactgtg cgcagaaatt    420
agtaaatgtc tgggtcgtat tcataatatg ctgccgaatg tggtggaagc aaccacctgg    480
gaaaatcgcg attatatgct ggaacgcctg gaaagttggc gcctggcagc atgtctgagc    540
atgagcaatc tgctgcaacg taaactgagt agtgatcatg cagaaattct gcgtaaaatt    600
gaactgtggg atgccttcag cccggagttc attccgaccc tgcgctgccg cattgcacag    660
gttgatagtc cggtggtgtt cgcacattgt gatctgcatc atggtaatgt gctgcgcttc    720
catgccaaag atggcgtgat tgcaattgac ttcgaatatg gtatgccgac cttccgtggc    780
ttcgatctgg ccaacttcct gagtgagttc tgttatgatt ataatagtcc gaccccggaa    840
gtgccggatt ggagtaatta tccgagtcgc gataccattg tgcgcattct gcaaaattat    900
ctgggcgaaa gcccgccgga agatgtgtat aaaattatgg atgaaatcag cgtgttcgcc    960
gcagcagttc agctgttctg gggtcattgg tgtctgatta aagcagttga tatggttgat   1020
gatggctata aaggcttcga ttatattacc agcgccttcg aacgctataa acgcttcatt   1080
cagctgaccg aacagctggc cattgtttga                                    1110

<210> 31
<211> 397
<212> PRT
<213> Catenaria anguillulae

<220>
<223> CK11

<400> 31
Met Thr Ala Pro Thr Ile Ser Val Arg Arg Ile Asp Tyr Thr Val Asp
1               5                   10                  15
Ser Asn Asn Pro Gln Ala Leu Asp Ser Ser Ala Arg His Leu Gly Val
            20                  25                  30
Val Leu Gly Leu Cys Ser Glu Ala Asp Ala Gln Ala Ala Lys Val Thr
        35                  40                  45
Arg Cys Lys Gln Gly Ile Thr Asn Lys Leu Leu Lys Val Ser Leu Pro
    50                  55                  60
Ser Gly Asn Lys Tyr Leu Met Arg Val Tyr Gly His Gly Thr Ser Thr
65                  70                  75                  80
Leu Ile Asp Arg Asp Ala Glu Val Arg Asn Met Ala Tyr Leu Ala Ser
                85                  90                  95
His Gly Leu Ala Pro Pro Leu His Ala Arg Phe Asn Asn Gly Leu Val
            100                 105                 110
Tyr Gly Phe Val Lys Gly Thr Ala Ala His Pro Asp Ala Leu Ala His
        115                 120                 125
Pro Gln Val Trp Pro Ala Ile Ala Lys His Leu Ala Glu Trp His Ser
    130                 135                 140
Leu Pro Leu Pro Ser Pro Ser Ser Pro Ser Asn Asp Gly Ala Gln Ala
145                 150                 155                 160
Pro Pro Pro Ala Ser Gln Leu Phe Val Thr Leu Asp Arg Trp Leu Gly
                165                 170                 175
Met Val Thr Gln Ala Ala Gln Ala Arg Asp Gly Pro Thr Ala Thr Gln
            180                 185                 190
Phe Glu Gly Ile Ala Leu Ala Asp Leu Gly Ala Glu Arg Asp Arg Leu
        195                 200                 205
Phe Ala Thr Leu Pro Ala Ser Pro Leu Thr Phe Asn His Asn Asp Leu
    210                 215                 220
Leu Ser Gly Asn Val Ile Leu Gln His Asp Gln Ala His Ala Asp Leu
225                 230                 235                 240
Asp Ala Ile Asp Leu Ser Gly Asp Ala Asn Glu Val Asp Pro Ser Asp
                245                 250                 255
Thr Pro Asp Ala Leu Ala Ser Val Val Ser Ala Lys Phe Ile Asp Tyr
            260                 265                 270
Glu Tyr Gly Ala Leu Gly Pro Ala Ala Phe Asp Val Ala Asn His Trp
        275                 280                 285
Cys Glu Trp Ala Gly Phe Glu Cys Glu Tyr Trp Arg Tyr Pro Ala Thr
    290                 295                 300
Glu Thr Gln Arg Ala Trp Leu Thr Thr Tyr Leu Thr Ala Leu Asn Lys
305                 310                 315                 320
Glu Ala Lys Pro Pro Thr Val Ala Glu Val Asp Thr Trp Val Glu His
                325                 330                 335
Val Lys Glu Tyr Thr Pro Ala Ser His Phe Phe Trp Ile Leu Trp Ala
            340                 345                 350
Leu Val Gln Ala Thr Val Ser Asp Ile Asp Phe Asp Tyr Ala Gly Tyr
        355                 360                 365
Ala Arg Leu Arg Trp Ser Glu Leu Lys Arg Trp Cys Glu Ala Arg Cys
    370                 375                 380
Arg Arg Pro Ser Arg Ile Ser Ser Ser Arg Ile Arg Pro
385                 390                 395

<210> 32
<211> 1194
<212> DNA
<213> Catenaria anguillulae

<220>
<223> CK11 wild type

<400> 32
atgaccgcac ctaccatcag cgtccgccgc atcgactaca cagtcgactc caacaaccca     60
caggccctcg actcgtctgc ccgccacctc ggcgtcgtgc tgggcctgtg ctccgaggcc    120
gacgcccagg ccgccaaggt gacccgatgc aagcagggca tcaccaacaa gctcctcaag    180
gtgtcgctgc cctcaggcaa caagtacctt atgcgcgtct acggccacgg cacctcgacc    240
ctcattgacc gcgacgccga ggtccgaaac atggcctact tggcatccca cggcctcgcc    300
ccgccgctcc atgcacgctt caacaatggc ctcgtgtacg gctttgtcaa gggcacggcc    360
gcgcacccgg atgctctggc gcacccgcaa gtgtggccag caattgcaaa gcacttggcc    420
gaatggcact cgctgcctct gccctcgccg tcgtcgccgt caaacgatgg cgcacaggcg    480
ccgccgcctg catcgcagtt gtttgtcact ctggaccgct ggctcggtat ggtcacgcaa    540
gccgctcaag cccgcgacgg cccgacggca acgcaatttg agggcattgc cctggctgac    600
ctcggcgccg aacgtgaccg cctctttgca acgcttcctg cgtcgccgct caccttcaac    660
cacaatgacc tgctctcggg caatgtcatt ttgcaacacg accaagcgca cgccgacttg    720
gacgccattg acctgtcggg cgacgccaac gaggtggacc cttcggatac gcccgatgct    780
ttggccagtg tagtgtctgc caaattcatt gattacgagt acggcgccct tggccctgcg    840
gcgttcgatg tggccaacca ctggtgcgag tgggccgggt tcgagtgcga gtactggcgg    900
taccccgcaa cagagacgca gcgcgcgtgg ttgacgacct acttgacggc gctcaacaag    960
gaggccaagc cgcccacggt ggccgaggtc gacacatggg tggaacatgt caaggagtac   1020
acgccggcat cgcacttctt ttggattctg tgggcgctgg tgcaagccac ggtgagcgac   1080
attgattttg attatgccgg gtatgcacgg ttgaggtgga gcgagctcaa gcgctggtgc   1140
gaggccagat gccgccggcc aagccgcatc tcgtcgagtc ggattcggcc gtga         1194

<210> 33
<211> 1194
<212> DNA
<213> Artificial Sequence

<220>
<223> CK11 codon optimised

<400> 33
atgaccgccc cgaccattag cgtgcgccgt attgattata ccgtggatag caataatccg     60
caggcactgg atagcagcgc acgccatctg ggcgttgtgc tgggcctgtg cagtgaagca    120
gatgcacagg ccgcaaaagt gacccgctgt aaacagggta ttaccaataa actgctgaaa    180
gttagcctgc cgagcggtaa taaatatctg atgcgtgtgt atggtcatgg caccagtacc    240
ctgattgatc gtgatgccga agtgcgtaat atggcatatc tggcaagtca tggtctggca    300
ccgccgctgc acgctcgctt caataatggc ctggtgtatg gcttcgttaa aggcaccgcc    360
gcacatccgg atgccctggc acatccgcag gtgtggccgg ctattgccaa acatctggcc    420
gaatggcata gcctgccgct gccgagtccg agtagtccga gtaatgatgg cgcccaggca    480
ccgccgccgg caagccagtt attcgtgacc ctggatcgct ggctgggcat ggtgacccag    540
gcagcccagg cacgtgatgg tccgaccgca acccagttcg aaggcattgc actggcagat    600
ctgggcgccg aacgtgatcg cctgttcgca accctgccgg caagtccgct gaccttcaat    660
cataatgatc tgctgagcgg caatgtgatt ctgcaacatg atcaggcaca tgcagatctg    720
gatgcaattg atctgagtgg tgatgcaaat gaagtggacc ctagcgatac cccggatgca    780
ctggccagtg ttgttagcgc aaaattcatt gattatgaat acggtgcact gggtccggcc    840
gccttcgatg tggcaaatca ttggtgcgaa tgggccggct tcgaatgcga atattggcgt    900
tatccggcaa ccgaaacaca acgcgcatgg ctgaccacct atctgaccgc actgaataaa    960
gaagccaaac cgccgaccgt ggcagaagtg gatacctggg tggaacatgt taaagaatat   1020
accccggcca gccacttctt ctggattctg tgggcactgg ttcaggcaac cgttagtgat   1080
attgacttcg attatgcagg ctatgcacgc ctgcgttgga gcgaactgaa acgttggtgc   1140
gaagcccgtt gtcgtcgtcc gagtcgtatt agtagtagcc gtattcgtcc gtga         1194

<210> 34
<211> 369
<212> PRT
<213> Mucor circinelloides

<220>
<223> CK12

<400> 34
Met Ala Ser Pro Glu Gly Glu Thr Leu Pro Ser Ile Pro Gly Cys Asp
1               5                   10                  15
Thr Ile Ile Asp Leu Ala Val Leu Lys Gly Asp Glu Leu Thr Asn Lys
            20                  25                  30
Val Leu Lys Leu Ile Gln Val Leu Phe Pro Asp Tyr Ala Glu Asn Leu
        35                  40                  45
Glu Lys Ile Glu Leu Asn Arg Val Ser Gly Ala Leu Thr Asn Ala Val
    50                  55                  60
Phe Phe Val Asn Ala Pro Asn Lys Arg Arg Leu Leu Leu Arg Val Tyr
65                  70                  75                  80
Gly Asn Gly Val Asp Gln Ile Ile Asp Arg Glu Asn Glu Leu Ala Trp
                85                  90                  95
Leu Ala Arg Leu Ser Ser Leu Asn Ile Gly Pro Ser Leu Leu Gly Ile
            100                 105                 110
Phe Gly Asn Gly Arg Phe Glu Glu Tyr Leu Pro Ser Thr Thr Leu Thr
        115                 120                 125
His His Asp Ile Arg Asp Pro Glu Thr Ser Lys Gly Ile Ala Ala Cys
    130                 135                 140
Ile Arg Glu Leu His Asp Ile Val Ala Val Tyr Pro Phe Ser Pro Glu
145                 150                 155                 160
Lys Asn His Leu Glu Ile Trp Ala Asn Ile Asp Lys Trp Tyr Gln Val
                165                 170                 175
Val Met Ser Leu Leu Pro Val Leu Tyr Lys Lys Ser Asp Gly Trp Thr
            180                 185                 190
Gln Val Leu Thr Thr Phe Asn Leu Glu Arg Leu Thr Phe Glu Ile Glu
        195                 200                 205
Glu Cys Lys Gln Ile Leu Glu Ala Ala Lys Ser Pro Ile Val Phe Gly
    210                 215                 220
His Asn Asp Thr Gln Tyr Gly Asn Val Leu Lys Leu Glu Lys Thr Asn
225                 230                 235                 240
Glu Leu Val Ile Val Asp Phe Glu Tyr Ala Gly Tyr Asn Pro Arg Gly
                245                 250                 255
Phe Asp Ile Ala Asn His Phe Cys Glu Trp Thr Tyr Asp Tyr His Ser
            260                 265                 270
Glu Gln Pro Ala Ser Met Asp Thr Ser Gln Tyr Pro Thr Tyr Glu Glu
        275                 280                 285
Gln Ile Arg Phe Leu Asn Ala Tyr Leu Glu Thr Lys Ser Lys Asn Asp
    290                 295                 300
Asn Pro Asp Ile Leu Asp Lys Ala Val Thr Ala Glu Cys Leu Gln Lys
305                 310                 315                 320
Glu Ala Ala Met Trp Leu Met Ala Ser His Leu Ser Trp Gly Leu Trp
                325                 330                 335
Gly Leu Ile Gln Ala Ser Gln Ser Glu Ile Asp Phe Asp Tyr Phe Leu
            340                 345                 350
Phe Ser Thr Gln Arg Leu Asn Ala Phe Arg Glu Glu Phe Ala Lys Trp
        355                 360                 365
Lys

<210> 35
<211> 1110
<212> DNA
<213> Mucor circinelloides

<220>
<223> CK12 wild type

<400> 35
atggcatcac ccgaaggaga aacgctgcct tctattccag gctgtgatac aatcattgat     60
ctagctgtgc ttaaaggaga cgaattaaca aacaaagtgc tcaagttgat tcaagtactt    120
ttcccggatt atgctgaaaa cctcgagaaa atcgaactga atagagtgag tggtgccttg    180
acaaatgccg tcttttttgt caacgcaccc aacaagcgcc gtttattgtt gagagtatac    240
ggtaatggag tggatcagat tattgaccga gagaatgaat tagcttggct tgctcgcctc    300
tctagcttga atattggacc tagtttatta ggcatatttg gtaacggtcg atttgaggag    360
tatttgcctt ccaccacttt gacgcaccat gacattcgtg accccgagac ttccaaggga    420
attgctgctt gtattcgcga attgcacgac attgtagcag tatatccatt ctcaccagaa    480
aagaatcatt tggagatttg ggccaacatt gataaatggt atcaggttgt catgtcgttg    540
ttaccagtac tttacaaaaa gagcgatggc tggacacaag tgctgacgac ctttaatttg    600
gagcgtttga catttgaaat tgaagagtgt aaacaaatct tggaagctgc taaatcgccc    660
attgtatttg ggcataatga tacacaatat ggcaatgtgc tcaagttgga aaagacaaac    720
gaattggtga ttgttgattt cgaatacgca ggctataatc cccgtggatt tgacattgct    780
aaccactttt gtgaatggac atacgactat cactctgaac agcctgcctc aatggacaca    840
agtcaatatc ctacctacga agagcaaatc agattcctga acgcctatct cgaaaccaag    900
tctaagaacg acaatccaga cattttagac aaagcagtta cagctgaatg cttacagaaa    960
gaggctgcca tgtggctcat ggccagccat ctgtcctggg gtttgtgggg tttgatccaa   1020
gcaagtcaaa gcgagattga tttcgactac tttttgttct ctactcaacg tttgaatgct   1080
ttccgcgaag agtttgctaa atggaaatga                                    1110

<210> 36
<211> 1110
<212> DNA
<213> Artificial Sequence

<220>
<223> CK12 codon optimised

<400> 36
atggcaagcc cggaaggcga aaccttaccg agcattccgg gctgcgatac cattattgat     60
ctggcagttc tgaaaggtga tgaactgacc aataaagttc tgaaactgat tcaggttctg    120
ttcccggatt atgcagaaaa tctggaaaaa attgagctga atcgcgtgag cggcgcactg    180
accaatgccg tgttcttcgt gaatgccccg aataaacgcc gcctgctgct gcgcgtgtat    240
ggcaatggcg ttgatcagat tattgatcgc gaaaatgaac tggcctggct ggcacgcctg    300
agtagcctga atattggtcc gagtctgctg ggcatcttcg gtaatggccg cttcgaagaa    360
tatctgccga gtaccaccct gacccatcat gatattcgtg atccggaaac cagtaaaggc    420
attgccgcat gtattcgcga actgcatgat attgttgccg tgtatccgtt cagcccggaa    480
aaaaatcatc tggaaatctg ggcaaatatt gataaatggt atcaggtggt tatgagcctg    540
ctgccggtgc tgtataaaaa aagtgatggt tggacccagg tgctgaccac cttcaatctg    600
gaacgtctga ccttcgaaat tgaagaatgt aaacagattc tggaagccgc caaaagtccg    660
attgtgttcg gccataatga tacccagtat ggtaatgttc tgaaattaga aaagaccaat    720
gagctggtta ttgttgactt cgaatatgcc ggctataatc cgcgcggctt cgatattgca    780
aatcacttct gtgaatggac ctatgattat catagtgaac agccggccag catggatacc    840
agccagtatc cgacctatga agaacagatt cgcttcctga atgcctatct ggaaaccaaa    900
agtaaaaatg ataacccgga tattctggat aaagcagtta ccgcagaatg tctgcaaaaa    960
gaagcagcca tgtggctgat ggcaagccat ctgagctggg gtctgtgggg tctgattcag   1020
gcaagccaga gtgaaattga cttcgattac ttcctgttca gcacccagcg cctgaatgca   1080
ttccgcgaag agttcgcaaa atggaaatga                                    1110

<210> 37
<211> 262
<212> PRT
<213> Escherichia coli

<220>
<223> HTK1

<400> 37
Met Gln Val Asp Leu Leu Gly Ser Ala Gln Ser Ala His Ala Leu His
1               5                   10                  15
Leu Phe His Gln His Ser Pro Leu Val His Cys Met Thr Asn Asp Val
            20                  25                  30
Val Gln Thr Phe Thr Ala Asn Thr Leu Leu Ala Leu Gly Ala Ser Pro
        35                  40                  45
Ala Met Val Ile Glu Thr Glu Glu Ala Ser Gln Phe Ala Ala Ile Ala
    50                  55                  60
Ser Ala Leu Leu Ile Asn Val Gly Thr Leu Thr Gln Pro Arg Ala Gln
65                  70                  75                  80
Ala Met Arg Ala Ala Val Glu Gln Ala Lys Ser Ser Gln Thr Pro Trp
                85                  90                  95
Thr Leu Asp Pro Val Ala Val Gly Ala Leu Asp Tyr Arg Arg His Phe
            100                 105                 110
Cys His Glu Leu Leu Ser Phe Lys Pro Ala Ala Ile Arg Gly Asn Ala
        115                 120                 125
Ser Glu Ile Met Ala Leu Ala Gly Ile Ala Asn Gly Gly Arg Gly Val
    130                 135                 140
Asp Thr Thr Asp Ala Ala Ala Asn Ala Ile Pro Ala Ala Gln Thr Leu
145                 150                 155                 160
Ala Arg Glu Thr Gly Ala Ile Val Val Val Thr Gly Glu Met Asp Tyr
                165                 170                 175
Val Thr Asp Gly His Arg Ile Ile Gly Ile His Gly Gly Asp Pro Leu
            180                 185                 190
Met Thr Lys Val Val Gly Thr Gly Cys Ala Leu Ser Ala Val Val Ala
        195                 200                 205
Ala Cys Cys Ala Leu Pro Gly Asp Thr Leu Glu Asn Val Ala Ser Ala
    210                 215                 220
Cys His Trp Met Lys Gln Ala Gly Glu Arg Ala Val Ala Arg Ser Glu
225                 230                 235                 240
Gly Pro Gly Ser Phe Val Pro His Phe Leu Asp Ala Leu Trp Gln Leu
                245                 250                 255
Thr Gln Glu Val Gln Ala
            260

<210> 38
<211> 789
<212> DNA
<213> Escherichia coli

<220>
<223> HTK1 wild type

<400> 38
atgcaagtcg acctgctggg ttcagcgcaa tctgcgcacg cgttacacct ttttcaccaa     60
cattcccctc ttgtgcactg catgaccaat gatgtggtgc aaacctttac cgccaatacc    120
ttgctggcgc tcggtgcatc gccagcgatg gttatcgaaa ccgaagaggc cagtcagttt    180
gcggctatcg ccagtgcctt gttgattaac gttggcacac tgacgcagcc acgcgctcag    240
gcgatgcgtg ctgccgttga gcaagcaaaa agctctcaaa caccctggac gcttgatcca    300
gtagcggtgg gtgcgctcga ttatcgccgc catttttgtc atgaactttt atcttttaaa    360
ccggcagcga tacgtggtaa tgcttcggaa atcatggcat tagctggcat tgctaatggc    420
ggacggggag tggataccac tgacgccgca gctaacgcga tacccgctgc acaaacactg    480
gcacgggaaa ctggcgcaat cgtcgtggtc actggcgaga tggattatgt taccgatgga    540
catcgtatca ttggtattca cggtggtgat ccgttaatga ccaaagtggt aggaactggc    600
tgtgcattat cggcggttgt cgctgcctgc tgtgcgttac caggcgatac gctggaaaat    660
gtcgcatctg cctgtcactg gatgaaacaa gccggagaac gcgcagtcgc cagaagcgag    720
gggccaggca gttttgttcc acatttcctt gatgcgctct ggcaattgac gcaggaggtg    780
caggcatga                                                            789

<210> 39
<211> 789
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK1 codon optimised

<400> 39
atgcaggtgg atctgctggg tagcgcacag agcgcccatg ccctgcatct gttccatcag     60
catagcccgc tggttcattg tatgaccaat gatgtggtgc agaccttcac cgccaatacc    120
ctgctggcac tgggcgcaag tccggcaatg gtgattgaaa ccgaagaagc aagtcagttc    180
gccgcaattg caagcgccct gctgattaat gtgggcaccc tgacccagcc gcgcgcacaa    240
gcaatgcgcg ccgcagtgga acaggccaaa agcagtcaga ccccgtggac cctggaccct    300
gtggcagttg gtgcactgga ttatcgtcgc cacttctgtc atgaactgct gagcttcaaa    360
ccggccgcca ttcgcggcaa tgcaagtgaa attatggccc tggccggtat tgccaatggc    420
ggccgtggtg ttgataccac cgatgcagca gcaaatgcca ttccggcagc ccagaccctg    480
gcacgcgaaa ccggcgctat tgttgttgtg accggtgaaa tggattatgt gaccgatggt    540
catcgcatta ttggcattca tggtggtgat ccgctgatga ccaaagttgt tggtaccggt    600
tgtgcactga gtgccgttgt ggccgcatgt tgtgcactgc cgggcgatac cctggaaaat    660
gtggccagcg catgtcattg gatgaaacag gcaggtgaac gtgcagttgc ccgcagcgaa    720
ggccctggta gcttcgtgcc gcacttcctg gatgccctgt ggcagctgac ccaggaagtt    780
caggcctga                                                            789

<210> 40
<211> 224
<212> PRT
<213> Bacillus licheniformis

<220>
<223> HTK2

<400> 40
Met Thr Arg Val Ser Glu Glu Ala Met Lys Asp Leu Leu Ser Val Tyr
1               5                   10                  15
Phe Ile Met Gly Ser Asn Asn Thr Ala Gly Asp Pro Leu Thr Val Ile
            20                  25                  30
Glu Lys Ala Leu Lys Gly Gly Ala Thr Leu Phe Gln Phe Arg Glu Lys
        35                  40                  45
Gly Glu Gly Ala Leu Lys Ala Gly Asp Gln Thr Ala Phe Ala Arg Gln
    50                  55                  60
Val Gln Ala Leu Cys Lys Gln Phe Asn Val Pro Phe Ile Ile Asn Asp
65                  70                  75                  80
Asp Val Glu Leu Ala Leu Glu Leu Asp Ala Asp Gly Val His Ile Gly
                85                  90                  95
Gln Asp Asp Asp Lys Ala Ala Asp Val Arg Ala Arg Ile Gly Asp Lys
            100                 105                 110
Ile Leu Gly Val Ser Ala His Thr Leu Glu Glu Val Leu Lys Ala Glu
        115                 120                 125
Lys Asp Gly Ala Asp Tyr Ile Gly Ala Gly Pro Val Tyr Pro Thr Glu
    130                 135                 140
Thr Lys Arg Asp Thr Lys Ala Val Gln Gly Val Ser Leu Ile Gln Glu
145                 150                 155                 160
Ile Arg Arg Gln Gly Ile Gly Ile Pro Val Val Gly Ile Gly Gly Ile
                165                 170                 175
Thr Val Glu Asn Cys Val Pro Val Ile Glu Ala Gly Ala Asp Gly Ile
            180                 185                 190
Ser Val Ile Ser Ala Ile Ser Lys Ala Ala Asp Pro Lys Gln Ala Ala
        195                 200                 205
Glu Ala Phe Ser Glu Lys Val Gln Ala Thr Lys Gln Ser Ala His Ser
    210                 215                 220

<210> 41
<211> 675
<212> DNA
<213> Bacillus licheniformis

<220>
<223> HTK2 wild type

<400> 41
atgacgcgag tctcagaaga agcaatgaaa gatctgttgt cggtctattt tattatgggg     60
tcaaacaata ccgcagggga tcctttaact gttattgaaa aagctttaaa aggcggtgcg    120
accctttttc aattccgcga aaaaggcgag ggtgcgttga aagccggaga tcaaacggcg    180
tttgcccgac aggtgcaggc gctgtgcaaa cagttcaatg taccgtttat tatcaacgat    240
gatgtagaac tcgcgcttga acttgacgca gatggcgtgc atatcggcca ggatgatgat    300
aaggccgcag acgtcagagc gagaatcggg gacaaaatcc tcggcgtttc cgcgcataca    360
cttgaagaag tcttaaaggc ggaaaaggat ggagcggatt acatcggtgc ggggcctgtt    420
taccctactg aaacaaagcg ggatacaaaa gctgtgcagg gcgtctcgct cattcaagag    480
atccgcaggc agggaattgg cattcctgtg gtcggaatcg gcgggatcac ggtcgaaaac    540
tgcgtccccg tcatcgaggc cggggcggac ggcatcagtg tcatcagcgc catcagcaaa    600
gccgccgatc caaagcaggc cgccgaggcg tttagcgaga aagtccaagc taccaaacaa    660
agcgcacatt cctaa                                                     675

<210> 42
<211> 675
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK2 codon optimised

<400> 42
atgacccgtg tgagcgaaga agcaatgaaa gatctgctga gcgtgtactt cattatgggc     60
agtaataata ccgcaggcga tccgctgacc gttattgaaa aagcactgaa aggcggtgcc    120
accctgttcc agttccgtga aaaaggtgaa ggcgccctga aagcaggcga tcagaccgca    180
ttcgcacgtc aggtgcaggc actgtgtaaa cagttcaatg ttccgttcat tattaacgat    240
gatgttgaac tggcactgga actggatgcc gatggtgttc atattggtca ggatgatgat    300
aaagccgccg atgtgcgcgc ccgtattggt gataaaattc tgggtgtgag cgcacatacc    360
ctggaagaag tgctgaaagc cgaaaaagat ggcgcagatt atattggtgc cggcccggtg    420
tatccgaccg aaaccaaacg cgataccaaa gccgtgcagg gtgttagtct gattcaggaa    480
attcgtcgtc agggcattgg cattccggtg gtgggtattg gtggcattac cgttgaaaat    540
tgtgtgccgg tgattgaagc cggtgcagat ggcattagtg ttattagcgc cattagcaaa    600
gcagcagatc cgaaacaggc cgcagaagca ttcagcgaaa aagttcaggc aaccaaacag    660
agcgcccata gctga                                                     675

<210> 43
<211> 275
<212> PRT
<213> Clostridium algidicarnis

<220>
<223> HTK3

<400> 43
Met Asn Tyr Ile Asp Lys Ser Ile Asp Leu Ile Arg Leu Thr Lys Glu
1               5                   10                  15
Lys Asn Pro Leu Val Asp Phe Ala Val Asn Tyr Val Thr Ala Asn Asp
            20                  25                  30
Ser Thr Ser Val Thr Ser Tyr Ile Gly Gly Ser Pro Val Met Thr Asp
        35                  40                  45
Asp Ser Ile Asp Ala Ala Asp Val Val Glu Tyr Gly Asn Val Asp Ala
    50                  55                  60
Leu Ile Phe Asn Ile Gly Thr Ile Thr Glu Lys Gln Tyr His Ser Met
65                  70                  75                  80
Met Glu Ala Gly Lys Arg Ala Thr Glu Arg Gly Ile Pro Ile Val Ile
                85                  90                  95
Asp Pro Val Ala Thr Ser Ile Thr Pro Phe Arg Thr Met Ile Ile Gln
            100                 105                 110
Arg Met Leu Asp Glu Leu Asn Val Ser Val Ile Lys Gly Asn Leu Gly
        115                 120                 125
Glu Ile Lys Ala Cys Leu Gly Leu Lys Thr Asn Ser Lys Gly Val Asp
    130                 135                 140
Ser Asn Glu Asn Pro Glu Gly Ala Glu Glu Phe Cys Ile Lys Leu Ala
145                 150                 155                 160
Arg Lys Arg Asn Leu Val Val Ala Met Thr Gly Pro Lys Asp Ile Ile
                165                 170                 175
Thr Asp Gly Glu Arg Ile Val Val Ile Glu Asn Gly Thr Asp Arg Leu
            180                 185                 190
Pro Lys Val Ile Gly Thr Gly Cys Ile Leu Gly Ala Met Val Ala Thr
        195                 200                 205
Tyr Cys Gly Ala Thr Asn Asp Tyr Val Leu Ala Ala Ser Thr Ala Ile
    210                 215                 220
Met Leu Met Gly Val Ala Gly Glu Leu Ala Ser Glu Ile Thr Lys Glu
225                 230                 235                 240
Asp Glu Gly His Tyr Lys Phe Lys Val Asn Leu Ile Asp Val Leu Ser
                245                 250                 255
Thr Ile Val Asp Asn Glu Asp Lys Ile Lys Ala Lys Ala Asn Met Lys
            260                 265                 270
Ile Ile Lys
        275

<210> 44
<211> 828
<212> DNA
<213> Clostridium algidicarnis

<220>
<223> HTK3 wild type

<400> 44
atgaactata tagataaatc aatagaccta attagactta caaaggagaa aaaccctctt     60
gtggattttg cggtaaacta tgttactgca aatgattcta ctagtgtaac ctcttacatt    120
ggaggtagcc ctgtgatgac agatgattct atagatgctg cggatgttgt ggaatacggt    180
aatgtagatg cattaatatt taacatagga actataacag aaaagcagta tcactctatg    240
atggaagctg gtaaaagggc tactgaaaga ggaattccta tagtaataga ccctgtagca    300
actagtatta caccttttag aactatgatt atccaaagaa tgcttgatga acttaatgtt    360
tctgtaataa agggaaacct tggggaaatt aaagcttgcc ttggtcttaa gacaaactca    420
aaaggtgttg attctaatga aaaccctgag ggtgcagaag aattctgcat taaacttgca    480
agaaaaagaa acttagtagt tgcaatgact ggtccaaagg atataataac ggacggtgaa    540
agaatagttg ttattgaaaa tggtacggac aggcttccaa aggttatagg aactggctgt    600
attctaggtg ctatggttgc tacctattgt ggtgctacta atgattacgt attagcagct    660
tctacagcta taatgcttat gggagtagct ggtgagcttg caagtgaaat tacaaaggaa    720
gatgaaggac actataagtt taaagttaat ttaatagatg tattatctac tatagtggac    780
aatgaagata aaataaaagc caaggctaac atgaaaatta ttaaatag                 828

<210> 45
<211> 828
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK3 codon optimised

<400> 45
atgaactaca tcgataagag cattgatctg attcgcctga ccaaagaaaa aaatccgctg     60
gtggacttcg cagtgaatta tgtgaccgca aatgatagta ccagtgtgac cagttatatt    120
ggcggtagcc cggttatgac cgatgatagt attgatgccg cagatgtggt ggaatatggc    180
aatgttgatg cactgatctt caatattggt accattaccg aaaaacagta tcatagcatg    240
atggaagcag gcaaacgtgc aaccgaacgt ggtattccga ttgttattga tccggtggca    300
accagcatta ccccgttccg taccatgatt attcagcgta tgctggatga actgaatgtt    360
agcgttatta aaggcaatct gggcgaaatt aaagcatgtc tgggcctgaa aaccaatagc    420
aaaggcgttg atagtaatga aaatccggaa ggtgcagaag agttctgtat taaactggca    480
cgtaaacgta atctggtggt ggcaatgacc ggtccgaaag atattattac cgatggtgaa    540
cgcattgtgg tgattgaaaa tggcaccgat cgtctgccga aagtgattgg taccggctgc    600
attctgggtg caatggtggc cacctattgc ggtgcaacca atgattatgt gctggccgcc    660
agtaccgcca ttatgctgat gggtgttgcc ggtgaactgg ccagtgaaat taccaaagaa    720
gatgaaggtc attataagtt caaggtgaat ctgattgatg ttctgagcac cattgttgat    780
aatgaagata aaatcaaggc caaagcaaat atgaaaatca ttaagtga                 828

<210> 46
<211> 279
<212> PRT
<213> Aerococcus sp.

<220>
<223> HTK4

<400> 46
Met Ser Lys Lys Ile Ser Leu Pro Phe Ala Lys Tyr Ser Gln Gln Val
1               5                   10                  15
Lys Glu Lys Ala Pro Leu Ile Gln Val Leu Asn Asn Tyr Val Thr Ile
            20                  25                  30
His Asp Val Ala Asn Val Ile Leu Ala Ser Gly Gly Arg Pro Val Met
        35                  40                  45
Thr Asp Asn Leu Pro Asn Ser Gln Asp Val Val Lys Thr Ala Asp Leu
    50                  55                  60
Leu Leu Leu Asn Ala Ala Ser Pro Arg Pro Asn Gln Glu Leu Leu Asp
65                  70                  75                  80
Leu Ala Thr Thr Ala Lys Asn Asp His His Pro Val Val Leu Asp Pro
                85                  90                  95
Val Gly Val Ser Ala Met Pro Ser Lys Leu Lys Leu Cys Gln Asp Leu
            100                 105                 110
Ile Asp Gln Gly Leu Val Thr Ala Val Lys Gly Asn Ala Ser Glu Ile
        115                 120                 125
Arg Ser Leu Leu Phe Glu Lys Ser Gln Gly Ser Gly Val Asp Leu Gly
    130                 135                 140
Pro Gly Asp Glu Val Thr Leu Ala Asn Leu Ala Asp Phe Ala Pro Asp
145                 150                 155                 160
Phe Lys Ala Tyr Ala Gln Glu Lys Arg Ile Ile Leu Ala Met Ser Gly
                165                 170                 175
Pro Ile Asp Leu Val Thr Asp Gly Lys Arg Leu Ala Val Ile Glu Asn
            180                 185                 190
Gly His Pro Trp Met Ala Ser Tyr Thr Gly Ser Gly Cys Gln Leu Ser
        195                 200                 205
Gly Val Leu Ala Ser Phe Leu Ala Gly Asn Pro Asp Glu Asp Pro Phe
    210                 215                 220
Tyr Leu Ala Thr Ala Ala Val Ile Ser Tyr Gly Val Ala Gly Glu Ile
225                 230                 235                 240
Ala Ala Gln Val Leu Gln Pro Tyr Glu Gly Asn Ala Thr Tyr Ser Asn
                245                 250                 255
Arg Val Ile Asp Gln Val Phe Leu Leu Glu Ala Lys Glu Leu Glu Arg
            260                 265                 270
Arg Ala Lys Tyr Asp Ile Gln
        275

<210> 47
<211> 840
<212> DNA
<213> Aerococcus sp.

<220>
<223> HTK4 wild type

<400> 47
atgtctaaaa aaataagcct tccctttgcg aaatatagtc agcaggtcaa ggagaaagcc     60
ccgctgatcc aagtcttaaa taattatgtc accatacatg atgtggccaa tgtgattttg    120
gccagtggcg ggcgtcccgt gatgaccgat aacttaccta atagtcagga tgtagtcaag    180
acagctgacc ttttgctatt gaatgccgct agtcccagac ctaatcagga attgttggac    240
ctcgccacca cagcgaaaaa tgatcaccat cccgtggtct tagacccagt gggggtttca    300
gcgatgccgt ctaaattaaa gctctgtcaa gatctgattg accaggggtt agtgacagcg    360
gtgaagggga atgcttcgga aattcgaagt ctcctttttg aaaaaagcca aggatctggg    420
gtcgacctgg gtcctggaga cgaggtaacc ttagccaacc tggctgattt tgctccggac    480
tttaaagctt atgcccaaga aaaaagaatt atcctagcca tgtcgggtcc gattgactta    540
gtgacagacg gcaagcggct ggcagtcatt gaaaatggcc acccctggat ggcttcctat    600
acggggtcag gttgccagtt aagcggcgtc ctagctagct ttttagccgg taatccggat    660
gaggatcctt tttacctagc tacagcagcc gtgattagct atggggtggc gggagaaatt    720
gctgcccaag tcctccagcc ttatgagggc aatgccactt attccaaccg ggtgattgac    780
caggttttct tattagaggc caaagaatta gaaaggagag ccaagtatga cattcaataa    840

<210> 48
<211> 840
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK4 codon optimised

<400> 48
atgagtaaga agatcagcct gccgttcgcc aaatatagcc agcaggttaa agaaaaagcc     60
ccgctgattc aggtgctgaa taattatgtg accattcatg atgtggcaaa tgttattctg    120
gccagcggtg gccgtccggt gatgaccgat aatctgccga atagtcagga tgtggttaaa    180
accgcagatc tgctgctgct gaatgccgca agtccgcgcc cgaatcagga actgctggat    240
ctggcaacca ccgcaaaaaa tgatcatcat ccggttgttc tggaccctgt tggcgttagt    300
gcaatgccga gtaaactgaa actgtgccag gatctgattg atcagggtct ggtgaccgcc    360
gttaaaggta atgcaagcga aattcgtagc ctgctgttcg aaaaaagtca gggcagcggt    420
gttgatctgg gtccgggtga tgaagtgacc ctggccaatc tggcagactt cgcaccggac    480
ttcaaagcct atgcacagga aaaacgtatt attctggcca tgagcggtcc gattgatctg    540
gtgaccgatg gtaaacgtct ggccgttatt gaaaatggcc atccgtggat ggcaagctat    600
accggtagtg gttgccagct gagcggcgtt ctggcatcat tcctggcagg taatccggat    660
gaagatccgt tctatctggc aaccgccgca gttattagtt atggtgtggc aggtgaaatt    720
gcagcacagg ttctgcaacc gtatgaaggt aatgcaacct atagcaatcg cgttattgat    780
caggtgttcc tgctggaagc caaagaactg gaacgccgtg ccaaatatga tattcagtga    840

<210> 49
<211> 280
<212> PRT
<213> Lactobacillus casei

<220>
<223> HTK5

<400> 49
Met Ser Lys Val Ile Thr Asp Val Phe Tyr Thr Ala Phe Lys Thr Ala
1               5                   10                  15
Leu Pro Leu Thr Ser Ser Pro Leu Val Gln Cys Ile Thr Asn Glu Ile
            20                  25                  30
Thr Val Glu Ser Met Ala Asn Ala Leu Leu Tyr Ile Asp Ala Lys Pro
        35                  40                  45
Val Met Ala Asp Asp Gln Arg Glu Phe Pro Glu Phe Phe Ala Gln Ser
    50                  55                  60
Asp Ala Leu Leu Leu Asn Leu Gly His Ile Ser Glu Val Arg Gln Gln
65                  70                  75                  80
Asn Leu Leu Ala Ala Gly Lys Phe Ala Gln Ala Thr Asn Gln Pro Thr
                85                  90                  95
Val Ile Asp Leu Val Gly Val Ser Ala Thr Gln Leu Arg Tyr Asp Leu
            100                 105                 110
Gly His Gln Leu Leu Ala Asn His Pro Asn Val Val Lys Gly Asn Ile
        115                 120                 125
Ser Glu Met Arg Arg Phe Ala Asp Leu Lys Ser Thr Gly Arg Gly Val
    130                 135                 140
Asp Gly Ser Gln Leu Asp Gln Ser Ala Thr Ala Leu Gly Glu Leu Ala
145                 150                 155                 160
Ala Ser Leu Gln Gln Leu Thr Gln Ala Phe Pro Thr Thr Thr Phe Leu
                165                 170                 175
Ala Thr Gly Lys Ile Asp Leu Val Val Ser Ala Lys Gly Thr Trp Tyr
            180                 185                 190
Leu Lys Asn Gly Val Pro Gln Leu Asp Arg Phe Thr Gly Thr Gly Asp
        195                 200                 205
Ile Val Gly Ala Leu Ile Ala Ala Leu Leu Gly Thr Gly Leu Asp Asn
    210                 215                 220
Asp Ala Ala Val Val Val Ala Val Ser Tyr Phe Asn Cys Cys Gly Glu
225                 230                 235                 240
Val Ala Ala Ala Gln Asn Arg Thr Gly Gly Leu Ala Ala Phe Arg Glu
                245                 250                 255
Gly Thr Leu Asn Gln Leu Ser Leu Leu Ala Ala Thr Ala Asp Trp Leu
            260                 265                 270
Gln Met Val Lys Gly Glu Ala Leu
        275                 280

<210> 50
<211> 843
<212> DNA
<213> Lactobacillus casei

<220>
<223> HTK5 wild type

<400> 50
atgtcaaaag taatcactga cgttttttac accgcgttca aaaccgccct gccattaaca     60
tcatccccac tggtgcaatg cattaccaac gaaataaccg tcgaatccat ggctaatgca    120
ttgctttaca ttgatgcgaa gccggtcatg gctgatgatc agcgcgagtt cccggaattc    180
tttgctcaaa gtgatgcgtt gctgttgaat cttggacaca tttccgaagt gcgccagcag    240
aatctattag ccgctggcaa gtttgcgcag gccaccaacc agccaacggt gattgatttg    300
gtcggcgttt ctgccaccca gttgcgctat gacttaggcc atcaattgtt agccaatcat    360
ccgaacgtgg tcaaaggtaa catttctgaa atgcgtcgat tcgctgatct aaaaagcaca    420
ggccgcggcg ttgatggaag ccagttagat caaagtgcga ccgccttggg agaactagcc    480
gcgagcttgc agcagctgac ccaagcgttt cccaccacta ccttcttggc aaccggcaag    540
attgatctcg ttgtgagtgc gaaggggact tggtatttga aaaatggggt gccgcagctg    600
gatcgtttca ccgggactgg tgacattgtc ggtgccttga ttgccgcgct gttggggaca    660
ggtttggaca acgacgcagc agtcgtcgtg gctgtgagtt acttcaactg ctgtggcgaa    720
gtagcagctg cgcagaatcg aaccggcgga ctggcggcat ttcgcgaagg taccttgaat    780
caactttcct tactggccgc caccgctgac tggcttcaaa tggtgaaagg ggaagcgtta    840
tga                                                                  843

<210> 51
<211> 843
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK5 codon optimised

<400> 51
atgagcaaag ttatcaccga tgtgttctat accgccttca aaaccgccct gccgctgacc     60
agtagtccgc tggttcagtg tattaccaat gaaattaccg ttgaaagcat ggccaatgcc    120
ctgctgtata ttgatgccaa accggttatg gcagatgatc agcgcgagtt cccggagttc    180
ttcgcccaga gcgatgcact gctgctgaat ctgggccata ttagtgaagt gcgccagcag    240
aatctgctgg cagcaggtaa attcgcccag gcaaccaatc agccgaccgt gattgatctg    300
gtgggtgtta gtgcaaccca gctgcgttat gatctgggcc atcagctgct ggcaaatcat    360
ccgaatgtgg tgaaaggtaa tattagtgaa atgcgccgct tcgccgatct gaaaagtacc    420
ggtcgcggtg tggatggtag ccagctggat cagagtgcaa ccgcactggg cgaactggcc    480
gccagcctgc aacagctgac ccaggcattc ccgaccacca ccttcctggc aaccggtaaa    540
attgatctgg ttgtgagcgc aaaaggcacc tggtatctga aaaatggtgt tccgcagctg    600
gatcgcttca ccggcaccgg tgatattgtg ggtgccctga ttgccgcact gctgggtacc    660
ggcctggata atgatgcagc cgttgtggtg gcagtgagtt acttcaattg ctgtggtgaa    720
gttgcagccg cacagaatcg caccggcggt ctggcagcct tccgcgaagg taccctgaat    780
cagctgagtc tgctggcagc caccgccgat tggctgcaaa tggttaaagg tgaagcactg    840
tga                                                                  843

<210> 52
<211> 262
<212> PRT
<213> Shigella boydii

<220>
<223> HTK6

<400> 52
Met Gln Val Asp Leu Leu Ser Ser Ala Gln Ser Ala His Thr Leu His
1               5                   10                  15
Leu Phe His Gln His Ser Pro Leu Val His Cys Met Thr Asn Asp Val
            20                  25                  30
Val Gln Thr Phe Thr Ala Asn Thr Leu Leu Ala Leu Gly Ala Ser Pro
        35                  40                  45
Ala Met Val Ile Glu Thr Glu Glu Ala Ser Gln Phe Ala Ala Ile Ala
    50                  55                  60
Ser Ala Leu Leu Ile Asn Val Gly Thr Leu Thr Gln Pro Arg Ala Gln
65                  70                  75                  80
Ala Met Arg Ala Ala Val Glu Gln Ala Lys Ser Ser Gln Thr Pro Trp
                85                  90                  95
Thr Leu Asp Pro Val Ala Val Gly Ala Leu Asp Tyr Arg Arg Arg Phe
            100                 105                 110
Cys Leu Glu Leu Leu Ser His Lys Pro Thr Ala Ile Arg Gly Asn Ala
        115                 120                 125
Ser Glu Ile Met Ala Leu Ala Gly Val Ala Asn Gly Gly Arg Gly Val
    130                 135                 140
Asp Thr Thr Asp Ala Ala Ala Asn Ala Ile Pro Ala Ala Gln Thr Leu
145                 150                 155                 160
Ala Arg Glu Thr Gly Ala Ile Val Val Val Thr Gly Glu Val Asp Tyr
                165                 170                 175
Val Thr Asp Gly His Arg Ile Val Gly Ile His Gly Gly Asp Pro Leu
            180                 185                 190
Met Thr Lys Val Val Gly Thr Gly Cys Ala Leu Ser Ala Val Val Ala
        195                 200                 205
Ala Cys Cys Ala Leu Pro Gly Asp Met Leu Glu Asn Val Ala Ser Ala
    210                 215                 220
Cys His Trp Met Lys Gln Ala Gly Glu Arg Ala Val Ala Arg Ser Glu
225                 230                 235                 240
Gly Pro Gly Ser Phe Val Pro His Phe Leu Asp Ala Leu Trp Gln Leu
                245                 250                 255
Thr Gln Glu Val Gln Ala
            260

<210> 53
<211> 789
<212> DNA
<213> Shigella boydii

<220>
<223> HTK6 wild type

<400> 53
atgcaagtcg acctgctgag ttccgcgcaa tctgcgcaca cgttacacct ttttcaccaa     60
cattcccctc ttgtgcactg catgaccaac gatgtggtgc aaacctttac tgccaatacc    120
ttgctggcgc tcggtgcatc gccagcgatg gttatcgaaa ccgaagaggc cagtcagttt    180
gcggctattg ctagtgcgtt gctgattaac gttggcacgc tgacgcagcc acgcgcacag    240
gcgatgcgtg ccgccgttga gcaagcaaaa agctctcaaa ccccctggac gcttgatcct    300
gtagcggtgg gtgcactcga ttatcgccgc cgtttttgtc tggaacttct gtctcataag    360
ccaaccgcca tacgtggtaa tgcttcggaa atcatggcat tagctggcgt tgctaatggt    420
ggacggggag tggataccac tgacgccgca gctaacgcaa tacccgctgc acaaacactg    480
gcacgggaaa ctggcgcaat cgtcgtggtc acaggcgagg tggattatgt taccgatggg    540
catcgtatcg ttggcattca cggtggcgat ccgctaatga ctaaagtggt aggaactggc    600
tgtgcattat cggcggttgt tgctgcctgt tgtgcgttac caggcgatat gctggaaaat    660
gtcgcatctg cctgtcactg gatgaaacaa gccggagagc gcgcagtcgc cagaagcgag    720
gggccaggca gttttgttcc acatttcctt gatgcgctct ggcaattgac gcaggaggtg    780
caggcatga                                                            789

<210> 54
<211> 789
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK6 codon optimised

<400> 54
atgcaggttg atctgctgag cagtgcacag agcgcccata ccctgcatct gttccatcag     60
catagcccgc tggtgcattg tatgaccaat gatgttgttc agaccttcac cgcaaatacc    120
ctgctggccc tgggcgcaag tccggcaatg gtgattgaaa ccgaagaagc aagtcagttc    180
gccgccattg caagcgccct gctgattaat gttggtaccc tgacccagcc gcgcgcccag    240
gcaatgcgtg cagcagtgga acaggcaaaa agcagtcaga ccccgtggac cctggaccct    300
gttgccgtgg gtgcactgga ttatcgtcgc cgcttctgtc tggaactgct gagtcataaa    360
ccgaccgcaa ttcgtggtaa tgccagcgaa attatggccc tggccggcgt ggcaaatggc    420
ggtcgtggcg ttgataccac cgatgccgca gccaatgcca ttccggcagc acagaccctg    480
gcccgcgaaa ccggtgcaat tgtggttgtt accggtgaag tggattatgt gaccgatggc    540
catcgtattg ttggtattca tggcggtgat ccgctgatga ccaaagtggt gggtaccggt    600
tgcgcactga gtgccgtggt ggccgcatgt tgtgccctgc cgggtgatat gctggaaaat    660
gtggcaagcg cctgccattg gatgaaacag gcaggcgaac gcgccgtggc acgtagtgaa    720
ggtccgggta gcttcgttcc gcacttcctg gatgcactgt ggcagctgac ccaggaagtt    780
caggcctga                                                            789

<210> 55
<211> 209
<212> PRT
<213> Bacteroides thetaiotaomicron

<220>
<223> HTK7

<400> 55
Met Val Ser Leu Gln Phe Ile Thr His Gln Thr Asp Arg Tyr Thr Tyr
1               5                   10                  15
Phe Glu Ser Ala Leu Met Ala Leu Glu Gly Gly Cys Lys Trp Ile Gln
            20                  25                  30
Leu Arg Met Lys Glu Ala Pro Cys Glu Glu Val Glu Ala Val Ala Leu
        35                  40                  45
Gln Leu Lys Pro Leu Cys Lys Glu Lys Glu Ala Ile Leu Leu Leu Asp
    50                  55                  60
Asp His Val Glu Leu Ala Lys Lys Leu Glu Val Asp Gly Val His Leu
65                  70                  75                  80
Gly Lys Lys Asp Met Pro Ile Asp Gln Ala Arg Gln Leu Leu Gly Glu
                85                  90                  95
Ala Phe Ile Ile Gly Gly Thr Ala Asn Thr Phe Glu Asp Val Val Gln
            100                 105                 110
His Tyr Arg Ala Gly Ala Asp Tyr Leu Gly Ile Gly Pro Phe Arg Phe
        115                 120                 125
Thr Thr Thr Lys Lys Asn Leu Ser Pro Val Leu Gly Leu Glu Gly Tyr
    130                 135                 140
Thr Ala Ile Leu Ser Gln Met Lys Glu Ala Asn Ile Glu Leu Pro Val
145                 150                 155                 160
Val Ala Ile Gly Gly Ile Thr Arg Glu Asp Ile Pro Ala Ile Leu Glu
                165                 170                 175
Thr Gly Val Asn Gly Ile Ala Leu Ser Gly Thr Ile Leu Arg Ala Glu
            180                 185                 190
Asp Pro Ala Ala Glu Thr Arg Lys Ile Leu Asn Met Lys Arg Ile Ile
        195                 200                 205
Lys

<210> 56
<211> 630
<212> DNA
<213> Bacteroides thetaiotaomicron

<220>
<223> HTK7 wild type

<400> 56
atggtcagtc tacaatttat cacccaccag accgatcggt atacttattt cgaatcggca     60
ctcatggcac ttgaaggagg atgtaagtgg attcagctac gcatgaagga agctccgtgt    120
gaagaggtgg aagctgttgc cctccaacta aagccgctct gcaaagaaaa agaagcgatc    180
ttacttctgg atgaccacgt cgaacttgcc aaaaagctgg aagtggacgg agtgcatctg    240
ggcaaaaaag acatgccgat agatcaggca cgacaattac ttggagaagc atttattatc    300
ggaggtacgg caaatacatt cgaagatgtc gtacagcact accgtgccgg agcggattac    360
ctcggcatcg gtcctttccg gtttaccact acaaagaaaa acctgagtcc tgtactggga    420
ctggaaggtt ataccgctat tttatctcag atgaaggaag cgaatatcga acttccggta    480
gtagccatcg gaggaatcac ccgtgaggat atacccgcca tactcgaaac cggagtgaac    540
ggaatcgcgc tttcaggaac gattcttcgg gcggaggatc cggcagcgga aacacgaaag    600
attttgaaca tgaaacgtat aatcaaataa                                     630

<210> 57
<211> 630
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK7 codon optimised

<400> 57
atggttagcc tgcaattcat tacccatcag accgatcgtt atacctactt cgaaagcgcc     60
ctgatggcac tggaaggtgg ttgtaaatgg attcagctgc gtatgaaaga agccccgtgt    120
gaagaagtgg aagccgttgc cctgcaactg aaaccgctgt gtaaagaaaa agaagcaatt    180
ctgctgctgg atgatcatgt tgaactggca aaaaaactgg aagttgatgg cgtgcatctg    240
ggcaaaaaag atatgccgat tgatcaggca cgccagctgc tgggcgaagc cttcattatt    300
ggtggtaccg ccaatacctt cgaagatgtt gtgcagcatt atcgcgcagg tgccgattat    360
ctgggtattg gcccgttccg cttcaccacc accaaaaaaa atctgagtcc ggttctgggt    420
ctggaaggct ataccgccat tctgagtcag atgaaagaag cgaatattga actgccggtg    480
gtggcaattg gtggcattac ccgcgaagat attccggcaa ttctggaaac cggcgtgaat    540
ggcattgcac tgagtggtac cattctgcgc gccgaagatc cggcagccga aaccagaaaa    600
attctgaata tgaaacgcat tatcaagtga                                     630

<210> 58
<211> 351
<212> PRT
<213> Prochlorococcus marinus

<220>
<223> HTK8

<400> 58
Met Lys Asn Pro Asn Ile Ile Gln Pro Glu Asp Leu Arg Ile Ser Gln
1               5                   10                  15
Ile Ile Asp Ala Asn Leu Asp Arg Ala Arg Glu Gly Leu Arg Val Leu
            20                  25                  30
Glu Asp Trp Ala Arg Phe Gly Leu Gly Asn Glu Asp Phe Val Ile Arg
        35                  40                  45
Ile Lys Asn Phe Arg Gln Ile Leu Gly Lys Asn His Leu Glu Ile Tyr
    50                  55                  60
Lys Leu Ser Arg Asn His Ile Glu Asp Gln Cys Lys Gly Leu Ser His
65                  70                  75                  80
Val Glu Gln Ile Asn Arg Asn Ser Ser Ser Lys Ile Ile Ser Ser Asn
                85                  90                  95
Ser Ala Arg Val Gln Glu Ala Leu Arg Val Ile Glu Glu Phe Ser Arg
            100                 105                 110
Ile His Asn Ser Lys Leu Ser Lys Ile Ala Ser Glu Ile Arg Tyr Glu
        115                 120                 125
Ile Tyr Thr Leu Glu Ile Glu Ile Leu Asn Phe Asn Thr Arg Lys Arg
    130                 135                 140
Ala Gln Ser Ile Ile Ser Lys Asn Asn Leu Tyr Ser Ile Thr Asp Pro
145                 150                 155                 160
Arg Glu Asn Leu Leu Glu Ile Ile Glu Lys Ile Leu Leu Gly Gly Val
                165                 170                 175
Lys Ile Ile Gln His Arg Phe Lys Glu Gly Asn Asp Lys Asp His Leu
            180                 185                 190
Lys Glu Ala Ile Glu Ile Asn Lys Leu Cys Lys Lys Tyr Asn Ser Leu
        195                 200                 205
Phe Ile Val Asn Asn Arg Leu Asp Ile Ala Leu Ala Ser Lys Ala Asp
    210                 215                 220
Gly Val His Leu Gly Gln Asp Asp Leu Asp Ile Lys Thr Val Arg Lys
225                 230                 235                 240
Leu Leu Gly Ala Ser Lys Ile Ile Gly Val Ser Ala Asn Asn Ser Thr
                245                 250                 255
Asp Ile Asn Lys Ala Val Lys Asp Gly Cys Asp Tyr Ile Gly Val Gly
            260                 265                 270
Pro Val Phe Pro Thr Leu Thr Lys Lys Asn Lys Glu Pro Leu Gly Glu
        275                 280                 285
Glu Lys Ile Lys Ala Leu Thr Lys Glu Leu Asn Ile Pro Cys Phe Ala
    290                 295                 300
Ile Gly Gly Ile Asn Lys Leu Asn Ile Ser Ser Leu Lys Asn His Gly
305                 310                 315                 320
Ile Ser Lys Val Ala Ile Val Ser Gly Leu Leu Asn Ser Glu Asp Pro
                325                 330                 335
Lys Asp Glu Ala Met Ile Ile Ile Lys Glu Leu Ser His Glu Asn
            340                 345                 350

<210> 59
<211> 1056
<212> DNA
<213> Prochlorococcus marinus

<220>
<223> HTK8 wild type

<400> 59
atgaaaaacc caaacataat tcaacctgaa gatttacgaa tatctcaaat tattgacgct     60
aatttagata gagcaagaga aggtctaagg gttttggagg actgggccag atttggcttg    120
ggtaatgaag attttgttat aagaataaaa aacttccgac aaatattagg taaaaatcat    180
ttagaaattt ataaattatc aagaaaccat attgaagatc aatgcaaagg gttatctcat    240
gtcgaacaaa tcaacaggaa tagttcctct aaaataataa gttctaattc tgcaagagtt    300
caagaagcgc ttcgagttat tgaagaattt tcaaggattc ataatagtaa actttctaaa    360
atagcttccg agattagata tgaaatttac actttagaaa ttgaaatatt aaatttcaat    420
actcgtaaga gagcacaatc aataattagt aaaaacaatt tatattcgat aacagaccca    480
agagaaaact tattagaaat aattgaaaaa atattattag gaggggtaaa aataattcag    540
catcgattta aagaaggtaa tgataaagac catctcaaag aggcaattga aataaataaa    600
ttatgtaaga aatataattc tttgttcatc gttaataaca gattagatat agcattggca    660
tcaaaggcag atggtgttca tcttggtcaa gacgacctcg atataaaaac agtaagaaaa    720
ttacttggtg cctcaaaaat cattggagtt tcagccaaca attcaactga catcaataag    780
gctgtaaaag atggatgcga ttacattgga gtcgggccag tttttccaac tttgacaaag    840
aaaaataaag aacctctcgg tgaagagaaa attaaggcct taacaaaaga actaaatatt    900
ccttgttttg caataggagg aattaataaa ttaaatatct cttctctaaa aaatcatgga    960
attagtaagg ttgcaatagt ttctgggctg ctaaattcag aagatccaaa agatgaagct   1020
atgattatca taaaagaatt atcccatgaa aattag                             1056

<210> 60
<211> 1056
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK8 codon optimised

<400> 60
atgaagaacc cgaatattat ccagccggaa gatctgcgca ttagtcagat tattgatgca     60
aatctggatc gtgcccgtga aggcctgcgt gtgctggaag attgggcccg cttcggtctg    120
ggcaatgaag acttcgttat tcgcattaaa aacttccgcc agattctggg taaaaatcat    180
ctggaaatct ataaactgag ccgcaatcat attgaagatc agtgtaaagg cctgagtcat    240
gttgaacaga ttaatcgtaa tagcagcagt aaaattatca gcagtaatag cgcacgtgtg    300
caggaagcac tgcgtgtgat tgaagagttc agccgcattc ataatagcaa actgagcaaa    360
attgcaagcg aaattcgtta tgaaatctat accctggaaa ttgaaatcct gaacttcaat    420
acccgcaaac gcgcccagag cattattagc aaaaataatc tgtacagcat caccgatccg    480
cgtgaaaatc tgctggaaat tattgaaaaa atcctgctgg gtggtgtgaa aattattcag    540
catcgcttca aagaaggtaa tgataaagat catctgaagg aagcaattga aattaataag    600
ctgtgcaaaa agtacaacag tctgttcatt gttaacaatc gcctggatat tgccctggca    660
agcaaagccg atggcgttca tctgggccag gatgatctgg atattaaaac cgttcgtaaa    720
ctgctgggtg ccagtaaaat tattggtgtt agtgcaaata acagcaccga tattaataaa    780
gcagttaaag atggttgcga ttatattggc gtgggtccgg tgttcccgac cctgaccaaa    840
aaaaataaag aaccgctggg cgaagaaaaa attaaagccc tgaccaaaga actgaatatt    900
ccgtgcttcg caattggcgg cattaataaa ctgaatatta gcagtctgaa gaaccacggt    960
attagcaaag ttgccattgt tagcggcctg ctgaatagtg aagatccgaa agatgaagcc   1020
atgattatta ttaaagagct gagccatgaa aactga                             1056

<210> 61
<211> 462
<212> PRT
<213> Peptoniphilus sp.

<220>
<223> HTK9

<400> 61
Met Lys Ile Lys Cys Asn Leu Arg Lys Leu Arg Asp Lys Ala Pro Leu
1               5                   10                  15
Val His Ile Ile Ser Asn Gly Val Thr Arg Gly Arg Val Ala Asp Phe
            20                  25                  30
Val Leu Ser Thr Gly Ala Ser Pro Met Met Ala Glu Tyr Ser Lys Glu
        35                  40                  45
Val Ser Glu Ile Thr Lys Lys Ala Ser Ala Leu Val Leu Asn Met Gly
    50                  55                  60
Met Leu Asn Glu Asp Lys Ile Glu Ala Ile Lys Ile Ala Gly Lys Thr
65                  70                  75                  80
Ala Lys Glu Asn Asn Ile Pro Thr Val Leu Asp Pro Val Gly Val Ala
                85                  90                  95
Ser Ser Lys Ile Arg Arg Asp Leu Ala Glu Tyr Leu Leu Asp Asn Phe
            100                 105                 110
Lys Phe Asn Val Ile Arg Gly Asn Phe Asn Glu Ile Asn Tyr Leu Val
        115                 120                 125
Gly Gly Gln Ala Phe Ala Gly Ile Asp Ser Arg Asp Lys Asn Leu Ser
    130                 135                 140
Glu Glu Asp Phe Lys Glu Leu Ala Val Lys Met Asn Glu Lys Ser Gly
145                 150                 155                 160
Ala Thr Val Val Val Ser Gly Lys Tyr Glu Val Ile Ala Asn Ser His
                165                 170                 175
Met Leu Ile Ser Ile Pro Gly Gly His Asp Asp Phe Arg Lys Ile Ser
            180                 185                 190
Gly Leu Gly Asp Met Glu Ser Ala Met Ile Gly Ser Leu Leu Ala Thr
        195                 200                 205
Pro Met Ser Asn Leu Lys Ala Cys Ala Ile Ser Ala Ile Phe Leu Arg
    210                 215                 220
Gln Leu Ala Arg Glu Val Ile Val Asp Gly Ser Ile Lys Ala Gln Asp
225                 230                 235                 240
Ile Ile Ser Lys Val Gln Lys Leu Glu Glu Ile Ser Gly Glu Ile Glu
                245                 250                 255
Ile Leu Ser Pro Ser Tyr Lys Phe Lys Lys Pro Ser Leu Tyr Gly Ile
            260                 265                 270
Ser Asp Gly Asn Asp Leu Met Lys Ile Lys Asn Ala Thr Arg Ala Gly
        275                 280                 285
Met Lys Ile Tyr Gln Leu Arg Asp Lys Thr Ser Glu Glu Ala Leu Leu
    290                 295                 300
Gly Glu Lys Ile Leu Lys Ile Lys Lys Glu Ile Glu Glu Asp Cys Leu
305                 310                 315                 320
Phe Ile Leu Asn Asp Asn Leu Lys Leu Ala Lys Glu Tyr Lys Thr Ser
                325                 330                 335
Leu His Leu Gly Gln Asp Asp Glu Glu Ile Ser Leu Ala Arg Arg Ile
            340                 345                 350
Leu Gly Arg Asp Pro Ile Ile Gly Ala Thr Ala Lys Thr Pro Glu Leu
        355                 360                 365
Ala Ile Glu Ala Glu Asn Met Gly Ala Ser Tyr Leu Gly Ser Gly Ala
    370                 375                 380
Phe Phe Glu Thr Glu Thr Lys Arg Asp Ala Ser Met Ile Asn Leu Glu
385                 390                 395                 400
Ile Tyr Glu Glu Ile Arg Asp Ser Ile Leu Ile Pro Ala Phe Pro Ile
                405                 410                 415
Gly Gly Ile Asn Leu Glu Asn Leu Asp Leu Phe Lys Gly Val Glu Ile
            420                 425                 430
Pro Gly Leu Cys Met Ser Ser Gly Ile Phe Ser Leu Glu Glu Asn Glu
        435                 440                 445
Val Glu Lys Asn Val Arg Glu Ile Ile Lys Lys Leu Gly Asp
    450                 455                 460

<210> 62
<211> 1389
<212> DNA
<213> Peptoniphilus sp.

<220>
<223> HTK9 wild type

<400> 62
atgaaaatta aatgcaattt aagaaaacta agggacaagg ctcccttggt acatattatt     60
tccaatggag ttaccagggg aagggtagct gactttgtcc tctctacagg tgcgagtcca    120
atgatggcag agtattcaaa agaagtatct gaaatcacaa aaaaagcttc agccctagtt    180
ttaaatatgg gaatgttaaa tgaagataaa attgaagcaa ttaaaattgc agggaaaaca    240
gcaaaagaaa ataatattcc cactgttctt gatcctgttg gtgttgcttc tagcaagata    300
agaagagatt tagcagaata tcttcttgat aattttaagt tcaatgtcat taggggaaat    360
tttaatgaaa tcaattattt ggttgggggt caagcctttg ctggcataga ttcaagagac    420
aaaaatttat cggaagagga ctttaaagaa cttgcagtga agatgaatga aaaaagtgga    480
gcaactgtag ttgtaagtgg caagtatgaa gtaattgcta actctcacat gcttatttct    540
atacctggtg gtcatgatga ttttagaaaa ataagtgggc ttggggatat ggaatcagcg    600
atgataggct cacttttggc gacacctatg tcaaacctaa aggcttgtgc catttcggca    660
atatttttaa gacagcttgc aagagaagta atagttgatg gaagtataaa ggctcaagac    720
ataataagta aagttcaaaa gcttgaggaa ataagtggag aaattgaaat tttatcgcca    780
agctataaat ttaaaaagcc aagtttatat ggaatttctg atggtaatga tttgatgaaa    840
ataaaaaatg ccacaagagc tgggatgaaa atttaccagc taagagataa gacatcagag    900
gaagctttgc ttggagaaaa aattttaaag ataaaaaagg aaattgaaga agattgtctc    960
ttcatcttaa acgataattt aaaacttgca aaagaatata agacatccct tcacctggga   1020
caagatgatg aagagatttc tcttgcaaga agaattctgg gtagagaccc aataattgga   1080
gcaactgcaa aaactcctga gcttgccata gaagcagaaa atatgggagc atcctacctt   1140
ggttccggtg ccttttttga aacggaaact aagagggatg catcaatgat taatctagaa   1200
atttacgagg aaataagaga tagcatttta ataccagcct tcccaatagg tggtataaat   1260
ttggagaacc tagacttgtt taagggggta gagatacctg gcctttgtat gtctagtgga   1320
attttttccc ttgaagagaa tgaagtggag aaaaatgtaa gagaaataat aaaaaagtta   1380
ggtgattaa                                                           1389

<210> 63
<211> 1389
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK9 codon optimised

<400> 63
atgaagatca agtgcaatct gcgtaaactg cgtgataaag ccccgctggt tcatattatt     60
agtaatggcg ttacccgtgg tcgcgtggca gacttcgtgc tgagtaccgg tgcaagcccg    120
atgatggcag aatatagtaa agaagtgagt gaaattacca agaaagcaag cgccctggtt    180
ctgaatatgg gtatgctgaa tgaagataaa atcgaagcca ttaaaatcgc aggcaaaacc    240
gccaaagaaa ataatattcc gaccgttctg gaccctgttg gcgttgcaag cagcaaaatt    300
cgtcgtgatc tggccgaata tctgctggat aacttcaaat tcaatgttat tcgcggcaac    360
ttcaatgaaa ttaattatct ggtgggtggt caggcattcg ccggtattga tagccgcgat    420
aaaaatctga gtgaagaaga cttcaaagaa ctggccgtga aaatgaatga aaaaagcggt    480
gccaccgttg ttgttagtgg taaatatgaa gttatcgcaa atagccacat gctgattagt    540
attccgggcg gtcatgatga cttccgcaaa attagtggtc tgggcgatat ggaaagtgca    600
atgattggca gtctgctggc aaccccgatg agcaatctga aagcatgtgc aattagcgca    660
atcttcctgc gtcagctggc acgcgaagtg attgttgatg gtagtattaa agcccaggat    720
attattagta aggttcagaa actggaagaa attagcggtg aaattgaaat tctgagcccg    780
agctataaat tcaaaaaacc gagcctgtat ggcattagtg atggcaatga tctgatgaaa    840
attaaaaacg ccacccgtgc aggtatgaaa atctatcagc tgcgtgataa gaccagtgaa    900
gaagcactgc tgggcgaaaa aattctgaaa attaaaaagg agatcgagga agattgcctg    960
ttcattctga atgataatct gaaactggcc aaagaatata aaaccagtct gcatctgggc   1020
caggatgatg aagaaattag tctggcacgc cgtattctgg gccgcgatcc gattattggc   1080
gcaaccgcaa aaacacctga actggcaatt gaagcagaaa atatgggcgc cagctatctg   1140
ggcagcggtg cattcttcga aaccgaaacc aaacgtgatg ccagcatgat taatctggaa   1200
atctatgaag aaatccgtga tagcattctg attccggcct tcccgattgg tggtattaat   1260
ctggagaatc tggatctgtt caaaggtgtg gaaattccgg gcctgtgtat gagtagtggt   1320
atcttcagcc tggaagaaaa tgaagtggaa aaaaatgtgc gcgaaattat taaaaagctg   1380
ggtgattga                                                           1389

<210> 64
<211> 298
<212> PRT
<213> Clostridium gasigenes

<220>
<223> HTK10

<400> 64
Met Lys Ile Ser Ile Glu Ala Leu Thr Glu Val Leu Asn Leu Gln Lys
1               5                   10                  15
Glu Lys Gln Pro Leu Ile His Cys Ile Ser Ser Met Val Thr Met Asn
            20                  25                  30
Asp Leu Ala Gln Gly Ile Leu Ser Tyr Asn Gly Lys Pro Ile Met Ala
        35                  40                  45
Pro Gly Ile Asp Glu Val Gly Glu Ile Thr Ala Ser Ala Asn Ala Leu
    50                  55                  60
Leu Ile Asn Leu Gly Thr Leu Asp Ser Ser Arg Val Glu Ala Met Glu
65                  70                  75                  80
Lys Ser Ile Arg Ile Ala Ser Lys Lys Asn Lys Pro Ile Val Leu Asp
                85                  90                  95
Ala Ile Gly Val Asp Ile Ser Phe Phe Arg Arg Glu Ile Ala Leu Val
            100                 105                 110
Phe Leu Thr Arg Tyr Lys Ile Asp Val Ile Lys Gly Asn Val Ser Glu
        115                 120                 125
Ile Lys Ala Leu Leu Glu Lys Lys Pro Lys Lys Asn Lys Glu His Lys
    130                 135                 140
Glu Ile Ile Glu Ser Lys Glu Gln Asn Arg Asn Asn Glu Asn Glu Glu
145                 150                 155                 160
Phe Val Lys Asn Thr Ile Lys Asp Asp Tyr Glu Ile Arg Glu Gln Met
                165                 170                 175
Arg Glu Phe Ser Lys Lys Tyr Lys Ser Ile Leu Ile Ala Thr Gly Asn
            180                 185                 190
Glu Asp Tyr Ile Thr Asp Gly Phe Ser Glu Phe Phe Ile Asn Asn Gly
        195                 200                 205
Asn Asn Glu Phe Asp Arg Val Val Gly Val Asp Ser Leu Leu Gly Gly
    210                 215                 220
Leu Ile Ser Val Gly Val Ala Val Ala Arg Thr Asn Ala Glu Lys Val
225                 230                 235                 240
Gln Ala Val Leu Ile Ala Ile Met Thr Met Gly Val Ser Lys Glu Leu
                245                 250                 255
Ala Tyr Glu Lys Met Asp Lys Lys Gln Gly Leu Ile Ser Leu Lys Asn
            260                 265                 270
Ser Leu Ile Asp Glu Ile Ser Leu Ile Asn Asn Lys Lys Leu Glu Ala
        275                 280                 285
Met Gly Lys Ile Ser Tyr Ile Phe Lys Arg
    290                 295

<210> 65
<211> 897
<212> DNA
<213> Clostridium gasigenes

<220>
<223> HTK10 wild type

<400> 65
atgaaaataa gtatagaggc attgactgaa gttctaaatt tacagaagga aaagcaacct     60
cttatacatt gtatttcaag tatggttact atgaatgatt tagcacaggg aattttaagt    120
tataatggga aaccaattat ggcacctggc attgatgaag taggagaaat aacagctagc    180
gctaatgcgt tactaattaa tcttggaaca ttagatagta gtagggttga ggctatggag    240
aaatctatta gaatagcttc aaaaaaaaat aaacctatag ttttagatgc tataggagtt    300
gatatttctt tttttaggag agaaatagct ttggtgttct taaccagata taaaatagat    360
gttataaagg gcaatgtatc agaaattaaa gcattactag agaaaaagcc taagaaaaat    420
aaagaacaca aagaaattat agaatctaaa gaacaaaata gaaataatga aaatgaagaa    480
tttgttaaaa atactataaa agatgattat gaaattagag aacaaatgag agagttttct    540
aaaaaatata aaagcatatt aatagcaaca ggaaatgaag attatataac tgatggattt    600
agtgagtttt ttattaataa tggaaataat gaatttgata gagtagttgg agtggatagt    660
ttattaggtg ggttaatttc agtgggggtg gcagtagcca gaacaaatgc agaaaaagta    720
caagccgtgc taattgcaat aatgactatg ggtgtaagca aagaactagc ttatgaaaag    780
atggacaaaa aacagggact aatatcttta aaaaactctt taatagatga aatttccctt    840
ataaataata aaaaattaga agctatgggg aaaatatcat atatatttaa aaggtag       897

<210> 66
<211> 897
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK10 codon optimised

<400> 66
atgaagatca gtatcgaagc cctgaccgaa gttctgaatc tgcaaaaaga aaaacagccg     60
ctgattcatt gcattagcag catggttacc atgaatgatc tggcccaggg tattctgagt    120
tataatggca aaccgattat ggcccctggt attgatgaag ttggcgaaat taccgcaagc    180
gccaatgccc tgctgattaa tctgggcacc ctggatagta gtcgtgttga agcaatggaa    240
aaaagtattc gtattgcaag caaaaagaac aaaccgattg ttctggatgc aattggcgtg    300
gatattagct tcttccgtcg tgaaattgcc ctggtgttcc tgacccgcta taaaattgat    360
gtgattaaag gtaacgtgag cgaaattaaa gcactgctgg aaaaaaaacc gaaaaaaaat    420
aaggagcaca aggaaattat cgaaagtaaa gaacagaacc gtaataatga aaacgaagag    480
ttcgttaaaa acaccattaa agatgattac gagatccgtg aacagatgcg tgagttcagt    540
aaaaaatata aaagcatcct gatcgcaacc ggtaatgaag attatattac cgatggcttc    600
agtgagttct tcattaataa tggtaacaac gagttcgatc gtgttgttgg tgtggatagt    660
ctgctgggtg gcctgattag tgtgggcgtg gcagttgcac gtaccaatgc agaaaaagtg    720
caggcagtgc tgattgcaat tatgacaatg ggtgtgagta aagaactggc ctatgaaaaa    780
atggataaaa aacagggtct gattagtctg aaaaatagtc tgattgatga gattagcctg    840
attaataata agaagctgga agcaatgggc aaaattagct atatcttcaa acgttga       897

<210> 67
<211> 264
<212> PRT
<213> Thermanaerovibrio acidaminovorans

<220>
<223> HTK11

<400> 67
Met Arg Val Ser His Leu Ala Ser Glu Val Ala Arg Leu Ser Pro Leu
1               5                   10                  15
Val Tyr His Val Thr Asn Trp Val Ser Gly Pro Leu Ser Ala Arg Val
            20                  25                  30
Cys Tyr Ala Leu Gly Gly Arg Ala Leu Met Thr Thr His Pro Glu Glu
        35                  40                  45
Ala Leu Glu Ala Ala Arg Met Ser Gln Ala Leu Leu Leu Asn Leu Gly
    50                  55                  60
Thr Pro Thr Glu Asp Arg Val Val Ser Ile Arg Arg Ala Leu Asp Gly
65                  70                  75                  80
Ala Gly Asp Arg Pro Ala Leu Leu Asp Pro Val Gly Val Gly Ser Phe
                85                  90                  95
Pro Gly Arg Leu Asp Leu Ala Met Glu Ile Leu Ser Arg Gly Ile Ser
            100                 105                 110
Ile Leu Lys Gly Asn Gly Ala Glu Ile Ser Ala Leu Leu Gly Glu Gly
        115                 120                 125
Lys Gly Gln Arg Gly Val Asp Ser Asp Leu Pro Gly Pro Pro Leu Gly
    130                 135                 140
Val Arg Arg Leu Ala Glu Asp His Arg Cys Cys Ala Val Met Thr Gly
145                 150                 155                 160
Glu Glu Asp His Val Ala Leu Gly Val Ser Trp Gly Leu Val Arg Leu
                165                 170                 175
Arg Gly Arg Glu Val Arg Gly Ala Val Pro Val Pro Gly Leu Gly Cys
            180                 185                 190
Ala Leu Gly Ser Ala Met Ala Cys Ala Leu Gly Val Gly Ala Asp Pro
        195                 200                 205
Phe Ser Ala Ala Leu Trp Gly Cys Ala Leu Phe Lys Gly Ala Leu Arg
    210                 215                 220
Arg Ala Leu Gly Ala Cys Cys Gly Pro Gly Ser Leu Val Glu Ala Leu
225                 230                 235                 240
Ile Asp Gln Leu His Arg Ala Arg Thr Gly Glu Leu Asp Gly Glu Asn
                245                 250                 255
Val Glu Val Ile Arg Ala Asp Gly
            260

<210> 68
<211> 795
<212> DNA
<213> Thermanaerovibrio acidaminovorans

<220>
<223> HTK11 wild type

<400> 68
ttgagggtct cccacctggc atcggaggtg gcccggctat cccctctggt ctatcacgtc     60
accaactggg tgtcgggccc cctcagcgcc cgggtctgtt acgccctggg gggcagggcg    120
ctgatgacca cccatccgga ggaggccctg gaggcggccc ggatgtccca ggcgctgctg    180
ctcaacctgg ggacccccac ggaggaccgg gtcgtgtcca tccggcgggc tctggacggg    240
gcgggggatc ggccggcgct gttggatccg gtgggggtcg ggtccttccc tgggcgcctg    300
gacctggcta tggagatcct ctcccggggg atctcgatcc tcaaggggaa cggggcggag    360
atctccgccc tgcttgggga ggggaaggga cagcgggggg tggactcgga cctgccggga    420
ccgccccttg gggtccggag gctggcggag gaccaccggt gttgcgcggt catgaccggg    480
gaggaggacc acgtggccct aggggtgagc tgggggctgg tgaggctccg ggggagggag    540
gttcgagggg cggtgccggt gcccgggctt ggatgcgccc tgggtagcgc catggcctgc    600
gccctggggg tgggggcgga ccccttctcc gccgccctgt ggggctgcgc cctcttcaag    660
ggggccctcc ggagggccct tggggcctgc tgtggccccg gcagcctggt ggaggccctc    720
atagaccagc tacaccgggc caggaccggg gagctggatg gggagaacgt ggaggtgatc    780
cgggcagatg gatga                                                     795

<210> 69
<211> 795
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK11 codon optimised

<400> 69
atgcgtgtga gtcatctggc cagtgaagtg gcacgtctga gcccgctggt gtatcatgtg     60
accaattggg tgagcggtcc gctgagcgcc cgcgtgtgtt atgccctggg tggccgcgca    120
ctgatgacca cccatccgga agaagcactg gaagcagcac gtatgagcca ggcactgctg    180
ctgaatctgg gtaccccgac cgaagatcgc gtggtgagta ttcgtcgtgc cctggatggt    240
gcaggtgatc gtccggcact gctggaccct gttggcgttg gtagcttccc tggtcgtctg    300
gatctggcaa tggaaattct gagtcgtggc attagtattc tgaaaggtaa tggcgccgaa    360
attagcgcac tgctgggtga aggtaaaggc cagcgtggcg ttgatagtga tctgccgggt    420
ccgccgctgg gcgtgagaag actggcagaa gatcatcgtt gctgcgccgt gatgaccggc    480
gaagaagatc atgttgcact gggcgtgagc tggggtctgg ttcgtctgcg tggtcgtgaa    540
gttcgtggtg cagttccggt tccgggcctg ggctgtgcac tgggtagtgc aatggcttgt    600
gcactgggcg ttggtgccga tccgttcagc gcagccctgt ggggttgtgc actgttcaaa    660
ggcgccctgc gccgtgccct gggtgcttgc tgtggccctg gtagcctggt ggaagccctg    720
attgatcagc tgcatcgcgc acgtaccggt gaactggatg gcgaaaatgt ggaagtgatt    780
cgtgccgatg gctga                                                     795

<210> 70
<211> 223
<212> PRT
<213> Dictyoglomus turgidum

<220>
<223> HTK12

<400> 70
Met Asn Lys Lys Glu Lys Leu Glu Leu Leu Lys Asp Phe Asn Leu Tyr
1               5                   10                  15
Cys Leu Thr Cys Glu Glu Tyr Ser Ile Gly Arg Lys Asn Ile Asp Val
            20                  25                  30
Val Arg Glu Ile Leu Glu Ala Gly Val Lys Ile Ile Gln Tyr Arg Glu
        35                  40                  45
Lys Lys Lys Pro Met Arg Glu Lys Tyr His Glu Val Val Lys Ile Arg
    50                  55                  60
Asp Leu Thr Ala Lys Tyr Asn Ala Leu Leu Ile Val Asn Asp His Leu
65                  70                  75                  80
Asp Leu Thr Lys Ile Val Glu Ala Asp Gly Val His Ile Gly Gln Glu
                85                  90                  95
Asp Tyr Pro Ile Glu Val Ala Lys Glu Phe Leu Gly Glu Asn Phe Ile
            100                 105                 110
Ile Gly Leu Thr Thr His Thr Lys Glu Gln Val Met Glu Ala Leu Arg
        115                 120                 125
Lys Gly Ala Asp Tyr Ile Gly Leu Gly Pro Ile Phe Pro Ser Tyr Thr
    130                 135                 140
Lys Glu Lys Pro His Pro Pro Ile Gly Ile Glu Ile Leu Asp Trp Ala
145                 150                 155                 160
Ile Lys Asn Ile Ser Ile Pro Val Val Ala Ile Gly Gly Ile Lys Glu
                165                 170                 175
Ser Asn Ile His Glu Ile Leu Asn Leu Gly Ala Lys Cys Ile Ala Met
            180                 185                 190
Val Thr Glu Ile Val Ser Ser Pro Asn Ile Tyr Glu Lys Thr Arg Lys
        195                 200                 205
Ile Ile His Ile Leu Glu Gly Tyr Lys Asn Gly Lys Tyr Ile Ala
    210                 215                 220

<210> 71
<211> 672
<212> DNA
<213> Dictyoglomus turgidum

<220>
<223> HTK12 wild type

<400> 71
atgaataaaa aagaaaaatt agagctatta aaagatttca atctatactg cctaacctgt     60
gaagaatatt ccataggaag gaagaatatt gatgtggtaa gagaaattct tgaagcaggg    120
gtaaagatta tacaataccg agaaaagaaa aaacctatga gagaaaaata tcatgaagtc    180
gtaaaaataa gagatttaac tgctaagtat aatgctttac ttatagtaaa tgatcaccta    240
gatcttacaa aaatagttga agcagatggg gtacatattg gacaagaaga ttatcctata    300
gaggttgcta aagagttctt aggggaaaac ttcattatag gtctcactac tcatacaaag    360
gaacaagtta tggaggcatt acgaaaagga gctgattata ttggacttgg acctatattt    420
ccaagttata caaaagaaaa acctcaccca ccaatcggaa tagagattct tgattgggct    480
atcaagaata tcagtattcc cgttgttgcc attgggggta taaaagaatc caatatccat    540
gagatactaa atctgggagc caaatgtata gctatggtga ctgagattgt ttcttctcca    600
aatatttatg agaaaacgag aaaaatcatc cacatattgg agggatataa aaatggaaaa    660
tacattgctt ga                                                        672

<210> 72
<211> 672
<212> DNA
<213> Artificial Sequence

<220>
<223> HTK12 codon optimised

<400> 72
atgaacaaga aggaaaagct ggaactgctg aaagacttca atctgtattg tctgacctgt     60
gaagaatata gtattggccg taaaaatatc gatgttgtgc gcgaaattct ggaagcaggc    120
gtgaaaatta ttcagtatcg tgaaaaaaag aagccgatgc gcgaaaaata tcatgaagtt    180
gtgaaaattc gcgatctgac cgcaaaatat aatgcactgc tgattgtgaa tgatcatctg    240
gatctgacca aaattgttga agccgatggc gtgcatattg gtcaggaaga ttatccgatt    300
gaagtggcaa aagagttcct gggcgaaaac ttcattattg gtctgaccac ccataccaaa    360
gaacaggtga tggaagccct gcgtaaaggc gcagattata ttggcctggg tccgatcttc    420
ccgagttata ccaaagaaaa accgcatccg ccgattggca ttgaaattct ggattgggca    480
attaaaaaca ttagcattcc ggttgtggca attggtggta ttaaagaaag caatattcac    540
gaaatcctga atctgggtgc aaaatgtatt gcaatggtta ccgaaattgt gagtagtccg    600
aatatctatg aaaaaaccag aaaaatcatc cacatcctgg aaggctataa aaatggcaaa    660
tatattgcct ga                                                        672

<210> 73
<211> 565
<212> PRT
<213> Aspergillus niger

<220>
<223> IP1

<400> 73
Met Ser Ser Tyr Gly Gly Gly Gly Gly Tyr Gln Arg Asp Ser Tyr Arg
1               5                   10                  15
Ser Arg Asn Gly Gly Gly Gly Gly Gly Tyr Gly Asn Gly Asn Gly Tyr
            20                  25                  30
Ser Asn Gly Gly Gly Tyr Gly Gly Gly Gly Gly Gly Gly Gly Tyr Gly
        35                  40                  45
Gly Gly Gly Tyr Gly Gly Gly Gly Tyr Gly Gly Gly Gly Tyr Gly Gly
    50                  55                  60
Arg Gly Gly Gly Ala Gly Gly Ala Gly Gly Asp Arg Met Ser Asn Leu
65                  70                  75                  80
Gly Ala Gly Leu Lys Lys Gln Glu Trp Asp Leu Asp Ser Leu Pro Lys
                85                  90                  95
Phe Glu Lys Ser Phe Tyr Lys Glu His Ala Asp Val Ala Glu Arg Ser
            100                 105                 110
Gln Arg Asp Val Asp Glu Phe Arg Lys Lys His Glu Met Ala Val Gln
        115                 120                 125
Gly Arg Asn Val Pro Arg Pro Val Glu Thr Phe Asp Glu Ala Gly Phe
    130                 135                 140
Pro Gln Tyr Val Leu Ser Glu Val Lys Ala Gln Gly Phe Asp Arg Pro
145                 150                 155                 160
Thr Ala Ile Gln Ser Gln Gly Trp Pro Met Ala Leu Ser Gly Arg Asp
                165                 170                 175
Val Val Gly Ile Ala Glu Thr Gly Ser Gly Lys Thr Leu Thr Tyr Cys
            180                 185                 190
Leu Pro Ala Ile Val His Ile Asn Ala Gln Pro Leu Leu Ala Pro Gly
        195                 200                 205
Asp Gly Pro Ile Val Leu Ile Leu Ala Pro Thr Arg Glu Leu Ala Val
    210                 215                 220
Gln Ile Gln Ala Glu Ile Ser Lys Phe Gly Lys Ser Ser Arg Ile Arg
225                 230                 235                 240
Asn Thr Cys Val Tyr Gly Gly Val Pro Lys Gly Pro Gln Ile Arg Asp
                245                 250                 255
Leu Ser Arg Gly Val Glu Val Cys Ile Ala Thr Pro Gly Arg Leu Ile
            260                 265                 270
Asp Met Leu Glu Ala Gly Arg Thr Asn Leu Arg Arg Val Thr Tyr Leu
        275                 280                 285
Val Leu Asp Glu Ala Asp Arg Met Leu Asp Met Gly Phe Glu Pro Gln
    290                 295                 300
Ile Arg Lys Ile Ile Ser Gln Ile Arg Pro Asp Arg Gln Thr Cys Met
305                 310                 315                 320
Trp Ser Ala Thr Trp Pro Lys Glu Val Arg Gln Leu Ala Ser Asp Phe
                325                 330                 335
Leu Asn Asp Tyr Ile Gln Val Asn Ile Gly Ser Met Asp Leu Ser Ala
            340                 345                 350
Asn His Arg Ile Thr Gln Ile Val Glu Val Val Ser Asp Phe Glu Lys
        355                 360                 365
Arg Asp Lys Met Ile Lys His Leu Glu Lys Ile Met Glu Asn Arg Ala
    370                 375                 380
Asn Lys Cys Leu Ile Phe Thr Gly Thr Lys Arg Ile Ala Asp Glu Ile
385                 390                 395                 400
Thr Arg Phe Leu Arg Gln Asp Gly Trp Pro Ala Leu Ser Ile His Gly
                405                 410                 415
Asp Lys Gln Gln Gln Glu Arg Asp Trp Val Leu Asn Glu Phe Lys Thr
            420                 425                 430
Gly Lys Ser Pro Ile Met Val Ala Thr Asp Val Ala Ser Arg Gly Ile
        435                 440                 445
Asp Val Arg Asp Ile Thr His Val Leu Asn Tyr Asp Tyr Pro Asn Asn
    450                 455                 460
Ser Glu Asp Tyr Val His Arg Ile Gly Arg Thr Gly Arg Ala Gly Ala
465                 470                 475                 480
Lys Gly Thr Ala Ile Thr Phe Phe Thr Thr Asp Asn Ser Lys Gln Ala
                485                 490                 495
Arg Asp Leu Val Thr Ile Leu Thr Glu Ala Lys Gln Gln Ile Asp Pro
            500                 505                 510
Arg Leu Ala Glu Met Val Arg Tyr Ser Gly Gly Gly Gly His Gly His
        515                 520                 525
Gly Gly Tyr Gly Arg Trp Gly Gly Arg Gly Gly Gly Arg Gly Gly Gly
    530                 535                 540
Arg Gly Arg Gly Asn His Phe Thr Ala Ser Asn Ala Ala Pro Leu Gly
545                 550                 555                 560
Gly Asn Arg Arg Trp
                565

<210> 74
<211> 1698
<212> DNA
<213> Aspergillus niger

<220>
<223> IP1 wild type

<400> 74
atgtcttcct acggcggcgg cggcggctac cagcgcgact cctaccgctc cagaaacggt     60
ggcggcggcg gcggctacgg taacggtaac ggttactcca acggcggagg ctacggtggt    120
ggtggtggtg gtggtggtta cggcggtggt ggctatggcg gcggcggcta cggcggcggc    180
ggctatggtg gcagaggtgg tggtgccggc ggtgctggcg gagaccgcat gtccaacctg    240
ggcgccggtc tgaagaagca ggaatgggat cttgactccc tgcccaagtt cgagaagtcc    300
ttctacaagg aacacgccga tgtcgccgag cgctctcagc gtgatgttga cgagttccgt    360
aagaagcacg agatggctgt tcagggaaga aacgtccctc gccctgtcga gaccttcgac    420
gaggccggtt tccctcaata cgttctcagc gaggtcaagg cccagggctt cgaccgccct    480
accgctattc agtctcaggg ttggcccatg gccctctctg gtcgcgacgt tgtcggtatc    540
gctgagacgg gttccggaaa gaccctgacc tactgtcttc ctgccattgt tcacatcaac    600
gcccagcccc tcctcgcccc cggtgacggc cccattgtcc ttatcctcgc ccccacccgt    660
gaattggccg ttcagattca agccgaaatc tccaagttcg gaaagtcttc ccgtatccgc    720
aacacctgtg tctacggtgg tgtccccaag ggtcctcaga tccgtgacct gagccgtggt    780
gtggaagtct gcattgcgac tcccggtcgt ctgattgaca tgctcgaggc tggtcgcacc    840
aaccttcgtc gtgtcaccta cctcgttctg gatgaggccg atcgcatgct ggacatgggt    900
ttcgagcccc agatccgcaa gatcatctcc cagattcgcc ctgaccgtca gacctgcatg    960
tggtccgcta catggcccaa ggaggtccgt cagcttgcgt ctgacttcct caacgactac   1020
atccaggtta acattggttc catggatctg tcggccaacc accgtatcac tcagatcgtc   1080
gaggtcgtct cggacttcga gaagcgcgac aagatgatca agcacctcga gaagatcatg   1140
gagaaccgtg ccaacaagtg ccttatcttc accggcacca agcgcatcgc tgacgaaatc   1200
actcgcttcc tccgccagga cggatggccg gcactttcta ttcacggtga taagcaacag   1260
caagaaagag attgggtctt gaacgagttc aagacgggca agagcccaat catggtggct   1320
actgatgtgg cttcccgtgg tatcgatgtg cgcgacatca cacacgttct caactatgac   1380
taccccaaca actcggagga ctacgttcac cgtattggta gaactggtcg tgccggtgct   1440
aagggtaccg ccatcacctt ctttaccact gacaactcca agcaggctcg tgacttggtc   1500
accattctca ctgaggccaa gcagcagatt gacccccgtc tcgccgagat ggtccgctac   1560
agtggcggcg gtggtcatgg ccacggtggc tatggccgct ggggtggccg tggtggtggc   1620
cgtggtggtg gccgtggccg cggcaaccac ttcactgctt ccaatgctgc tcctcttggt   1680
ggcaaccgtc gctggtaa                                                 1698

<210> 75
<211> 1698
<212> DNA
<213> Artificial Sequence

<220>
<223> IP1 codon optimised

<400> 75
atgagtagtt acggtggcgg tggcggttat cagcgcgata gttatcgtag ccgcaatggc     60
ggcggtggcg gcggttatgg taatggcaat ggttatagta atggcggtgg ctatggtggt    120
ggcggtggtg gcggcggata tggtggtggt ggttatggcg gtggcggata tggcggcggt    180
ggttatggtg gccgtggcgg tggcgccggt ggtgcaggtg gtgatcgcat gagcaatctg    240
ggcgccggcc tgaaaaaaca ggaatgggat ctggatagcc tgccgaaatt cgaaaaatca    300
ttctataaag agcacgcaga tgttgccgaa cgtagtcagc gcgatgttga tgagttccgc    360
aaaaaacatg aaatggcagt tcagggccgt aatgtgccgc gcccggtgga aaccttcgat    420
gaagcaggct tcccgcagta tgttctgagc gaagtgaaag cacagggctt cgatcgtccg    480
accgccattc agagtcaggg ttggccgatg gccctgagtg gccgcgatgt tgtgggcatt    540
gcagaaaccg gtagtggcaa aaccttaacc tattgtctgc cggcaattgt gcatattaat    600
gcacagccgc tgctggcacc gggtgatggt ccgattgttc tgattctggc cccgacccgt    660
gaactggccg ttcagattca ggccgaaatt agcaaattcg gcaaaagcag ccgtattcgc    720
aatacctgcg tgtatggtgg tgttccgaaa ggtccgcaga ttcgcgatct gagtcgtggt    780
gtggaagtgt gtattgcaac ccctggtcgc ctgattgata tgctggaagc aggccgcacc    840
aatctgcgtc gcgttaccta tctggttctg gatgaagccg atcgtatgct ggatatgggc    900
ttcgaaccgc agattcgtaa aattattagt cagattcgtc cggatcgtca gacctgcatg    960
tggagcgcaa cctggccgaa agaagtgcgt cagctggcaa gtgacttcct gaatgattat   1020
attcaggtga atatcggtag tatggatctg agcgcaaatc atcgcattac ccagattgtg   1080
gaagttgtta gcgacttcga aaaacgcgat aaaatgatta aacacctgga aaaaatcatg   1140
gaaaaccgtg caaataaatg cctgatcttc accggtacca aacgcattgc cgatgaaatt   1200
acccgcttcc tgcgtcagga tggttggccg gccctgagta ttcatggtga taaacagcag   1260
caggaacgtg attgggtgct gaatgagttc aaaaccggca aaagtccgat tatggttgcc   1320
accgatgttg caagtcgcgg tattgatgtg cgcgatatta cccatgtgct gaattatgat   1380
tatccgaata atagtgagga ctatgttcat cgtattggtc gcaccggccg tgccggcgca   1440
aaaggtaccg ccattacctt cttcaccacc gataatagca aacaggcccg cgatctggtt   1500
accattctga ccgaagcaaa acagcagatt gatccgcgtc tggccgaaat ggttcgctat   1560
agcggcggcg gtggacatgg ccacggtggt tatggacgct ggggtggccg tggtggcggc   1620
cgtggtggtg gtagaggtcg cggtaatcac ttcaccgcaa gtaatgcagc cccgctgggt   1680
ggtaatcgtc gttggtga                                                 1698

<210> 76
<211> 266
<212> PRT
<213> Streptomyces viridochromogenes

<220>
<223> IP2

<400> 76
Met Thr Pro Asp Phe Leu Ala Ile Lys Val Gly Gly Ser Leu Phe Ser
1               5                   10                  15
Arg Lys Asp Glu Pro Gly Ser Leu Asp Asp Asp Ala Val Thr Arg Phe
            20                  25                  30
Ala Arg Asn Phe Ala Arg Leu Ala Glu Thr Tyr Arg Gly Arg Met Val
        35                  40                  45
Leu Ile Ser Gly Gly Gly Ala Phe Gly His Gly Ala Ile Arg Asp His
    50                  55                  60
Asp Ser Thr His Ala Phe Ser Leu Ala Gly Leu Thr Glu Ala Thr Phe
65                  70                  75                  80
Glu Val Lys Lys Arg Trp Ala Glu Lys Leu Arg Gly Ile Gly Val Asp
                85                  90                  95
Ala Phe Pro Leu Gln Leu Ala Ala Met Cys Thr Leu Arg Asn Gly Ile
            100                 105                 110
Pro Gln Leu Arg Ser Glu Val Leu Arg Asp Val Leu Asp His Gly Ala
        115                 120                 125
Leu Pro Val Leu Ala Gly Asp Ala Leu Phe Asp Glu His Gly Lys Leu
    130                 135                 140
Trp Ala Phe Ser Ser Asp Arg Val Pro Glu Val Leu Leu Pro Met Val
145                 150                 155                 160
Glu Gly Arg Leu Arg Val Val Thr Leu Thr Asp Val Asp Gly Ile Val
                165                 170                 175
Thr Asp Gly Ala Gly Gly Asp Thr Ile Leu Pro Glu Val Asp Ala Arg
            180                 185                 190
Ser Pro Glu Gln Ala Tyr Ala Ala Leu Trp Gly Ser Ser Glu Trp Asp
        195                 200                 205
Ala Thr Gly Ala Met His Thr Lys Leu Asp Ala Leu Val Thr Cys Ala
    210                 215                 220
Arg Arg Gly Ala Glu Cys Phe Ile Met Arg Gly Asp Pro Gly Ser Asp
225                 230                 235                 240
Leu Glu Phe Leu Thr Ala Pro Phe Ser Ser Trp Pro Ala His Val Arg
                245                 250                 255
Ser Thr Arg Ile Thr Thr Thr Ala Ser Ala
            260                 265

<210> 77
<211> 801
<212> DNA
<213> Streptomyces viridochromogenes

<220>
<223> IP2 wild type

<400> 77
atgacgcccg atttcttggc catcaaggtt ggcggcagcc tgttctcccg caaggacgaa     60
cccggcagcc tggacgacga cgcggtgacg cgcttcgcca ggaacttcgc ccggctcgcc    120
gagacctacc ggggccggat ggttctcatc agcggcggcg gcgccttcgg ccacggggcc    180
atccgtgacc acgacagcac gcacgcgttc tccctcgccg gcctgaccga ggccaccttc    240
gaggtgaaga agcggtgggc cgagaagctc cgcgggatcg gcgtggacgc cttcccgctc    300
cagctggcgg ccatgtgcac gctccgcaac ggcataccgc agctccggtc cgaggtcctc    360
cgggacgtcc tcgaccacgg cgcgctgccc gtcctcgccg gcgacgccct gttcgacgag    420
cacggaaagc tgtgggcgtt ctccagcgac cgcgtccccg aggtcctcct gcccatggtc    480
gaggggcgcc tccgggtcgt caccctgacc gacgtcgacg gcatcgtgac cgacggcgcc    540
ggcggcgaca cgatcctgcc cgaggtcgac gcccggtccc ccgagcaggc gtacgccgcg    600
ctctggggca gcagcgaatg ggacgccacc ggcgccatgc acaccaagct cgacgcactg    660
gtcacctgcg cccgccgcgg tgccgagtgc ttcatcatgc ggggcgaccc cggcagcgac    720
ctggagttcc tgaccgcccc cttctcctcc tggccggcgc acgtgcggtc caccaggatc    780
accacgactg cttctgcgta a                                              801

<210> 78
<211> 801
<212> DNA
<213> Artificial Sequence

<220>
<223> IP2 codon optimised

<400> 78
atgaccccgg acttcctggc cattaaagtg ggtggtagtc tgttcagccg caaagatgaa     60
ccgggcagcc tggatgatga tgcagtgacc cgcttcgccc gtaacttcgc acgtctggcc    120
gaaacctatc gcggccgtat ggtgctgatt agtggtggtg gtgccttcgg tcatggtgca    180
attcgtgatc atgatagcac ccatgccttc agtctggcag gtctgaccga agcaaccttc    240
gaagtgaaaa aacgctgggc agaaaaactg cgcggtattg gcgtggatgc attcccgctg    300
caactggccg caatgtgcac cctgcgtaat ggtattccgc agctgcgtag cgaagtgctg    360
cgtgatgttc tggatcatgg cgcactgccg gttctggcag gcgatgccct gttcgatgaa    420
catggtaaac tgtgggcatt cagcagcgat cgcgttccgg aagttctgct gccgatggtt    480
gaaggtcgtc tgcgtgtggt gaccctgacc gatgttgatg gtattgtgac cgatggcgcc    540
ggcggtgata ccattctgcc ggaagttgat gcacgcagtc cggaacaggc atacgctgcc    600
ctgtggggta gtagcgaatg ggatgcaacc ggtgcaatgc ataccaaact ggatgcactg    660
gtgacctgtg cacgccgtgg cgcagaatgc ttcattatgc gtggtgatcc gggcagtgat    720
ctggagttcc tgaccgcacc gttcagcagc tggccggccc atgtgcgtag tacccgcatt    780
accaccaccg caagtgcctg a                                              801

<210> 79
<211> 276
<212> PRT
<213> Pseudoalteromonas byunsanensis

<220>
<223> IP3

<400> 79
Met Val Thr Ser Ala Asp Leu Leu Ile Val Lys Val Gly Gly Ser Leu
1               5                   10                  15
Phe Ser Asp Lys Met Thr Asp Arg Gln Leu Asp Glu Gln Ala Leu Gln
            20                  25                  30
Thr Tyr Ala Gln Leu Met Ala Ser Leu Tyr Arg Asn Ala Pro Gly His
        35                  40                  45
Val Ile Met Ile Ser Gly Gly Gly Ser Tyr Gly His His Ala Val Arg
    50                  55                  60
Cys Ile Asp Glu Ser Asp Glu Leu Ser Leu Leu Ser Leu Gly Met Ile
65                  70                  75                  80
Asn Phe Glu Leu Lys Cys Val Trp His Glu Gln Leu Lys Arg Cys Gly
                85                  90                  95
Ile Lys Ser Tyr Pro Leu His Leu Ala Ser Met Thr Ser Cys Val Asn
            100                 105                 110
Arg Glu Asn Phe Asp Ser Ser Ala Lys Phe Val Asn Lys Leu Leu Tyr
        115                 120                 125
Ala Lys Tyr Leu Pro Leu Val Thr Gly Asp Ala Leu Leu Asn Glu Gln
    130                 135                 140
Gly Val Leu Glu Val Val Gly Ser Asp Tyr Val Ala Gly Ala Phe Lys
145                 150                 155                 160
Asp Leu Glu Phe Asn Lys Ile Arg Ile Val Ile Met Thr Asp Val Pro
                165                 170                 175
Gly Val Leu Gln Lys Ser Ala Thr Gly Gln Phe Glu Thr Ile Lys Glu
            180                 185                 190
Ile Asp Gln Phe Asn Asp Pro Ala Gln Trp Leu Trp Glu Thr Pro Glu
        195                 200                 205
Gly Asp Thr Ser Gly Ala Met Gln Gly Lys Ile Ala Ala Leu Leu Lys
    210                 215                 220
Gln Ala Lys Trp Gly Ala Glu Cys Phe Ile Val Glu Gly Gln Ala Cys
225                 230                 235                 240
Leu Lys Asn Pro Arg Trp Leu Phe Glu Glu His Ser Asp Trp Pro Glu
                245                 250                 255
Glu Phe Lys Ser Thr Gln Ile Ile Trp Arg Glu Asn Asn Ala Lys Asp
            260                 265                 270
Thr Glu Gly Tyr
        275

<210> 80
<211> 831
<212> DNA
<213> Pseudoalteromonas byunsanensis

<220>
<223> IP3 wild type

<400> 80
atggtgacca gcgctgattt attaattgta aaagtgggtg gtagtctttt ttctgacaaa     60
atgactgaca gacaactgga tgaacaagcc ctgcaaacat atgcgcaatt gatggcctct    120
ttgtatcgca atgcaccggg tcacgtgatt atgataagcg gtggcggctc ttatggacac    180
catgctgtgc gctgtataga tgaaagtgat gagctgtcgc tactatcttt aggcatgatt    240
aactttgagc ttaagtgcgt gtggcatgaa caattaaaac gctgtgggat taagtcttac    300
cctttgcatt tagcttctat gacatcctgt gtgaatagag aaaactttga ctcatccgca    360
aagtttgtca ataagttgtt gtatgcaaag tacttgcccc tagttacagg agatgcattg    420
cttaatgaac aaggggtgtt agaagttgtg ggcagcgatt atgtggctgg cgcctttaaa    480
gaccttgagt ttaacaaaat aagaatcgtg atcatgaccg atgtaccggg tgtgctgcaa    540
aagagtgcta ctggtcaatt tgaaacgatt aaagaaatag accagtttaa tgacccagct    600
cagtggctat gggagacacc agaaggggac acatcaggtg ctatgcaagg caaaattgca    660
gccttactta agcaagccaa atggggcgct gagtgcttta tcgtagaagg acaagcatgt    720
ttgaaaaacc cccgatggtt atttgaagaa cactccgatt ggcctgaaga gtttaagagc    780
acacaaatta tttggcgaga gaataacgca aaagacacag aaggatatta a             831

<210> 81
<211> 831
<212> DNA
<213> Artificial Sequence

<220>
<223> IP3 codon optimised

<400> 81
atggttacca gtgccgatct gctgattgtg aaagtgggcg gtagcctgtt cagtgataaa     60
atgaccgatc gtcagctgga tgaacaggcc ctgcaaacct atgcacagct gatggcaagc    120
ctgtatcgca atgcccctgg tcatgtgatt atgattagcg gcggtggcag ctatggccat    180
catgccgttc gttgcattga tgaaagtgat gaactgagcc tgctgagtct gggcatgatt    240
aacttcgaac tgaaatgtgt gtggcatgaa cagctgaaac gttgtggtat taaaagttat    300
ccgctgcatc tggccagcat gaccagctgc gtgaatcgtg aaaacttcga tagtagcgca    360
aaattcgtga ataaactgct gtatgccaaa tatctgccgc tggttaccgg tgatgccctg    420
ctgaatgaac agggcgttct ggaagttgtg ggcagtgatt atgtggcagg tgccttcaaa    480
gatctggagt tcaataaaat tcgcattgtt attatgaccg acgttccggg tgtgctgcaa    540
aaaagtgcaa ccggtcagtt cgaaaccatt aaagaaattg atcagttcaa cgatccggcc    600
cagtggctgt gggaaacacc tgaaggcgat accagcggcg caatgcaggg taaaattgca    660
gccctgctga aacaggccaa atggggcgcc gaatgcttca ttgttgaagg tcaggcatgt    720
ctgaaaaatc cgcgttggct gttcgaagaa catagcgatt ggccggaaga gttcaaaagt    780
acccagatta tctggcgcga aaataatgca aaagataccg aaggctattg a             831

<210> 82
<211> 248
<212> PRT
<213> Haloferax volcanii

<220>
<223> IP4

<400> 82
Met Ser Leu Val Val Leu Lys Leu Gly Gly Ser Val Val Thr Asp Lys
1               5                   10                  15
Asp Glu Pro Glu Thr Val Asp Glu Ala Gly Leu Ala Ala Ala Ala Asp
            20                  25                  30
Ala Val Ala Pro Leu Ala Glu Ser Arg Arg Val Val Val Val His Gly
        35                  40                  45
Gly Gly Ser Phe Gly His His His Ala Ala Glu His Gly Val Ser Ser
    50                  55                  60
Glu Ser Gly Ser His Asp Ala Arg Gly Val Arg Ala Ile His Asp Ala
65                  70                  75                  80
Met Lys Arg Leu Asn Asp Ala Val Leu Asp Ala Leu Glu Glu Arg Gly
                85                  90                  95
Val Ala Ala Leu Pro Val His Pro Leu Ser Ala Gly Ala Arg Glu Ala
            100                 105                 110
Asp Gly Ser Leu Ser Leu Pro Leu Ala Ala Thr Glu Thr Met Leu Asp
        115                 120                 125
Glu Gly Phe Val Pro Val Leu His Gly Asp Val Ile Ser His Ala Gly
    130                 135                 140
Lys Gly Ala Thr Ile Val Ser Gly Asp Asp Leu Val Val Ser Leu Ala
145                 150                 155                 160
Ser Gly Leu Gly Ala Asp Arg Val Gly Leu Cys Ser Thr Val Pro Gly
                165                 170                 175
Val Leu Asp Ala Asp Gly Asp Val Ile Pro Glu Ile Thr Ala Phe Ala
            180                 185                 190
Asp Ala Ala Asp Ala Leu Gly Gly Ser Asp Ser Thr Asp Val Thr Gly
        195                 200                 205
Gly Met Ala Ala Lys Val Arg Lys Leu Leu Ala Leu Gly Ala Pro Ala
    210                 215                 220
His Val Phe Gly Pro Glu Gly Leu Ser Ala Phe Val Ala Gly Glu Ser
225                 230                 235                 240
Pro Gly Thr Val Ile Arg Gly Glu
                245

<210> 83
<211> 747
<212> DNA
<213> Haloferax volcanii

<220>
<223> IP4 wild type

<400> 83
gtgagcctcg tcgtcctcaa actcggcggg agcgtcgtca ccgacaagga cgaaccggag     60
acggtcgacg aggcgggact ggcggccgcc gcggacgcgg ttgcgcccct cgctgaatcg    120
cgccgcgtcg tcgtcgtcca cggcggcggg agcttcggcc accaccacgc cgccgaacac    180
ggcgtctcct cggagtccgg gagccacgac gcccgcggcg tgcgcgccat ccacgacgcg    240
atgaagcgcc tcaacgacgc cgtcctcgac gccctcgaag agcgcggcgt cgcggccctg    300
ccggtccacc cgctttcggc cggcgcgcgc gaggccgacg gctcgctgtc gcttccgctc    360
gcggcgaccg agacgatgct cgacgagggc ttcgtcccgg tcctccacgg agatgtcatc    420
tcgcacgcgg gcaagggcgc gaccatcgtc agcggcgacg acctcgtggt gtcgctcgcg    480
tcggggctcg gcgcggaccg cgtcggcctc tgttcgaccg tccccggcgt gctcgacgcc    540
gacggcgacg tgattcccga gattacggcg ttcgcggacg cggccgacgc gctcggcggg    600
tctgactcca ccgacgtgac cggcgggatg gccgcgaagg tgcgaaaact gctcgcgctc    660
ggcgcgccgg cgcacgtctt cggccccgag ggactgtcgg cgttcgtcgc cggcgagtcg    720
ccgggaaccg tcatccgcgg agagtag                                        747

<210> 84
<211> 747
<212> DNA
<213> Artificial Sequence

<220>
<223> IP4 codon optimised

<400> 84
atgagtctgg tggtgctgaa actgggcggc agtgtggtga ccgataaaga tgaaccggaa     60
accgtggatg aagccggcct ggcagcagcc gcagatgcag ttgcaccgct ggccgaaagc    120
cgccgtgttg tggtggttca tggcggcggc agcttcggtc atcatcatgc cgccgaacat    180
ggtgtgagca gtgaaagcgg tagtcatgat gcccgtggcg tgcgcgccat tcatgatgca    240
atgaaacgcc tgaatgatgc cgtgctggat gcactggaag aacgcggtgt tgccgcactg    300
ccggttcatc cgctgagtgc cggcgcccgt gaagccgatg gtagcctgag cctgccgctg    360
gcagcaaccg aaaccatgct ggatgaaggc ttcgttccgg ttctgcatgg tgatgtgatt    420
agccatgcag gtaaaggtgc caccattgtg agtggcgatg atctggtggt tagcctggcc    480
agtggtctgg gcgcagatcg tgtgggtctg tgtagtaccg tgccgggtgt gctggatgcg    540
gatggcgatg tgattccgga aattaccgcc ttcgcagatg ccgcagatgc cctgggcggt    600
agtgatagca ccgatgtgac cggtggcatg gcagcaaaag ttcgcaaact gctggccctg    660
ggcgccccgg cacatgtgtt cggtccggaa ggcctgagcg ccttcgtggc aggcgaaagc    720
cctggtaccg ttattcgcgg cgaatga                                        747

<210> 85
<211> 266
<212> PRT
<213> Methanothermobacter thermautotrophicus

<220>
<223> IP5

<400> 85
Met Ile Ile Leu Lys Leu Gly Gly Ser Val Ile Thr Arg Lys Asp Ser
1               5                   10                  15
Glu Glu Pro Ala Ile Asp Arg Asp Asn Leu Glu Arg Ile Ala Ser Glu
            20                  25                  30
Ile Gly Asn Ala Ser Pro Ser Ser Leu Met Ile Val His Gly Ala Gly
        35                  40                  45
Ser Phe Gly His Pro Phe Ala Gly Glu Tyr Arg Ile Gly Ser Glu Ile
    50                  55                  60
Glu Asn Glu Glu Asp Leu Arg Arg Arg Arg Phe Gly Phe Ala Leu Thr
65                  70                  75                  80
Gln Asn Trp Val Lys Lys Leu Asn Ser His Val Cys Asp Ala Leu Leu
                85                  90                  95
Ala Glu Gly Ile Pro Ala Val Ser Met Gln Pro Ser Ala Phe Ile Arg
            100                 105                 110
Ala His Ala Gly Arg Ile Ser His Ala Asp Ile Ser Leu Ile Arg Ser
        115                 120                 125
Tyr Leu Glu Glu Gly Met Val Pro Val Val Tyr Gly Asp Val Val Leu
    130                 135                 140
Asp Ser Asp Arg Arg Leu Lys Phe Ser Val Ile Ser Gly Asp Gln Leu
145                 150                 155                 160
Ile Asn His Phe Ser Leu Arg Leu Met Pro Glu Arg Val Ile Leu Gly
                165                 170                 175
Thr Asp Val Asp Gly Val Tyr Thr Arg Asn Pro Lys Lys His Pro Asp
            180                 185                 190
Ala Arg Leu Leu Asp Val Ile Gly Ser Leu Asp Asp Leu Glu Ser Leu
        195                 200                 205
Asp Gly Thr Leu Asn Thr Asp Val Thr Gly Gly Met Val Gly Lys Ile
    210                 215                 220
Arg Glu Leu Leu Leu Leu Ala Glu Lys Gly Val Glu Ser Glu Ile Ile
225                 230                 235                 240
Asn Ala Ala Val Pro Gly Asn Ile Glu Arg Ala Leu Leu Gly Glu Glu
                245                 250                 255
Val Arg Gly Thr Arg Ile Thr Gly Lys His
            260                 265

<210> 86
<211> 801
<212> DNA
<213> Methanothermobacter thermautotrophicus

<220>
<223> IP5 wild type

<400> 86
atgatcattc tcaagcttgg tggaagtgta attaccagga aggactctga ggaacctgca     60
atagacaggg ataaccttga aaggatagcc tcagagatag ggaacgcttc gccatcatca    120
ttgatgatag tgcacggcgc agggtccttt ggccacccat ttgcaggtga gtacaggata    180
ggctcggaga tagagaatga ggaggacctc aggcgccgga ggtttggatt tgcactgacc    240
cagaactggg ttaaaaagct taacagccat gtatgcgacg cactccttgc tgagggaatt    300
ccagcagttt caatgcagcc atcagctttc ataagggccc atgctggccg cataagccac    360
gcggatatct cactgatcag atcctacctt gaggagggta tggtaccggt ggtctacggt    420
gacgttgtac ttgactcaga caggaggttg aaattttctg tcatatcagg agaccagctg    480
ataaaccact tctccctgag gctgatgccg gagagggtca tactcgggac agatgtggac    540
ggggtataca ccaggaaccc taagaagcac cccgatgcaa ggctccttga tgttatagga    600
tcactcgatg accttgaatc ccttgacggg acacttaaca ctgatgtgac gggtggaatg    660
gtcggtaaga taagggaact ccttctgctt gcagagaagg gtgtggaatc tgaaataatt    720
aatgctgcag tgcccggaaa tattgagagg gccctcctgg gagaggaggt acggggcaca    780
agaatcacag ggaaacattg a                                              801

<210> 87
<211> 801
<212> DNA
<213> Artificial Sequence

<220>
<223> IP5 codon optimised

<400> 87
atgatcatcc tgaaactggg cggtagcgtg attacccgca aagatagcga agaaccggca     60
attgatcgtg ataatctgga acgtattgca agcgaaattg gtaatgcaag cccgagtagc    120
ctgatgattg tgcatggtgc cggtagcttc ggtcatccgt tcgccggtga atatcgcatt    180
ggcagcgaaa ttgaaaatga agaagatctg cgccgtcgcc gcttcggctt cgccttaacc    240
cagaattggg tgaaaaaact gaatagccat gtgtgtgatg cactgctggc agaaggtatt    300
ccggccgtta gtatgcagcc gagtgccttc attcgcgccc atgcaggtcg cattagccat    360
gccgatatta gcctgattcg tagctatctg gaagaaggca tggttccggt ggtgtatggt    420
gatgtggttc tggatagcga tcgtcgcctg aaattcagcg tgattagcgg tgatcagctg    480
attaatcact tcagcctgcg cctgatgccg gaacgcgtta ttctgggtac cgatgtggat    540
ggcgtgtata cccgcaatcc gaaaaaacat ccggatgccc gtctgctgga tgttattggt    600
agtctggatg atctggaaag tctggatggc accctgaata ccgatgttac cggtggcatg    660
gtgggcaaaa ttcgtgaact gctgctgctg gcagagaaag gtgttgaaag cgaaattatt    720
aatgccgcag tgccgggtaa tattgaacgc gccctgctgg gtgaagaagt gcgtggcacc    780
cgcattaccg gtaaacattg a                                              801

<210> 88
<211> 260
<212> PRT
<213> Methanocaldococcus jannaschii

<220>
<223> IP6

<400> 88
Met Leu Thr Ile Leu Lys Leu Gly Gly Ser Ile Leu Ser Asp Lys Asn
1               5                   10                  15
Val Pro Tyr Ser Ile Lys Trp Asp Asn Leu Glu Arg Ile Ala Met Glu
            20                  25                  30
Ile Lys Asn Ala Leu Asp Tyr Tyr Lys Asn Gln Asn Lys Glu Ile Lys
        35                  40                  45
Leu Ile Leu Val His Gly Gly Gly Ala Phe Gly His Pro Val Ala Lys
    50                  55                  60
Lys Tyr Leu Lys Ile Glu Asp Gly Lys Lys Ile Phe Ile Asn Met Glu
65                  70                  75                  80
Lys Gly Phe Trp Glu Ile Gln Arg Ala Met Arg Arg Phe Asn Asn Ile
                85                  90                  95
Ile Ile Asp Thr Leu Gln Ser Tyr Asp Ile Pro Ala Val Ser Ile Gln
            100                 105                 110
Pro Ser Ser Phe Val Val Phe Gly Asp Lys Leu Ile Phe Asp Thr Ser
        115                 120                 125
Ala Ile Lys Glu Met Leu Lys Arg Asn Leu Val Pro Val Ile His Gly
    130                 135                 140
Asp Ile Val Ile Asp Asp Lys Asn Gly Tyr Arg Ile Ile Ser Gly Asp
145                 150                 155                 160
Asp Ile Val Pro Tyr Leu Ala Asn Glu Leu Lys Ala Asp Leu Ile Leu
                165                 170                 175
Tyr Ala Thr Asp Val Asp Gly Val Leu Ile Asp Asn Lys Pro Ile Lys
            180                 185                 190
Arg Ile Asp Lys Asn Asn Ile Tyr Lys Ile Leu Asn Tyr Leu Ser Gly
        195                 200                 205
Ser Asn Ser Ile Asp Val Thr Gly Gly Met Lys Tyr Lys Ile Asp Met
    210                 215                 220
Ile Arg Lys Asn Lys Cys Arg Gly Phe Val Phe Asn Gly Asn Lys Ala
225                 230                 235                 240
Asn Asn Ile Tyr Lys Ala Leu Leu Gly Glu Val Glu Gly Thr Glu Ile
                245                 250                 255
Asp Phe Ser Glu
            260

<210> 89
<211> 783
<212> DNA
<213> Methanocaldococcus jannaschii

<220>
<223> IP6 wild type

<400> 89
atgctaacca tattaaaatt aggagggagt attttatcag ataaaaatgt tccatactca     60
ataaaatggg acaacttgga gagaatagca atggagataa aaaacgcctt ggattattat    120
aaaaaccaaa ataaagagat aaaattaata ctcgtccatg gaggaggagc ttttggtcat    180
ccagtagcta aaaaatactt aaaaattgaa gatggcaaaa aaatatttat aaacatggag    240
aaaggatttt gggaaattca aagagcaatg agaagattta acaacatcat tatagacact    300
ctacagagct atgacatccc agctgtttct atacaaccat cttcgtttgt cgtttttggg    360
gataagttaa tttttgatac ctctgctata aaagagatgc ttaaaaggaa tttagttcca    420
gttattcatg gagatattgt aattgatgat aaaaacggct atagaataat ttctggagat    480
gacatagttc catatttggc aaatgaatta aaagctgatt taattctcta tgctacagat    540
gttgatggtg ttttaataga taataagcca ataaagagga ttgataaaaa taatatctat    600
aaaattttga attatttaag tggttctaat agtatagatg ttactggtgg aatgaagtat    660
aagatagaca tgattaggaa aaataagtgt agaggttttg tatttaatgg aaataaagct    720
aataatatat acaaagcttt attgggggag gttgaaggaa cagaaattga tttttcagaa    780
taa                                                                  783

<210> 90
<211> 783
<212> DNA
<213> Artificial Sequence

<220>
<223> IP6 codon optimised

<400> 90
atgctgacca ttctgaaact gggtggtagc attctgagtg ataaaaatgt gccgtatagt     60
attaagtggg ataatctgga acgcattgca atggaaatta aaaatgcact ggattattac    120
aagaaccaga ataaagaaat caagctgatt ctggtgcatg gtggtggtgc cttcggccat    180
ccggttgcaa aaaaatatct gaaaattgaa gacggtaaga agatcttcat taatatggaa    240
aagggcttct gggaaattca gcgtgccatg cgtcgcttca ataatattat tattgacacc    300
ctgcaaagct atgatattcc ggccgtgagt attcagccga gtagcttcgt ggtgttcggc    360
gataaactga tcttcgatac cagtgccatt aaagaaatgc tgaaacgtaa tctggtgccg    420
gtgattcatg gcgatattgt gattgatgat aaaaatggtt accgtattat cagtggtgat    480
gatattgttc cgtatctggc caatgaactg aaagccgatc tgattctgta tgcaaccgat    540
gttgatggcg tgctgattga taataaaccg attaaacgca ttgacaaaaa caatatctat    600
aagatcctga actacctgag tggtagcaat agtattgatg tgaccggtgg tatgaaatat    660
aaaattgata tgatccgcaa gaacaagtgt cgcggcttcg tgttcaatgg caataaagca    720
aataatatct acaaggccct gctgggcgaa gttgaaggca ccgaaattga cttcagtgaa    780
tga                                                                  783

<210> 91
<211> 290
<212> PRT
<213> Streptomyces regensis

<220>
<223> IP7

<400> 91
Met Thr Gly Pro Gly Thr Arg Ala Gly Glu Pro Gly Leu Leu Val Leu
1               5                   10                  15
Lys Val Gly Gly Ser Leu Leu Ser Asp Lys Arg His Thr Gly Glu Thr
            20                  25                  30
Asp Tyr Thr Ala Ile Asp Asp Tyr Ala Gly Leu Leu Ala Asp Leu Val
        35                  40                  45
Ala Ala Phe Pro Gly Arg Val Val Leu Val Thr Gly Gly Gly Ala Leu
    50                  55                  60
Cys His Pro Val Gly Leu Arg Ile Lys Ala Ala Lys Asp Asp Pro Tyr
65                  70                  75                  80
Ala Ala Val Ala Leu Thr Glu Pro Ala Phe Arg Met Arg Trp Ala Trp
                85                  90                  95
Thr Thr Arg Leu Arg Ala Lys Gly Val Arg Ala Val Pro Leu Gln Ala
            100                 105                 110
Thr Ser Leu Leu Glu Glu Arg Pro Asp Gly Thr Arg Ala Cys His Thr
        115                 120                 125
Ala Val Val Gly Arg Leu Leu Ala Gln Gly Ala Leu Pro Val Leu Ser
    130                 135                 140
Ser Asp Cys Tyr Leu Thr Ala Glu Gly Thr Leu Arg Ile Leu Ser Ser
145                 150                 155                 160
Asp Asp Val Pro Ala Val Leu Leu Asp Leu Gly Leu Gly Pro Val Arg
                165                 170                 175
Val Val Ala Leu Thr Asp Val Asp Gly Ile His Thr Gly Gly Pro Gly
            180                 185                 190
Ser Pro Val Leu Ala His Leu Asp Pro Asp Asp Leu Ala Ala Ala Arg
        195                 200                 205
Ala Leu Phe Trp Thr Asp Ala Trp Asp Ala Thr Gly Ala Met Glu Gly
    210                 215                 220
Lys Val Glu Ala Leu Ala Asp Ser Ala Arg Arg Gly Ala Glu Cys Val
225                 230                 235                 240
Ile Thr Arg Gly Asp Arg Thr Ala Ala Asp Leu Arg His Leu Phe Ala
                245                 250                 255
Pro Leu Pro Asp Trp Pro Arg Asn Ala Pro Arg Thr Leu Ile Ser Arg
            260                 265                 270
Arg Ala Pro Thr Asp Ser Ala Leu Ser Pro Ala Leu Ser Leu Glu Asn
        275                 280                 285
Ala Pro
    290

<210> 92
<211> 873
<212> DNA
<213> Streptomyces regensis

<220>
<223> IP7 wild type

<400> 92
gtgaccggcc ccggcacgcg ggccggtgag ccggggctgc tcgtcctgaa ggtcggcggc     60
agcctcctct ccgacaagcg gcacaccggc gagaccgact acacggcgat cgacgactac    120
gccgggctgc tcgcggacct ggtcgccgcc ttccccggcc gggtggtcct ggtgaccggg    180
ggcggagcgc tctgccaccc ggtcggactg cggatcaaag cggccaagga cgacccctac    240
gccgccgtcg ccctgaccga acccgcgttc cggatgcgct gggcgtggac gacccggctg    300
cgcgccaagg gcgtgcgggc cgtcccgctc caggcgacct ccctgctcga ggagcgcccc    360
gacggcaccc gggcctgcca caccgcggtc gtcggccgac tgctcgccca gggcgcgctg    420
cccgtgctgt ccagcgactg ctacctcacc gccgagggca ccctgcgcat cctcagcagc    480
gacgacgtgc ccgccgtcct gctggacctg ggcctcggcc cggtccgggt ggtggccctc    540
accgatgtcg acggcatcca caccgggggc cccggctccc cggtgctcgc ccacctcgat    600
cccgacgacc tggccgccgc acgcgccctg ttctggacgg acgcatggga tgccaccggt    660
gccatggagg gcaaggtcga ggcgctggcc gactccgccc ggcgcggcgc cgagtgcgtc    720
atcacccgcg gcgaccgcac ggcggccgac ctcaggcacc tgttcgcccc gctgccggac    780
tggccgcgaa acgccccgcg cacgctgatc tcgcggcgcg ccccgaccga cagcgccctc    840
tcacccgccc tctccctgga gaacgcccca tga                                 873

<210> 93
<211> 873
<212> DNA
<213> Artificial Sequence

<220>
<223> IP7 codon optimised

<400> 93
atgaccggtc cgggcacccg tgcaggcgaa cctggtctgc tggtgctgaa agtgggtggt     60
agcctgctga gtgataaacg ccataccggc gaaaccgatt ataccgcaat tgatgattat    120
gcaggtctgc tggcagatct ggttgccgca ttccctggtc gtgtggttct ggttaccggt    180
ggcggcgcac tgtgccatcc ggtgggtctg cgtattaaag ccgcaaaaga tgatccgtat    240
gcagcagtgg cactgaccga accggccttc cgcatgagat gggcctggac cacccgcctg    300
cgtgcaaaag gcgtgcgcgc cgttccgctg caagccacaa gtctgctgga agaacgcccg    360
gatggtaccc gtgcatgtca taccgcagtt gttggtcgcc tgctggccca gggcgcctta    420
cctgtgctga gcagcgattg ctatctgacc gcagaaggta ccctgcgtat tctgagtagc    480
gatgatgtgc cggccgtgct gctggatctg ggcctgggtc cggttcgcgt ggtggcactg    540
acagatgttg atggcattca taccggcggt ccgggcagcc cggttctggc acatctggac    600
cctgatgatc tggccgccgc ccgcgctctg ttctggacag atgcctggga tgcaaccggc    660
gcaatggaag gtaaagtgga agccctggcc gatagcgccc gtcgcggtgc agaatgcgtt    720
attacccgtg gtgatcgcac cgcagccgat ctgcgtcatc tgttcgcacc gctgccggat    780
tggccgcgca atgcccctcg taccctgatt agccgccgcg ccccgaccga tagtgcctta    840
tctccggccc tgagtctgga aaatgcaccg tga                                 873

<210> 94
<211> 299
<212> PRT
<213> Sphaerisporangium sp.

<220>
<223> IP8

<400> 94
Met Glu Ala Met Gly Leu Ser Ser Pro Ala Val Asp Arg Thr Trp Arg
1               5                   10                  15
Ser Thr Gln Arg Ala Arg Glu Ser Ser Ala Ser Asp Arg Val Leu Ala
            20                  25                  30
Val Lys Ile Gly Gly Ser Leu Phe Ser Asp Lys Ser Val Ala Gly Ser
        35                  40                  45
Leu Asp Lys Gly Arg Ile Ala Arg Phe Ala Arg Val Ile Ser Ser Leu
    50                  55                  60
His Glu Arg Phe Pro Gly Gln Val Val Leu Ile Thr Gly Gly Gly Ala
65                  70                  75                  80
Ile Gly His Gly Ala Leu Arg Gly Ile Asp Pro Ala Asp Pro Phe Ala
                85                  90                  95
Ala Ile Gly Leu Thr Lys Ala Leu Ala Asp Val Arg Trp Ala Trp Thr
            100                 105                 110
Gln Ala Leu Val Gly Leu Gly Val Arg Ala Phe Pro Leu Gln Leu Gly
        115                 120                 125
Ala Met Ala Thr Leu Asp Asp Asp Leu Ser Phe Arg Val Arg Ala Asp
    130                 135                 140
Ile Val Glu Arg Val Leu Ala Ser Gly Ala Leu Pro Ile Leu Ser Gly
145                 150                 155                 160
Asp Ser Val Leu Asp Ala Asn Gly Asn Leu His Gly Leu Ser Ser Asp
                165                 170                 175
Arg Val Pro Glu Phe Leu Val Arg Ala Leu Gln Thr Pro Leu Arg Val
            180                 185                 190
Ala Ser Phe Thr Asp Val Pro Gly Ile Val Leu Gly Gly Pro Gly Gly
        195                 200                 205
Lys Glu Thr Leu Arg Tyr Val Asp Pro Met Thr Pro Gln Ala Ala Tyr
    210                 215                 220
Glu Ala Leu Trp Thr Asn Ser Glu Trp Asp Thr Thr Gly Gly Phe Lys
225                 230                 235                 240
Thr Lys Val Asp Ala Leu Ile Arg Cys Ala Ala Glu Gly Ala Glu Cys
                245                 250                 255
Phe Ile Leu Glu Gly Val Ala Gln Asp Ser Glu Trp Ala Tyr Leu Leu
            260                 265                 270
Ser Pro Tyr Ser Gly Trp Ser Asn Lys Leu His Cys Thr Arg Ile Ala
        275                 280                 285
Arg Ser Pro Ala Ala Ala Val Ser Thr Ala Glu
    290                 295

<210> 95
<211> 900
<212> DNA
<213> Sphaerisporangium sp.

<220>
<223> IP8 wild type

<400> 95
ttggaggcga tggggttgag tagtccggct gtcgatagaa catggcgctc cacgcagcgc     60
gcccgtgaga gctcggcgtc agatcgcgtg cttgcggtga agatcggtgg cagtcttttc    120
tctgataaga gtgtcgccgg cagcttggat aaaggtcgta ttgctaggtt cgctcgagtg    180
atctccagcc ttcatgagcg ttttcccggc caggtcgtcc ttatcacagg aggcggtgcc    240
atcgggcacg gcgcactacg aggaatcgat ccggcagacc ccttcgccgc aatcggcttg    300
accaaggcgc tcgcggacgt ccggtgggca tggacgcaag ctttggtagg tctcggggta    360
agggcgtttc cgctccagct tggcgcgatg gcgacattgg atgacgatct gtcgttccgc    420
gtgagggccg atatcgtcga gcgggtcctg gcgagcggcg cgcttccgat cctatcgggc    480
gacagcgtct tggacgccaa tggaaacctg cacggcctgt cgagtgaccg cgtaccagaa    540
ttccttgtac gtgccttgca gacacccctg cgggtcgcga gcttcactga tgtcccgggc    600
attgtgctgg gaggaccggg cggaaaggaa acgctccgct acgtggaccc catgaccccc    660
caggcagcct atgaagcact gtggaccaac agtgaatggg acaccacagg gggcttcaag    720
accaaggtgg acgcgctcat tcgatgtgca gcggagggcg cggagtgctt cattcttgag    780
ggagttgcgc aagactccga gtgggcatat ctgctctcac cgtacagcgg gtggtccaac    840
aagctccact gtacccggat cgcgcgctct ccggccgcag cagtcagcac cgccgagtag    900

<210> 96
<211> 900
<212> DNA
<213> Artificial Sequence

<220>
<223> IP8 codon optimised

<400> 96
atggaagcaa tgggtctgag cagtccggcc gtggatcgta cctggcgcag tacccagcgt     60
gcacgcgaaa gcagcgccag cgatcgtgtg ctggcagtta aaattggtgg cagtctgttc    120
agcgataaaa gcgttgcagg tagcctggat aaaggccgta ttgcacgctt cgcccgtgtt    180
attagtagcc tgcatgaacg cttccctggt caggtggtgc tgattaccgg tggcggtgcc    240
attggtcatg gcgcactgcg tggcattgat ccggcagatc cgttcgcagc aattggcctg    300
accaaagcac tggcagatgt gcgttgggcc tggacccagg cactggttgg cctgggtgtt    360
cgtgcattcc cgctgcaact gggcgcaatg gctaccctgg atgatgatct gagcttccgt    420
gttcgcgcag atattgttga acgtgttctg gcaagtggcg cactgccgat tctgagcggc    480
gatagcgtgc tggatgcaaa tggtaatctg catggcctga gcagtgatcg tgttccggag    540
ttcctggttc gcgcactgca aacacctctg cgcgttgcca gcttcaccga tgtgccgggc    600
attgtgctgg gtggtccggg cggtaaagaa accttacgtt atgtggaccc tatgaccccg    660
caggccgcct atgaagcact gtggaccaat agtgaatggg ataccaccgg tggcttcaaa    720
accaaagttg atgccctgat tcgctgtgcc gcagaaggtg ccgaatgctt cattctggaa    780
ggtgttgccc aggatagtga atgggcctat ctgctgagtc cgtatagcgg ctggagtaat    840
aaactgcatt gcacccgcat tgcccgcagc ccggccgcag cagttagcac agccgaatga    900

<210> 97
<211> 269
<212> PRT
<213> Actinoalloteichus sp.

<220>
<223> IP9

<400> 97
Met Val Asp Asp Val Val Asp Leu Leu Val Val Lys Val Gly Gly Ser
1               5                   10                  15
Leu Val Ser Glu Lys Ala Arg Arg Asp His Leu Asp His Asp Ala Leu
            20                  25                  30
Ala Gly Tyr Ala Ala Gln Ile Ala Asp Leu His Ala Ala Ala Pro Gly
        35                  40                  45
Arg Val Val Leu Val Val Gly Gly Gly Ser Ile Gly His Gly Ala Val
    50                  55                  60
Arg His Leu Asp Ala Asp Asp Pro Leu Ala Pro Leu Pro Leu Thr Arg
65                  70                  75                  80
Ala Thr Phe Asp Val Lys Trp Ser Trp Val Arg Ala Leu Arg Asp Leu
                85                  90                  95
Gly Ser Arg Cys Phe Pro Val Gln Val Ala Ala Ile Cys Val Leu Gly
            100                 105                 110
Pro Arg Gly Pro Glu Val Ser Phe Gly Thr Val Arg Arg Leu Leu Asp
        115                 120                 125
His Gly Ile Leu Pro Val Leu Ala Gly Asp Ser Val Leu Cys Ala Asp
    130                 135                 140
Gly Ala Leu Arg Val Phe Gly Ser Asp His Val Pro Ala Val Ala Val
145                 150                 155                 160
Arg Gly Thr Pro Gly Arg Thr Arg Val Ala Val Leu Thr Asp Val Pro
                165                 170                 175
Gly Val Leu Ala Gly Gly Pro Gly Ser Gln Glu Val Ile Pro Glu Ile
            180                 185                 190
Thr Pro Gly Ser Ser Ala Glu Ala Phe Arg Arg Ile Trp Pro Ala Ala
        195                 200                 205
Ala His Asp Thr Ser Gly Ser Met Gly Gly Lys Leu Thr Ala Leu Leu
    210                 215                 220
Asp His Ala Arg Asp Gly Ala Glu Cys Phe Val Leu Arg Gly Asp Pro
225                 230                 235                 240
Thr Ala Pro Asp Leu Arg Phe Leu Leu Glu Gly Arg Gly Arg Trp Pro
                245                 250                 255
Asp Val Pro His Thr Arg Ile Val Ala Asp Thr Thr Gly
            260                 265

<210> 98
<211> 810
<212> DNA
<213> Actinoalloteichus sp.

<220>
<223> IP9 wild type

<400> 98
gtggtggatg acgtggtcga cctgctggtg gtgaaggtgg gcgggagcct ggtctcggag     60
aaggcccggc gcgaccacct ggaccacgac gccctggcgg ggtacgcggc gcagatcgcg    120
gacctgcacg ccgccgcccc cgggcgcgtg gtcctcgtcg tgggcggcgg atccatcggg    180
cacggggccg tgcggcacct cgacgccgac gaccccctcg ccccgttgcc cctcacccgt    240
gcgaccttcg acgtcaagtg gtcgtgggtg cgggcgttgc gggacctcgg ctcgcggtgc    300
ttccccgtcc aggtcgccgc catctgcgtg ctcggccccc gggggccgga ggtcagcttc    360
gggaccgtgc gacgcctgct cgaccacggc atcctcccgg tgctggccgg tgacagcgtg    420
ctctgcgccg acggtgcgct gcgggtgttc ggcagcgacc acgtcccggc cgtggccgtg    480
cgtggcacgc cgggtcggac gcgggtcgcc gtgctcaccg acgtcccggg agtgctggct    540
ggcggtcccg gcagccagga ggtgatcccg gagatcacgc cggggtcctc ggccgaggcg    600
ttccggcgga tctggccggc ggcggcgcac gacaccagcg gatcgatggg gggcaagctg    660
acggcgctcc tcgaccacgc ccgcgacggc gccgagtgct tcgtgcttcg aggcgatccg    720
acggcgccgg acctgcggtt cctgctcgaa gggcgcggca ggtggcccga cgtgccgcac    780
acccggatcg tcgccgacac caccggttga                                     810

<210> 99
<211> 810
<212> DNA
<213> Artificial Sequence

<220>
<223> IP9 codon optimised

<400> 99
atggtggatg atgttgttga tctgctggtg gttaaagtgg gtggtagtct ggttagtgaa     60
aaagcacgcc gtgatcatct ggatcatgat gccctggcag gttatgccgc acagattgcc    120
gatctgcacg ctgccgcccc tggtcgtgtg gtgctggtgg ttggcggcgg tagcattggt    180
catggcgcag tgcgtcatct ggatgcagat gatccgctgg caccgctgcc gctgacccgt    240
gcaaccttcg atgttaaatg gagttgggtg cgcgcactgc gcgatctggg cagccgttgc    300
ttcccggttc aggttgcagc catctgtgtg ctgggtccgc gcggtccgga agttagcttc    360
ggcaccgtgc gtcgtctgct ggatcatggt attctgccgg tgctggcagg tgatagcgtt    420
ctgtgtgcag atggcgcact gcgcgtgttc ggcagcgatc atgtgccggc cgttgcagtg    480
cgtggtaccc ctggtcgcac ccgtgttgca gtgctgaccg atgtgccggg tgttctggcc    540
ggtggtccgg gtagccagga agtgattccg gaaattaccc ctggtagtag cgccgaagcc    600
ttccgccgta tctggccggc cgccgcacat gataccagcg gtagtatggg tggcaaactg    660
accgcactgc tggatcacgc ccgtgatggc gccgaatgct tcgttctgcg tggcgatccg    720
accgccccgg atctgcgctt cctgctggaa ggtcgcggcc gttggccgga tgtgccgcat    780
acccgtattg tggccgatac caccggctga                                     810

<210> 100
<211> 302
<212> PRT
<213> Bifiguratus adelaidae

<220>
<223> IP10

<400> 100
Met Thr Phe Glu Leu Cys Leu Val Lys Val Gly Gly Ala Ala Leu Thr
1               5                   10                  15
Asp Lys Ser Arg Pro Ser Thr Leu Ala Asp Pro Lys Ile Leu Thr Thr
            20                  25                  30
Ile Ala Ser Gln Leu Gly Thr Ala Phe Ser Thr Phe Ser Gly Thr Arg
        35                  40                  45
Lys Arg Leu Val Ile Val His Gly Val Gly Ser Phe Gly His Pro Gln
    50                  55                  60
Ala Lys Lys Tyr Asn Leu Ser Ser Gly Tyr Asp Met Glu Asn Ala Ser
65                  70                  75                  80
Glu Asp Glu Lys Glu Tyr Lys Val Asp Gly Val Val Glu Thr Arg Gln
                85                  90                  95
Ser Val Met Thr Leu His Gln Lys Val Cys Asp Leu Phe Ile Ala Gln
            100                 105                 110
Gly Ile Pro Ala Ile Ser Met Ser Pro Phe His Tyr Val Arg Thr Leu
        115                 120                 125
His Thr Pro Lys Ser Thr Lys Pro Asp Ala Tyr Ile Arg Leu Val Glu
    130                 135                 140
Ala Val Asp Arg Ala Leu Thr Leu Gly Tyr Val Pro Val Leu His Gly
145                 150                 155                 160
Asp Ala Val Leu Asp Asp Ala Gln Gly Cys Ala Ile Leu Ser Gly Asp
                165                 170                 175
Val Val Ile Arg Glu Leu Ala Arg Gly Leu Arg Ala Gly Pro Ser Pro
            180                 185                 190
Lys Tyr Ser Leu His Asn Cys Thr Phe Leu Thr Asp Val Asn Gly Val
        195                 200                 205
Phe Asp Arg Asp Pro Lys Leu Thr Met Asp Glu Pro Pro His Leu Ile
    210                 215                 220
Gln Ser Ile Lys Ile Ser Lys Tyr Gln Val Glu Arg Met Val Asn His
225                 230                 235                 240
Ser Ser Ser Ile Asp Val Thr Gly Ala Met Thr Gly Lys Leu Gln Cys
                245                 250                 255
Ala Met Asn Ile Val Lys Asp Ala Leu Glu Ala Gly Ile Glu Pro Ile
            260                 265                 270
Gly Gln Val Ile Ile Cys Arg Ala Ala Ser Ala Asp Ala Met His Val
        275                 280                 285
Leu Cys Gly Gln Glu Ala Asp Asn Arg Thr Val Val Glu Pro
    290                 295                 300

<210> 101
<211> 909
<212> DNA
<213> Bifiguratus adelaidae

<220>
<223> IP10 wild type

<400> 101
atgacctttg agctgtgcct tgtgaaggtg ggaggtgccg cgttgacgga caaaagcagg     60
ccgtcaacgt tggccgaccc caaaatactc accaccatag catcacaact cggcacagca    120
ttctcgacct ttagtggaac gcgcaaacga ctcgtcatag ttcacggtgt gggctcgttt    180
ggtcacccgc aagctaaaaa gtacaatctc tccagtggat atgatatgga gaatgcctcc    240
gaggacgaga aagagtacaa ggtagacgga gttgtggaga cgcggcagtc tgtcatgaca    300
ttgcatcaaa aggtatgcga tctattcatt gctcagggca ttccagcaat cagtatgtcg    360
ccttttcatt atgtgcgaac cctgcatacc cctaagtcga caaagccaga cgcgtatata    420
cgattggtgg aggcggttga ccgagcattg acgctcggat atgtgccagt gttgcacggc    480
gatgcagtcc tcgatgatgc gcaaggttgc gccatcctca gtggcgatgt ggtgatccgg    540
gagcttgccc gtgggctacg cgctggacca tcacctaaat acagtcttca caattgtaca    600
ttcttaacgg atgtgaacgg tgtctttgac cgcgatccaa agttaactat ggatgaacca    660
ccgcatctga tccagagcat caagatcagc aaataccaag tagagaggat ggtgaatcat    720
tcatcttcca tcgatgtcac tggtgccatg acgggaaagc ttcaatgcgc aatgaacatt    780
gtcaaagatg cgcttgaggc cggtattgag cctatagggc aagtcattat atgtagagca    840
gcgtctgcgg acgccatgca tgtgctttgt ggtcaagagg ctgacaatag gaccgtcgtt    900
gagccttaa                                                            909

<210> 102
<211> 909
<212> DNA
<213> Artificial Sequence

<220>
<223> IP10 codon optimised

<400> 102
atgaccttcg aactgtgtct ggtgaaagtt ggtggtgcag cactgaccga taaaagtcgt     60
ccgagtaccc tggcagatcc gaaaattctg accaccattg caagccagct gggcaccgca    120
ttcagtacct tcagcggcac ccgcaaacgt ctggtgattg ttcatggtgt tggcagcttc    180
ggtcatccgc aggccaaaaa atataatctg agcagtggct atgatatgga aaatgccagt    240
gaagatgaaa aagaatataa ggttgacggc gtggtggaaa ccagacagag cgttatgacc    300
ctgcatcaga aagtgtgcga tctgttcatt gcccagggta ttccggccat tagcatgagt    360
ccgttccatt atgttcgtac cctgcatacc ccgaaaagta ccaaaccgga tgcctatatt    420
cgcctggttg aagccgtgga tcgcgccctg accctgggct atgttccggt gctgcatggc    480
gatgcagtgc tggatgatgc acagggctgt gcaattctga gcggcgatgt tgttattcgt    540
gaactggccc gtggcctgcg tgccggtcct agtcctaaat atagtctgca taattgtacc    600
ttcctgaccg atgtgaatgg cgtgttcgat cgtgatccga aactgacaat ggatgaaccg    660
ccgcatctga ttcagagcat taaaattagc aaataccagg tggaacgcat ggtgaatcat    720
agtagcagta ttgatgttac cggtgccatg accggtaaac tgcaatgcgc catgaatatt    780
gtgaaagatg ccctggaagc aggcattgaa ccgattggtc aggtgattat ctgtcgtgca    840
gcaagcgccg atgccatgca tgttctgtgt ggtcaggaag ccgataatcg caccgtggtg    900
gaaccgtga                                                            909

<210> 103
<211> 245
<212> PRT
<213> Thermoplasma acidophilum

<220>
<223> IP11

<400> 103
Met Met Ile Leu Lys Ile Gly Gly Ser Val Ile Thr Asp Lys Ser Ala
1               5                   10                  15
Tyr Arg Thr Ala Arg Thr Tyr Ala Ile Arg Ser Ile Val Lys Val Leu
            20                  25                  30
Ser Gly Ile Glu Asp Leu Val Cys Val Val His Gly Gly Gly Ser Phe
        35                  40                  45
Gly His Ile Lys Ala Met Glu Phe Gly Leu Pro Gly Pro Lys Asn Pro
    50                  55                  60
Arg Ser Ser Ile Gly Tyr Ser Ile Val His Arg Asp Met Glu Asn Leu
65                  70                  75                  80
Asp Leu Met Val Ile Asp Ala Met Ile Glu Met Gly Met Arg Pro Ile
                85                  90                  95
Ser Val Pro Ile Ser Ala Leu Arg Tyr Asp Gly Arg Phe Asp Tyr Thr
            100                 105                 110
Pro Leu Ile Arg Tyr Ile Asp Ala Gly Phe Val Pro Val Ser Tyr Gly
        115                 120                 125
Asp Val Tyr Ile Lys Asp Glu His Ser Tyr Gly Ile Tyr Ser Gly Asp
    130                 135                 140
Asp Ile Met Ala Asp Met Ala Glu Leu Leu Lys Pro Asp Val Ala Val
145                 150                 155                 160
Phe Leu Thr Asp Val Asp Gly Ile Tyr Ser Lys Asp Pro Lys Arg Asn
                165                 170                 175
Pro Asp Ala Val Leu Leu Arg Asp Ile Asp Thr Asn Ile Thr Phe Asp
            180                 185                 190
Arg Val Gln Asn Asp Val Thr Gly Gly Ile Gly Lys Lys Phe Glu Ser
        195                 200                 205
Met Val Lys Met Lys Ser Ser Val Lys Asn Gly Val Tyr Leu Ile Asn
    210                 215                 220
Gly Asn His Pro Glu Arg Ile Gly Asp Ile Gly Lys Glu Ser Phe Ile
225                 230                 235                 240
Gly Thr Val Ile Arg
                245

<210> 104
<211> 738
<212> DNA
<213> Thermoplasma acidophilum

<220>
<223> IP11 wild type

<400> 104
atgatgatac tgaagatagg cggaagcgtg atcaccgata aatccgctta ccggactgcc     60
aggacgtacg ccataaggag catagttaag gttctatccg gaatcgagga tctggtgtgc    120
gtggttcacg gcggcggttc cttcggccat ataaaggcga tggaatttgg actgccaggc    180
ccaaagaacc caagatccag cattggatac agcatagttc acagagacat ggagaatctt    240
gatctcatgg tcatcgatgc aatgatagag atgggcatgc gaccgatatc cgtgccaata    300
agcgccctgc gctatgacgg ccgcttcgat tacacccctc ttatcaggta catagatgct    360
ggttttgttc ctgtatcata cggtgacgta tatataaagg atgaacattc atatggtata    420
tactctggag acgacataat ggccgacatg gcagaactgc tgaaaccaga tgtagctgta    480
ttccttaccg atgttgatgg catctacagc aaggatccga agagaaatcc tgatgcggtg    540
cttctcaggg atatagatac aaatatcacc ttcgatcggg tgcagaacga tgttaccggc    600
ggtataggga agaagttcga atccatggtg aagatgaaga gcagcgtgaa aaacggagtt    660
tacctgatca acgggaacca tccagagagg atcggagaca ttggaaagga atcattcata    720
ggtacggtga taagatga                                                  738

<210> 105
<211> 738
<212> DNA
<213> Artificial Sequence

<220>
<223> IP11 codon optimised

<400> 105
atgatgatcc tgaaaatcgg cggtagcgtg attaccgata aaagcgccta tcgtaccgcc     60
cgtacctatg ccattcgtag tattgttaaa gttctgagtg gcattgaaga tctggtgtgt    120
gttgttcatg gtggtggcag cttcggtcat attaaagcaa tggagttcgg cctgccgggt    180
ccgaaaaatc cgcgtagtag cattggctat agtattgttc atcgtgatat ggaaaacctg    240
gatctgatgg tgattgatgc catgattgaa atgggcatga gaccgattag tgtgccgatt    300
agtgccctgc gttatgatgg tcgcttcgat tataccccgc tgattcgcta tattgatgcc    360
ggcttcgtgc cggtgagtta tggtgatgtg tatattaaag atgagcatag ctatggtatc    420
tatagcggtg atgatattat ggccgatatg gccgaactgc tgaaaccgga tgttgccgtg    480
ttcctgaccg atgttgatgg tatctattca aaagatccga aacgtaatcc ggatgccgtg    540
ctgctgcgcg atattgatac caatattacc ttcgatcgtg tgcagaatga tgtgaccggt    600
ggtattggta aaaaattcga aagtatggtt aagatgaaga gtagcgttaa aaatggtgtg    660
tatctgatta atggtaacca tccggaacgt attggcgata ttggcaaaga atcattcatt    720
ggtaccgtga ttcgctga                                                  738

<210> 106
<211> 270
<212> PRT
<213> Tumebacillus algifaecis

<220>
<223> IP12

<400> 106
Met Asn Val Val Lys Ile Gly Gly Ser Leu Leu Thr Asp Lys Asp Gly
1               5                   10                  15
Tyr Cys Ala Pro Asn Gln Glu Met Val Arg Gln Tyr Ala Arg Thr Ile
            20                  25                  30
Ala Lys Glu Trp Glu Arg Leu Arg Gly Asn Leu Ile Leu Ile Val Gly
        35                  40                  45
Gly Gly Ser Tyr Gly Asn Ala Val Pro Val Arg Tyr His Leu Lys Asp
    50                  55                  60
Ala Ser Leu Pro Trp Lys Asp Thr Asp Leu Ser Met Met Thr Val Lys
65                  70                  75                  80
Met Phe Glu Trp Leu Ser Leu Val Thr Gln Ile Phe Arg Glu Glu Gly
                85                  90                  95
Val Pro Cys Tyr Pro Phe Gln Thr Ser Gly Tyr Val Val Thr Lys Asn
            100                 105                 110
Lys Arg Pro Gln Arg Phe Phe Val Glu Pro Val Glu His Val Leu Ser
        115                 120                 125
Met Gly Val Leu Pro Val Phe Ser Gly Asp Leu Val Phe Asp Glu Glu
    130                 135                 140
Gln Gln Phe Ile Ile Phe Ser Ser Asp Asn Leu Pro Glu Leu Phe Val
145                 150                 155                 160
Glu Arg Met Ser Leu Arg Arg Met Val Met Leu Thr Asp Val Glu Gly
                165                 170                 175
Val Met Gln Ile Gly Thr Asp Gly Gln Gln Thr Val Ile Pro Glu Val
            180                 185                 190
Thr Arg Ala Asn Phe Gln Glu Val Leu Arg Cys Ala Gly Pro Ser Gln
        195                 200                 205
Lys Pro Asp Ile Thr Gly Gly Met Lys Asn Lys Leu Glu Ala Leu Leu
    210                 215                 220
Arg Leu Ala Glu Gln Gly Val Glu Gly Val Ile Thr Ser Gly Arg Lys
225                 230                 235                 240
Ala Glu Ala Leu Leu Pro Ala Leu Phe Glu Pro Glu Pro Val Gly Thr
                245                 250                 255
Met Ile Arg Pro Trp Ala Gln Glu Asn Arg Gly Gly Leu Leu
            260                 265                 270

<210> 107
<211> 813
<212> DNA
<213> Tumebacillus algifaecis

<220>
<223> IP12 wild type

<400> 107
atgaatgtag tcaagattgg agggagtttg ctgaccgata aggacggcta ctgtgcgccg     60
aatcaggaga tggtgcggca gtatgcgcgc acgatcgcca aggagtggga gcggctgcgt    120
ggcaacttga tcctgatcgt cggcggaggc tcgtatggga atgcagtgcc ggttcgctat    180
cacttgaagg atgcttccct gccgtggaaa gatacggacc tgtcgatgat gacggtgaag    240
atgtttgagt ggttgtcgct ggtgacccag atttttcggg aagagggagt gccttgctat    300
ccgttccaga ccagcgggta tgtggtgacg aaaaacaaac ggccccagcg tttttttgtg    360
gagccggtgg agcacgtgct gtcgatgggg gtgctgcccg ttttctcggg cgacctggtc    420
tttgatgagg agcagcagtt tatcattttt tcgagtgata atttgcccga gttgtttgtc    480
gagcggatgt ctctccggcg gatggtgatg ctgaccgatg ttgagggagt gatgcaaatc    540
ggaacggatg ggcagcagac ggtgattcct gaagtgacgc gtgcaaattt tcaggaggtg    600
ctgcgctgtg cggggccttc gcaaaagccg gatatcacag gcggaatgaa aaataagctg    660
gaagctctgt tgcgcttggc ggagcaggga gtggaaggtg tgatcaccag cggaaggaag    720
gcggaggcat tgctgccagc gttgtttgag cccgagcctg taggcacgat gattcggcct    780
tgggcacaag agaatagagg gggattgctg tag                                 813

<210> 108
<211> 813
<212> DNA
<213> Artificial Sequence

<220>
<223> IP12 codon optimised

<400> 108
atgaacgtgg tgaaaattgg tggcagcctg ctgaccgata aagatggcta ttgcgcaccg     60
aatcaggaaa tggttcgcca gtatgcccgc accattgcaa aagaatggga acgtctgcgt    120
ggtaatctga ttctgattgt tggcggtggc agctatggca atgcagttcc ggttcgctat    180
catctgaaag atgccagcct gccgtggaaa gataccgatc tgagcatgat gaccgttaaa    240
atgttcgaat ggctgagtct ggttacccag atcttccgtg aagaaggtgt gccgtgttat    300
ccgttccaga ccagcggtta tgtggtgacc aaaaataaac gtccgcagcg cttcttcgtt    360
gaaccggtgg aacatgtgct gagtatgggt gtgctgccgg tgttcagcgg tgatctggtg    420
ttcgatgaag aacagcagtt cattatcttc agtagcgata atctgccgga actgttcgtg    480
gaacgcatga gcctgcgtcg tatggttatg ctgaccgatg ttgaaggtgt tatgcagatt    540
ggcaccgatg gccagcagac cgtgattccg gaagtgaccc gcgcaaactt ccaggaagtg    600
ctgcgctgcg ccggtccgag tcagaaaccg gatattaccg gtggcatgaa aaataaactg    660
gaagccctgc tgcgcctggc agaacagggt gttgaaggtg tgattaccag cggtcgcaaa    720
gccgaagcac tgctgccggc cctgttcgaa ccggaaccgg tgggcaccat gattcgcccg    780
tgggcccagg aaaatcgtgg tggtctgctg tga                                 813

<210> 109
<211> 5365
<212> DNA
<213> Artificial Sequence

<220>
<223> Plasmid

<400> 109
cgatcaccac aattcagcaa attgtgaaca tcatcacgtt catctttccc tggttgccaa     60
tggcccattt tcctgtcagt aacgagaagg tcgcgaattc aggcgctttt tagactggtc    120
gtaatgaaca attcttaaga aggagatata catatgcaga caagaaaaat cgtccgggca    180
gccgccgtac aggccgcctc tcccaactac gatctggcaa cgggtgttga taaaaccatt    240
gagctggctc gtcaggcccg cgatgagggc tgtgacctga tcgtgtttgg tgaaacctgg    300
ctgcccggat atcccttcca cgtctggctg ggcgcaccgg cctggtcgct gaaatacagt    360
gcccgctact atgccaactc gctctcgctg gacagtgcag agtttcaacg cattgcccag    420
gccgcacgga ccttgggtat tttcatcgca ctgggttata gcgagcgcag cggcggcagc    480
ctttacctgg gccaatgcct gatcgacgac aagggcgaga tgctgtggtc gcgtcgcaaa    540
ctcaaaccca cgcatgtaga gcgcaccgta tttggtgaag gttatgcccg tgatctgatt    600
gtgtccgaca cagaactggg acgcgtcggt gctctatgct gctgggagca tttgtcgccc    660
ttgagcaagt acgcgctgta ctcccagcat gaagccattc acattgctgc ctggccgtcg    720
ttttcgctat acagcgaaca ggcccacgcc ctcagtgcca aggtgaacat ggctgcctcg    780
caaatctatt cggttgaagg ccagtgcttt accatcgccg ccagcagtgt ggtcacccaa    840
gagacgctag acatgctgga agtgggtgaa cacaacgccc ccttgctgaa agtgggcggc    900
ggcagttcca tgatttttgc gccggacgga cgcacactgg ctccctacct gcctcacgat    960
gccgagggct tgatcattgc cgatctgaat atggaggaga ttgccttcgc caaagcgatc   1020
aatgaccccg taggccacta ttccaaaccc gaggccaccc gtctggtgct ggacttgggg   1080
caccgagacc ccatgactcg ggtgcactcc aaaagcgtga ccagggaaga ggctcccgag   1140
caaggtgtgc aaagcaagat tgcctcagtc gctatcagcc atccacagga ctcggacaca   1200
ctgctagtgc aagagccgtc cttgaggatc cgtcgacctg cagccaagct tggctgtttt   1260
ggcggatgag agaagatttt cagcctgata cagattaaat cagaacgcag aagcggtctg   1320
ataaaacaga atttgcctgg cggcagtagc gcggtggtcc cacctgaccc catgccgaac   1380
tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc gagagtaggg   1440
aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct ttcgttttat   1500
ctgttgtttg tcggtgaacg ctctcctgag taggacaaat ccgccgggag cggatttgaa   1560
cgttgcgaag caacggcccg gagggtggcg ggcaggacgc ccgccataaa ctgccaggca   1620
tcaaattaag cagaaggcca tcctgacgga tggccttttt gcgtttctac aaactctttt   1680
gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa   1740
tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta   1800
ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag   1860
taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca   1920
gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta   1980
aagttctgct atgtggcgcg gtattatccc gtgttgacgc cgggcaagag caactcggtc   2040
gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc   2100
ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca   2160
ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc   2220
acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca   2280
taccaaacga cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac   2340
tattaactgg cgaactactt actctagctt cccggcaaca attaatagac tggatggagg   2400
cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg   2460
ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg   2520
gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac   2580
gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc   2640
aagtttactc atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct   2700
aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc   2760
actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc   2820
gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg   2880
atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa   2940
atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc   3000
ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt   3060
gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa   3120
cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc   3180
tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc   3240
cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct   3300
ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat   3360
gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc   3420
tggccttttg ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg   3480
ataaccgtat taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc   3540
gcagcgagtc agtgagcgag gaagcggaag agcgcctgat gcggtatttt ctccttacgc   3600
atctgtgcgg tatttcacac cgcatatatg gtgcactctc agtacaatct gctctgatgc   3660
cgcatagtta agccagtata cactccgcta tcgctacgtg actgggtcat ggctgcgccc   3720
cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct   3780
tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca   3840
ccgaaacgcg cgaggcagct gcggtaaagc tcatcagcgt ggtcgtgaag cgattcacag   3900
atgtctgcct gttcatccgc gtccagctcg ttgagtttct ccagaagcgt taatgtctgg   3960
cttctgataa agcgggccat gttaagggcg gttttttcct gtttggtcac tgatgcctcc   4020
gtgtaagggg gatttctgtt catgggggta atgataccga tgaaacgaga gaggatgctc   4080
acgatacggg ttactgatga tgaacatgcc cggttactgg aacgttgtga gggtaaacaa   4140
ctggcggtat ggatgcggcg ggaccagaga aaaatcactc agggtcaatg ccagcgcttc   4200
gttaatacag atgtaggtgt tccacagggt agccagcagc atcctgcgat gcagatccgg   4260
aacataatgg tgcagggcgc tgacttccgc gtttccagac tttacgaaac acggaaaccg   4320
aagaccattc atgttgttgc tcaggtcgca gacgttttgc agcagcagtc gcttcacgtt   4380
cgctcgcgta tcggtgattc attctgctaa ccagtaaggc aaccccgcca gcctagccgg   4440
gtcctcaacg acaggagcac gatcatgcgc acccgtggcc aggacccaac gctgcccgag   4500
atgcgccgcg tgcggctgct ggagatggcg gacgcgatgg atatgttctg ccaagggttg   4560
gtttgcgcat tcacagttct ccgcaagaat tgattggctc caattcttgg agtggtgaat   4620
ccgttagcga ggtgccgccg gcttccattc aggtcgaggt ggcccggctc catgcaccgc   4680
gacgcaacgc ggggaggcag acaaggtata gggcggcgcc tacaatccat gccaacccgt   4740
tccatgtgct cgccgaggcg gcataaatcg ccgtgacgat cagcggtcca atgatcgaag   4800
ttaggctggt aagagccgcg agcgatcctt gaagctgtcc ctgatggtcg tcatctacct   4860
gcctggacag catggcctgc aacgcgggca tcccgatgcc gccggaagcg agaagaatca   4920
taatggggaa ggccatccag cctcgcgtcg cgaacgccag caagacgtag cccagcgcgt   4980
cggccgccat gccggcgata atggcctgct tctcgccgaa acgtttggtg gcgggaccag   5040
tgacgaaggc ttgagcgagg gcgtgcaaga ttccgaatac cgcaagcgac aggccgatca   5100
tcgtcgcgct ccagcgaaag cggtcctcgc cgaaaatgac ccagagcgct gccggcacct   5160
gtcctacgag ttgcatgata aagaagacag tcataagtgc ggcgacgata gtcatgcccc   5220
gcgcccaccg gaaggagctg actgggttga aggctctcaa gggcatcggt cgacgctctc   5280
ccttatgcga ctcctgcatt aggaagcagc ccagtagtag gttgaggccg ttgagcaccg   5340
ccgccgcaag gaatggtgca tgcat                                         5365

<210> 110
<211> 5016
<212> DNA
<213> Artificial Sequence

<220>
<223> Plasmid IPK11 in pDHE165

<400> 110
catatgatga tcctgaaaat cggcggtagc gtgattaccg ataaaagcgc ctatcgtacc     60
gcccgtacct atgccattcg tagtattgtt aaagttctga gtggcattga agatctggtg    120
tgtgttgttc atggtggtgg cagcttcggt catattaaag caatggagtt cggcctgccg    180
ggtccgaaaa atccgcgtag tagcattggc tatagtattg ttcatcgtga tatggaaaac    240
ctggatctga tggtgattga tgccatgatt gaaatgggca tgagaccgat tagtgtgccg    300
attagtgccc tgcgttatga tggtcgcttc gattataccc cgctgattcg ctatattgat    360
gccggcttcg tgccggtgag ttatggtgat gtgtatatta aagatgagca tagctatggt    420
atctatagcg gtgatgatat tatggccgat atggccgaac tgctgaaacc ggatgttgcc    480
gtgttcctga ccgatgttga tggtatctat tcaaaagatc cgaaacgtaa tccggatgcc    540
gtgctgctgc gcgatattga taccaatatt accttcgatc gtgtgcagaa tgatgtgacc    600
ggtggtattg gtaaaaaatt cgaaagtatg gttaagatga agagtagcgt taaaaatggt    660
gtgtatctga ttaatggtaa ccatccggaa cgtattggcg atattggcaa agaatcattc    720
attggtaccg tgattcgctg actgaaagct tggctgtttt ggcggatgag agaagatttt    780
cagcctgata cagattaaat cagaacgcag aagcggtctg ataaaacaga atttgcctgg    840
cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag    900
cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa    960
aacgaaaggc tcagtcgaaa gactgggcct ttcgttttat ctgttgtttg tcggtgaacg   1020
ctctcctgag taggacaaat ccgccgggag cggatttgaa cgttgcgaag caacggcccg   1080
gagggtggcg ggcaggacgc ccgccataaa ctgccaggca tcaaattaag cagaaggcca   1140
tcctgacgga tggccttttt gcgtttctac aaactctttt gtttattttt ctaaatacat   1200
tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa   1260
ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt   1320
gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt   1380
tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt   1440
ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg   1500
tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga   1560
atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa   1620
gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga   1680
caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa   1740
ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca   1800
ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta   1860
ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac   1920
ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc   1980
gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag   2040
ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga   2100
taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt   2160
agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata   2220
atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag   2280
aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa   2340
caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt   2400
ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc   2460
cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa   2520
tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa   2580
gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc   2640
ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa   2700
gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa   2760
caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg   2820
ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc   2880
tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg   2940
ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg   3000
agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg   3060
aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc   3120
gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac   3180
actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct   3240
gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc   3300
tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg   3360
cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg   3420
tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg   3480
ttaagggcgg ttttttcctg tttggtcact tgatgcctcc gtgtaagggg gaatttctgt   3540
tcatgggggt aatgataccg atgaaacgag agaggatgct cacgatacgg gttactgatg   3600
atgaacatgc ccggttactg gaacgttgtg agggtaaaca actggcggta tggatgcggc   3660
gggaccagag aaaaatcact cagggtcaat gccagcgctt cgttaataca gatgtaggtg   3720
ttccacaggg tagccagcag catcctgcga tgcagatccg gaacataatg gtgcagggcg   3780
ctgacttccg cgtttccaga ctttacgaaa cacggaaacc gaagaccatt catgttgttg   3840
ctcaggtcgc agacgttttg cagcagcagt cgcttcacgt tcgctcgcgt atcggtgatt   3900
cattctgcta accagtaagg caaccccgcc agcctagccg ggtcctcaac gacaggagca   3960
cgatcatgcg cacccgtggc caggacccaa cgctgcccga gatgcgccgc gtgcggctgc   4020
tggagatggc ggacgcgatg gatatgttct gccaagggtt ggtttgcgca ttcacagttc   4080
tccgcaagaa ttgattggct ccaattcttg gagtggtgaa tccgttagcg aggtgccgcc   4140
ggcttccatt caggtcgagg tggcccggct ccatgcaccg cgacgcaacg cggggaggca   4200
gacaaggtat agggcggcgc ctacaatcca tgccaacccg ttccatgtgc tcgccgaggc   4260
ggcataaatc gccgtgacga tcagcggtcc agtgatcgaa gttaggctgg taagagccgc   4320
gagcgatcct tgaagctgtc cctgatggtc gtcatctacc tgcctggaca gcatggcctg   4380
caacgcgggc atcccgatgc cgccggaagc gagaagaatc ataatgggga aggccatcca   4440
gcctcgcgtc gcgaacgcca gcaagacgta gcccagcgcg tcggccgcca tgccggcgat   4500
aatggcctgc ttctcgccga aacgtttggt ggcgggacca gtgacgaagg cttgagcgag   4560
ggcgtgcaag attccgaata ccgcaagcga caggccgatc atcgtcgcgc tccagcgaaa   4620
gcggtcctcg ccgaaaatga cccagagcgc tgccggcacc tgtcctacga gttgcatgat   4680
aaagaagaca gtcataagtg cggcgacgat agtcatgccc cgcgcccacc ggaaggagct   4740
gactgggttg aaggctctca agggcatcgg tcgacgctct cccttatgcg actcctgcat   4800
taggaagcag cccagtagta ggttgaggcc gttgagcacc gccgccgcaa ggaatggtgc   4860
atgcatcgat caccacaatt cagcaaattg tgaacatcat cacgttcatc tttccctggt   4920
tgccaatggc ccattttcct gtcagtaacg agaaggtcgc gaattcaggc gctttttaga   4980
ctggtcgtaa tgaacaattc ttaagaagga gatata                             5016

<210> 111
<211> 273
<212> PRT
<213> Streptomyces silvensis

<400> 111
Met Ser Gly Pro Ala Gly Asp Gly Leu Ile Ala Thr Ile Arg Glu Arg
1               5                   10                  15
Arg Pro Leu Val His Met Ile Thr Asn Leu Val Ser Met Ala Ala Cys
            20                  25                  30
Ala Gln Thr Val Lys Ser Leu Gly Ala Ala Thr Ile Phe Ala His Ala
        35                  40                  45
Ala Glu Glu Ala Ala Glu Ile Ala Gly Thr Ala Asp Ala Val Val Leu
    50                  55                  60
Asn Val Gly Thr Ser Val Pro Gly Met Asp Arg Thr Ala Val Gln Val
65                  70                  75                  80
Ala Glu Ala Cys Ala Ala Arg Ser Ile Pro Val Val Leu Asp Pro Leu
                85                  90                  95
Gly Ser Gly Ala Ser Arg Phe Arg Ser His Leu Ala Arg Ala Leu Leu
            100                 105                 110
Asp Thr Gly Ala Val Arg Met Val Ser Gly Asn Val Ala Glu Leu Ala
        115                 120                 125
Asp Leu Cys Gly Val Pro Ser Val Ile Arg Gly Ala Asp Ala Val Ser
    130                 135                 140
Ala Thr Ala Pro Ala Asp Glu Val Cys Met Lys Leu Ala Glu Ser Ala
145                 150                 155                 160
Gln Val Ile Ala Ala Val Ser Gly Arg Thr Asp Tyr Val Gly Asp Gly
                165                 170                 175
Arg Gln Leu Ala Ala Ile Thr Asn Gly His Pro Val Met Gly Gln Val
            180                 185                 190
Val Gly Thr Gly Ser Ala Arg Ser Ala Val Leu Gly Ala Phe Ala Ala
        195                 200                 205
Val Ala Gly Ala Asp Met Phe Thr Ala Thr Val Thr Gly Val Cys Ala
    210                 215                 220
Tyr Gly Ile Ala Gly Glu Leu Ala Ala Ala Thr Gly Arg Gly Pro Gly
225                 230                 235                 240
Tyr Leu Leu Pro Glu Val Cys Asn Gln Leu Ser Val Met Asp Asp Glu
                245                 250                 255
Met Val Ala Thr Arg Ser Arg Val Thr Thr Ser Ala Pro Arg Asp Pro
            260                 265                 270
Ser

<210> 112
<211> 822
<212> DNA
<213> Streptomyces silvensis

<400> 112
atgagcgggc cggcaggtga cgggctgatc gccacgatcc gcgagcgccg cccgctcgta     60
cacatgatca ccaatctggt gtcgatggct gcctgtgcgc agaccgtgaa gtcgctgggc    120
gccgcgacca tcttcgcgca cgcggccgag gaggcggcgg agatcgccgg gacggcggac    180
gccgtggtgc tcaacgtcgg gacctccgtg ccggggatgg accggaccgc ggtacaggtc    240
gccgaggcct gtgcggcacg gtcgatcccc gtcgtcctcg acccgttggg atccggggcc    300
agccggttcc gctcgcacct cgccagggcg ctgctcgaca ccggagccgt ccgcatggtg    360
tccggcaatg tcgccgagct ggcggacctg tgtggtgtcc cgtcggtgat acgtggcgcc    420
gacgccgtca gcgccaccgc accagcggac gaggtgtgca tgaagctggc ggagtccgcc    480
caggtcatcg ccgccgtctc cggccgcacc gactacgtgg gcgacggacg gcagctggcc    540
gcgatcacca acggacatcc ggtcatggga caggtggtcg gcaccggcag cgcacggtcc    600
gcggtgctcg gcgccttcgc ggccgtcgca ggtgcggaca tgttcaccgc gacggtcacc    660
ggggtctgcg cctacggcat cgccggtgag ctggcggcgg ccaccggcag gggacccggc    720
tatctcctgc ccgaggtctg caatcagctg tccgtcatgg atgacgagat ggtcgcgacc    780
cggtcgcggg tcaccacgtc cgcccccagg gatccctcgt ga                       822

<210> 113
<211> 832
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 113
atgagcggtc cggccggtga tggtctgatt gcaaccattc gcgaacgtcg tccgctggtt     60
cacatgatta ccaatctggt tagtatggca gcctgcgccc agaccgttaa aagtctgggc    120
gcagcaacca tcttcgccca tgcagcagaa gaagccgcag aaattgccgg taccgcagat    180
gccgttgttc tgaatgtggg caccagtgtg ccgggcatgg atcgcaccgc cgtgcaggtg    240
gcagaagcct gcgcagcacg tagcattccg gtggttctgg accctctggg tagtggtgca    300
agccgcttcc gtagtcatct ggcccgtgca ctgctggata ccggcgcagt gcgcatggtg    360
agtggtaatg ttgcagaact ggccgatctg tgtggtgtgc cgagcgttat tcgcggtgca    420
gatgcagtga gtgccaccgc accggcagat gaagtgtgca tgaaactggc agaaagcgca    480
caggtgattg ccgccgttag tggtcgtacc gattatgttg gtgatggtcg tcagctggcc    540
gccattacca atggccatcc ggttatgggt caggttgttg gcaccggtag cgcacgtagc    600
gccgttctgg gtgccttcgc cgcagtggcc ggcgcagata tgttcaccgc caccgtgacc    660
ggcgtgtgtg catacggtat tgccggtgaa ctggccgcag caaccggtcg tggtccgggt    720
tatctgctgc cggaagtgtg caatcagctg agcgtgatgg atgatgaaat ggttgccacc    780
cgtagccgtg ttaccaccag cgcaccgcgt gatccgagtt gactgaaagc tt            832

<210> 114
<211> 277
<212> PRT
<213> Halarchaeum acidiphilum

<400> 114
Met Thr Thr Asp Phe Thr Val Ala Glu Ser Leu Glu Arg Ile Ala Ala
1               5                   10                  15
Glu Gln Pro Leu Ile Asn Cys Val Thr Asn Ala Val Thr Val Asn Asp
            20                  25                  30
Val Ala Asn Val Thr Leu His Trp Gly Gly Leu Pro Val Met Ser Asp
        35                  40                  45
Asp Ala Arg Glu Val Gly Asp Met Val Ala Gly Ala Gln Gly Cys Leu
    50                  55                  60
Leu Asn Met Gly Thr Val Ser Glu Ala Gly Glu Glu Ala Met Leu Thr
65                  70                  75                  80
Ala Gly Asn Ala Ala Asn Asp His Gly Val Pro Leu Val Val Asp Pro
                85                  90                  95
Val Gly Val Gly Ala Thr Pro Thr Arg Asp Arg Val Ala Glu Ala Leu
            100                 105                 110
Val Thr Asp Leu Asp Pro Thr Ile Val Lys Gly Asn Tyr Gly Glu Ile
        115                 120                 125
Thr Ala Leu Ala Gly Ala Asp Ala Glu Val Arg Gly Val Glu Ser Val
    130                 135                 140
Gly Asp Tyr Ala Asp Val Ala Glu Thr Ala Val Ala Leu Ala Arg Asp
145                 150                 155                 160
Thr Gly Ala Val Val Val Ala Ser Gly Glu Thr Asp Val Val Ala Ser
                165                 170                 175
Ala Asp Ala Ala Tyr Glu Val Glu Asn Gly Asp Ala Met Leu Gly Thr
            180                 185                 190
Val Val Gly Thr Gly Cys Met Leu Gly Val Thr Leu Ala Val Phe Ala
        195                 200                 205
Ala Ala Leu Asp Asp Ala Glu Thr Ala Ala Leu Ala Gly Thr Leu Ala
    210                 215                 220
Phe Gly Val Ala Gly Glu Ala Ala Ala Lys Gly Asp Phe Gly Asp Tyr
225                 230                 235                 240
Ala Gly Pro Ala Ser Tyr Arg Val Ala Phe Leu Asp Ala Val Ala Gly
                245                 250                 255
Leu Asp Gly Val Glu Val Glu His Pro Asp Pro Asp Ala Arg Ile Thr
            260                 265                 270
Arg Val Leu Asp Ala
        275

<210> 115
<211> 834
<212> DNA
<213> Halarchaeum acidiphilum

<400> 115
atgaccaccg acttcaccgt cgccgagagc ctcgaacgca tcgccgccga acagccgctc     60
atcaactgcg tgacgaacgc cgtcaccgtc aacgacgtcg cgaacgtcac cctccactgg    120
ggcggtctcc ccgtgatgtc cgacgacgcc cgcgaagtcg gcgacatggt cgcgggcgcg    180
cagggctgtc tcctcaacat gggcaccgtg agcgaggccg gcgaggaagc gatgctgacg    240
gccggcaacg ccgcgaacga ccacggcgtc ccgctcgtcg tcgatcccgt cggcgtcggc    300
gcgacgccga cccgcgatcg cgtcgccgaa gccctcgtca ccgacctcga tccgacgatc    360
gtgaagggga actacggcga gataacggcg ctcgcgggcg cggatgcaga ggttcgcggc    420
gtcgagtccg tcggcgacta cgcggacgtc gcggagacgg ccgtcgcgct cgcccgcgac    480
accggcgccg tcgtcgtcgc gtcgggtgag acggacgtcg tcgcgagcgc cgatgcggcc    540
tacgaggtcg agaacggcga cgcgatgctc ggaacggtcg tcgggacggg ctgtatgctc    600
ggcgtgacgc tcgccgtgtt cgccgcggcg ctcgacgacg cggagacggc cgcgctcgcc    660
ggcacgctcg cgttcggcgt cgcgggcgag gccgccgcga agggcgactt cggcgactac    720
gcagggccgg cgagctaccg cgtcgcgttc ctcgacgccg tcgccggact cgacggggtc    780
gaggtcgagc accccgatcc cgacgcgcgg atcacgcgcg tcctcgacgc gtag          834

<210> 116
<211> 844
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 116
atgaccaccg acttcaccgt ggccgaaagc ctggaacgca ttgcagccga acagccgctg     60
attaattgtg ttaccaatgc cgttaccgtt aatgatgttg caaatgtgac cctgcattgg    120
ggtggtctgc cggtgatgag cgatgatgca cgtgaagttg gcgatatggt tgccggcgcc    180
cagggttgtc tgctgaatat gggcaccgtt agtgaagcag gcgaagaagc catgctgacc    240
gccggcaatg cagccaatga tcatggtgtg ccgctggtgg tggaccctgt gggtgttggc    300
gccaccccga cccgtgatcg tgtggcagaa gcactggtta ccgatctgga ccctaccatt    360
gtgaaaggta attatggcga aattaccgcc ctggccggtg ccgatgcaga agttcgtggc    420
gtggaaagcg ttggtgatta tgccgatgtg gccgaaaccg ccgttgccct ggcccgtgat    480
accggcgcag tggttgtggc aagcggtgaa accgatgtgg tggcaagtgc cgatgccgca    540
tacgaagtgg aaaatggtga tgccatgctg ggcaccgttg ttggtaccgg ttgtatgctg    600
ggtgtgaccc tggcagtgtt cgcagcagcc ctggatgatg ccgaaaccgc agcactggca    660
ggtaccctgg cattcggtgt ggcaggcgaa gccgcagcca aaggtgactt cggcgattat    720
gccggcccgg ccagctatcg tgttgccttc ctggatgcag ttgccggcct ggatggtgtt    780
gaagttgaac atccggaccc tgatgcccgt attacccgtg tgctggatgc atgactgaaa    840
gctt                                                                 844

<210> 117
<211> 258
<212> PRT
<213> Legionella rubrilucens

<400> 117
Met Leu Tyr Gln Leu Asp Thr Arg Leu Lys Arg Leu Arg Ala Glu Lys
1               5                   10                  15
Pro Leu Val Leu Cys Leu Thr Asn Ala Val Thr Met Asn Phe Val Ala
            20                  25                  30
Asn Ser Leu Leu Ala Leu Gly Ala Ala Pro Ile Met Ser Gln Ala Asp
        35                  40                  45
Asp Glu Leu Glu Ala Leu Ile Thr Ile Ser Arg Ala Leu Tyr Val Asn
    50                  55                  60
Ile Gly Thr Leu Asp Arg Gln Phe Ile Glu Arg Ile Asp Lys Ala Cys
65                  70                  75                  80
Arg Leu Ala Ala Ile His Gln Lys Pro Ile Ile Leu Asp Pro Val Gly
                85                  90                  95
Ala Gly Ala Ser Gln Ile Arg Thr Leu Thr Ala Arg Gln Leu Ser Pro
            100                 105                 110
Gln Ala Ala Ile Ile Arg Gly Asn Ala Ser Glu Ile Ile Ala Leu Thr
        115                 120                 125
Ala Asp Lys Gly Ile Ser Lys Gly Val Glu Thr Leu His Pro Val Glu
    130                 135                 140
Ala Ala Leu Thr Ala Ala Leu Thr Leu Ser Gln Gln Thr Ser Ala Ile
145                 150                 155                 160
Val Val Val Ser Gly Pro Val Asp Phe Ile Thr Asp Gly Arg Gln Cys
                165                 170                 175
Cys His Leu Ser Tyr Gly Ser Pro Leu Met Pro Leu Val Thr Gly Met
            180                 185                 190
Gly Cys Ala Leu Thr Ala Ile Ile Ala Ala Phe Ala Ala Met Glu Thr
        195                 200                 205
Pro Phe Tyr Glu Ala Ala Ala Ala Gly Thr Ala Tyr Val Gly Leu Cys
    210                 215                 220
Gly Gln Ser Ala His Arg Ser Ala Ser Gly Pro Ala Ser Phe Gln Ser
225                 230                 235                 240
Ala Phe Ile Asp Ala Leu Tyr Gln Met Pro Phe Glu Glu Leu Asp Asp
                245                 250                 255
Ala Leu

<210> 118
<211> 777
<212> DNA
<213> Legionella rubrilucens

<400> 118
atgctttatc aactggatac ccggcttaag cgcttgcgcg cggagaagcc tttggtgctc     60
tgtttaacga atgcagtgac catgaacttc gtagccaaca gcctgctggc gctgggggct    120
gcaccgatca tgtcacaggc agatgacgag cttgaggcac tgattaccat atcacgggct    180
ctctatgtca atattggtac cttggacaga cagtttattg agcggattga caaggcttgc    240
cgccttgccg ccattcacca gaaacccatc atactcgatc ccgtgggggc gggagccagt    300
caaatcagga cgttgacagc ccggcaactt tcaccccaag ccgccattat tcgcggtaac    360
gccagtgaaa ttattgcttt gacggcagat aaaggcatca gcaagggagt ggaaacgctt    420
catccggtgg aggcggcatt gactgctgca ttgactctat cacagcagac atcggcgatt    480
gtcgttgtca gcggccccgt ggattttatt acggatggcc gccaatgttg ccatttgtcc    540
tatggttcgc cgctgatgcc tttggtcacg ggaatggggt gtgccctgac cgccatcatt    600
gcggcctttg ccgcgatgga aacgcctttt tatgaagccg ccgctgcggg tacagcctat    660
gtcgggctat gcggccagtc agctcatcgt tcagccagtg gtccggccag ctttcagagc    720
gcgttcattg atgcccttta tcaaatgcca tttgaggagc tggatgatgc gctataa       777

<210> 119
<211> 787
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 119
atgctgtatc agctggatac ccgtctgaaa cgtctgcgtg cagaaaaacc gctggttctg     60
tgtctgacca atgccgtgac catgaacttc gttgcaaata gtctgctggc actgggtgcc    120
gccccgatta tgagccaggc cgatgatgaa ctggaagccc tgattaccat tagccgcgca    180
ctgtatgtta atattggtac cctggatcgc cagttcattg aacgcattga taaagcatgc    240
cgtctggcag ccattcatca gaaaccgatt attctggacc ctgtgggtgc aggcgccagc    300
cagattcgca ccctgaccgc acgtcagctg agtccgcagg ccgccattat tcgcggcaat    360
gcaagcgaaa ttattgcact gaccgccgat aaaggcatta gcaaaggcgt ggaaacctta    420
catccggtgg aagcagcact gaccgcagca ctgacactga gccagcagac cagcgcaatt    480
gttgttgtta gtggcccggt tgacttcatt accgatggtc gccagtgctg tcatctgagc    540
tatggtagcc cgctgatgcc gctggtgacc ggcatgggct gcgccttaac cgcaattatt    600
gccgcattcg ccgcaatgga aacaccgttc tatgaagcag cagcagcagg taccgcatac    660
gttggtctgt gcggtcagag tgcccatcgc agcgccagtg gcccggctag cttccagagt    720
gccttcattg atgccctgta tcagatgccg ttcgaagaac tggatgatgc actgtgactg    780
aaagctt                                                              787

<210> 120
<211> 273
<212> PRT
<213> Leuconostoc mesenteroides

<400> 120
Met Lys Asn Glu Leu Ile Lys Ile Lys Ser Ile Leu Pro Leu Gln Lys
1               5                   10                  15
Ala Pro Leu Val His Cys Ile Thr Asn Asp Ile Thr Leu Glu Thr Val
            20                  25                  30
Ala Asn Thr Ile Leu Tyr Leu Gly Gly Lys Pro Ile Met Ser Ser Asp
        35                  40                  45
Thr Arg Glu Phe Ser Ser Leu Phe Gln Ser Thr Asp Ala Leu Leu Leu
    50                  55                  60
Asn Met Gly Arg Leu Asn Glu Ser His Glu Gln Ser Leu Ser Gln Ala
65                  70                  75                  80
Ser Ser Leu Ala Asp Met Thr Lys Lys Pro Thr Val Val Asp Leu Val
                85                  90                  95
Gly Tyr Gly Ile Thr Asn Glu Arg Thr Lys Leu Gly Met Ala Met Ala
            100                 105                 110
Arg Asn His Pro Thr Val Ile Lys Gly Asn Thr Ser Glu Ile Arg Arg
        115                 120                 125
Phe Val Gly Leu Pro Ser Leu Ala Lys Gly Ile Asp Gly Ala Ser Ser
    130                 135                 140
Asp Gln His Asp Gln Ala Leu Lys Asp Leu Ile Leu Ser Leu Lys Gln
145                 150                 155                 160
Ile Thr Thr Glu Tyr Ala Asp Thr Val Phe Val Ala Thr Gly Lys Lys
                165                 170                 175
Asp Val Ile Val Gln Asn Asp Lys His Leu Ile Leu Ser Asn Gly Val
            180                 185                 190
Asp Glu Leu Asp Lys Phe Val Gly Thr Gly Asp Met Val Gly Ala Ile
        195                 200                 205
Ile Thr Thr Leu Leu Ala Val Gly Glu Asp Pro Trp Val Ala Ser Gln
    210                 215                 220
Phe Ala Ile Ser Tyr Leu Asn Val Ala Ala Glu Lys Ala Leu Ser Leu
225                 230                 235                 240
Thr Asn Gly Met Glu Asn Phe Arg Arg Glu Val Leu Asn Gln Ile Asp
                245                 250                 255
Leu Leu Gly Arg Asn Gln Gln Trp Ala Thr Lys Ile Lys Tyr Ser Asn
            260                 265                 270
Phe

<210> 121
<211> 822
<212> DNA
<213> Leuconostoc mesenteroides

<400> 121
atgaaaaatg aattaataaa aattaaatca attttacctt tgcaaaaggc accgttagta     60
cattgcataa ccaatgatat tacgttagaa acggtggcta atactatatt gtatcttggt    120
ggtaaaccta tcatgagtag tgatacccgt gaattttcgt cattatttca gtcaacagat    180
gctctactac taaacatggg ccgacttaat gagtcacacg aacaaagtct cagtcaagct    240
agctctttag cagatatgac taaaaaaccg actgttgtgg atctggtagg ctatggtatt    300
actaatgaac gaacaaagtt aggaatggct atggcgcgta atcatccaac agttatcaaa    360
ggaaatactt cagaaatcag aagatttgtc ggcttaccat ctttagcaaa aggtattgat    420
ggtgctagct ctgatcagca tgatcaggcg ttaaaagatc tcattttgtc tttgaaacaa    480
ataacaactg agtatgccga tactgtgttt gtggcaacag ggaaaaagga tgtcattgtt    540
caaaatgata aacaccttat tttgagcaat ggcgtcgatg aattggataa gtttgttggg    600
acaggagaca tggttggcgc aattataact acacttctcg cggtcggtga agatccgtgg    660
gttgctagtc aatttgcaat tagctacctc aatgtagctg ctgaaaaagc attgtcatta    720
acaaacggca tggaaaactt tagacgagaa gttcttaacc aaattgattt gctagggaga    780
aatcaacagt gggcaacgaa aattaagtat tcgaattttt ga                       822

<210> 122
<211> 832
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 122
atgaagaacg aactgattaa gatcaagagt attctgccgc tgcaaaaagc cccgctggtt     60
cattgcatta ccaatgatat taccctggaa accgttgcca ataccattct gtatctgggt    120
ggtaaaccga ttatgagtag tgatacccgt gagttcagta gtctgttcca gagcaccgat    180
gcactgctgc tgaatatggg ccgtctgaat gaaagtcatg aacagagcct gagccaggca    240
agcagcctgg ccgatatgac caaaaaaccg accgttgtgg atctggttgg ttatggtatt    300
accaatgaac gtaccaaact gggtatggca atggcacgca atcatccgac cgtgattaaa    360
ggtaatacca gcgaaattcg tcgcttcgtg ggcctgccga gtctggccaa aggtattgat    420
ggtgccagta gtgatcagca tgatcaggca ctgaaagact taattctgag tctgaaacag    480
attaccaccg aatatgcaga taccgtgttc gttgcaaccg gtaaaaaaga tgtgattgtg    540
cagaatgata agcatctgat tctgagtaat ggcgtggatg aactggataa attcgtgggt    600
accggtgata tggttggcgc aattattacc accctgctgg cagtgggtga agatccgtgg    660
gtggccagcc agttcgccat tagctatctg aatgtggcag cagaaaaagc cctgagcctg    720
accaatggta tggaaaactt ccgtcgtgaa gttctgaatc agattgatct gctgggccgc    780
aatcagcagt gggcaaccaa aattaaatat agcaacttct gactgaaagc tt            832

<210> 123
<211> 252
<212> PRT
<213> Leptospira species

<400> 123
Met Ile Thr Asn Asn Pro Leu Ile Leu Asn Ile Thr Asn Gln Val Thr
1               5                   10                  15
Thr His Phe Ile Ala Ser Ser Leu Ile Ala Leu Gly Ala Ser Pro Val
            20                  25                  30
Met Ser Asp Asp Pro Ser Asp Ala Tyr Asp Leu Val Asp Ile Thr Asn
        35                  40                  45
Gly Ile Cys Leu Asn Ile Gly Thr Ile Ser Ser His Gln Met Asp Ile
    50                  55                  60
Met Arg Asn Val Leu Ser Asn Pro Lys Ala Lys Asn Ile Val Leu Asp
65                  70                  75                  80
Pro Val Gly Ala Gly Ala Ser Ser Ile Arg Ser Ser Ser Cys Lys Glu
                85                  90                  95
Ile Leu Glu Ser Gly Lys Ile Asp Leu Ile Arg Gly Asn Ala Ser Glu
            100                 105                 110
Ile Ser Ser Ile Ala Gly Leu Ser Ser Thr Thr Arg Gly Val Asp Ser
        115                 120                 125
Thr Met Glu Thr Lys Ser Val Glu Ile Thr Ala Asp Lys Leu Ala Lys
    130                 135                 140
Asp Arg Ser Cys Ile Val Val Val Ser Gly Glu Val Asp Tyr Ile Thr
145                 150                 155                 160
Asn Gly Thr Asp Lys Tyr Arg Val Asn Asn Gly Ala Ser Ile Met Ala
                165                 170                 175
Lys Ile Thr Gly Thr Gly Cys Val Leu Ser Ser Tyr Leu Ala Ala Val
            180                 185                 190
Leu Ala Ser Gly Asn Lys Ser Ile Glu Ser Ile Ala Phe Ala Val Ala
        195                 200                 205
Tyr Tyr Gly Val Leu Gly Glu Lys Ala Ala Leu Asp Asn Thr Gly Leu
    210                 215                 220
Gly Asn Tyr Arg Glu Arg Phe Leu Asp Ala Met Ser Thr Ile His Phe
225                 230                 235                 240
Asp Ser Val Lys Pro Ser Leu Arg Ile Ser Lys Leu
                245                 250

<210> 124
<211> 759
<212> DNA
<213> Leptospira species

<400> 124
atgataacca ataatcctct catattaaat ataaccaacc aggtgactac gcattttatt     60
gcgagttctt tgattgcatt aggcgcttca ccagtaatgt cagatgatcc cagcgatgct    120
tacgatcttg ttgacattac aaatggtatt tgtttaaata taggaacaat atcatctcac    180
caaatggata taatgcgaaa tgttctttct aaccctaagg caaagaatat tgttttagac    240
cctgttggtg cgggagcaag cagtattcga tcttcatcat gtaaagaaat tctagaatca    300
ggtaaaatcg atttaattcg aggaaatgct tctgagattt ctagtatcgc tggcttatct    360
tccacaacca gaggagtaga ctcaacaatg gaaactaaat ctgtggaaat aacagccgat    420
aaattagcga aagatcgtag ttgtatagtg gttgtaagcg gggaggttga ttatattaca    480
aatggcaccg ataagtatcg agttaacaat ggagcctcga ttatggcaaa gattacagga    540
acaggctgcg ttctctcctc ttatctagct gcagttttag cttctggaaa taaatcaatt    600
gaaagcattg ctttcgcagt cgcatactat ggagtattag gtgagaaagc tgcattagac    660
aacacaggct taggaaatta tagagaaaga ttcttagacg ctatgagtac aatccatttt    720
gattctgtga aaccttcatt aaggatttca aaactatga                           759

<210> 125
<211> 769
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 125
atgatcacca ataacccgct gattctgaat attaccaatc aggttaccac ccacttcatt     60
gccagtagtc tgattgccct gggcgccagt ccggtgatga gcgatgatcc gagtgatgca    120
tacgatctgg tggatattac caatggtatc tgtctgaata ttggcaccat tagtagccat    180
cagatggata ttatgcgtaa tgtgctgagc aatccgaaag caaaaaatat tgtgctggac    240
cctgtgggtg caggtgccag tagcattcgt agtagtagct gcaaagaaat tctggaaagt    300
ggcaaaattg atctgattcg cggtaatgca agcgaaatta gtagcattgc aggcctgagc    360
agcaccaccc gcggtgtgga tagcaccatg gaaaccaaaa gcgttgaaat taccgcagat    420
aaactggcaa aagatcgtag ctgtattgtg gtggttagtg gcgaagtgga ttatattacc    480
aatggcaccg ataaatatcg cgtgaataat ggcgcaagta ttatggccaa aattaccggt    540
accggctgcg ttctgagtag ttatctggca gccgtgctgg caagcggcaa taaaagcatt    600
gaaagtattg cattcgccgt tgcatattat ggcgttctgg gcgaaaaagc agcactggat    660
aataccggtc tgggtaatta tcgtgaacgc ttcctggatg ccatgagtac cattcacttc    720
gatagtgtga aaccgagtct gcgcattagt aaactgtgac tgaaagctt                769

<210> 126
<211> 277
<212> PRT
<213> Johnsonella ignava

<400> 126
Met Ile Phe Asn Glu Lys Lys Glu Ile Tyr Gly Val Asp Ile Lys Gly
1               5                   10                  15
Phe Tyr Val Lys Ile Ala Lys Lys Asn Pro Gly Ile His Cys Ile Thr
            20                  25                  30
Asn Ile Val Ser Ala Asn Asp Cys Ala Asn Ile Leu Leu Ala Met Gly
        35                  40                  45
Ala Ser Pro Val Met Ala Gln His Val Leu Asp Ala Gly Glu Ile Ser
    50                  55                  60
Gly Gly Cys Asp Ala Leu Val Cys Asn Phe Gly Ala Thr Gly Ala Tyr
65                  70                  75                  80
Asp Ala Met Tyr Glu Ala Ala Lys Ser Ala Ala Leu Leu Lys His Pro
                85                  90                  95
Ile Val Ala Asp Pro Val Gly Val Gly Ala Ser Ala Tyr Arg Arg Ser
            100                 105                 110
Cys Phe Leu Asp Phe Ile Ser Lys Phe Lys Val Ser Cys Ile Arg Gly
        115                 120                 125
Asn Ile Ser Glu Ile Arg Ala Leu His Glu Lys Arg Pro Thr Ala Arg
    130                 135                 140
Gly Val Asp Val Ser Glu Tyr Glu Leu Lys Asn Asn Ser Gly Asp Glu
145                 150                 155                 160
Ser Val Leu Phe Lys Asn Ala Glu Trp Ile Arg Glu Phe Ser Leu Lys
                165                 170                 175
Val Asn Cys Ile Val Val Cys Ser Gly Glu Thr Asp Ile Val Thr Asp
            180                 185                 190
Gly Lys Asn Thr Val Phe Val Thr Asp Gly Cys Ile Leu Met Ser Arg
        195                 200                 205
Val Ser Ala Thr Gly Cys Met Ala Ser Ala Val Thr Ala Ala Phe Leu
    210                 215                 220
Ser Val Glu Asn Ser Phe Ile Ser Ala Ala Ala Ser Ile Ser Phe Met
225                 230                 235                 240
Gly Arg Cys Gly Glu Tyr Ala Leu Lys His Leu Glu Arg Gly Thr Ser
                245                 250                 255
Gly Ile Gly Ser Gly Ser Phe Arg Val Gly Leu Ile Asp Ala Ala Gly
            260                 265                 270
Leu Ile Phe Asn Glu
        275

<210> 127
<211> 834
<212> DNA
<213> Johnsonella ignava

<400> 127
atgatattta atgagaaaaa agaaatatat ggggttgata taaagggatt ttatgtaaaa     60
attgcgaaaa aaaatcccgg tatacattgt ataacaaata tagtcagtgc aaatgactgt    120
gctaatatac tgcttgctat gggggcatca cctgttatgg cacagcatgt acttgatgcg    180
ggagagataa gtggaggctg tgatgctctt gtatgtaatt tcggtgctac aggagcttat    240
gatgcgatgt atgaggctgc aaaaagtgca gccttactta agcatccaat agtggcagat    300
cccgtaggag tgggagcttc ggcttacaga agaagctgct ttcttgattt tataagtaaa    360
tttaaggttt catgtataag gggaaacata tcggaaatac gggcattgca tgaaaaacgt    420
ccgactgcaa ggggagttga tgtatctgaa tatgagctta aaaataattc aggtgatgag    480
tcagtacttt ttaaaaatgc cgaatggata agggaatttt cattaaaggt gaattgtatt    540
gtagtatgtt caggcgaaac tgatatagta actgacggga aaaataccgt ttttgtaact    600
gacggctgta ttcttatgtc aagagtaagt gctacgggct gtatggcatc tgcggttacg    660
gcggcttttt taagtgtgga aaatagtttt atttcagctg cggcatctat atcttttatg    720
ggaagatgcg gagaatatgc tcttaagcat ttagagagag ggacttcagg catagggagc    780
ggcagtttca gagtaggact gattgatgcg gcaggactta tatttaatga gtag          834

<210> 128
<211> 844
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 128
atgatcttca acgaaaagaa ggaaatctat ggtgttgata ttaagggctt ctatgttaaa     60
atcgccaaaa aaaatccggg cattcattgc attaccaata ttgttagtgc caatgattgc    120
gcaaatattc tgctggccat gggtgcaagc ccggtgatgg cccagcatgt tctggatgcc    180
ggtgaaatta gcggcggttg tgatgccctg gtgtgtaact tcggtgcaac cggtgcctat    240
gatgccatgt atgaagcagc aaaaagtgca gccctgctga aacatccgat tgtggccgat    300
ccggtgggtg tgggtgccag tgcatatcgt cgtagttgct tcctggactt cattagcaaa    360
ttcaaagtga gttgtatccg tggtaatatt agtgaaattc gtgcactgca tgaaaaacgt    420
ccgaccgcac gcggtgtgga tgttagcgaa tatgaactga aaaataacag cggcgatgaa    480
agcgtgctgt tcaaaaatgc cgaatggatt cgtgagttca gtctgaaagt gaattgtatt    540
gttgtgtgta gcggcgaaac cgatattgtg accgatggta aaaataccgt gttcgtgacc    600
gatggctgca ttctgatgag tcgcgttagt gccaccggtt gcatggccag cgccgtgacc    660
gccgcattcc tgagcgttga aaatagcttc attagtgccg ccgccagcat tagcttcatg    720
ggccgctgcg gtgaatatgc cctgaaacat ctggaacgtg gtaccagtgg tattggtagc    780
ggcagcttcc gcgtgggtct gattgatgcc gccggtctga tcttcaatga atgactgaaa    840
gctt                                                                 844

<210> 129
<211> 272
<212> PRT
<213> Dichelobacter nodosus

<400> 129
Met Thr Glu Ser Leu Glu Met Gly Lys Ser Ile Asp Phe Asn Val Gln
1               5                   10                  15
Thr Phe Asp Phe Lys Ser Ala Gln Phe Tyr Leu Asp Arg Ala Tyr Ala
            20                  25                  30
Asn Ala Pro Phe Ile His Cys Leu Thr Asn Asn Thr Thr Lys Phe Phe
        35                  40                  45
Val Ala Asn Ala Leu Leu Ala Ile Gly Ala Lys Pro Ala Met Val Glu
    50                  55                  60
Ser Trp Gln Glu Val Val Glu Phe Ser Gln Arg Ala Ala Asn Val Val
65                  70                  75                  80
Met Asn Leu Asp Ser Leu Thr Asp Glu Arg Leu Arg Ser Leu Ser Met
                85                  90                  95
Ser Ala Gln Val Ala His Asp His Gly Lys Trp Trp Val Phe Asp Pro
            100                 105                 110
Ala Ala Val Ser Asp Ile Leu Ser Tyr Arg Ser Gly Phe Ala Arg Glu
        115                 120                 125
Leu Leu Arg Tyr Tyr Pro Arg Val Ile Arg Gly Asn Ala Ser Glu Ile
    130                 135                 140
Ser Tyr Leu Asn Asp Thr Tyr Gly Arg Arg Ser Phe Glu Asn Val Met
145                 150                 155                 160
Ser Ser Ser Glu Ala Ile Glu Ala Ala Val Lys Leu Ala Ile His Gln
                165                 170                 175
Arg Ala Val Val Val Val Thr Gly Glu Ile Asp Tyr Val Thr Asp Gly
            180                 185                 190
Glu Thr Ile Leu Ala Val Arg Gly Gly His Pro Phe Leu Gly Arg Val
        195                 200                 205
Cys Gly Thr Gly Cys Val Leu Ser Ala Met Ile Ala Ser Thr Val Leu
    210                 215                 220
Cys Gly Asp Val Leu Tyr Gly Ala Ala Ser Ala Cys Ala Leu Met Lys
225                 230                 235                 240
Arg Ala Gly Glu Arg Ala Gly Leu Thr Thr Ser Gly Leu Gly Ser Phe
                245                 250                 255
Tyr Val Ala Leu Leu Asp Asn Leu Thr Phe Pro Met Arg Tyr Gln Asp
            260                 265                 270

<210> 130
<211> 819
<212> DNA
<213> Dichelobacter nodosus

<400> 130
atgacggaat cattggaaat gggtaaaagt attgatttta atgtgcaaac ttttgatttt     60
aaatcggcac aattttatct cgatcgggct tatgcaaatg cgccgtttat tcattgttta    120
acgaataata cgactaaatt ttttgtggca aatgcattgt tggcaattgg cgcgaaaccg    180
gcaatggtgg aatcgtggca ggaagtcgtt gaattttctc aacgtgcggc aaatgtggtg    240
atgaatttgg attcgttaac cgatgagcgg ttgcgttctt tatcgatgag tgcgcaagta    300
gcgcatgatc acggcaaatg gtgggttttt gatccggcag cggtcagcga tattttgtct    360
taccgcagcg gttttgcgcg cgaattattg cgttattatc cgcgggtgat tcgtggcaat    420
gcctcggaaa tttcttattt gaatgatacg tacggacgcc gtagttttga aaatgtgatg    480
agttcttcgg aagcaattga agcggcagtg aaattagcca ttcatcaacg tgcggttgtc    540
gtcgttacgg gagaaattga ttatgtgacc gacggcgaga cgattttagc agtgcgcggc    600
ggacaccctt ttttagggcg cgtttgcggc acgggttgcg ttttatcggc gatgattgcg    660
tcaacggtat tatgtggtga cgttttgtat ggcgcggctt ctgcgtgcgc tttgatgaaa    720
cgcgccggtg aacgggcggg tttaacgact tcgggattag gcagttttta tgttgcgtta    780
ttggataatt taacgtttcc aatgcgttat caagattaa                           819

<210> 131
<211> 829
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 131
atgaccgaaa gcctggaaat gggcaaaagt attgacttca atgttcagac cttcgacttc     60
aaaagtgccc agttctatct ggatcgtgcc tatgcaaatg ccccgttcat tcattgtctg    120
accaataata ccaccaaatt cttcgtggca aatgcactgc tggcaattgg tgcaaaaccg    180
gcaatggtgg aaagctggca ggaagtggtt gagttcagcc agcgcgccgc caatgttgtt    240
atgaatctgg atagtctgac cgatgaacgt ctgcgtagtc tgagtatgag tgcacaggtg    300
gcacatgatc atggcaaatg gtgggtgttc gatccggccg cagtgagtga tattctgagc    360
tatcgtagtg gcttcgcccg cgaactgctg cgctattatc cgcgtgttat tcgtggtaat    420
gccagtgaaa ttagttatct gaatgatacc tacggtcgcc gcagcttcga aaatgtgatg    480
agcagcagtg aagccattga agcagccgtg aaactggcca ttcatcagcg tgccgtggtt    540
gttgtgaccg gcgaaattga ttatgtgacc gatggtgaaa ccattctggc cgtgcgtggt    600
ggtcatccgt tcctgggccg cgtgtgtggt accggctgcg tgctgagtgc aatgattgcc    660
agtaccgttc tgtgtggtga tgtgctgtat ggtgccgcaa gtgcctgcgc actgatgaaa    720
cgcgcaggcg aacgcgccgg tctgaccaca agcggtctgg gtagcttcta tgttgccctg    780
ctggataatc tgaccttccc gatgcgctat caggattgac tgaaagctt                829

<210> 132
<211> 284
<212> PRT
<213> Butyrivibrio species

<400> 132
Met Ser Glu Leu Ile His Cys Ile Thr Asn Pro Ile Ser Met Met Gln
1               5                   10                  15
Cys Ala Asn Ala Ile Leu Ala Leu Gly Ala Lys Pro Ile Met Ala Glu
            20                  25                  30
His Pro Leu Glu Val Met Glu Ile Thr Glu Ser Ala Ser Ala Leu Val
        35                  40                  45
Ile Asn Leu Gly Asn Ile Ser Asp Thr Arg Met Glu Ser Met Glu Ile
    50                  55                  60
Ser Phe Glu Thr Ala Leu Lys Lys Asn Ile Pro Val Val Ile Asp Ala
65                  70                  75                  80
Val Gly Val Ala Cys Ser Lys Leu Arg Arg Asp Phe Val Met Arg Leu
                85                  90                  95
Leu Lys Met Arg Ser Lys Lys Thr Glu Leu Ser Leu Arg Glu Lys Gly
            100                 105                 110
Ile Leu Leu Leu Lys Gly Asn Tyr Ser Glu Ile Lys Ala Ile Phe Asp
        115                 120                 125
Glu Ser Tyr Arg Gly Val Gly Val Asp Ala Asp Glu Ser Leu Gly Ala
    130                 135                 140
Ser Glu Ile Ala Asp Ile Val Arg Val Leu Ala Leu Asn Leu Gly Val
145                 150                 155                 160
Ile Val Leu Ala Ser Gly Glu Lys Asp Ile Val Ser Asp Val Ser Arg
                165                 170                 175
Thr Phe Phe Ile Ser Asn Gly Asn Pro Ile Met Gly Val Ile Thr Gly
            180                 185                 190
Thr Gly Cys Met Leu Gly Ala Ile Cys Gly Val Phe Leu Ala Arg Asp
        195                 200                 205
Ala Ser Ile Glu Ala Val Leu Arg Ala Ala Gly Phe Phe Gly Ile Ala
    210                 215                 220
Gly Glu Ile Ala Tyr Glu Arg Ala Glu Ala Val Thr Glu Arg Leu Thr
225                 230                 235                 240
Asn Gly Lys Ala Ser Glu Ser Met Val Gly Ser Gly Ser Phe Leu Val
                245                 250                 255
Glu Leu Leu Asn Ala Ile Ser Met Ile Asp Glu Glu Thr Val Lys Ser
            260                 265                 270
Leu Leu Arg Cys Ala Glu Glu Lys His Ser Ser Asn
        275                 280

<210> 133
<211> 855
<212> DNA
<213> Butyrivibrio species

<400> 133
atgtctgagc taattcactg tataacaaat ccaatatcta tgatgcaatg cgcaaatgca     60
attttagcac ttggcgcgaa gccgataatg gctgagcatc cgttggaagt catggagatc    120
actgagagcg cttctgctct tgtaattaat ctgggtaata tatcggacac cagaatggag    180
tcgatggaga tatcttttga aactgctctg aagaaaaata taccggttgt gatagatgcg    240
gtgggcgtag cctgctccaa gcttcgcagg gatttcgtga tgaggcttct aaagatgagg    300
tcgaagaaaa cggaattgtc tttgagggaa aaggggatcc ttttacttaa gggaaactac    360
tccgagatta aggcgatttt tgatgagtcc tacaggggag ttggcgtcga tgccgacgaa    420
agtctgggag cttcagagat agctgatatt gtaagagtgc tggctttgaa tctgggagtg    480
atcgtgctcg ccagcggtga gaaggatatc gtatcagatg taagtcgtac attttttata    540
agtaatggga atccgataat gggggtgatt acgggaacgg gatgcatgct aggggctata    600
tgcggcgtgt ttctggcaag ggatgcgagc atagaagctg tcctccgcgc agccggattt    660
ttcgggattg ctggagaaat tgcatatgag agagcggaag cagttacgga gaggcttaca    720
aatggcaaag catctgaaag catggttgga agcggaagtt ttcttgttga acttttaaat    780
gcaatttcaa tgatcgatga agaaactgta aaaagtttgc ttagatgcgc tgaagaaaaa    840
cattcatcta actga                                                     855

<210> 134
<211> 865
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 134
atgagcgaac tgattcattg tattaccaat ccgattagta tgatgcagtg tgcaaatgca     60
attctggcac tgggcgcaaa accgattatg gcagaacatc cgctggaagt gatggaaatt    120
accgaaagcg caagtgcact ggtgattaat ctgggcaata ttagtgatac ccgcatggaa    180
agtatggaaa ttagcttcga aaccgccctg aaaaaaaata ttccggtggt tattgatgcc    240
gtgggcgtgg cctgctctaa actgcgtcgc gacttcgtta tgcgcctgct gaaaatgcgt    300
agcaaaaaaa ccgaactgag cctgcgcgaa aaaggcattc tgctgctgaa aggtaattat    360
agcgaaatta aagcaatctt cgacgaaagt tatcgtggtg ttggcgtgga tgcagatgaa    420
agcctgggtg caagcgaaat tgccgatatt gtgcgtgtgc tggcactgaa tctgggcgtg    480
attgtgctgg ccagtggtga aaaagatatt gtgagcgatg tgagtcgtac cttcttcatt    540
agcaatggta atccgattat gggtgttatt accggcaccg gctgtatgct gggcgcaatc    600
tgtggcgtgt tcctggcacg cgatgccagt attgaagccg ttctgcgcgc agcaggcttc    660
ttcggcattg ccggcgaaat tgcctatgaa cgtgcagaag ccgttaccga acgtctgacc    720
aatggcaaag ccagtgaaag catggtgggc agcggcagct tcctggtgga actgctgaat    780
gcaattagca tgattgatga agaaaccgtt aaaagtctgc tgcgttgtgc cgaagaaaaa    840
catagtagca attgactgaa agctt                                          865

<210> 135
<211> 260
<212> PRT
<213> Agrilactobacillus composti

<400> 135
Met Gln Leu Asp Leu Leu Asn Gln Leu Arg Ala Gln Val Pro Leu Val
1               5                   10                  15
Val Asn Tyr Ala Asn Tyr Val Thr Pro Asn Phe Val Ala Asn Gly Leu
            20                  25                  30
Asn Ala Leu Gly Ala Ser Pro Ile Met Thr Ser Glu Val Asp Glu Ala
        35                  40                  45
Asp Asp Leu Val Lys Ile Thr Asn Thr Val Val Ile Asn Leu Gly Thr
    50                  55                  60
Ile Asn His Tyr Glu Thr Asp Leu Val Trp Gln Leu Cys Thr Ser Ala
65                  70                  75                  80
Val Lys Tyr His Lys Pro Ile Val Leu Asp Pro Val Ala Val Gly Ala
                85                  90                  95
Thr Ala Tyr Arg Leu Asp Ile Ala Gln Lys Leu Leu Gln Asp Phe Pro
            100                 105                 110
Ile Ala Val Ile Arg Gly Asn Val Gly Glu Ile Ala Ala Leu Ala Gln
        115                 120                 125
Val Asp Trp Ala Thr Lys Gly Ile Asp Ala Gly Thr Gly Asp Ala Asp
    130                 135                 140
Pro Ala Ala Ile Ala Lys Ala Cys Ala Thr Arg Tyr His Asn Val Val
145                 150                 155                 160
Ala Leu Ser Gly Ile Thr Asp Tyr Ile Thr Asp Gly Gln Gln Leu Phe
                165                 170                 175
Lys Val Gly Asn Gln Thr Pro Leu Leu Pro Leu Thr Val Gly Ser Gly
            180                 185                 190
Asp Leu Leu Ser Ser Ile Ile Gly Ala Phe Val Gly Ile Thr Asp Asn
        195                 200                 205
Tyr Tyr Glu Ala Ala Gln Val Gly Cys Ala Val Leu Ala Cys Thr Gly
    210                 215                 220
Glu Ile Ala Ala Gln Pro Leu His Ser His Glu Gly Gly Thr Phe Ala
225                 230                 235                 240
Ala Arg Leu Leu Asp Lys Leu Thr Ile Val Asp Lys Glu Asp Ile Leu
                245                 250                 255
Glu Ile Leu Lys
            260

<210> 136
<211> 783
<212> DNA
<213> Agrilactobacillus composti

<400> 136
atgcaattag atttattgaa tcagctccgt gcccaagtcc cactggtggt caactacgcc     60
aattatgtca cccccaactt tgtggctaac ggtttgaacg ccttgggggc ctcgcccatt    120
atgacgtccg aagtcgatga agccgatgac ttggttaaaa tcaccaatac agttgtcatc    180
aacttgggga ccatcaacca ttatgaaacg gacttagttt ggcagctttg caccagtgcc    240
gtgaaatatc acaagcccat cgtgttagat cccgttgccg tgggggcaac ggcctatcgt    300
ttagatatcg cccaaaaatt actgcaagat ttcccgattg ccgttatccg gggcaatgtg    360
ggtgaaattg cggctttagc ccaggtggat tgggccacta aaggtattga cgctgggacc    420
ggggatgcgg atccggccgc tattgccaag gcttgtgcca cccgttatca caatgtggtg    480
gccttaagtg gtattaccga ttacatcacg gatggccagc agcttttcaa agtgggcaac    540
caaaccccac tgctcccctt gaccgtgggt tctggggact tattatccag catcatcggg    600
gcttttgtgg gcattactga caattactac gaagctgccc aggtgggctg tgctgtacta    660
gcttgtaccg gtgaaattgc cgcccaaccg ctacattccc acgagggtgg tacctttgcc    720
gcccgattat tggacaaatt gaccatcgtt gacaaagaag atattctcga aattttgaaa    780
tag                                                                  783

<210> 137
<211> 793
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 137
atgcagctgg atctgctgaa tcagctgcgt gcacaggttc cgctggtggt taattatgcc     60
aattatgtga ccccgaactt cgttgcaaat ggcctgaatg ccctgggtgc aagcccgatt    120
atgaccagtg aagtggatga agcagatgat ctggtgaaaa ttaccaatac cgttgttatt    180
aacctgggca ccattaatca ttatgaaacc gatctggtgt ggcagctgtg taccagtgca    240
gttaaatatc ataaaccgat tgtgctggac cctgttgccg tgggcgccac cgcatatcgt    300
ctggatattg cacagaaact gctgcaagac ttcccgattg ccgtgattcg tggcaatgtt    360
ggcgaaattg cagcactggc acaggtggat tgggccacca aaggtattga tgccggcacc    420
ggtgatgcag atccggcagc aattgcaaaa gcatgcgcaa cccgttatca taatgttgtt    480
gcactgagtg gtattaccga ttatattacc gatggtcagc agctgttcaa agttggcaat    540
cagaccccgc tgctgccgct gaccgttggc agcggtgatc tgctgagtag cattattggt    600
gccttcgtgg gcattaccga taattattat gaagcagcac aggttggctg tgcagttctg    660
gcctgcaccg gcgaaattgc ggcccagccg ctgcatagtc atgaaggtgg taccttcgca    720
gcccgcctgc tggataaact gaccattgtt gataaagaag atatcctgga aatcctgaaa    780
tgactgaaag ctt                                                       793

<210> 138
<211> 276
<212> PRT
<213> Proteiniclasticum ruminis

<400> 138
Met Asn Leu Arg Asn His Val Glu Lys Leu Tyr Ala Gln Lys Phe Glu
1               5                   10                  15
Glu Lys Tyr Pro Leu Val Cys Phe Leu Thr Asn Tyr Val Thr Val Leu
            20                  25                  30
Asp Leu Val Asp Met Cys Ile His Ser Gly Gly Ser Pro Val Leu Thr
        35                  40                  45
Asp Glu Ile Ser Glu Ala His Glu Met Val Glu Tyr Ser Lys Ser Gln
    50                  55                  60
Ala Val Val Met Asn Phe Gly Thr Ile Asn Arg Glu Tyr Leu Asp Ile
65                  70                  75                  80
Met Thr Leu Thr Gly Lys Thr Ala Asn Arg Val His Val Pro Val Ile
                85                  90                  95
Leu Asp Pro Ala Ala Ile Thr Ala Ser Ser Phe Arg Lys Tyr Ala Ile
            100                 105                 110
Glu His Leu Leu Lys Glu Val Lys Val Asp Ile Leu Lys Gly Asn Leu
        115                 120                 125
Gly Glu Ile Lys Phe Ile Leu Gly Tyr Glu Thr Lys Asn Lys Gly Ile
    130                 135                 140
Asp Ser Phe Glu Asp Glu Asn Gly Ala Glu Lys Tyr Cys Ile Glu Leu
145                 150                 155                 160
Ala Glu Lys Leu Gly Ala Val Val Val Met Thr Gly Lys Thr Asp Ile
                165                 170                 175
Ile Thr Asp Gly Lys Arg Met Ala Lys Val Ser Asn Gly Asp Ala Lys
            180                 185                 190
Leu Lys Lys Ile Cys Gly Ala Gly Ser Ser Val Ala Ala Ile Met Ala
        195                 200                 205
Thr Tyr Ser Gly Leu Thr Lys Asp Tyr Phe Leu Ser Ala Thr Val Gly
    210                 215                 220
Cys Ala Val Met Gly Val Ala Ser Glu Met Ala Glu Glu Arg Met Lys
225                 230                 235                 240
Glu Arg Glu Gly Ile Arg Thr Phe Lys Thr Tyr Val His Asp Ala Val
                245                 250                 255
Ser Met Met Glu Thr Lys Glu Leu Met Asn Arg Leu Asn Leu Val Glu
            260                 265                 270
Val Glu Leu Asp
        275

<210> 139
<211> 831
<212> DNA
<213> Proteiniclasticum ruminis

<400> 139
atgaatctga gaaatcatgt agagaagctc tacgcacaga aattcgagga gaaatatcct     60
ctggtctgct ttctgaccaa ttatgtgacc gtactggatt tggtggatat gtgcattcat    120
tccggaggtt caccggtgct gacggatgaa atctcggaag ctcatgaaat ggtggagtat    180
tccaagtccc aagcggtggt catgaacttt ggcaccatca acagggagta tctggacatt    240
atgacgttga caggaaaaac tgccaataga gttcatgtgc ctgtaattct ggatcctgca    300
gccatcaccg cttcttcttt cagaaagtat gccattgagc atcttctgaa agaggtgaag    360
gtggatattc tcaaagggaa ccttggtgaa atcaagttta ttttaggata tgaaacaaaa    420
aacaagggca tcgactcttt tgaggatgaa aatggagccg agaagtactg catcgagctt    480
gcagaaaagc tgggcgcggt ggtggtgatg acagggaaaa cagatatcat caccgatgga    540
aagagaatgg ccaaagtatc caatggagat gccaaactga agaaaatctg cggagctgga    600
tcttctgtgg ctgccatcat ggcgacctat tcaggtctca caaaggacta cttcttaagt    660
gctactgttg ggtgcgctgt catgggcgta gcctctgaga tggcagaaga aagaatgaag    720
gaaagagaag gcatccgcac gttcaagacc tatgtacatg atgcggtttc tatgatggaa    780
acgaaggagc tcatgaacag gctgaacctt gtggaagttg agttagatta g             831

<210> 140
<211> 841
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 140
atgaacctgc gtaatcatgt tgaaaaactg tatgcacaga aattcgaaga aaaatatccg     60
ctggtgtgct tcctgaccaa ttatgtgacc gtgctggatc tggtggatat gtgtattcat    120
agcggcggta gtccggtgct gaccgatgaa attagcgaag cccatgaaat ggttgaatat    180
agcaaaagcc aggcagtggt gatgaacttc ggtaccatta atcgtgaata tctggatatt    240
atgaccctga ccggtaaaac cgccaatcgt gttcatgttc cggtgattct ggaccctgcc    300
gccattaccg ccagcagctt ccgtaaatat gcaattgaac atctgctgaa agaagtgaaa    360
gttgatattc tgaaaggcaa tctgggcgaa attaaattca ttctgggtta tgaaaccaag    420
aataaaggca ttgatagctt cgaagatgaa aatggcgccg aaaaatattg cattgaactg    480
gccgaaaaac tgggtgcagt ggtggttatg accggcaaaa ccgatattat taccgatggt    540
aaacgtatgg caaaagttag caatggtgat gcaaaactga aaaaaatctg tggcgcaggt    600
agtagcgtgg ccgcaattat ggcaacctat agcggtctga ccaaagatta cttcctgagc    660
gccaccgtgg gctgcgcagt tatgggcgtg gcaagcgaaa tggccgaaga acgtatgaaa    720
gaacgcgaag gcattcgtac cttcaaaacc tatgtgcatg atgcagttag catgatggaa    780
accaaagaac tgatgaatcg cctgaatctg gtggaagtgg aactggattg actgaaagct    840
t                                                                    841

<210> 141
<211> 277
<212> PRT
<213> Oxobacter pfennigii

<400> 141
Met Gly Asn Lys Val Leu Ile Asp Ile Leu Lys Cys Tyr Asn Glu Arg
1               5                   10                  15
Glu Asn Lys Thr Pro Leu Val His Phe Ile Thr Asn Phe Val Thr Met
            20                  25                  30
Asn Asp Val Ala Asn Ala Cys Leu Tyr Met Gly Gly Lys Pro Val Met
        35                  40                  45
Ala His Trp Glu Gln Glu Ile Asn Glu Ile Thr Ser Ala Ala His Ser
    50                  55                  60
Leu Val Leu Asn Leu Gly Thr Pro Asp Glu Ala Arg Ile Asp Ala Ile
65                  70                  75                  80
Lys Lys Ala Ala Arg Ile Ala Glu Ala Lys Asp Ile Pro Val Ile Leu
                85                  90                  95
Asp Pro Val Gly Ile His Val Phe Ser Val Arg Leu Asp Leu Ala Arg
            100                 105                 110
Tyr Leu Leu Glu Asn Arg Gln Val Asn Val Leu Lys Gly Asn Tyr Ser
        115                 120                 125
Glu Val Met Ala Phe Leu Asn Met Lys Ser Asn Phe Ile Gly Ile Asp
    130                 135                 140
Ser Leu Glu Glu Gly Phe Lys Arg Asp Val Ile Glu Lys Ile Lys Glu
145                 150                 155                 160
Phe Ser Glu Ile Asn Lys Leu Tyr Ile Val Ile Thr Gly Lys Glu Asp
                165                 170                 175
Tyr Val Phe Tyr Lys Asp Asn Ala Val Arg Ile Thr Asn Gly Thr Pro
            180                 185                 190
Leu Leu Ser Lys Ile Thr Gly Ser Gly Cys Ile Leu Ser Ala Ile Leu
        195                 200                 205
Gly Thr Leu Cys Ala Lys Gly Asn Lys Lys Asp Ile Phe Ser Leu Cys
    210                 215                 220
Val Met Gly Thr Leu Ile Asn Ser Ile Ala Gly Glu Lys Ala Gln Asp
225                 230                 235                 240
Lys Ile Lys Lys Ser His Glu Gly Phe His Thr Phe Lys Asn Tyr Tyr
                245                 250                 255
Leu Asp Glu Leu Ser Leu Val Asn Asp Asp Asp Ile Leu Ser Arg Gly
            260                 265                 270
Arg Val Phe Tyr Val
        275

<210> 142
<211> 834
<212> DNA
<213> Oxobacter pfennigii

<400> 142
atggggaata aggttctgat tgatatttta aaatgctata atgaaagaga aaacaaaact     60
cccctggttc attttataac taattttgta accatgaatg atgtcgccaa tgcctgcctg    120
tatatggggg gaaagcccgt aatggcacac tgggagcagg agattaatga aataacttca    180
gctgctcatt ctctggtttt aaatttaggc acacctgatg aagcaaggat tgatgctatt    240
aaaaaagcag ccaggattgc agaggctaag gatattcctg taattttaga ccctgtagga    300
atacacgttt tttctgtaag attggacctt gcaagatatc ttttggagaa caggcaagtg    360
aatgtattaa agggaaacta ttcggaagtt atggcttttt taaatatgaa aagcaatttt    420
ataggcatag attcccttga agaaggcttt aaaagagatg tcattgaaaa aataaaagag    480
ttctcagaga ttaataaatt gtatatagtt attaccggta aagaagatta tgttttttat    540
aaggacaacg ctgtaagaat aactaacggc actccccttc tttccaaaat aacaggttcc    600
ggctgtattt taagcgccat cctcggcacc ctttgtgcaa aaggcaataa aaaagatatt    660
ttttcattat gtgttatggg cactcttatt aactccatag ccggtgaaaa agcgcaggat    720
aaaattaaaa aatcacatga gggatttcat acctttaaaa attattactt ggatgagctc    780
tctcttgtga atgatgatga tattttaagc agagggagag ttttttatgt ctag          834

<210> 143
<211> 844
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 143
atgggtaata aggttctgat tgatatcctg aaatgttata acgaacgtga aaataagacc     60
ccgctggttc acttcattac caacttcgtt accatgaatg atgtggccaa tgcatgtctg    120
tatatgggcg gtaaaccggt tatggcccat tgggaacagg aaattaatga aattaccagc    180
gcagcacata gcctggttct gaatctgggt accccggatg aagcacgtat tgatgcaatt    240
aaaaaagcag cccgtattgc cgaagccaaa gatattccgg tgattctgga ccctgtgggt    300
attcatgtgt tcagcgtgcg cctggatctg gcccgttatc tgctggaaaa tcgtcaggtg    360
aatgttctga aaggtaatta tagcgaagtg atggcattcc tgaatatgaa aagcaacttc    420
attggtattg acagtctgga agaaggcttc aaacgtgatg ttattgaaaa aattaaggag    480
ttcagcgaaa tcaataagct gtatattgtt atcaccggca aagaagatta tgtgttctat    540
aaagataacg cagttcgtat taccaatggc accccgctgc tgagcaaaat taccggcagc    600
ggttgtattc tgagtgcaat tctgggcacc ctgtgtgcaa aaggtaataa aaaagatatc    660
ttcagcctgt gtgttatggg caccctgatt aatagcattg ccggcgaaaa agcacaggat    720
aaaattaaaa aaagccacga aggcttccat accttcaaaa attattatct ggacgaactg    780
agcctggtta atgatgatga tattctgagc cgcggtcgtg tgttctatgt ttgactgaaa    840
gctt                                                                 844

<210> 144
<211> 270
<212> PRT
<213> Planomicrobium species

<400> 144
Met Ile Ser Lys Ile Arg Ser Glu Asn Pro Ile Ile His Cys Ile Thr
1               5                   10                  15
Asn His Val Val Ser Asn Phe Gln Ala Asn Gly Leu Leu Ala Ile Gly
            20                  25                  30
Ala Ser Pro Ile Met Gly Glu Ala Gln Glu Glu Val Glu Glu Leu Val
        35                  40                  45
Ala Ile Ser Arg Ala Leu Ser Leu Asn Ile Gly Thr Leu Asn Lys Glu
    50                  55                  60
Thr Leu His Ser Met Leu Leu Ala Gly Lys Arg Ala Asn Lys Glu Lys
65                  70                  75                  80
Ile Pro Val Ile Leu Asp Pro Val Gly Ala Gly Ala Thr Ala Phe Arg
                85                  90                  95
Lys Asp Ala Ile Gln Lys Ile Leu Thr Asp Ile Asp Val Ser Val Leu
            100                 105                 110
Arg Cys Asn Ala Gly Glu Leu Ala Ala Ile Gly Gly Val Arg Trp Ala
        115                 120                 125
Ser Lys Gly Val Asp Ala Gly Glu Gly Asn Val Asp Leu Glu Glu Leu
    130                 135                 140
Ala Thr Arg Val Ala Ile Glu Tyr Ser Leu Val Val Ala Val Thr Gly
145                 150                 155                 160
Glu Thr Asp Ile Val Ala Asp Gly Ser Arg Val Glu Lys Ile Thr Gly
                165                 170                 175
Gly Asp Arg Met Met Ser Ser Val Thr Gly Met Gly Cys Leu Leu Ser
            180                 185                 190
Ala Val Thr Ala Ala Phe Met Ala Val Ser Pro Asp Asn Pro Thr Ala
        195                 200                 205
Ala Ala Ile Glu Ala Leu Lys Phe Tyr Gly Thr Ala Gly Glu Lys Ala
    210                 215                 220
Ala Ala Val Ser Glu Gly Pro Gly Ser Phe Arg Asp Thr Phe Leu Asp
225                 230                 235                 240
Val Leu Phe Gly Met Glu Ile Glu Glu Thr Gly Phe Asp Phe Glu Lys
                245                 250                 255
Gly Glu Gly Val Asp Val Leu Trp Gln Arg Ser Ser Arg Tyr
            260                 265                 270

<210> 145
<211> 813
<212> DNA
<213> Planomicrobium species

<400> 145
atgatcagta aaatccgcag tgaaaatcca atcatccatt gcatcaccaa tcacgtcgtg     60
tcgaattttc aagcgaacgg gctgctggca atcggtgctt cgcccatcat gggcgaagca    120
caagaagaag tggaagaact ggtcgccatt tcgcgcgcct tgtcattgaa tatcggaacg    180
ttgaataaag aaacgcttca tagcatgctg ctcgccggta aacgggcgaa taaggaaaaa    240
attccggtga tcctggatcc agtgggagca ggcgcgaccg cttttcggaa agatgccata    300
cagaagattt taacggatat cgatgtcagt gtgctgcgct gcaacgcggg tgaactggcg    360
gcaatcggcg gagtgagatg ggcctctaaa ggcgtcgatg ccggtgaagg caatgtcgac    420
ttggaggaac tggcaactcg agtagcaata gaatacagct tggtggtggc agtgacaggc    480
gagaccgata tcgtggccga cggttcgcgg gtggaaaaga ttacgggcgg cgaccggatg    540
atgagttcag tgaccggcat gggctgtctg ctcagcgctg tgacagccgc tttcatggca    600
gtaagtccgg acaatccaac agctgcggca atcgaagctc tcaaatttta cgggacggcc    660
ggtgaaaagg cagcggcagt ttcagaagga cccggcagtt tccgcgacac attcctggat    720
gttttattcg gcatggagat cgaggaaacc ggcttcgact ttgaaaaagg ggaaggggtg    780
gatgtactgt ggcagcgatc aagcaggtat tga                                 813

<210> 146
<211> 823
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 146
atgatcagca aaatccgcag cgaaaatccg attattcatt gtattaccaa ccatgtggtt     60
agcaacttcc aggccaatgg tctgctggcc attggtgcca gcccgattat gggcgaagcc    120
caggaagaag tggaagaact ggtggccatt agccgcgccc tgagcctgaa tattggcacc    180
ctgaataaag aaaccttaca tagtatgctg ctggccggta aacgcgcaaa taaagaaaaa    240
attccggtga ttctggaccc tgtgggtgca ggcgcaaccg cattccgcaa agatgccatt    300
cagaaaattc tgaccgatat tgatgttagc gtgctgcgct gtaatgccgg tgaactggcc    360
gcaattggtg gcgttcgttg ggcaagtaaa ggtgttgatg caggcgaagg taatgttgat    420
ctggaagaac tggccacccg cgttgcaatt gaatatagtc tggttgtggc cgttaccggc    480
gaaaccgata ttgttgcaga tggcagccgt gtggaaaaaa ttaccggcgg tgatcgcatg    540
atgagtagtg ttaccggcat gggctgcctg ctgagtgcag ttaccgccgc attcatggcc    600
gtgagtccgg ataatccgac cgccgcagca attgaagccc tgaaattcta tggtaccgcc    660
ggtgaaaaag ccgccgcagt gagcgaaggc ccgggcagct tccgtgatac cttcctggat    720
gtgctgttcg gcatggaaat tgaagaaacc ggcttcgact tcgaaaaagg cgaaggtgtg    780
gatgtgctgt ggcagcgcag cagccgttat tgactgaaag ctt                      823

<210> 147
<211> 259
<212> PRT
<213> Lachnospiraceae bacterium

<400> 147
Met Gln Glu Glu Leu Ile Glu Gln Ile Arg Val Asn Arg Pro Leu Val
1               5                   10                  15
His Cys Ile Thr Asn Gln Val Thr Val Asn Tyr Val Val Asn Met Leu
            20                  25                  30
Leu Gly Leu Gly Ala Arg Ala Glu Gly Thr Asp Ala Pro Glu Glu Ala
        35                  40                  45
Ala Glu Ile Ala Gly Arg Ser Gln Ala Leu Met Leu Asn Val Gly Ala
    50                  55                  60
Pro Thr Glu Ser Leu Ala Asn Ser Met Ile Glu Ala Gly Arg Arg Ala
65                  70                  75                  80
Asn Glu Met Gly Val Pro Val Val Leu Asp Pro Asp Gly Val Gly Lys
                85                  90                  95
Ser Ser Phe Arg Leu Glu Ile Val Asn Glu Ile Leu Asn Ser Val Tyr
            100                 105                 110
Val Thr Cys Ile Arg Gly Thr Ala Thr Asp Leu Ala Ala Leu Asn Gly
        115                 120                 125
Trp Glu Leu Glu Glu Asn Ala Thr Leu Ser Leu Asp Asp Leu Gln Ile
    130                 135                 140
Ile Ala Asp Lys Tyr Asn Val Cys Val Val Met Thr Gly Gln Glu Asp
145                 150                 155                 160
Leu Val Val Tyr His Ala Ser Gln Ala Arg Ile Ser Asn Asn Ile Pro
                165                 170                 175
Phe Met Lys Arg Val Ala Gly Ser Gly Ala Ala Leu Thr Ala Val Ile
            180                 185                 190
Ala Ala Phe Leu Ala Val Gly Gly Val Glu Asn Ile Phe Asp Ser Val
        195                 200                 205
Val Thr Ala Val Ala Ala Phe Asp Val Ala Gly Gln Lys Ser Glu Ala
    210                 215                 220
Lys Asn Ala Tyr Val Gly Thr Ala Ser Phe Ala Glu Gly Val Ile Asp
225                 230                 235                 240
Ser Leu Ser Ile Leu Gln Ala Ser Glu Leu Arg Thr Glu Ala Lys Ile
                245                 250                 255
Glu Glu Arg

<210> 148
<211> 780
<212> DNA
<213> Lachnospiraceae bacterium

<400> 148
atgcaggaag aacttattga gcaaattcgc gtaaatagac cacttgtaca ttgtattaca     60
aatcaggtta cagttaacta tgttgtgaac atgcttctcg ggcttggcgc tcgtgctgaa    120
ggtacagacg caccggaaga agctgctgag atcgcaggca gaagccaggc tcttatgctt    180
aatgtaggtg cacccacaga gtcgcttgca aattctatga tagaagcagg acgacgcgca    240
aatgaaatgg gtgttccggt tgtactcgat cctgatggag taggaaagtc aagcttcaga    300
cttgagattg taaacgagat acttaacagt gtatatgtta catgcatacg tggtacagcg    360
actgatcttg ctgcattaaa cggctgggaa cttgaagaaa atgcaacact ttccttggat    420
gatcttcaga ttattgccga taaatataat gtatgtgtgg tcatgacagg tcaggaagat    480
ctggttgttt accatgctag tcaggcacgt ataagcaata atataccttt tatgaagaga    540
gttgccggaa gcggcgcagc actcacagcg gtcatcgctg cattcctggc tgttggcgga    600
gttgagaata tatttgattc tgtcgttaca gctgttgcag catttgatgt tgctggacag    660
aagtcagaag caaagaacgc ttacgtcggt acagcatcat ttgctgaggg cgttattgat    720
agtctgagca ttcttcaggc ctctgaactt agaacagaag caaagataga ggaacgttaa    780

<210> 149
<211> 790
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 149
atgcaggaag aactgattga acagattcgt gtgaatcgtc cgctggttca ttgtattacc     60
aatcaggtta ccgtgaatta tgtggtgaat atgctgctgg gcctgggtgc ccgcgcagaa    120
ggtaccgatg caccggaaga agccgccgaa attgccggtc gtagccaggc actgatgctg    180
aatgttggtg ccccgaccga aagcctggca aatagcatga ttgaagcagg ccgccgcgcc    240
aatgaaatgg gtgttccggt ggtgctggac cctgatggcg tgggcaaaag cagcttccgc    300
ctggaaattg tgaatgaaat tctgaatagt gtgtacgtga cctgtattcg tggcaccgcc    360
accgatctgg cagcactgaa tggttgggaa ctggaagaaa atgcaaccct gagcctggat    420
gatctgcaaa ttattgccga taaatataac gtgtgcgtgg tgatgaccgg ccaggaagac    480
ttagtggtgt atcatgccag ccaggcccgt attagcaata atattccgtt catgaaacgc    540
gttgcaggta gtggtgcagc cctgaccgcc gttattgcag ccttcctggc agtgggcggt    600
gtggaaaata tcttcgatag cgtggtgacc gcagttgcag cattcgatgt ggccggtcag    660
aaaagtgaag caaaaaatgc ctatgtgggc accgcatcat tcgccgaagg tgtgattgat    720
agcctgagca ttctgcaagc cagcgaactg cgtaccgaag caaaaattga agaacgttga    780
ctgaaagctt                                                           790

<210> 150
<211> 283
<212> PRT
<213> Globicatella sulfidifaciens

<400> 150
Met His Glu Gln Met Thr Phe Tyr Leu Asn Gln Ile Val Glu Gln Asn
1               5                   10                  15
Pro Leu Ile Asn Cys Leu Thr Asn Lys Val Thr Thr Asn Phe Gln Ala
            20                  25                  30
Asn Ala Leu Leu Ala Ile Gly Ala Ser Pro Ile Met Thr Asp Glu Pro
        35                  40                  45
Asp Ala Ser Pro Leu Val Ser Ala Gln Ser Gln Ala Ile Val Ile Asn
    50                  55                  60
Ile Gly Ser Pro Phe Asn Gln Asp Lys Met Glu Ala Ile Glu Leu Ser
65                  70                  75                  80
Ile Lys Ala Ala Ile Asp Lys Gln Ile Pro Val Ile Ile Asp Pro Val
                85                  90                  95
Gly Val Ala Ala Leu Ser Asn Arg Leu Ala Tyr Ile Glu His Leu Leu
            100                 105                 110
Ser Glu Tyr Glu Ile Ala Ala Val Cys Gly Asn Tyr Ser Glu Ile Ala
        115                 120                 125
Ala Leu Ala Gly Ala Lys Ser Asn Gly Lys Gly Val Asp Gly Gly His
    130                 135                 140
Pro Glu Gly Glu Met Thr Asp His Leu Leu Lys Val Ala Asn Leu Tyr
145                 150                 155                 160
Gln Thr Val Val Val Ala Thr Gly Lys Thr Asp Tyr Ile Ala Asn Gln
                165                 170                 175
Thr Ala Val Tyr Ala His Gln Tyr Gly Asp Ala Leu Leu Gly Tyr Val
            180                 185                 190
Thr Gly Thr Gly Cys Val Ala Thr Thr Ile Val Ala Ala Phe Ile Ser
        195                 200                 205
Gln Ala Pro Thr Pro Ala Asp Tyr Leu Thr Ala Ala Thr Leu Ala Thr
    210                 215                 220
Gly Phe Tyr Ala Trp Cys Gly Gly Arg Ala Val Gln Leu Thr Thr Gly
225                 230                 235                 240
Pro Gly Asp Leu Pro Ile His Leu Leu Asn Gln Leu Tyr Glu His Ser
                245                 250                 255
Val Lys Ala Asn Asn Lys Lys Thr Ser Asp Lys Glu Leu Thr Asn Leu
            260                 265                 270
Thr Ile Thr Gln Glu Arg Met Ser Ser Asn Asp
        275                 280

<210> 151
<211> 852
<212> DNA
<213> Globicatella sulfidifaciens

<400> 151
atgcatgagc aaatgacgtt ttatctcaat caaatcgtcg aacaaaaccc tttaatcaat     60
tgtttaacta ataaagtgac caccaatttc caggccaatg ccctattagc cattggtgct    120
tcaccgatta tgaccgatga accggatgct agtcctttag tatcggcaca atctcaagcg    180
attgtcatta atatcggttc cccatttaat caagataaaa tggaggcgat tgagctttcc    240
ataaaagcgg ccattgacaa acaaattccg gtcatcatcg atccggtcgg agtagcggca    300
ctttccaatc gtcttgctta tattgaacat ttactaagcg aatacgagat cgcggctgtt    360
tgcggtaatt attccgaaat tgctgcctta gctggagcga aaagtaatgg taaaggtgtg    420
gatggtggtc atcctgaagg tgaaatgacc gatcatttac taaaagtagc caacctttat    480
caaacagtgg tggtggccac tggcaaaact gattatatcg ctaatcaaac tgctgtctat    540
gcccatcaat atggcgacgc tttactcggt tatgtgaccg gcaccggctg tgtcgcgacg    600
accatcgtcg ctgcttttat cagtcaagcg cctacgccag ctgactattt aacggctgcc    660
actttagcga ccggatttta tgcttggtgt ggaggccggg cagttcaatt gactaccgga    720
cctggtgatt taccaattca tttattaaac caattatatg aacactcagt caaagcaaat    780
aataaaaaaa catctgataa agaattaaca aacttaacaa ttacacaaga aaggatgtca    840
agcaatgact aa                                                        852

<210> 152
<211> 862
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 152
atgcatgaac agatgacctt ctatctgaat cagattgtgg aacagaatcc gctgattaat     60
tgcctgacca ataaagttac caccaacttc caggccaatg ccctgctggc aattggtgcc    120
agcccgatta tgaccgatga accggatgcc agtccgctgg tgagtgccca gagtcaggcc    180
attgttatta atattggcag tccgttcaat caggataaaa tggaagcaat tgaactgagt    240
attaaggcag ccattgataa acagattccg gtgattattg atccggtggg tgtggcagca    300
ctgagtaatc gtctggcata tattgaacat ctgctgagcg aatatgaaat tgcagcagtg    360
tgcggcaatt atagcgaaat tgccgccctg gcaggtgcca aaagtaatgg taaaggtgtt    420
gatggtggcc atccggaagg cgaaatgacc gatcatctgc tgaaagtggc aaatctgtat    480
cagaccgttg tggttgcaac cggcaaaacc gattatattg caaatcagac cgcagtgtat    540
gcccatcagt atggcgatgc cctgctgggc tatgttaccg gtaccggttg cgtggccacc    600
accattgttg cagcattcat tagtcaggcc ccgaccccgg ccgattatct gaccgccgca    660
accctggcca ccggcttcta tgcctggtgc ggcggccgtg cagttcagct gaccaccggt    720
ccgggcgatc tgccgattca tctgctgaat cagctgtatg aacatagcgt taaagcaaat    780
aataagaaga ccagtgataa ggaactgacc aatctgacca ttacccagga acgtatgagc    840
agtaatgatt gactgaaagc tt                                             862

<210> 153
<211> 279
<212> PRT
<213> Aerococcus urinae

<400> 153
Met Ser Lys Lys Thr Ser Ile Pro Phe Ala Lys Tyr Ser Gln Gln Val
1               5                   10                  15
Lys Ala Lys Ala Pro Leu Ile Gln Val Leu Asn Asn Tyr Val Thr Ile
            20                  25                  30
His Asp Val Ala Asn Val Ile Leu Ala Ser Gly Gly Arg Pro Val Met
        35                  40                  45
Thr Asp Lys Leu Pro Asn Ser Gln Asp Val Val Lys Ser Ala Asp Leu
    50                  55                  60
Leu Leu Leu Asn Ala Ala Ser Pro Arg Pro Asn Gln Glu Leu Leu Asp
65                  70                  75                  80
Leu Ala Thr Thr Ala Lys Asn Asp His His Pro Val Val Leu Asp Pro
                85                  90                  95
Val Gly Val Ser Ala Met Pro Ser Lys Leu Lys Leu Cys Gln Asp Leu
            100                 105                 110
Ile Asp Arg Gly Leu Val Thr Ala Val Lys Gly Asn Ala Ser Glu Ile
        115                 120                 125
Arg Ser Leu Leu Phe Glu Lys Ser Gln Gly Ser Gly Val Asp Leu Gly
    130                 135                 140
Pro Gly Asp Glu Val Thr Leu Ala Asn Leu Ala Asp Phe Ala Pro Asp
145                 150                 155                 160
Phe Lys Ala Tyr Ala Gln Glu Lys Glu Ile Ile Leu Ala Met Ser Gly
                165                 170                 175
Pro Ile Asp Leu Val Thr Asp Gly Glu Arg Leu Ala Ile Ile Lys Asn
            180                 185                 190
Gly His Pro Trp Met Ala Ser Tyr Thr Gly Ser Gly Cys Gln Leu Ser
        195                 200                 205
Gly Val Leu Ala Ser Phe Leu Ala Gly Asn Pro Asp Glu Asp Pro Phe
    210                 215                 220
Tyr Leu Ala Thr Ala Ala Met Ile Ser Tyr Gly Val Ala Gly Glu Ile
225                 230                 235                 240
Ala Ala Gln Val Leu Gln Pro Tyr Glu Gly Asn Ala Thr Tyr Ser Asn
                245                 250                 255
Arg Val Ile Asp Gln Val Ser Leu Leu Glu Ala Lys Glu Leu Glu Arg
            260                 265                 270
Arg Ala Lys Tyr Asp Ile Gln
        275

<210> 154
<211> 840
<212> DNA
<213> Aerococcus urinae

<400> 154
atgtctaaaa aaacaagcat tccctttgcg aaatatagtc agcaggtcaa ggccaaggcc     60
ccgctgatcc aagtcttaaa taattatgtc accatccatg atgtggccaa tgtgattttg    120
gccagtggcg ggcgtcccgt gatgaccgat aaattaccta atagccagga tgtggtcaag    180
tcggctgacc ttttgctcct aaatgctgct agccccagac ctaatcagga attgttggac    240
ctcgccacca cagcaaaaaa tgaccaccat cccgtggtct tagacccagt gggggtttca    300
gctatgccgt ctaaattaaa gctctgtcag gatctgattg accggggtct agtgacggcg    360
gtgaagggga atgcttcgga aattcgtagc ctcctctttg aaaaaagcca aggatctggg    420
gtcgacctgg gtcctggaga cgaggtgacc ttagctaatt tggctgattt tgctccggac    480
tttaaagctt atgcccaaga aaaagagatt atcctagcca tgtcaggtcc gattgactta    540
gtgactgacg gcgaacggct ggcaatcatt aaaaatggtc acccctggat ggcctcctat    600
acgggatcag gttgccagtt aagcggcgtc ctagctagct ttttagccgg taatccggat    660
gaggatcctt tttacctagc tacagcagcc atgattagct atggggtggc gggagaaatc    720
gctgctcaag tgctccagcc ctatgagggt aatgccactt attccaaccg ggtgattgac    780
caggtctcct tattagaggc caaagaatta gaaaggagag ccaagtatga cattcaataa    840

<210> 155
<211> 850
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 155
atgagtaaga agaccagcat tccgttcgcc aaatatagtc agcaggtgaa agccaaagcc     60
ccgctgattc aggtgctgaa taattatgtt accattcatg atgtggcaaa tgtgattctg    120
gcaagcggtg gtcgtccggt tatgaccgat aaactgccga atagccagga tgttgtgaaa    180
agtgcagact tactgctgct gaatgcagca agcccgcgtc cgaatcagga actgctggat    240
ctggccacca ccgccaaaaa tgatcatcat ccggtggtgc tggaccctgt tggcgtgagc    300
gccatgccga gcaaactgaa actgtgtcag gatctgattg atcgtggcct ggttaccgcc    360
gttaaaggta atgcaagtga aattcgcagc ctgctgttcg aaaaaagtca gggtagcggc    420
gttgatctgg gcccgggtga tgaagttacc ctggccaatc tggccgactt cgcaccggac    480
ttcaaagcct atgcccagga aaaagaaatt attctggcaa tgagcggccc gattgatctg    540
gttaccgatg gtgaacgtct ggccattatt aaaaatggtc atccgtggat ggcaagttat    600
accggcagtg gttgtcagct gagtggtgtg ctggccagct tcctggccgg caatccggat    660
gaagatccgt tctatctggc caccgcagca atgattagct atggcgttgc aggcgaaatt    720
gccgcccagg tgctgcaacc gtatgaaggt aatgccacct atagcaatcg tgtgattgat    780
caggtgagtc tgctggaagc caaagaactg gaacgccgtg caaaatatga tattcagtga    840
ctgaaagctt                                                           850

<210> 156
<211> 273
<212> PRT
<213> Blautia obeum

<400> 156
Met Ser Leu Asn Asn Ile Glu Glu Ile Ser Phe Cys Ile His Gln Asp
1               5                   10                  15
Ala Pro Lys Ile His Cys Leu Thr Asn Pro Val Thr Met Gln Asp Val
            20                  25                  30
Ala Asn Leu Leu Leu Ala Ala Gly Gly Ser Ala Val Met Gly Gln Asp
        35                  40                  45
Glu Gln Glu Val Glu Glu Ile Thr Ser Phe Cys His Gly Ala Leu Leu
    50                  55                  60
Asn Thr Gly Val Pro Asp Ile Ala Lys Ile Gln Ala Cys Ile Leu Ala
65                  70                  75                  80
Gly Gln Lys Ala Asn Ala Leu Asp His Pro Val Val Leu Asp Pro Val
                85                  90                  95
Gly Ala Gly Ala Ser Thr Phe Arg Arg Lys Glu Leu Gln Lys Leu Leu
            100                 105                 110
Gln Ala Val His Pro Thr Ala Val Arg Cys Asn Gln Glu Glu Ala Val
        115                 120                 125
Val Leu Cys Ser Leu Leu Ser Asp Thr Asp Ser Pro Glu Lys His Gly
    130                 135                 140
Gly Val Glu Ser Ser Leu Gln Met Ala Glu Arg Asp Val Cys Leu Ile
145                 150                 155                 160
Ala Gly Gln Ala Ala Ser Leu Leu Asn Cys Thr Val Leu Ile Thr Gly
                165                 170                 175
Arg Glu Asp Val Val Ser Asp Gly Lys Gln Thr Gln Ile Leu Thr Gly
            180                 185                 190
Gly Asp Ser Arg Ile Arg Arg Ile Thr Gly Gly Gly Cys Met Leu Ser
        195                 200                 205
Ala Leu Cys Thr Leu Phe Leu Cys Thr Asp Thr Ser Ala Phe Asp Ala
    210                 215                 220
Val Arg Ala Ala Gly Ala Leu Trp Arg Glu Thr Ala Leu Glu Ala Gly
225                 230                 235                 240
Arg Arg Thr Asp Ala Glu Lys Ser Gly Ile Gly Ser Phe His Val His
                245                 250                 255
Leu Phe Asp Val Leu Glu Glu Lys Leu Met Tyr Thr Ser Lys His Lys
            260                 265                 270
Phe

<210> 157
<211> 822
<212> DNA
<213> Blautia obeum

<400> 157
atgtcactta ataatataga agaaatttct ttctgcattc accaggatgc tcccaagatt     60
cattgtctca ctaatccggt aaccatgcag gatgttgcca atctactgct tgcagccggc    120
ggaagtgccg taatgggaca ggatgaacag gaagtcgaag aaatcacttc tttctgtcat    180
ggagcccttc tgaataccgg agtaccggac attgccaaaa tacaggcctg catccttgcc    240
ggacagaagg caaatgccct tgaccacccg gtcgtcctgg atccggtcgg cgccggtgcc    300
agcacatttc gaagaaaaga gctgcaaaaa cttcttcaag ccgttcatcc gactgctgtt    360
cgctgtaatc aggaagaagc cgttgttcta tgttctcttc tttcagatac cgacagtccc    420
gagaaacacg gcggtgtaga aagttccctg cagatggcgg aacgtgatgt ctgtctgatt    480
gcagggcagg cggcatctct tctcaactgc acagttctga tcacgggcag agaggatgtg    540
gtatctgatg gaaagcagac acagattctg accggcggtg actccaggat tcgacgaatc    600
accggtggcg gatgtatgct ttctgcactc tgcacattat ttctgtgtac ggacacttct    660
gcatttgatg cggtccgtgc cgccggagca ctctggcgtg aaactgccct tgaagcaggc    720
agacgaacag acgcagaaaa atccggtatt ggaagttttc atgtacatct ttttgatgta    780
ctggaagaga aattaatgta cacctcgaaa cataaattct ga                       822

<210> 158
<211> 832
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 158
atgagtctga ataacatcga agaaatcagc ttctgtattc atcaggatgc accgaaaatt     60
cattgcctga ccaatccggt gaccatgcag gatgtggcaa atctgctgct ggcagccggt    120
ggtagtgcag ttatgggtca ggatgaacag gaagtggaag aaattaccag cttctgccat    180
ggcgccctgc tgaataccgg tgtgccggat attgccaaaa ttcaggcatg cattctggca    240
ggccagaaag caaatgccct ggatcatccg gtggttctgg accctgtggg cgccggtgcc    300
agcacattcc gccgtaaaga actgcaaaaa ctgctgcaag ccgtgcatcc gaccgcagtt    360
cgttgcaatc aggaagaagc agttgttctg tgtagcctgc tgagtgatac cgatagcccg    420
gaaaaacatg gtggcgttga aagcagtctg caaatggcag aacgcgatgt gtgtctgatt    480
gccggtcagg ccgccagtct gctgaattgt accgtgctga ttaccggtcg cgaagatgtg    540
gttagtgatg gcaaacagac ccagattctg accggcggtg atagtcgtat tcgccgcatt    600
accggcggcg gttgcatgct gagcgcactg tgcaccctgt tcctgtgcac cgataccagt    660
gcattcgatg ccgtgcgtgc cgccggtgca ctgtggagag aaaccgccct ggaagccggt    720
cgccgtaccg atgccgaaaa aagcggcatt ggcagcttcc atgtgcatct gttcgatgtt    780
ctggaagaaa aactgatgta taccagcaaa cataaattct gactgaaagc tt            832

<210> 159
<211> 281
<212> PRT
<213> Rhodococcus erythropolis

<400> 159
Met Ser Asp Thr Val Ser Val Glu Thr Val Ala Ser Ala Ile Asp Ala
1               5                   10                  15
Leu Arg Asp Gln Val Pro Leu Val Gln Ser Leu Thr Asn Ile Val Ser
            20                  25                  30
Ala Asn Phe Leu Thr Asn Val Leu Leu Ala Ala Gly Ala Ser Asn Ala
        35                  40                  45
His Ile Asp Asn Val His Glu Ala Gly Gly Phe Ala Ala Val Ala Gly
    50                  55                  60
Gly Val Leu Val Asn Leu Gly Thr Pro Asp Asp Gly Thr Ala Glu Ala
65                  70                  75                  80
Phe Leu Ile Ser Ala Glu Ala Ala Arg Thr Ala Gly Thr Pro Trp Val
                85                  90                  95
Leu Asp Pro Val Gly Val Gly Gly Leu Pro Trp Arg Ser Gly Ile Ala
            100                 105                 110
Val Asp Leu Leu Arg Phe His Pro Ser Ala Ile Arg Gly Asn Ala Ser
        115                 120                 125
Glu Ile Ile Ala Leu Ala Gly Leu Gly Gly Asp Thr Arg Gly Val Asp
    130                 135                 140
Ser Ala Ser Asp Ser Ala Asp Ala Val Pro Ala Ala Leu Ser Leu Leu
145                 150                 155                 160
Thr His Ala Asp Ala Val Ser Ala Ser Gly Pro Val Asp Tyr Ile Val
                165                 170                 175
Gly Arg Asp Gly Gly Gly Asp Val Arg Gly Ile Arg Val Ser Gly Gly
            180                 185                 190
Ser Ala Leu Leu Pro Arg Val Thr Ser Thr Gly Cys Ser Leu Gly Gly
        195                 200                 205
Leu Val Ala Ala Tyr Leu Ala Val Thr Pro Thr Ala Leu Asp Gly Leu
    210                 215                 220
Val Ala Ala His Thr His Val Ala Val Ala Ser Glu Ile Ala Glu Glu
225                 230                 235                 240
Asn Ala Ser Gly Pro Gly Ser Phe Ala Val Ala Tyr Leu Asp Ala Leu
                245                 250                 255
Tyr Thr Val Asn Ala Asp Thr Ile Arg Ser Arg Ala Arg Ile Glu Ser
            260                 265                 270
Phe Asp Leu Pro Ala Gly Val Gln Asn
        275                 280

<210> 160
<211> 846
<212> DNA
<213> Rhodococcus erythropolis

<400> 160
atgtctgaca ctgtctctgt cgaaaccgtc gcgtcggcga tcgacgcact ccgcgatcag     60
gtaccgctcg tccagtcgct gacgaacatc gtttcggcga acttcttgac caatgttctg    120
ctcgctgccg gcgcaagcaa cgctcacatc gacaacgttc atgaagcggg tgggttcgct    180
gccgtcgccg ggggagtgct cgtcaacctc ggaacgccgg acgacgggac agccgaagcc    240
ttcctgattt ccgcagaagc ggcccgaact gccggtacgc catgggtttt ggatccagtc    300
ggtgtcggcg gactaccctg gcggagcggt atcgctgtgg atctgctgcg ctttcacccc    360
agcgccatcc gcgggaacgc atcggagatc atcgctctgg ccggactcgg gggtgacacc    420
cgcggcgtcg acagtgcctc cgactccgcc gacgcagtgc ccgccgctct gtcactcctg    480
acgcatgcgg atgccgtgtc ggcatcaggc ccggtcgact acatcgtcgg ccgcgatggt    540
ggcggcgacg ttcggggcat ccgagtctcc ggtggcagcg ctctgttgcc gcgcgtgacc    600
agtaccggtt gctctctggg aggcctggtc gctgcctacc tcgccgttac gccgaccgcg    660
ctggacggat tggttgccgc gcatacgcac gttgccgtgg catcggagat cgccgaggaa    720
aatgcctcag gtcctggttc tttcgcggtc gcgtacctcg acgcgctcta caccgtgaat    780
gcggacacca tccgctcacg tgcccgaatc gaatcgttcg acctgccggc tggagtgcag    840
aactga                                                               846

<210> 161
<211> 856
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 161
atgagcgata ccgtgagtgt tgaaaccgtg gccagtgcaa ttgatgcact gcgcgatcag     60
gttccgctgg ttcagagtct gaccaatatt gttagtgcaa acttcctgac caatgttctg    120
ctggccgccg gcgcaagtaa tgcccatatt gataatgttc atgaagcagg cggcttcgca    180
gcagttgcag gtggcgttct ggtgaatctg ggcaccccgg atgatggtac cgcagaagcc    240
ttcctgatta gcgccgaagc cgcacgtacc gcaggcaccc cttgggtgct ggaccctgtt    300
ggtgttggcg gtctgccgtg gcgtagtggt attgccgtgg atctgctgcg cttccatccg    360
agtgccattc gtggtaatgc aagcgaaatt attgccctgg caggtctggg cggcgatacc    420
cgtggtgtgg atagtgccag tgatagcgca gatgccgttc cggccgccct gagtctgctg    480
acccatgcag atgccgtgag cgccagcggt ccggtggatt atattgtggg ccgtgatggt    540
ggtggtgatg tgcgcggcat tcgcgtgagc ggtggcagtg cactgctgcc gcgcgttacc    600
agtaccggct gtagcctggg tggcctggtt gccgcatatc tggccgtgac cccgaccgcc    660
ctggatggtc tggtggccgc acatacccat gtggccgttg caagtgaaat tgccgaagaa    720
aatgcaagtg gcccgggcag cttcgccgtg gcttatctgg atgccctgta taccgttaat    780
gcagatacca ttcgcagccg cgcccgcatt gaatcattcg atctgccggc aggcgtgcag    840
aattgactga aagctt                                                    856

<210> 162
<211> 274
<212> PRT
<213> Synergistales bacterium

<400> 162
Met Ile Arg Pro Ala Asp Pro Gly Lys Val Trp Glu Ser Ile Arg Lys
1               5                   10                  15
Lys Arg Pro Leu Val Tyr Gln Leu Thr Asn Thr Val Ala Ala Ser Phe
            20                  25                  30
Gln Ala Glu Val Thr Ala Ala Ala Gly Ala Ala Val Val Met Ser Ser
        35                  40                  45
His Pro Gly Glu Ala Arg Val Ile Ala Ser Gly Ala Asp Ala Leu Leu
    50                  55                  60
Leu Asn Thr Gly Thr Pro Gly Gly Thr Ser Gln Glu Ala Phe Ile Glu
65                  70                  75                  80
Ala Leu Gly Gly Leu Arg Lys Gly Lys Pro Cys Leu Leu Asp Ala Val
                85                  90                  95
Gly Tyr Gly Leu Thr Pro Phe Arg Thr Gly Trp Ile Asn Ser Leu Leu
            100                 105                 110
Glu Gly Gly Arg Val Thr Ala Val Lys Gly Asn Ala Ala Glu Met Ala
        115                 120                 125
Arg Leu Gly Gly Gly Ser Gly Ser Met Lys Gly Val Glu Ser Ser Arg
    130                 135                 140
Ala His Gly Val Glu Lys Ala Leu Lys Glu Ile Thr Lys Ser Glu Thr
145                 150                 155                 160
Ala Pro Val Val Ala Val Ala Thr Gly Lys Val Asp Lys Ile Ala Cys
                165                 170                 175
Gly Gly Ser Leu Trp Lys Val Arg Gly Gly Ala Gly Leu Leu Pro Gln
            180                 185                 190
Val Pro Ala Ser Gly Cys Ala Leu Gly Ser Val Met Ala Ala Cys Met
        195                 200                 205
Ala Val Thr Asp Pro Leu Ser Ala Ala Thr Ala Ala Leu Leu Ala Phe
    210                 215                 220
Arg Met Ala Ala Glu Arg Ala Pro Gly Ala Ala Gly Pro Ala Ser Trp
225                 230                 235                 240
Arg Asn Ala Phe Val Asp Ala Leu Ala Ala Leu Glu Pro Glu Lys Leu
                245                 250                 255
Ser Ser Gly Met Lys Glu Arg Val Glu Gly Pro Phe Pro Leu Glu Val
            260                 265                 270
Leu Pro

<210> 163
<211> 825
<212> DNA
<213> Synergistales bacterium

<400> 163
atgatccgcc cggccgatcc gggaaaagtc tgggagagta tcagaaaaaa acgtccgttg     60
gtctatcagc ttaccaatac agtggccgca tcctttcagg ccgaggtgac cgccgccgcg    120
ggcgccgctg tggtcatgtc gtcccacccc ggggaagccc gggtgatcgc gtccggggcg    180
gatgctctcc ttttgaacac gggaacgccc ggggggacct ctcaggaggc tttcattgaa    240
gctttaggcg gattgcggaa ggggaagccc tgtctgctgg atgccgtggg gtacggtctc    300
acccccttcc ggacgggctg gatcaattca ctcctggaag gtggccgagt cacagccgtt    360
aagggaaacg cggctgaaat ggcacgcctc ggcggcggat cgggttccat gaaaggcgtg    420
gagagttccc gagcccacgg ggtggaaaag gccctgaaag agataacgaa aagcgaaacg    480
gcacccgttg tggccgttgc gacaggaaag gtcgacaaaa tagcctgtgg cgggtcgctt    540
tggaaagtcc ggggaggggc ggggttgctt ccgcaagttc cggcaagcgg ctgtgccctt    600
gggagcgtca tggcggcctg catggccgtg acagaccccc tttcggccgc caccgcggcc    660
cttctggcct tccggatggc ggctgagcgg gcacccggcg ctgcagggcc ggcttcctgg    720
agaaacgcct ttgtcgacgc ccttgccgcc ctggagcctg aaaagctttc cagcggcatg    780
aaagaacgtg tggagggccc gttccctctg gaggttttgc catga                    825

<210> 164
<211> 835
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 164
atgatccgcc cggcagatcc gggcaaagtg tgggaaagta ttcgcaaaaa acgtccgctg     60
gtgtatcagc tgaccaatac cgttgcagcc agcttccagg cagaagttac cgcagccgca    120
ggtgcagccg tggtgatgag cagtcatccg ggtgaagccc gcgtgattgc cagcggtgca    180
gatgccctgc tgctgaatac cggtaccccg ggcggcacca gtcaggaagc attcattgaa    240
gcactgggtg gtctgcgcaa aggcaaaccg tgtctgctgg atgcagtggg ctatggtctg    300
accccgttcc gtaccggctg gattaatagt ctgctggaag gtggccgcgt taccgcagtg    360
aaaggcaatg ccgcagaaat ggcacgtctg ggcggtggta gtggcagcat gaaaggtgtg    420
gaaagcagtc gcgcacatgg tgttgaaaaa gcactgaaag aaattaccaa aagcgaaacc    480
gccccggttg tggcagtggc caccggtaaa gttgataaaa ttgcatgtgg cggtagtctg    540
tggaaagttc gcggtggcgc aggtctgctg ccgcaggttc cggctagtgg ctgtgcactg    600
ggtagtgtta tggccgcatg tatggccgtg accgatccgc tgagcgccgc cacagcagcc    660
ctgttagcct tccgtatggc agccgaacgt gccccgggtg cagcaggtcc tgccagttgg    720
cgtaatgcct tcgtggatgc cctggccgcc ctggaaccgg aaaaactgag tagcggcatg    780
aaagaacgtg ttgaaggccc gttcccgctg gaagttctgc cgtgactgaa agctt         835

<210> 165
<211> 299
<212> PRT
<213> Pelagibaculum spongiae

<400> 165
Met Leu Val Lys Gln Glu Leu Lys Val Glu Met Glu Ile Arg Arg Gly
1               5                   10                  15
Ala Met Ser Thr Val Ser Thr Ser Pro Gln Gln Ser Phe Glu Ser Pro
            20                  25                  30
Val Glu Gln Gln Leu Ser Glu Leu Leu Ile Ala Val Arg Thr Lys Lys
        35                  40                  45
Pro Leu Val His Asn Ile Thr Asn Tyr Leu Ala Met Asn Ile Ser Ala
    50                  55                  60
Asn Ala Leu Leu Ala Leu Gly Ala Ser Pro Ile Met Ala His Ser Arg
65                  70                  75                  80
Glu Glu Ala Ser Glu Leu Cys Arg Ile Ser Gln Ala Leu Val Ile Asn
                85                  90                  95
Ile Arg Thr Leu Ser Ser Gly Trp Ala Glu Ala Met Val Asp Thr Ala
            100                 105                 110
Met Thr Ala Arg Ala His Asn Ile Pro Trp Val Leu Asp Pro Asp Gly
        115                 120                 125
Ala Asp Ile Ser Ser Tyr Arg Met Asp Thr Cys Gln Glu Leu Ala Gly
    130                 135                 140
Leu Ser Pro Lys Val Ile Arg Gly Asn Leu Lys Glu Ile Ala Ala Leu
145                 150                 155                 160
Cys Ala Asp Cys Glu Pro Glu Leu Thr Pro Ala Gln Met Ala Lys Ala
                165                 170                 175
Asp Leu Asp Gln Leu Leu Pro Ala Ile Leu Ser Cys Ala Ser Arg Arg
            180                 185                 190
Ser Ser Val Leu Cys Ile Ser Gly Leu Thr Asp Asn Ile His Leu Val
        195                 200                 205
Thr Asp Gly Glu Arg Val Leu Lys Val Ala Asn Gly Asp Ala Leu Ser
    210                 215                 220
Ser Gln Val Ala Ala Met Gly Cys Thr Ala Ser Ala Leu Val Gly Ala
225                 230                 235                 240
Phe Leu Thr Val Thr Asp Asp Ala Trp Leu Ala Thr Ala Ala Ala Ile
                245                 250                 255
Ala Leu Leu Gly Val Ala Cys Glu Leu Ala Ala Ser Gln Ala Lys Gly
            260                 265                 270
Pro Gly Ser Phe Gln Ala Glu Leu Met Asp Gln Leu Tyr Leu Ile Gln
        275                 280                 285
Ser Asp Gln Leu Ala Ala Arg Leu Arg Leu Leu
    290                 295

<210> 166
<211> 900
<212> DNA
<213> Pelagibaculum spongiae

<400> 166
ttgcttgtaa agcaagaatt aaaagtagaa atggagatcc ggaggggcgc tatgtcgaca     60
gtttctactt caccgcagca atcatttgaa tcgccggttg agcagcagtt atcggaattg    120
ttaattgctg tgagaacaaa aaagccattg gttcataata ttaccaacta tttggcgatg    180
aatatttctg ctaatgcact gttggcattg ggtgctagcc cgattatggc gcatagccgt    240
gaagaagctt cggagctttg ccgaatttcc caagcgttag tcatcaacat tcgaaccctg    300
tcttcaggtt gggccgaagc gatggttgat actgccatga ccgctcgagc gcataatatt    360
ccctgggtat tagatcctga tggtgcagat atcagtagct atcggatgga cacttgccaa    420
gagttagcag gactttctcc taaggtaatt cgtggcaacc tgaaagaaat tgctgcactt    480
tgtgctgact gcgagcctga attaacgcca gcgcaaatgg ctaaggccga tcttgatcag    540
ctattacccg caatattatc ttgtgccagt cgtcgttcca gcgtgctttg cattagtggc    600
cttactgata atattcattt ggtcactgat ggtgaacgag tgttgaaagt tgccaatggt    660
gatgcgcttt caagccaagt tgcggcgatg ggatgtaccg ccagtgcatt ggtaggtgct    720
ttcctaacgg tcaccgatga cgcatggtta gcgactgcag cggcaattgc cttgctcgga    780
gttgcctgtg agttggctgc cagtcaggca aaagggccgg gtagttttca agctgagtta    840
atggatcaat tgtatctaat tcagtcagac caattagccg cgcgtttgcg gctgttgtaa    900

<210> 167
<211> 910
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 167
atgctggtga aacaggaact gaaagtggaa atggaaattc gtcgcggtgc aatgagcacc     60
gttagtacca gcccgcagca gagcttcgaa agcccggttg aacagcagct gagcgaactg    120
ctgattgccg ttcgtaccaa aaaaccgctg gttcataata ttaccaatta tctggcaatg    180
aacatcagcg caaatgccct gctggccctg ggcgcaagtc cgattatggc acatagtcgt    240
gaagaagcca gtgaactgtg tcgcattagt caggcactgg tgattaatat tcgcaccctg    300
agcagtggtt gggcagaagc catggtggat accgccatga ccgcccgcgc ccataatatt    360
ccgtgggtgc tggaccctga tggcgcagat attagcagct atcgcatgga tacctgccag    420
gaactggccg gcctgagccc gaaagttatt cgtggcaatc tgaaagaaat tgcagcactg    480
tgtgccgatt gtgaaccgga actgaccccg gcccagatgg ccaaagccga tctggatcag    540
ctgctgccgg caattctgag ctgcgccagc cgtcgtagca gtgtgctgtg tattagtggt    600
ctgaccgata atattcatct ggtgaccgat ggtgaacgtg tgctgaaagt ggccaatggt    660
gatgcactga gcagtcaggt ggccgcaatg ggttgtaccg ccagcgcact ggtgggtgcc    720
ttcctgaccg ttaccgatga tgcctggctg gcaaccgcag cagcaattgc actgctgggt    780
gtggcctgcg aactggccgc aagccaggca aaaggcccgg gcagcttcca ggccgaactg    840
atggatcagc tgtatctgat tcagagtgat cagctggccg cccgcctgcg tctgctgtga    900
ctgaaagctt                                                           910

<210> 168
<211> 261
<212> PRT
<213> Firmicutes bacterium

<400> 168
Met Ile Lys Asp Ile Ile Gln Asn Val Tyr Ser Lys Arg Pro Leu Val
1               5                   10                  15
His Asn Ile Thr Asn Tyr Val Ala Ala Thr Asp Cys Ala Asn Ile Thr
            20                  25                  30
Leu Thr Ile Gly Ala Ser Pro Ile Met Ala Asp Glu Pro Lys Glu Val
        35                  40                  45
Gly Glu Val Thr Gln Ile Ala Asp Gly Leu Val Leu Asn Cys Gly Thr
    50                  55                  60
Ile Ser Glu Ser Arg Leu Asn Ala Met Leu Ile Ser Gly Lys Thr Ala
65                  70                  75                  80
Lys Ser Arg Glu Ile Pro Ile Val Leu Asp Pro Val Gly Val Gly Ile
                85                  90                  95
Ser Lys Phe Arg Thr Ile Ala Val His Lys Ile Ile Thr Glu Val Lys
            100                 105                 110
Pro Asp Ile Ile Arg Leu Asn Ala Ser Glu Leu Lys Ser Ile Cys Leu
        115                 120                 125
Asn Ile Lys Asn Met Ser Gly Val Asp Ala Val Asn Ile Asp Ser Phe
    130                 135                 140
Asp Asp Thr Val Glu Leu Ala Lys Asn Leu Ser Leu Lys Thr Asn Ala
145                 150                 155                 160
Ile Ile Gly Val Ser Gly Ile Ser Asp Ile Val Thr Asp Gly Lys Asn
                165                 170                 175
Thr Ala Val Ile Ser Gly Gly His Ala Met Met Lys Lys Ile Thr Gly
            180                 185                 190
Ser Gly Cys Met Leu Ser Ser Val Ile Gly Ala Phe Ala Ala Ala Asn
        195                 200                 205
Pro Asn Asn Leu Phe Tyr Ala Leu Ser Val Ala Phe Gly Leu Tyr Ala
    210                 215                 220
Ser Cys Gly Arg Asn Ala Tyr Lys Glu Asn Ile Gly Ile Ala Thr Tyr
225                 230                 235                 240
Lys Asn Asn Phe Phe Asp Glu Met Thr Asn Pro Asp Leu Glu Gly Ile
                245                 250                 255
Glu Ile Glu Tyr Arg
            260

<210> 169
<211> 786
<212> DNA
<213> Firmicutes bacterium

<400> 169
atgataaaag acattataca aaatgtttat tcaaaacgtc cgcttgttca taacataacc     60
aactacgttg ccgcaacgga ttgtgcaaat ataacattga caattggtgc gtcaccaatt    120
atggctgatg agcctaagga agtcggtgaa gtaacacaaa ttgccgacgg acttgtttta    180
aactgcggaa caatttccga aagtcgatta aacgctatgc ttatatccgg taaaacagca    240
aagtcaagag aaatacctat cgttcttgac ccagtcggag taggcatttc caaattcaga    300
acaattgccg ttcataaaat aattacggaa gtaaaacccg atattataag acttaatgca    360
tcggaactta aaagcatttg tcttaacatt aaaaatatgt cgggtgtgga cgccgttaat    420
attgacagtt ttgacgatac tgttgagctt gcaaaaaatc tttcgcttaa gaccaacgcg    480
attataggtg taagcggtat atcggatatt gttacagacg gcaaaaatac cgccgttatt    540
tcaggcggac acgcaatgat gaaaaaaatt acaggctccg gctgtatgct ttcatctgtt    600
atcggtgcat ttgccgccgc aaatccgaat aatctttttt atgcgctatc tgttgcattc    660
ggtttatatg caagttgcgg aagaaacgca tacaaagaaa atataggtat cgcaacatat    720
aaaaacaact tttttgatga aatgacaaat cctgatttgg aggggattga aattgaatat    780
agataa                                                               786

<210> 170
<211> 796
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 170
atgatcaagg atatcatcca gaatgtgtat agtaaacgcc cgctggttca taatattacc     60
aattatgtgg ccgccaccga ttgtgccaat attaccctga ccattggtgc aagtccgatt    120
atggcagatg aaccgaaaga agtgggcgaa gttacccaga ttgccgatgg cctggttctg    180
aattgcggta ccattagtga aagccgtctg aatgccatgc tgattagcgg caaaaccgca    240
aaaagtcgtg aaattccgat tgtgctggac cctgttggcg ttggcattag taaattccgt    300
accattgcag tgcataaaat tattaccgaa gttaaaccgg atatcattcg tctgaatgcg    360
agtgaactga aaagtatctg tctgaatatt aagaacatga gtggtgttga tgcagttaat    420
attgatagct tcgatgatac cgttgaactg gccaaaaatc tgagtctgaa aaccaatgca    480
attattggcg ttagcggcat tagtgatatt gttaccgatg gcaaaaatac cgcagtgatt    540
agtggtggcc atgccatgat gaaaaaaatt accggcagcg gttgcatgct gagcagcgtt    600
attggtgcat tcgccgcagc caatccgaat aatctgttct atgccctgag cgtggcattc    660
ggtctgtatg ccagctgtgg tcgtaatgcc tataaagaaa atattggtat cgccacctat    720
aaaaataact tcttcgatga aatgaccaac ccggatctgg aaggcattga aattgaatat    780
cgctgactga aagctt                                                    796

<210> 171
<211> 263
<212> PRT
<213> Euryarchaeota archaeon

<400> 171
Met Pro Arg His Gly Ala Ala Val Asn Leu Val Lys Trp Gly Gly Ser
1               5                   10                  15
Leu Ile Thr Asp Lys Ala Ala Ala Lys Pro Thr Pro Gln Arg Glu Arg
            20                  25                  30
Ile Thr Ala Leu Ala Ala Ala Leu Ala Ala Asn Asp Ala Pro Ala Val
        35                  40                  45
Leu Val His Gly Ala Gly Ser Phe Gly His Pro Leu Ala Lys Arg Phe
    50                  55                  60
Gly Leu Ala Gln Gly Ser Asp Gly Ser Pro Glu Gln Ala Ala Ala Val
65                  70                  75                  80
Ala Arg Thr Arg Gln Gln Val Arg Thr Leu Asn Ala Leu Val Cys Glu
                85                  90                  95
Ala Leu Ala Thr Ala Gly Leu Glu Pro Val Pro Ile Leu Pro Ser Gln
            100                 105                 110
Ala Leu Arg Thr Ala Gly Pro Gln Asn Ile Val Asp Phe Pro Ala Ser
        115                 120                 125
Ser Phe Glu Ala Ala Leu Glu Ala Gly Arg Ile Pro Val Thr Cys Gly
    130                 135                 140
Asp Val Thr Asp Asp Asp Ser Gln Gly Ile Ala Ile Leu Ser Gly Asp
145                 150                 155                 160
Thr Leu Met Leu Ala Leu Ala Arg Ala Leu Arg Pro Gln Arg Ala Leu
                165                 170                 175
Phe Val Ile Asn His Ser Gly Val Met Asp Arg Asp Pro Ala Glu Pro
            180                 185                 190
Gly Ala Lys Leu Ile Ala His Leu Asn Gly Asp Ala Arg Thr Glu Met
        195                 200                 205
Arg Ala Gln Arg Met Asp Val Pro Gly Ala Asp Val Thr Gly Gly Met
    210                 215                 220
Trp Gly Lys Leu Glu Ala Ala Ala Ala Ile Ala Arg Glu Cys Glu Cys
225                 230                 235                 240
Arg Ile Ile Gly Ala Gly Gly Phe Ala Ala Ala Leu Thr Gly Asp Pro
                245                 250                 255
Ala Gly Thr Leu Val Leu Pro
            260

<210> 172
<211> 792
<212> DNA
<213> Euryarchaeota archaeon

<220>
<221> misc_feature
<222> (80)..(80)
<223> n means a or g or c or t

<220>
<221> misc_feature
<222> (171)..(171)
<223> n means a or g or c or t

<220>
<221> misc_feature
<222> (252)..(252)
<223> n means a or g or c or t

<220>
<221> misc_feature
<222> (518)..(518)
<223> n means a or g or c or t

<400> 172
ctgccccgcc atggagctgc cgtgaacctc gtcaagtggg gcggctcgct catcaccgac     60
aaggcggccg cgaagccgan gccacaacgc gagcgcatca cggcgctggc tgcggcgctc    120
gccgccaacg acgcgccggc ggtgctggtg cacggcgccg gttctttcgg ncacccgctc    180
gcaaagcggt tcgggctggc gcagggaagc gacggctcac ccgagcaggc ggcggcggtc    240
gcgcggacgc gncagcaggt gcgcacgctc aacgcgctgg tttgcgaagc gctcgccacg    300
gcgggactgg aacctgttcc aatcctgccg tcgcaggcgc tgcgcaccgc tggcccgcaa    360
aacatcgttg atttccccgc gagcagcttc gaggcggcgc tggaggcggg ccgtatccca    420
gtcacctgcg gcgacgtcac cgacgacgat tcacagggca tcgccatttt gagcggcgat    480
acgctgatgc tcgcgctggc gcgcgcgctg cgaccgcngc gcgcgttgtt cgtcatcaac    540
cactcaggag ttatggatcg cgacccggcc gagccgggcg cgaaactgat cgcgcatctc    600
aacggggacg cccgcacgga gatgcgcgcg cagcggatgg atgtgccggg cgccgacgtc    660
acaggcggga tgtggggcaa gctcgaggcg gcggccgcca tcgcccgcga atgcgagtgc    720
cgcatcatcg gcgcgggcgg cttcgcagca gcactcacag gcgacccagc tgggacgctg    780
gtgctgccgt ga                                                        792

<210> 173
<211> 802
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 173
atgccgcgtc atggtgccgc agttaatctg gttaaatggg gtggtagcct gattaccgat     60
aaagccgccg ccaaaccgac cccgcagcgc gagagaatta ccgccctggc agcagccctg    120
gcagccaatg atgcaccggc cgtgctggtt catggcgcag gcagcttcgg tcatccgctg    180
gccaaacgct tcggtctggc ccagggcagt gatggcagtc cggaacaggc agcagccgtt    240
gcccgtaccc gtcagcaggt tcgcaccctg aatgcactgg tgtgtgaagc cctggcaacc    300
gccggtctgg aaccggttcc gattctgccg agtcaggcac tgcgcaccgc aggccctcag    360
aatattgttg acttcccggc cagcagcttc gaagccgcac tggaagccgg ccgcattccg    420
gtgacctgtg gcgatgtgac cgatgatgat agtcagggca ttgccattct gagcggtgat    480
accctgatgc tggcactggc acgcgcactg cgtccgcagc gtgctctgtt cgttattaat    540
catagcggtg tgatggatcg cgatccggca gaaccgggtg ccaaactgat tgcacatctg    600
aatggcgatg cccgtaccga aatgcgtgcc cagcgtatgg atgttccggg tgccgatgtt    660
accggtggta tgtggggcaa actggaagcc gccgccgcaa ttgcccgcga atgcgaatgt    720
cgcattattg gcgccggtgg cttcgcagcc gcactgacag gtgatccggc cggtaccctg    780
gtgctgccgt gactgaaagc tt                                             802

<210> 174
<211> 251
<212> PRT
<213> Methanocorpusculum parvum

<400> 174
Met Asn Glu Pro Val Ile Ile Lys Leu Gly Gly Ser Ile Val Thr Lys
1               5                   10                  15
Lys Ser Glu Asp Gly Val Val Asp Ser Ala Lys Ile Lys Leu Leu Ala
            20                  25                  30
Glu Gln Ile Ala Pro Phe Ala Gly Lys Phe Pro Leu Ile Ile Val His
        35                  40                  45
Gly Ala Gly Ser Cys Gly His Pro Glu Ala Lys Ala Tyr Asp Ile Pro
    50                  55                  60
Gly Gly Val Thr Lys Ala Asn Ala Ala Gly Ile Phe Val Thr His Thr
65                  70                  75                  80
Ala Val Ser Arg Leu Asn Arg Ser Val Val Ala Ser Leu Arg Glu Ala
                85                  90                  95
Gly Met Glu Ala Val Ser Leu His Pro Phe Gly Cys Cys Leu Ala Glu
            100                 105                 110
Asn Gly Arg Leu Val Ser Ala Gly Val Ser Gln Ile Lys Glu Met Leu
        115                 120                 125
Ser Leu Ser Leu Ile Pro Val Leu His Gly Asp Val Val Met Asp Thr
    130                 135                 140
Lys Arg Gly Ala Cys Ile Ile Ser Gly Asp Gln Ile Val Pro Tyr Leu
145                 150                 155                 160
Ala Val Lys Leu Gly Ala Lys Arg Val Gly Ile Ala Thr Asp Val Gly
                165                 170                 175
Gly Val Leu Glu Asn Gly Glu Val Ile Pro Glu Ile Asn Arg Lys Asn
            180                 185                 190
Val Gly Glu Ile Asp Leu Gly Gly Ser Ser Ser Thr Asp Ile Thr Gly
        195                 200                 205
Gly Met Arg Gly Lys Ile Asp Glu Leu Leu Leu Leu Ala Asp Glu Gly
    210                 215                 220
Ile Asp Ser His Ile Phe Ala Ala Asn Arg Val Ala Asp Phe Leu Leu
225                 230                 235                 240
Gly Lys Asn Tyr Gly Gly Thr Leu Val Arg Lys
                245                 250

<210> 175
<211> 756
<212> DNA
<213> Methanocorpusculum parvum

<400> 175
atgaatgaac cagtcattat caaactcggc ggaagcatcg ttacgaaaaa atccgaagac     60
ggggtcgtcg attcggcaaa gatcaaactc cttgccgagc agatcgcacc gtttgccgga    120
aaattccccc tcatcatcgt gcacggggca ggttcctgcg ggcatcccga ggcaaaagcc    180
tatgatatcc cgggcggcgt gacgaaagca aacgccgcag gtatttttgt aacgcacacg    240
gcagtgtccc ggctcaaccg ctcggtcgta gcctcgcttc gtgaagccgg catggaagcc    300
gtctcgcttc atccgttcgg ctgctgtctt gcggagaacg gccgccttgt ctcggcaggt    360
gtttcgcaga tcaaagagat gctttcccta agtctcatcc cggtcctgca cggggatgtg    420
gtcatggaca caaaacgcgg cgcctgtatt atctccggcg atcagatcgt cccctacctt    480
gccgtcaaac tcggcgcaaa acgcgttggg atcgcgaccg atgtcggggg cgttcttgaa    540
aacggtgaag tcatcccgga gatcaaccgg aaaaacgtcg gtgagatcga tctcggcggc    600
tcttcgagca ccgacataac cggcggcatg cgtggaaaga tcgatgaact tctccttctc    660
gccgatgaag ggatcgactc gcacattttc gccgcgaacc gtgtggcgga tttcctcttg    720
ggaaaaaatt acggcgggac actggtaaga aaatga                              756

<210> 176
<211> 766
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 176
atgaacgaac cggttattat taagctgggt ggcagcattg ttaccaaaaa aagtgaagat     60
ggtgttgttg atagtgcaaa aattaaactg ctggccgaac agattgcacc gttcgccggt    120
aaattcccgc tgattattgt gcatggcgcc ggtagttgtg gccatccgga agccaaagca    180
tacgatattc cgggtggtgt taccaaagca aatgcagccg gtatcttcgt gacccatacc    240
gcagtgagtc gcctgaatcg cagcgttgtt gccagtctgc gtgaagcagg tatggaagca    300
gttagtctgc atccgttcgg ttgttgtctg gcagaaaatg gccgtctggt tagtgcaggt    360
gttagccaga ttaaagaaat gctgagtctg agcctgattc cggtgctgca tggtgatgtg    420
gtgatggata ccaaacgtgg tgcctgtatt attagtggtg atcagattgt gccgtatctg    480
gccgttaaac tgggcgcaaa acgcgttggc attgccaccg atgtgggtgg cgtgctggaa    540
aatggtgaag tgattccgga aattaatcgc aaaaatgtgg gcgaaattga tctgggcggc    600
agcagcagca ccgatattac cggcggcatg cgcggcaaaa ttgatgaact gctgctgctg    660
gcagatgaag gcattgatag ccatatcttc gcagcaaatc gcgtggcaga cttcctgctg    720
ggcaaaaatt atggtggtac cctggtgcgt aaatgactga aagctt                   766

<210> 177
<211> 267
<212> PRT
<213> Ardenticatena maritima

<400> 177
Met Asn Cys Ile Phe Val Lys Leu Gly Gly Ser Leu Leu Thr Asp Lys
1               5                   10                  15
Thr Ala Arg Tyr Ala Ala Arg His Asp Val Leu Ala Arg Leu Ala Arg
            20                  25                  30
Glu Ile Ala Ala Ala Arg His Ala Asn Pro Asp Leu Ala Leu Val Leu
        35                  40                  45
Ala His Gly Ser Gly Ser Tyr Gly His Val Ala Ala Arg Glu Thr Gly
    50                  55                  60
Tyr Asp Arg Glu Arg Gly His Arg Asp Val Leu Ala Tyr Ala Arg Val
65                  70                  75                  80
Ala Ala Ala Ala Ala Thr Leu Asn Ser Leu Val Arg Ala Ala Leu Leu
                85                  90                  95
Ala Cys Asp Ile Pro Ala Val Ser Leu Pro Pro Ser Ala Ser Ala Leu
            100                 105                 110
Val Glu Gly Gly Arg Leu Val Arg Met Ala Trp Asp Pro Phe Ala Arg
        115                 120                 125
Ile Leu Ala Trp Gly Gly Val Pro Leu Thr Tyr Gly Asp Val Ala Leu
    130                 135                 140
Thr Glu Thr Gly Gly Thr Ile Val Ser Thr Glu Thr Val Leu Leu Ala
145                 150                 155                 160
Leu Ala Glu Gln Leu Pro Pro Thr Arg Leu Leu Leu Leu Thr Asp Val
                165                 170                 175
Pro Gly Val Phe Ala His Pro Pro Thr Gly Asp Thr Thr Pro Pro Leu
            180                 185                 190
Leu Glu Arg Ile Thr Pro Ala Thr Trp Pro Glu Gln Arg Ala Gly Val
        195                 200                 205
Gln Gly Ala Arg Gly Thr Asp Val Thr Gly Gly Met Val Arg Lys Val
    210                 215                 220
Glu Gln Met Leu Ala Leu Val Glu Arg Leu Pro Gln Val Glu Val Ile
225                 230                 235                 240
Ile Ala Ser Gly Arg Thr Pro Gly Leu Leu Gln Arg Ala Leu Leu Gly
                245                 250                 255
Glu Asp Val Pro Gly Thr Arg Ile Val Arg Ala
            260                 265

<210> 178
<211> 804
<212> DNA
<213> Ardenticatena maritima

<400> 178
atgaattgca tcttcgtcaa acttgggggc tcgctcctga ccgacaaaac cgcccgctac     60
gccgcccgcc atgacgtctt ggcgcggctg gcgcgtgaaa tcgccgcggc gcgccacgcc    120
aaccccgacc tggcgctggt gctcgcccat ggcagcggct cttacgggca tgtcgccgcg    180
cgcgaaacag gctacgaccg cgagcggggg catcgcgatg tgctggcgta tgcacgggtt    240
gccgccgccg ccgcgacgct caacagcctg gtgcgggcgg cgttgctggc gtgcgatatt    300
cccgccgtct cgttgccgcc ttcggcgtcg gcgctggtgg aaggggggcg gctggtgcgc    360
atggcgtggg accccttcgc ccgcattctg gcgtgggggg gcgtcccgct cacctatggc    420
gatgtggcgc tgacggagac gggcggcacc atcgtttcca ccgaaaccgt attgctggcg    480
ctcgccgagc agttgccgcc gacgcgcctg ctcctgctga ctgacgtgcc cggcgttttt    540
gcgcatcccc ccacgggcga cacgacgccc cctctgctgg aacgcattac gcccgccacc    600
tggcccgaac agcgcgccgg tgtgcagggc gcccgcggca ccgacgtgac ggggggcatg    660
gtgcgcaagg tggagcagat gttggcgctg gtggaacgct tgccgcaggt ggaagtcatc    720
atcgcgtccg ggcggacgcc cggcctgttg cagcgggcgt tgctggggga agatgtgccc    780
ggcacgcgca tcgtgcgggc gtga                                           804

<210> 179
<211> 814
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 179
atgaactgta tcttcgtgaa actgggcggc agcctgctga ccgataaaac cgcacgttat     60
gcagcccgtc atgatgttct ggcacgtctg gcccgtgaaa ttgccgcagc acgtcatgca    120
aatccggatc tggccctggt tctggcccat ggtagtggca gttatggtca tgttgccgcc    180
cgtgaaaccg gctatgatcg tgaacgtggc catcgcgatg tgctggccta tgcccgtgtt    240
gcagccgccg cagcaaccct gaatagtctg gtgcgtgccg cactgctggc ctgtgatatt    300
ccggcagtga gcctgccgcc gagcgcatca gcactggttg aaggcggtcg cctggtgcgt    360
atggcatggg acccgttcgc acgcattctg gcatggggcg gcgttccgct gacctatggt    420
gatgtggccc tgaccgaaac cggtggcacc attgtgagta ccgaaaccgt gctgctggca    480
ctggcagaac agctgccgcc gacccgcctg ctgctgttaa ccgatgtgcc gggcgtgttc    540
gcacatccgc cgaccggtga taccaccccg ccgttactgg aacgcattac cccggcaacc    600
tggccggaac agcgcgcagg tgtgcagggt gcccgtggta ccgatgttac cggtggtatg    660
gtgcgtaaag ttgaacagat gctggccctg gtggaacgtc tgccgcaggt tgaagttatt    720
attgcaagcg gtcgcacccc gggcctgctg caacgtgcac tgctgggtga agatgtgccg    780
ggtacccgta ttgttcgcgc ctgactgaaa gctt                                814

<210> 180
<211> 242
<212> PRT
<213> Thermoplasmatales archaeon

<400> 180
Met Ile Ala Ile Lys Leu Gly Gly Ser Phe Ile Thr Asp Lys Ser Lys
1               5                   10                  15
Tyr Arg Thr Phe Arg Lys Tyr Glu Thr Glu Arg Ala Leu Lys Gly Ile
            20                  25                  30
Ile Lys Phe Gly Glu Pro Phe Val Leu Val His Gly Ala Gly Ser Phe
        35                  40                  45
Gly His Ile Leu Cys Lys Gln Ser Gly Phe Pro Gly Thr Tyr Lys Gly
    50                  55                  60
Lys Glu Ser Gln Leu Ser Arg Val Lys Tyr Asp Thr Cys Ser Leu Asn
65                  70                  75                  80
Ser Met Ile Thr Glu Ile Leu Leu Asp Leu Gly Met Ala Pro Met Ser
                85                  90                  95
Phe Ser Pro Phe His Leu Arg Arg Lys Asp Thr Phe Asp Tyr Ser Ser
            100                 105                 110
Val Leu Arg Ser Val Glu Gly Gly Phe Leu Pro Val Met Tyr Gly Asp
        115                 120                 125
Ile Tyr Ile Asp Gly Asn Asp Val Lys Ile Tyr Ser Gly Asp Ser Ile
    130                 135                 140
Met Phe Asp Ile Cys Asn Leu Leu Asn Pro Thr Asp Ala Ile Phe Met
145                 150                 155                 160
Gly Asp Val Asp Gly Ile Phe Asp Arg Asp Pro Lys Ile Tyr Pro Glu
                165                 170                 175
Ser Lys Leu Leu Lys Thr Val Lys Lys Gln Gln Asp Phe Asn Thr Ile
            180                 185                 190
Leu Asn Asp Val Thr Gly Gly Met Gly Gly Lys Tyr Ile Ala Met Lys
        195                 200                 205
Lys Ile Ala Ser Leu Gly Ile Arg Thr Ser Met Met Asn Gly Leu Tyr
    210                 215                 220
Pro Glu Arg Leu Ser Asp Leu His Asn Asp Asn Phe Tyr Gly Ser Val
225                 230                 235                 240
Ile Glu

<210> 181
<211> 729
<212> DNA
<213> Thermoplasmatales archaeon

<400> 181
atgatcgcaa taaagcttgg aggaagtttc ataacagata agtcaaaata tagaactttc     60
aggaagtatg aaacggaacg ggcattaaag ggaataataa aatttggcga accatttgtt    120
ctggtacacg gagccggttc ctttggtcac atattatgca agcagagtgg tttccccggt    180
acttataaag gtaaagaatc tcaactttcc agagttaagt atgatacatg ctctctgaat    240
agcatgataa cagaaatcct tcttgatctg ggtatggctc caatgagctt ttcacctttt    300
catctaagga ggaaggatac ctttgattac tcatctgttc tcaggtctgt ggaaggaggt    360
ttccttcctg taatgtatgg cgatatttac attgacggga atgatgttaa gatctattcc    420
ggtgacagca taatgtttga tatttgtaac ctgttgaatc ctacagatgc aattttcatg    480
ggtgacgttg acggaatatt cgacagggat ccaaagatat atcctgaaag caaactcctg    540
aagacagtga aaaagcagca ggattttaac accatattga atgatgtaac cggcggcatg    600
ggtggaaagt acatagccat gaagaaaatt gcatcacttg gaataagaac ctccatgatg    660
aacggtttat atccggaaag gcttagtgat cttcataacg ataactttta tggttcggtg    720
attgaatga                                                            729

<210> 182
<211> 739
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 182
atgatcgcaa ttaagctggg cggcagcttc attaccgata aaagtaaata tcgcaccttc     60
cgtaaatatg aaaccgaacg cgccctgaaa ggcattatta aattcggtga accgttcgtg    120
ctggtgcatg gtgccggtag cttcggccat attctgtgta aacagagtgg cttcccgggt    180
acctataaag gcaaagaaag tcagctgagt cgcgtgaaat atgatacctg tagcctgaat    240
agtatgatta ccgaaattct gctggatctg ggcatggccc cgatgagctt cagtccgttc    300
catctgcgcc gtaaagatac cttcgattat agcagtgtgc tgcgcagtgt tgaaggcggc    360
ttcctgccgg ttatgtatgg cgatatctat attgatggca atgatgttaa gatatatagc    420
ggtgatagca ttatgttcga tatctgtaat ctgctgaatc cgaccgatgc aatcttcatg    480
ggcgatgttg atggtatctt cgatcgtgat ccgaaaatct atccggaaag taaactgctg    540
aaaaccgtga aaaaacagca ggacttcaat accattctga atgatgtgac cggtggtatg    600
ggcggtaaat atattgccat gaaaaaaatt gcgagcctgg gtattcgcac cagtatgatg    660
aatggtctgt atccggaacg cctgagcgat ctgcataatg ataacttcta tggcagcgtt    720
attgaatgac tgaaagctt                                                 739

<210> 183
<211> 282
<212> PRT
<213> Aciduliprofundum boonei

<400> 183
Met Ala Trp Ser Thr Tyr Gly Ile Ser Gly Val Leu Ala Ile Val Arg
1               5                   10                  15
Asp Arg Met Gly Leu Ser Val Ile Lys Leu Gly Gly Ser Leu Leu Thr
            20                  25                  30
Asp Lys Ser Lys Pro Tyr Thr Met Arg Lys Glu Lys Phe Arg Glu Ile
        35                  40                  45
Ala Arg Glu Leu Lys Glu Ser Met Asp Glu Met Ile Ile Val His Gly
    50                  55                  60
Val Gly Ser Tyr Gly His Pro Pro Val Lys Glu Tyr Lys Leu Tyr Arg
65                  70                  75                  80
Gly Tyr Thr Gly Lys Glu Asn Leu Leu Asn Leu Ala Lys Thr Gln Ser
                85                  90                  95
Ile Val Phe Glu Leu Arg Leu Glu Phe Val Arg Ala Leu Gln Glu Glu
            100                 105                 110
Gly Ile Asn Ala Met Ile Phe Leu Pro Ser Ser Gln Ile Val Ala Glu
        115                 120                 125
Gly Met Lys Ile Lys Lys Ile Cys Ile Glu Pro Ile Lys Arg Phe Leu
    130                 135                 140
Glu Met Gly Met Thr Pro Val Phe Gly Gly Asp Ile Val Val Asp Thr
145                 150                 155                 160
Lys Met Gly Tyr Ser Val Cys Ser Gly Asp Leu Ile Ala Ala His Leu
                165                 170                 175
Ala Ser Glu Leu Asn Ala Glu Arg Leu Ile Phe Ala Thr Asp Val Asp
            180                 185                 190
Gly Ile Tyr Thr Lys Asp Pro Lys Lys Asp Lys Asn Ala Lys Leu Leu
        195                 200                 205
Lys Glu Ile Asn Leu Glu Asn Met Asp Glu Leu Ala Lys Leu Thr Gly
    210                 215                 220
Ser Ala Phe Thr Asp Val Thr Ser Gly Met Tyr Gly Lys Ile Glu Thr
225                 230                 235                 240
Ile Arg Lys Tyr Lys Asn Asp Leu Lys Asn Thr Glu Ile Val Ile Leu
                245                 250                 255
Ser Met Leu Lys Glu Gly Asn Leu Lys Ala Tyr Met Arg Asn Met Lys
            260                 265                 270
Asp Ala Lys Tyr Thr Lys Ile Lys Ile Lys
        275                 280

<210> 184
<211> 849
<212> DNA
<213> Aciduliprofundum boonei

<400> 184
atggcctgga gtacatacgg tatttcaggc gttctggcaa ttgtgaggga tagaatggga     60
ttgagtgtaa taaaattggg tggctcgctg ctcacagata aatcgaagcc ttacacaatg    120
aggaaagaaa aatttagaga aattgctaga gagttgaaag agagtatgga tgagatgata    180
atagttcatg gtgtgggctc ttatggacac ccgcctgtta aggagtacaa actttacagg    240
ggttatacag gaaaggagaa tcttttaaat ttagcaaaga cgcagagcat agtgtttgaa    300
ttgcgtttgg aatttgttag agctttacag gaagagggaa taaatgccat gattttcctg    360
ccgagtagcc agatagttgc agaaggaatg aagataaaaa agatttgtat tgaaccaata    420
aagagatttt tagagatggg aatgactcca gtgtttggtg gagacattgt ggttgatact    480
aagatgggtt attcagtttg ttcgggagat cttattgcag cacatcttgc ctctgagcta    540
aatgctgaga gattgatttt tgcaacagat gtggacggaa tttatacaaa ggacccgaaa    600
aaagataaaa atgcaaaatt gctgaaagaa ataaatctag aaaatatgga tgaattggct    660
aaattaacgg gttctgcttt cactgatgtg acctcaggta tgtacggcaa aatagagacc    720
attagaaaat acaagaatga tttaaagaat acagaaatag tcattctttc tatgttaaaa    780
gagggcaatt taaaagcata tatgagaaat atgaaggatg caaaatacac caaaataaaa    840
ataaagtaa                                                            849

<210> 185
<211> 859
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 185
atggcatgga gtacctatgg tattagtggt gttctggcca ttgttcgcga tcgcatgggt     60
ctgagtgtta ttaaactggg tggtagcctg ctgaccgata aaagtaaacc gtataccatg    120
cgcaaagaaa aattccgtga aattgcccgt gaactgaaag aaagtatgga tgaaatgatt    180
atcgtgcatg gtgttggtag ttatggtcat ccgccggtga aagaatataa actgtatcgt    240
ggctataccg gtaaagaaaa tctgctgaat ctggcaaaaa cacaaagcat tgtgttcgaa    300
ctgcgcctgg agttcgttcg tgcactgcaa gaagaaggta ttaatgcaat gatcttcctg    360
ccgagtagcc agattgttgc cgaaggcatg aaaattaaaa aaatctgtat cgagccgatc    420
aaacgcttcc tggaaatggg catgaccccg gtgttcggtg gtgatattgt tgtggatacc    480
aaaatgggtt atagtgtgtg tagtggtgat ctgattgcag cacatctggc aagtgaactg    540
aatgccgaac gcctgatctt cgccaccgat gttgatggta tctataccaa agatccgaaa    600
aaagataaga acgcaaaact gctgaaagaa attaatctgg aaaacatgga tgagctggcc    660
aaactgaccg gtagcgcctt caccgatgtg accagcggta tgtatggcaa aattgaaacc    720
attcgtaaat acaaaaacga cctgaaaaac accgaaattg ttattctgag catgctgaaa    780
gaaggtaatc tgaaagcata tatgcgtaat atgaaggatg caaaatacac caaaatcaaa    840
atcaagtgac tgaaagctt                                                 859

<210> 186
<211> 252
<212> PRT
<213> Methanohalarchaeum thermophilum

<400> 186
Met Lys Ile Ile Lys Ile Gly Gly Ser Leu Ile Thr Asp Lys Asp Ser
1               5                   10                  15
Tyr Lys Thr Pro Asp Thr His Glu Ile Asn Arg Ile Ala Arg Glu Ile
            20                  25                  30
Ser Lys Gly Ile Asn Ser Asp Arg Leu Ile Leu Ile His Gly Ala Gly
        35                  40                  45
Ser Phe Gly His Pro Leu Val Lys Lys Phe Lys Leu Asn Lys Lys Ser
    50                  55                  60
Thr Asn Lys Asp Leu Phe Ser Ile Leu Lys Val Gln Asp Ser Val Arg
65                  70                  75                  80
Glu Leu Asn Arg Leu Phe Lys Asp Ser Leu Asn Lys Glu Lys Ile Pro
                85                  90                  95
Ala Tyr Thr Ile His Pro Ser Ser Ile Thr Arg Thr Glu Asn Gly Glu
            100                 105                 110
Ile Ile Asp Leu Glu Leu Asn Thr Ile Arg Gln Ala Leu Lys Glu Gly
        115                 120                 125
Tyr Ile Pro Leu Leu Tyr Gly Asp Met Val Leu Asp Thr Lys Asn Arg
    130                 135                 140
Ala Ser Val Leu Ser Gly Asp Arg Leu Val Ser Phe Leu Ala Gln Glu
145                 150                 155                 160
Leu Lys Pro Asn Lys Val Gly Met Ala Thr Thr Thr Pro Val Leu Asp
                165                 170                 175
Lys Asn Asn Gln Lys Ile Asp Leu Ile Thr Gln Thr Asp Leu Glu Asn
            180                 185                 190
Ile Gly Glu Ser Asn Ser Thr Asp Val Thr Gly Gly Met Leu Asn Lys
        195                 200                 205
Val Asn Glu Leu Leu Lys Thr Arg Ala Lys Ser Tyr Ile Phe Asn Ala
    210                 215                 220
Lys Lys Arg Lys Ala Leu Glu Lys Phe Ile Lys Gly Lys Asn Ile Gly
225                 230                 235                 240
Thr Glu Val Glu Tyr Asp Asp Lys Gly Lys Lys Thr
                245                 250

<210> 187
<211> 759
<212> DNA
<213> Methanohalarchaeum thermophilum

<400> 187
atgaaaataa ttaaaatcgg tggaagctta ataacagata aggacagtta taaaacccca     60
gatacccacg agataaatcg tatagcaaga gagataagta agggaatcaa ctcagatcga    120
ttaatactaa ttcatggagc tgggtccttc ggacacccat tagtcaagaa atttaaatta    180
aataaaaaat caacaaacaa ggatttattt tctattctaa aggtacaaga ttcagttagg    240
gaattaaata ggttatttaa ggactcttta aataaagaaa aaataccagc ttacacgatt    300
catccttcat caattacaag aacggaaaat ggagagataa tagatctaga attaaataca    360
ataagacaag cactcaagga gggctatata cctctcctgt atggtgacat ggttttagat    420
accaaaaata gagcaagtgt attatctggt gatcgattag tttccttctt agctcaggaa    480
cttaaaccaa ataaagttgg tatggctaca actactcccg tattagataa aaacaatcaa    540
aagatcgatt taatcaccca aaccgattta gaaaatatag gagaatctaa ttcaactgat    600
gtcacgggag gaatgttaaa caaagtaaat gagcttttaa aaacccgagc caaatcatat    660
atcttcaatg ctaaaaaaag aaaagcactt gaaaaattca taaaaggcaa aaacattggg    720
acagaggtcg aatatgacga taaaggaaag aaaacttga                           759

<210> 188
<211> 769
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 188
atgaagatca tcaagatcgg cggcagtctg attaccgata aagatagcta taaaacacct     60
gatacccatg aaattaaccg cattgcacgt gaaattagta aaggtattaa cagcgatcgc    120
ctgattctga ttcatggcgc aggtagcttc ggccatccgc tggttaaaaa attcaaactg    180
aataaaaaga gcaccaacaa agacttattc agcattctga aagttcagga tagcgttcgc    240
gaactgaatc gcctgttcaa agatagcctg aataaagaaa aaatcccggc atataccatt    300
catccgagca gtattacccg taccgaaaat ggtgaaatta ttgatctgga actgaatacc    360
attcgccagg ccctgaaaga aggttatatt ccgctgctgt atggcgatat ggtgctggat    420
accaaaaatc gcgcaagtgt tctgagtggt gatcgtctgg tgagcttcct ggcacaggaa    480
ctgaaaccga ataaagttgg catggcaacc accaccccgg tgctggataa aaataatcag    540
aaaattgacc tgatcaccca gaccgatctg gaaaatattg gcgaaagcaa tagcaccgat    600
gttaccggtg gcatgctgaa taaagttaat gaactgctga aaaccagagc aaaaagctat    660
atcttcaatg caaaaaagcg caaagcactg gaaaaattca ttaaaggtaa gaatatcggc    720
accgaagttg aatatgatga taaaggtaaa aagacctgac tgaaagctt                769

<210> 189
<211> 243
<212> PRT
<213> Acidiplasma cupricumulans

<400> 189
Met Ile Ile Leu Lys Leu Gly Gly Ser Ile Ile Thr Asp Lys Lys Thr
1               5                   10                  15
Tyr Arg Lys Phe Glu Glu Asp Arg Thr Arg Lys Ile Ile Ala Glu Ile
            20                  25                  30
Ser Lys Ile Lys Asp Lys Phe Ile Ile Ile His Gly Gly Gly Ser Phe
        35                  40                  45
Gly His Ile Met Ala Lys Glu Tyr Asn Ile Pro Gly Arg Leu Asn Lys
    50                  55                  60
Arg Ser Leu Tyr Tyr Met Ser Leu Ile His Tyr Asp Met Ser Asp Leu
65                  70                  75                  80
Asn Met Arg Val Ser Lys Ile Leu Ser Glu Tyr Gly Met Gly Asn Ile
                85                  90                  95
Pro Val Pro Pro Ser Thr Tyr Ile Tyr Gly Lys Lys Lys Asn Tyr Asp
            100                 105                 110
Ile Phe Arg Tyr Tyr Val Lys Asn Asn Ile Met Pro Val Ser Tyr Gly
        115                 120                 125
Asp Val Tyr Ile Lys Asn Arg Asn Tyr Ile Gly Ile Tyr Ser Gly Asp
    130                 135                 140
Asp Ile Ile Tyr Asp Leu Ser Arg Ile Phe Met Pro Glu Lys Val Ile
145                 150                 155                 160
Phe Phe Ser Asp Val Asp Gly Ile Phe Asp Lys Asn Pro Lys Ile His
                165                 170                 175
Lys Asp Ala Lys Leu Leu Lys Thr Val Asn Lys Asp Phe Asn Phe Glu
            180                 185                 190
Asn Asp Ser Ile Asp Val Thr Gly Gly Ile Ile Asn Lys Tyr Asn Ser
        195                 200                 205
Met Val Lys Ile Ser Lys Leu Gly Ile Lys Val Tyr Leu Ile Asn Gly
    210                 215                 220
Leu Tyr Pro Glu Arg Ile Lys Asp Ile Gly Lys Asp Asn Phe Tyr Gly
225                 230                 235                 240
Thr Val Val

<210> 190
<211> 732
<212> DNA
<213> Acidiplasma cupricumulans

<400> 190
atgataatat taaagcttgg tggcagtata attacagaca aaaaaacata cagaaaattt     60
gaggaggata ggacaagaaa aattattgct gaaatatcga aaattaaaga taaatttatt    120
attatacatg gaggcggatc ttttggccac ataatggcca aggaatataa cataccgggc    180
aggcttaata aaaggtcatt atattacatg agccttatac attatgatat gtcagatctc    240
aatatgaggg tttcaaaaat tctttcagag tacggcatgg gaaatatacc ggtgccacca    300
tcaacatata tttatggcaa aaaaaagaat tatgacatct ttaggtatta tgtaaaaaat    360
aatattatgc ctgtgagtta tggagatgtt tacataaaaa atagaaatta cattggcata    420
tattctggag atgatataat ttatgatcta tcaagaatat tcatgccgga aaaggttata    480
ttttttagtg atgttgatgg tatatttgat aaaaatccaa aaatacataa ggatgcaaaa    540
ctgctaaaaa cagtgaataa agattttaat tttgaaaatg attcaataga tgttactggt    600
ggaattatta ataagtacaa ttcaatggtt aaaatttcaa aacttggcat aaaagtttat    660
ttaataaatg gattatatcc agagagaata aaagatatag gaaaagataa tttttatgga    720
acggtggtat aa                                                        732

<210> 191
<211> 742
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 191
atgatcatcc tgaaactggg tggtagtatt attaccgata aaaaaaccta ccgtaagttc     60
gaagaagatc gtacccgtaa aattattgcc gaaattagta aaatcaagga caaattcatc    120
atcatccatg gtggcggcag cttcggccat attatggcaa aagaatataa tatcccgggc    180
cgcctgaata aacgcagtct gtattatatg agcctgattc attatgatat gagcgatctg    240
aatatgcgtg tgagcaaaat tctgagtgaa tatggtatgg gtaatattcc ggttccgccg    300
agtacctata tctatggtaa aaaaaaaaac tacgacatct tccgttacta tgttaaaaat    360
aacatcatgc cggttagtta tggcgatgtg tatattaaaa accgcaatta tatcggcatc    420
tatagtggtg atgatattat ctatgacctg agtcgtatct tcatgccgga aaaagtgatc    480
ttcttcagcg atgttgatgg tatcttcgat aaaaatccga aaattcataa ggacgccaaa    540
ctgctgaaaa ccgttaataa agacttcaac ttcgaaaatg acagcattga tgtgaccggc    600
ggtattatta ataaatataa tagcatggtg aagatcagca aactgggcat taaagtgtat    660
ctgattaatg gtctgtaccc ggaacgtatt aaagatattg gcaaagataa cttctacggt    720
accgtggtgt gactgaaagc tt                                             742

<210> 192
<211> 244
<212> PRT
<213> Aciduliprofundum species

<400> 192
Met Leu Leu Val Lys Met Gly Gly Ser Val Ile Thr Asp Lys Arg Val
1               5                   10                  15
Tyr Arg Arg Phe Arg Glu Asp Val Met Glu Arg Ile Val Lys Tyr Leu
            20                  25                  30
Pro Lys Glu Asp Leu Ile Ile Val His Gly Gly Gly Ser Phe Gly His
        35                  40                  45
Pro Leu Ala Lys Lys Tyr Gly Ile Thr Glu Gly Phe Ser Glu Glu Lys
    50                  55                  60
Thr Met Gly Phe Ala Glu Ile Gly Arg Asp Met Glu Asp Leu Asn Leu
65                  70                  75                  80
Arg Ile Ile Glu Ile Leu Ile Glu Asn Asp Ile Pro Ala Val Ser Ile
                85                  90                  95
Ala Pro His Ser Phe His Ile Phe Gly Glu Glu Met Asp Leu His Ile
            100                 105                 110
Phe Glu Arg Phe Leu Ser Leu Gly Leu Val Pro Val Thr Tyr Gly Asp
        115                 120                 125
Ile Ile Leu Asp Ser Ser Gln Gly Ile Asn Ile Cys Ser Gly Asp Tyr
    130                 135                 140
Leu Met Leu Gln Leu Ala Arg Glu Phe Arg Pro Glu Lys Val Ile Phe
145                 150                 155                 160
Leu Thr Asp Val Asp Gly Ile Tyr Asp Arg Asp Pro Ser Glu Gln Gly
                165                 170                 175
Ala Glu Leu Ile Glu Val Leu Arg Arg Asp Ser Lys Val Glu Thr Ile
            180                 185                 190
Ile Lys Val Asp Asp Val Thr Gly Gly Val Ala Tyr Lys Ile Ser Ile
        195                 200                 205
Met Arg Lys Ile Ala Arg Tyr Ser Arg Val Tyr Val Leu Asn Gly Phe
    210                 215                 220
His Pro Glu Arg Ile Glu Asn Val Leu Asn Asp Glu Asp Phe Val Gly
225                 230                 235                 240
Thr Val Val Glu

<210> 193
<211> 735
<212> DNA
<213> Aciduliprofundum species

<400> 193
atgctcctgg tgaagatggg gggaagcgtg ataacagata agagggttta cagaaggttc     60
cgggaagatg tgatggagag gatagtcaaa tatcttccaa aggaggattt gataattgtg    120
catggtggag gttcctttgg tcatcctctg gcaaagaaat acgggataac ggagggattc    180
tcagaggaaa agaccatggg atttgcagag ataggaaggg atatggagga tcttaatctg    240
aggatcatag aaattttgat agagaatgac attcctgccg tatccattgc gccccattcc    300
ttccatattt tcggagagga aatggatctc cacatatttg agagattcct ctctctgggt    360
ttggtgcccg ttacctacgg ggatataatc cttgattctt cgcagggtat aaacatatgc    420
tctggagatt acctgatgct tcaacttgcc agagaattca ggcccgagaa ggttatattc    480
ctcacagatg tggacggcat atacgatagg gatccctcag aacagggagc agaactcata    540
gaggtgctga ggagggatag caaggtagag acgatcataa aggtggatga tgttactgga    600
ggggttgcct acaaaatatc catcatgagg aaaattgcga ggtacagcag ggtatatgtg    660
cttaatggat ttcatcccga gaggatagag aatgttttaa atgatgaaga ttttgtaggg    720
acggtggtgg aatga                                                     735

<210> 194
<211> 745
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 194
atgctgctgg ttaaaatggg cggtagcgtt attaccgata aacgcgtgta tcgtcgcttc     60
cgcgaagatg ttatggaacg tattgttaaa tacctgccga aagaagactt aattattgtt    120
catggtggcg gtagcttcgg ccatccgctg gcaaaaaaat atggtattac cgaaggcttc    180
agcgaagaaa aaaccatggg cttcgcagaa attggccgtg atatggaaga cttaaatctg    240
cgtattattg aaatcctgat cgaaaatgat atcccggcag tgagtattgc cccgcatagc    300
ttccatatct tcggcgaaga aatggatctg catatcttcg aacgcttcct gagcctgggc    360
ctggttccgg ttacctatgg tgatattatt ctggatagta gccagggtat taatatctgt    420
agcggcgatt atctgatgct gcaactggcc cgtgagttcc gcccggaaaa agtgatcttc    480
ctgaccgatg tggatggcat ctatgatcgc gatccgagtg aacagggcgc cgaactgatt    540
gaagttctgc gtcgtgatag caaagttgaa accattatta aagtggatga cgttaccggt    600
ggtgtggcat ataaaattag cattatgcgc aaaatcgccc gttatagccg tgtgtatgtt    660
ctgaatggct tccatccgga acgtattgaa aatgttctga atgatgaaga cttcgtgggt    720
accgttgttg aatgactgaa agctt                                          745

<210> 195
<211> 268
<212> PRT
<213> Euryarchaeota archaeon

<400> 195
Met Gly Ser Glu Met Gly Leu Arg Val Ala Ile Lys Leu Gly Gly Gly
1               5                   10                  15
Leu Ile Thr Asp Lys Gly Ser Met Lys Lys Phe Asp Gln Lys Ala Val
            20                  25                  30
Glu Lys Val Val Asp Ser Leu Ser Ser Val Ser Glu Leu Gly Ala Ser
        35                  40                  45
Ile Val Leu Val His Gly Ala Gly Ser Phe Gly His Leu Leu Ala Lys
    50                  55                  60
Lys Trp Gly Ile Ala Asn Gly Phe Asn Ile Gln Leu Glu Lys Glu Gln
65                  70                  75                  80
Leu Glu Ala Val Arg Glu Ile Arg Ser Asp Met Arg Glu Leu Asn Ala
                85                  90                  95
Leu Ile Ile Gly Lys Met Glu Glu Arg Gly Leu Glu Cys Ile Gly Tyr
            100                 105                 110
Pro Pro Ser Asp Trp Ala Arg Gly Thr Gly Ala Leu Phe Thr Gly Asp
        115                 120                 125
Val Ser Ile Phe Glu Arg Gly Ser Arg Gln Pro Ile Pro Val Thr Phe
    130                 135                 140
Gly Asp Val Val Asp Thr Glu Asp Glu Ser Arg Phe Gly Ile Leu Ser
145                 150                 155                 160
Gly Asp Asp Leu Met Leu Arg Leu Ser Thr Glu Leu Glu Val Thr His
                165                 170                 175
Ser Ile Phe Leu Ile Gly Asp Ser Glu Gly Val Leu Thr Gly Pro Pro
            180                 185                 190
Ala Glu Arg Asp Ser Glu Leu Ile Thr His Leu Gly Ser Glu Thr Lys
        195                 200                 205
Ile Lys Gly Glu His Asp Ala Glu Ile Asp Val Thr Gly Gly Ile Gly
    210                 215                 220
Leu Lys Ile Glu Arg Ala Leu Glu Ile Ala Lys Val Val Asp Glu Val
225                 230                 235                 240
Trp Ile Ile Asp Gly Arg Glu Pro Asp Arg Val Leu Glu Leu Leu Thr
                245                 250                 255
Ser Gly Glu Thr Thr Gly Thr Lys Ile Leu Asn Gly
            260                 265

<210> 196
<211> 807
<212> DNA
<213> Euryarchaeota archaeon

<400> 196
ttggggtcgg aaatgggact cagggtagca ataaaattag gagggggcct aattactgac     60
aagggttcta tgaagaaatt cgaccaaaag gccgttgaaa aggttgtgga ctcgctaagt    120
tctgtttctg aattgggggc ttcaatcgtg ctggttcacg gagctggctc ttttggacat    180
ctgctggcaa agaaatgggg gattgctaat ggattcaata ttcaactgga gaaggagcag    240
ttggaagcag taagagaaat tcgttcagat atgagggaat taaatgccct gatcatagga    300
aagatggaag aaagaggttt ggaatgcatt ggctatcccc cctctgattg ggccaggggc    360
actggagcac tttttactgg ggatgtttca atattcgaaa ggggtagtag acagccaatt    420
cctgtaacat ttggtgatgt tgttgatact gaggacgagt ctaggtttgg tatattgtca    480
ggagacgatt tgatgctacg tttgtctact gaattagaag tcacgcatag catttttctc    540
attggggatt ctgagggagt tttgactggc cctccagcag aaagagattc tgaattgatt    600
acgcatttag gatctgaaac caaaattaag ggggagcatg atgcagaaat cgatgttact    660
ggggggattg gattgaaaat agaaagggct ctcgagattg caaaggtagt ggatgaagtg    720
tggataattg acggtagaga accagataga gttcttgaat tattgacatc cggggaaact    780
acagggacta agattctgaa cggttga                                        807

<210> 197
<211> 817
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 197
atgggcagtg aaatgggcct gcgtgttgcc attaaactgg gtggcggcct gattaccgat     60
aaaggcagta tgaaaaaatt cgatcagaaa gccgttgaaa aagttgtgga tagcctgagc    120
agcgttagcg aactgggtgc aagcattgtt ctggtgcatg gtgcaggcag cttcggccat    180
ctgctggcaa aaaaatgggg tattgcaaat ggcttcaata ttcagctgga aaaagaacag    240
ctggaagccg tgcgcgaaat tcgcagcgat atgcgcgaac tgaatgccct gattattggt    300
aaaatggaag aacgcggtct ggaatgcatt ggttatccgc cgagcgattg ggcccgtggt    360
accggtgcac tgttcaccgg tgatgtgagc atcttcgaac gtggcagccg tcagccgatt    420
ccggtgacct tcggtgatgt tgtggatacc gaagatgaaa gccgcttcgg cattctgagc    480
ggtgatgatc tgatgctgcg tctgagtacc gaactggaag tgacccatag catcttcctg    540
attggcgata gcgaaggtgt tctgaccggc ccgccggcag aacgtgatag cgaactgatt    600
acccatctgg gcagtgaaac caaaattaaa ggcgaacatg atgccgaaat tgatgtgacc    660
ggtggcattg gtctgaaaat tgaacgtgca ctggaaattg caaaagttgt tgatgaagtg    720
tggattattg atggtcgtga accggatcgc gtgctggaac tgctgaccag cggcgaaacc    780
accggtacca aaattctgaa tggctgactg aaagctt                             817

<210> 198
<211> 260
<212> PRT
<213> Nitrososphaera evergladensis

<400> 198
Met Gln Lys Leu Ala Leu Val Lys Leu Gly Gly Ser Val Ile Thr Phe
1               5                   10                  15
Lys Asp Lys Ala Leu Thr Ala Asn Thr Gly Ala Ile Asp Gly Ile Ser
            20                  25                  30
Gly Ala Leu Val Gln Leu Asp Met Pro Val Ile Val Val His Gly Gly
        35                  40                  45
Gly Ser Phe Gly His His Trp Ser Val Gln Tyr Asp Met His Thr Lys
    50                  55                  60
Pro Ala Pro Tyr Asp Pro His Gly Val Ala Val Val His Glu Ser Met
65                  70                  75                  80
Ile Ala Leu Asn Gln Ile Ile Val Asn Ser Met Ile Lys Ala Gly Ala
                85                  90                  95
Asn Pro Tyr Ala Val Ala Pro Cys Met Phe Thr Thr Gly His Lys Ala
            100                 105                 110
Ile Ala Ala Lys Val Arg Gln Leu Tyr Glu Met Ala Lys Ala Asn Asn
        115                 120                 125
Val Ile Pro Val Thr Phe Gly Asp Val Val His Met Gly Gly Arg Lys
    130                 135                 140
Tyr Ser Ile Leu Ser Gly Asp Ala Leu Met Ser Ile Ile Ala Lys Val
145                 150                 155                 160
Leu Lys Pro Ser Arg Val Ile Phe Ala Thr Asn Val Asp Gly Ile Tyr
                165                 170                 175
Arg Asp Met Lys Thr Arg Glu Leu Val Gln Glu Leu Lys Ser Ala Arg
            180                 185                 190
Arg Asn Gly Asp Pro Val Glu Phe Ser Lys Thr Ala Gly Ala Asp Val
        195                 200                 205
Thr Gly Gly Met Gln Arg Lys Val Arg Glu Ala Phe Lys Ile Ala Ser
    210                 215                 220
Met Gly Met Asp Val Val Leu Val Asn Gly Leu Tyr Pro Glu Arg Ile
225                 230                 235                 240
Val Gln Ala Ala His Gly Glu Val Gln Thr Gly Thr Val Val Val Lys
                245                 250                 255
Lys Gly Arg Lys
            260

<210> 199
<211> 783
<212> DNA
<213> Nitrososphaera evergladensis

<400> 199
atgcagaaac ttgcacttgt caagctggga ggctctgtca tcaccttcaa ggacaaggca     60
cttacagcaa acactggcgc aatagatggc atatccggcg cacttgtaca gcttgacatg    120
cctgtcattg ttgtccacgg cggggggtct tttggccacc actggtctgt gcagtacgac    180
atgcacacca agcctgcgcc ctacgacccg cacggagtgg ccgtggtgca cgaatcgatg    240
attgcgttaa accagataat cgtaaattcc atgataaagg caggcgcaaa cccgtacgcc    300
gttgcaccct gcatgttcac gacgggccat aaagcgattg cagcaaaggt aaggcagctg    360
tacgaaatgg ccaaggcaaa caacgtaatc cctgtcacgt ttggcgacgt ggtccacatg    420
ggcggccgga aatactctat tctttccggc gacgcgctga tgtccatcat cgcaaaggtg    480
ctaaagccgt caagggtcat atttgccacc aacgtcgacg gcatttaccg ggacatgaag    540
acgagggaac tcgtacagga gctaaagtcg gcaaggcgga atggtgatcc cgtggagttc    600
tcaaagacgg ccggcgccga cgtgacgggc ggcatgcagc gcaaggtaag ggaagcattt    660
aagattgcat ctatgggtat ggatgtcgtg ctggtaaatg ggctttatcc tgagcgcata    720
gtccaggcgg cgcacggcga ggttcaaaca ggcaccgtag tagtaaagaa ggggaggaaa    780
taa                                                                  783

<210> 200
<211> 793
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 200
atgcagaaac tggccctggt gaaactgggt ggtagcgtta ttaccttcaa agataaagcc     60
ctgaccgcca ataccggtgc aattgatggt attagcggtg cactggtgca gctggatatg    120
ccggttattg tggtgcatgg tggtggcagc ttcggtcatc attggagcgt gcagtatgat    180
atgcatacca aaccggcccc gtatgatccg catggtgtgg ccgttgtgca tgaaagcatg    240
attgcactga atcagattat tgtgaatagc atgattaagg caggtgccaa tccgtatgca    300
gttgccccgt gtatgttcac caccggtcat aaagccattg cagccaaagt tcgccagctg    360
tatgaaatgg ccaaagcaaa taatgttatc ccggttacct tcggcgatgt ggtgcacatg    420
ggtggtcgta aatatagcat tctgagtggt gatgcactga tgagcattat tgcaaaagtg    480
ctgaaaccga gtcgcgttat cttcgcaacc aatgttgatg gtatctatcg cgatatgaaa    540
accagagaac tggttcagga actgaaaagc gcccgtcgta atggtgatcc ggtggagttc    600
agtaaaaccg ccggtgccga tgttaccggc ggcatgcagc gcaaagtgcg cgaagccttc    660
aaaattgcca gcatgggtat ggatgtggtg ctggtgaatg gtctgtatcc ggaacgcatt    720
gtgcaggccg cccatggtga agttcagacc ggtaccgtgg tggtgaaaaa aggtcgtaaa    780
tgactgaaag ctt                                                       793

<210> 201
<211> 256
<212> PRT
<213> Methanosalsum zhilinae

<400> 201
Met Ser Gln His Lys Asn Ile Thr Ile Leu Lys Ile Gly Gly Ser Val
1               5                   10                  15
Ile Thr Asp Lys Ser Ser Asp Ile Gly Lys Val Gln Ile Glu Glu Ile
            20                  25                  30
Glu Arg Ile Cys Gln Glu Ile Ser Gly Tyr Ser Asn Asp Leu Ile Ile
        35                  40                  45
Val His Gly Ala Gly Ser Tyr Gly His Pro Leu Ala Lys Lys Tyr Asp
    50                  55                  60
Leu Asp Asn Ile Pro Asp Pro Lys Gly Ala Ile Ile Thr His Ser Ser
65                  70                  75                  80
Val Lys Ser Leu Asn Glu Ile Met Val Ser Ser Leu Gln Lys Ala Gly
                85                  90                  95
Ile Asp Ala Val Ser Val His Pro Leu Asn Asn Thr Val Ser Asn Asp
            100                 105                 110
Gly Arg Ile Ser Asp Met Phe Leu Ser Asn Ile His Ile Met Leu Glu
        115                 120                 125
Asn Gly Leu Val Pro Val Ile His Gly Asp Val Val Met Asp Ile Thr
    130                 135                 140
Asn Thr Phe Ser Val Ile Ser Gly Asp Gln Ile Val Ser Tyr Leu Ala
145                 150                 155                 160
Asn Lys Leu Lys Ala Ser Arg Val Gly Ile Gly Ser Ile Glu Asp Gly
                165                 170                 175
Val Met Asp Asn Lys Gly Lys Thr Leu Thr Lys Ile Thr Ser Ser Asn
            180                 185                 190
Phe Lys Glu Ile Glu Lys Phe Leu Gly Val Ser Lys Asn Thr Asp Val
        195                 200                 205
Thr Gly Gly Met Leu Gly Lys Val Asn Glu Leu Leu Gln Leu Cys Glu
    210                 215                 220
Ile Thr Gly Ala Thr Ser Tyr Ile Phe Asn Ala Lys Lys Pro Asn Asn
225                 230                 235                 240
Ile Ser Tyr Phe Leu Ser Gly His Asn Ile Gly Thr Ala Ile Lys Lys
                245                 250                 255

<210> 202
<211> 771
<212> DNA
<213> Methanosalsum zhilinae

<400> 202
atgagtcaac ataaaaacat tacaattctg aaaataggtg gaagtgtaat tactgataag     60
agttctgata ttgggaaggt tcaaatcgaa gaaatcgaac gtatatgtca ggaaatatct    120
ggatatagca atgatcttat aattgtccat ggtgccggct cctatggtca cccccttgca    180
aaaaaatatg atctggacaa tattccagac ccaaaaggtg caattataac acatagttct    240
gtaaaatcgt taaatgagat tatggtaagc tcacttcaaa aagctggaat cgatgcagtt    300
tctgtccatc cgttaaataa cacagtctca aatgatggac gaatttctga catgttttta    360
tccaatatcc atatcatgct ggagaacggt ttagtcccgg tgatacatgg tgatgtggta    420
atggacataa caaatacttt ttctgttatt tccggggatc aaattgtttc gtatcttgca    480
aataaactga aagcttcccg ggttggaatt ggaagtattg aagatggggt tatggacaat    540
aaaggaaaaa ctcttaccaa aatcaccagt tccaatttta aagaaataga aaaatttcta    600
ggcgtatcca aaaatactga tgtcaccggt ggtatgcttg gaaaagttaa tgaactactt    660
cagctttgtg aaataaccgg tgccacttcc tacattttca atgcgaaaaa accaaataac    720
atttcgtatt ttttaagtgg acataacata ggaactgcaa taaaaaaata g             771

<210> 203
<211> 781
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 203
atgagccagc ataaaaatat caccattctg aaaatcggcg gcagcgttat taccgataaa     60
agcagcgata ttggtaaagt tcagattgaa gaaatcgaac gcatctgtca ggaaattagt    120
ggctatagta atgatctgat tatcgttcat ggcgccggta gttatggcca tccgctggcc    180
aaaaaatatg atctggataa tattccggac ccgaaaggtg caattattac ccatagcagt    240
gtgaaaagcc tgaatgaaat tatggttagt agcctgcaaa aagcaggcat tgatgcagtt    300
agtgtgcatc cgctgaataa taccgtgagt aatgatggtc gcattagtga tatgttcctg    360
agtaatattc acattatgct ggaaaatggc ctggtgccgg ttattcatgg cgatgttgtg    420
atggatatta ccaatacctt cagcgtgatt agcggtgatc agattgtgag ctatctggca    480
aataaactga aagcaagtcg tgttggcatt ggcagtattg aagatggcgt gatggataat    540
aaaggcaaaa ccttaaccaa aatcaccagt agcaacttca aagaaattga aaaattcctg    600
ggtgttagca aaaataccga tgtgaccggc ggtatgctgg gcaaagttaa tgaactgctg    660
caactgtgtg aaattaccgg tgccaccagc tatatcttca atgccaaaaa accgaataac    720
attagttact tcctgagtgg tcataatatt ggtaccgcca ttaaaaaatg actgaaagct    780
t                                                                    781

<210> 204
<211> 260
<212> PRT
<213> Euryarchaeota archaeon

<400> 204
Met Leu Leu Val Lys Phe Gly Gly Ser Val Ile Thr Val Lys Ser Lys
1               5                   10                  15
Tyr Arg Thr Leu Arg Gly Ala Asp Leu Ser Arg Leu Ala Arg Glu Leu
            20                  25                  30
Ala Ala Ala His Asp Pro Glu Ala Gly Thr Val Leu Val His Gly Ala
        35                  40                  45
Gly Ser Tyr Gly His Ile Leu Ala Ala Lys His Arg Leu Lys Glu Gly
    50                  55                  60
Phe Arg Asp Asp Ala Gln Leu Thr Ala Val Ala Gln Val Gln Arg Asp
65                  70                  75                  80
Val Arg Ala Leu Asp Leu Lys Val Leu Asp Ala Leu Leu Arg Ala Arg
                85                  90                  95
Leu Arg Pro Ile Ala Ile Pro Pro Gly Thr Asp Ala Val Val Asp Lys
            100                 105                 110
Asp Gly Arg Phe His Leu Asp Thr Ala Pro Phe Glu Asp Tyr Arg Met
        115                 120                 125
Arg Gly Phe Leu Pro Val Ser Phe Gly Asp Val Val Arg Asp Glu Gly
    130                 135                 140
Arg Leu Phe Thr Ile Ala Ser Gly Asp Asp Val Val Leu Glu Leu Ala
145                 150                 155                 160
Arg Phe Tyr Arg Pro Glu Arg Val Leu Phe Val Ala Asp Val Asp Gly
                165                 170                 175
Val Phe Thr Ala Asp Pro Lys Arg Asp Arg Ala Ala Thr Leu Leu Asp
            180                 185                 190
Val Val Asp Gly Pro Ala Leu Glu Arg Ile Ala Phe Ser Asp Ala Ala
        195                 200                 205
Gly Arg Asp Val Thr Gly Gly Leu Arg Ala Lys Leu Glu Arg Met Arg
    210                 215                 220
Glu Ile Ala Gly Val Ala Lys Asp Val Arg Ile Ile Asn Gly Leu Ala
225                 230                 235                 240
Lys Gly Arg Leu Glu Arg Ala Ala Lys Gly Gly Asp Val Pro Gly Thr
                245                 250                 255
Arg Val Val Ala
            260

<210> 205
<211> 783
<212> DNA
<213> Euryarchaeota archaeon

<400> 205
atgcttctcg tgaagttcgg cggctccgtg atcacggtga agtcgaagta ccgcaccctg     60
cggggggcgg acctctcccg cctcgcgcgg gagctcgcgg cggcccacga cccggaggcg    120
ggaacggtcc tcgtccacgg cgcgggctcg tacggccaca tcctcgcggc gaagcaccgc    180
ctgaaggagg gcttccgcga cgacgcccag ctcaccgcgg tcgcgcaggt ccagcgggac    240
gtgcgggccc tcgacctcaa ggtcctcgac gcgctcttgc gggcccgcct ccgcccgatc    300
gcgatccccc cgggcacgga cgcggtcgtc gacaaggacg gccgcttcca cctcgacacc    360
gcgcccttcg aggactaccg gatgcggggg ttcctccccg tctccttcgg ggacgtcgtg    420
cgggacgagg ggcggctctt cacgatcgcc tccggggacg acgtcgtcct cgagctcgcg    480
aggttctacc gcccggagcg ggtcctcttc gtcgcggacg tggacggcgt gttcacggcg    540
gacccgaagc gggaccgggc cgcgaccctc ctcgacgtcg tcgacggtcc cgccctcgag    600
cggatcgcgt tctcggacgc ggcggggagg gacgtgacgg gggggctgcg cgccaagctc    660
gagaggatgc gggagatcgc gggggtcgcg aaggacgtgc ggatcatcaa cggcctcgcg    720
aagggccgcc tcgagagggc ggcgaagggc ggggacgtgc cgggcacgcg ggtggtggcg    780
tag                                                                  783

<210> 206
<211> 793
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 206
atgctgctgg tgaaattcgg tggtagtgtt attaccgtga aaagcaaata tcgtaccctg     60
cgtggcgccg atctgagccg cctggcacgt gaactggcag cagcccatga tccggaagca    120
ggtaccgttc tggttcatgg tgcaggtagc tatggtcata ttctggcagc aaaacatcgt    180
ctgaaagaag gcttccgtga tgatgcacag ctgaccgcag tggcccaggt tcagcgtgat    240
gtgcgcgccc tggatctgaa agtgctggat gccctgctgc gtgcccgtct gcgtccgatt    300
gccattccgc cgggtaccga tgcagtggtt gataaagatg gtcgcttcca tctggatacc    360
gccccgttcg aagattatcg tatgcgtggc ttcctgccgg ttagcttcgg cgatgttgtg    420
cgtgatgaag gtcgcctgtt caccattgcc agcggcgatg atgttgtgct ggaactggca    480
cgcttctatc gtccggaacg cgtgctgttc gttgcagatg ttgatggcgt gttcaccgca    540
gatccgaaac gtgatcgtgc cgcaaccctg ctggatgtgg ttgatggtcc ggccctggaa    600
cgtattgcct tcagtgatgc agcaggccgc gatgttaccg gcggtctgcg tgccaaactg    660
gaacgcatgc gtgaaattgc cggtgtggcc aaagatgtgc gtattattaa tggcctggca    720
aaaggccgtc tggaacgtgc agccaaaggc ggtgatgtgc cgggcacccg cgtggttgca    780
tgactgaaag ctt                                                       793

<210> 207
<211> 264
<212> PRT
<213> Archaeoglobus sulfaticallidus

<400> 207
Met Arg Asp Asp Glu Ile Ile Ile Leu Lys Ile Gly Gly Ser Ile Ile
1               5                   10                  15
Thr Asp Lys Ser Lys Gly Ser Phe Glu Lys Ala Lys Phe Asp Val Ile
            20                  25                  30
Glu Arg Ile Ser Arg Glu Ile Ser Gln Phe Leu Met Glu Asn Arg Lys
        35                  40                  45
Lys Ile Ile Leu Val His Gly Ala Gly Ser Phe Gly His Pro His Val
    50                  55                  60
Glu Lys Tyr Asn Leu Lys Glu Lys Lys Glu Leu Arg Gly Val Leu Thr
65                  70                  75                  80
Thr His Phe Ala Cys Lys Arg Leu Asn Ser Ile Val Cys Asp Lys Leu
                85                  90                  95
Leu Glu Asn Gly Val His Ala Leu Gly Ile His Pro Leu Thr Ser Phe
            100                 105                 110
Phe Leu Asp Glu Lys Leu Asn Ile Asn Ile Asp Leu Phe Leu Asp Met
        115                 120                 125
Leu Thr Glu Asp Ile Ile Pro Val Thr His Gly Asp Met Ile Tyr Asn
    130                 135                 140
Arg Lys Arg Lys Phe Phe Glu Val Leu Ser Gly Asp Ser Ile Ile Ser
145                 150                 155                 160
Ala Leu Met Gly Glu Leu Ser Asp Arg Lys Leu Arg Val Gly Leu Ala
                165                 170                 175
Thr Asp Val Asp Gly Val Ile Tyr Asp Gly Arg Val Val Lys Glu Ile
            180                 185                 190
Asn Ala Asp Asn Phe Glu Glu Val Leu Ser Ala Ile Asp Lys Ser Ala
        195                 200                 205
Met Asp Ala Glu Arg Lys Ser Asp Val Thr Gly Gly Met Lys Gly Lys
    210                 215                 220
Ile Gly Ala Leu Phe Arg Ser Ile His Gly Ser Glu Val Arg Ile Phe
225                 230                 235                 240
Asn Gly Ala Ile Glu Gly Asn Ile Ile Lys Phe Leu Lys Gly Glu Ala
                245                 250                 255
Leu Gly Thr Leu Ile Arg Gly Lys
            260

<210> 208
<211> 795
<212> DNA
<213> Archaeoglobus sulfaticallidus

<400> 208
ttgagggatg atgagataat tatactcaag atcggtggat cgatcataac ggataaatca     60
aaggggagct tcgagaaagc aaaattcgat gtgatagaga ggatttccag ggaaatcagc    120
cagtttttga tggagaacag gaaaaagatt atactggtcc atggagcagg ttctttcggg    180
catccgcatg tcgagaaata caacctgaaa gagaagaaag agttgagagg agttttaaca    240
actcattttg cctgcaagag actgaactcg attgtatgcg ataaactttt agagaatgga    300
gttcatgctc tgggaatcca tcccttaacg agctttttct tggatgaaaa gctgaacatc    360
aacatcgatc tctttctgga tatgctgaca gaggacataa ttcctgttac tcatggcgac    420
atgatctaca acagaaagcg aaagttcttc gaggttcttt caggagattc aataatctcc    480
gccttgatgg gagagctttc tgacagaaag ctgagagttg gattagcaac tgatgtcgat    540
ggagtgattt atgatggcag ggttgttaag gaaatcaacg cagataactt tgaagaggtt    600
ttatccgcaa tagacaaatc tgcaatggat gcagagagaa aatcggatgt taccggtgga    660
atgaagggaa agatcggagc cctgttcaga tccattcatg gttctgaggt gaggatattc    720
aatggagcga ttgagggaaa cattattaag tttctgaaag gtgaggcatt aggcacactt    780
atcaggggga aatga                                                     795

<210> 209
<211> 805
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 209
atgcgtgatg atgaaattat catcctgaaa attggtggta gtattattac cgataagagc     60
aaaggcagct tcgaaaaagc caaattcgat gtgattgaac gtattagccg cgaaattagt    120
cagttcctga tggaaaatcg caaaaaaatt atcctggttc atggtgcagg tagcttcggc    180
catccgcatg ttgaaaaata taatctgaaa gaaaagaagg agctgcgtgg cgttctgacc    240
acccacttcg cctgtaaacg cctgaatagc attgtgtgtg ataaactgct ggaaaatggt    300
gttcatgccc tgggcattca tccgctgacc agcttcttcc tggatgaaaa actgaatatt    360
aacatcgatc tgttcctgga tatgctgacc gaagatatta ttccggtgac ccatggtgat    420
atgatctata atcgcaaacg taaattcttc gaagtgctga gcggtgatag cattattagc    480
gcactgatgg gtgaactgag cgatcgtaaa ctgcgcgtgg gcctggcaac cgatgtggat    540
ggtgtgatct atgatggtcg cgtggtgaaa gaaattaatg cagataactt cgaagaggtt    600
ctgagtgcca ttgataaaag cgccatggat gcagaacgta aaagtgatgt gaccggtggc    660
atgaaaggta aaattggcgc cctgttccgt agtattcatg gcagtgaagt tcgcatcttc    720
aatggtgcca ttgaaggcaa tattattaaa ttcctgaagg gtgaagccct gggtaccctg    780
attcgcggca aatgactgaa agctt                                          805

<210> 210
<211> 242
<212> PRT
<213> Thermoplasmatales archaeon

<400> 210
Met Asp Ile Ile Lys Ile Gly Gly Ser Leu Leu Thr Asp Lys Thr Val
1               5                   10                  15
Tyr Arg Lys Phe Tyr Gln Lys Lys Thr Ser Thr Ile Ile Gln Arg Leu
            20                  25                  30
Ser Arg Leu Glu Ser Phe Ile Leu Val His Gly Gly Gly Ser Phe Gly
        35                  40                  45
His Tyr Ile Ser Glu Lys Tyr Gly Leu Pro Gly Glu Val Ser Glu Glu
    50                  55                  60
Arg Ile Lys Ala Ala Ala Ile Val Lys Tyr Asp Met Ala Asp Leu Asn
65                  70                  75                  80
Gln Arg Ile Val Lys Met Leu Asn Asn Met Gly Arg Pro Ala Ile Gly
                85                  90                  95
Ile Ser Pro Phe Phe Leu Asp Arg Asn Asn Ser Phe Asn Tyr Asp Leu
            100                 105                 110
Val Lys Lys Val Leu Glu Met Asn Phe Ile Pro Val Leu Tyr Gly Asp
        115                 120                 125
Val Tyr Leu Arg Asn His Glu Ile Gly Ile Leu Ser Gly Asp His Ile
    130                 135                 140
Met Val Ser Leu Ala Glu Met Phe Lys Pro Glu Arg Ala Ile Phe Leu
145                 150                 155                 160
Ser Asp Val Asp Gly Val Phe Asp Met Asp Pro Lys Lys Tyr Arg Asn
                165                 170                 175
Ala Ala Met Ile Arg Lys Tyr Ser Lys Glu Ile Val Asn Phe Gly Ala
            180                 185                 190
Ile Ser Asn Asp Val Thr Gly Gly Met Glu Leu Lys Phe Arg Ser Met
        195                 200                 205
Ile Asn Cys Lys Lys Ala Gly Val Lys Thr Tyr Leu Leu Asn Gly Asn
    210                 215                 220
Phe Pro Glu Arg Ile Glu Asn Ile Asp Lys Glu Asp Phe Val Gly Thr
225                 230                 235                 240
Glu Phe

<210> 211
<211> 729
<212> DNA
<213> Thermoplasmatales archaeon

<400> 211
atggatataa tcaagatcgg tggtagcctg cttacagaca agacagtata caggaagttt     60
taccagaaaa aaacatcaac catcatccag agactctcaa gattggaaag tttcatactt    120
gttcatgggg gaggttcctt cggtcattac atcagtgaaa aatacggttt gccgggggag    180
gtatcagagg agagaataaa ggctgcagcc atagtcaaat atgatatggc tgatctgaac    240
cagaggatag tgaaaatgct taacaacatg ggaagacctg caattggcat ctcaccattt    300
ttccttgata gaaataactc attcaattac gatctggtga agaaggtact ggaaatgaat    360
ttcattcctg tactttatgg ggatgtgtat ttaaggaatc atgaaattgg aatcctgtca    420
ggagatcata ttatggtttc acttgctgaa atgttcaaac ctgagagggc aatcttcctc    480
agtgatgttg atggtgtttt tgacatggat ccaaagaaat acagaaatgc agcaatgatc    540
aggaagtatt caaaggaaat agtgaatttt ggtgccatat ccaatgatgt tactggtggc    600
atggaactca agttcagatc aatgataaac tgcaaaaaag caggtgttaa aacatattta    660
ctcaacggga attttccaga aagaattgaa aatatagata aagaagattt tgtggggact    720
gaattttaa                                                            729

<210> 212
<211> 739
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 212
atggatatca tcaagatcgg tggtagtctg ctgaccgata aaaccgtgta tcgcaaattc     60
tatcagaaaa aaaccagcac cattattcag cgcctgagtc gcctggaatc attcattctg    120
gttcatggcg gcggtagctt cggccattat attagtgaaa aatatggtct gccgggtgaa    180
gtgagcgaag aacgtattaa agcagcagcc attgttaaat atgatatggc agacttaaac    240
cagcgtattg ttaaaatgct gaataatatg ggccgcccgg ccattggcat tagcccgttc    300
ttcctggatc gcaataatag cttcaattat gatctggtta agaaggttct ggaaatgaac    360
ttcattccgg ttctgtatgg cgatgtgtat ctgcgtaatc atgaaattgg cattctgagc    420
ggtgatcata ttatggtgag cctggcagaa atgttcaaac cggaacgcgc aatcttcctg    480
agcgatgttg atggcgtgtt cgatatggac cctaaaaaat atcgtaatgc agcaatgatt    540
cgtaaatata gcaaagaaat cgtgaacttc ggtgcaatta gcaatgatgt gaccggcggc    600
atggaactga aattccgcag tatgattaat tgcaaaaagg ccggtgttaa aacctatctg    660
ctgaatggta acttcccgga acgtattgaa aatattgata aagaagactt cgtgggtacc    720
gagttctgac tgaaagctt                                                 739

<210> 213
<211> 264
<212> PRT
<213> Thermoplasmatales archaeon

<400> 213
Met Arg Lys Arg Val Leu Val Glu Gly Leu Ile Phe Tyr Ile His Val
1               5                   10                  15
Thr Ile Arg Pro Met Met Gln Leu Val Lys Leu Gly Gly Ser Val Ile
            20                  25                  30
Thr Val Lys Ser Arg Tyr Arg Tyr Phe Leu Gln Gln Thr Thr Arg Lys
        35                  40                  45
Ile Val His Glu Leu Lys Lys Ile Asp Asp Glu Ile Ile Leu Val His
    50                  55                  60
Gly Gly Gly Ser Phe Gly His Ile Lys Ala Ser Glu Tyr Gln Leu Ser
65                  70                  75                  80
Gly Ser Pro Ala Ser Ser Ser Arg Ser Gly Ile Ser Ile Val His Arg
                85                  90                  95
Asp Met Met Glu Leu Asp Gln Arg Ile Ile Gly Val Met Leu Ser Glu
            100                 105                 110
Ser Met Pro Gly Ile Gly Met Ala Pro Ser Ser Phe Pro Asp Pro Phe
        115                 120                 125
Ile Pro Pro Phe Glu Leu Ile Glu Ser Tyr Met Lys Ala Gly Leu Phe
    130                 135                 140
Pro Val Thr Phe Gly Asp Val Tyr Ile Arg Asn Gly Asn Ser Gly Ile
145                 150                 155                 160
Val Ser Gly Asp Asp Leu Met Leu Ala Leu Ala Leu His Phe Lys Pro
                165                 170                 175
Thr Arg Val Met Phe Leu Ser Asp Val Asp Gly Ile Phe Asp Arg Asn
            180                 185                 190
Pro Lys Thr His Pro Asp Ala Glu Leu Arg Lys Glu Val Lys Gly Asn
        195                 200                 205
Glu Ala Phe Glu Leu Asn Arg Glu Asp Val Thr Gly Gly Met Gly Lys
    210                 215                 220
Lys Leu Gly Ile Met Lys Lys Ile Ala Glu Thr Gly Thr Thr Val Tyr
225                 230                 235                 240
Leu Leu Asn Gly Arg His Pro Glu Arg Ile Trp Asn Met Gly Thr Arg
                245                 250                 255
Asp Phe Ile Gly Thr Val Ile His
            260

<210> 214
<211> 795
<212> DNA
<213> Thermoplasmatales archaeon

<400> 214
atgaggaaga gagtcttggt tgagggtctt atattttata tccatgtaac aattcgccca     60
atgatgcagc tggtaaaact cggagggagc gtcataaccg tcaaatcaag atatcgttat    120
tttctccagc agacaacaag aaaaattgtg cacgaactta aaaaaattga tgacgaaata    180
attcttgttc acgggggcgg atcattcggt cacataaagg ccagcgaata ccagttgagc    240
ggaagcccgg catcctcatc acgtagtgga atttcaatag tgcacaggga tatgatggaa    300
cttgatcaga ggattattgg tgtgatgcta tccgaaagca tgccaggcat tggaatggcc    360
ccctcatctt ttccggaccc tttcattccc cccttcgagc tgattgaatc ctacatgaag    420
gcagggctgt ttccggtgac ctttggtgat gtatatatcc ggaatgggaa ttcaggtatt    480
gtatccggtg acgacctgat gctcgccctt gcgcttcatt tcaagccaac aagagtgatg    540
ttcctgagcg atgttgacgg aatattcgac agaaatccaa agactcaccc agatgccgaa    600
ctcaggaaag aggtgaaggg aaatgaggcg tttgagctga acagagagga cgtgacaggc    660
ggaatgggta aaaaattggg aataatgaag aagatagcag aaactggaac aactgtatac    720
ctcctcaacg ggaggcatcc ggagaggatc tggaatatgg ggacgcggga tttcatcggt    780
acggtgatac attga                                                     795

<210> 215
<211> 805
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 215
atgcgcaaac gcgtgctggt ggaaggtctg atcttctata ttcatgttac cattcgtccg     60
atgatgcagc tggttaaact gggcggtagt gttattaccg tgaaaagtcg ttatcgttac    120
ttcctgcaac agaccacccg caaaattgtg catgaactga aaaaaattga cgatgaaatc    180
atcctggttc atggcggtgg tagcttcggc catattaaag ccagcgaata tcagctgagc    240
ggcagcccgg ccagcagctc acgtagtggc attagcattg tgcatcgcga tatgatggaa    300
ctggatcagc gtattattgg tgtgatgctg agcgaaagta tgccgggtat tggcatggcc    360
ccgagcagct tcccggaccc gttcattccg ccgttcgaac tgattgaaag ctatatgaaa    420
gcaggtctgt tcccggttac cttcggcgat gtgtatattc gtaatggtaa tagcggtatt    480
gtgagcggcg atgatctgat gctggccctg gcactgcact tcaaaccgac ccgcgtgatg    540
ttcctgagtg atgttgatgg tatcttcgat cgtaatccga aaacacatcc ggatgcagaa    600
ctgcgcaaag aagttaaagg taatgaagcc ttcgaactga atcgcgaaga tgtgaccggt    660
ggcatgggta aaaaactggg cattatgaaa aaaatcgccg aaaccggcac caccgtgtat    720
ctgctgaatg gccgtcatcc ggaacgtatc tggaatatgg gcacccgcga cttcattggc    780
accgttattc attgactgaa agctt                                          805

<210> 216
<211> 259
<212> PRT
<213> Archaeoglobales archaeon

<400> 216
Met Asp Thr Leu Ile Ile Lys Leu Gly Gly Ser Ala Ile Thr Asn Lys
1               5                   10                  15
Glu Arg Arg Tyr Glu Val Arg Arg Lys Val Val Glu Arg Ile Ala Lys
            20                  25                  30
Glu Val Lys Ile Leu Ile Lys Asn Tyr Lys Leu Val Leu Val His Gly
        35                  40                  45
Gly Gly Ser Phe Gly His Pro Thr Ala Lys Glu Tyr Asn Ile His Leu
    50                  55                  60
Gly Tyr Phe Ser Asn His Gln Leu Ile Gly Tyr Ser Lys Val Arg Tyr
65                  70                  75                  80
Phe Met Thr Gln Leu Asn Gln Ile Ile Leu Glu Tyr Phe Ile Lys Ser
                85                  90                  95
Gly Val Pro Ala Val Thr Leu His Thr Ser Asn Ile Leu Lys Ala Asn
            100                 105                 110
Asp Gly Lys Ile Ser Ser Phe Asn Ile Gly Leu Leu Ile Glu Tyr Thr
        115                 120                 125
Lys Met Gly Phe Thr Pro Val Val Tyr Gly Asp Ala Val Leu Asp Glu
    130                 135                 140
Lys Arg Gly Phe Ser Ile Ile Ser Gly Asp Gln Ile Val Ser Tyr Leu
145                 150                 155                 160
Ala Ile Arg Leu Lys Pro Ile Lys Val Ile Leu Gly Thr Asp Val Asp
                165                 170                 175
Gly Ile Tyr Thr Gly Asn Pro Lys Lys Asp Ser Asn Ala Lys Leu Val
            180                 185                 190
Lys Thr Leu Lys Ile Ser Lys Leu Ile Thr Ile Lys Ala Asp Lys Pro
        195                 200                 205
Met Ile Asp Val Thr Gly Gly Ile Val Ala Lys Ile Asp Glu Met Arg
    210                 215                 220
Lys Val Val Lys Ala Gly Ile Pro Val Ile Ile Gly Asn Ile Val Ser
225                 230                 235                 240
Gly Asn Leu Ile Asp Leu Val Glu Glu Lys Thr Pro Lys Tyr Thr Lys
                245                 250                 255
Ile Ile Met

<210> 217
<211> 780
<212> DNA
<213> Archaeoglobales archaeon

<400> 217
atggatacgt taataattaa gcttggcgga tcggctataa caaataaaga aagaagatac     60
gaagttagac gtaaggtcgt tgaaagaata gcaaaggaag ttaaaatcct tatcaaaaac    120
tacaaattag tattggtgca tggaggaggg agtttcggtc acccaacggc taaagaatac    180
aacatacacc taggatattt ttctaatcat cagcttattg gctatagcaa ggtaagatat    240
ttcatgacgc agcttaatca gataattcta gaatatttta taaaatctgg agttcctgca    300
gttacattgc atacttcgaa tatacttaag gcaaatgatg gaaaaatatc aagttttaac    360
atagggcttt taatagagta tacgaagatg ggatttacgc cggtagttta tggagacgct    420
gttttagacg aaaaaagagg attcagcata atttctggag atcagatagt ttcttatcta    480
gctataagat tgaagccaat taaggtcatt ttaggaacag atgtggatgg catatacacg    540
ggtaatccca aaaaagattc aaatgcaaaa ctcgtcaaaa cattaaaaat atccaagctt    600
ataactataa aagctgataa gccaatgatc gatgtaactg gtggaatagt agcaaaaata    660
gatgaaatga gaaaagttgt aaaagctgga ataccggtta taatcgggaa catcgtttca    720
ggaaacttga ttgatcttgt tgaagagaaa acacctaaat atacgaaaat cataatgtaa    780

<210> 218
<211> 790
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 218
atggataccc tgattattaa gctgggcggt agcgcaatta ccaataaaga acgtcgttat     60
gaagttcgtc gcaaagttgt tgaacgcatt gcaaaagaag tgaaaattct gattaagaac    120
tacaagctgg tgctggttca tggcggtggt agcttcggcc atccgaccgc aaaagaatat    180
aatattcatc tgggctactt cagcaatcat cagctgattg gctatagtaa agttcgctac    240
ttcatgaccc agctgaatca gattattctg gaatacttca ttaagagcgg tgtgccggca    300
gtgaccctgc ataccagcaa tattctgaaa gccaatgatg gtaaaatcag tagcttcaat    360
atcggcctgc tgattgaata taccaaaatg ggcttcaccc cggttgtgta tggcgatgcc    420
gttctggatg aaaaacgcgg cttcagcatt attagtggcg atcagattgt gagctatctg    480
gccattcgtc tgaaaccgat taaagttatt ctgggcaccg atgttgatgg tatctatacc    540
ggtaatccga aaaaagatag taatgcaaaa ctggttaaga ccctgaaaat tagcaaactg    600
attaccatta aggccgataa accgatgatt gatgtgaccg gtggcattgt ggccaaaatt    660
gatgaaatgc gtaaagtggt taaggccggt attccggtga ttattggtaa tattgtgagt    720
ggtaatctga ttgatctggt ggaagaaaaa acacctaaat ataccaaaat catcatgtga    780
ctgaaagctt                                                           790

<210> 219
<211> 238
<212> PRT
<213> Pyrobaculum species

<400> 219
Met Phe Ile Val Lys Phe Gly Gly Ser Ala Ile Thr Asp Lys Thr Lys
1               5                   10                  15
Pro Tyr Thr Phe Leu Arg Gly Arg Ile Ala Gln Ala Ala Pro Ala Leu
            20                  25                  30
Arg Gly Arg Arg Ala Val Leu Ile His Gly Ala Gly Ser Phe Ala His
        35                  40                  45
Pro His Val Lys Ala Phe Gly Leu Thr Pro Thr Gly Ile Ala Leu Thr
    50                  55                  60
Lys Ala Thr Leu Arg Arg Leu Thr Ala Leu Val Ala Glu Glu Leu Leu
65                  70                  75                  80
Glu Ala Gly Leu Pro Ala Met Pro Val Glu Pro Ser Asp Val Phe Trp
                85                  90                  95
Gly Arg Ser Leu Val Arg Arg Glu Val Ile Thr His Ala Leu Glu Arg
            100                 105                 110
Gly Leu Tyr Pro Leu Leu His Gly Asp Ile Val Pro Ser Asp Glu Gly
        115                 120                 125
Tyr Val Val Val Ser Gly Asp Asp Ile Ala Val Glu Leu Ala Arg Leu
    130                 135                 140
Tyr Lys Pro Ser Ala Val Ile Phe Leu Met Asn Val Asp Gly Ile Tyr
145                 150                 155                 160
Thr Ala Ser Pro Gly Ser Pro Asn Ala Glu Lys Ile Arg Arg Leu Lys
                165                 170                 175
Ser Asn Val Tyr Leu Glu Gly Thr Ala Gly Val Asp Val Thr Gly Gly
            180                 185                 190
Ile Arg Lys Lys Val Glu Ala Gly Leu Ala Ile Ala Ala Leu Gly Thr
        195                 200                 205
Pro Val Phe Tyr Cys Ser Ile Ser Asp Arg Glu Ser Leu Glu Ile Ile
    210                 215                 220
Ala Arg Gly Gly Ala Pro Glu Asn Cys Thr Ser Val Glu Pro
225                 230                 235

<210> 220
<211> 717
<212> DNA
<213> Pyrobaculum species

<400> 220
atgtttatcg tgaaattcgg cggctccgcc atcaccgaca agacgaagcc ctacaccttc     60
ctccgggggc ggatcgcgca agccgcgccg gcgctccgcg ggaggagggc cgtcttaatc    120
cacggcgcgg gctccttcgc acacccccac gtcaaggcct tcggactgac gccaaccggc    180
atagccctga ccaaggccac gttgaggcgc ctcacagccc tagtggccga ggagctacta    240
gaggctggcc tgccggccat gccggtggaa cccagcgacg tgttctgggg cagatcgctg    300
gtccggcgtg aggtcatcac acatgcgctg gagagggggc tgtaccccct cctccacggc    360
gacatagtcc cctccgacga gggctacgtg gtggtcagcg gcgacgacat cgcggttgag    420
ctcgccaggc tctacaagcc cagcgccgtg atcttcctca tgaacgtaga cggcatctac    480
acggcgtctc ccggaagccc caacgccgaa aaaataagaa ggctaaaaag taacgtatat    540
ttagagggga ccgccggtgt agacgtcacg gggggtataa ggaagaaggt ggaggctgga    600
ttggccatag cggctctggg aacccccgtg ttctactgtt caatatctga cagagaatcc    660
ctcgagataa tcgcaagagg cggagcccca gagaactgca cctctgtgga gccctag       717

<210> 221
<211> 727
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 221
atgttcatcg ttaagttcgg tggcagtgca attaccgata aaaccaaacc gtataccttc     60
ctgcgcggcc gtattgcaca ggccgcccct gcactgcgtg gtcgtcgtgc agttctgatt    120
catggcgcag gtagcttcgc ccatccgcat gtgaaagcat tcggcctgac cccgaccggt    180
attgccctga ccaaagcaac cctgcgtcgc ctgaccgccc tggttgccga agaactgctg    240
gaagccggcc tgccggccat gcctgtggaa cctagtgatg tgttctgggg tcgcagcctg    300
gtgcgccgcg aagtgattac ccatgccctg gaacgcggcc tgtatccgct gctgcatggt    360
gatattgttc cgagtgatga aggctatgtt gttgttagcg gcgatgatat tgcagtggaa    420
ctggcacgtc tgtataaacc gagtgcagtg atcttcctga tgaatgtgga tggtatctat    480
accgcaagtc cgggtagccc gaatgccgaa aaaattcgtc gtctgaaaag taatgtgtat    540
ctggaaggca ccgccggcgt ggatgtgacc ggcggtattc gtaaaaaagt ggaagccggc    600
ttagcaattg cagcactggg taccccggtg ttctattgta gtattagcga tcgtgaaagc    660
ctggaaatta ttgcacgtgg cggtgccccg gaaaattgta ccagtgtgga accgtgactg    720
aaagctt                                                              727

<210> 222
<211> 249
<212> PRT
<213> Methanosaeta species

<400> 222
Met Lys Val Leu Lys Ile Gly Gly Ser Ile Leu Thr Asp Lys Arg Arg
1               5                   10                  15
Ile Gly Ala Ala Arg Leu Gly Glu Ile Gln Arg Val Ala Asp Glu Ile
            20                  25                  30
Ala Val Asn Pro Glu Asp Leu Val Leu Val His Gly Ala Gly Ser Phe
        35                  40                  45
Gly His Ile Pro Ala Arg Arg Tyr Gly Leu Pro Glu Asn Phe Asn Pro
    50                  55                  60
Glu Gly Leu Arg Ala Thr His Ser Ser Val Val Lys Leu Asn Asp Leu
65                  70                  75                  80
Val Ile Asp Ala Leu Cys Lys Ala Gly Val Phe Ala Met Pro Val His
                85                  90                  95
Pro Phe Ser Cys Val Leu Leu Arg Asp Gly Arg Ile Asp Ser Phe Ala
            100                 105                 110
Leu Lys Pro Ile Glu Glu Met Val Arg Asp Gly Leu Leu Pro Val Leu
        115                 120                 125
His Gly Asp Val Ala Met Asp Ala Thr Arg Lys Ala Gly Ile Val Ser
    130                 135                 140
Gly Asp Gln Ile Val Thr Tyr Val Ala Arg Ala Leu Gln Ala Glu Val
145                 150                 155                 160
Val Ala Val Gly Cys Asn Val Asp Gly Val Leu Phe Ser Gly Glu Pro
                165                 170                 175
Leu Arg Glu Leu Thr Arg Lys Asp Leu Pro Ser Ile Glu Gly Ala Leu
            180                 185                 190
Gly Gly Ser Ala Gly Val Asp Val Thr Gly Gly Met Arg Gly Lys Leu
        195                 200                 205
Leu Glu Met Leu Asp Leu Ala Asp Ala Gly Ile Met Ser Val Ile Phe
    210                 215                 220
Asn Ala Gly Thr Glu Gly Asn Val Leu Arg Ala Leu Arg Gly Glu Pro
225                 230                 235                 240
Ile Gly Thr Met Val Arg Arg Pro Asn
                245

<210> 223
<211> 750
<212> DNA
<213> Methanosaeta species

<400> 223
atgaaggttc tgaagatcgg cggaagtatt ctgacggaca agagaaggat cggcgccgct     60
cgtctcgggg agattcagcg cgttgccgac gagatcgccg tcaatcctga ggatctggtg    120
ctggtccacg gcgcaggttc ctttgggcac atccctgcaa gaagatatgg acttccagag    180
aacttcaacc cagagggact cagggcgact catagctcag tggtcaagct caacgacctt    240
gtcatagacg ctctatgcaa agccggcgtc ttcgcgatgc cagtccatcc attctcctgc    300
gttctcctaa gagacggcag gatagacagc tttgccttga agcccatcga ggagatggtc    360
cgcgacggac tgcttcctgt tttgcatggc gatgtggcca tggatgcgac gcgaaaggct    420
ggaatcgtct ccggagatca gattgtgacg tacgttgcca gggccctgca ggcggaggtc    480
gtggctgtgg gatgtaacgt tgacggcgtg ctattctccg gagagccgct gagggagctg    540
acccgcaagg atctgccttc gatcgagggg gctcttggag ggagcgctgg ggtcgatgtg    600
accggcggca tgcgtggaaa actgctggag atgctggatc ttgccgatgc cggaataatg    660
tcagtgatct tcaatgcagg cactgaaggc aacgttctgc gggcactgcg tggcgagccc    720
attggcacaa tggtgcggag gccgaattga                                     750

<210> 224
<211> 760
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 224
atgaaggttc tgaaaatcgg cggcagtatt ctgaccgata aacgtcgtat tggtgccgcc     60
cgcctgggtg aaattcagcg tgtggccgat gaaattgcag ttaatccgga agacttagtg    120
ctggtgcatg gtgcaggtag cttcggtcat attccggcac gccgttatgg tctgccggaa    180
aacttcaatc cggaaggcct gcgcgccacc catagcagtg ttgtgaaact gaatgatctg    240
gtgattgatg ccctgtgtaa agcaggtgtg ttcgccatgc cggttcatcc gttcagctgt    300
gttctgctgc gtgatggccg tattgatagc ttcgcactga aaccgattga agaaatggtg    360
cgtgatggcc tgctgccggt tctgcatggt gatgttgcaa tggatgcaac ccgtaaagca    420
ggtattgtta gcggcgatca gattgttacc tatgtggcac gcgcactgca agcagaagtg    480
gtggcagtgg gctgtaatgt tgatggcgtt ctgttcagtg gtgaaccgct gcgcgaactg    540
acccgcaaag acttaccgag tattgaaggc gcactgggtg gcagcgcagg tgtggatgtt    600
accggtggta tgcgtggcaa actgctggaa atgctggatc tggcagatgc aggtattatg    660
agtgttatct tcaatgcagg taccgaaggc aatgttctgc gtgccctgcg tggcgaaccg    720
attggcacca tggtgcgtcg tccgaattga ctgaaagctt                          760

<210> 225
<211> 261
<212> PRT
<213> Methanococcus aeolicus

<400> 225
Met Leu Ala Ile Leu Lys Leu Gly Gly Ser Ile Leu Cys Asp Lys Asn
1               5                   10                  15
Thr Pro Phe Ser Val Lys Thr Asp Asp Leu Lys Arg Met Ser Leu Glu
            20                  25                  30
Ile Lys Lys Ala Ile Glu Tyr Tyr Lys Asn Lys Gly Glu Ile Leu Asn
        35                  40                  45
Leu Ile Ile Val His Gly Gly Gly Ser Phe Gly His Pro Val Ala Lys
    50                  55                  60
Lys Tyr Ile Lys Thr Asn Glu Asn Gly Glu Lys Val Phe Phe Asn Met
65                  70                  75                  80
Glu Lys Gly Phe Trp Asp Ile Gln Asn Ala Met Arg Lys Phe Asn Asn
                85                  90                  95
Ile Val Ile Glu Glu Leu His Gln Gln Glu Val Pro Ala Val Ser Ile
            100                 105                 110
Gln Pro Ser Ser Phe Ile Leu Phe Asp Glu Lys Gly Glu Leu His Phe
        115                 120                 125
Asp Thr Tyr Ala Ile Glu Gly Met Leu Lys Arg Asn Leu Ile Pro Val
    130                 135                 140
Ile His Gly Asp Ile Val Leu Lys Gly Glu Asn Asn Tyr Lys Ile Phe
145                 150                 155                 160
Ser Gly Asp His Ala Leu Pro Tyr Leu Ser Lys Lys Leu Asn Pro Asp
                165                 170                 175
Leu Ser Leu His Ala Ser Asp Val Asp Gly Val Tyr Asp Leu Asp Lys
            180                 185                 190
Lys Thr Ile Lys Lys Ile Asn Ser Asp Asn Ile Asn Asp Val Leu Lys
        195                 200                 205
Cys Leu Lys Pro Ser Asn Lys Gln Asp Ile Thr Gly Gly Met Tyr Leu
    210                 215                 220
Lys Val Met Glu Cys Tyr Asn Leu Gly Ile Lys Thr Ile Ile Phe Asn
225                 230                 235                 240
Gly Ser Lys Lys Asp Asn Ile Tyr Lys Ser Leu Ile Gly Glu Val Asn
                245                 250                 255
Gly Thr Lys Ile Asn
            260

<210> 226
<211> 786
<212> DNA
<213> Methanococcus aeolicus

<400> 226
atgttggcca ttttaaaact tggagggagc atattatgcg ataaaaatac tcctttttca     60
gtaaaaacag atgatttaaa aagaatgtcc ttagagataa aaaaggcaat tgaatattat    120
aaaaataaag gagagatatt aaatttaata atcgttcatg gcggcggttc atttggacat    180
cctgttgcaa aaaaatacat taaaacaaat gaaaatggag aaaaagtatt ttttaacatg    240
gaaaaaggat tttgggatat tcaaaatgcc atgcgaaaat ttaataatat tgttattgag    300
gagctccatc aacaagaagt ccctgctgta tcaattcaac cttcttcatt tatattattt    360
gatgagaagg gggagctcca ctttgatacc tatgcaatag aaggaatgtt aaaaagaaat    420
ttaattcctg ttatacacgg cgatattgtg ttgaagggag aaaataacta taaaatattt    480
tccggggacc atgccctacc atatttatca aaaaaattaa atcctgattt aagcctacat    540
gcctccgatg tagatggcgt ttatgatttg gataaaaaaa caataaaaaa aattaattca    600
gataatataa acgatgtttt aaaatgttta aaaccatcaa ataaacagga cattacaggg    660
gggatgtatt taaaagtaat ggaatgttat aatttaggca taaaaaccat aatatttaat    720
ggtagtaaaa aagataatat atataagtca ttaattggag aggttaatgg aactaaaatt    780
aattaa                                                               786

<210> 227
<211> 796
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 227
atgctggcca ttctgaaact gggtggtagc attctgtgtg ataaaaatac cccgttcagc     60
gttaaaaccg atgatctgaa acgcatgagt ctggaaatta aaaaagcaat tgagtactac    120
aagaacaagg gtgaaattct gaatctgatt attgtgcatg gcggcggcag cttcggtcat    180
ccggtggcta aaaaatatat taaaaccaat gagaacggtg agaaagtgtt cttcaatatg    240
gaaaaaggct tctgggatat tcagaatgca atgcgtaaat tcaataacat tgtgattgag    300
gaactgcatc agcaggaagt tccggcagtt agtattcagc cgagtagctt cattctgttc    360
gatgaaaaag gcgaactgca cttcgatacc tatgccattg aaggcatgct gaaacgcaat    420
ctgattccgg tgattcatgg cgatattgtt ctgaaaggtg aaaataatta caagatattc    480
agcggtgatc atgcactgcc gtatctgagc aaaaaactga atccggatct gagcctgcat    540
gccagcgatg ttgatggtgt gtatgatctg gataaaaaaa ccattaagaa gatcaacagc    600
gataatatta acgatgtgct gaaatgtctg aaaccgagta ataaacagga tattaccggt    660
ggcatgtatc tgaaagttat ggaatgctat aacctgggca ttaaaaccat tatcttcaat    720
ggtagcaaga aagataacat ctataagagc ctgattggcg aagtgaatgg taccaaaatt    780
aattgactga aagctt                                                    796

<210> 228
<211> 281
<212> PRT
<213> Heimdallarchaeota archaeon

<400> 228
Met Asn Ser Asp Glu Leu Cys Leu Ile Lys Leu Gly Gly Ala Ala Ile
1               5                   10                  15
Thr Asp Lys Thr Lys Asp Tyr Thr Met Arg Pro Glu Ile Ile Ser Ser
            20                  25                  30
Val Leu Asn Glu Ile Ser Tyr Ser Asp Lys Lys Ala Ile Ile Ile His
        35                  40                  45
Gly Ala Gly Ser Phe Ala His Asn Ile Ala Lys Glu Tyr Lys Leu Val
    50                  55                  60
His Gly Leu Asp Ser Ala Val Ala Arg Asp Leu Gln Tyr Arg Gly Val
65                  70                  75                  80
Ser Ile Thr Arg Arg Ser Leu Leu Asn Leu His Thr Ala Val Leu Asp
                85                  90                  95
Ser Ala Leu Glu Ala Asn Leu Leu Pro Phe Ser Phe Pro Val Ser Ala
            100                 105                 110
Ile Phe Val Ser Asp Gly Glu Gln Gln Leu Tyr Ser Lys Tyr Leu Asp
        115                 120                 125
Gly Val Val Glu Ala Leu Asp Lys Gly Phe Thr Pro Ile Leu Tyr Gly
    130                 135                 140
Asp Ile Ser Phe Asp Ile Lys Thr His Phe Arg Val Ile Ser Gly Asp
145                 150                 155                 160
Arg Ile Leu Arg Val Leu Val Lys His Leu Lys Gly Leu Lys His Lys
                165                 170                 175
Asp Asn Ser Leu Tyr Phe Asn Asn Ile Lys Val Phe Phe Gly Ser Asn
            180                 185                 190
Val Asp Gly Leu Tyr Asp Lys Asp Pro Lys Asn Glu Asp Ala Lys Leu
        195                 200                 205
Ile Asp Tyr Ile Thr Asn Asn Gln Ile Ala Glu Leu Ile Glu Thr Ala
    210                 215                 220
Gly Glu Ser Ala Gly Thr Asp Val Thr Gly Gly Met Lys Gly Lys Leu
225                 230                 235                 240
Met Glu Ile Lys Gln Ile Ser Asp Leu Gly Ser Glu Val Gln Ile Met
                245                 250                 255
Asn Ile Met Glu Glu Asp Arg Met Tyr His Ala Leu Val Asn Glu Pro
            260                 265                 270
Asp Ile Arg Thr Val Ile Thr Pro Asn
        275                 280

<210> 229
<211> 846
<212> DNA
<213> Heimdallarchaeota archaeon

<400> 229
atgaatagcg acgaattgtg tctgatcaaa cttggtggtg cagctatcac cgataaaacc     60
aaggattaca ccatgagacc agaaattatt tctagtgtct taaacgaaat tagttatagt    120
gacaaaaaag cgatcattat ccatggagca gggtcatttg ctcataatat agcaaaagaa    180
tataagctag ttcacggact tgattccgca gttgctcgag atttacaata tagaggcgtt    240
agcattacgc gtagatcatt attgaattta cataccgctg ttttggactc tgctttggag    300
gctaatcttc tcccattctc ttttcctgtg tcagcaattt ttgtttcaga cggagaacaa    360
caattgtatt cgaaatattt ggatggagta gttgaagcac tggataaagg ttttacaccc    420
attctttatg gggatatttc atttgacatt aaaactcatt tcagagtgat ttctggtgat    480
cgaatcctcc gagtcttggt taaacatcta aaagggttaa aacataaaga taatagcttg    540
tatttcaata atattaaagt attttttgga tctaatgttg atggattata tgataaagat    600
cctaaaaatg aggatgcaaa actaattgat tacattacaa ataaccaaat tgctgaatta    660
attgagactg ctggagaaag tgcaggaact gatgttactg ggggaatgaa aggtaaacta    720
atggaaatta aacaaattag tgatttgggg tctgaagtac aaataatgaa tattatggaa    780
gaagacagaa tgtatcatgc ccttgtaaat gaacctgata tcagaaccgt aattactccg    840
aattag                                                               846

<210> 230
<211> 856
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 230
atgaacagtg atgaactgtg cctgattaaa ctgggtggtg cagccattac cgataaaacc     60
aaagattata ccatgcgccc ggaaattatt agcagtgttc tgaatgaaat cagttatagt    120
gataagaagg caattatcat ccatggtgcc ggtagcttcg cccataatat tgccaaagaa    180
tataaactgg tgcatggtct ggatagcgca gttgcacgcg atctgcaata tcgcggcgtg    240
agtattaccc gtcgcagcct gctgaatctg cataccgccg tgctggatag tgcactggaa    300
gccaatctgc tgccgttcag cttcccggtt agtgccatct tcgttagcga tggtgaacag    360
cagctgtata gcaaatatct ggatggcgtg gttgaagccc tggataaagg cttcaccccg    420
attctgtatg gcgatattag cttcgatatt aaaacacact tccgcgttat tagtggcgat    480
cgcattctgc gtgtgctggt taaacatctg aaaggtctga aacataaaga taatagtctg    540
tacttcaaca acatcaaagt gttcttcggt agtaatgttg atggtctgta tgataaagat    600
ccgaaaaatg aagatgccaa actgattgat tacattacca ataatcagat cgcagaactg    660
attgaaaccg caggtgaaag cgcaggtacc gatgttaccg gtggcatgaa aggtaaactg    720
atggaaatta aacagatcag cgatctgggt agcgaagttc agattatgaa tattatggaa    780
gaggatcgta tgtatcatgc actggttaat gaaccggata ttcgtaccgt gattaccccg    840
aattgactga aagctt                                                    856

<210> 231
<211> 276
<212> PRT
<213> Uryarchaeota archaeon

<400> 231
Met Thr Leu Tyr Leu Phe Asp Asp Leu Ser Ile Ser Thr Lys Arg Ile
1               5                   10                  15
Met Ser Lys Glu Lys Ile Leu Leu Lys Phe Gly Gly Ser Leu Ile Thr
            20                  25                  30
Glu Lys Met Ser Asp Thr Pro Lys Ile Asn Thr Ile Asn Leu Asp Arg
        35                  40                  45
Ile Gly Lys Val Leu Asn Asn Lys Glu Tyr Asp Ile Ile Val Val His
    50                  55                  60
Gly Ala Gly Ser Phe Gly His Pro Ile Ala Gln Lys Phe Asn Leu Ile
65                  70                  75                  80
Asp Gly Leu Asn Glu Ser Pro Glu Gln Lys Lys Ser Ile Ala Glu Ile
                85                  90                  95
Arg Glu Gln Met Glu Lys Leu Asn His Val Leu Cys Ser Ile Ile Glu
            100                 105                 110
Lys Asn Gly Met Lys Thr Lys Ser Val Ile Pro Ser Lys Thr Met Ile
        115                 120                 125
Thr Lys Gly Ala Arg Asn Ile Ala Lys Phe Pro Thr Glu Ile Phe Asp
    130                 135                 140
Lys Cys Ile Glu Glu Gly Asn Ile Pro Ile Thr Phe Gly Asp Ala Thr
145                 150                 155                 160
Asp Asp Glu Leu Gln Gly Ile Asn Ile Leu Ser Gly Asp Val Ile Met
                165                 170                 175
Met Glu Leu Ala Arg Ile Tyr Lys Pro Ala Phe Ser Val Phe Val Met
            180                 185                 190
Asp Leu Pro Gly Val Met Asp Gly Asp Pro Lys Ser Lys Asp Ser Lys
        195                 200                 205
Val Ile Pro Arg Val Asp Ala Lys Ile Ile Arg Glu Leu Lys Glu Lys
    210                 215                 220
Thr Phe Ser Asn Gly Asn Thr Asp Val Thr Gly Gly Leu Ile Gly Lys
225                 230                 235                 240
Leu Glu Cys Ala Leu Glu Ile Ala Gln His Ser Gln Cys Trp Ile Thr
                245                 250                 255
Asn Leu Asp Ser Leu Glu Met Val Leu Thr Gly Asn Pro Arg Gly Ser
            260                 265                 270
Glu Val Val Leu
        275

<210> 232
<211> 831
<212> DNA
<213> Uryarchaeota archaeon

<400> 232
atgacactct acctttttga cgacttatct atatctacaa agagaatcat gtctaaagaa     60
aaaattcttt taaaatttgg gggatcttta attaccgaaa aaatgagtga cacaccaaag    120
attaatacca taaatttaga cagaattgga aaagttttga ataacaaaga atatgatatt    180
attgtggtcc atggagccgg atcattcgga catccaattg ctcaaaaatt taatctaatt    240
gatggattaa acgaaagtcc agagcaaaaa aaatctattg ctgagataag agaacaaatg    300
gaaaaattaa atcatgtttt atgtagtatt attgaaaaaa atggaatgaa aactaaatct    360
gttatcccct ctaaaactat gatcactaag ggtgcccgaa atatcgctaa attccctact    420
gaaatatttg ataagtgtat tgaagaaggt aatataccca ttacatttgg tgatgctaca    480
gatgatgaat tacaaggaat taatatacta agtggtgatg ttataatgat ggaattagca    540
cgaatataca aacctgcttt ctcagtattt gtaatggatc tgcctggagt aatggatggc    600
gatccaaaat ctaaggatag taaagttatt cctagagttg acgcaaaaat cattagagaa    660
ttgaaagaaa aaacatttag caatggaaac acggatgtta ccggtggatt aattggaaaa    720
ttggaatgtg cactcgaaat agctcagcac agccaatgtt ggataactaa tttagactcc    780
ttggagatgg tattaactgg taatccaaga ggaagtgagg ttgttttatg a             831

<210> 233
<211> 841
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 233
atgaccctgt atctgttcga tgatctgagc attagtacca aacgcattat gagcaaagaa     60
aaaattctgc tgaagttcgg cggcagcctg attaccgaaa aaatgagtga taccccgaaa    120
attaatacca ttaatctgga tcgtatcggt aaagtgctga ataataaaga atacgatatc    180
atcgtggtgc atggtgcagg tagcttcggc catccgattg cccagaaatt caatctgatt    240
gatggcctga atgaaagccc ggaacagaaa aaaagtattg ccgaaattcg cgaacagatg    300
gaaaaactga atcatgttct gtgcagtatt attgaaaaga atggcatgaa aaccaagagc    360
gttattccga gcaaaaccat gattaccaaa ggcgcacgta atattgccaa attcccgacc    420
gaaatcttcg ataaatgtat tgaagaaggc aatatcccga ttaccttcgg tgatgcaacc    480
gatgatgaac tgcaaggtat taatattctg agtggtgatg ttattatgat ggaactggca    540
cgcatctata aaccggcctt cagcgtgttc gtgatggatc tgccgggtgt gatggatggc    600
gatccgaaaa gtaaagatag taaagttatc ccgcgtgtgg atgcaaaaat tattcgtgaa    660
ctgaaagaaa agaccttcag taatggtaat accgatgtta ccggtggcct gattggtaaa    720
ctggaatgtg ccctggaaat tgcccagcat agtcagtgtt ggattaccaa tctggatagt    780
ctggaaatgg tgctgaccgg caatccgcgc ggtagtgaag ttgtgctgtg actgaaagct    840
t                                                                    841

<210> 234
<211> 289
<212> PRT
<213> Archaeon species

<400> 234
Met Arg Lys Ser Asn Ile Asn Lys Lys Glu Ile Lys Glu Ile Pro Ile
1               5                   10                  15
Asn Asn Lys Lys Asn Ile Gly Leu Ser Asp Leu Val Val Ile Lys Phe
            20                  25                  30
Gly Gly Ser Ser Ile Thr Lys Lys Ala Asp Asn Gln Phe Glu Met Asn
        35                  40                  45
Tyr Glu Val Leu Asn Gln Ser Ala Glu Glu Leu Tyr Arg Ala Ile Lys
    50                  55                  60
Lys Thr Lys Tyr Lys Val Ala Leu Ile Cys Gly Val Gly Pro Phe Gly
65                  70                  75                  80
His Thr Asn Val Lys Lys Leu Asn Leu Asn Asp Gly Ile Lys Thr Arg
                85                  90                  95
Glu Gln Glu Glu Gly Thr Gln Lys Thr Ile Val Asp Cys Asn Phe Val
            100                 105                 110
Ala Gln Glu Thr Ser Thr Ala Leu Glu Lys Phe Gly Leu Lys Thr Lys
        115                 120                 125
Ile Ile Pro Gly Tyr Leu Val Cys Lys Gln Asp Asn Arg Lys Ala Ile
    130                 135                 140
Ser Phe Asp Thr Lys Glu Tyr Val Lys Ala Ile His Gly Gly Phe Ile
145                 150                 155                 160
Pro Ile Thr Thr Gly Thr Met Val Lys Asp Lys Thr Leu Lys Trp Ser
                165                 170                 175
Val Met Ser Gly Asp Thr Ala Val Ala Glu Leu Cys Lys Gln Leu Arg
            180                 185                 190
Pro Arg Lys Val Ile Met Gly Thr Asp Val Asp Gly Ile Tyr Thr Ala
        195                 200                 205
Asp Pro Lys Val Asn Pro Lys Ala Lys Leu Ile Glu Ser Ile Thr Lys
    210                 215                 220
Glu Asn Val Pro Lys Ile Leu Glu Met Val Gly Glu Ser Asn Ser Val
225                 230                 235                 240
Asp Val Thr Gly Gly Met Lys Gly Lys Leu Glu Lys Leu Ala Leu Thr
                245                 250                 255
Leu Asn Gly Val Pro Gly Glu Ile Phe Asn Leu Phe Thr Lys Gly Asn
            260                 265                 270
Leu Glu Lys Ala Phe Ile Gly Glu Glu Ile Lys Asp Thr Lys Ile Arg
        275                 280                 285
Leu

<210> 235
<211> 870
<212> DNA
<213> Archaeon species

<400> 235
atgcgtaaat ctaatattaa caaaaaagaa ataaaagaaa ttcctattaa taataaaaaa     60
aatattgggt tgagtgattt agtagtaata aaatttgggg gttcttcaat aactaaaaaa    120
gcggataatc aatttgaaat gaattacgaa gtgctaaatc aatcagcaga agaactttat    180
agagcaataa aaaaaacaaa atataaagtt gcacttattt gtggggtagg cccatttgga    240
catactaatg taaaaaaact taatctaaat gatgggataa aaacaagaga acaagaagaa    300
ggaacacaaa aaacaattgt tgattgtaat tttgttgcac aagaaacttc tactgctcta    360
gaaaaattcg gactcaaaac aaaaataatt ccagggtatt tagtatgcaa gcaagacaat    420
agaaaagcaa tttcatttga tactaaagaa tatgtgaaag caattcatgg aggatttatt    480
ccgataacca ctgggacaat ggttaaagac aaaactttga aatggagtgt aatgagtgga    540
gacacagcag tagcagaact ctgcaaacaa ttaaggccaa gaaaagtaat aatggggact    600
gatgtggatg gaatttatac tgcggaccca aaagttaatc caaaagcaaa attaattgaa    660
agtattacaa aagaaaatgt tccaaaaatt ttggagatgg tgggagaatc aaattcagtt    720
gatgtaactg ggggaatgaa aggaaaatta gaaaaactag ccctaacact aaatggagtg    780
cctggagaga tatttaattt attcacaaaa ggaaatttag aaaaagcatt tattggagaa    840
gaaataaaag acacaaaaat aagattataa                                     870

<210> 236
<211> 880
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 236
atgcgtaaaa gcaatatcaa caagaaggaa attaaggaaa tcccgattaa taacaagaag     60
aatatcggcc tgagcgatct ggtggtgatt aaattcggtg gcagtagcat taccaaaaaa    120
gccgataatc agttcgaaat gaattatgaa gtgctgaatc agagtgccga agaactgtat    180
cgcgcaatta aaaaaaccaa atataaggtt gcgctgatct gtggcgttgg cccgttcggt    240
cataccaatg ttaaaaaact gaatctgaac gatggcatta aaaccagaga acaggaagaa    300
ggcacccaga aaaccattgt tgattgcaac ttcgttgcac aggaaaccag caccgcactg    360
gaaaaattcg gtctgaaaac caaaattatc ccgggttatc tggtgtgtaa acaggataat    420
cgtaaagcaa ttagcttcga taccaaagaa tatgtgaaag ccattcatgg cggcttcatt    480
ccgattacca ccggtaccat ggtgaaagat aaaaccttaa aatggagtgt gatgagtggc    540
gataccgcag tggcagaact gtgtaaacag ctgcgtccgc gcaaagtgat tatgggtacc    600
gatgttgatg gtatctatac cgcagatccg aaagtgaatc cgaaagccaa actgattgaa    660
agtattacca aagaaaacgt tccgaaaatt ctggaaatgg ttggcgaaag taatagtgtg    720
gatgtgaccg gtggtatgaa aggcaaactg gaaaaactgg cactgaccct gaatggtgtt    780
ccgggcgaaa tcttcaatct gttcaccaaa ggcaatctgg aaaaagcatt cattggtgaa    840
gaaattaagg ataccaaaat ccgtctgtga ctgaaagctt                          880

<210> 237
<211> 268
<212> PRT
<213> Thorarchaeota archaeon

<400> 237
Met Ile Gln Leu Lys His Leu Thr Ile Val Lys Leu Gly Gly Ser Val
1               5                   10                  15
Ile Thr His Lys Asp Ser Thr Pro Pro Lys Val Asn Glu Ala Asn Leu
            20                  25                  30
Ser Arg Ile Ala Asn Glu Leu Lys Val His Thr Arg Gly Glu Leu Ile
        35                  40                  45
Ile Ile Leu Gly Gly Gly Ala His Gly His Gln Ala Ala His Ser His
    50                  55                  60
Gly Phe Ala Asp Pro Thr Ser Pro Lys Glu Leu Leu Val Lys Gly Ile
65                  70                  75                  80
Pro Pro Ile Arg His Asn Met Ser Ala Leu Ala Ser Ser Val Glu Thr
                85                  90                  95
Ser Leu Ser Glu Glu Gly Ile Pro Ala Ile Val Ile Pro Pro Phe Val
            100                 105                 110
Ser Val Thr Leu Arg Asn Gly Leu Ile His Asp Tyr Pro Thr Asn Ile
        115                 120                 125
Ile Arg Lys Ser Leu Asp Asn Gly Leu Val Val Ile Thr His Gly Asp
    130                 135                 140
Val Cys Phe Asp Glu Glu Asn Ile Val Ser Ile Leu Ser Gly Asp Thr
145                 150                 155                 160
Ile Ala Val Tyr Leu Ala Lys Glu Leu Asp Ala Lys Thr Ile Leu Ile
                165                 170                 175
Gly Thr Asp Val Asp Gly Val Leu Asp Asp Asn Pro Lys Thr Asn Pro
            180                 185                 190
Ser Ala Lys His Ile Pro Val Ile Asn Gln Glu Asn Lys Asp Thr Ile
        195                 200                 205
Leu Ser Lys Thr Gly Pro Ser Thr Asn Thr Asp Val Thr Gly Gly Met
    210                 215                 220
Ser Lys Lys Val Thr Glu Leu Leu Glu Ile Ser Arg Gln Asn Arg Glu
225                 230                 235                 240
Ile Ile Ile Phe Asn Leu Thr Val Pro Asp Arg Leu Lys Phe Leu Leu
                245                 250                 255
Gln Asn Lys Thr Thr Ile Cys Thr Arg Ile Gln Ser
            260                 265

<210> 238
<211> 807
<212> DNA
<213> Thorarchaeota archaeon

<400> 238
gtgatacaat tgaaacacct cacaattgtg aaactaggcg ggtcagtgat aacacacaaa     60
gactcgactc ctcctaaagt caatgaggca aacttgtccc gaattgccaa cgaactaaaa    120
gtacatacta gaggtgaact aatcattatt cttggcggcg gagctcacgg tcaccaagcc    180
gcccattcac atggctttgc tgatcctaca agtcccaaag agcttcttgt aaaaggaatt    240
ccaccaatac ggcataatat gtccgcactt gcatcctctg ttgaaacaag tctaagcgaa    300
gaaggaatcc ctgcaatcgt cattcctccc tttgtgtctg taacactaag aaatggtttg    360
atacatgatt atccaaccaa cataattaga aagtcattgg acaatgggct tgttgtaata    420
acccatggag atgtttgttt tgatgaagaa aacattgtgt ctattctcag tggggataca    480
attgcagttt accttgcaaa agagctagat gcaaaaacca tactcattgg aaccgatgtt    540
gacggagtct tagacgataa tccaaaaaca aacccatcag ccaaacacat acctgtaata    600
aatcaagaaa acaaggacac cattctatcc aaaacaggac cttcgaccaa taccgatgta    660
acaggcggca tgtcgaaaaa agtgaccgag cttctcgaaa tttctagaca gaacagagag    720
attatcatct tcaacctcac agttcccgat cgtcttaagt ttctattaca aaataaaaca    780
accatctgca caagaattca gtcataa                                        807

<210> 239
<211> 817
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 239
atgatccagc tgaaacatct gaccattgtt aaactgggcg gcagcgtgat tacccataaa     60
gatagcaccc cgccgaaagt gaatgaagca aatctgagtc gcattgccaa tgaactgaaa    120
gttcataccc gcggcgaact gattattatt ctgggcggtg gcgcccatgg ccatcaggct    180
gctcatagtc atggcttcgc cgatccgacc agcccgaaag aactgctggt gaaaggcatt    240
ccgccgattc gtcataatat gagtgcactg gcaagcagcg tggaaaccag cctgagcgaa    300
gaaggcattc cggccattgt tattccgccg ttcgtgagtg ttaccctgcg caatggtctg    360
attcatgatt atccgaccaa tattattcgt aaaagtctgg ataacggcct ggtggttatt    420
acccatggtg atgtgtgctt cgatgaagaa aatattgtta gtatcctgag cggtgatacc    480
attgcagtgt atctggcaaa agaactggat gccaaaacca ttctgattgg taccgatgtg    540
gatggtgttc tggatgataa tccgaaaacc aatccgagcg ccaaacatat tccggtgatt    600
aatcaggaaa ataaagatac catcctgagt aaaaccggcc cgagcaccaa taccgatgtg    660
accggtggta tgagtaaaaa agttaccgaa ctgctggaaa ttagccgcca gaatcgtgaa    720
attattatct tcaatctgac cgttccggat cgcctgaaat tcctgctgca aaataaaacc    780
accatctgta cccgtattca gagttgactg aaagctt                             817

<210> 240
<211> 324
<212> PRT
<213> Streptomyces thermolilacinus

<400> 240
Met Thr Ala Asp Val Ser Ala Ala Pro Arg Thr Ala Pro Gly Ala Ala
1               5                   10                  15
Pro Ala Thr Ala Ala Ala Gly Glu Pro Gly Leu Leu Val Leu Lys Val
            20                  25                  30
Gly Gly Ser Leu Leu Ser Asp Lys Arg His Ser Gly Glu Thr Asp His
        35                  40                  45
Ala Thr Ile Asp Ala Tyr Ala Ser Gln Val Ala Glu Leu Val Thr Ala
    50                  55                  60
His Pro Gly Arg Ile Val Leu Val Thr Gly Gly Gly Ala Leu Cys His
65                  70                  75                  80
Pro Val Gly Leu Arg Ile Lys Ala Ala Lys Asp Asp Pro Tyr Ala Ala
                85                  90                  95
Val Ala Leu Thr Glu Pro Ala Phe Arg Met Arg Trp Ala Trp Thr Thr
            100                 105                 110
Ala Leu Arg Ala His Gly Val Arg Ala Val Pro Leu Gln Thr Thr Ser
        115                 120                 125
Met Leu Asn Glu Leu Ala Asp Gly Thr Thr Val Thr Glu Thr Gly Val
    130                 135                 140
Val Ser Arg Leu Leu Ala Glu Gly Ala Leu Pro Val Leu Ser Ser Asp
145                 150                 155                 160
Cys Val Val Thr Ala Thr Gly Thr Leu Arg Ile Leu Ser Ser Asp Asp
                165                 170                 175
Val Pro Gly Val Val Leu Asp Ala Ala Val Ala Pro Gly Pro Val Arg
            180                 185                 190
Val Val Ala Leu Thr Asp Val Ala Gly Ile His Leu Ala Arg Asp Pro
        195                 200                 205
Asp Ser Pro Val Leu Pro His Leu Asp Pro Asp Asp Leu Gly Ala Val
    210                 215                 220
Arg Arg Leu Phe Trp Asp Asp Ala Trp Asp Ala Thr Gly Ala Met Glu
225                 230                 235                 240
Gly Lys Val Glu Ala Leu Ala Ala His Ala Arg Arg Gly Ala Glu Cys
                245                 250                 255
Val Ile Thr Arg Gly Asp His Arg Pro Gly Gly Leu Arg His Leu Phe
            260                 265                 270
Ala Pro Leu Asp Ala Trp Pro Gly Asp Val Pro Arg Thr Leu Ile Ser
        275                 280                 285
Arg Arg Thr Pro Ala Gly Pro Ala Ala Val Thr Ala Gly Pro Thr Asp
    290                 295                 300
Pro Ala Ala Thr Pro Ala Thr Val Thr Thr Ala Ala Gln Pro Pro Gln
305                 310                 315                 320
Glu Asp Pro Thr

<210> 241
<211> 975
<212> DNA
<213> Streptomyces thermolilacinus

<400> 241
gtgaccgccg acgtgtccgc ggcgccccgc accgcaccgg gcgccgcgcc cgcgacggcg     60
gcggccgggg agccggggct gctcgtcctc aaggtcggcg gcagcctcct ctccgacaag    120
cggcacagcg gggagaccga ccacgcgacc atcgacgcct acgcgtccca ggtcgccgag    180
ctggtcaccg cgcaccccgg ccggatcgtc ctggtcaccg ggggcggcgc gctgtgccac    240
ccggtcggcc tgcggatcaa ggccgccaag gacgacccgt acgcggcagt cgccctcacc    300
gaaccggcct tccgcatgcg ctgggcgtgg accaccgcgc tgcgcgccca cggggtgcgg    360
gccgtgccgc tccagaccac gtcgatgctg aacgagctgg ccgacggcac caccgtcacc    420
gagaccggtg tggtgtcccg gctcctcgcc gagggcgcgc tgccggtgct gtccagcgac    480
tgcgtggtca ccgccaccgg caccctgcgc atcctcagca gcgacgacgt ccccggtgtg    540
gtgctggacg ccgccgtggc gccggggccc gtccgggtcg tggccctcac cgacgtggcc    600
ggcatccacc tcgcccggga cccggacagc cccgtgctgc cgcacctcga ccccgacgac    660
ctcggggccg tacggcggct gttctgggac gacgcgtggg acgccaccgg ggccatggag    720
ggcaaggtcg aggcgctcgc cgcgcacgcc cgccgcggcg ccgagtgcgt catcacccgc    780
ggcgaccacc ggccgggcgg gctgcgccac ctgttcgccc cgctcgacgc ctggcccggc    840
gacgtccccc gcacgctgat cagccgccgt acccccgccg ggcccgcagc cgtcaccgcg    900
ggcccgacgg acccggccgc cacccccgcc accgtcacca ccgccgccca accgccccag    960
gaggacccga catga                                                     975

<210> 242
<211> 985
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 242
atgaccgccg atgtgagcgc cgcaccgcgt accgcaccgg gtgcagcacc tgctaccgcc     60
gcagcaggcg aaccgggtct gctggtgctg aaagtgggtg gcagtctgct gagtgataaa    120
cgtcatagcg gtgaaaccga tcatgccacc attgatgcat acgctagtca ggtggccgaa    180
ctggtgaccg cacatccggg tcgtattgtg ctggttaccg gtggtggtgc cctgtgccat    240
ccggtgggtc tgcgcattaa agcagccaaa gatgatccgt atgcagcagt tgccctgacc    300
gaaccggcat tccgtatgcg ttgggcatgg accaccgcac tgcgtgccca tggtgttcgc    360
gccgtgccgc tgcaaaccac cagcatgctg aatgaactgg cagatggtac caccgtgacc    420
gaaaccggcg ttgtgagtcg cctgctggca gaaggtgcac tgccggttct gagcagcgat    480
tgcgttgtta ccgcaaccgg caccctgcgc attctgagca gcgacgatgt gccgggcgtg    540
gtgctggatg cagccgtggc tccgggtccg gttcgtgtgg tggccctgac cgatgttgcc    600
ggcattcatc tggcacgcga tccggatagt ccggttctgc cgcatctgga ccctgatgat    660
ctgggtgccg ttcgtcgcct gttctgggat gatgcctggg atgccaccgg cgcaatggaa    720
ggcaaagttg aagcactggc agcacatgca cgtcgcggtg cagaatgtgt gattacccgt    780
ggtgatcatc gcccgggtgg cctgcgccat ctgttcgctc cgctggatgc ctggccgggt    840
gatgtgccgc gtaccctgat tagccgtcgt accccggccg gcccggctgc cgttaccgca    900
ggtcctaccg atccggcagc caccccggca accgttacca ccgccgctca gccgccgcag    960
gaagatccga cctgactgaa agctt                                          985

<210> 243
<211> 166
<212> PRT
<213> Solanum tuberosum

<400> 243
Met Thr Glu Ala Asp Ile Ser Met Val Ile Lys Ala Ile Asp Ala Gly
1               5                   10                  15
Phe Ile Pro Val Leu His Gly Asp Ala Val Leu Asp Thr Leu Gln Glu
            20                  25                  30
Cys Thr Ile Leu Ser Gly Asp Met Ile Ile Arg His Leu Ala Ala Glu
        35                  40                  45
Leu Lys Pro Glu Phe Val Val Phe Leu Thr Asp Val Leu Gly Val Tyr
    50                  55                  60
Asp Arg Pro Pro Val Glu Pro Gly Ala Val Leu Ile Arg Glu Ile Ala
65                  70                  75                  80
Val Arg Glu Asp Gly Ser Trp Ser Val Val Lys Pro Lys Leu Glu Asp
                85                  90                  95
Thr Ser Lys Pro Val Glu Phe Thr Val Ala Ala His Asp Thr Thr Gly
            100                 105                 110
Gly Met Val Thr Lys Ile Thr Glu Ala Ala Met Ile Ala Lys Leu Gly
        115                 120                 125
Ile Asp Val Tyr Ile Thr Lys Ala Gly Thr Asp His Ser Val Lys Ala
    130                 135                 140
Leu Ser Gly Phe Leu Lys Gly Gly Ile Pro Asp Asp Trp Leu Gly Thr
145                 150                 155                 160
Ala Ile Arg Tyr Met Ser
                165

<210> 244
<211> 501
<212> DNA
<213> Solanum tuberosum

<400> 244
atgacggagg ctgacatttc catggtcatt aaagctattg atgctggttt tatacctgtt     60
ctgcatggag atgcagttct ggatacatta caggagtgca ctattctgag tggagacatg    120
ataatacgtc atttagcagc tgaactaaag ccagagtttg ttgtttttct tacagatgtt    180
cttggtgtat atgatcgtcc accagtagaa cctggcgctg tacttatccg ggaaatagct    240
gtacgtgaag atggaagctg gtcggtagtg aaacctaaac tagaagatac aagcaagcct    300
gttgaattca cggtagctgc acatgataca actggtggga tggtaacaaa aataacagaa    360
gctgccatga ttgcgaagct tgggattgat gtctacataa ctaaggcagg aacagaccat    420
tcagtgaaag cccttagtgg attcttgaag ggtggcatac ctgatgactg gctcggaaca    480
gccattcgtt acatgagctg a                                              501

<210> 245
<211> 511
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 245
atgaccgaag cagatattag tatggtgatt aaagccattg atgccggctt cattccggtt     60
ctgcatggcg atgccgtgct ggataccctg caagaatgca ccattctgag cggtgatatg    120
attattcgcc atctggccgc cgaactgaaa ccggagttcg ttgtgttcct gaccgatgtg    180
ctgggcgtgt atgatcgccc gccggttgaa ccgggcgccg ttcttattcg cgaaattgcc    240
gtgcgcgaag atggtagctg gagtgttgtt aaaccgaaac tggaagatac cagtaaaccg    300
gtggagttca ccgtggcagc ccatgatacc accggcggta tggtgaccaa aattaccgaa    360
gccgcaatga ttgccaaact gggcattgat gtgtatatta ccaaagcagg caccgatcat    420
agtgttaaag cactgagcgg cttcctgaaa ggtggtattc cggatgattg gctgggcacc    480
gcaattcgct atatgagttg actgaaagct t                                   511

<210> 246
<211> 252
<212> PRT
<213> Euryarchaeota archaeon

<400> 246
Met Lys Val Leu Lys Ile Gly Gly Ser Val Leu Thr Glu Lys His Thr
1               5                   10                  15
Glu Glu Lys Lys Val Arg Met Glu Val Leu Asp Arg Ile Ala Ser Glu
            20                  25                  30
Ile Ala Pro Arg Ala Asp Gly Leu Ile Leu Val His Gly Ala Gly Ser
        35                  40                  45
Phe Gly His Pro Glu Ala Ile Arg His Gly Val Gly Arg Arg Phe Ser
    50                  55                  60
Thr Glu Gly Val Leu Lys Thr His Gln Ser Val Cys Leu Leu Asn Arg
65                  70                  75                  80
Ile Val Val Ser Ala Leu Val Arg Gly Gly Val Pro Ala Val Pro Val
                85                  90                  95
Ser Pro Leu Gly Cys Ala Ile Ala Asp Gly Gly Arg Leu Val Ser Met
            100                 105                 110
Glu Met Thr Pro Ile Leu His Met Val Glu Arg Gly Leu Val Pro Val
        115                 120                 125
Leu His Gly Asp Val Val Met Asp Arg Thr Leu Gly Ala Ala Val Val
    130                 135                 140
Ser Gly Asp Ala Leu Val Ala His Ile Ala Lys Gly Leu Gly Ala His
145                 150                 155                 160
His Val Gly Met Gly Thr Ser Ala Gln Gly Val Leu Asp Ala His Gly
                165                 170                 175
Arg Thr Ile Pro Glu Val Thr Glu His Asn Ile Glu Gln Val Arg Gln
            180                 185                 190
Trp Val Arg Pro Ser Gly Gly Gly Asp Ala Thr Gly Gly Met Val Gly
        195                 200                 205
Lys Val Glu Glu Leu Trp Arg Leu Ala Gly Glu Gly Ile Glu Ser Trp
    210                 215                 220
Val Phe Ser Ala Leu Asp Glu Gly Ala Val Ala Ala Phe Leu Asp Gly
225                 230                 235                 240
His Pro Val Gly Thr Arg Val Arg Asn Val Glu Leu
                245                 250

<210> 247
<211> 759
<212> DNA
<213> Euryarchaeota archaeon

<400> 247
gtgaaggtgc tgaagatagg cggaagcgtg ctcacagaga agcacacaga ggaaaagaag     60
gtaaggatgg aggtgctcga ccgcattgct tctgagattg ccccacgggc agatgggctg    120
atactggtgc acggagctgg ctcattcggg catcccgagg cgataaggca cggggtgggc    180
aggaggttca gcaccgaggg tgtgctaaag acccaccaga gcgtgtgcct tctcaacagg    240
atagtggtga gcgcactcgt gagagggggc gtgcccgccg tcccggtgtc cccactgggc    300
tgtgccatcg cagatggcgg aaggctcgta tccatggaga tgacgcccat actccacatg    360
gtggagcgtg gactcgtgcc tgtgctgcac ggcgatgtgg tgatggatcg cacccttgga    420
gcggcagtgg tgtctggaga cgcgctcgtg gcgcacatcg caaagggact gggtgcgcac    480
cacgtgggca tgggcacgag tgcacagggg gtgctggatg cccatggacg caccattccg    540
gaagtaacgg agcacaacat cgagcaggtt agacaatggg tgcgtccctc tggaggcggg    600
gatgccactg gaggaatggt cggaaaggtg gaggagctgt ggaggctggc gggcgagggc    660
atagagtcgt gggtgttcag cgccctcgat gagggggcgg tggcggcgtt tctagatggg    720
catccagtgg gcacgagggt gagaaatgtt gaactatga                           759

<210> 248
<211> 769
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 248
atgaaggttc tgaaaatcgg tggcagcgtt ctgaccgaaa aacataccga agaaaaaaaa     60
gtgcgcatgg aagttctgga tcgcattgcc agtgaaattg ccccgcgtgc cgatggcctg    120
attctggtgc atggtgcagg tagcttcggt catccggaag ccattcgcca tggtgtgggt    180
cgtcgcttca gcaccgaagg tgtgctgaaa acacatcaga gcgtgtgtct gctgaatcgt    240
attgttgtga gcgcactggt gcgcggtggc gtgcctgccg ttcctgtgag tccgctgggt    300
tgtgcaattg cagatggtgg ccgcctggtt agcatggaaa tgaccccgat tctgcacatg    360
gttgaacgcg gtctggtgcc ggtgctgcat ggtgatgtgg ttatggatcg taccctgggc    420
gcagcagttg tgagcggcga tgccctggtg gcacatattg caaaaggcct gggtgcccat    480
catgttggca tgggcaccag tgcccagggc gtgctggatg cccatggtcg taccattccg    540
gaagtgaccg aacataatat tgaacaggtg cgccagtggg ttcgtccgag tggtggcggc    600
gatgcaaccg gtggcatggt gggtaaagtg gaagaactgt ggcgtctggc aggtgaaggt    660
attgaaagtt gggtgttcag tgcactggat gaaggcgcag ttgcagcctt cctggatggc    720
catccggtgg gcacccgtgt gcgcaatgtt gaactgtgac tgaaagctt                769

<210> 249
<211> 263
<212> PRT
<213> Archaeon species

<400> 249
Met Asn Pro Leu Ile Val Lys Leu Gly Gly Ser Val Ile Thr Asp Lys
1               5                   10                  15
Ala Lys Glu Phe Ser Met Lys Arg Gly Glu Ile Glu Arg Leu Ala Lys
            20                  25                  30
Glu Leu Thr Ser Val Asp Gly Pro Leu Val Val Val His Gly Gly Gly
        35                  40                  45
Ser Phe Gly His Pro Leu Ala Ser Glu Tyr Glu Ile Asp Ser Gly Tyr
    50                  55                  60
Lys Asp Asp Leu Gln Leu Met Gly Phe Thr Leu Thr His His Ala Met
65                  70                  75                  80
Gln Lys Leu Asn Phe Glu Val Val Asp Ser Leu His Gly Ala Asn Leu
                85                  90                  95
Leu Ala Val Ser Ile Gln Pro Ser Ala Cys Thr Ile Val Arg Asn Gly
            100                 105                 110
Arg Ile Ile Ser Ile Glu Leu Glu Pro Leu Arg Lys Leu Leu Asp Leu
        115                 120                 125
Gly Phe Val Pro Val Leu His Gly Asp Ser Val Pro Asp Leu Asp Lys
    130                 135                 140
Gly Met Ser Ile Leu Ser Gly Asp Gln Leu Val Val Phe Leu Ala Arg
145                 150                 155                 160
Glu Leu Lys Ala Asp Ser Val Ile Leu Gly Val Asp Thr Asp Gly Val
                165                 170                 175
Cys Thr Gly Asp Pro Lys Gln Gly Glu Lys Val Glu Leu Ile Pro Lys
            180                 185                 190
Ile Thr Pro Lys Ser Trp Pro Arg Ile Ala Asp Ser Leu Thr Pro Ser
        195                 200                 205
Pro Val Phe Asp Val Thr Gly Gly Met Arg Arg Lys Val Glu Glu Leu
    210                 215                 220
Met Lys Leu Pro Glu Ile Gly Ile Glu Ala Gln Ile Val Asn Ala Ser
225                 230                 235                 240
Lys Pro Lys Ile Leu Glu Lys Ala Ile Asn Gly Asp Lys Ser Leu Gly
                245                 250                 255
Thr Arg Ile Val Glu Gly Ser
            260

<210> 250
<211> 792
<212> DNA
<213> Archaeon species

<400> 250
atgaatcctt tgattgtaaa acttggcgga agcgtgataa cggataaggc caaggaattt     60
tcaatgaaac ggggggaaat tgaacgtctt gcaaaggagt tgacctcggt tgatggccca    120
cttgtggtgg ttcatggtgg gggttcgttt ggtcaccccc tcgcctctga atacgaaatt    180
gattcaggat ataaagatga ccttcagctt atgggcttca ccttgaccca ccatgcgatg    240
cagaagctca acttcgaggt ggtagattct ctgcatgggg ccaatctgct agcagtctcg    300
attcaaccat cagcatgcac gatcgttagg aacgggagaa ttatttccat agagcttgaa    360
cctcttcgta agttactgga tttgggcttt gttcccgtat tgcatggcga tagcgttccg    420
gatctcgata agggcatgag tatactgtcc ggcgatcagc ttgtcgtttt tttggcccga    480
gaattaaaag cagatagtgt gatacttgga gttgacacgg atggggtttg tacgggcgat    540
cctaagcaag gcgagaaggt agaactcata ccaaagataa cgcccaaaag ctggccaaga    600
atcgctgact ccctcacgcc atctcccgtt tttgacgtta ccggggggat gagaaggaag    660
gtcgaagagt taatgaaatt acccgaaata ggtatcgagg cacagatagt aaatgcatct    720
aagcctaaaa tccttgaaaa agcaatcaac ggcgataaaa gcttgggaac aaggatagtt    780
gagggtagct aa                                                        792

<210> 251
<211> 802
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 251
atgaacccgc tgattgttaa actgggcggc agtgttatta ccgataaagc aaaagagttc     60
agtatgaaac gcggcgaaat tgaacgtctg gccaaagaac tgaccagcgt tgatggtccg    120
ctggttgtgg ttcatggcgg tggtagcttc ggtcatccgc tggccagtga atatgaaatt    180
gatagcggct ataaagatga tctgcaactg atgggcttca ccctgaccca tcatgcaatg    240
cagaaactga acttcgaagt tgttgatagt ctgcatggtg caaatctgct ggcagtgagt    300
attcagccga gtgcatgtac cattgttcgc aatggtcgca ttattagtat tgaactggaa    360
ccgctgcgta aactgctgga tctgggcttc gtgccggtgc tgcatggtga tagcgttccg    420
gatctggata aaggcatgag cattctgagt ggcgatcagc tggttgtgtt cctggcacgt    480
gaactgaaag cagatagcgt tattctgggt gtggataccg atggtgtgtg caccggcgat    540
ccgaaacagg gtgaaaaagt tgaactgatt ccgaaaatta ccccgaaaag ctggccgcgt    600
attgccgata gcctgacccc gagtccggtg ttcgatgtta ccggtggcat gcgtcgtaaa    660
gttgaagaac tgatgaaact gccggaaatt ggtattgaag cacagattgt gaatgccagt    720
aaaccgaaaa ttctggaaaa agccattaat ggcgataaaa gcctgggcac ccgtattgtg    780
gaaggtagtt gactgaaagc tt                                             802

<210> 252
<211> 260
<212> PRT
<213> Nethanofastidiosum methylthiophilus

<400> 252
Met Ile Ile Ile Lys Ala Gly Gly Ser Ala Ile Thr Lys Lys Ser Glu
1               5                   10                  15
Asp Phe Thr Pro Asn Met Glu Val Ile Ser Asn Leu Ala Gln Glu Ile
            20                  25                  30
Lys Glu Ala Gly Arg Val Ser Ile Leu Val His Gly Ala Gly Ser Tyr
        35                  40                  45
Gly His Pro Ile Ala Lys Lys Tyr Ser Leu Gly Lys Gly Tyr Phe Asp
    50                  55                  60
Asp Tyr Gln Leu Lys Gly Phe Ser Glu Thr Arg Ala Ser Val Ser Glu
65                  70                  75                  80
Leu Asp Ser Ile Val Leu Lys Ser Leu Met Lys Asn Gly Leu Thr Pro
                85                  90                  95
Val Lys Ile Gly Thr Phe Ser Asn Phe Ile Thr Ser Asn Gly Arg Ile
            100                 105                 110
Val Glu Phe His Lys Glu Pro Leu Leu Arg Ala Ile Glu Leu Gly Leu
        115                 120                 125
Leu Pro Val Phe Thr Gly Asp Leu Val Phe Asp Arg Thr Arg Val Phe
    130                 135                 140
Ser Ile Leu Ser Gly Asp Gln Ile Val Ser Tyr Leu Ser Arg Leu Leu
145                 150                 155                 160
Lys Pro Ser Arg Val Val Phe Gly Thr Asp Val Asp Gly Ile Tyr Thr
                165                 170                 175
Gly Asp Pro Lys Lys Glu Asn Val Lys Leu Ile Asp Thr Val Thr Glu
            180                 185                 190
Glu Asn Ile Lys Glu Val Phe Lys Phe Ala Lys Asp Thr Gly Asp Ala
        195                 200                 205
Ser Gly Gly Met Glu Gly Lys Leu Ser Glu Ile Leu Pro Ile Phe Asp
    210                 215                 220
Met Gly Ile Glu Ile Asp Val Ile Asn Leu Thr Lys Lys Gly Asn Leu
225                 230                 235                 240
Ala Glu Thr Leu Arg Gly Asn Val Lys Gly Thr Val Ile Lys Lys Lys
                245                 250                 255
Asn Ile Ser Lys
            260

<210> 253
<211> 783
<212> DNA
<213> Methanofastidiosum methylthiophilus

<400> 253
atgattatta taaaagcagg cggttcggca attacaaaaa aaagcgaaga ttttacccca     60
aatatggaag taatttctaa tcttgctcaa gagatcaaag aggcaggaag agtatccatc    120
cttgttcatg gtgctggatc ttatgggcac ccgattgcaa agaaatattc tctagggaaa    180
ggttattttg atgattacca acttaaagga ttttctgaaa caagagccag tgttagtgaa    240
cttgacagca tagttctaaa atctcttatg aaaaatggac taactcctgt taaaataggc    300
actttttcaa attttatcac atctaatggt agaattgttg aattccataa agaacctcta    360
ctaagagcta ttgaacttgg gctattacct gtgtttactg gcgatttagt ctttgataga    420
actagagttt tctcaatatt atcgggagat caaatagtct catatctatc aagattgtta    480
aaaccttcaa gagttgtatt tgggactgat gtcgatggca tatatacagg agatccaaag    540
aaagaaaatg tcaagttgat tgatacagtt acagaagaaa atattaaaga agtatttaag    600
tttgcgaaag atactggaga cgcttctggc ggtatggaag gtaaactatc tgagatatta    660
cccatatttg atatgggtat tgagatagat gttattaatt taactaaaaa aggtaattta    720
gctgaaacac ttaggggaaa tgtaaaagga actgtaataa aaaagaagaa tatctcaaaa    780
taa                                                                  783

<210> 254
<211> 793
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 254
atgatcatca tcaaggccgg cggtagtgcc attaccaaaa aaagtgaaga cttcaccccg     60
aatatggaag tgattagtaa tctggcacag gaaattaaag aagcaggccg cgttagtatt    120
ctggtgcatg gtgccggcag ttatggccat ccgattgcaa aaaaatatag cctgggcaaa    180
ggctacttcg atgattatca gctgaaaggc ttcagtgaaa ccagagcaag cgtgagcgaa    240
ctggatagta ttgttctgaa aagcctgatg aaaaatggcc tgaccccggt taaaattggc    300
accttcagca acttcattac cagcaatggt cgcattgttg agttccataa agaaccgctg    360
ctgcgtgcaa ttgaactggg cctgctgccg gtgttcaccg gcgatctggt gttcgatcgc    420
acccgtgtgt tcagcattct gagcggtgat cagattgtta gttatctgag tcgtctgctg    480
aaaccgagtc gtgttgtgtt cggtaccgat gttgatggca tctataccgg cgatccgaaa    540
aaagaaaatg ttaaactgat cgataccgtg accgaagaaa atattaaaga agtgttcaag    600
ttcgcaaagg ataccggtga tgcaagtggc ggcatggaag gtaaactgag cgaaattctg    660
ccgatcttcg atatgggtat tgaaattgat gtgattaacc tgaccaaaaa aggtaatctg    720
gccgaaacct tacgtggtaa tgttaaaggt accgttatta aaaagaagaa catcagtaag    780
tgactgaaag ctt                                                       793

<210> 255
<211> 255
<212> PRT
<213> Syntrophoarchaeum butanivorans

<400> 255
Met Gly Lys Gly Leu Thr Ile Leu Lys Ile Gly Gly Ser Val Ile Thr
1               5                   10                  15
Cys Lys Asn Asp Glu Lys Arg Leu Arg Glu Asp Thr Ile Asn Gln Val
            20                  25                  30
Val Arg Glu Ile Ala Asp Ala Arg Thr Asp His Leu Ile Leu Val His
        35                  40                  45
Gly Ala Gly Ser Phe Gly His Pro Gln Ala Ala Lys His Ser Glu Phe
    50                  55                  60
Gly Glu Asp Leu Val Lys Asn Ala Phe Ala Val Phe Asp Ile Asn Thr
65                  70                  75                  80
Thr Val Met Glu Leu Asn Thr Ile Leu Val Ser Ser Met Ile Ser Gln
                85                  90                  95
Gly Leu Pro Ala Val Ala Leu His Pro Met Asn Phe Thr Ile Leu Glu
            100                 105                 110
Asp Gly Arg Ile Tyr Ser Met Met Thr Thr Gln Ile Glu Glu Met Leu
        115                 120                 125
Asp Lys Gly Phe Val Pro Val Leu His Gly Asp Ile Val Phe Asp Arg
    130                 135                 140
Lys Lys Gly Tyr Ala Ile Leu Ser Gly Asp Gln Ile Val Thr Tyr Leu
145                 150                 155                 160
Ala Arg Asp Leu Gln Ala Gln Arg Val Gly Leu Gly Val Asp Val Asp
                165                 170                 175
Gly Val Ile Gly Ser Asp Gly Ser Val Met Glu Val Ile Thr Pro Gln
            180                 185                 190
Asn Val Asp Glu Ile Ile Phe Asp Lys Gly Ala Asp Leu Asp Val Thr
        195                 200                 205
Gly Ala Met Glu Gly Lys Val Arg Glu Leu Leu Glu Leu Ala Ser Tyr
    210                 215                 220
Gly Ile Ser Ser Cys Ile Phe Asn Gly Thr Lys Lys Gly Tyr Ile Arg
225                 230                 235                 240
Arg Trp Leu Lys Gly Glu Lys Ile Pro Ser Thr Ile Ile Ser Glu
                245                 250                 255

<210> 256
<211> 768
<212> DNA
<213> Syntrophoarchaeum butanivorans

<400> 256
atggggaagg gtctgacgat tctcaagatc ggaggaagtg tgattacctg taagaacgat     60
gaaaagcggc tcagagagga taccatcaat caggttgtga gagagatcgc agatgccagg    120
accgatcatc taattctcgt gcatggtgca ggctcgttcg gtcacccaca ggctgcaaag    180
cactcagagt tcggggagga tctggtaaaa aatgcgtttg cagtctttga tataaatacg    240
accgtgatgg agcttaacac catcctcgtc tcaagtatga tctcacaggg tcttcctgcg    300
gttgcgcttc acccgatgaa cttcacaatc cttgaggacg ggaggattta ctcgatgatg    360
actactcaga tcgaggagat gcttgataag ggtttcgtcc cggttctgca tggtgatatt    420
gttttcgatc gtaagaaggg gtatgccatc ctctcagggg atcagatcgt gacatatctt    480
gcgagagatc ttcaggcaca gagggtaggg cttggagtcg atgttgatgg ggtcatcggg    540
agtgatggga gtgttatgga ggtcatcaca ccgcagaatg tggatgagat aatcttcgac    600
aagggggcag atcttgatgt taccggggca atggagggaa aggtcagaga gcttcttgag    660
cttgcctcgt atggaataag ctcgtgtatc ttcaatggga caaagaaagg ctacatcaga    720
cgatggctta aaggggagaa gataccatcc acaatcatat cagagtag                 768

<210> 257
<211> 778
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 257
atgggtaaag gcctgaccat tctgaaaatt ggcggtagcg tgattacctg taaaaatgat     60
gaaaaacgcc tgcgtgaaga taccattaat caggttgtgc gcgaaattgc agatgcccgc    120
accgatcatc tgattctggt gcatggcgca ggcagcttcg gtcatccgca ggcagcaaaa    180
catagcgagt tcggtgaaga cttagttaaa aatgcattcg ccgtgttcga tattaatacc    240
accgtgatgg aactgaatac cattctggtg agtagcatga ttagccaggg cctgccggca    300
gtggccctgc atcctatgaa cttcaccatt ctggaagatg gccgtatcta tagtatgatg    360
accacccaga ttgaagaaat gctggataaa ggcttcgtgc cggttctgca tggtgatatt    420
gtgttcgatc gcaaaaaagg ctatgcaatt ctgagcggcg atcagattgt tacctatctg    480
gcccgtgatc tgcaagccca gcgtgttggt ctgggcgtgg atgttgatgg tgttattggt    540
agcgatggca gtgtgatgga agtgattacc ccgcagaatg tggatgaaat tatcttcgat    600
aaaggcgcag acttagatgt taccggtgcc atggaaggta aagtgcgtga actgctggaa    660
ctggccagtt atggtattag cagttgcatc ttcaatggca ccaaaaaagg ttatattcgt    720
cgttggctga aaggtgaaaa aattccgagt accattatta gtgaatgact gaaagctt      778

<210> 258
<211> 254
<212> PRT
<213> Euryarchaeota archaeon

<400> 258
Met Met Leu Ile Lys Leu Gly Gly Ser Ala Ile Thr Asp Lys Ser Lys
1               5                   10                  15
Pro Leu Thr Ser Arg Ala Gly Asp Ile Lys Arg Leu Ala Met Glu Ile
            20                  25                  30
Ala Gly Ala Glu Gly Thr Lys Met Ile Val His Gly Gly Gly Ser Phe
        35                  40                  45
Gly His Ile Lys Ala Ala Glu Phe Lys Leu Asn Glu Gly Phe Val Asp
    50                  55                  60
Asp Ser Gln Arg Glu Gly Ile Cys Leu Val Gln Lys Asp Met Arg Lys
65                  70                  75                  80
Leu Asn Ala Ile Val Val Asp Ala Phe Arg Glu Ala Gly Val Pro Val
                85                  90                  95
Ala Ser Val Pro Ala Gly Ala Ile Thr Leu Phe Asp Asn Gly Gln Met
            100                 105                 110
Val Lys Phe Pro Ser Glu Val Phe Ile His Tyr Val Lys Leu Gly Ile
        115                 120                 125
Val Pro Ile Thr Phe Gly Asp Val Val Val Asp Arg Ala Arg Gly Ile
    130                 135                 140
Ser Ile Cys Ser Gly Asp Asp Ile Met Leu Gln Leu Ala Lys Asp Thr
145                 150                 155                 160
Asp Ala Val Lys Cys Val Phe Val Thr Ser Val Asp Gly Ile Phe Glu
                165                 170                 175
Ser Tyr Pro Pro Gly Lys Asp Glu Glu Pro Leu Ser Glu Val Gly Pro
            180                 185                 190
Asp Thr Val Ile Arg Phe Ser Ser Glu Asp Val Asp Val Thr Gly Ser
        195                 200                 205
Met Lys Arg Lys Leu Asp Leu Met Ile Glu Met Ala Ser Ser Gly Lys
    210                 215                 220
Glu Val Ala Val Val Asn Gly Leu Val Pro Asp Arg Leu Thr Asp Ala
225                 230                 235                 240
Leu Lys Gly Asn Asp Phe Ile Gly Thr Arg Val Lys Gly Asp
                245                 250

<210> 259
<211> 765
<212> DNA
<213> Euryarchaeota archaeon

<400> 259
atgatgctca tcaagctggg tggcagcgcc atcacagata aatccaagcc cctgacctca     60
cgtgccgggg acatcaagag gctggcgatg gagatcgccg gcgccgaggg tacaaagatg    120
atcgtgcacg gcgggggctc cttcggccac atcaaggctg ccgagttcaa gctcaacgag    180
ggctttgtcg acgacagcca gagagagggc atctgcctag tccagaagga tatgcgcaag    240
ttgaacgcca tcgtcgtcga tgcattcagg gaggcgggtg tccccgtggc atccgtcccg    300
gccggggcca taacgctctt cgacaacggc caaatggtta aattcccctc cgaggtcttt    360
atccattacg tgaagctggg catcgttccg attaccttcg gcgacgttgt cgtcgacagg    420
gctaggggca tatcgatatg ctcgggagac gacataatgc tccagcttgc caaggatacc    480
gatgctgtaa aatgcgtgtt cgtgacctcg gtggacggca tcttcgaatc ctatccccca    540
gggaaggacg aggagcctct cagcgaggtc ggtccggaca ccgtcatcag gttcagtagc    600
gaggacgtgg acgtgacggg cagcatgaag cgcaaactcg acctgatgat cgagatggcc    660
tcgtcaggta aggaggtggc cgtcgtgaac ggtcttgtgc ctgacagatt gaccgatgca    720
ttgaagggta atgacttcat tggcacccgg gtgaagggtg attga                    765

<210> 260
<211> 775
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 260
atgatgctga ttaagctggg cggtagcgca attaccgata aaagtaaacc gctgaccagt     60
cgcgcaggcg atattaaacg cctggccatg gaaattgcag gcgccgaagg caccaaaatg    120
attgtgcatg gtggtggcag cttcggtcat attaaagcag cagagttcaa actgaatgaa    180
ggcttcgtgg atgatagtca gcgcgaaggc atctgtctgg tgcagaaaga tatgcgtaaa    240
ctgaatgcca ttgtggtgga tgccttccgc gaagcaggcg tgccggttgc cagcgtgccg    300
gcaggtgcta ttaccctgtt cgataatggt cagatggtta aattcccgag tgaagtgttc    360
attcattatg ttaaactggg tattgtgccg attaccttcg gtgatgtggt ggttgatcgt    420
gcccgtggca ttagcatctg tagcggtgat gatattatgc tgcaactggc aaaagatacc    480
gatgccgtta aatgcgtgtt cgttaccagt gtggatggta tcttcgaaag ctatccgccg    540
ggcaaagatg aagaaccgct gagtgaagtg ggcccggata ccgtgattcg cttcagtagc    600
gaagatgtgg atgttaccgg tagtatgaaa cgtaaactgg atctgatgat tgaaatggca    660
agcagtggca aagaagttgc cgtggttaat ggcctggttc cggatcgcct gaccgatgca    720
ctgaaaggta atgacttcat tggcacccgc gttaaaggtg attgactgaa agctt         775

<210> 261
<211> 239
<212> PRT
<213> Archaeoglobus profundus

<400> 261
Met Ile Val Val Lys Ile Gly Gly Ser Ala Ile Thr Asp Lys Lys Gly
1               5                   10                  15
Phe Lys Ile Val Lys Ile Asp Ser Ile Glu Arg Val Ala Lys Asp Ile
            20                  25                  30
Ala Glu Val Arg Pro Arg Lys Leu Ile Leu Val His Gly Val Gly Ser
        35                  40                  45
Phe Gly His Pro Phe Val Val Lys Tyr Arg Leu Lys Glu Glu Lys Asn
    50                  55                  60
Leu Glu Gly Val Val Arg Ala His Met Ser Cys Lys Glu Leu Asn Ala
65                  70                  75                  80
Met Ile Cys Glu Ala Met Leu Met Tyr Gly Leu Lys Pro Phe Pro Val
                85                  90                  95
His Pro Leu Leu Thr Phe Lys Leu Arg Gly Gly Lys Ile Thr Phe Asp
            100                 105                 110
Ile Asp Ile Phe Glu Lys Ala Leu Glu Glu Gly Phe Ile Pro Val Thr
        115                 120                 125
His Gly Asp Met Val Tyr Asp Val Glu Asp Arg Phe Phe Lys Val Leu
    130                 135                 140
Ser Gly Asp Asp Ile Thr Leu Lys Leu Ala Lys Ala Phe Lys Ala Glu
145                 150                 155                 160
Lys Ile Gly Phe Ala Thr Asp Val Glu Gly Val Tyr Val Asp Gly Lys
                165                 170                 175
Leu Ala Asp Val Val Thr Trp Lys Asp Leu Asp Lys Ile Gly Phe Ser
            180                 185                 190
Lys Gly Val Asp Val Thr Gly Gly Met Arg Ser Lys Val Glu Lys Ile
        195                 200                 205
Leu Arg Ser Gly Val Asn Ala Arg Ile Phe Ser Ile Ser Lys Phe Lys
    210                 215                 220
Gly Phe Leu Ser Cys Glu Glu Val Gly Thr Leu Val Lys Ser Asp
225                 230                 235

<210> 262
<211> 720
<212> DNA
<213> Archaeoglobus profundus

<400> 262
atgatagtag tcaagattgg tggatctgca ataacagata agaaggggtt caaaatagtt     60
aagatcgatt ctatagagag agttgctaaa gatattgccg aagtaagacc cagaaaactt    120
attcttgttc acggtgtggg ttcctttggt cacccctttg tagttaagta caggcttaag    180
gaggagaaaa acttggaggg tgttgttaga gcgcatatgt catgcaagga gctgaacgct    240
atgatttgtg aggctatgct catgtatggc ttgaaacctt ttccagttca tccccttcta    300
acgttcaaac ttcgtggggg caagataaca tttgacatcg acatcttcga aaaggcactt    360
gaggaaggat tcatacccgt aactcatggg gacatggtct acgatgtgga ggacagattt    420
ttcaaggttc tttccggcga tgacataacg ctgaagcttg ccaaggcttt taaggctgaa    480
aagattggtt ttgctactga tgttgaagga gtttacgttg acgggaaatt agctgatgtg    540
gtgacttgga aagatttgga caagattgga ttctccaaag gagttgatgt gaccggtggg    600
atgagaagca aggtcgagaa gattttgagg agcggtgtga atgctagaat ttttagcatt    660
tcaaagttta aggggtttct aagctgtgaa gaagtaggaa cgctcgttaa atctgactga    720

<210> 263
<211> 730
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 263
atgatcgtgg tgaaaattgg tggcagcgcc attaccgata aaaaaggctt caaaattgtg     60
aaaatcgaca gcattgaacg cgttgcaaaa gatattgcag aagttcgtcc gcgtaaactg    120
attctggttc atggtgtggg tagcttcggc catccgttcg tggttaaata tcgtctgaaa    180
gaagaaaaaa acctggaagg tgttgtgcgc gcacacatga gttgcaaaga actgaatgca    240
atgatctgtg aagccatgct gatgtatggc ctgaaaccgt tcccggttca tccgctgctg    300
accttcaaac tgcgcggtgg caaaattacc ttcgatattg atatcttcga gaaagcactg    360
gaagaaggct tcattccggt gacccatggt gatatggtgt atgatgtgga agatcgcttc    420
ttcaaagtgc tgagtggcga tgatattacc ctgaaactgg ccaaagcctt caaagcagaa    480
aaaattggct tcgccaccga tgtggaaggc gtgtatgtgg atggcaaact ggccgatgtt    540
gtgacctgga aagacttaga taaaattggc ttcagcaaag gcgttgatgt taccggtggc    600
atgcgcagca aagtggaaaa aattctgcgt agcggcgtta atgcacgtat cttcagtatt    660
agcaaattca aaggcttcct gagctgtgaa gaagttggca ccctggtgaa aagtgattga    720
ctgaaagctt                                                           730

<210> 264
<211> 253
<212> PRT
<213> Halopenitus malekzadehii

<400> 264
Met Thr Gly Glu Thr Val Val Cys Lys Leu Gly Gly Ser Val Ile Thr
1               5                   10                  15
Glu Lys Asp Arg Ser Glu Thr Leu Asp Ala Pro Ala Leu Asp Ala Ala
            20                  25                  30
Cys Asp Ala Ile Ala Gly Val Leu Ala Asp Asp Ala Ile Asp Arg Leu
        35                  40                  45
Val Val Val His Gly Gly Gly Ser Phe Gly His His His Ala Ser Ala
    50                  55                  60
His Gly Met Thr Thr Thr Ala Gly Thr His Asn Val Asp Ala Val Met
65                  70                  75                  80
Asp Val His Gly Ala Met Thr Thr Leu Asn Arg Phe Val Leu Asp Arg
                85                  90                  95
Leu His Glu Arg Asn Val Pro Ala Leu Pro Val His Pro Leu Ser Val
            100                 105                 110
Gly Ala Arg Thr Gly Gly Pro Asp Gly Glu Leu Thr Leu Pro Ser Glu
        115                 120                 125
Pro Ala Ala Thr Leu Leu Ala Glu Gly Phe Val Pro Val Leu His Gly
    130                 135                 140
Asp Gly Val Ala Thr Ala Ser Glu Gly Val Thr Val Ile Ser Gly Asp
145                 150                 155                 160
Glu Leu Val Val Glu Leu Ala Ala Asp Ile Asp Ala Asp Arg Val Gly
                165                 170                 175
Leu Cys Ser Thr Val Pro Gly Val Leu Asp Gly Asp Gly Asp Val Val
            180                 185                 190
Pro Arg Ile Asp Ala Phe Glu Asp Val Ala Asp Leu Leu Gly Ala Ser
        195                 200                 205
Glu Ser Thr Asp Val Ser Gly Gly Met Ala Ala Lys Val Glu Glu Leu
    210                 215                 220
Leu Gly Leu Gly Ser Pro Ala Phe Ile Phe Gly Pro Asp Asp Leu Glu
225                 230                 235                 240
Gly Phe Leu Arg Gly Asp Ser Pro Gly Thr Arg Ile Gly
                245                 250

<210> 265
<211> 762
<212> DNA
<213> Halopenitus malekzadehii

<400> 265
atgacgggtg agacggtcgt ctgcaagctc ggcggcagcg tgatcaccga gaaggatcgg     60
tccgagacgc tcgatgcgcc cgcgctcgat gctgcctgcg acgcgatcgc cggggtgctg    120
gcggacgacg ccatcgaccg actcgtcgtc gttcacggtg gcgggagctt cggccaccat    180
cacgccagcg cacacgggat gaccacgacc gcggggaccc ataacgtcga cgccgtgatg    240
gacgttcacg gcgcgatgac gacgctcaac cggttcgtcc tcgaccggct gcacgaacgg    300
aacgttccgg cgcttccagt ccacccgcta tcggtcggcg cacggacggg cgggccggac    360
ggggagctga cgctgccgtc ggagcccgcc gcgacgctgc ttgcggaggg gtttgttccg    420
gtgttacacg gtgacggggt cgccaccgcg agcgagggcg tgacggtgat ctcgggcgac    480
gaactggtcg tcgagctggc tgccgacatc gatgcggacc gggtcggcct ctgttcgacg    540
gttcccggcg tcctcgacgg cgacggtgac gtggttcccc ggatcgacgc cttcgaagac    600
gtcgccgacc tgctcggcgc gagcgagtcg acggacgtct cgggcgggat ggcggcgaag    660
gtcgaggagc tgctcgggct cggatcgccg gcgttcatct tcggcccgga cgacctcgag    720
ggatttctgc gaggcgactc gcccgggacg cggatcgggt ag                       762

<210> 266
<211> 772
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 266
atgaccggcg aaaccgttgt gtgtaaactg ggcggtagcg ttattaccga aaaagatcgc     60
agtgaaacct tagatgcccc ggccctggat gccgcatgtg atgcaattgc aggtgtgctg    120
gcagatgatg ccattgatcg tctggtggtg gttcatggcg gtggtagctt cggccatcat    180
catgcaagcg cacatggcat gaccaccacc gcaggtaccc ataatgttga tgccgttatg    240
gatgtgcatg gtgcaatgac caccctgaat cgcttcgttc tggatcgtct gcatgaacgt    300
aatgttccgg ccctgccggt gcatccgctg agcgttggtg cacgtaccgg cggtccggat    360
ggtgaactga ccctgccgag tgaaccggcc gccaccctgc tggcagaagg cttcgttccg    420
gtgctgcatg gcgatggtgt tgcaaccgca agcgaaggcg tgaccgttat tagcggcgat    480
gaactggttg tggaactggc cgccgatatt gatgccgatc gtgttggtct gtgtagtacc    540
gttccgggtg ttctggatgg tgatggtgat gtggtgccgc gtattgatgc cttcgaagat    600
gttgcagact tactgggtgc cagtgaaagc accgatgtta gtggtggcat ggcagccaaa    660
gtggaagaac tgctgggcct gggtagtccg gccttcatct tcggtccgga tgatctggaa    720
ggcttcctgc gcggcgatag cccgggcaca cgcattggtt gactgaaagc tt            772

<210> 267
<211> 247
<212> PRT
<213> Methanosarcinales archaeon

<400> 267
Met Lys Ser Ile Leu Leu Lys Ile Gly Gly Ser Val Leu Thr Asp Lys
1               5                   10                  15
Glu Arg Glu Cys Thr Leu Arg Glu Ser Glu Ile Glu Arg Ile Ala Gly
            20                  25                  30
Glu Ile Lys Gly Ala Arg Ala Ser Val Val Ile Val His Gly Ala Gly
        35                  40                  45
Ser Phe Gly His Pro Gln Ala Arg Glu His Asn Leu Gly Glu Asp Pro
    50                  55                  60
Thr Thr Arg Gly Leu Ile Glu Thr His Arg Ala Val Met Lys Leu Asn
65                  70                  75                  80
Asn Ala Phe Ile Glu Ala Leu Asn Arg Ala Asp Val Asp Ala Ile Gly
                85                  90                  95
Val His Pro Leu Asp Phe Ile Met Val Glu Glu Arg Arg Val Thr His
            100                 105                 110
Leu Asp Cys Arg Val Leu Glu Ser Met Ile Glu Phe Gly Leu Thr Pro
        115                 120                 125
Val Leu His Gly Asp Val Val Ile Asp Ser Lys Arg Gly Ala Ser Val
    130                 135                 140
Ile Ser Gly Asp Gln Ile Leu Arg Glu Leu Gly Ile Arg Leu Asp Val
145                 150                 155                 160
Ser Met Val Gly Ala Gly Thr Asn Val Asp Gly Val Phe Asp Glu Thr
                165                 170                 175
Gly Arg Thr Ile Pro Lys Leu Asn Arg Ile Pro Gln Gln Ile His Pro
            180                 185                 190
Ser Val Ile Glu Asp Val Thr Gly Glu Met Lys Gly Lys Val Lys Glu
        195                 200                 205
Leu Ile Glu Leu Ala Glu Asn Gly Val Glu Ser Val Ile Phe Asn Ala
    210                 215                 220
Ser Met Lys Arg Lys Val Tyr Asp Phe Leu Arg Gly Lys Asp Val Gly
225                 230                 235                 240
Gly Thr Gln Ile Thr Leu Gln
                245

<210> 268
<211> 744
<212> DNA
<213> Methanosarcinales archaeon

<400> 268
gtgaagtcta tcctattgaa gatcggtgga agcgtgctga ccgataagga gcgtgagtgc     60
acgcttcgag aatctgagat tgagcggatt gcaggcgaaa ttaaaggtgc cagagccagt    120
gttgtgatcg ttcacggtgc tggctcgttc ggacacccac aggcaagaga gcacaacctc    180
ggggaagatc cgacaacgag gggtttgatt gagacgcaca gagcggtcat gaagctgaac    240
aacgccttca tcgaggctct caaccgagca gacgtcgatg caatcggcgt acacccactt    300
gacttcataa tggttgaaga gaggcgtgtt acacaccttg attgcagggt tcttgagagc    360
atgatcgagt tcgggctcac accagttctc cacggagatg ttgttataga ctcgaagagg    420
ggtgcatctg tgatctcagg agaccagatc ctaagagagc ttggaattag actggatgtc    480
tcaatggtcg gcgccgggac caacgttgat ggcgtgttcg acgaaacggg cagaacgatt    540
ccgaagttga atcggatccc tcagcagata cacccctcag ttatagagga tgtgacaggc    600
gagatgaaag gaaaagtaaa agaactcata gaacttgcag aaaacggcgt agaatctgtt    660
atattcaacg catccatgaa gaggaaggtt tacgattttc tcagaggaaa agatgttggc    720
ggaacacaga taactctaca gtaa                                           744

<210> 269
<211> 754
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 269
atgaagagca ttctgctgaa aattggcggt agcgtgctga ccgataaaga acgtgaatgc     60
accctgcgtg aaagcgaaat tgaacgcatt gcaggcgaaa ttaaaggtgc acgcgccagc    120
gttgtgattg ttcatggcgc cggcagcttc ggccatccgc aggcacgtga acataatctg    180
ggtgaagatc cgaccacccg tggtctgatt gaaacacatc gtgcagtgat gaaactgaat    240
aatgccttca ttgaagcact gaatcgcgca gatgttgatg caattggtgt tcatccgctg    300
gacttcatta tggttgaaga acgccgcgtg acccatctgg attgtcgcgt tctggaaagc    360
atgattgagt tcggcctgac cccggttctg catggtgatg ttgtgattga tagtaaacgt    420
ggtgccagcg tgattagtgg cgatcagatt ctgcgcgaac tgggcattcg cctggatgtg    480
agcatggttg gtgcaggtac caatgttgat ggcgtgttcg atgaaaccgg ccgcaccatt    540
ccgaaactga atcgtattcc gcagcagatt catccgagtg ttattgaaga tgttaccggc    600
gaaatgaaag gtaaagtgaa agaactgatt gaactggcag aaaatggtgt tgaaagtgtg    660
atcttcaatg caagtatgaa acgcaaagtg tatgacttcc tgcgtggtaa agatgtgggc    720
ggcacccaga ttaccctgca atgactgaaa gctt                                754

<210> 270
<211> 248
<212> PRT
<213> Euryarchaeota archaeon

<400> 270
Met Met Leu Val Lys Leu Gly Gly Ser Val Ile Thr Asp Lys Ser Lys
1               5                   10                  15
Lys Tyr Val Phe Arg Glu Lys Thr Val Arg Arg Leu Ala Glu Glu Ile
            20                  25                  30
Lys Asn Ser Gly Glu Lys Val Ile Val Val His Gly Ala Gly Ser Phe
        35                  40                  45
Gly His Ile Leu Ala Lys Lys Tyr Arg Leu Asp Lys Gly Leu Ile His
    50                  55                  60
Asp Arg Gln Val Lys Gly Val Ala Val Val Gln Arg Asp Val Lys Asn
65                  70                  75                  80
Leu Asn Leu Lys Val Val Asn Cys Leu Ile Gly Ala Gly Met Asn Pro
                85                  90                  95
Val Ser Ile Pro Pro Ser Ser Val Ala Glu Cys Arg Asn Lys Lys Ile
            100                 105                 110
Glu Lys Ile Asn Leu Asp Val Phe Lys Lys Tyr Leu Asn Leu Gly Leu
        115                 120                 125
Thr Pro Val Thr Phe Gly Asp Val Ala Leu Asp Arg Lys Leu Ser Phe
    130                 135                 140
Cys Ile Val Ser Gly Asp Leu Leu Met Leu Glu Leu Ala Arg Val Phe
145                 150                 155                 160
Lys Pro Lys Lys Ser Ile Phe Val Ser Asp Val Asp Gly Val Tyr Ile
                165                 170                 175
Asn Gly Lys Met Val Glu Lys Val Asp Arg Lys Ile Ile Gln Asn Leu
            180                 185                 190
Glu Arg Lys Lys Thr Lys Ile Ser Asp Val Thr Gly Ser Met Tyr Glu
        195                 200                 205
Lys Val Lys Ile Ala Leu Lys Met Ser Glu Phe Thr Lys Thr Val Ile
    210                 215                 220
Ile Asn Gly Asn Val Lys Gly Arg Leu Gly Asp Thr Leu Lys Gly Lys
225                 230                 235                 240
Lys Val Val Gly Thr Val Val Gly
                245

<210> 271
<211> 747
<212> DNA
<213> Euryarchaeota archaeon

<400> 271
atgatgcttg ttaaacttgg cggctctgtt ataacagaca aatcaaaaaa atatgttttc     60
agagaaaaaa ctgtgagaag actggcagag gagataaaaa actcaggtga aaaggttatt    120
gttgttcacg gtgctggctc ttttgggcac atacttgcaa aaaaatacag gctggataaa    180
ggcctcattc atgataggca ggttaagggt gttgctgttg ttcaaagaga tgtgaagaat    240
ttgaatctaa aggttgttaa ctgcctgata ggtgcaggta tgaaccctgt ttctatccct    300
ccatcatctg tcgctgaatg cagaaacaaa aagattgaaa aaataaatct tgatgttttc    360
aaaaaatatc ttaaccttgg tttgacacct gtcacgttcg gtgatgtcgc gcttgacagg    420
aaactcagtt tctgcatagt ctcaggtgat ttactcatgc ttgaacttgc cagagtgttt    480
aaaccaaaaa aatctatttt tgtttctgat gttgacggtg tttacataaa tgggaagatg    540
gtggaaaagg ttgacagaaa aattattcag aatctggaaa gaaaaaaaac aaaaatttct    600
gatgttacag gctcaatgta tgaaaaggta aaaatagcat taaaaatgtc agaatttaca    660
aaaacagtaa taatcaatgg aaacgttaaa ggcagattag gggatacatt aaagggaaaa    720
aaggttgttg gtacggtggt aggatga                                        747

<210> 272
<211> 757
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 272
atgatgctgg ttaaactggg cggtagtgtt attaccgata aaagcaaaaa atacgtgttc     60
cgtgaaaaaa ccgttcgtcg tctggccgaa gaaattaaaa atagtggtga aaaagtgatc    120
gtggtgcatg gtgcaggcag cttcggccat attctggcca aaaaatatcg cctggataaa    180
ggcctgattc atgatcgtca ggtgaaaggt gtggcagttg tgcagcgcga tgttaaaaat    240
ctgaatctga aagttgtgaa ctgcctgatt ggcgcaggca tgaatccggt tagtattccg    300
ccgagcagtg tggcagaatg ccgcaataaa aaaattgaaa aaatcaacct ggacgtgttc    360
aaaaaatatc tgaatctggg cctgaccccg gttaccttcg gcgatgtggc cctggatcgt    420
aaactgagct tctgcattgt gagcggtgat ctgctgatgc tggaactggc ccgcgtgttc    480
aaaccgaaaa aaagcatctt cgtgagtgat gttgatggtg tgtatattaa tggcaaaatg    540
gtggaaaaag tggatcgcaa aattattcag aatctggaac gtaaaaagac caaaattagc    600
gatgtgaccg gcagtatgta tgaaaaagtt aaaattgccc tgaagatgag cgagttcacc    660
aaaaccgtta ttattaatgg caatgtgaag ggccgcctgg gcgataccct gaaaggtaaa    720
aaagtggttg gtaccgttgt gggctgactg aaagctt                             757

<210> 273
<211> 251
<212> PRT
<213> Methanothermus fervidus

<400> 273
Met Ile Ile Ile Lys Ile Gly Gly Ser Val Ile Thr Asp Lys Asn Ser
1               5                   10                  15
Lys Tyr Pro Lys Leu Asn Lys Lys Asn Leu Lys Arg Val Cys Lys Glu
            20                  25                  30
Ile Cys Glu Val Met Pro Phe Pro Leu Ile Leu Val His Gly Ala Gly
        35                  40                  45
Ser Phe Gly His Pro Ile Val Lys Lys Tyr Asp Ile Ile Ala Asn Pro
    50                  55                  60
Asn Lys Lys Gly Phe Cys Ile Val His Tyr Trp Val Lys Lys Leu Asn
65                  70                  75                  80
Leu Tyr Val Cys Arg Tyr Leu Leu Lys Tyr Gly Met Asp Val Val Ser
                85                  90                  95
Ile Gln Pro Ser Ser Cys Ile Ile Ala Ser Asp Gly Phe Ile Asp Tyr
            100                 105                 110
Phe Asn Val Lys Ile Ile Glu Arg Tyr Leu Glu Lys Glu Ile Val Pro
        115                 120                 125
Val Leu Tyr Gly Asp Ile Val Leu Asp Lys Ser Leu Glu Phe Ser Val
    130                 135                 140
Ile Ser Gly Asp Gln Ile Val Arg Tyr Leu Gly Glu Lys Met Lys Ala
145                 150                 155                 160
Asn Lys Ile Ile Leu Ala Thr Asp Val Asp Gly Val Tyr Asp Lys Asp
                165                 170                 175
Pro Lys Lys His Lys Asp Ala Lys Leu Ile Lys Arg Ile Lys Pro Glu
            180                 185                 190
Asp Lys Ile Lys Leu Lys Asp Phe Lys Glu Asp Val Thr Gly Gly Met
        195                 200                 205
Ala Gly Lys Val Ser Glu Leu Leu Lys Leu Ala Glu Lys Gly Val Lys
    210                 215                 220
Ser Glu Ile Val Asn Ala Lys Lys Lys Asn Arg Leu Lys Lys Leu Leu
225                 230                 235                 240
Leu Gly Lys Arg Val Arg Arg Thr Ile Ile Gly
                245                 250

<210> 274
<211> 756
<212> DNA
<213> Methanothermus fervidus

<400> 274
atgattatta ttaagatagg tggtagtgtt atcactgata aaaattcaaa atatcctaaa     60
ttaaataaaa aaaatttgaa aagggtttgt aaggaaattt gtgaagttat gcccttccct    120
ttaattttgg tacatggggc aggatcattt ggccatccaa tcgtaaaaaa atatgatatt    180
attgctaatc ccaataaaaa aggtttttgt attgtccact attgggtaaa aaagttaaat    240
ctttatgttt gtaggtatct acttaaatat ggtatggatg tagtatccat acaaccttct    300
tcatgtatta tagcatctga tgggtttata gattatttta atgtaaaaat tattgagcga    360
tatttagaaa aagaaatcgt tcctgtgtta tatggggata tagtattaga taaatcatta    420
gagttttctg ttatttctgg agatcaaatt gtaagatatt taggagaaaa aatgaaagct    480
aataaaatta tattggctac agatgttgat ggtgtatatg ataaggatcc aaaaaaacat    540
aaagatgcaa aattaataaa aaggataaaa cctgaagata aaatcaaatt aaaagatttt    600
aaagaagacg taacaggagg tatggcaggc aaagtttcag aacttttaaa attagcagaa    660
aaaggtgtaa aatctgagat tgtaaatgca aaaaagaaaa acagattgaa aaaattacta    720
cttgggaaaa gagtccgtag aacaataata ggttaa                              756

<210> 275
<211> 766
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 275
atgatcatca tcaagatcgg tggtagcgtt attaccgata aaaatagtaa atacccgaag     60
ctgaataaga aaaatctgaa acgcgtgtgt aaagaaatct gtgaagtgat gccgttcccg    120
ctgattctgg tgcatggcgc aggcagcttc ggccatccga ttgtgaaaaa atatgatatt    180
atcgccaacc cgaataaaaa aggcttctgt attgttcatt actgggtgaa aaaactgaat    240
ctgtatgtgt gtcgttatct gctgaaatat ggtatggatg tggtgagtat tcagccgagc    300
agctgtatta ttgcaagcga tggcttcatt gattacttca atgtgaaaat tatcgagcgc    360
tatctggaaa aagaaattgt tccggtgctg tatggtgata ttgtgctgga taaaagcctg    420
gagttcagtg ttattagtgg cgatcagatt gttcgttatc tgggtgaaaa gatgaaagca    480
aataaaatta tcctggccac cgatgttgat ggcgtgtatg ataaagatcc gaaaaaacat    540
aaggacgcaa aactgattaa gcgcattaaa ccggaagata aaattaaact gaaggacttc    600
aaggaagatg ttaccggtgg catggcaggt aaagttagcg aactgctgaa actggcagaa    660
aaaggtgtga aaagcgaaat tgtgaatgca aaaaaaaaga accgcctgaa aaaactgctg    720
ctgggtaaac gcgtgcgtcg caccattatt ggttgactga aagctt                   766

<210> 276
<211> 270
<212> PRT
<213> Tumebacillus algifaecis

<400> 276
Met Asn Val Val Lys Ile Gly Gly Ser Leu Leu Thr Asp Lys Asp Gly
1               5                   10                  15
Tyr Cys Ala Pro Asn Gln Glu Met Val Arg Gln Tyr Ala Arg Thr Ile
            20                  25                  30
Ala Lys Glu Trp Glu Arg Leu Arg Gly Asn Leu Ile Leu Ile Val Gly
        35                  40                  45
Gly Gly Ser Tyr Gly Asn Ala Val Pro Val Arg Tyr His Leu Lys Asp
    50                  55                  60
Ala Ser Leu Pro Trp Lys Asp Thr Asp Leu Ser Met Met Thr Val Lys
65                  70                  75                  80
Met Phe Glu Trp Leu Ser Leu Val Thr Gln Ile Phe Arg Glu Glu Gly
                85                  90                  95
Val Pro Cys Tyr Pro Phe Gln Thr Ser Gly Tyr Val Val Thr Lys Asn
            100                 105                 110
Lys Arg Pro Gln Arg Phe Phe Val Glu Pro Val Glu His Val Leu Ser
        115                 120                 125
Met Gly Val Leu Pro Val Phe Ser Gly Asp Leu Val Phe Asp Glu Glu
    130                 135                 140
Gln Gln Phe Ile Ile Phe Ser Ser Asp Asn Leu Pro Glu Leu Phe Val
145                 150                 155                 160
Glu Arg Met Ser Leu Arg Arg Met Val Met Leu Thr Asp Val Glu Gly
                165                 170                 175
Val Met Gln Ile Gly Thr Asp Gly Gln Gln Thr Val Ile Pro Glu Val
            180                 185                 190
Thr Arg Ala Asn Phe Gln Glu Val Leu Arg Cys Ala Gly Pro Ser Gln
        195                 200                 205
Lys Pro Asp Ile Thr Gly Gly Met Lys Asn Lys Leu Glu Ala Leu Leu
    210                 215                 220
Arg Leu Ala Glu Gln Gly Val Glu Gly Val Ile Thr Ser Gly Arg Lys
225                 230                 235                 240
Ala Glu Ala Leu Leu Pro Ala Leu Phe Glu Pro Glu Pro Val Gly Thr
                245                 250                 255
Met Ile Arg Pro Trp Ala Gln Glu Asn Arg Gly Gly Leu Leu
            260                 265                 270

<210> 277
<211> 813
<212> DNA
<213> Tumebacillus algifaecis

<400> 277
atgaatgtag tcaagattgg agggagtttg ctgaccgata aggacggcta ctgtgcgccg     60
aatcaggaga tggtgcggca gtatgcgcgc acgatcgcca aggagtggga gcggctgcgt    120
ggcaacttga tcctgatcgt cggcggaggc tcgtatggga atgcagtgcc ggttcgctat    180
cacttgaagg atgcttccct gccgtggaaa gatacggacc tgtcgatgat gacggtgaag    240
atgtttgagt ggttgtcgct ggtgacccag atttttcggg aagagggagt gccttgctat    300
ccgttccaga ccagcgggta tgtggtgacg aaaaacaaac ggccccagcg tttttttgtg    360
gagccggtgg agcacgtgct gtcgatgggg gtgctgcccg ttttctcggg cgacctggtc    420
tttgatgagg agcagcagtt tatcattttt tcgagtgata atttgcccga gttgtttgtc    480
gagcggatgt ctctccggcg gatggtgatg ctgaccgatg ttgagggagt gatgcaaatc    540
ggaacggatg ggcagcagac ggtgattcct gaagtgacgc gtgcaaattt tcaggaggtg    600
ctgcgctgtg cggggccttc gcaaaagccg gatatcacag gcggaatgaa aaataagctg    660
gaagctctgt tgcgcttggc ggagcaggga gtggaaggtg tgatcaccag cggaaggaag    720
gcggaggcat tgctgccagc gttgtttgag cccgagcctg taggcacgat gattcggcct    780
tgggcacaag agaatagagg gggattgctg tag                                 813

<210> 278
<211> 823
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 278
atgaacgttg ttaagatcgg cggcagtctg ctgaccgata aagatggcta ttgtgccccg     60
aatcaggaaa tggtgcgtca gtatgcccgc accattgcaa aagaatggga acgcctgcgt    120
ggtaatctga ttctgattgt tggcggcggc agctatggta atgcagttcc ggttcgctat    180
catctgaaag atgccagtct gccgtggaaa gataccgatc tgagcatgat gaccgtgaaa    240
atgttcgaat ggctgagtct ggttacccag atattccgcg aagaaggcgt tccgtgttat    300
ccgttccaga ccagcggcta tgtggtgacc aaaaataaac gcccgcagcg cttcttcgtt    360
gaaccggtgg aacatgtgct gagtatgggt gttctgccgg tgttcagtgg tgatctggtg    420
ttcgatgaag aacagcagtt cattatcttc agtagcgata atctgccgga actgttcgtg    480
gaacgcatga gtctgcgccg tatggtgatg ctgaccgatg tggaaggtgt gatgcagatt    540
ggtaccgatg gccagcagac cgttattccg gaagttaccc gtgccaactt ccaggaagtt    600
ctgcgttgtg ccggcccgag tcagaaaccg gatattaccg gtggcatgaa aaataaactg    660
gaagccctgc tgcgcctggc agaacagggt gttgaaggtg ttattaccag tggtcgtaaa    720
gcagaagccc tgttaccggc actgttcgaa ccggaaccgg tgggtaccat gattcgtccg    780
tgggcccagg aaaatcgtgg cggcctgctg tgactgaaag ctt                      823

<210> 279
<211> 259
<212> PRT
<213> methanogenic archaeon

<400> 279
Met Ile Leu Ile Lys Phe Gly Gly Ser Val Ile Thr Asp Lys Ala Glu
1               5                   10                  15
Tyr Arg Lys Phe Asn Lys Glu Thr Val Ala Arg Leu Ala Asp Glu Ile
            20                  25                  30
Lys Arg Ser Gly Gln Glu Val Ile Ile Val His Gly Ala Gly Ser Phe
        35                  40                  45
Gly His Val Val Ser Lys Gln Tyr Asn Leu Gln Lys Gly Tyr Glu Asn
    50                  55                  60
Asp Ser Gln Ile Pro Ala Met Ala Arg Val Met Cys Asp Thr Arg Glu
65                  70                  75                  80
Leu Ser Ser Met Val Val Glu Glu Leu Leu Ala Lys Asn Ile Pro Ala
                85                  90                  95
Val Ser Val Pro Ile Gly Ser Cys Phe Val Ala Asp Gly Gly Lys Leu
            100                 105                 110
Val Val Asp Asn Glu Glu Pro Leu Arg Arg Leu Thr Asp Leu Gly Ile
        115                 120                 125
Met Pro Val Met Phe Gly Asp Val Ile Thr Asp Arg Lys Thr Arg Phe
    130                 135                 140
Cys Ile Val Ser Gly Asp Gln Val Met Glu Leu Leu Cys Trp Met Tyr
145                 150                 155                 160
Asn Pro Glu Lys Val Val Phe Val Ser Asp Ile Asp Gly Leu Tyr Asp
                165                 170                 175
Arg Asn Pro Lys Thr Asp Lys Ala Ala Arg Met Ile Gly Thr Val Thr
            180                 185                 190
Lys Glu Lys Met Ala Ser Ile Ala Thr Asp Ser Asn Val Asp Asp Val
        195                 200                 205
Thr Gly Gly Val Arg Asn Lys Met Glu Ala Met Leu Arg Met Thr Asp
    210                 215                 220
Gly Ser Arg Lys Cys Tyr Leu Val Asn Gly Asn Ala Pro Asn Arg Leu
225                 230                 235                 240
Tyr Ser Leu Leu Lys Gly Glu Thr Val Thr Cys Thr Ile Ala Lys Gly
                245                 250                 255
Gly Leu Glu

<210> 280
<211> 780
<212> DNA
<213> methanogenic archaeon

<400> 280
atgatactca taaaattcgg cggaagcgta attaccgaca aggccgagta tcgcaaattc     60
aacaaggaga ccgtcgccag actggctgac gaaatcaaac gctccggaca ggaggtcatc    120
atcgtccacg gagcaggttc tttcggacac gtggtctcca aacaatacaa ccttcagaag    180
ggttacgaga acgattccca gattcccgcc atggcaaggg tcatgtgcga caccagggaa    240
ctcagctcca tggtcgtgga ggaactcctc gccaagaata tccccgccgt atcggtgccc    300
atcggatcct gtttcgtcgc ggacggaggc aaactggtgg tcgacaacga ggagcccctc    360
aggaggctca ccgacctcgg gatcatgcct gtcatgttcg gagatgtgat taccgacagg    420
aagacccgtt tctgcatagt ttctggcgac caggttatgg agctcctgtg ctggatgtac    480
aatcccgaga aagtcgtgtt cgtatcggac atcgacggac tctacgaccg caaccccaag    540
acagacaagg ccgcacgcat gatcggcacc gtcaccaagg agaagatggc gagtatcgcc    600
accgattcca acgtggatga cgtcacaggt ggggtgcgca acaagatgga ggccatgctc    660
cgcatgaccg acggcagcag gaaatgctac ctggtcaacg gcaacgcccc caaccgcctg    720
tactcgctgc tgaagggcga gacagtcaca tgcacaatcg caaaaggagg cctggaatga    780

<210> 281
<211> 790
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 281
atgatcctga ttaagttcgg cggcagtgtt attaccgata aagcagaata tcgcaaattc     60
aataaggaaa ccgttgcccg cctggcagat gaaattaaac gcagcggcca ggaagtgatt    120
attgttcatg gtgccggtag cttcggccat gttgtgagta aacagtataa tctgcaaaaa    180
ggctatgaaa atgacagcca gattccggcc atggcacgcg tgatgtgtga tacccgtgaa    240
ctgagtagta tggttgttga agaactgctg gcaaaaaata ttccggcagt gagcgtgccg    300
attggcagct gcttcgttgc agatggtggt aaactggttg tggataatga agaaccgctg    360
cgtcgtctga ccgatctggg tattatgccg gtgatgttcg gcgatgtgat taccgatcgc    420
aaaaccagat tctgtattgt gagtggcgat caggtgatgg aactgctgtg ctggatgtat    480
aatccggaaa aagtggtgtt cgttagcgat attgatggtc tgtatgatcg caatccgaaa    540
accgataaag ccgcccgtat gattggtacc gtgaccaaag aaaaaatggc cagcattgcc    600
accgatagta atgtggatga tgttaccggt ggcgtgcgta ataaaatgga agcaatgctg    660
cgtatgaccg atggtagtcg caaatgctat ctggttaatg gtaatgcacc gaatcgcctg    720
tatagtctgc tgaaaggcga aaccgttacc tgtaccattg caaaaggcgg cctggaatga    780
ctgaaagctt                                                           790

<210> 282
<211> 260
<212> PRT
<213> Thermoplasmatales archaeon

<400> 282
Met Phe Ile Ile Lys Leu Gly Gly Ser Val Ile Thr Asp Lys Ser Lys
1               5                   10                  15
Glu Asn Phe Tyr Lys Gln Val Ile Val Asp Asn Leu Val Lys Ala Ile
            20                  25                  30
Lys Lys Ala Asn Lys Lys Thr Ile Ile Ile His Gly Ala Gly Ser Phe
        35                  40                  45
Gly His Ile Ile Ala Asp Lys Tyr Lys Leu Asn Asp Gly Tyr Asn Ser
    50                  55                  60
Asp Asp Gln Leu Leu Gly Phe Ser Leu Thr His Gly Met Val Gln Lys
65                  70                  75                  80
Leu Ser Ser Leu Val Leu Glu Ser Phe His Lys Asn Gly Ile Pro Ala
                85                  90                  95
Val Ala Val Pro Pro His Ser Asn Leu Ile Leu Asn Asn His Thr Pro
            100                 105                 110
Leu Ser Phe Asp Tyr Lys Ile Phe Lys Glu Tyr Ile Asp Met Asn Phe
        115                 120                 125
Thr Pro Ile Thr Phe Gly Asp Val Val Ile Asp Lys Lys Leu Gly Phe
    130                 135                 140
Ser Ile Cys Ser Gly Asp Leu Leu Met Leu Leu Leu Ala Lys Asn Leu
145                 150                 155                 160
Lys Pro Glu Lys Thr Ile Phe Val Ile Asp Glu Asp Gly Leu Tyr Thr
                165                 170                 175
Ser Asn Pro Lys Lys Asp Asn Asn Ala Glu Leu Ile Glu Ser Ala Thr
            180                 185                 190
Lys Glu Gln Leu Asn Lys Tyr Val Thr Ser Met Asp Lys His Ala Asp
        195                 200                 205
Val Thr Gly Gly Met Gln Gly Lys Ile Glu Thr Ile Lys Lys Ile Ala
    210                 215                 220
Asp Ile Gly Ile Asp Thr Ile Leu Leu Asn Gly Asn Lys Pro Asp Arg
225                 230                 235                 240
Leu Phe Lys Val Leu Asn Gly Glu Lys Thr Lys Ser Thr Ile Ile Thr
                245                 250                 255
Gly Gly Tyr Lys
            260

<210> 283
<211> 783
<212> DNA
<213> Thermoplasmatales archaeon

<400> 283
atgtttatta tcaaacttgg cggtagtgta attacagata aatcaaagga aaatttctac     60
aaacaggtaa ttgttgataa tctagtaaag gctattaaaa aggcaaacaa aaaaacaata    120
attattcatg gggcaggttc ttttgggcat attattgcag ataaatacaa attaaatgat    180
ggttataatt cagatgatca actgcttggt ttttctctta ctcatggtat ggtacaaaaa    240
ttaagcagtt tggtcttgga atcttttcat aaaaatggga tacctgctgt tgctgttcct    300
cctcactcta atttaatatt aaacaatcat actcctttaa gctttgatta taaaattttc    360
aaagaatata ttgatatgaa ctttactcca ataacttttg gagatgttgt aattgataaa    420
aaattgggtt tttcaatttg ttcaggcgat cttttaatgc tcctacttgc aaaaaatctc    480
aaacctgaaa agacaatctt tgtaattgat gaggatggcc tttatacctc taatccaaaa    540
aaagataata atgcagaact aatagaatct gcaacaaaag aacaattaaa taaatatgtt    600
acctcaatgg acaagcatgc agatgtaact ggaggaatgc aaggaaaaat tgaaacaatt    660
aaaaaaattg cagatattgg tattgataca atcctgctaa atggtaataa acctgataga    720
ttattcaagg tcttaaatgg agaaaaaaca aaatcaacaa taatcactgg aggatataaa    780
tga                                                                  783

<210> 284
<211> 793
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 284
atgttcatca tcaagctggg cggcagcgtt attaccgata aaagtaaaga aaacttctac     60
aagcaggtga ttgtggataa tctggttaaa gccattaaaa aagccaataa aaagaccatt    120
atcatccatg gtgccggtag cttcggtcat attattgccg ataaatataa gctgaacgat    180
ggctataata gtgatgatca gctgctgggc ttcagcctga cccatggtat ggttcagaaa    240
ctgagtagtc tggttctgga atcattccat aaaaatggca ttccggcagt tgccgttccg    300
ccgcatagta atctgattct gaataatcat accccgctga gcttcgatta taaaatcttc    360
aaagaataca tcgacatgaa cttcaccccg attaccttcg gcgatgttgt tattgataaa    420
aaactgggct tcagtatctg tagcggtgat ctgctgatgc tgctgctggc aaaaaatctg    480
aaaccggaaa aaaccatctt cgtgattgat gaagatggtc tgtataccag caatccgaaa    540
aaagataata acgccgaact gattgaaagc gcaaccaaag aacagctgaa taaatatgtg    600
accagcatgg ataaacatgc cgatgtgacc ggtggtatgc agggtaaaat tgaaaccatt    660
aaaaagatcg cagacattgg tattgatacc attctgctga atggtaataa accggatcgc    720
ctgttcaaag tgctgaatgg cgaaaaaacc aaaagtacca ttattaccgg cggttataaa    780
tgactgaaag ctt                                                       793

<210> 285
<211> 247
<212> PRT
<213> Thermoplasmatales archaeon

<400> 285
Met Val Ile Val Leu Lys Leu Gly Gly Ser Ile Ile Thr Glu Lys Asp
1               5                   10                  15
Ser Tyr Arg Lys Ile Asn Glu Asp Ala Ile Val Lys Leu Phe Asp Val
            20                  25                  30
Leu Ser Lys Ser Arg Glu Lys Met Val Leu Ile His Gly Ala Gly Ser
        35                  40                  45
Phe Gly His Ile Leu Ala Leu Lys His Gly Leu Glu Lys Pro Gly Pro
    50                  55                  60
Ser Lys Gly Arg Glu Ala Ser Ile Ser Arg Val Met Ser Asp Val Leu
65                  70                  75                  80
Ala Leu Asp Ser Ala Ile Val Asp Lys Leu Asn Glu Lys Gly Val Arg
                85                  90                  95
Gly Val Ala Val Pro Pro His Ala Ile Tyr His Gly Ser Leu Pro Asp
            100                 105                 110
Phe Lys Ile Val Glu Thr Leu Leu Ala Asn Gly Phe Ile Pro Val Leu
        115                 120                 125
Tyr Gly Asp Ile Ile Val Tyr Arg Gly Lys Tyr Arg Ile Ile Ser Gly
    130                 135                 140
Asp Glu Ile Ala Leu Asp Leu Ser Arg Arg Phe Arg Pro Arg Ser Val
145                 150                 155                 160
Val Phe Val Thr Asp Val Asp Gly Leu Tyr Asp Ser Asp Pro Lys Val
                165                 170                 175
Asn Lys Arg Ala Lys Phe Ile Pro Lys Ile Arg Ala Ser Glu Ile Glu
            180                 185                 190
Val Ile Asp Thr Lys Arg Asp Ala Thr Gly Ser Met Ala Gly Lys Met
        195                 200                 205
Glu Arg Ile Lys Lys Ile Val His Tyr Thr Gly Arg Val Ile Ile Ile
    210                 215                 220
Asn Gly Lys Arg Pro Asp Arg Leu Ser Asp Ser Leu Glu Gly Lys Glu
225                 230                 235                 240
Thr Lys Ser Thr Val Ile Thr
                245

<210> 286
<211> 744
<212> DNA
<213> Thermoplasmatales archaeon

<400> 286
atggtgattg ttttaaaact cggtggaagc atcataactg aaaaagacag ttacaggaag     60
atcaacgaag acgccatagt aaaattgttt gatgttctct ctaaaagcag ggagaaaatg    120
gtgctcatcc acggtgcggg atccttcggc cacatacttg cactcaaaca tggtcttgaa    180
aaacctggac catcaaaggg gagggaggca tcgatctcaa gggttatgag tgacgtcctt    240
gcactcgatt ctgcaatcgt agataagctc aacgagaagg gtgtcagagg agtggcagta    300
ccaccacacg caatctacca tgggagtttg cccgatttca agatagttga aacgctcctg    360
gccaacggat tcattcctgt gctttacgga gacataattg tctatagggg aaaatacagg    420
ataatctcag gggacgagat agctctggat ttatcaagaa ggttcaggcc aaggtcagtt    480
gtctttgtca ctgacgtcga cggactttat gattccgacc cgaaggtcaa caagagggct    540
aaatttatcc caaagatcag ggcaagtgag atcgaagtga tagacactaa aagagatgct    600
actggttcca tggcggggaa gatggaaaga attaagaaaa tagtacacta tacaggcaga    660
gtgataataa tcaacggaaa gagacctgac aggcttagtg attccctgga ggggaaggag    720
acgaagtcca cggtgatcac atga                                           744

<210> 287
<211> 754
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 287
atggttatcg ttctgaaact gggcggtagc attattaccg aaaaagatag ttatcgcaag     60
attaatgagg atgccattgt gaaactgttc gatgtgctga gcaaaagtcg tgaaaaaatg    120
gtgctgattc atggcgcagg tagcttcggt catattctgg ccctgaaaca tggtctggaa    180
aaaccgggcc cgagtaaagg tcgtgaagca agtattagtc gtgttatgag cgatgtgctg    240
gcactggata gcgcaattgt ggataaactg aatgaaaaag gcgttcgtgg cgttgccgtg    300
ccgccgcatg ccatctatca tggtagcctg ccggacttca aaattgttga aaccttactg    360
gcaaatggct tcattccggt tctgtatggc gatattattg tgtatcgtgg caaatatcgt    420
attattagcg gcgatgaaat tgcactggat ctgagtcgcc gcttccgccc gcgtagtgtg    480
gtgttcgtta ccgatgtgga tggtctgtat gatagtgatc cgaaagtgaa taaacgtgcc    540
aaattcattc cgaaaattcg cgccagcgaa attgaagtga ttgataccaa acgtgatgca    600
accggtagta tggccggtaa aatggaacgt attaaaaaaa ttgtgcacta caccggccgc    660
gtgattatta ttaatggtaa acgcccggat cgcctgagcg atagcctgga aggtaaagaa    720
accaaaagta ccgtgattac ctgactgaaa gctt                                754

<210> 288
<211> 270
<212> PRT
<213> Odinarchaeota archaeon

<400> 288
Met Asn Asn Lys Leu Thr Ile Ile Lys Leu Gly Gly Ser Val Ile Thr
1               5                   10                  15
Asp Lys Ser Val Pro Phe Ser Ile Asp Glu Lys Val Ile Lys Asn Ile
            20                  25                  30
Ile Ser Glu Met Glu Gln Ile Lys Lys Glu Lys Thr Ile Ile Val His
        35                  40                  45
Gly Gly Gly Ala Phe Gly His Pro Ile Ala Lys Lys Tyr Met Leu Ala
    50                  55                  60
Thr Gly Leu Lys Val Lys Glu Gln Ile Arg Gly Val Ile Glu Thr Ser
65                  70                  75                  80
Gln Ala Met Leu Thr Leu Asn Lys Ile Ile Leu Asp Met Phe Ile Gln
                85                  90                  95
Ala Asp Tyr Pro Val Ile Ser Phe Ser Pro His Asp Ile Phe Ile Thr
            100                 105                 110
Lys His Gly Arg Ile Tyr Lys Thr Phe Leu Asn Ser Leu Lys Asn Val
        115                 120                 125
Leu Glu Ile Gly Phe Ile Pro Val Leu Phe Gly Asp Val Val Tyr Asp
    130                 135                 140
Thr Ala Gln Gly Val Ala Ile Leu Ser Gly Asp Gln Ile Ile Ser Tyr
145                 150                 155                 160
Leu Ser Ile Lys Leu Lys Ala Ser Lys Val Ile Leu Gly Thr Asp Ile
                165                 170                 175
Asn Gly Ile Tyr Ser Ser Asp Pro Lys Ile Asn Pro Gly Ala Gln Leu
            180                 185                 190
Ile Pro Glu Val Thr Pro Asp Asn Tyr Arg Arg Ile Leu Lys Ile Leu
        195                 200                 205
Lys Ser Asn Thr Lys Asn Ser Leu Asp Val Thr Gly Gly Met Tyr Gly
    210                 215                 220
Lys Val Arg Glu Leu Ile Lys Val Ala Lys His Gly Ile Asp Ile Tyr
225                 230                 235                 240
Ile Leu Asn Ala Arg Thr Pro Gly Asn Ile Ser Lys Ile Leu Asn Asn
                245                 250                 255
Ser Glu Ile Asn Cys Thr Gln Phe Lys Asn Trp Arg Lys Gln
            260                 265                 270

<210> 289
<211> 813
<212> DNA
<213> Odinarchaeota archaeon

<400> 289
ttgaataata aattaactat aattaaactc ggcggaagtg tgatcacaga taaaagtgtc     60
ccgttctcaa tagatgagaa agtgattaag aatattatca gtgaaatgga gcagattaaa    120
aaagaaaaaa cgattatagt tcacggcggc ggagcgttcg ggcaccctat cgccaaaaaa    180
tacatgctag caacaggtct taaagttaaa gagcagataa gaggcgtaat cgaaacaagt    240
caagcgatgc ttacactgaa taaaatcata ctcgatatgt ttattcaagc agattacccg    300
gttatctctt tcagcccaca tgatattttc atcactaaac acggtagaat atataaaaca    360
tttcttaatt ctctaaaaaa tgttttagaa ataggcttca tacctgtttt attcggcgac    420
gttgtatatg atacagccca aggtgttgcg atactttcag gggaccagat tataagctat    480
ttatcaatca agcttaaagc tagtaaggtg attctcggaa cagatataaa tggaatatac    540
tcaagcgatc ctaaaataaa ccccggcgcg cagttaatcc cagaagtaac tcctgataac    600
tatcgcagga ttctaaaaat tttaaagtca aacactaaaa acagtttaga tgtcacaggt    660
ggaatgtacg gtaaggttag agagctgata aaggtcgcta aacatggaat agacatatat    720
attttgaacg ctagaacacc aggtaatatt agtaaaatat taaataactc tgaaattaat    780
tgtacacaat tcaaaaactg gcgtaaacag taa                                 813

<210> 290
<211> 823
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 290
atgaacaaca agctgaccat tattaagctg ggcggcagcg ttattaccga taaaagcgtg     60
ccgttcagca ttgatgaaaa agttattaaa aacatcatca gcgagatgga acagattaaa    120
aaagaaaaaa ccatcatcgt gcacggcggt ggcgcattcg gtcatccgat tgcaaaaaaa    180
tatatgctgg caaccggcct gaaagtgaaa gaacagattc gtggcgttat tgaaaccagt    240
caggccatgc tgaccctgaa taaaattatt ctggatatgt tcatccaggc cgattatccg    300
gttattagct tcagcccgca tgatatcttc attaccaaac atggccgtat ctataaaacc    360
ttcctgaata gcctgaaaaa tgttctggaa attggcttca ttccggttct gttcggcgat    420
gttgtgtatg ataccgcaca gggtgttgcc attctgagcg gtgatcagat tattagctat    480
ctgagcatta aactgaaggc aagtaaagtt attctgggca ccgatattaa tggcatctat    540
agcagcgatc cgaaaattaa tccgggtgcc cagctgattc cggaagtgac cccggataat    600
tatcgccgca ttctgaaaat tctgaaaagt aataccaaga acagcctgga tgtgaccggc    660
ggcatgtatg gtaaagttcg tgaactgatt aaagttgcaa aacatggtat tgatatctac    720
attctgaacg cccgtacccc gggcaatatt agcaaaattc tgaataatag cgagatcaat    780
tgcacccagt tcaaaaattg gcgcaaacag tgactgaaag ctt                      823

<210> 291
<211> 137
<212> PRT
<213> Shigella flexneri

<400> 291
Met Pro Met Asp Leu Arg Asp Asn Lys Gln Ser Gln Lys Lys Trp Lys
1               5                   10                  15
Asn Arg Thr Leu Thr Ser Ser Leu Glu Phe Ala Leu Thr Gly Ile Phe
            20                  25                  30
Thr Ala Phe Lys Glu Glu Arg Asn Met Lys Lys His Ala Val Ser Ala
        35                  40                  45
Leu Leu Ala Val Ile Ala Gly Leu Val Phe Lys Val Ser Val Ile Glu
    50                  55                  60
Trp Leu Phe Leu Leu Leu Ser Ile Phe Leu Val Ile Thr Phe Glu Ile
65                  70                  75                  80
Val Asn Ser Ala Ile Glu Asn Val Val Asp Leu Ala Ser Asp Tyr His
                85                  90                  95
Phe Ser Met Leu Ala Lys Asn Ala Lys Asp Met Ala Ala Gly Ala Val
            100                 105                 110
Leu Val Ile Ser Gly Phe Ala Ala Leu Thr Gly Leu Ile Ile Phe Leu
        115                 120                 125
Leu Lys Ile Trp Phe Leu Leu Phe His
    130                 135

<210> 292
<211> 414
<212> DNA
<213> Shigella flexneri

<400> 292
atgcctatgg acttaagaga taataagcaa agccaaaaga aatggaaaaa tagaacttta     60
acttccagcc ttgaattcgc tttaacgggg atttttacgg cttttaaaga agagcgtaat    120
atgaaaaaac atgcagtgtc agctctttta gctgtcattg ctggtttggt ttttaaagta    180
tcagtcattg agtggctttt tcttttatta agtatttttt tggttattac ctttgaaatt    240
gtcaattcgg ctattgaaaa tgtggttgat ttggccagtg actatcattt ttccatgttg    300
gctaaaaatg ctaaggatat ggctgcagga gctgttcttg tcatttcagg ttttgctgcc    360
ttgacaggct tgattatttt tttattaaaa atttggtttt tgctttttca ttaa          414

<210> 293
<211> 424
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 293
atgccgatgg atctgcgtga taataaacag agtcagaaaa aatggaagaa ccgcaccctg     60
accagtagcc tggagttcgc cctgaccggt atcttcaccg cattcaaaga agaacgcaat    120
atgaaaaaac acgcagttag tgcactgctg gcagtgattg caggtctggt gttcaaagtt    180
agcgttattg aatggctgtt cctgctgctg agcatcttcc tggttattac cttcgaaatt    240
gttaatagcg ccattgaaaa tgtggtggat ctggcaagtg attatcactt cagcatgctg    300
gcaaaaaatg ccaaagatat ggcagccggc gcagttctgg ttattagtgg cttcgcagcc    360
ctgaccggcc tgattatctt cctgctgaaa atctggttcc tgctgttcca ttgactgaaa    420
gctt                                                                 424

<210> 294
<211> 123
<212> PRT
<213> Roizmanbacteria bacterium

<400> 294
Met Ile Arg Gly His Arg Ile Ser Ile Arg His Ala Ile Asp Gly Ile
1               5                   10                  15
Ile Trp Ala Val Arg Thr Gln Pro Asn Tyr Arg Ile His Phe Thr Leu
            20                  25                  30
Ser Ile Leu Ser Leu Val Gly Gly Leu Ile Phe Lys Ile Ser Tyr Glu
        35                  40                  45
Glu Phe Leu Ala Ile Tyr Val Leu Ile Phe Val Gly Leu Ala Ile Glu
    50                  55                  60
Thr Val Asn Thr Ser Leu Glu Lys Thr Ser Asp Ala Ile Thr Arg Glu
65                  70                  75                  80
Tyr Asn His Asp Ile Lys Thr Ala Lys Asp Val Ala Ala Gly Ala Met
                85                  90                  95
Leu Phe Phe Ala Ile Gly Ala Leu Ala Val Ala Cys Ala Ile Phe Ile
            100                 105                 110
Pro Lys Ile Trp Leu Leu Phe Ile Asn Ala Ser
        115                 120

<210> 295
<211> 372
<212> DNA
<213> Roizmanbacteria bacterium

<400> 295
atgattcgtg gacacagaat ttctattaga catgcaattg acggcataat ctgggccgtt     60
cggacacagc ctaactatcg gattcatttt actttatcaa ttctgtctct ggttggtggc    120
ctgattttca aaatttccta tgaagaattt ttagcaatct acgtcctgat ttttgtgggg    180
ctggcaattg agacagtcaa tacttcactt gagaaaactt cagacgccat cacccgtgaa    240
tacaatcacg atatcaaaac cgccaaagat gtggcggccg gagcaatgct tttttttgcg    300
attggtgctt tggccgtggc ctgcgcgatc tttatcccca aaatatggct gttatttatt    360
aacgcgtctt aa                                                        372

<210> 296
<211> 382
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 296
atgatccgtg gtcatcgtat tagcattcgc catgcaattg atggcattat ctgggcagtg     60
cgtacccagc cgaattatcg cattcacttc accctgagca ttctgagcct ggttggtggc    120
ctgatcttca aaattagtta tgaagagttc ctggccatct atgttctgat cttcgtgggc    180
ctggcaattg aaaccgtgaa taccagtctg gaaaaaacca gcgatgccat tacccgtgaa    240
tataatcatg atattaagac cgcaaaggat gttgccgcag gtgccatgct gttcttcgcc    300
attggcgccc tggcagttgc atgcgccatc ttcattccga aaatctggct gctgttcatt    360
aatgcaagct gactgaaagc tt                                             382

<210> 297
<211> 127
<212> PRT
<213> Bacteroidales bacterium

<400> 297
Met Glu Ser Gly Asn Arg Leu Gly Leu His Ser Arg Tyr Gln Ser Ile
1               5                   10                  15
Arg Phe Ala Val Asn Gly Ile Lys Thr Leu Phe Arg Glu Glu Lys Asn
            20                  25                  30
Ala Val Ile Gln Leu Leu Ile Phe Ala Met Val Ile Ile Ala Gly Phe
        35                  40                  45
Phe Ser Arg Leu Ser Asp Thr Glu Trp Ile Leu Ile Thr Thr Val Ser
    50                  55                  60
Met Phe Val Phe Ala Cys Glu Cys Phe Asn Thr Ala Leu Glu Asp Leu
65                  70                  75                  80
Ser Asp Phe Val Thr Gly Glu Lys Asn Glu Lys Ile Arg Lys Ile Lys
                85                  90                  95
Asp Leu Ala Ala Gly Gly Val Leu Ile Ser Ala Leu Gly Ala Ala Ile
            100                 105                 110
Thr Gly Met Ile Ile Phe Phe Pro Arg Phe Leu Asp Leu Phe Asn
        115                 120                 125

<210> 298
<211> 384
<212> DNA
<213> Bacteroidales bacterium

<400> 298
atggaatccg gtaacaggtt gggtttgcat agccggtatc agagtatcag gtttgctgta     60
aatggcataa aaactctctt cagagaagag aaaaatgcgg taattcagtt gctgattttt    120
gctatggtaa tcatagcggg atttttttcc agactgtcag atacagaatg gattttaata    180
acaacagtaa gtatgtttgt atttgcttgt gaatgtttta atactgcact ggaagatctt    240
tctgatttcg tgaccggaga aaagaatgaa aagatcagga agatcaagga cctggcagcg    300
ggtggagtcc tgatctcagc tttaggtgct gctattacag gtatgataat tttttttcca    360
cgatttttag acctgtttaa ttga                                           384

<210> 299
<211> 394
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 299
atggaaagtg gtaatcgtct gggtctgcat agtcgttatc agagtattcg cttcgcagtt     60
aatggtatta aaaccttatt ccgtgaagaa aaaaacgccg ttattcagct gctgatcttc    120
gcaatggtta ttattgcagg cttcttcagt cgcctgagcg ataccgaatg gattctgatt    180
accaccgtga gcatgttcgt gttcgcatgt gaatgcttca ataccgccct ggaagactta    240
agcgacttcg tgaccggtga aaaaaatgaa aaaattcgta agatcaagga cctggcagcc    300
ggtggtgtgc tgattagcgc cctgggtgca gccattaccg gcatgattat cttcttcccg    360
cgcttcctgg atctgttcaa ttgactgaaa gctt                                394

<210> 300
<211> 168
<212> PRT
<213> Methylomirabilis oxyfera

<400> 300
Met Glu Gly Ser Ser His Pro Phe Arg Cys Ala Leu Lys Gly Val Glu
1               5                   10                  15
Asp Ala Ile Ser Thr Gln Arg His Leu Arg Ala His Ile Val Val Ala
            20                  25                  30
Gly Phe Val Ala Leu Phe Gly Leu Leu Leu Glu Leu Pro His Val Asp
        35                  40                  45
Leu Val Leu Leu Leu Met Ala Ile Ala Leu Val Ile Ile Thr Glu Leu
    50                  55                  60
Leu Asn Thr Ala Val Glu Leu Thr Val Asp Leu Val Ser Pro Thr Phe
65                  70                  75                  80
His Pro Ile Ala Gly Arg Ala Lys Asp Ile Ala Ala Gly Ala Val Leu
                85                  90                  95
Ile Ala Ala Leu Val Ala Ala Thr Val Gly Ile Ile Val Leu Ala Pro
            100                 105                 110
Pro Leu Phe Gly Ala Leu Thr Thr Arg Pro Leu Ser Ala Lys Ser Ala
        115                 120                 125
Leu Leu Val Ala Thr Thr Leu Gly Leu Val Gly Ser Ile Ile Ala Ala
    130                 135                 140
Leu Leu Pro Arg Ser Ser Ser Ser Lys Arg Asp Gln Leu Leu Thr Val
145                 150                 155                 160
Ser Lys Lys Leu Asn Thr Asp His
                165

<210> 301
<211> 507
<212> DNA
<213> Methylomirabilis oxyfera

<400> 301
gtggaaggct cgtcacatcc gtttcgctgt gcgttgaaag gggtcgaaga cgcgatctcc     60
acacaacgcc acctgcgcgc tcacatcgtc gtggccgggt tcgtcgcgct gttcgggctg    120
ttgctggagt tgccgcatgt cgatctggtg ctgctgctca tggccattgc gcttgttatc    180
atcacagaac tgctgaatac tgcagtggag ttgaccgtgg atcttgtatc gccgaccttt    240
cacccgatcg ccggacgggc aaaggacatt gccgccggcg cggtgctgat cgccgcgttg    300
gttgcggcta ccgtcggtat tatcgtactt gcgccccctt tgttcggcgc gctcaccaca    360
cgtccgcttt cagcgaagtc agctctgctg gtggcgacca ccctcgggct ggtcggaagc    420
attattgctg cactcttgcc acgttcttcc agctcgaagc gcgaccagct tttaactgtc    480
agcaagaagt tgaatactga ccactga                                        507

<210> 302
<211> 517
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 302
atggaaggca gcagccatcc gttccgttgt gccctgaaag gcgttgaaga tgcaattagc     60
acccagcgtc atctgcgcgc ccatattgtt gttgccggct tcgtggccct gttcggcctg    120
ctgctggaac tgccgcatgt ggatctggtt ctgctgctga tggcaattgc cctggttatt    180
attaccgaac tgctgaatac cgcagttgaa ctgaccgtgg atctggtgag cccgaccttc    240
catccgattg ccggtcgcgc aaaagatatt gccgccggcg ccgtgctgat tgcagcactg    300
gtggccgcca ccgttggtat tattgtgctg gcaccgccgc tgttcggtgc actgaccacc    360
cgcccgctga gtgccaaaag tgcactgctg gttgcaacca ccctgggcct ggttggcagt    420
attattgcag cattactgcc gcgtagcagc agtagcaaac gtgatcagct gctgaccgtt    480
agtaaaaaac tgaataccga tcattgactg aaagctt                             517

<210> 303
<211> 132
<212> PRT
<213> Eremococcus coleocola

<400> 303
Met Asp Leu Lys Asp Arg Lys Asp Leu Asn Pro Phe Lys Arg Trp Leu
1               5                   10                  15
Leu Ser Cys Gly Phe Ala Leu Glu Gly Leu Arg Phe Thr Trp Lys Gly
            20                  25                  30
Glu Pro Asn Phe Lys Ile His Ile Ser Ile Leu Ile Leu Val Thr Ile
        35                  40                  45
Ala Gly Phe Phe Phe Gly Ile Ala Arg Trp Glu Trp Val Thr Leu Leu
    50                  55                  60
Ile Cys Phe Ala Phe Ile Leu Thr Leu Glu Leu Ile Asn Thr Ala Leu
65                  70                  75                  80
Glu Thr Leu Val Asn Trp Ile Ala Asp Lys Gln Trp His Pro Leu Ala
                85                  90                  95
Lys Ile Thr Lys Asp Val Ala Ala Gly Ala Val Leu Val Gly Ala Ile
            100                 105                 110
Ile Val Ala Gly Ile Gly Leu Ile Ile Phe Val Pro Tyr Ile Trp Gln
        115                 120                 125
Tyr Phe Leu Gly
    130

<210> 304
<211> 399
<212> DNA
<213> Eremococcus coleocola

<400> 304
atggacttga aagatagaaa agatttaaat ccctttaaac ggtggctact ttcctgtggc     60
tttgccctag agggattacg ctttacttgg aaaggtgagc ccaattttaa gattcatatt    120
tccattttga ttcttgtaac gattgctggg tttttctttg ggattgcccg ctgggaatgg    180
gtgactcttt taatttgttt tgcctttatt ttgaccttag agttaatcaa taccgcctta    240
gaaactctcg taaactggat agctgataaa cagtggcacc ctttggctaa gattaccaag    300
gatgtggcag ctggagctgt cttagtgggt gctattattg ttgcggggat tggtttgatt    360
atttttgtgc catatatctg gcaatatttt ctaggttag                           399

<210> 305
<211> 409
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 305
atggatctga aagatcgtaa agacttaaat ccgttcaaac gctggctgct gagctgcggc     60
ttcgcactgg aaggtctgcg cttcacctgg aaaggcgaac cgaacttcaa aattcatatt    120
agtatcctga tcctggttac cattgccggc ttcttcttcg gcattgcccg ctgggaatgg    180
gtgaccctgc tgatctgctt cgccttcatt ctgaccctgg aactgattaa taccgcactg    240
gaaaccttag tgaattggat tgcagataaa cagtggcatc cgctggcaaa aattaccaaa    300
gatgtggccg caggtgccgt tctggttggt gccattattg tggcaggtat tggcctgatt    360
atcttcgttc cgtatatctg gcagtacttc ctgggttgac tgaaagctt                409

<210> 306
<211> 128
<212> PRT
<213> Firmicutes bacterium

<400> 306
Met Leu Ser Arg Thr Lys Lys Arg Arg Lys Phe Lys Asp Ser Phe Lys
1               5                   10                  15
Asn Cys Ile Asp Gly Leu Arg Phe Ile Asn Ile Asn Glu Asp Asn Phe
            20                  25                  30
Lys Arg Glu Ile Leu Leu Gly Ile Ile Thr Leu Val Leu Ser Tyr Leu
        35                  40                  45
Leu Lys Ile Asp Lys Ile Glu Phe Ile Ile Ile Ile Ile Val Ile Gly
    50                  55                  60
Leu Val Leu Val Ser Glu Ile Ile Asn Thr Ala Ile Glu Arg Leu Val
65                  70                  75                  80
Asp Leu Val Ser Pro Lys Tyr Asn Lys Leu Ala Gly Glu Val Lys Asp
                85                  90                  95
Ile Ala Ala Ser Ser Val Leu Leu Met Cys Ile Phe Ser Leu Val Val
            100                 105                 110
Gly Val Ile Ile Phe Val Pro Lys Ile Ile Asn Leu Leu Gly Gly Phe
        115                 120                 125

<210> 307
<211> 387
<212> DNA
<213> Firmicutes bacterium

<400> 307
atgctttcac gaactaagaa aagacggaag tttaaagata gttttaagaa ttgtatagat     60
ggcttaagat ttattaatat taatgaagat aattttaaaa gagaaatact tttaggtatt    120
ataactttag ttttatcgta tcttcttaag atagataaaa tagaatttat aatcataatt    180
atagtaattg gactagtatt agtaagtgaa ataattaata cagcaataga aagattagtt    240
gatcttgtta gtccaaaata taataagtta gcaggagaag taaaagatat agcagcttct    300
tctgtccttc ttatgtgtat cttttcttta gtagtaggag taataatatt tgtaccaaaa    360
ataattaatt tactaggagg attttaa                                        387

<210> 308
<211> 397
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 308
atgctgagtc gtaccaaaaa acgtcgcaaa ttcaaagata gcttcaaaaa ttgtatcgac     60
ggcctgcgct tcattaatat taatgaagat aacttcaagc gtgagattct gctgggtatt    120
attaccctgg tgctgagcta tctgctgaaa attgataaaa ttgagttcat catcatcatc    180
attgtgattg gcctggtgct ggtgagcgaa attattaata ccgcaattga acgtctggtg    240
gatctggtta gcccgaaata taataaactg gccggcgaag ttaaagatat tgccgccagc    300
agcgtgctgc tgatgtgcat cttcagcctg gtggtgggcg tgattatctt cgtgccgaaa    360
attattaatc tgctgggtgg cttctgactg aaagctt                             397

<210> 309
<211> 156
<212> PRT
<213> Raphidiopsis brookii

<400> 309
Met Phe Pro Lys Ser Ser Ile Pro Pro Thr Pro Pro Lys Arg Leu Pro
1               5                   10                  15
Lys Ile Val Ser Ser Glu Arg Glu Phe Ser Trp Gln Ile Ala Ser Asn
            20                  25                  30
Leu Leu Ala Ser Phe Lys Tyr Ala Trp Ala Gly Ile Ser Tyr Gly Phe
        35                  40                  45
Gln Thr Gln Arg Asn Phe Arg Ile His Val Ala Ala Cys Ala Phe Val
    50                  55                  60
Ile Gly Leu Ser Ile Phe Leu His Leu Lys Pro Val Glu Ile Ala Ile
65                  70                  75                  80
Ile Ser Ile Thr Ser Gly Leu Val Leu Thr Leu Glu Leu Val Asn Thr
                85                  90                  95
Ala Ile Glu Ser Leu Val Asp Leu Thr Val Lys Gln Thr Tyr His Glu
            100                 105                 110
Leu Ala Lys Val Ala Lys Asp Cys Ala Ala Gly Ala Val Leu Val Ser
        115                 120                 125
Ala Met Val Ser Leu Ile Val Ala Thr Thr Leu Leu Leu Pro Pro Leu
    130                 135                 140
Leu Arg Leu Ile Thr Thr Thr Phe Leu Leu Glu Trp
145                 150                 155

<210> 310
<211> 471
<212> DNA
<213> Raphidiopsis brookii

<400> 310
atgttcccta agagttcaat accaccaaca ccaccaaagc gtttaccaaa aattgtatct     60
tcggaaaggg aattttcttg gcaaatagct tctaacttac ttgccagctt taaatacgct    120
tgggctggta tcagttatgg ttttcagacc cagcgtaact ttaggattca tgtagctgct    180
tgtgcttttg tgattggttt gagtattttt ctgcatctta aaccagtaga aatagccata    240
attagcatta caagtggttt agttttaaca ttggagttag ttaatacagc cattgagtct    300
ctggtggatt taactgttaa gcagacctat catgaattgg cgaaagtggc caaagactgt    360
gctgctggtg ctgtgcttgt ctcagcaatg gtatcactaa tagtagcaac tacactatta    420
cttcctcctt tactacgttt aatcacaact acattcttat tagaatggta g             471

<210> 311
<211> 481
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 311
atgttcccga aaagtagcat tccgccgacc ccgccgaaac gtctgccgaa aattgtgagt     60
agcgaacgtg agttcagttg gcagattgca agcaatctgc tggcatcatt caaatatgcc    120
tgggccggca ttagctatgg cttccagacc cagcgtaact tccgtattca tgtggccgca    180
tgcgcattcg ttattggtct gagtatcttc ctgcatctga aaccggttga aattgcaatt    240
attagcatta ccagcggtct ggttctgacc ctggaactgg ttaataccgc aattgaaagc    300
ctggtggatc tgaccgtgaa acagacctat catgaactgg caaaagttgc caaagattgt    360
gcagcaggtg cagtgctggt tagtgccatg gttagtctga ttgttgcaac caccctgctg    420
ctgccgccgc tgctgcgtct gattaccacc accttcctgc tggaatggtg actgaaagct    480
t                                                                    481

<210> 312
<211> 128
<212> PRT
<213> Bacterium species

<400> 312
Met Glu Thr Val Arg Lys Phe Leu Met Gly Phe Val Tyr Ala Trp His
1               5                   10                  15
Gly Ile Lys Arg Ala Leu Ser Gln Arg Asn Met Lys Phe His Ala Phe
            20                  25                  30
Val Ala Ala Met Val Val Leu Phe Gly Phe Phe Leu Gln Ile Ser Phe
        35                  40                  45
Val Glu Trp Val Val Val Ile Val Leu Ile Ala Leu Val Phe Ala Ala
    50                  55                  60
Glu Met Phe Asn Thr Ala Ile Glu Asn Glu Ala Asn Thr Met Arg Asp
65                  70                  75                  80
Lys Leu Gly Ala Pro Tyr Ser Leu Met Gly Ala Pro Lys Asp Leu Ala
                85                  90                  95
Ala Gly Ala Val Leu Val Leu Ala Ile Ala Ala Ala Ile Ile Gly Leu
            100                 105                 110
Ala Ile Phe Leu Pro Lys Leu Trp Val Leu Phe Gln Glu Ile Thr Met
        115                 120                 125

<210> 313
<211> 387
<212> DNA
<213> Bacterium species

<400> 313
atggagactg taagaaagtt tttaatgggt tttgtatacg cctggcatgg aataaaacgg     60
gctttaagtc agcgaaatat gaagtttcac gcttttgttg cggcgatggt ggttcttttt    120
ggctttttcc ttcaaatatc atttgtggaa tgggttgtgg tcatagtttt gattgcccta    180
gtttttgctg ccgaaatgtt taacaccgcg attgaaaacg aggccaatac tatgcgcgat    240
aagcttggcg cgccgtactc gttgatgggt gcccctaaag acttagctgc cggtgcggtt    300
ttagttttag caattgccgc tgcaataatt ggccttgcga tttttctgcc caaactttgg    360
gttctttttc aagaaatcac aatgtag                                        387

<210> 314
<211> 397
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 314
atggaaaccg tgcgtaaatt cctgatgggc ttcgtgtatg catggcatgg cattaaacgt     60
gccctgagcc agcgtaatat gaaattccat gccttcgttg cagcaatggt ggtgctgttc    120
ggcttcttcc tgcaaattag cttcgttgaa tgggttgtgg tgattgtgct gattgccctg    180
gtgttcgccg ccgaaatgtt caataccgcc attgaaaatg aagccaatac catgcgtgat    240
aaactgggcg ccccgtatag cctgatgggt gccccgaaag acttagcagc aggcgcagtt    300
ctggttctgg ccattgcagc agccattatt ggcctggcca tcttcctgcc gaaactgtgg    360
gttctgttcc aggaaattac catgtgactg aaagctt                             397

<210> 315
<211> 120
<212> PRT
<213> Saprospirales bacterium

<400> 315
Met Lys Lys Trp Asp Phe Ile Gly Leu Arg Phe Ala Leu Ser Gly Leu
1               5                   10                  15
Phe Ile Leu Leu Gln Gln Arg Asn Phe Arg Ile Glu Ala Leu Val Ala
            20                  25                  30
Leu Leu Ala Ile Val Leu Gly Phe Tyr Leu Gln Ile Asn Ala Gln Ala
        35                  40                  45
Trp Leu Trp Ile Ser Leu Ala Ile Thr Leu Val Leu Val Gly Glu Ala
    50                  55                  60
Ile Asn Thr Ala Ile Glu Leu Thr Leu Asp Arg Ile Gly Arg Asp Phe
65                  70                  75                  80
His Pro Thr Thr Lys Arg Ala Lys Asp Ile Ala Ala Gly Ala Val Val
                85                  90                  95
Leu Cys Cys Leu His Ala Ala Ile Ile Gly Phe Val Val Phe Gly Pro
            100                 105                 110
Lys Leu Trp Ala Leu Leu Phe Ala
        115                 120

<210> 316
<211> 363
<212> DNA
<213> Saprospirales bacterium

<400> 316
atgaaaaaat gggactttat cggcctccga tttgccctta gtggcctttt tattctttta     60
caacagcgaa actttcgcat tgaagcactg gttgcacttc ttgcaattgt tttggggttt    120
tatcttcaaa ttaacgcgca ggcctggcta tggattagcc ttgccattac tctcgttctt    180
gtaggagaag ccatcaatac cgctattgag ttaaccttag atagaatagg aagggatttc    240
catccgacaa caaaacgagc taaagacatt gccgcaggag cagttgtttt atgttgtctc    300
catgctgcta ttattggatt tgtagtcttt ggccctaagc tttgggcatt gctctttgca    360
taa                                                                  363

<210> 317
<211> 373
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 317
atgaagaagt gggacttcat tggtctgcgc ttcgcactga gtggtctgtt cattctgctg     60
caacagcgca acttccgtat tgaagcactg gtggccctgc tggccattgt tctgggcttc    120
tatctgcaaa ttaatgccca ggcctggctg tggattagcc tggccattac cctggtgctg    180
gttggtgaag ccattaatac cgcaattgaa ctgaccctgg atcgtattgg tcgcgacttc    240
catccgacca ccaaacgcgc caaagatatt gcagcaggcg cagtggtgct gtgctgtctg    300
catgcagcaa ttattggctt cgttgtgttc ggtccgaaac tgtgggccct gctgttcgca    360
tgactgaaag ctt                                                       373

<210> 318
<211> 114
<212> PRT
<213> Staphylococcus aureus

<400> 318
Met Lys Arg Phe Lys Tyr Ala Leu Asp Gly Leu Lys Ile Leu Ile Gln
1               5                   10                  15
Lys Asp Tyr Lys Phe Leu Leu His Val Phe Ala Met Ile Val Ala Ile
            20                  25                  30
Val Phe Gly Leu Val Leu Asn Ile Asn Arg Ile Glu Trp Ile Phe Ile
        35                  40                  45
Leu Ile Ala Ile Ala Leu Val Leu Thr Val Glu Ala Leu Asn Thr Ala
    50                  55                  60
Ile Glu Tyr Val Val Asp Leu Val Thr Val Glu Tyr His Asp Leu Ala
65                  70                  75                  80
Lys Tyr Ala Lys Asp Ile Ala Ala Phe Ser Val Leu Ile Val Ser Ile
                85                  90                  95
Leu Ala Phe Ile Ile Gly Leu Ile Val Phe Leu Pro His Phe Ile Ala
            100                 105                 110
Leu Phe

<210> 319
<211> 345
<212> DNA
<213> Staphylococcus aureus

<400> 319
atgaaaaggt ttaaatatgc acttgatggg ctgaaaatct taattcaaaa agactataaa     60
tttcttttac atgtgtttgc aatgattgtt gctattgtct ttggtctcgt actaaatatt    120
aatcggattg agtggatatt tatactcatt gctattgcat tagttctcac tgttgaagct    180
ttaaacactg ctattgaata tgttgtcgat ttagtgaccg ttgaatatca tgatttagct    240
aaatacgcta aagatattgc ggcttttagt gtacttatag tttcaatatt agcatttatt    300
ataggtttaa tagtattttt accacatttt atagcgttat tttag                    345

<210> 320
<211> 355
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 320
atgaagcgct tcaaatatgc cctggatggc ctgaaaattc tgattcagaa agattataag     60
ttcctgctgc atgtgttcgc aatgattgtt gcaattgtgt tcggcctggt gctgaatatt    120
aatcgcattg aatggatctt catcctgatt gccattgccc tggttctgac cgtggaagcc    180
ctgaataccg caattgaata tgtggttgat ctggttaccg ttgaatatca tgatctggcc    240
aaatatgcaa aagatattgc cgcattcagc gttctgattg ttagcattct ggccttcatt    300
attggcctga ttgtgttcct gccgcacttc attgcactgt tctgactgaa agctt         355

<210> 321
<211> 117
<212> PRT
<213> Sphingobacteriaceae bacterium

<400> 321
Met Gly Tyr Phe Lys Asn Arg Ile Asn Ala Phe Gly Tyr Ala Phe Ser
1               5                   10                  15
Gly Ile Tyr Gln Ala Phe Arg Gln Glu Thr His Leu Lys Ile His Ala
            20                  25                  30
Val Ile Ala Leu Leu Val Ile Gly Leu Ala Ala Phe Phe Glu Val Cys
        35                  40                  45
Asn Glu His Trp Ile Leu Leu Leu Leu Ala Ile Thr Leu Val Ile Ala
    50                  55                  60
Leu Glu Met Leu Asn Ser Ala Val Glu Lys Leu Cys Asn Ile Ile Lys
65                  70                  75                  80
Pro Glu Leu Asp Pro Arg Ile Lys Tyr Ile Lys Asp Val Ser Ala Gly
                85                  90                  95
Ala Val Leu Ile Val Cys Leu Phe Ala Val Ala Ala Gly Ile Ile Val
            100                 105                 110
Phe Ser His Tyr Phe
        115

<210> 322
<211> 354
<212> DNA
<213> Sphingobacteriaceae bacterium

<400> 322
atgggctact ttaagaatag aataaatgct tttggctacg cattttcagg catttaccaa     60
gcttttcggc aggaaacgca tcttaagata catgccgtaa ttgccctttt agtgatcgga    120
ctcgctgctt tttttgaagt ctgtaacgaa cattggatcc tactactttt agctatcaca    180
ttggtgattg ccctagagat gctaaattca gccgttgaga agctctgcaa tattattaag    240
cctgaacttg atccaagaat taaatatatt aaagacgttt ctgccggcgc ggttcttata    300
gtttgtttgt ttgctgttgc tgccggaata attgtttttt cacattattt ttga          354

<210> 323
<211> 364
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 323
atgggttact tcaaaaaccg tattaacgcc ttcggctatg ccttcagtgg tatctatcag     60
gcattccgcc aggaaacaca tctgaaaatt catgcagtga ttgccctgct ggttattggc    120
ctggccgcat tcttcgaagt gtgcaatgaa cattggattc tgctgctgct ggccattacc    180
ctggttattg cactggaaat gctgaatagt gcagttgaaa aactgtgtaa tatcattaag    240
ccggaactgg accctcgtat taaatatatt aaagatgtga gcgcaggcgc agtgctgatt    300
gtgtgtctgt tcgccgtggc cgcaggtatt attgtgttca gtcattactt ctgactgaaa    360
gctt                                                                 364

<210> 324
<211> 248
<212> PRT
<213> Clostridium species

<400> 324
Met Lys Glu Lys Lys Lys Glu Ser Lys Glu Ile Thr Asn His Asn Phe
1               5                   10                  15
Ile Asp Ala Trp Lys Asn Ala Phe Asn Gly Ile Ile Tyr Ala Thr Thr
            20                  25                  30
Thr Gln Lys Asn Ile Gln Lys Gln Leu Ile Ile Ala Val Ile Val Val
        35                  40                  45
Ile Val Ser Leu Phe Phe Asn Leu Asn Arg Ala Glu Phe Leu Cys Phe
    50                  55                  60
Leu Phe Thr Ile Val Leu Ile Ile Phe Ala Glu Met Val Asn Thr Ala
65                  70                  75                  80
Ile Glu Thr Val Val Asp Leu Tyr Val Asp Val Tyr His Pro Lys Ala
                85                  90                  95
Lys Ile Ala Lys Asp Val Ala Ala Gly Gly Val Val Ile Thr Thr Ile
            100                 105                 110
Asn Ala Ile Ile Val Ala Tyr Phe Leu Phe Phe Asp Lys Ile Ala Asp
        115                 120                 125
Ile Gly Leu Thr Phe Leu Lys Asn Val Thr Thr Asn Pro Met His Leu
    130                 135                 140
Ala Phe Ser Ile Met Ile Ile Ala Ile Ile Ala Val Leu Ala Leu Ile
145                 150                 155                 160
Ala Tyr Ala Lys Thr Asn Lys His Lys Gly Leu Asn Lys Lys Met Val
                165                 170                 175
Pro Ser Gly His Ala Thr Ile Gly Phe Ala Ala Asn Thr Leu Ile Trp
            180                 185                 190
Leu Leu Thr Asp Asn Ile Val Ile Leu Met Leu Ser Leu Leu Thr Ala
        195                 200                 205
Ile Leu Leu Ala Glu Ser Arg Ile Ala Ala Lys Glu His Thr Leu Ser
    210                 215                 220
Glu Ile Ile Phe Ser Gly Cys Phe Ala Thr Ile Leu Val Leu Ile Leu
225                 230                 235                 240
Tyr Gly Ile Ala Met Ala Ile Val
                245

<210> 325
<211> 747
<212> DNA
<213> Clostridium species

<400> 325
atgaaagaaa aaaagaagga aagcaaagaa ataacaaacc acaattttat agatgcgtgg     60
aaaaatgcat ttaatggaat aatatatgca acaacaactc aaaaaaacat acaaaaacaa    120
ttaattattg cagtaattgt tgttattgtt agcttgtttt ttaatttgaa tcgtgctgaa    180
tttctatgct ttctattcac aattgttcta ataatatttg ctgaaatggt taatacagca    240
atagaaacag ttgttgattt atatgttgat gtatatcatc caaaagctaa aatagcaaaa    300
gatgttgcag caggaggagt agtaattaca acaataaatg caataattgt ggcatatttc    360
ttattttttg ataaaattgc ggatatagga ctaacatttt tgaaaaatgt aacaacaaat    420
cctatgcatt tagctttttc tattatgatt attgctataa ttgcagtttt agcattaatt    480
gcttatgcaa aaacaaacaa acataaagga ttaaacaaaa aaatggttcc aagtggtcat    540
gcaactattg gctttgcagc taatactctt atatggcttt taactgataa tatagttatt    600
ttaatgttgt cattattaac tgcaatttta ttagcagaaa gtagaatagc agcaaaagaa    660
catacattgt cagaaataat atttagtggt tgttttgcaa caatattagt attaatatta    720
tatggaattg caatggcaat agtttaa                                        747

<210> 326
<211> 757
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 326
atgaaggaaa agaagaagga aagcaaagaa attaccaatc ataacttcat cgacgcctgg     60
aaaaatgcat tcaatggcat tatctatgcc accaccaccc agaaaaatat tcagaaacag    120
ctgattatcg ccgttattgt ggttattgtg agtctgttct tcaatctgaa tcgcgcagag    180
ttcctgtgct tcctgttcac cattgtgctg attatcttcg cagaaatggt taataccgca    240
attgaaaccg ttgttgatct gtatgtggat gtgtatcatc cgaaagccaa aattgccaaa    300
gatgttgcag caggtggcgt tgttattacc accattaatg ccattattgt tgcctacttc    360
ctgttcttcg ataaaattgc agatattggt ctgaccttcc tgaaaaatgt gaccaccaat    420
ccgatgcatc tggccttcag cattatgatt attgcaatta ttgcggtgct ggccctgatt    480
gcctatgcaa aaaccaataa acataagggc ctgaataaaa aaatggttcc gagtggccat    540
gccaccattg gcttcgcagc aaataccctg atctggctgc tgaccgataa tattgtgatt    600
ctgatgctga gcctgctgac cgcaattctg ctggccgaaa gtcgtattgc agccaaagaa    660
cataccctga gcgaaattat cttcagcggc tgcttcgcca ccattctggt tctgattctg    720
tatggcattg caatggccat tgtgtgactg aaagctt                             757

<210> 327
<211> 295
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 327
Met Thr Asn Ala Ala Thr Ser Ser Gly Leu Thr Val Ser Val Arg His
1               5                   10                  15
Pro Ala Ala Gly Leu Ala Arg Arg Asn Leu Leu Gly Val Ser Arg Thr
            20                  25                  30
Asn Pro Gly Leu Arg Gly Gly Arg Ala Asn Pro Trp Asn Leu Asn Arg
        35                  40                  45
Ile Ile Pro Ala Glu Glu Ser Met Glu Val Leu Ala Ser Ser Thr Val
    50                  55                  60
Leu Ala Gly Cys Ala Gln Val Ile Asp Arg Leu Arg Ala Asp Pro Pro
65                  70                  75                  80
Ala Val His Ala Ile Thr Ser Pro Val Ala Ala Glu Arg Thr Ala Asn
                85                  90                  95
Thr Leu Leu Ala Leu Gly Ile Arg Pro Ser Leu Thr Val Asn Pro Asp
            100                 105                 110
Glu Val Ala Ala Phe Val Ala Val Ser Asp Ala Leu Leu Val Asn Leu
        115                 120                 125
Gly Met Leu Asp Pro Val Arg Glu Ala Ala Ile Asp Arg Ala Val Ala
    130                 135                 140
Glu Ala Ala Arg Gln Ala Arg Pro Trp Val Leu Asp Pro Val Phe Ala
145                 150                 155                 160
Glu Val Ser Pro Ala Arg Ala Ala Leu Thr Arg Ala Leu Leu Ala Arg
                165                 170                 175
Gly Pro Ala Ala Leu Lys Ala Asn Ala Gln Glu Ala Gly Leu Ala Ala
            180                 185                 190
Asp Ala Pro Ile Thr Thr Val Ala Ile Val Thr Gly Ala Glu Asp Arg
        195                 200                 205
Leu Arg Leu Gly Ala Arg Asn Ile Gly Ile Ala Asn Gly His Pro Leu
    210                 215                 220
Ala Ala Ser Val Thr Ala Met Gly Cys Ala Leu Gly Ala Leu Val Ala
225                 230                 235                 240
Ala Cys Leu Cys Arg Ala Asp Asp Pro Leu Arg Leu Ser Pro Thr Ala
                245                 250                 255
Ser Pro Pro Asn Arg Arg Gln Pro Glu Gln Ala Ala Ala Glu Ser Glu
            260                 265                 270
Gly Pro Gly Ser Phe Ala Val Ala Phe Tyr Asp Cys Pro Ala Ile Asp
        275                 280                 285
Arg Gly Glu Thr Ile Val Lys
    290                 295

<210> 328

<400> 328
000

<210> 329
<211> 894
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 329
atgaccaatg ccgccaccag cagcggtctg accgttagcg tgcgtcatcc ggccgcaggt     60
ctggcacgtc gtaatctgct gggcgttagt cgtaccaatc cgggtctgcg cggtggtcgt    120
gcaaatccgt ggaatctgaa tcgcattatt ccggcagaag aaagtatgga agttctggca    180
agcagcaccg tgctggcagg ctgcgcacag gtgattgatc gtctgcgtgc agatccgccg    240
gccgtgcatg caattaccag cccggttgcc gcagaacgca ccgccaatac cctgctggcc    300
ctgggtattc gtccgagtct gaccgtgaat ccggatgaag ttgcagcctt cgttgccgtg    360
agcgatgccc tgctggttaa tctgggtatg ctggaccctg ttcgcgaagc agccattgat    420
cgtgccgtgg ccgaagcagc ccgccaggca agaccgtggg ttctggaccc tgtgttcgca    480
gaagttagtc cggcccgtgc cgccctgacc cgtgcattac tggcacgcgg tccggcagca    540
ctgaaagcca atgcccagga agccggcctg gccgcagatg ctccgattac caccgttgca    600
attgttaccg gcgccgaaga tcgtctgcgc ctgggtgccc gtaatattgg tattgcaaat    660
ggtcatccgc tggcagcaag cgttaccgca atgggttgtg cactgggcgc cctggttgcc    720
gcatgtctgt gtcgcgccga tgatccgctg cgcctgagtc cgaccgccag ccctcctaat    780
cgccgccagc cggaacaggc cgccgcagag tcagaaggcc cgggtagctt cgccgttgcc    840
ttctatgatt gtccggccat tgatcgcggc gaaaccattg ttaaataagg atcc          894

<210> 330
<211> 227
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 330
Met Asp Ala Val Arg Arg Arg Leu Ser Arg Ser Arg Arg Arg Ala Gln
1               5                   10                  15
Pro Ala Gly Met Gly Arg His Ala Ala Ala Leu Arg Gln Ala Ala Gly
            20                  25                  30
Gly Ala Arg Pro Arg Pro Leu Arg Pro Leu Arg Arg Leu His Gly Arg
        35                  40                  45
Gly Gly Ala Asp Pro Ala Gly Ala Gly Gly Arg Gly Asp Arg Gly Gly
    50                  55                  60
Thr Ala Val Ser Asp Ser Asp Gly Met Ile Met Ser Asp Ser Ala Leu
65                  70                  75                  80
Asp Thr Ala Ala Gly Leu Leu Glu Arg Leu Arg Thr Arg Arg Pro Arg
                85                  90                  95
Val His Cys Leu Met Asn Thr Val Val Gln Lys Leu Val Ala Asp Gly
            100                 105                 110
Leu Ser Ala Leu Gly Ala Ile Pro Ser Met Thr Ser Ser Arg Glu Glu
        115                 120                 125
Val Ala Ala Phe Val Arg Lys Ala Asp Ala Leu Cys Val Asn Leu Gly
    130                 135                 140
Thr Leu Asp Ala Ala Arg Arg Glu Ala Ile Asn Leu Ala Leu Glu Ala
145                 150                 155                 160
Ala Ser Glu Ala Gly Arg Pro Trp Ala Leu Asp Pro Ala His Cys Asp
                165                 170                 175
Tyr Ser Pro Pro Arg Ala Ala Phe Ala Gln Glu Leu Leu Ala Arg Gly
            180                 185                 190
Pro Ala Val Leu Arg Ala Asn Pro Ala Glu His Val Leu Leu Ala Val
        195                 200                 205
Pro Ala Asp Ile Val Gly Val Val Thr Val Leu Asp Arg Asp Arg Ala
    210                 215                 220
Leu Pro Lys
225

<210> 331

<400> 331
000

<210> 332
<211> 690
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 332
atggatgcag tgcgccgtcg tctgagccgc agtcgccgtc gtgcacagcc ggctggtatg     60
ggccgccatg cagccgctct gcgccaggca gcaggtggtg cacgtccgcg tccgctgcgt    120
cctctgcgtc gtctgcatgg tcgtggcggt gccgatccgg caggtgcagg tggtcgcggt    180
gatcgtggtg gtaccgccgt gagcgatagt gatggcatga ttatgagtga tagcgcactg    240
gataccgccg caggtctgct ggaacgtctg cgtacccgcc gcccgcgtgt tcattgcctg    300
atgaataccg ttgtgcagaa actggttgca gatggtctga gcgcactggg tgcaattccg    360
agtatgacca gcagccgtga agaagtggca gcattcgttc gcaaagcaga tgcactgtgc    420
gtgaatctgg gcaccctgga tgccgcacgt cgtgaagcaa ttaatctggc cctggaagca    480
gcaagtgaag caggccgtcc gtgggcactg gaccctgctc attgcgatta tagcccgccg    540
cgcgccgcct tcgcacagga gttactggca cgtggcccgg ccgttctgcg cgcaaatccg    600
gctgaacatg tgctgctggc agtgccggcc gatattgtgg gcgttgttac cgtgctggat    660
cgcgatcgtg ccctgccgaa ataaggatcc                                     690

<210> 333
<211> 288
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 333
Met Ser Gln Pro Phe Asp Val Leu Arg Ala Trp Arg Ile Leu Gln Arg
1               5                   10                  15
Val Arg Glu Gln Asn Pro Leu Ile His Cys Val Thr Asn Gln Val Val
            20                  25                  30
Met Asn Phe Thr Ala Asn Val Leu Tyr Ala Val Gly Ala Ser Pro Leu
        35                  40                  45
Met Ser Gln Ala Pro Glu Glu Ala Asp Glu Leu Ala Lys Ala Arg Ala
    50                  55                  60
Asn Leu Leu Val Asn Ile Gly Thr Leu Thr Arg Ala Trp Leu Val Asp
65                  70                  75                  80
Val Arg Glu Val Val Arg Ala Glu Gln Ser Leu Gly Arg Gly Arg Pro
                85                  90                  95
Arg Ala Val Leu Asp Pro Val Gly Ala Gly Phe Thr His Phe Arg Thr
            100                 105                 110
Glu Ala Ala His Glu Leu Leu Lys Thr Gly Val Phe Gly Thr Leu Arg
        115                 120                 125
Ala Asn Ala Phe Glu Val Met Lys Leu Ala Gly Val Ala Ala Arg Gly
    130                 135                 140
Gln Gly Val Asp Ser Asn Glu Ser Ser Leu Glu Ala Ala Gly Ala Ala
145                 150                 155                 160
Gly Gln Leu Ala Arg Lys Tyr Gly Ile Val Val Ala Val Ser Gly Val
                165                 170                 175
Val Asp Tyr Val Thr Asp Gly Arg Arg Glu Ile Trp Leu Arg Thr Gly
            180                 185                 190
His Pro Leu Leu Thr Arg Val Thr Gly Thr Gly Cys Ala Leu Asn Ala
        195                 200                 205
Val Ile Ala Ala Ala Thr Ala Val Asp Asp Asp Pro Leu Asp Ala Ala
    210                 215                 220
Ala Ala Ala Leu Ala Ile Phe Gly Thr Ala Ala Leu Lys Ala Ala Arg
225                 230                 235                 240
Asn Pro Glu Gly Glu Pro Gly Pro Gly Ser Phe Ala Ala Gly Phe Leu
                245                 250                 255
Asp Glu Leu Ala Arg Leu Glu His Ser Asp Leu Glu Ala Phe Trp Gln
            260                 265                 270
Val Glu Gln Thr Ser Ser Ser Ser Pro Asp Pro Met Glu Ser Ala Arg
        275                 280                 285

<210> 334

<400> 334
000

<210> 335
<211> 873
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 335
atgagtcagc cgttcgatgt gctgcgcgcc tggcgcattc tgcaacgtgt tcgtgaacag     60
aatccgctga ttcattgtgt gaccaatcag gttgtgatga acttcaccgc caatgtgctg    120
tatgccgttg gtgcaagtcc gctgatgagc caggcaccgg aagaagccga tgaactggcc    180
aaagcacgtg ccaatctgct ggttaatatt ggtaccctga cccgtgcctg gctggtggat    240
gttcgtgaag ttgttcgcgc cgaacagagt ctgggccgtg gccgtccgcg tgctgtgtta    300
gatccggtgg gtgccggctt cacccacttc cgtaccgaag ccgcccatga actgctgaaa    360
accggcgtgt tcggcaccct gcgtgccaat gcattcgaag tgatgaaact ggccggtgtt    420
gccgcccgcg gtcagggtgt tgatagcaat gaaagcagcc tggaagccgc cggtgcagca    480
ggtcagctgg cacgtaaata tggcattgtt gttgccgtta gcggtgtggt tgattatgtt    540
accgatggtc gtcgcgaaat ctggctgcgc accggccatc cgctgctgac ccgtgtgacc    600
ggtaccggtt gcgcactgaa tgcagtgatt gcagcagcaa ccgcagtgga tgatgatccg    660
ctggatgcag cagccgccgc actggcaatc ttcggcaccg cagcactgaa agcagcccgt    720
aatccggaag gtgaaccggg cccgggcagc ttcgcagctg gcttcctgga tgaactggca    780
cgtctggaac atagtgatct ggaagcattc tggcaggttg aacagaccag cagtagtagc    840
ccggacccta tggaaagtgc ccgctaagga tcc                                 873

<210> 336
<211> 223
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 336
Met Phe Phe Lys Gln Lys Thr Ala Tyr Glu Ile Ser Leu Arg Leu Val
1               5                   10                  15
Gly Ser Glu Met Cys Ile Arg Asp Ser Leu Thr Pro Ala Ala Leu Asp
            20                  25                  30
Ala Met Arg Leu Ala Gly Arg Glu Ala Asn Arg Cys Gly Val Pro Val
        35                  40                  45
Val Leu Asp Pro Val Gly Ala Gly Ala Thr Pro Tyr Arg Thr Glu Ala
    50                  55                  60
Ala Arg Gln Leu Leu Glu Asp Ile Asn Val Ala Ile Val Arg Gly Asn
65                  70                  75                  80
Ser Gly Glu Val Ala Ala Ile Ile Gly Gln Gln Ala Val Val Arg Gly
                85                  90                  95
Val Glu Ser Leu Glu Thr Ala Leu Pro Ala Ala Glu Leu Gly Ala Gln
            100                 105                 110
Ala Ala Gln Gln Leu Gly Val Val Val Ala Leu Thr Gly Ala Arg Asp
        115                 120                 125
Ile Ile Ser Asp Gly Ser Val Ser Leu Ala Val Asp His Gly Ser Pro
    130                 135                 140
Trp Leu Lys Thr Ile Asp Gln Pro Thr Gln Pro Arg Lys Ala Ala Val
145                 150                 155                 160
Ala Cys Phe Ala Ala Val Ala Pro Ser Ser Leu Gln Ala Ala Ala Ala
                165                 170                 175
Ala Leu Ala Ala Tyr Gly Leu Ala Ala Glu Leu Ala His Lys Pro Gln
            180                 185                 190
Ile His Gly Pro Ala Ser Phe Lys Val Ala Leu Leu Asp Ala Val Tyr
        195                 200                 205
Gly Leu Thr Ala Glu Thr Leu Gln Arg Ala Lys Val Ser Val Leu
    210                 215                 220

<210> 337

<400> 337
000

<210> 338
<211> 678
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 338
atgttcttca agcagaaaac cgcatacgaa attagtctgc gcctggttgg tagcgaaatg     60
tgtattcgtg atagtctgac cccggcagca ctggatgcaa tgcgtctggc cggtcgtgaa    120
gccaatcgtt gtggtgttcc ggtggtgctg gaccctgttg gcgccggtgc aaccccgtat    180
cgtaccgaag cagcacgcca gctgctggaa gatattaatg tggcaattgt tcgtggtaat    240
agtggtgaag tggcagcaat tattggccag caggccgtgg tgcgtggtgt tgaaagtctg    300
gaaaccgccc tgccggccgc agaactgggt gctcaggccg ctcagcagct gggcgttgtt    360
gttgccctga ccggtgcacg tgatattatt agcgatggca gcgttagcct ggccgttgat    420
catggtagtc cgtggctgaa aaccattgat cagccgaccc agccgcgcaa agcagcagtt    480
gcctgcttcg ccgcagtggc cccgagtagc ctgcaagcag ccgccgctgc cctggcagct    540
tatggtctgg ccgcagaatt agcacataaa ccgcagattc atggcccggc cagcttcaaa    600
gtggccctgc tggatgcagt gtatggtctg accgccgaaa ccttacagcg cgccaaagtt    660
agcgttctgt aaggatcc                                                  678

<210> 339
<211> 234
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 339
Met Glu Ala Pro Phe Glu Pro Glu Pro Asp His Ala Gly Val Gly Ser
1               5                   10                  15
Leu Gln Gly Val Val Leu Pro Leu Pro Ser Ala Ser Ser Arg Arg Ala
            20                  25                  30
Glu Gly Glu Met Gln Asp Pro Gly Leu Tyr Leu Gln Arg Met Cys Glu
        35                  40                  45
Ala Ala Pro Leu Val Gln Asn Ile Thr Asn Phe Val Ala Met Thr Ile
    50                  55                  60
Met Ala Asn Val Leu Leu Ala Val Gly Ala Ser Pro Ala Met Val His
65                  70                  75                  80
Ala Arg Glu Glu Ala Ala Glu Phe Ala Gly Leu Ala Gln Ala Leu Thr
                85                  90                  95
Val Asn Ile Gly Thr Pro Asp Pro Ala Trp Ala Asp Ala Met Ala Glu
            100                 105                 110
Ala Ala Ala Val Ile Lys Ala Ala Gly Arg Pro Trp Val Leu Asp Pro
        115                 120                 125
Val Gly Val Gly Ala Thr Arg Phe Arg Gln Glu Ile Ala Ala Arg Leu
    130                 135                 140
Leu Asp Leu Gly Pro Ser Val Ile Arg Gly Asn Ala Ser Glu Ile Leu
145                 150                 155                 160
Ala Leu Ala Gly Leu Gly Gly Thr Gly Arg Gly Ala Asp Ala Ala Asp
                165                 170                 175
Ser Val Ala Ala Val Ser Tyr Thr His Leu Thr Leu Pro Thr Thr Arg
            180                 185                 190
Gly Ala Val Val Ala Ala Ser Gly Ala Val Asp Phe Val Thr Asp Gly
        195                 200                 205
Ser Arg Ala Phe Arg Val Ala Asn Gly His Pro Pro Val Tyr Tyr Thr
    210                 215                 220
His Leu Arg Ala His Glu Thr Lys Thr Ala
225                 230

<210> 340

<400> 340
000

<210> 341
<211> 711
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 341
atggaagccc cgttcgaacc ggaaccggat catgcaggtg tgggcagtct gcaaggcgtg     60
gttctgccgc tgccgagtgc cagtagccgc cgtgctgaag gtgaaatgca ggaccctggt    120
ctgtatctgc aacgcatgtg tgaagccgca ccgctggttc agaatattac caacttcgtg    180
gccatgacca ttatggcaaa tgttctgctg gccgttggtg ccagcccggc catggttcat    240
gcccgcgaag aagcagcaga gttcgccggt ctggcccagg ccctgaccgt gaatattggc    300
accccggacc ctgcatgggc cgatgcaatg gccgaagcag cagcagtgat taaagcagcc    360
ggtcgtccgt gggttctgga ccctgtgggt gtgggtgcaa cccgcttccg tcaggaaatt    420
gcagcccgtc tgctggatct gggtccgagt gtgattcgcg gtaatgccag tgaaattctg    480
gcactggccg gcctgggcgg caccggtcgt ggtgcagatg cagccgatag tgtggcagcc    540
gttagctata cccatctgac cctgccgacc acccgcggcg ctgttgttgc cgctagtggt    600
gccgttgact tcgtgaccga tggtagccgt gcattccgtg tggccaatgg tcatccgccg    660
gtgtattata cccatcttcg tgcccatgaa accaaaaccg cataaggatc c             711

<210> 342
<211> 271
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 342
Met Gly Leu Ser Gly Ser Ser Asn Arg Pro Ser Pro Ser Arg Arg Gln
1               5                   10                  15
Cys Pro Gly Ile Met Thr Arg Gly Gly Thr Val Thr Asp His Asp Asp
            20                  25                  30
Arg Ser Arg Leu Ala Arg Asp Thr Thr Arg Val Leu Ala Arg Leu Arg
        35                  40                  45
Ala Glu Arg Pro Arg Val His Cys Leu Thr Asn Lys Arg Gln Met Gln
    50                  55                  60
Leu Thr Ala Asn Leu Leu Leu Ala Ala Gly Ala Val Pro Ser Met Thr
65                  70                  75                  80
Met Asp Ala Arg Glu Met Pro Ala Phe Val Glu Thr Ser Arg Ala Leu
                85                  90                  95
Leu Val Asn Leu Gly Met Leu Asp Pro Trp Arg Glu Ala Ala Ile Pro
            100                 105                 110
Val Ala Ile Glu Ala Ala His Gly Leu Gly Arg Pro Trp Val Leu Asp
        115                 120                 125
Pro Val Lys Val Asp Arg Ala Pro Gly Arg Arg Ala Phe Ala Ser Ser
    130                 135                 140
Leu Leu Glu Arg Gly Pro Ala Val Leu Arg Cys Asn Ala Ala Glu Ala
145                 150                 155                 160
Glu Met Leu Glu Pro Gly Pro Gly Ile Val Thr Ala Val Thr Gly Ala
                165                 170                 175
Ala Asp Arg Ile Ser Gly Gly Gly Arg Glu Ile Gly Leu Gly Gly Gly
            180                 185                 190
Thr Ala Leu Met Asp Arg Val Thr Ala Met Gly Cys Ala Ala Ser Ala
        195                 200                 205
Leu Val Ala Ala Cys Leu Ala Val Glu Pro Asp Pro Phe Leu Ala Thr
    210                 215                 220
Val Ser Gly Leu Leu Val Met Lys Val Ala Gly Ala Ile Ala Ala Glu
225                 230                 235                 240
Ser Ala Ala Gly Pro Gly Ser Phe Val Pro Leu Phe Leu Asp Ala Val
                245                 250                 255
His Gly Leu Asp Ala Ala Thr Leu Gln Arg Arg Ala Glu Leu Ala
            260                 265                 270

<210> 343

<400> 343
000

<210> 344
<211> 822
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 344
atgggcctga gtggcagcag caatcgcccg agtccgagtc gccgccagtg ccctggtatt     60
atgacccgtg gcggtaccgt taccgatcat gatgatcgca gtcgcctggc acgcgatacc    120
acccgtgtgc tggcacgtct gcgtgccgaa cgtccgcgcg ttcattgtct gaccaataaa    180
cgtcagatgc agctgaccgc caatctgctg ctggccgcag gtgccgtgcc gtcaatgacc    240
atggatgcac gtgaaatgcc ggccttcgtt gaaaccagtc gcgcactgct ggttaatctg    300
ggtatgctgg acccttggcg tgaagccgca attccggttg ccattgaagc cgcacatggt    360
ctgggtcgtc cgtgggttct ggaccctgtt aaagtggatc gtgccccggg ccgtcgtgcc    420
ttcgcttcaa gtctgctgga acgcggcccg gccgtgctga gatgcaatgc agccgaagca    480
gaaatgctgg aaccgggccc gggcattgtt accgcagtta ccggtgccgc agatcgtatt    540
agtggtggtg gccgcgaaat tggtctgggc ggtggtaccg ccctgatgga tcgcgtgacc    600
gcaatgggct gcgccgcaag tgccctggtg gcagcttgtc tggcagtgga accggacccg    660
ttcctggcca ccgttagcgg cctgctggtg atgaaagttg caggcgcaat tgccgcagaa    720
agcgcagccg gtccgggtag cttcgttccg ctgttcctgg atgcagtgca tggcctggat    780
gcagcaaccc tgcaacgtcg tgccgaactg gcataaggat cc                       822

<210> 345
<211> 264
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 345
Met Lys Ala Ala Glu Ala Ile Trp Gln Ser Leu Ser Ala Val Arg Ala
1               5                   10                  15
Glu Ala Pro Leu Val His Asn Ile Thr Asn Phe Val Val Met Asn Pro
            20                  25                  30
Thr Ala Ser Lys Ala Leu Ala Val Gly Ala Ser Pro Val Met Val His
        35                  40                  45
Ala Ala Glu Glu Val Glu Gln Met Thr Ala Met Ala Arg Ala Leu Val
    50                  55                  60
Val Asn Leu Gly Thr Leu Ser Pro Pro Trp Ala Thr Ala Met Arg Leu
65                  70                  75                  80
Ala Val Ala Arg Ala Gly Thr Ala Gly Thr Pro Trp Val Leu Asp Pro
                85                  90                  95
Val Gly Val Gly Ala Leu Ala Trp Arg Thr Gly Ile Ala Ala Glu Leu
            100                 105                 110
Ile Arg Gln Gln Pro Ala Ala Val Arg Gly Asn Gly Ser Glu Ile Leu
        115                 120                 125
Ala Leu Ala Gly Ala Ala Gly Ala Ala Gly Lys Gly Val Asp Ser Arg
    130                 135                 140
His Asp Ser Ala Ala Ala Val Asp Ile Ala Arg Gly Leu Ala Arg Ala
145                 150                 155                 160
Thr Gly Ala Val Val Ala Ile Thr Gly Arg Val Asp Tyr Val Thr Asp
                165                 170                 175
Gly Glu Arg Gln Val Ala Ile Ala Asn Gly His Pro Leu Met Ala Arg
            180                 185                 190
Val Thr Gly Leu Gly Cys Thr Ala Thr Ala Leu Val Gly Ala Phe Cys
        195                 200                 205
Ala Val Glu Ala Asp Arg Leu Ala Ala Thr Ala His Ala Pro Ala Met
    210                 215                 220
Pro Ser Thr Ala Arg Ala Trp Ala Ala Glu Arg Ala Ala Gly Pro Gly
225                 230                 235                 240
Ser Leu Gln Val Gly Leu Leu Asp Ala Leu Tyr Thr Leu Asp Leu Asp
                245                 250                 255
Thr Ile Arg Gln Arg Glu Arg Ile
            260

<210> 346

<400> 346
000

<210> 347
<211> 801
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 347
atgaaggcag cagaagccat ctggcagagt ctgagcgccg ttcgtgccga agccccgctg     60
gttcataata ttaccaactt cgtggttatg aatccgaccg ccagcaaagc actggcagtg    120
ggtgccagtc cggtgatggt tcatgccgca gaagaagtgg aacagatgac cgcaatggca    180
cgtgcactgg ttgttaatct gggcaccctg agcccgccgt gggccaccgc aatgcgtctg    240
gctgtggcac gtgcaggcac cgccggtacc ccttgggttc tggaccctgt gggtgtgggt    300
gcactggcat ggcgtaccgg tattgcagca gaactgattc gccagcagcc ggcagcagtg    360
cgtggtaatg gtagcgaaat tctggccctg gcaggcgccg caggcgcagc aggtaaaggt    420
gtggatagtc gtcatgatag cgcagcagcc gttgatattg cccgcggtct ggcacgcgcc    480
accggtgcag ttgttgccat taccggccgc gtggattatg ttaccgatgg cgaacgccag    540
gttgcaattg caaatggtca tccgctgatg gcccgcgtta ccggtctggg ctgcaccgct    600
accgcactgg ttggtgcatt ctgtgcagtt gaagcagatc gcctggcagc caccgcccat    660
gcaccggcaa tgccgagcac cgcccgcgca tgggctgctg aacgcgcagc aggtccgggc    720
agtctgcaag ttggtctgct ggatgcactg tataccctgg atctggatac cattcgccag    780
cgcgaacgca tttaaggatc c                                              801

<210> 348
<211> 268
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 348
Met Gln Glu Pro Thr Ala Pro Leu Arg Gln Phe Val Ser Pro Ala Glu
1               5                   10                  15
Leu Val Ser Ala Ala Ala Ala Val Leu Ala Arg Val Arg Ala Lys Ser
            20                  25                  30
Pro Arg Val His Cys Ile Thr Asn Ser Val Ala Glu Asn Phe Thr Ala
        35                  40                  45
Asn Val Leu Leu Ala Leu Gly Ala Val Pro Ser Met Thr Leu Ser Pro
    50                  55                  60
Ile Glu Ile Gly Ala Phe Val Gly Arg Ala Asp Ala Leu Leu Val Asn
65                  70                  75                  80
Leu Gly Thr Phe Gly Arg Glu Arg Arg Glu Ala Thr Ser Ile Ala Val
                85                  90                  95
Asp Thr Ala Val Gln Gly Gly Leu Pro Trp Val Leu Asp Pro Val Phe
            100                 105                 110
Val Asp Arg Ala Pro Pro Arg Ala Thr Tyr Ala Gln Asp Leu Leu Phe
        115                 120                 125
Met Gly Pro Thr Ala Met Arg Leu Asn Ala Ala Glu Phe Ala Ala Leu
    130                 135                 140
Ala Gly Ala Gly Asp Glu Leu Asp Asp Val Arg Gly Tyr Ala Arg Glu
145                 150                 155                 160
Arg Gly Ile Ala Ile Gly Leu Ser Gly Ala Arg Asp Leu Ile Ala Asp
                165                 170                 175
Gly Arg Arg Ala Ala Ser Ile Ala Asn Gly Asp Ala Met Met Ala Arg
            180                 185                 190
Ile Thr Ala Met Gly Cys Ala Ala Ser Ala Met Val Gly Ala Cys Leu
        195                 200                 205
Ala Val Glu Lys Asp Ala Phe Val Ala Thr Ala Ala Ala Leu Leu Ile
    210                 215                 220
Val Gly Val Ala Gly Glu Met Ala Ala Glu Thr Ala Arg Gly Pro Gly
225                 230                 235                 240
Ser Phe Ala Val Ala Ile Leu Asp Ala Leu Tyr Ser Ile Asp Gly Asp
                245                 250                 255
Ala Leu Ala Ala Arg Ala Arg Ile Ser Leu His Glu
            260                 265

<210> 349
<211> 804
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 349
atgcaggaac cgacagcccc attgcgccag ttcgtatcgc ccgcggagct tgtctccgcg     60
gcggccgccg tgctcgcgcg ggtgcgcgcg aaatcgcccc gtgtccactg catcaccaat    120
tcggtggcgg agaatttcac cgccaatgtg ctgctggcgc tgggcgccgt tccctccatg    180
actttgtcgc ccatcgagat cggggccttt gtgggccgtg ccgatgcgct gctggtcaat    240
ctcggtacct tcggccgcga gcggcgcgag gcgacgtcga tcgcggtcga tacggcggtg    300
cagggcgggc tgccttgggt tctcgatccg gtctttgtcg accgcgcacc gccgcgcgcg    360
acctacgccc aggatctgtt gtttatgggc ccgacggcga tgcggctcaa cgcggccgag    420
tttgccgcgc tggccggcgc cggcgatgag ctcgacgatg tgcgcggcta cgcacgcgag    480
cgcggcatcg cgatcggcct gtcgggcgcg cgcgacctga tcgcggacgg ccgccgcgcc    540
gcgtcgatcg ccaatggcga tgcgatgatg gcgcggatca cggcgatggg ctgcgccgcc    600
tcggcgatgg tcggcgcctg cctcgcggtc gagaaggacg ccttcgtcgc caccgcagcg    660
gcgctgctga tcgtcggcgt tgccggcgag atggcggcgg aaaccgcgcg cggccccggc    720
agcttcgcgg tggcgatcct cgatgcgctc tattccatcg acggcgacgc gctcgcggcg    780
cgggcccgca tttccctgca cgag                                           804

<210> 350
<211> 813
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 350
atgcaggaac cgaccgcccc gctgcgtcag ttcgttagcc cggccgaact ggttagcgcc     60
gcagccgctg tgctggcccg tgttcgtgcc aaaagcccgc gcgttcattg tattaccaat    120
agcgtggccg aaaacttcac cgcaaatgtg ctgctggccc tgggtgccgt tccgagtatg    180
accctgagcc cgattgaaat tggtgccttc gttggtcgtg ccgatgcact gctggttaat    240
ctgggcacct tcggtcgtga acgccgcgaa gcaaccagta ttgcagtgga taccgccgtg    300
cagggtggcc tgccgtgggt tctggaccct gtgttcgttg atcgtgcccc gccgcgcgca    360
acctatgccc aagacttact gttcatgggt ccgaccgcca tgcgcctgaa tgccgcagag    420
ttcgcagccc tggccggcgc aggtgatgaa ctggatgatg tgcgcggtta tgcccgcgaa    480
cgtggcattg caattggtct gagcggtgca cgcgatctga ttgccgatgg tcgtcgtgca    540
gccagcattg ccaatggcga tgcaatgatg gcacgtatta ccgccatggg ctgcgccgcc    600
agcgccatgg ttggcgcatg tctggccgtt gaaaaagatg ccttcgttgc caccgcagca    660
gccctgctga ttgttggtgt tgccggcgaa atggccgcag aaaccgcacg cggtccgggc    720
agcttcgcag tggcaattct ggatgcactg tatagcattg atggcgatgc cctggcagcc    780
cgtgcccgta ttagtctgca tgaataagga tcc                                 813

<210> 351
<211> 269
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 351
Met Ile Asp Ala Lys Ser Leu Ala Ala Asp Leu Ala Ala Ile Arg Glu
1               5                   10                  15
Lys Asn Pro Leu Val Leu Ser Val Thr Asn Asn Val Val Thr Asn Thr
            20                  25                  30
Thr Ala Asn Ala Leu Leu Ala Leu Gly Ala Ser Pro Ala Met Ser His
        35                  40                  45
Ala Gln Glu Glu Met Glu Glu Leu Pro Ala Phe Ala Gly Ala Ile Val
    50                  55                  60
Leu Asn Ile Gly Thr Pro Ala Arg Glu Tyr Val Glu Ala Met Ile Arg
65                  70                  75                  80
Ala Ala Ala Thr Ala Ser Arg Leu Asn Ile Pro Ile Ile Leu Asp Pro
                85                  90                  95
Val Ala Ala Gly Val Thr Arg His Arg Asn Arg Val Leu Lys Asn Leu
            100                 105                 110
Leu Asp Asp Phe Pro Met Ala Ile Ile Arg Gly Asn Ala Ser Glu Ile
        115                 120                 125
Met Ala Leu Ala Gly Glu Ala Ala Thr Ala Lys Gly Ala Asp Ser Ala
    130                 135                 140
His Gly Ser Ser Glu Ala Val Asp Ala Ala Met Arg Leu Ala Arg Glu
145                 150                 155                 160
Arg Lys Thr Val Val Cys Val Ser Gly Glu Gln Asp Gln Ile Thr Asp
                165                 170                 175
Gly Ala Arg Leu Ile Arg Val Ser Gly Gly His Val Met Met Thr Lys
            180                 185                 190
Val Thr Gly Leu Gly Cys Thr Ala Ser Ala Val Ala Gly Ala Tyr Ala
        195                 200                 205
Ala Val Asn Arg Asp Tyr Leu Ala Ala Ala Ala His Ala Ala Ala Thr
    210                 215                 220
Met Lys Ile Ala Gly Glu Leu Ala Ala Asp Ile Ser Ala Gly Pro Gly
225                 230                 235                 240
Ser Leu Gln Leu His Phe Tyr Asp Ala Leu Tyr Ala Leu Thr Pro Glu
                245                 250                 255
Thr Ile Ala Ala Arg Leu Arg Leu Glu Glu Val Val Ser
            260                 265

<210> 352

<400> 352
000

<210> 353
<211> 816
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 353
atgatcgatg ccaaaagtct ggcagcagac ttagcagcca ttcgtgaaaa aaatccgctg     60
gttctgagcg tgaccaataa tgtggtgacc aataccaccg caaatgcact gctggcactg    120
ggtgccagtc cggcaatgag ccatgcccag gaagaaatgg aagaactgcc ggcattcgcc    180
ggcgcaattg tgctgaatat tggcaccccg gcccgtgaat atgttgaagc catgattcgt    240
gcagccgcaa ccgcaagccg cctgaatatt ccgattattc tggaccctgt ggcagcaggc    300
gttacccgcc atcgtaatcg tgttctgaaa aatctgctgg atgacttccc gatggccatt    360
attcgtggca atgccagcga aattatggcc ctggcaggtg aagcagccac cgccaaaggt    420
gccgatagtg cacatggtag tagtgaagcc gttgatgccg ccatgcgcct ggcacgtgaa    480
cgcaaaaccg tggtgtgcgt gagcggtgaa caggatcaga ttaccgatgg cgcacgcctg    540
attcgtgtga gtggcggcca tgttatgatg accaaagtta ccggtctggg ttgtaccgca    600
agcgcagttg caggtgcata cgctgccgtt aatcgtgatt atctggcagc agccgcacat    660
gccgcagcaa ccatgaaaat tgcaggtgaa ctggccgcag atattagcgc aggcccgggc    720
agtctgcaac tgcacttcta tgatgccctg tatgcactga ccccggaaac cattgcagcc    780
cgtctgcgtc tggaagaagt ggtgagctaa ggatcc                              816

<210> 354
<211> 198
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 354
Met Ser Ala Ile Arg Glu Arg Arg Pro Leu Val His Asn Ile Thr Asn
1               5                   10                  15
Phe Val Val Met Asn Asn Ser Ala Asn Ala Leu Leu Ala Leu Gly Ala
            20                  25                  30
Ser Pro Ala Met Val His Ser Ser Asp Glu Val Glu Asp Phe Val Ala
        35                  40                  45
Leu Ser Gln Ala Leu Val Val Asn Ile Gly Thr Leu Tyr Ser Glu Gln
    50                  55                  60
Ile Ala Ala Gly Lys Leu Ala Ala Ile Arg Ala Lys Ala Ala Gly Ile
65                  70                  75                  80
Pro Trp Val Phe Asp Pro Val Gly Ala Gly Ala Thr Pro Tyr Arg Gln
                85                  90                  95
Ala Ala Ala Val Ala Leu Ala Arg Leu Gly Pro Ser Ala Ile Arg Gly
            100                 105                 110
Asn Gly Ser Glu Ile Leu Ala Leu Ala Gln Gln Ala Arg Ala Gly Gln
        115                 120                 125
Gly Val Asp Ser Leu His Gly Ser Glu Ala Ala Leu Asp Ala Ala Arg
    130                 135                 140
Lys Leu Ala Glu Asp Ser Ser Ala Ala Ile Ala Ile Thr Gly Glu Val
145                 150                 155                 160
Asp Tyr Val Thr Asp Gly Arg Arg Val Val Glu Ile His Asn Gly His
                165                 170                 175
Ala Leu Met Thr Arg Val Thr Gly Leu Gly Cys Ser Ala Thr Ala Ile
            180                 185                 190
Ile Gly Ala Phe Leu Ala
        195

<210> 355

<400> 355
000

<210> 356
<211> 603
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 356
atgagcgcca ttcgtgaacg ccgtccgctg gtgcataata ttaccaactt cgtggttatg     60
aataacagtg caaatgccct gctggcactg ggcgcaagtc cggcaatggt tcatagcagc    120
gatgaagtgg aagacttcgt tgcactgagc caggcactgg tggtgaatat tggtaccctg    180
tatagcgaac agattgccgc aggcaaactg gcagcaattc gcgcaaaagc cgcaggtatt    240
ccgtgggtgt tcgatccggt tggtgccggt gcaaccccgt atcgccaggc cgcagccgtg    300
gctctggccc gtcttggtcc gagtgccatt cgtggcaatg gtagtgaaat tctggcactg    360
gcccagcagg cccgtgcagg tcagggtgtg gatagcctgc atggcagtga agcagccctg    420
gatgccgcac gcaaactggc cgaagatagc agtgccgcca ttgcaattac cggcgaagtg    480
gattatgtta ccgatggtcg tcgtgttgtt gaaattcata atggccatgc actgatgacc    540
cgtgtgaccg gcctgggttg tagcgccacc gccattattg gcgccttcct ggcctaagga    600
tcc                                                                  603

<210> 357
<211> 264
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 357
Met Ala Leu His Ala Val Ala Gln Glu Tyr Arg Gln Gln Ser Pro Leu
1               5                   10                  15
Val Phe Cys Leu Thr Asn Thr Val Val Ala Asn Phe Thr Ala Asn Ala
            20                  25                  30
Leu Leu Ala Ser Gly Ala Ser Pro Ala Met Thr Asp Leu Pro Gly Glu
        35                  40                  45
Ala Gly Pro Phe Ala Lys Ala Ala Ser Ala Val Leu Val Asn Leu Gly
    50                  55                  60
Thr Pro Ser Thr Glu Gln Leu Ala Ala Met Glu Glu Ala Val Gln Ser
65                  70                  75                  80
Ala Ser Ala Ala Gly Thr Pro Trp Ile Leu Asp Pro Val Ala Val Gly
                85                  90                  95
Ala Leu Pro Val Arg Thr Asp Phe Ala Arg Arg Ile Ala Arg Gln Arg
            100                 105                 110
Pro Ala Leu Ile Arg Gly Asn Ala Ser Glu Ile Leu Ala Leu Ala Gly
        115                 120                 125
Arg Gln Ser Ala Ser Arg Gly Val Asp Ala Leu Asp Asp Val Ser Ala
    130                 135                 140
Ala Leu Ala Ala Gly Arg Glu Leu Ala Glu Arg His Asp Cys Val Val
145                 150                 155                 160
Ala Ile Ser Gly Gln Ser Asp Ala Ile Ile Asp Ala Thr Arg Thr Val
                165                 170                 175
Leu Val His Thr Asn Gly Ile Gly Leu Thr Arg Ile Thr Gly Gly Gly
            180                 185                 190
Cys Ala Leu Gly Ala Phe Cys Ala Gly Met Ile Ala Val His Asp Asp
        195                 200                 205
Pro Phe Glu Ala Ala Ile Ala Ala His Gly Phe Tyr Gly Leu Ala Ala
    210                 215                 220
Glu Lys Ala Leu Glu Asn Ser Thr Gly Pro Gly Ser Phe Ala Val Ala
225                 230                 235                 240
Phe Ile Asp Ala Leu Ser Ala Thr Asp Pro Glu Glu Leu Lys Thr Leu
                245                 250                 255
Lys His Glu Glu Leu Asn His Ala
            260

<210> 358
<211> 792
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 358
atggctttgc atgcagttgc ccaagaatac cggcaacaat cacctttggt tttctgcctg     60
accaacaccg tcgtagcaaa tttcaccgcc aatgccctgc tggcatccgg cgcgtccccg    120
gccatgacgg atctgcccgg agaggccggc cccttcgcaa aggcggcttc cgccgtcttg    180
gtgaatctgg gaactccgag caccgagcag ctggccgcca tggaagaagc cgtgcaatcg    240
gcgagcgcag caggcacccc atggatcctt gatccggtgg cggtcggcgc cctgccggtg    300
cgcacggatt tcgcgcggcg cattgcccgg caacgcccgg ccctgattcg cggcaacgcc    360
tcggaaatcc tggccctggc cggacggcaa tcggcttcac gcggtgtaga tgccttggat    420
gacgtctctg ccgccttggc ggctggacgc gagctggccg agcggcatga ctgcgtggtg    480
gccatttccg gccagtccga tgccatcatc gatgccacgc ggacggtcct ggtgcacacc    540
aacggaatcg gcttgacgcg gatcaccggc ggcggttgcg cgttgggcgc gttctgcgcc    600
ggcatgattg cggtccacga tgacccgttc gaggccgcca tcgcggccca cggcttctac    660
gggctggccg ctgaaaaggc cctggagaac agcacaggtc cgggcagctt cgccgtcgca    720
ttcattgatg cgctgtccgc aaccgatcca gaagaactta agactttgaa gcacgaggaa    780
ctgaaccatg ca                                                        792

<210> 359
<211> 801
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 359
atggccctgc atgcagtggc acaggaatat cgtcagcaga gcccgctggt gttctgtctg     60
accaataccg tggttgccaa cttcaccgca aatgccctgc tggcaagcgg tgcaagtccg    120
gccatgaccg atctgccggg cgaagcaggc ccgttcgcaa aagccgccag cgcagtgctg    180
gttaatctgg gcaccccgag caccgaacag ctggccgcta tggaagaagc cgtgcagagc    240
gccagcgccg caggtacacc gtggattctg gaccctgttg ccgtgggcgc cctgcctgtg    300
cgtaccgact tcgcacgtcg tattgcacgc cagcgtccgg ccctgattcg cggtaatgca    360
agcgaaattc tggcactggc cggtcgccag agcgccagtc gtggtgtgga tgccctggat    420
gatgtgagtg cagcactggc agcaggtcgt gaactggccg aacgtcatga ttgcgttgtt    480
gcaattagtg gtcagagcga tgcaattatt gatgcaaccc gtaccgttct ggtgcatacc    540
aatggcattg gtctgacccg tattaccggc ggtggttgtg cactgggtgc attctgtgcc    600
ggtatgattg cagtgcatga tgatccgttc gaagccgcaa ttgcagccca tggcttctat    660
ggtctggcag cagaaaaagc cctggaaaat agtaccggcc cgggtagctt cgccgttgca    720
ttcattgatg cactgagcgc caccgatccg gaagaactga aaaccttaaa acatgaagaa    780
ctgaatcacg cataaggatc c                                              801

<210> 360
<211> 256
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 360
Met Gln Ala Arg Asp Asp Asp Val Arg Ser Ala Ala Ala Glu Val Leu
1               5                   10                  15
Glu Arg Leu Arg Ala Arg Arg Pro Arg Val His Cys Ile Thr Asn Ala
            20                  25                  30
Val Ala Gln Ala Phe Thr Ala Asn Leu Leu Leu Ala Val Gly Ala Val
        35                  40                  45
Pro Ser Met Thr Ile Ser Pro Glu Glu Val Gly Asp Phe Val Ala Arg
    50                  55                  60
Ala Asp Ala Leu Leu Val Asn Leu Gly Thr Leu Glu Lys Ser Arg Arg
65                  70                  75                  80
Ala Ala Ile Gly Ile Ala Val Asp Thr Ala Asn Gly Glu His Val Pro
                85                  90                  95
Trp Leu Val Asp Pro Val Phe Val Asp Arg Ser Pro Leu Arg Ala Ser
            100                 105                 110
Phe Ala Gln Ala Leu Ile Leu Leu Arg Pro His Ala Val Arg Leu Asn
        115                 120                 125
Gly Asp Glu Phe Ala Ala Leu Ala Gly Ala Glu Gly Ala Thr Glu Ala
    130                 135                 140
Val Ser His Phe Ala Arg Asp Asn Arg Thr Val Ile Ala Leu Thr Gly
145                 150                 155                 160
Glu Ile Asp Thr Val Gly Asp Gly Lys Arg Met Ala Arg Val Ala Asn
                165                 170                 175
Gly His Pro Trp Met Gly Lys Ile Thr Ala Met Gly Cys Ala Gly Ala
            180                 185                 190
Ala Leu Ala Ala Ala Cys Leu Ala Val Glu Ser Glu Pro Trp Leu Ala
        195                 200                 205
Val Ala Ala Gly Gln Ile Ile Val Gly Val Ala Gly Glu Val Ala Ala
    210                 215                 220
Glu Gln Ala Arg Gly Pro Gly Ser Leu Ala Val Ala Ile Leu Asp Thr
225                 230                 235                 240
Leu Asn Ser Leu Asp Arg Ala Thr Leu Met Ala Arg Ala Lys Val Thr
                245                 250                 255

<210> 361
<211> 768
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 361
atgcaagcgc gtgacgatga tgtcagaagc gcggctgccg aagtcctcga acggttgcgg     60
gcgcggcgcc ctcgggtcca ctgcatcacg aatgcggtcg cgcaggcctt tacggcaaat    120
cttctgcttg ccgttggcgc ggttccatcg atgaccattt cgccggaaga ggtcggcgat    180
ttcgtcgcgc gtgccgatgc gctgctcgtc aatctcggca cgctggaaaa gtcgcggcgt    240
gccgcgatcg gaatcgctgt ggacactgca aacggggagc atgttccgtg gctggtcgat    300
ccggtgttcg ttgaccgctc gccattacgc gccagtttcg cgcaggccct gatcctgttg    360
cggccgcatg cggtgcggct gaacggcgac gaattcgctg cgctcgctgg agccgaaggc    420
gcgactgaag cggtctcgca tttcgcccgc gacaaccgaa cggtgatcgc gctcaccgga    480
gagatcgata ccgtcggcga tggcaagcgg atggcgcggg ttgcaaacgg ccacccctgg    540
atgggcaaga tcacggcaat gggctgcgcc ggcgcggcgt tggcggcggc ctgtcttgcg    600
gtagagagtg agccgtggct ggcggtcgcc gccggccaga ttatcgtcgg tgtcgccggc    660
gaggtggcgg ccgagcaggc gcgcggcccg ggcagtctcg cggtggcgat cctggatacg    720
ttgaacagcc tcgaccgggc gacgctgatg gcgcgggcga aggtgacg                 768

<210> 362
<211> 777
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 362
atgcaggcac gtgatgatga tgttcgtagt gcagccgccg aagttctgga acgtctgcgt     60
gcacgtcgtc cgcgcgttca ttgcattacc aatgccgttg cccaggcctt caccgccaat    120
ctgctgctgg cagttggtgc cgtgccgagc atgaccatta gcccggaaga agtgggtgac    180
ttcgttgccc gcgccgatgc actgctggtt aatctgggta ccctggaaaa aagtcgtcgc    240
gccgccattg gtattgcagt tgataccgcc aatggtgaac atgttccgtg gctggttgat    300
ccggtgttcg tggatcgcag tccgctgcgc gccagcttcg cccaagcact gattctgctg    360
cgcccgcatg cagtgcgcct gaatggtgat gagttcgccg cactggccgg cgcagaaggc    420
gcaacagaag ccgttagtca cttcgcccgt gataatcgta ccgttattgc cctgaccggc    480
gaaattgata ccgtgggtga tggtaaacgc atggcacgtg ttgcaaatgg ccatccgtgg    540
atgggcaaaa ttaccgccat gggctgtgca ggtgcagccc tggccgcagc ttgcctggca    600
gtggaaagcg aaccgtggct ggcagtggca gccggccaga ttattgtggg cgtggccggc    660
gaagttgcag cagaacaggc ccgcggtccg ggcagtctgg ctgttgcaat tctggatacc    720
ctgaatagcc tggatcgcgc caccctgatg gcacgcgcca aagtgaccta aggatcc       777

<210> 363
<211> 239
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 363
Met Glu Glu Glu Glu Glu Ala Gly Ala Ser Leu Met Thr Phe Pro Ser
1               5                   10                  15
Ser Ala Ser Pro Ser Ser Ser Ser Ser Arg Pro Ala Gly Pro Ser Glu
            20                  25                  30
Val Ser Gly Ala Asp Leu Leu His Glu Arg His Leu Gly Ala Gln Arg
        35                  40                  45
Leu Glu Arg Val Gln Cys Leu Thr Asn Ile Val Val Ala Gly Trp Ser
    50                  55                  60
Ala Asn Val Leu Leu Ala Ala Gly Ala Ala Pro Ala Met Val Asp Asn
65                  70                  75                  80
Pro Arg Glu Ala Gly Val Leu Gly Gly Val Ala Gly Gly Val Leu Val
                85                  90                  95
Asn Leu Gly Thr Pro Tyr Glu Glu Thr Val Lys Ala Met Tyr Ala Ala
            100                 105                 110
Ala Arg Ala Ala Arg Ala Gly Arg Arg Pro Trp Val Leu Asp Pro Val
        115                 120                 125
Ala Ala Gly Ala Leu Pro Trp Arg Thr Glu His Ala Val Ala Leu Leu
    130                 135                 140
Asp Glu Gln Pro Thr Val Val Arg Gly Asn Ala Ser Glu Val Leu Ala
145                 150                 155                 160
Pro Gly Glu Gly Glu Gly Gly Lys Gly Val Asp Ser Thr Asp Ser Pro
                165                 170                 175
Glu Ser Val Leu Glu Ala Ala Gln Gly Leu Ala Glu Arg Leu Gly Cys
            180                 185                 190
Val Val Ala Val Ser Gly Ala Val Asp His Val Thr Asp Gly Arg Arg
        195                 200                 205
Leu Val Arg Val His Asn Gly His Glu Trp Leu Thr Lys Val Thr Gly
    210                 215                 220
Val Gly Cys Ser Leu Gly Ala Leu Val Ala Gly Tyr Ala Ala Val
225                 230                 235

<210> 364

<400> 364
000

<210> 365
<211> 726
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 365
atggaagaag aagaagaggc cggtgcaagt ctgatgacct tcccgagtag tgccagtccg     60
agtagtagca gcagccgtcc ggcaggtccg agtgaagtga gtggcgcaga cttactgcat    120
gaacgtcatc tgggtgcaca gcgtctggaa cgtgtgcagt gcctgaccaa tattgttgtt    180
gcaggctgga gcgccaatgt gctgctggcc gcaggtgcag caccggcaat ggttgataat    240
ccgcgtgaag caggtgtgct gggtggtgtg gccggtggtg tgctggttaa tctgggtacc    300
ccgtatgaag aaaccgtgaa agccatgtat gccgccgcac gtgccgcccg tgccggtcgt    360
cgtccttggg ttctggaccc tgttgccgca ggcgcactgc cgtggcgtac cgaacatgca    420
gtggcactgc tggatgaaca gccgaccgtt gttcgtggca atgcaagcga agtgctggcc    480
ccgggcgaag gcgaaggcgg taaaggtgtg gatagtaccg atagtccgga aagtgttctg    540
gaagcagccc agggtctggc cgaacgcctg ggttgtgtgg ttgcagttag cggtgccgtg    600
gatcatgtta ccgatggccg ccgtctggtt cgcgttcata atggtcatga atggctgacc    660
aaagttaccg gcgttggttg tagcctgggc gcactggttg ccggttatgc agccgtgtaa    720
ggatcc                                                               726

<210> 366
<211> 296
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 366
Met Arg Cys Pro Pro Ser Ala Asp Pro Leu Arg Pro Leu Pro Trp Arg
1               5                   10                  15
Thr Thr Thr Met His Ala Asn Pro Ile Asp Ala Ala Ala Leu Trp Ala
            20                  25                  30
Asp Leu Gln Ala Val Arg Arg Gln Ala Pro Leu Val His Asn Ile Thr
        35                  40                  45
Asn Phe Val Val Met Asn Tyr Ser Ala Asn Ala Leu Leu Ala Val Gly
    50                  55                  60
Ala Ser Pro Val Met Ala His Ala Lys Glu Glu Val Arg Asp Met Val
65                  70                  75                  80
Gly Ile Ala Gln Ala Leu Val Leu Asn Ile Gly Thr Leu Gln Pro Glu
                85                  90                  95
Trp Met Glu Ala Met Gln Leu Ala Leu Ala Ala Ala Arg Ala Arg Gly
            100                 105                 110
Val Pro Val Val Leu Asp Pro Val Gly Ala Gly Ala Thr Pro Tyr Arg
        115                 120                 125
Asn Thr Ala Ile Ala Glu Leu Leu Arg Cys Gly Ala Pro Ser Val Val
    130                 135                 140
Arg Gly Asn Ala Ser Glu Val Met Ser Val Ala Gly Leu Ser Ala Ala
145                 150                 155                 160
Thr Arg Gly Val Asp Ser Ala Ala Ala Ser Gly Glu Ala Leu Asp Ala
                165                 170                 175
Ala Arg Ala Leu Ala Arg Gly Leu Gln Ala Thr Val Cys Ile Ser Gly
            180                 185                 190
Ala Asp Asp His Val Val Asp Ala Gly Ala Arg Trp Ala Thr Leu Ser
        195                 200                 205
Asn Gly His Pro Trp Met Thr Arg Val Thr Gly Val Gly Cys Ser Ala
    210                 215                 220
Ser Ala Leu Val Gly Ala Phe Cys Ala Val Gln Pro Asp Arg Trp Arg
225                 230                 235                 240
Ala Thr Val Ala Ala Met Ala Val Leu Gly Val Ala Gly Glu Met Ala
                245                 250                 255
Ala Glu Arg Thr Gln Ala Ala Gly Gly Ser Val Gly Arg Leu Gln Ile
            260                 265                 270
Glu Leu Leu Asp Gly Leu Gln Leu Leu Asp Glu Pro Ser Phe Ala Ala
        275                 280                 285
Arg Leu Lys Leu Arg Thr His Asp
    290                 295

<210> 367
<211> 888
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 367
atgcgctgcc caccgtcggc cgacccgctg cgcccactgc cgtggaggac gacgacgatg     60
cacgcgaacc ccatcgacgc tgcagccctg tgggccgatc tgcaagccgt gcgccggcaa    120
gcgccgctgg tgcacaacat caccaacttc gtggtcatga actacagcgc caatgcgctg    180
ctggccgtgg gcgcgtcgcc ggtgatggcg cacgccaagg aagaagtgcg cgacatggtc    240
ggcatcgcgc aggcgctggt gctgaacatc ggcacgctgc agcccgagtg gatggaagcg    300
atgcagctgg cgctggccgc cgcccgcgcg cgcggcgtgc cggtggtgct ggacccggtg    360
ggtgccggcg ccacgcccta ccgcaatacc gccatcgccg agctgttgcg atgcggcgcg    420
cccagcgtgg tgcgcggcaa cgcctcggag gtgatgagcg tggccggact gagcgcagcg    480
acccgcggcg tcgacagcgc cgccgcgtcg ggcgaggcgc tcgatgcagc gcgtgcgctg    540
gcgcgcggcc tgcaagccac cgtgtgcatc agcggcgccg acgaccacgt ggtcgatgcc    600
ggcgctcgct gggccacgct gtccaacggc cacccgtgga tgacccgcgt caccggcgtg    660
ggctgctcgg ccagtgcgct ggtcggtgcc ttctgtgcgg tgcagccgga ccgctggcgc    720
gccaccgttg cggccatggc cgtgctgggt gtcgccggcg agatggcggc cgagcgcacc    780
caggccgccg gcggcagtgt cggccggctg cagatcgagc tgctcgacgg gctgcagctg    840
ctcgacgaac ccagcttcgc cgcgcggctg aagctgcgca cccatgac                 888

<210> 368
<211> 897
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 368
atgcgttgtc cgccgagcgc agatccgctg cgccctcttc cgtggcgtac caccaccatg     60
catgccaatc cgattgatgc agcagccctg tgggccgatc tgcaagcagt tcgccgccag    120
gcaccgctgg tgcataatat taccaacttc gttgttatga actacagtgc aaatgcactg    180
ctggccgttg gcgccagtcc ggttatggca catgccaaag aagaagttcg cgatatggtt    240
ggcattgcac aggccctggt gctgaatatt ggtaccctgc aaccggaatg gatggaagca    300
atgcagctgg cactggccgc cgcccgtgct agaggtgttc cggttgttct ggaccctgtg    360
ggcgccggtg ccacccctta tcgtaatacc gcaattgccg aactgctgcg ctgtggtgcc    420
ccgagtgtgg ttcgcggcaa tgcaagcgaa gttatgagtg tggcaggtct gagtgcagca    480
acccgtggtg ttgatagtgc cgcagcaagc ggcgaagccc tggatgcagc ccgtgcactg    540
gcacgtggcc tgcaagcaac cgtgtgtatt agcggcgcag atgatcatgt ggtggatgca    600
ggtgcccgct gggccaccct gagtaatggt catccgtgga tgacccgtgt taccggcgtg    660
ggttgcagtg caagcgcact ggttggtgca ttctgtgcag ttcagccgga tcgctggcgc    720
gccaccgtgg cagcaatggc agttctgggc gtggcaggcg aaatggcagc cgaacgcacc    780
caggccgccg gtggtagtgt gggtcgtctg caaattgaac tgctggatgg tctgcaactg    840
ctggatgaac cgagcttcgc cgcccgtctg aaactgcgta cccatgatta aggatcc       897

<210> 369
<211> 225
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 369
Met Gly Thr Leu Thr Pro Ala Ala Leu Asp Ala Met Arg Leu Ala Gly
1               5                   10                  15
Arg Glu Ala Asn Arg Cys Gly Val Pro Val Val Leu Asp Pro Val Gly
            20                  25                  30
Ala Gly Ala Thr Pro Tyr Arg Thr Glu Ala Ala Arg Gln Leu Leu Glu
        35                  40                  45
Asp Ile Asn Val Ala Ile Val Arg Gly Asn Ser Gly Glu Val Ala Ala
    50                  55                  60
Ile Ile Gly Gln Gln Ala Val Val Arg Gly Val Glu Ser Leu Glu Thr
65                  70                  75                  80
Ala Leu Pro Ala Ala Glu Leu Gly Ala Gln Ala Ala Gln Gln Leu Gly
                85                  90                  95
Val Val Val Ala Leu Thr Gly Ala Arg Asp Ile Ile Ser Asp Gly Ser
            100                 105                 110
Val Ser Leu Ala Val Asp His Gly Ser Pro Trp Leu Lys Thr Ile Thr
        115                 120                 125
Gly Ser Gly Cys Met Ala Ser Ala Ala Val Ala Cys Phe Ala Ala Val
    130                 135                 140
Ala Pro Ser Ser Leu Gln Ala Ala Ala Ala Ala Leu Ala Ala Tyr Gly
145                 150                 155                 160
Leu Ala Ala Glu Leu Ala His Lys Pro Gln Ile His Gly Pro Ala Ser
                165                 170                 175
Phe Lys Val Ala Leu Leu Asp Ala Val Tyr Gly Leu Thr Ala Glu Thr
            180                 185                 190
Leu Gln Arg Ala Lys Ala Val Ser Tyr Thr His Leu Thr Leu Pro Glu
        195                 200                 205
Thr Lys Thr Asp Leu Val Phe Arg Leu Leu Leu Glu Lys Lys Lys Leu
    210                 215                 220
Thr
225

<210> 370

<400> 370
000

<210> 371
<211> 684
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 371
atgggtaccc tgaccccggc agcactggat gccatgcgcc tggccggtcg tgaagcaaat     60
cgttgtggtg tgccggtggt gctggaccct gtgggtgcag gcgcaacccc gtatcgcacc    120
gaagcagccc gccagctgct ggaagatatt aatgttgcca ttgtgcgtgg caatagcggt    180
gaagtggcag caattattgg ccagcaggca gttgttcgtg gtgtggaaag cctggaaacc    240
gccctgccgg cagccgaact gggtgctcag gccgcacagc agctgggcgt ggtggtggct    300
ctgaccggcg ctcgtgatat tattagtgat ggtagtgtta gcctggcagt ggatcatggt    360
agtccgtggc tgaaaaccat taccggtagc ggctgtatgg ccagcgcagc cgttgcctgc    420
ttcgccgcag tggcccctag tagcctgcaa gcagccgcag ccgcactggc cgcatacggt    480
ctggcagccg aattagcaca taaaccgcag attcatggtc cggccagctt caaagttgcc    540
ctgctggatg ccgtgtatgg cctgaccgca gaaaccttac agcgtgccaa agcagttagc    600
tatacccatc tgaccctgcc ggaaaccaaa accgatctgg tgttccgcct gctgctggaa    660
aaaaaaaaac tgacctaagg atcc                                           684

<210> 372
<211> 193
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 372
Met Ile Asp Glu Ile Glu Lys Ser Leu Tyr Phe Leu Arg Lys Lys Lys
1               5                   10                  15
Pro Ile Val Leu Cys Leu Thr Asn Phe Val Thr Met Glu Phe Val Ala
            20                  25                  30
Asn Ser Leu Leu Ser Leu Gly Ala Ala Pro Ile Thr Ser Val Ser Glu
        35                  40                  45
Glu Glu Leu Ala Glu Leu Val Gln Ile Ala Ser Ser Val Tyr Ile Asn
    50                  55                  60
Ile Gly Thr Leu Asp Asp Lys Phe Ile Asn Leu Thr Lys Lys Ala Ile
65                  70                  75                  80
Glu Leu Ala Gln Glu Phe Asp Lys Pro Ile Ile Leu Asp Pro Val Gly
                85                  90                  95
Ala Gly Ala Thr Lys Val Arg Thr Gln Thr Ala Lys Tyr Ile Leu Pro
            100                 105                 110
Phe Ser Ser Ile Ile Arg Gly Asn Ala Ser Glu Ile Ile Ala Leu Gly
        115                 120                 125
Gln Asn Ala Gln Leu Thr Tyr Gly Val Glu Thr Ser Asn Thr Thr Asp
    130                 135                 140
Glu Ala Glu Glu Ile Ala Thr Arg Ile Ala Leu Glu Asn Asn Thr Thr
145                 150                 155                 160
Ile Ile Thr Ser Gly Pro Val Asp Tyr Ile Thr Asn Gly Asn His Ser
                165                 170                 175
Val Gln Val Pro Phe Gly Ser Ser Leu Met Gln Leu Val Thr Gly Met
            180                 185                 190
Gly

<210> 373

<400> 373
000

<210> 374
<211> 588
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 374
atgatcgatg aaatcgaaaa gagtctgtac ttcctgcgca aaaaaaaacc gattgttctg     60
tgcctgacca acttcgtgac catggagttc gtggccaata gcctgctgag cctgggtgcc    120
gcaccgatta ccagtgtgag tgaagaagaa ctggcagaac tggtgcagat tgcaagtagt    180
gtgtatatta atatcggcac cctggatgat aaattcatta atctgaccaa aaaggccatt    240
gaactggccc aggagttcga taaaccgatt attctggacc ctgttggtgc cggtgcaacc    300
aaagtgcgta cccagaccgc aaaatatatt ctgccgttca gtagcattat tcgcggcaat    360
gcaagtgaaa ttattgcact gggccagaat gcacagctga cctatggtgt tgaaaccagt    420
aataccaccg atgaagcaga agaaattgca acccgtattg ccctggaaaa taataccacc    480
attattacca gcggcccggt tgattatatt accaatggca atcatagtgt tcaggttccg    540
ttcggcagca gcctgatgca gctggttacc ggcatgggtt aaggatcc                 588

<210> 375
<211> 265
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 375
Met Ile Leu Asp Ala Ala Glu Val Leu Glu Arg Val Arg Arg Asp Lys
1               5                   10                  15
Pro Leu Val His His Leu Thr Asn Leu Val Thr Ile Tyr Asp Cys Ala
            20                  25                  30
Asn Ile Val Lys Val Phe Gly Ala Ser Pro Val Met Ala His Ala Arg
        35                  40                  45
Glu Glu Val Ala Asp Met Ala Lys Ile Ala Ser Ala Leu Val Leu Asn
    50                  55                  60
Ile Gly Thr Leu Thr Thr Glu Phe Val Gln Ala Met Leu Ile Ala Gly
65                  70                  75                  80
Lys Ser Ala Asn Glu Lys Gly Ile Pro Val Val Phe Asp Val Cys Gly
                85                  90                  95
Ala Gly Ala Thr Lys Phe Arg Asp Asp Lys Cys Leu Glu Ile Leu Asp
            100                 105                 110
Ala Val Asp Ile Ser Ile Ile Lys Gly Asn Ser Ser Glu Val Ala Arg
        115                 120                 125
Ile Ala Gly Glu Asp Val Lys Thr Arg Gly Val Asp Ala Ala Asp Ile
    130                 135                 140
Glu Ala Asn Leu Leu Glu Val Ala Gly Ser Leu Ala Glu Lys Arg Glu
145                 150                 155                 160
Cys Thr Val Val Ile Thr Gly Lys Asp Asp Ile Val Ala Asp Arg Lys
                165                 170                 175
Arg Ala Val Trp Val His Asn Gly His Pro Met Met Ala Asn Val Val
            180                 185                 190
Gly Thr Gly Cys Met Ala Ala Ser Val Ile Gly Ala Phe Ala Ala Val
        195                 200                 205
Glu Lys Asp Gln Leu Ala Ala Ser Val Ser Gly Leu Val Cys Tyr Glu
    210                 215                 220
Val Ala Ala Glu Val Ala Ala Leu Val Ser Ala Gly Pro Gly Ser Phe
225                 230                 235                 240
Lys Glu Asn Leu Tyr Asp Ala Val Tyr Asn Leu Asp Ser Gly Thr Ile
                245                 250                 255
Arg Ser Met Gln Lys Ile Glu Phe Glu
            260                 265

<210> 376

<400> 376
000

<210> 377
<211> 804
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 377
atgatcctgg atgcagcaga agtgctggaa cgtgttcgcc gtgataaacc gctggtgcat     60
catctgacca atctggttac catctatgat tgtgccaata ttgttaaagt gttcggcgca    120
agcccggtga tggcacatgc acgcgaagaa gtggccgata tggcaaaaat tgccagtgca    180
ctggtgctga atattggtac cctgaccacc gagttcgttc aggccatgct gattgccggc    240
aaaagtgcaa atgaaaaagg cattccggtt gtgttcgatg tgtgtggtgc cggtgccacc    300
aaattccgtg atgataaatg cctggaaatt ctggatgccg tggatattag cattattaaa    360
ggtaatagca gcgaagtggc ccgtattgcc ggtgaagatg tgaaaaccag aggtgtggat    420
gccgccgata ttgaagcaaa tctgctggaa gtggcaggca gcctggcaga aaaacgcgaa    480
tgtaccgttg ttattaccgg caaagatgat attgttgcag atcgtaaacg cgccgtgtgg    540
gttcataatg gccatccgat gatggcaaat gtggttggca ccggttgtat ggcagcaagc    600
gtgattggcg cattcgccgc agttgaaaaa gatcagctgg cagccagcgt gagcggcctg    660
gtgtgttatg aagttgccgc cgaagttgca gccctggtta gcgccggccc gggttcattc    720
aaagaaaatc tgtatgatgc cgtgtataat ctggatagtg gcaccattcg cagcatgcag    780
aaaattgagt tcgaataagg atcc                                           804

<210> 378
<211> 272
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 378
Met Asp Ala Gln Thr Ala Ala Gln His Leu Ser Lys Val Arg Asp Gln
1               5                   10                  15
Asn Pro Leu Val His Ser Ile Thr Asn Asn Val Val Thr Asn Phe Thr
            20                  25                  30
Ala Asn Gly Leu Leu Ala Leu Gly Ala Ser Pro Val Met Ala Tyr Ala
        35                  40                  45
Ile Glu Glu Ala Ala Asp Met Ala Lys Ile Ala Gly Ala Leu Val Leu
    50                  55                  60
Asn Ile Gly Thr Leu Ser Ser Ala Ser Val Glu Ala Met Ile Ala Ala
65                  70                  75                  80
Gly Lys Ser Ala Asn Glu Asn Gly Val Pro Val Ile Phe Asp Pro Val
                85                  90                  95
Gly Ala Gly Ala Thr Pro Phe Arg Thr Ala Ser Ala Arg Lys Ile Ile
            100                 105                 110
Gln Glu Val Arg Leu Ser Val Ile Arg Gly Asn Ala Ala Glu Ile Ala
        115                 120                 125
Asn Ile Ala Gly Ala Pro Asp Trp Lys Ile Lys Gly Val Asp Ala Gly
    130                 135                 140
Glu Ala Gly Gly Asp Val Val Gln Leu Ala Lys Thr Ala Ala Ser Arg
145                 150                 155                 160
Met Lys Thr Val Ile Ala Ile Thr Gly Lys Thr Asp Val Ile Ser Asp
                165                 170                 175
Gly Thr Asp Thr Tyr Ala Val His Asn Gly Asp Lys Leu Leu Thr Lys
            180                 185                 190
Val Thr Gly Ala Gly Cys Leu Leu Thr Ser Val Ile Gly Ala Phe Cys
        195                 200                 205
Ala Ala Glu Lys Asp Val Leu Gln Ala Ala Val Ser Ala Val Ser Val
    210                 215                 220
Tyr Gly Ser Ala Ala Gln Leu Ala Ala Leu Asp Thr Ser His Lys Gly
225                 230                 235                 240
Pro Gly Ser Phe Gln Ile Glu Leu Leu Asn Lys Leu Ala Ser Ile Ser
                245                 250                 255
Glu Gln Glu Ala Ala Glu Leu Ala Ala Ile Glu Arg Val Thr Glu Lys
            260                 265                 270

<210> 379
<211> 816
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 379
atggacgcac aaaccgcagc tcaacacttg agtaaagtgc gggatcaaaa cccgctcgtt     60
cacagtatca ccaataatgt cgtcacgaac ttcaccgcta acggactgct cgcgctcggc    120
gcatcgcccg ttatggctta tgcaattgag gaggccgccg acatggcgaa aatcgcagga    180
gcgctcgtcc tgaatatagg caccctcagt tcggcgtcgg ttgaagctat gatcgcagcc    240
ggaaaatcgg ccaatgaaaa cggtgttccg gtcatttttg atccggtagg cgccggcgcc    300
acgccgtttc ggacggcatc agcccgcaag atcatacagg aggtccgctt gtccgtcatt    360
cgcggaaatg ccgcggaaat cgccaatatt gccggcgccc ctgattggaa gataaaaggc    420
gtggatgcgg gggaagccgg aggagatgtg gttcagctcg ctaagacggc tgcaagcagg    480
atgaagacgg tcattgcgat taccggaaaa accgacgtca tctcggacgg caccgacaca    540
tacgccgtac ataacggtga taaactgctg acgaaggtga cgggggcggg atgcctgctg    600
acgtctgtaa tcggagcctt ctgcgcagcg gagaaggacg ttctgcaagc ggccgtttca    660
gccgtttctg tatacggaag cgccgctcag cttgccgctc tggacacctc ccataagggg    720
ccgggcagct tccagattga gcttttaaac aagctggcaa gcatctcaga acaagaagcg    780
gccgagcttg ctgcaattga aagggtgaca gaaaaa                              816

<210> 380
<211> 819
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 380
atggatgcac agaccgccgc ccagcatctg agtaaagtgc gtgatcagaa tccgctggtt     60
catagcatta ccaataatgt tgtgaccaac ttcaccgcca atggcctgct ggcactgggt    120
gcaagtccgg tgatggccta tgccattgaa gaagccgccg atatggcaaa aattgccggt    180
gccctggtgc tgaatattgg taccctgagc agcgccagtg ttgaagccat gattgccgca    240
ggcaaaagcg caaatgaaaa tggcgttccg gttatcttcg atccggtggg cgcaggtgca    300
accccgttcc gtaccgcaag tgcccgtaaa attattcagg aagttcgtct gagcgtgatt    360
cgtggcaatg ccgcagaaat tgcaaatatt gccggcgcac cggattggaa aattaaaggt    420
gtggatgcag gtgaagcagg cggtgatgtg gtgcagctgg ccaaaaccgc agccagccgt    480
atgaaaaccg tgattgccat taccggtaaa accgatgtta ttagtgatgg caccgatacc    540
tatgccgttc ataatggcga taaactgctg accaaagtga ccggtgccgg ttgcctgctg    600
accagcgtta ttggtgcatt ctgcgccgca gaaaaagatg ttctgcaagc agccgttagc    660
gcagtgagcg tgtatggtag cgcagcacag ctggccgcac tggataccag tcataaaggc    720
ccgggcagct tccagattga actgctgaat aaactggcca gcattagtga acaggaagca    780
gccgaactgg cagcaattga acgtgttacc gaaaaataa                           819

<210> 381
<211> 249
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 381
Met His Gly Asn Glu Ile Gly Asp Val Ser Met Asp Ile Ala Ala Asn
1               5                   10                  15
Ala Leu Leu Ala Leu Gly Ala Ser Pro Ala Met Ile His Ala Pro Glu
            20                  25                  30
Glu Thr Asp Glu Phe Thr Ala Met Ala Asp Ala Leu Val Ile Asn Val
        35                  40                  45
Gly Thr Leu Ser Ala Gln Ala Ala Gln Gly Met Glu Arg Ala Ala Ala
    50                  55                  60
Ala Ala Arg Leu His Gly Lys Pro Trp Leu Leu Asp Pro Val Ala Ala
65                  70                  75                  80
Gly Leu Leu Thr Phe Arg Asp Asp Thr Ile Arg Lys Leu Leu Arg His
                85                  90                  95
Arg Pro Ser Leu Ile Arg Gly Asn Ala Ser Glu Ile Met Ala Val Ala
            100                 105                 110
Arg Ile Ala Gly Leu Thr Gln Asp Ala Ala Ala Pro Arg Gly Val Asp
        115                 120                 125
Asn Arg His Gln Ala Ser Asp Ala Glu Ala Leu Ala Val Lys Leu Ala
    130                 135                 140
Arg His Cys Phe Cys Ala Val Val Ala Thr Gly Ala Val Asp Val Val
145                 150                 155                 160
Ser Asp Gly Glu Thr Thr Val Arg Ile Ala Asn Gly Ser Pro Leu Ala
                165                 170                 175
Pro Arg Val Thr Ala Leu Gly Cys Ser Met Ser Ser Val Met Gly Ala
            180                 185                 190
Tyr Leu Ala Leu Thr Gly Pro Phe Glu Ala Ala Leu Ala Thr Thr Ile
        195                 200                 205
Leu Tyr Gly Val Ala Gly Asp Ile Ala Ala Glu Gly Ala Arg Gly Pro
    210                 215                 220
Ala Ser Phe Arg Thr Ala Phe Leu Asp Thr Leu Tyr Ser Ile Ala Arg
225                 230                 235                 240
Ser Glu Leu Ala Arg Arg Val Arg Ser
                245

<210> 382

<400> 382
000

<210> 383
<211> 750
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 383
atgcatggca atgaaattgg cgatgttagt atggatattg cagccaatgc actgctggca     60
ctgggtgcca gtccggccat gattcatgcc ccggaagaaa ccgatgagtt caccgccatg    120
gccgatgcac tggttattaa tgttggcacc ctgagcgccc aggcagccca gggtatggaa    180
cgcgcagcag ccgccgcacg cctgcatggt aaaccgtggc tgctggaccc tgttgcagca    240
ggcctgctga ccttccgcga tgataccatt cgtaaactgc tgcgccatcg cccgagcctg    300
attcgcggta atgcaagtga aattatggca gttgcacgca ttgccggcct gacccaggat    360
gcagcagccc ctcgcggtgt ggataatcgt catcaggcaa gtgatgccga agccctggcc    420
gtgaaactgg cacgtcattg cttctgtgca gttgtggcca ccggcgccgt tgatgttgtt    480
agtgatggtg aaaccaccgt gcgcattgca aatggtagtc cgctggcccc gcgtgttacc    540
gcactgggtt gtagtatgag tagcgttatg ggtgcctatc tggccctgac cggcccgttc    600
gaagccgctc tggcaaccac cattctgtat ggtgtggccg gcgatattgc cgcagaaggc    660
gcccgtggcc cggcatcatt ccgcacagca ttcctggata ccctgtatag cattgcccgt    720
agtgaactgg cacgccgtgt tcgtagttaa                                     750

<210> 384
<211> 221
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 384
Met Ile Arg Asp Ser Asp Glu Ala Gly Val Phe Ala Pro Val Ala Ser
1               5                   10                  15
Ala Val Leu Ile Asn Leu Gly Thr Leu Gly Gly Arg Gln Pro Glu Ala
            20                  25                  30
Met Arg Glu Ala Ala Arg Ala Ala Arg Leu Ala Gly Thr Pro Trp Val
        35                  40                  45
Leu Asp Pro Val Ala Val Gly Ala Leu Pro Val Arg Thr Ala Leu Ala
    50                  55                  60
Ala Glu Leu Leu Ser Ala Arg Pro Thr Ile Val Arg Gly Asn Ala Ser
65                  70                  75                  80
Glu Ile Ala Ala Val Ala Gly Ala Gly Ala Ala Gly Arg Gly Val Asp
                85                  90                  95
Ala Thr Gln Thr Val Asp Ala Ala Ala Ala Pro Ala Val Glu Leu Ala
            100                 105                 110
Thr Arg Leu Asp Ala Val Val Ala Val Ser Gly Pro Val Asp Leu Val
        115                 120                 125
Thr Asp Gly Arg Thr Arg Ile Arg Val Ala Gly Gly His Glu Leu Leu
    130                 135                 140
Thr Arg Val Thr Gly Ala Gly Cys Ala Leu Gly Ala Val Leu Gly Ala
145                 150                 155                 160
Phe Ala Ala Val Ala Asp Asp Pro Leu Thr Ala Ala Val Ala Ala His
                165                 170                 175
Cys Val Tyr Ala Val Ala Ala Glu Asp Ala Ala Ala Gly Ser Pro Gly
            180                 185                 190
Pro Gly Ser Phe Ala Val Ala Leu Leu Asp Ser Leu Ala Ala Val Ser
        195                 200                 205
Ala Asp Asp Val Ala Ala Arg Ala Arg Leu Glu Val Val
    210                 215                 220

<210> 385

<400> 385
000

<210> 386
<211> 666
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 386
atgatccgtg atagtgatga agccggcgtg ttcgcaccgg ttgcaagcgc agttctgatt     60
aatctgggta ccctgggtgg ccgccagccg gaagctatgc gtgaagcagc acgcgccgcc    120
cgtctggcag gtacaccttg ggttctggac cctgtggcag ttggtgccct gccggttcgc    180
accgcactgg cagctgaact gctgagcgcc cgtccgacca ttgtgcgtgg taatgcaagt    240
gaaattgcag ccgtggcagg cgcaggtgcc gcaggtagag gtgttgatgc cacccagacc    300
gttgatgcag cagccgcccc ggccgttgaa ctggctaccc gtctggatgc cgttgtggca    360
gttagtggtc cggttgatct ggtgaccgat ggtcgcaccc gtattcgcgt ggcaggtggc    420
catgaactgc tgacccgcgt taccggcgcc ggctgtgcac tgggtgccgt tctgggtgcc    480
ttcgcagcag ttgccgatga tccgctgacc gccgccgttg ccgcacattg cgtgtatgcc    540
gtggcagccg aagatgccgc cgccggtagt ccgggtccgg gtagcttcgc cgttgcactg    600
ctggatagtc tggcagccgt tagcgccgat gatgttgcag cccgcgcacg tctggaagtt    660
gtttaa                                                               666

<210> 387
<211> 205
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 387
Met Pro Thr Pro Ser Val Leu Phe Thr Arg Met Arg Asp Thr Ala Pro
1               5                   10                  15
Leu Val Gln Cys Ile Thr Asn Tyr Val Ala Met Asn Val Ala Ala Asn
            20                  25                  30
Val Leu Leu Ala Ala Gly Ala Ser Pro Ala Met Val His Ala Glu Glu
        35                  40                  45
Glu Ala Gly Glu Phe Ala Ala Leu Ala Gly Ala Leu Thr Val Asn Ile
    50                  55                  60
Gly Thr Leu Ser Pro Ala Trp Ile Gly Gly Met Lys Ala Ala Val Gly
65                  70                  75                  80
Gly Ala Ala Ser Ser Gly Arg Pro Trp Val Leu Asp Pro Val Ala His
                85                  90                  95
Phe Ala Thr Gly Leu Arg Arg Gln Ala Val Ala Glu Leu Leu Glu Leu
            100                 105                 110
Arg Pro Thr Ile Ile Arg Gly Asn Ala Ser Glu Ile Ile Ala Leu Ala
        115                 120                 125
Gly Gln Ala Thr Ala Gly Lys Gly Val Asp Ser Gly Asp Ser Val Ala
    130                 135                 140
Ala Ala Glu Ala Ala Ala Arg Ser Leu Ala Ala Lys His Ala Cys Thr
145                 150                 155                 160
Ile Ala Val Thr Gly Pro Val Asp Phe Val Thr Asp Gly Ala Arg Ser
                165                 170                 175
Ala Arg Ile Glu Gly Gly Ser Pro Leu Met Pro Leu Val Thr Ala Thr
            180                 185                 190
Gly Cys Ala Leu Thr Ala Leu Val Gly Ala Phe Ala Ala
        195                 200                 205

<210> 388
<211> 615
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 388
atgccgacac cttcggtgct tttcacgcgg atgcgcgaca ccgccccgct ggtgcagtgc     60
atcaccaact acgtggcgat gaatgtcgcc gccaatgtgc tgctggctgc cggggcgtcg    120
cccgccatgg tgcatgccga ggaggaggcg ggcgagttcg cagccctcgc cggcgcgctg    180
accgtcaata tcggcacgct gtcgccggcc tggatcggcg gcatgaaggc ggcggtcggg    240
ggcgcggcat cctccggccg gccatgggtg ctcgatccgg tcgcgcattt cgcgaccggc    300
ctgcgccggc aggcggtcgc ggagctgctg gaactgcgcc cgacgatcat ccgcggcaac    360
gcctccgaga tcatcgccct cgccggtcag gccaccgccg gcaagggggt cgattccggc    420
gacagcgtcg ccgccgccga agccgccgcc cgcagccttg ccgccaagca tgcctgcacc    480
atcgcggtga ccggcccggt cgatttcgtc accgatggcg cccggtccgc ccggatcgag    540
ggcggctcgc ccctgatgcc gctggtgacc gccaccggct gcgccctgac cgcgctggtc    600
ggcgccttcg ccgcc                                                     615

<210> 389
<211> 618
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 389
atgccgaccc cgagcgtgct gttcacccgc atgcgcgata ccgccccgct ggttcagtgt     60
attaccaatt atgtggcaat gaatgtggcc gcaaatgtgc tgctggcagc cggcgcaagc    120
ccggcaatgg ttcatgcaga agaagaagca ggtgagttcg ccgcactggc cggtgccctg    180
accgttaata ttggcaccct gagcccggca tggattggcg gcatgaaagc cgcagttggc    240
ggtgcagcaa gcagcggccg cccttgggtt ctggaccctg ttgcacactt cgccaccggt    300
ctgcgccgcc aggcagtggc agaactgctg gaactgcgtc cgaccattat tcgcggcaat    360
gccagtgaaa ttattgccct ggccggccag gccaccgcag gtaaaggcgt tgatagtggt    420
gatagtgtgg cagcagccga agcagcagcc cgcagcctgg cagcaaaaca tgcctgtacc    480
attgcagtga ccggtccggt tgacttcgtg accgatggcg cacgtagcgc ccgcattgaa    540
ggcggcagcc cgctgatgcc gctggttacc gcaaccggct gtgccctgac cgcactggtg    600
ggtgcattcg cagcataa                                                  618

<210> 390
<211> 283
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 390
Met Arg Glu Val Pro Arg Gly Arg Arg Pro Leu His Ala Ala Gly Gly
1               5                   10                  15
Arg Glu Met Ser Pro Ala Thr Leu Leu Ala Ala Leu Arg Ala Glu Pro
            20                  25                  30
Pro Leu Val Gln Cys Ile Thr Asn Tyr Val Ala Met Asn Ile Ala Ala
        35                  40                  45
Asn Val Met Leu Ala Ala Gly Ala Ser Pro Ala Met Val Ser Asp Ala
    50                  55                  60
Glu Glu Ala Glu Glu Phe Ala Gly Ile Ala Gly Ala Leu Thr Val Asn
65                  70                  75                  80
Ile Gly Thr Leu Ser Ala Pro Phe Val Glu Gly Met Arg Ala Ala Ile
                85                  90                  95
Arg Gly Ala Gln Arg Ala Gly Arg Pro Trp Val Leu Asp Pro Val Ala
            100                 105                 110
Cys Gln Ala Thr Thr Tyr Arg Arg Arg Val Ser Ala Glu Leu Val Ala
        115                 120                 125
Leu Arg Pro Thr Ile Ile Arg Gly Asn Ala Ser Glu Val Leu Ser Leu
    130                 135                 140
Ala Gly Glu Ala Ser Arg Gly Gln Gly Val Asp Gly Arg Asp Ser Val
145                 150                 155                 160
Ala Ala Ala Glu Asp Gly Ala Arg Arg Leu Ala Gln Ala Ser Gly Ala
                165                 170                 175
Val Val Ala Val Thr Gly Glu Val Asp Phe Val Thr Asp Gly Arg Arg
            180                 185                 190
Gly Ala Arg Ile Glu Gly Gly Ser Pro Trp Met Pro Leu Asn Thr Ala
        195                 200                 205
Leu Gly Cys Ser Leu Thr Cys Leu Cys Gly Ala Tyr Ala Ala Val Gly
    210                 215                 220
Glu Asp Ala Phe Asp Ala Ala Val Ala Ala Leu Ala His Phe Ala Val
225                 230                 235                 240
Ala Gly Arg Trp Ala His Glu Gly Ala Glu Gly Pro Gly Ser Phe Ala
                245                 250                 255
Pro Arg Phe Leu Asp Ala Leu Arg Ala Val Thr Pro Arg Ala Leu Glu
            260                 265                 270
Ala Glu Ala Val Ile Arg Gly Ala Glu Val Val
        275                 280

<210> 391

<400> 391
000

<210> 392
<211> 852
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 392
atgcgcgaag tgccgcgtgg tcgccgccct cttcatgccg caggtggtcg cgaaatgagc     60
ccggcaaccc tgctggcagc cctgcgtgca gaaccgccgc tggtgcagtg tattaccaat    120
tatgtggcaa tgaatatcgc agcaaatgtg atgctggcag caggtgcaag tccggcaatg    180
gtgagcgatg ccgaagaagc cgaagagttc gcaggtattg ccggcgccct gaccgttaat    240
attggtaccc tgagtgcacc gttcgtggaa ggtatgcgtg cagccattcg cggcgcacag    300
cgcgcaggtc gtccgtgggt tctggaccct gttgcatgtc aggccaccac ctatcgccgc    360
cgtgttagcg ccgaactggt tgccctgcgc ccgaccatta ttcgcggtaa tgccagtgaa    420
gttctgagtc tggcaggcga agcaagccgc ggccagggtg tggatggccg tgatagcgtg    480
gccgcagccg aagatggtgc acgccgcctg gcacaggcca gcggtgcagt tgttgcagtt    540
accggtgaag ttgacttcgt gaccgatggt cgtcgcggtg cccgcattga aggcggcagt    600
ccgtggatgc cgctgaatac cgcactgggc tgctctctga cctgtctgtg cggtgcctat    660
gcagcagttg gtgaagatgc cttcgatgca gcagtggcag ccctggcaca cttcgcagtg    720
gccggtcgct gggcacatga aggtgcagaa ggcccgggca gcttcgcccc tcgcttctta    780
gatgcactgc gtgccgtgac cccgcgtgcc ttagaagcag aagccgtgat tcgtggcgca    840
gaagttgtgt aa                                                        852

<210> 393
<211> 261
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 393
Met Gln Glu Pro Leu Lys Lys Ile Lys Glu Thr Arg Pro Leu Ile His
1               5                   10                  15
His Ile Thr Asn Trp Val Thr Ile Tyr Asp Cys Ala Asn Ile Thr Arg
            20                  25                  30
Thr Phe Gly Ala Leu Pro Val Met Ala His Ala Pro Glu Glu Cys Ala
        35                  40                  45
Asp Met Thr Arg Ile Ser Ser Ala Leu Val Leu Asn Ile Gly Thr Leu
    50                  55                  60
Thr Asn Glu Leu Ile Asp Ala Met Ile Leu Ser Ala Lys Ala Ala Asn
65                  70                  75                  80
Glu Asn Lys Ile Pro Val Val Leu Asp Ala Val Gly Val Gly Ala Thr
                85                  90                  95
Lys Phe Arg Asp Tyr Met Ala Ser Lys Ile Ile Asp Ser Val His Val
            100                 105                 110
Asp Ile Ile Lys Gly Asn Tyr Ser Glu Ile Ala Lys Leu Ala Gly Glu
        115                 120                 125
Lys Ala Gln Thr Lys Gly Val Glu Ala Thr Ser Ile Asn Ala Asp Pro
    130                 135                 140
Arg Gln Ile Ala Arg Glu Leu Ala Ile Ser Lys Ser Cys Thr Ile Val
145                 150                 155                 160
Met Thr Gly Lys Glu Asp Ile Ile Ser Asn Gly Lys Lys Ile Phe Val
                165                 170                 175
Val Arg Asn Gly His Glu Leu Met Gly Ser Ile Val Gly Thr Gly Cys
            180                 185                 190
Met Ala Ala Ser Val Ile Gly Ser Phe Ala Ala Val Asn Thr Asp His
        195                 200                 205
Cys Asp Ala Ala Lys Asp Ala Leu Cys Tyr Phe Gly Ile Ala Gly Glu
    210                 215                 220
Leu Ala Ala Glu Ile Ser Arg Gly Pro Gly Ser Phe Lys Val Asn Leu
225                 230                 235                 240
Tyr Asp Glu Thr Phe Asp Leu Ser Asp Glu Arg Ala Glu Lys Met Met
                245                 250                 255
Asn Phe Glu Glu Tyr
            260

<210> 394
<211> 783
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 394
atgcaggaac ctctaaaaaa aataaaggaa acaagaccac tgatacacca tatcacaaat     60
tgggttacta tatatgattg cgcgaatatt acacgcacct ttggtgctct tcctgtaatg    120
gcccatgcac cagaggaatg cgcggatatg acacgcattt catctgctct tgtgcttaac    180
ataggcaccc tgacaaatga gctgatcgat gcgatgatac tttctgcgaa agccgcgaac    240
gagaataaga tacctgtcgt actcgatgca gttggcgtgg gcgctacgaa gttcagagat    300
tacatggcct caaagatcat cgattcagtt catgtcgata tcatcaaagg gaactattcc    360
gagatagcaa aacttgcagg cgaaaaagct cagacaaaag gtgtcgaggc aacttctatc    420
aacgctgatc ccagacagat agcaagagaa cttgcaatat ctaagtcctg cactatagtc    480
atgacgggaa aagaggacat aatcagcaat ggtaaaaaga tattcgttgt caggaacgga    540
catgagctaa tggggtccat tgtaggaaca ggatgcatgg ccgcttcagt tataggatct    600
ttcgctgctg tcaatacaga ccattgtgat gctgcaaaag atgctctctg ttattttggg    660
atagcaggag aacttgcagc tgagatatcg cgtggtcctg ggagcttcaa ggtaaattta    720
tatgacgaaa catttgatct ttctgatgaa agggcagaaa agatgatgaa cttcgaagag    780
tac                                                                  783

<210> 395
<211> 786
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 395
atgcaggaac cgctgaaaaa aattaaagaa accagaccgc tgattcatca tattaccaat     60
tgggtgacca tctatgattg cgccaatatt acccgcacct tcggtgccct gccggttatg    120
gcccatgccc cggaagaatg tgcagatatg acccgcatta gcagcgcact ggttctgaat    180
attggcaccc tgaccaatga actgattgat gcaatgattc tgagtgccaa agccgccaat    240
gaaaataaaa ttccggtggt tctggatgcc gttggcgttg gcgccaccaa attccgtgat    300
tatatggcaa gcaaaattat tgatagcgtt catgtggata tcattaaagg caattatagt    360
gaaatcgcaa agctggccgg cgaaaaagcc cagaccaaag gcgtggaagc caccagcatt    420
aatgccgatc cgcgccagat tgcccgcgaa ctggcaatta gtaaaagttg taccattgtg    480
atgaccggta aagaagatat tattagtaac ggcaaaaaga tattcgtggt tcgcaatggt    540
catgaactga tgggtagcat tgttggcacc ggttgtatgg ccgcaagtgt tattggtagc    600
ttcgccgcag ttaataccga tcattgcgat gccgccaaag atgcactgtg ctacttcggc    660
attgcaggcg aactggcagc agaaattagt cgtggcccgg gtagcttcaa agttaatctg    720
tatgatgaaa ccttcgatct gagcgatgaa cgcgccgaaa aaatgatgaa cttcgaagaa    780
tattaa                                                               786

<210> 396
<211> 215
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 396
Met Asn Asp Thr Ala Asn Leu Thr Leu His Leu Gly Ala Leu Pro Val
1               5                   10                  15
Met Ala Gln Ala Ala Glu Glu Val Ala Ala Met Thr Arg Thr Ala Asp
            20                  25                  30
Ala Leu Leu Leu Asn Met Gly Thr Leu Thr Pro Ala Ala Leu Asp Ala
        35                  40                  45
Met Arg Leu Ala Gly Arg Glu Ala Asn Arg Cys Gly Val Pro Val Val
    50                  55                  60
Leu Asp Pro Val Gly Ala Gly Ala Thr Pro Tyr Arg Thr Glu Ala Ala
65                  70                  75                  80
Arg Gln Leu Leu Glu Asp Ile Asn Val Ala Ile Val Arg Gly Asn Ser
                85                  90                  95
Gly Glu Val Ala Ala Ile Ile Gly Gln Gln Ala Val Val Arg Gly Val
            100                 105                 110
Glu Ser Leu Glu Thr Ala Leu Pro Ala Ala Glu Leu Gly Ala Gln Ala
        115                 120                 125
Ala Gln Gln Leu Gly Val Val Val Ala Leu Thr Gly Ala Arg Asp Ile
    130                 135                 140
Ile Ser Asp Gly Ser Val Ser Leu Ala Val Asp His Gly Ser Pro Trp
145                 150                 155                 160
Leu Lys Thr Ile Thr Gly Ser Gly Cys Met Ala Ser Ala Ala Val Ala
                165                 170                 175
Cys Phe Ala Ala Val Ala Pro Ser Ser Leu Gln Ala Ala Ala Ala Ala
            180                 185                 190
Leu Ala Ala Tyr Gly Leu Ala Ala Glu Leu Ala His Lys Pro Gln Ile
        195                 200                 205
His Gly Pro Ala Ser Phe Lys
    210                 215

<210> 397
<211> 645
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 397
atgaacgaca cggcgaacct tacgctgcat ttgggcgctc tgcctgttat ggcccaggcc     60
gctgaggagg ttgccgctat gacccgcacc gccgatgcgc tgctcttaaa catgggtacc    120
ctgacgccag cggcgctaga cgccatgcgc cttgccgggc gagaagccaa tagatgcggc    180
gttcctgtgg tactagaccc tgtgggcgct ggcgcaacgc cttaccgtac ggaggcagcg    240
cggcagcttt tagaagacat caacgttgct attgtgcgtg ggaattccgg agaagtcgcc    300
gccatcattg gacagcaagc cgtggtgcgc ggcgtggaaa gccttgagac tgcgctgcct    360
gccgcggagc tcggggcgca ggcggcgcag cagctaggtg tggttgtggc cctaacaggt    420
gcgcgcgaca tcatcagcga cggcagcgta agcctagcgg tggaccacgg ctcgccttgg    480
ctcaagacga tcacgggcag cggttgcatg gcaagcgctg cggtcgcttg ttttgcggcg    540
gtagcaccta gctctttgca ggcggccgcg gcggcgcttg cagcttatgg gctagcggct    600
gagctagccc ataagccgca aattcacggg ccagcgtcgt ttaag                    645

<210> 398
<211> 648
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 398
atgaacgata ccgcaaatct gaccctgcat ctgggcgcac tgccggtgat ggcacaggca     60
gcagaagaag tggccgccat gacccgtacc gcagatgccc tgctgctgaa tatgggtacc    120
ctgaccccgg ccgcactgga tgccatgcgc ctggcaggtc gtgaagccaa tcgttgcggt    180
gttccggtgg tgctggaccc tgtgggtgcc ggtgcaaccc cgtatcgtac cgaagcagca    240
cgccagctgc tggaagatat taatgttgcc attgttcgtg gtaatagcgg tgaagttgcc    300
gccattattg gtcagcaggc agtggttcgc ggcgtggaaa gcctggaaac cgccctgccg    360
gcagcagaac tgggtgcaca ggcagcccag cagctgggcg tggttgttgc cctgaccggt    420
gcccgcgata ttattagtga tggcagcgtt agcctggccg ttgatcatgg tagcccgtgg    480
ctgaaaacca ttaccggtag cggttgtatg gccagcgccg ccgtggcatg cttcgcagct    540
gttgccccga gcagtctgca agcagccgcc gcagcactgg cagcctatgg tctggcagca    600
gaattagcac ataaaccgca gattcatggc ccggcatcat tcaaataa                 648

<210> 399
<211> 203
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 399
Met Ala Ala Gly Met Ala Pro Ala Met Val Asp Asn Pro His Glu Ala
1               5                   10                  15
Gly Asp Phe Ala Arg Val Ala Ser Gly Val Leu Val His Leu Gly Pro
            20                  25                  30
Pro Gln Asp His Pro Val Ala Ala Met Lys Ala Ala Val Arg Ala Ala
        35                  40                  45
Asp Glu Ala Gly Thr Pro Trp Val Leu Asp Pro Val Ala Ala Gly Gly
    50                  55                  60
Leu Arg Trp Arg Thr Arg Val Ala Leu Glu Leu Leu Asp Ile Ser Arg
65                  70                  75                  80
Pro Ala Ile Ile Arg Gly Asn Ala Ser Glu Ile Ala Gly Leu Ala Gly
                85                  90                  95
Gly Ala Gly Gly Arg Gly Val Asp Ser Thr Thr Ser Pro Glu Glu Val
            100                 105                 110
Leu Asp Leu Ala Arg Ala Leu Ala Gln Arg His Arg Cys Ala Val Ala
        115                 120                 125
Val Ser Gly Pro Val Asp His Phe Val Asp Ala Glu Arg Val Val Thr
    130                 135                 140
Leu Ala Asn Gly His Arg Trp Leu Thr Arg Ile Thr Gly Val Gly Cys
145                 150                 155                 160
Ser Leu Gly Ala Leu Met Ala Gly Phe Ala Gly Val Thr Glu Asp Pro
                165                 170                 175
Leu Val Ala Ala Ala Gly Ala Thr Ala Val Ile Thr Val Ala Ala Asp
            180                 185                 190
Gln Ala Ala Gly Thr Thr Gly Leu Gly Gly Phe
        195                 200

<210> 400

<400> 400
000

<210> 401
<211> 612
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 401
atggccgcag gcatggcccc tgcaatggtg gataatccgc atgaagcagg tgacttcgcc     60
cgtgttgcaa gcggtgttct ggttcatctg ggcccgccgc aggatcatcc ggttgcagca    120
atgaaagcag cagtgcgtgc cgcagatgaa gccggtaccc cgtgggttct ggaccctgtg    180
gccgcaggtg gtctgcgctg gcgtacccgt gtggccctgg aactgctgga tattagccgc    240
ccggcaatta ttcgtggcaa tgcaagcgaa attgccggcc tggcaggtgg tgcaggcggc    300
cgtggtgttg atagtaccac cagtccggaa gaagtgctgg atctggcccg cgcactggca    360
cagcgtcatc gttgcgcagt tgcagttagc ggtccggttg atcacttcgt tgatgccgaa    420
cgcgttgtta ccctggccaa tggtcatcgc tggctgaccc gtattaccgg cgtgggttgc    480
agcctgggtg ccctgatggc aggcttcgca ggtgtgaccg aagatccgct ggttgcagca    540
gcaggcgcaa ccgcagttat taccgtggcc gcagatcagg cagcaggtac caccggcctg    600
ggtggcttct aa                                                        612

<210> 402
<211> 299
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 402
Met Ser Ala Leu Arg Lys Ile Val Ala Lys Val Gly Thr Asn Val Leu
1               5                   10                  15
Ala Arg Glu Asp Gly Leu Leu Asp Ile Thr Ser Ile Ser His Leu Val
            20                  25                  30
Asp Gln Ile Ala Ala Leu Lys Ala Gln Gly Val Glu Val Ile Leu Val
        35                  40                  45
Ser Ser Gly Ala Val Gly Ala Gly Arg Ser Leu Phe Pro Val Pro Glu
    50                  55                  60
Gly Ala Asn Lys Val Val Arg Arg Gln Val Leu Ser Ala Ile Gly Gln
65                  70                  75                  80
Val Arg Leu Met Glu Ile Tyr Arg Gln Leu Phe Ala Asn His Gly Leu
                85                  90                  95
Phe Cys Ala Gln Val Leu Ala Thr Lys Ala Asp Phe Gln Gly Lys Thr
            100                 105                 110
His Tyr Thr Asn Met Lys Ser Cys Phe Gln Ala Leu Leu Arg Asp Lys
        115                 120                 125
Val Val Pro Val Val Asn Glu Asn Asp Val Val Ser Val Asn Glu Leu
    130                 135                 140
Met Phe Thr Asp Asn Asp Glu Leu Ala Gly Leu Val Ala Ala Met Thr
145                 150                 155                 160
Asn Ala Gln Ala Leu Ile Ile Leu Ser Ser Val Asp Gly Val Leu Ser
                165                 170                 175
Gly Pro Pro Gly Glu Pro Gly Ser Glu Val Ile Pro Glu Ile Asp Pro
            180                 185                 190
Glu Asp Lys Gln Trp Leu Lys Leu Ile Leu Pro Ser Lys Ser Ser Phe
        195                 200                 205
Gly Arg Gly Gly Met His Thr Lys Phe Arg Ile Ala Gln Lys Ala Ala
    210                 215                 220
Lys Ala Gly Ile Thr Thr Tyr Ile Ala Asn Gly Arg Arg Pro Asn Ile
225                 230                 235                 240
Leu Leu Asp Ile Leu Lys Gly Gln Phe Thr Gly Thr Arg Phe Pro Ala
                245                 250                 255
Thr Gly Arg Leu Ser Asn Leu Lys Lys Arg Leu Ala Tyr Gly Glu Pro
            260                 265                 270
Glu Ser Lys Ala Ser Val His Ile Asn Arg Gly Ala Glu Ala Ala Leu
        275                 280                 285
Cys Ser Pro Asp Gln Ile Ser Ser Leu Leu Pro
    290                 295

<210> 403

<400> 403
000

<210> 404
<211> 906
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 404
atgagcgcac tgcgcaaaat tgtggccaaa gtgggcacca atgtgctggc acgcgaagat     60
ggcctgctgg atattaccag cattagtcat ctggttgatc agattgccgc actgaaagcc    120
cagggtgttg aagtgattct ggtgagcagt ggcgcagttg gtgccggtcg tagtctgttc    180
ccggttccgg aaggtgcaaa taaagttgtg cgtcgtcagg ttctgagtgc cattggtcag    240
gtgcgcctga tggaaatcta tcgccagctg ttcgccaatc atggtctgtt ctgcgcccag    300
gttctggcaa ccaaagcaga cttccagggt aaaacacatt ataccaatat gaaaagctgc    360
ttccaggcac tgctgcgtga taaagttgtt ccggtggtga atgaaaatga tgtggtgagc    420
gtgaatgaac tgatgttcac cgataatgat gaactggccg gcctggtggc cgccatgacc    480
aatgcccagg cactgattat tctgagcagt gttgatggcg ttctgagcgg cccgccgggt    540
gaacctggta gtgaagttat tccggaaatt gatccggaag ataaacagtg gctgaaactg    600
attctgccga gcaaaagtag cttcggccgc ggcggtatgc ataccaaatt ccgtattgcc    660
cagaaagccg ccaaagccgg cattaccacc tatattgcaa atggccgtcg tccgaatatt    720
ctgctggata tcctgaaagg ccagttcacc ggtacccgct tcccggcaac cggccgtctg    780
agtaatctga aaaaacgtct ggcatacggt gaaccggaaa gtaaagcaag cgttcatatt    840
aatcgcggcg cagaagccgc cctgtgcagt cctgatcaga ttagcagcct gctgccgtaa    900
ggatcc                                                               906

<210> 405
<211> 218
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 405
Met Arg Gly Leu Gly Asp Val Tyr Lys Arg Gln Asn Ser Arg Ser Ile
1               5                   10                  15
Leu Leu Ser Glu Leu Ile Leu Leu Lys Leu Gly Gly Ser Val Ile Thr
            20                  25                  30
Asp Lys Thr Arg Pro Phe Thr Ala Arg Thr Asp Val Ile Gln Arg Leu
        35                  40                  45
Ala Gln Glu Ile Lys Ser Ala Leu Ala Glu Arg Gly Asp Asp Leu Arg
    50                  55                  60
Leu Ile Ile Gly His Gly Ser Gly Ser Phe Gly His Glu Val Ala Asp
65                  70                  75                  80
Lys Tyr Gln Thr His Lys Gly Gly Thr Ser Ala Glu Ser Trp Leu Gly
                85                  90                  95
Phe Ala Glu Val Ala His Val Ala Ala Thr Leu Asn His Leu Val Val
            100                 105                 110
Asn Ala Leu Arg Glu Val Gly Val Pro Ala Met Arg Phe Gln Pro Ser
        115                 120                 125
Ala Ser Thr Arg Thr Arg Gly Glu Gln Leu Met Tyr Phe Glu Thr Phe
    130                 135                 140
Pro Leu Lys Glu Ala Leu Ser His Gly Leu Val Pro Val Val Tyr Gly
145                 150                 155                 160
Asp Val Ser Val Asp Ala Ala Gln Gly Met Ser Ile Val Ser Thr Glu
                165                 170                 175
Leu Leu Phe Asp Asn Leu Ala Arg Glu Leu Ser Pro Ser Arg Ile Val
            180                 185                 190
Leu Ala Gly Arg Val Asp Gly Val Tyr Glu Ala Asp Pro Asn Leu Asn
        195                 200                 205
Pro Leu Ser Leu Ile His Ile Ser Glu Pro
    210                 215

<210> 406

<400> 406
000

<210> 407
<211> 663
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 407
atgcgcggtc tgggtgatgt gtataaacgt cagaatagcc gcagcattct gctgagtgaa     60
ctgattctgc tgaaactggg cggtagcgtt attaccgata aaaccagacc gttcaccgca    120
cgcaccgatg ttattcagcg tctggcacag gaaattaaaa gcgccctggc agaacgtggc    180
gatgatctgc gtctgattat tggccatggt agtggcagct tcggtcatga agttgcagat    240
aaatatcaga cccataaagg tggtaccagt gcagaaagtt ggctgggctt cgccgaagtg    300
gcacatgttg cagcaaccct gaatcatctg gttgtgaatg cactgcgtga agtgggcgtg    360
ccggccatgc gcttccagcc tagtgcaagc acccgtaccc gtggtgaaca gctgatgtac    420
ttcgaaacct tcccgctgaa agaagccctg agtcatggtc tggttccggt tgtgtatggc    480
gatgttagcg tggatgccgc ccagggcatg agtattgtga gcaccgaact gctgttcgat    540
aatctggcac gtgaactgag cccgagccgt attgttctgg caggccgtgt ggatggcgtg    600
tatgaagcag atccgaatct gaatccgctg agtctgattc atattagcga accgtaagga    660
tcc                                                                  663

<210> 408
<211> 239
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 408
Met Ala Gly Gly His Ile His His Leu Ala Lys Lys Tyr Asn Leu Thr
1               5                   10                  15
Thr Ser Cys Ala Glu Asn Pro Gly Lys Leu Ala Gln Ala Leu Asp Val
            20                  25                  30
Gln Lys Thr Thr Lys Arg Leu Ser Asp Asp Ile Val Ala Leu Ala Gly
        35                  40                  45
Lys Thr Asn Phe Ser Leu Ser Gln Ile Pro Thr Cys Lys Ile Val Thr
    50                  55                  60
Asn Lys Asp Gly Lys Phe Leu Asn Ile Ala Thr Asp Gly Ile Gln Lys
65                  70                  75                  80
Thr Leu Ala Thr Ala Gly Val Pro Val Leu Tyr Gly Asp Met Val Pro
                85                  90                  95
Asp Glu Thr Phe Gly Leu Ser Ile Cys Ser Gly Asp Thr Leu Ile Thr
            100                 105                 110
Glu Glu Ala Pro Leu Ile Gly Ala Thr Arg Val Ile Tyr Val Ser Asp
        115                 120                 125
Ile Asp Gly Ile Tyr Thr Gly Asp Pro Tyr Gln Asn Glu Asp Ala Glu
    130                 135                 140
Leu Ile Glu Lys Ile Ser Val Ala Glu Leu Ser Asn Asn His Ile Ser
145                 150                 155                 160
Ile Gly Asn Ser His Asn Ile Asp Val Thr Gly Gly Leu Lys Asn Lys
                165                 170                 175
Leu Glu Pro Val Ala Lys Leu Phe Leu Ser Thr Pro Ser Leu Thr Lys
            180                 185                 190
Ile Glu Ile Cys Asn Gly Leu Lys Pro His Ile Leu Ser Ala Val Leu
        195                 200                 205
Arg Gly Gln Ala Val Pro His Thr Ala Gly Ser Gln Pro Pro Gln Leu
    210                 215                 220
Val Ile Arg Thr Thr Leu Met Ser Pro Val Val Leu Lys Ile Asn
225                 230                 235

<210> 409

<400> 409
000

<210> 410
<211> 726
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 410
atggccggtg gccatattca tcatctggca aaaaaatata acctgaccac cagctgtgcc     60
gaaaatccgg gtaaactggc ccaggccctg gatgttcaga aaaccaccaa acgtctgagc    120
gatgatattg tggcactggc cggtaaaacc aacttcagtc tgagtcagat tccgacctgc    180
aaaattgtga ccaataaaga tggtaaattc ctgaatatcg ccaccgatgg tattcagaaa    240
accttagcca ccgcaggcgt tccggtgctg tatggtgata tggttccgga tgaaaccttc    300
ggcctgagta tctgtagtgg tgataccctg attaccgaag aagccccgct gattggcgcc    360
acccgtgtta tctatgtgag tgatattgat ggtatctata ccggtgatcc gtatcagaat    420
gaagatgcag aactgattga aaaaatcagc gtggcagaac tgagtaataa tcatattagc    480
attggcaata gccataatat tgatgttacc ggcggcctga aaaataaact ggaaccggtg    540
gccaaactgt tcctgagtac cccgagcctg accaaaattg aaatctgtaa tggcctgaaa    600
ccgcatattc tgagcgccgt tctgcgtggc caggccgtgc ctcataccgc aggcagccag    660
ccgccgcagc tggttattcg caccaccctg atgagtccgg tggtgctgaa aattaattaa    720
ggatcc                                                               726

<210> 411
<211> 370
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 411
Met Met Leu Gly Asn Ala Lys Arg Ile Val Ile Lys Val Gly Ser Ser
1               5                   10                  15
Leu Leu Ile Asp Ala Glu Asn Gln Leu Arg Gly Asn Phe Leu Ala Gln
            20                  25                  30
Ile Ala Glu Ser Ile Ala Ala Leu Ile Ser Glu Gly Lys Gln Val Ile
        35                  40                  45
Val Val Thr Ser Gly Ala Val Ala Leu Gly Arg Pro Ser Leu Gly Tyr
    50                  55                  60
Gly Ala Arg Thr Leu Thr Leu Glu Glu Lys Gln Ala Ala Ala Ala Cys
65                  70                  75                  80
Gly Gln Ile Thr Leu Phe Ser Met Trp Asp Lys Ala Phe Ser Ala Phe
                85                  90                  95
Gln Leu Arg Pro Ala Gln Ile Leu Leu Thr Ala Asp Asp Ser Ile His
            100                 105                 110
Arg Arg Arg Tyr Leu Asn Ala Lys Asn Thr Leu Asp Thr Leu Leu Glu
        115                 120                 125
Asn Lys Ser Val Ile Pro Val Ile Asn Glu Asn Asp Thr Val Ala Thr
    130                 135                 140
Ala Glu Leu Arg Phe Gly Asp Asn Asp Arg Leu Ala Ala Arg Val Ala
145                 150                 155                 160
Gln Met Ala Glu Ala Asp Leu Leu Ile Ile Phe Ser Asp Ile Asp Gly
                165                 170                 175
Leu Tyr Ser Glu Asn Pro Asn His Asn Pro Asp Ala Arg Phe Ile Asp
            180                 185                 190
Glu Val Thr Glu Ile Thr Pro Glu Ile Glu Lys Met Ala Gly Gly Ala
        195                 200                 205
Ala Ser Ser Val Ser Ser Gly Gly Met Val Thr Lys Ile Ala Ala Ala
    210                 215                 220
Lys Ile Ala Thr Ala Ser Gly Cys His Val Ile Ile Ala Lys Gly Leu
225                 230                 235                 240
Gly Glu His Pro Leu Arg Ala Leu Ile Gln Gly Gly Lys His Thr His
                245                 250                 255
Phe Met Ala Lys Gly Thr Pro Arg Thr Ala Arg Lys Glu Trp Ile Ala
            260                 265                 270
Gly Ser Leu His Ala Thr Gly Glu Ile Trp Val Asp Ala Gly Ala Val
        275                 280                 285
Lys Ala Leu Gln Ser Gly Lys Ser Leu Leu Pro Ala Gly Val Thr Leu
    290                 295                 300
Val Ile Gly Ser Phe Asp Arg Gly Asp Ala Val Asn Ile Ile Asp Ala
305                 310                 315                 320
Ala Thr Thr Lys Val Ile Gly Lys Gly Leu Ile Ala Tyr Ala Ala Glu
                325                 330                 335
Asp Ala Ala Arg Ile Ala Gly Lys Lys Ser Gln Glu Ile Glu Gln Ile
            340                 345                 350
Leu Gly Phe Lys Arg Arg Asp Val Leu Ile His Arg Asp Asp Met Val
        355                 360                 365
Leu Glu
    370

<210> 412
<211> 1110
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 412
atgatgttag gcaacgcaaa acgcattgtt atcaaagttg gctcatcgct cttgattgac     60
gcggaaaacc agctgcgcgg caattttctg gcgcagatag cagaaagcat tgccgccctc    120
atcagcgaag gcaagcaggt gattgtggtg acatccggtg cggtggcgct tggccgccct    180
tcgttgggat atggcgcacg tacgcttacg ctcgaagaaa aacaggcggc cgctgcctgc    240
gggcagatca cgctattttc catgtgggat aaagcgttct ccgcgtttca gcttagacct    300
gcgcagatat tgcttacggc cgatgacagc atccatcgcc gccggtatct gaatgcaaaa    360
aatacactcg atacattgct tgaaaacaaa tccgtcatcc ccgttatcaa tgaaaatgac    420
acagtggcga cggcagaatt acgcttcggt gataatgacc ggctggctgc gcgcgtcgcg    480
caaatggcgg aagcggatct gctgattatt ttctcggata ttgatgggtt atactcagaa    540
aatcctaacc ataatcctga tgcccgcttt attgacgaag taaccgaaat cacacctgaa    600
atcgaaaaaa tggcaggcgg tgcggcatcc agcgtgtcat ccggcggaat ggtgacgaaa    660
attgctgcag ctaaaatcgc aactgcttct ggctgccatg tgattattgc aaaaggtctt    720
ggagagcatc ccctgcgtgc acttatacaa ggcggcaaac atacgcattt catggccaag    780
ggaacaccac gcacagcgcg caaggaatgg atcgcagggt cgctgcatgc aacgggtgaa    840
atttgggtcg atgcaggtgc agtaaaagca ttacaatcag gtaaaagttt gttgcctgca    900
ggcgtaacgc tggtcatagg aagttttgat cgtggagatg ctgtaaatat tattgatgca    960
gcaaccacca aagtgatcgg caaaggctta atcgcttatg cagcggaaga tgctgcgcgt   1020
attgccggca aaaaaagcca ggaaattgaa cagattcttg gttttaaacg tcgtgatgtg   1080
ctcattcacc gcgacgatat ggtactggaa                                    1110

<210> 413
<211> 1119
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 413
atgatgctgg gtaatgcaaa acgtattgtg attaaagtgg gtagcagcct gctgattgat     60
gccgaaaatc agctgcgcgg caacttcctg gcacagattg cagaaagtat tgccgcactg    120
attagcgaag gcaaacaggt tattgttgtt accagtggtg ccgtggcact gggtcgcccg    180
agcttaggtt atggcgcacg caccctgacc ctggaagaaa aacaggccgc agcagcatgt    240
ggccagatta ccctgttcag tatgtgggat aaagccttca gcgccttcca gctgcgcccg    300
gctcagattc tgctgaccgc agatgatagc attcatcgtc gccgttatct gaatgcaaaa    360
aataccctgg ataccctgct ggaaaataaa agtgttattc cggtgattaa cgaaaatgat    420
accgttgcaa ccgccgaact gcgcttcggt gataatgatc gtctggccgc ccgtgtggca    480
cagatggcag aagccgatct gctgattatc ttcagtgata ttgatggtct gtatagtgaa    540
aatccgaatc ataatccgga tgcccgcttc attgatgaag tgaccgaaat taccccggaa    600
attgaaaaaa tggccggtgg tgccgcaagc agtgttagta gcggtggtat ggttaccaaa    660
attgccgccg ccaaaattgc caccgccagt ggttgccatg ttattattgc aaaaggtctg    720
ggtgaacatc cgctgcgcgc actgattcag ggcggcaaac atacccactt catggccaaa    780
ggtaccccgc gcaccgcacg taaagaatgg attgcaggca gtctgcatgc aaccggtgaa    840
atctgggttg atgccggcgc agttaaagcc ctgcaaagcg gtaaaagtct gctgccggcc    900
ggcgttaccc tggttattgg tagcttcgat cgtggcgatg ccgtgaatat tattgatgca    960
gcaaccacca aagtgattgg taaaggtctg attgcctatg cagccgaaga tgccgcccgt   1020
attgccggta aaaaaagcca ggaaattgaa cagattctgg gcttcaaacg ccgtgatgtg   1080
ctgattcatc gcgatgatat ggttctggaa taaggatcc                          1119

<210> 414
<211> 237
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 414
Met Ser Ser Leu Thr Ser Glu Ala Asp Gly Gln Leu Asp Pro Asn Arg
1               5                   10                  15
Ile Gln Ala Leu Val Asp Thr Leu Ala Ala Ala Arg Ser Ala Gly Arg
            20                  25                  30
Glu Ile Val Leu Leu Gly Asp Val Tyr Lys Arg Gln Gly Ile Gly Pro
        35                  40                  45
Leu Gly Leu Ser Arg Arg Pro Asn Asp Leu Pro Thr Gln Gln Ala Ala
    50                  55                  60
Ala Ser Val Gly Gln Gly Leu Leu Val Ala His Tyr Thr Arg Arg Phe
65                  70                  75                  80
His Glu Tyr Gly Trp Pro Val Gly Gln Val Leu Leu Thr Val Asp Asp
                85                  90                  95
Val Thr Arg Gln Gln His Tyr Arg Asn Ala Tyr Arg Thr Phe Glu Lys
            100                 105                 110
Leu Leu Gln Leu Gly Val Ile Pro Ile Val Asn Glu Asn Asp Thr Val
        115                 120                 125
Ala Thr His Glu Ile Arg Phe Gly Asp Asn Asp Arg Leu Ala Ala Leu
    130                 135                 140
Val Ala Gln Leu Val His Ala Asp Ala Leu Phe Leu Phe Ser Asp Val
145                 150                 155                 160
Asp Ala Leu Tyr Thr Asp His Pro Ser Thr Pro Asp Ala Arg Arg Ile
                165                 170                 175
Ala Leu Val Asn Ser Ser Ala Asp Leu Leu Gly Val Asp Thr Ser Arg
            180                 185                 190
Thr Gly Ser Arg Val Gly Thr Gly Gly Met Thr Thr Lys Ile Gln Ala
        195                 200                 205
Ala Gly Ile Ala Thr Ser Ala Gly Val Pro Val Arg Val Ala Ala Ala
    210                 215                 220
Ala Asn Ala Ser Ala Ala Pro Ala Gly Gly Ala Val Ala
225                 230                 235

<210> 415

<400> 415
000

<210> 416
<211> 720
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 416
atgagcagtc tgaccagtga agccgatggt cagctggacc ctaatcgtat tcaggccctg     60
gtggataccc tggcagcagc acgcagcgcc ggtcgtgaaa ttgtgctgct gggtgatgtg    120
tataaacgcc agggtattgg tccgctgggc ctgagccgcc gtccgaatga tctgccgacc    180
cagcaggccg cagcaagtgt tggtcagggc ctgctggttg cacattatac ccgccgcttc    240
catgaatatg gctggccggt tggtcaggtt ctgctgaccg tggatgatgt tacccgccag    300
cagcattatc gtaatgcata tcgcaccttc gaaaaactgc tgcaactggg tgttattccg    360
attgtgaatg aaaatgatac cgtggcaacc catgaaattc gcttcggtga taatgatcgc    420
ctggccgcac tggttgcaca gctggttcat gcagatgcac tgttcctgtt cagcgatgtg    480
gatgccctgt ataccgatca tccgagcacc ccggatgcac gccgcattgc cctggtgaat    540
agcagcgcag acttactggg tgttgatacc agccgtaccg gtagccgcgt tggtaccggt    600
ggcatgacca ccaaaattca ggcagcaggc attgccacca gcgcaggtgt gccggttcgc    660
gttgcagccg cagccaatgc aagtgccgcc ccggctggtg gtgcagttgc ataaggatcc    720

<210> 417
<211> 299
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 417
Met Asp Gly Ile Arg Asp Gln Ser Ser Ser Arg Gly Gly Trp Arg Cys
1               5                   10                  15
Met Thr Asn Pro Ala Leu Thr Asp Asp Ala Pro Val Val Phe Leu Lys
            20                  25                  30
Leu Gly Gly Ser Leu Ile Thr Asp Lys Thr Arg Pro Glu Thr Asp Arg
        35                  40                  45
Pro Asp Ile Leu Ser Arg Leu Ala Gly Glu Leu Ala Gln Ala Arg Ala
    50                  55                  60
Arg Ala Pro Arg Ile Arg Leu Leu Leu Gly His Gly Ser Gly Ser Phe
65                  70                  75                  80
Gly His Ala Ala Ala Ala Arg His Gly Thr Arg Ala Gly Val Ala Gly
                85                  90                  95
Pro Ala Gly Trp Leu Gly Phe Ala Glu Val Ala Asp Ala Ala Ala Arg
            100                 105                 110
Leu Asn Arg Val Val Ala Ala Gln Met Leu Ala Ala Gly Leu Pro Lys
        115                 120                 125
Arg Gln Val Gln Pro Ser Ala Gly Ala Leu Cys Arg Asp Gly Ser Leu
    130                 135                 140
Thr Ala Trp Gln Ile Asp Ala Val Glu Gly Ala Leu Ala Arg Gly Leu
145                 150                 155                 160
Ile Pro Leu Val Tyr Gly Asp Ala Val Leu Asp Thr Val Arg Gly Gly
                165                 170                 175
Thr Ile Ala Ser Thr Glu Glu Leu Phe Gly Trp Leu Thr Pro Arg Leu
            180                 185                 190
Gln Pro Val Arg Ile Val Leu Ala Gly Val Val Asp Gly Val Tyr Asp
        195                 200                 205
Ala Asp Pro Leu Ser Asn Pro Arg Ala Ala Arg Ile Gly Glu Ile Thr
    210                 215                 220
Pro Ala Thr Leu Pro Ser Leu Ala Arg Gln Leu Gly Gly Ser His Gly
225                 230                 235                 240
Val Asp Val Thr Gly Gly Met Leu Ser Lys Val Thr Glu Met Cys Arg
                245                 250                 255
Leu Val Ala Ala His Pro Arg Thr Glu Val Trp Leu Val Ser Gly Gln
            260                 265                 270
Arg Ala Gly Ala Val Leu Gln Ala Leu Leu Gly Glu Asp Ala Gly Gly
        275                 280                 285
Thr Arg Ile Ala Ala Glu Gly Lys Gly Ser Leu
    290                 295

<210> 418

<400> 418
000

<210> 419
<211> 906
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 419
atggatggta ttcgtgatca gagtagcagt cgcggtggtt ggcgctgcat gaccaatccg     60
gccctgaccg atgatgcccc ggttgtgttc ctgaaactgg gcggtagtct gattaccgat    120
aaaaccagac cggaaaccga tcgtccggat attctgagtc gtctggcagg cgaactggcc    180
caggcacgtg cacgcgcacc gagaattaga ctgctgctgg gtcatggtag cggtagcttc    240
ggtcatgcag ccgcagcacg ccatggtacc cgtgccggtg tggccggtcc ggcaggttgg    300
ttaggcttcg cagaagtggc agatgccgcc gcccgcctga atcgtgttgt tgcagcacag    360
atgctggcag caggcctgcc gaaacgtcag gttcagccga gtgccggtgc cctgtgtcgt    420
gatggtagcc tgaccgcctg gcagattgat gccgtggaag gcgcactggc acgcggtctg    480
attccgctgg tgtatggcga tgccgtgctg gataccgttc gcggcggcac cattgcaagt    540
accgaagaac tgttcggctg gctgaccccg cgcctgcaac cggttcgcat tgttctggcc    600
ggtgttgtgg atggcgtgta tgatgccgat ccgctgagta atccgcgtgc cgcacgcatt    660
ggtgaaatta ccccggccac cctgccgagc ctggcaagac agctgggcgg ttcacatggc    720
gttgatgtga ccggtggcat gctgagcaaa gttaccgaaa tgtgtcgtct ggttgcagca    780
catccgcgta ccgaagtgtg gctggttagc ggccagcgtg ccggtgccgt tctgcaagca    840
ctgctgggtg aagatgccgg tggcacccgt attgcagcag aaggtaaagg cagcctgtaa    900
ggatcc                                                               906

<210> 420
<211> 534
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 420
Met Ser Asp Val Tyr Lys Arg Gln Pro Glu Leu Arg His Arg Arg His
1               5                   10                  15
Pro Gly Pro Asp Arg Gly Gly Leu Gly Gly Gly Gly Ala Gly Arg Pro
            20                  25                  30
Leu Pro Arg Pro Tyr Arg Ala Leu Arg His Ala Asp Pro Pro Arg Arg
        35                  40                  45
Arg His Gly Arg Ala Ser Arg Arg Gly Leu Ala Asp Gly Ala Arg Arg
    50                  55                  60
Ala Arg Gly Leu Trp Pro Arg Ala Gly Gly Glu Ala Arg Asp Pro Leu
65                  70                  75                  80
Pro Gln Gln Asp Arg Arg Pro Asp Pro Gly Gly Asp Arg Gly Ala Gln
                85                  90                  95
Ala Pro Ala Arg Gly Gly Phe Gly Arg Ala Arg Ala Leu His Leu Gly
            100                 105                 110
Arg Gly Glu Gly Gly Ala Gly Gly Gly Ala Ala Arg Gly Ala Gly Ser
        115                 120                 125
Cys Arg Arg Pro Ala Cys Gly Arg Gly Gly Gly Arg Gly Ser Ala Ser
    130                 135                 140
Glu Val Tyr Lys Ser Gln Ala Leu Leu Ala Arg Arg Arg Val Thr Ile
145                 150                 155                 160
Lys Ile Gly Ser Ala Leu Leu Val Asp Gly Glu Gly Arg Ile Arg Arg
                165                 170                 175
Ala Trp Leu Asp Gly Leu Ala Ala Asp Ile Gly Glu Leu Ala Arg Arg
            180                 185                 190
Gly Ser Arg Val Ile Val Val Thr Ser Gly Ala Ile Ala Leu Gly Arg
        195                 200                 205
Arg Ala Leu Gly Leu Pro Gln Arg Ala Leu Arg Leu Glu Glu Lys Gln
    210                 215                 220
Ala Ala Ala Ala Ala Gly Gln Ile Leu Leu Ala Gly Ala Trp Ala Glu
225                 230                 235                 240
Ser Leu Ala Gly His Gly Leu Ile Ala Ala Gln Leu Leu Val Thr Leu
                245                 250                 255
Gly Asp Thr Glu Gly Arg Arg Arg Tyr Leu Asn Ala Arg Ala Thr Ile
            260                 265                 270
Glu Thr Leu Leu Lys Leu Gly Ala Val Pro Val Val Asn Glu Asn Asp
        275                 280                 285
Thr Val Ala Thr Thr Glu Ile Arg Tyr Gly Asp Asn Asp Arg Leu Ser
    290                 295                 300
Ala Arg Val Ala Val Met Ser Gly Ala Glu Thr Leu Val Leu Leu Ser
305                 310                 315                 320
Asp Val Asp Gly Leu Tyr Thr Ala Asp Pro Gly Arg Asp Pro Ala Ala
                325                 330                 335
Arg His Ile Pro Glu Val Gly Ala Ile Thr Ala Glu Ile Glu Ala Met
            340                 345                 350
Ala Thr Gly Ser Gly Ser Ala Val Gly Thr Gly Gly Met Ala Ser Lys
        355                 360                 365
Leu Val Ala Ala Thr Ile Ala Thr Gln Ser Gly Cys Ala Val Leu Leu
    370                 375                 380
Thr Ser Gly Lys Pro Asp Arg Pro Leu Glu Ala Leu Arg Ser Gly Ala
385                 390                 395                 400
Arg Cys Thr Val Phe Ala Ala Arg Ser Thr Pro Arg Arg Ala Arg Lys
                405                 410                 415
His Trp Leu Ala Ala Thr Leu Lys Pro Ala Gly Arg Leu Thr Val Asp
            420                 425                 430
Asp Gly Ala Leu Ala Ala Leu Arg Arg Gly Ser Ser Leu Leu Pro Ala
        435                 440                 445
Gly Val Ala Ala Val Glu Gly Arg Phe Glu Arg Gly Asp Ala Val Leu
    450                 455                 460
Val Gln Asn Leu Gln Gly Thr Val Val Ala Lys Gly Leu Val Ala Tyr
465                 470                 475                 480
Asp Ala Ala Asp Ala Glu Arg Leu Arg Gly Arg Arg Thr Ala Asp Ile
                485                 490                 495
Glu Ala Leu Leu Gly Tyr Arg Gly Arg Asp Glu Met Ile His Arg Asp
            500                 505                 510
Asp Leu Val Leu Val Gln Ala Pro Gly Glu Pro Ala Gly Glu Ala Ala
        515                 520                 525
Val Gly Ala Gly Ala Pro
    530

<210> 421

<400> 421
000

<210> 422
<211> 1611
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 422
atgagcgatg tgtataaacg tcagccggaa ctgcgccatc gtcgccatcc gggtccggat     60
cgcggtggtc tgggcggtgg tggtgccggt cgtcctctgc ctcgccctta tcgtgcactg    120
cgtcatgccg atccgccgcg ccgtcgtcat ggtcgtgcct cacgccgtgg tctggcagat    180
ggcgcacgtc gcgcccgtgg tctgtggcct cgtgcaggtg gcgaagcccg cgatccgctg    240
cctcagcagg atcgccgtcc ggaccctggc ggtgatcgcg gtgctcaggc ccctgctcgc    300
ggtggcttcg gtcgtgcacg cgcactgcat ctgggccgcg gtgaaggtgg cgccggtggt    360
ggtgctgccc gtggtgcagg tagctgtcgt cgcccggcat gcggccgtgg tggtggtaga    420
ggtagtgcaa gtgaagtgta taaaagtcag gcactgctgg cacgccgtcg cgttaccatt    480
aaaattggta gcgcactgct ggtggatggt gaaggtcgta ttcgccgcgc atggctggat    540
ggcctggcag cagatattgg tgaactggcc cgtcgtggca gccgtgttat tgtggttacc    600
agtggcgcaa ttgccctggg ccgccgtgcc ttaggcctgc ctcagcgtgc actgcgcctg    660
gaagaaaaac aggccgcagc agcagccggc cagattctgc tggccggcgc atgggccgaa    720
agtctggccg gtcatggcct gattgcagcc cagctgctgg tgaccctggg cgataccgaa    780
ggccgtcgtc gttatctgaa tgcacgcgca accattgaaa ccttactgaa actgggtgca    840
gtgccggttg tgaatgaaaa tgataccgtg gccaccaccg aaattcgcta tggtgataat    900
gatcgcctga gtgcacgcgt tgccgttatg agcggtgccg aaaccttagt gctgctgagc    960
gatgtggatg gcctgtatac cgccgatccg ggtcgtgatc cggcagcccg tcatattccg   1020
gaagttggtg ccattaccgc agaaattgaa gccatggcca ccggtagtgg tagcgccgtt   1080
ggtaccggtg gtatggccag taaactggtt gccgccacca ttgccaccca gagcggctgt   1140
gcagtgctgc tgaccagcgg taaaccggat cgcccgctgg aagccctgcg tagtggcgca   1200
cgttgtaccg tgttcgccgc acgtagcacc ccgcgccgtg ctagaaaaca ttggctggca   1260
gccaccctga aaccggccgg ccgtctgacc gtggatgatg gcgccctggc agcactgcgt   1320
cgcggttcaa gtctgctgcc ggcaggcgtg gcagccgttg aaggtcgctt cgaacgcggc   1380
gatgcagtgc tggtgcagaa tctgcaaggc accgttgttg ccaaaggtct ggtggcatac   1440
gatgccgccg atgccgaacg tctgcgcggt cgtcgtaccg ccgatattga agcactgctg   1500
ggttatcgcg gtcgcgatga aatgattcat cgcgatgatc tggttctggt tcaggcaccg   1560
ggcgaaccgg ccggtgaagc agcagtgggc gccggtgcac cgtaaggatc c            1611

<210> 423
<211> 267
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 423
Met Ala Asp Arg Lys Ile Ile Glu Ser Thr Leu Ala Arg Thr Asn Leu
1               5                   10                  15
Thr Asp His Asp Leu Ala Asn Val Ala Tyr Ser Pro Thr Ala Val Met
            20                  25                  30
Pro Asp Val Arg Val Val Lys Ile Gly Gly Gln Ser Val Met Asp Arg
        35                  40                  45
Gly Arg Ala Ala Leu Phe Pro Ile Leu Asp Glu Leu Val Ala Ala Arg
    50                  55                  60
Lys Glu Gly Ile Ala Val Val Val Leu Val Gly Gly Gly Thr Arg Ala
65                  70                  75                  80
Arg His Ile Tyr Ser Ile Ala Ser Glu Leu Glu Met Pro Val Gly Val
                85                  90                  95
Met Ala Thr Leu Gly Lys Tyr Ile Pro Met Gln Asn Ala Arg Met Val
            100                 105                 110
Gln Met Leu Leu Ala Lys His Gly Gly Leu Tyr Ile Leu Pro Asp Asp
        115                 120                 125
Phe Glu Lys Leu Pro Leu Tyr Leu Gln Leu Gly Cys Leu Pro Val Met
    130                 135                 140
Ser Gly Met Pro Pro Phe Gly Tyr Trp Glu Lys Arg Glu Glu Gly Ser
145                 150                 155                 160
Arg Ile Pro Pro His Arg Thr Asp Ala Gly Val Phe Leu Ser Ala Glu
                165                 170                 175
Phe Leu Gly Ala Arg Arg Ala Ile Phe Ile Lys Asp Glu Asp Gly Leu
            180                 185                 190
Tyr Glu Asp Asp Pro Lys Lys Asn Pro Ala Ala Lys His Ile Pro Arg
        195                 200                 205
Ile Thr Ala Lys Glu Leu Glu Ala Arg Gly Leu Pro Asp Leu Val Val
    210                 215                 220
Glu Arg Val Val Val Glu Tyr Leu Pro Arg Ala Arg Trp Cys Lys Gln
225                 230                 235                 240
Leu Gln Ile Val Asn Gly Leu Lys Lys Gly Gln Val Leu Ala Ala Leu
                245                 250                 255
Lys Gly Glu Asp Val Gly Thr Ile Ile Ser Ala
            260                 265

<210> 424

<400> 424
000

<210> 425
<211> 804
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 425
atggccgatc gtaaaattat tgaaagcacc ctggcacgca ccaatctgac cgatcatgat     60
ctggccaatg tggcatatag tccgaccgca gttatgccgg atgttcgcgt ggttaaaatt    120
ggcggtcaga gcgttatgga tcgtggccgt gccgccctgt tcccgattct ggatgaactg    180
gttgccgccc gcaaagaagg tattgccgtg gttgtgctgg tgggtggtgg cacccgtgcc    240
cgtcatatct atagcattgc aagtgaactg gaaatgccgg tgggtgtgat ggccaccctg    300
ggcaaatata ttccgatgca gaatgcacgc atggttcaga tgctgctggc aaaacatggc    360
ggtctgtata ttctgccgga tgacttcgaa aaactgccgc tgtatctgca actgggctgc    420
ctgccggtta tgagtggtat gccgccgttc ggttattggg aaaaacgcga agaaggcagt    480
cgcattccgc cgcatcgcac cgatgcaggt gtgttcctga gtgccgagtt cctgggcgca    540
cgccgcgcaa tcttcattaa agatgaagat ggtctgtatg aagatgatcc gaaaaaaaat    600
ccggccgcaa aacatattcc gcgcattacc gcaaaagaac tggaagcacg tggtctgccg    660
gatctggttg ttgaacgcgt tgtggttgaa tatctgccgc gcgcacgctg gtgtaaacag    720
ctgcaaattg tgaatggtct gaaaaaaggc caggttctgg cagccctgaa aggtgaagat    780
gttggtacca ttattagcgc ctaa                                           804

<210> 426
<211> 263
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 426
Met Ser Asp Leu Thr Phe Leu Lys Leu Gly Gly Ser Leu Leu Thr Asp
1               5                   10                  15
Lys Pro Glu Arg Glu Ala Leu Arg Ala Asp Val Leu Ala Arg Leu Thr
            20                  25                  30
Asn Glu Ile Ala Ala Ala Arg Tyr Ala Arg Pro Asp Met Lys Leu Val
        35                  40                  45
Met Gly His Gly Ser Gly Ser Phe Gly His Thr Ala Ala Ala Arg His
    50                  55                  60
Gly Thr Arg Ser Gly Val Ser Gly Pro Glu Gln Trp Arg Gly Phe Ala
65                  70                  75                  80
Glu Val Ser Asp Ala Ala Ala Arg Leu Asn Arg Ala Val Ile Ala Ala
                85                  90                  95
Leu Leu Ser Ala Gly Val Pro Ala Val Gly Leu Pro Pro Ser Ala Ser
            100                 105                 110
Ala Val Val Thr Asp Gly Val Ile Gln Ala Met Ala Thr Ala Pro Ile
        115                 120                 125
Arg Ala Ala Leu Glu Ala Gly Leu Leu Pro Val Val Phe Gly Asp Val
    130                 135                 140
Ala Phe Asp Thr Val Arg Gly Gly Thr Ile Val Ser Thr Glu Glu Val
145                 150                 155                 160
Met Asp Tyr Leu Ala Leu Pro Met Arg Pro Ala Arg Leu Leu Leu Ala
                165                 170                 175
Gly Glu Thr Ala Gly Val Leu Asp Thr Thr Gly Gln Val Val Pro His
            180                 185                 190
Ile Ser Pro Ala Asn Tyr Glu Thr Ile Arg Pro Ala Leu Gly Gly Ser
        195                 200                 205
Arg Gly Ala Asp Val Thr Gly Gly Met Ser Ser Lys Val Ser Ala Met
    210                 215                 220
Leu Asp Leu Ala Ala Lys Ile Pro Gly Leu Thr Ile His Ile Phe Ser
225                 230                 235                 240
Gly Leu Glu Pro Gly Leu Leu Glu Gln Leu Leu Ile Ser Pro Ser Leu
                245                 250                 255
Ser Ser Gly Thr Arg Ile Gly
            260

<210> 427

<400> 427
000

<210> 428
<211> 792
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 428
atgagcgatc tgaccttcct gaaactgggc ggtagcctgc tgaccgataa accggaacgt     60
gaagcactgc gcgcagatgt tctggcccgt ctgaccaatg aaattgcagc agcacgttat    120
gcacgtccgg atatgaaact ggttatgggt catggcagcg gtagcttcgg tcataccgca    180
gccgcccgcc atggtacccg tagcggtgtg agcggtccgg aacagtggcg tggcttcgca    240
gaagtgagcg atgccgccgc acgtctgaat cgcgccgtta ttgcagccct gctgagtgcc    300
ggcgttccgg ctgttggcct gccgcctagt gccagcgcag tggttaccga tggtgttatt    360
caggcaatgg ccaccgcccc gattcgtgcc gcactggaag ccggtctgct gccggtggtg    420
ttcggcgatg ttgccttcga taccgtgcgt ggcggtacca ttgtgagcac cgaagaagtg    480
atggattatc tggccctgcc gatgcgcccg gcccgtctgt tactggcagg cgaaaccgca    540
ggcgtgctgg ataccaccgg tcaggttgtg ccgcatatta gcccggcaaa ttatgaaacc    600
attcgcccgg ccctgggcgg cagtcgtggt gcagatgtta ccggcggtat gagtagcaaa    660
gtgagcgcaa tgctggatct ggcagccaaa attccgggtc tgaccattca tatcttcagc    720
ggcctggaac cgggcctgct ggaacagctg ctgattagcc cgagtctgag cagcggtacc    780
cgtattggct aa                                                        792

<210> 429
<211> 182
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 429
Met Glu Ile Arg Gly Thr Leu Glu Glu Asn Pro Asp Ile Arg Leu Ile
1               5                   10                  15
Ile Ile His Gly Ala Gly Ala Gly Gly His Gln Leu Ala Lys Lys Tyr
            20                  25                  30
Arg Leu Thr Glu Asn Leu Gly Asn Asp Pro Glu Arg Trp Thr Gly Ala
        35                  40                  45
Phe Leu Thr Arg Gln Ala Asn Gln Leu Leu Asn Leu Glu Leu Phe Lys
    50                  55                  60
Ile Phe Ser Lys Ala Asn Leu Arg Val Val Pro Ile His Thr Ala Ser
65                  70                  75                  80
Ile Ile Thr Gln Arg Gln Lys Glu Ile Ala Ser Cys Ser Phe Glu Ser
                85                  90                  95
Ile Asn Gln Thr Leu Ala His Asn Cys Ile Pro Leu Leu Tyr Gly Glu
            100                 105                 110
Leu Val Phe Asp Glu Thr Leu Gly Met Ser Ile Leu Ser Gly Asp Thr
        115                 120                 125
Ser Ala Phe Ile Leu Ala Glu Lys Tyr Gln Ala Glu Arg Val Leu Phe
    130                 135                 140
Ala Ser Asp Ile Asp Gly Ile Phe Asn Lys Asp Pro His Lys Asn Lys
145                 150                 155                 160
Asp Ala Lys Leu Ile Gln Val Thr Thr Leu Lys Glu Leu Leu Glu Asn
                165                 170                 175
Lys Asn Val Ser Leu Ser
            180

<210> 430
<211> 546
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 430
atggaaatac gcggcactct cgaagagaat cccgatatac gtctcattat aattcacggt     60
gccggtgccg gcgggcatca gcttgccaag aaatatcgcc tcacggaaaa cctcgggaat    120
gatcccgagc gatggaccgg cgcctttctc accagacaag ccaatcagtt actcaatctc    180
gaattgttca agatattctc aaaagcaaac cttcgagtcg ttccaattca cactgcatcc    240
attattactc aacgacagaa agaaatagcg tcttgttctt tcgaatccat caatcagacg    300
ctcgcacaca attgcattcc acttctctat ggcgaacttg tgttcgacga gacacttggc    360
atgtccatac tctccggaga caccagcgct ttcatcttgg cagaaaaata tcaagctgaa    420
cgggttttgt ttgcatcaga catagacggc atcttcaaca aagaccccca caaaaacaaa    480
gacgcaaaac tcatacaagt tactacactc aaagagttgc tggaaaacaa aaacgtctca    540
ctttcc                                                               546

<210> 431
<211> 549
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 431
atggaaatcc gtggtaccct ggaagaaaat ccggatattc gcctgattat tattcatggc     60
gcaggcgcag gtggtcatca gctggccaaa aaatatcgcc tgaccgaaaa tctgggtaat    120
gatccggaac gctggaccgg cgcattcctg acccgtcagg caaatcagct gctgaatctg    180
gaactgttca aaatcttcag taaagcaaat ctgcgtgtgg ttccgattca taccgcaagc    240
attattaccc agcgtcagaa agaaattgcc agctgctcat tcgaaagtat taatcagacc    300
ctggcacata attgcattcc gctgctgtat ggcgaactgg tgttcgatga aaccttaggc    360
atgagtattc tgagcggcga taccagtgcc ttcattctgg cagaaaaata tcaggcagaa    420
cgtgttctgt tcgcaagcga tattgatggc atcttcaata aagatccgca taaaaataag    480
gacgcaaaac tgattcaggt taccaccctg aaagaactgc tggaaaataa aaatgtgagt    540
ctgagctaa                                                            549

<210> 432
<211> 241
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 432
Met Ser Glu Arg Leu Lys Asn Ser Lys Arg Ile Val Ile Lys Ala Gly
1               5                   10                  15
Thr Ser Ile Leu Thr Gly Lys Asp Gly Arg Phe Ser Pro Ala His Leu
            20                  25                  30
Ala Arg Leu Cys Asp Gln Ile Leu Ala Leu Val Asn Lys Lys Lys Glu
        35                  40                  45
Val Val Leu Val Ser Ser Gly Ala Ile Gly Leu Gly Met Glu Val Thr
    50                  55                  60
Ser Phe Lys Lys Arg Pro Lys Lys Met Ala Gln Leu Gln Ala Cys Ala
65                  70                  75                  80
Ala Ile Gly Gln Gly Lys Leu Met His Ala Tyr Glu Gln Phe Phe Ser
                85                  90                  95
Lys Arg Gly Ile His Thr Ala Gln Ile Leu Leu Thr Arg Asp Gly Leu
            100                 105                 110
Glu Asp Arg Glu Arg Phe Leu Arg Ala Ser Gly Ala Val Ala Glu Ile
        115                 120                 125
Leu Lys Met Lys Val Leu Pro Ile Val Asn Glu Asn Asp Thr Val Ser
    130                 135                 140
Thr Glu Glu Ile Ala Phe Gly Asp Asn Asp Arg Leu Ser Val His Val
145                 150                 155                 160
Ser His Leu Val Asp Ala Asp Leu Leu Ile Leu Leu Ser Asp Val Asp
                165                 170                 175
Gly Phe Tyr Leu Asn Asp Gly Ser Arg Ile Arg Leu Val Ser Ser Ile
            180                 185                 190
Arg Glu Ile Arg Glu Glu Leu Val Lys His Val Lys Asp Ser Arg Lys
        195                 200                 205
Glu Lys Thr Val Gly Gly Met Ser Ala Lys Leu Lys Ala Ala Thr Thr
    210                 215                 220
Ala Met Asn Leu Gly Ile Pro Met Leu Ile Val Asn Gly His Glu Pro
225                 230                 235                 240
Gly

<210> 433

<400> 433
000

<210> 434
<211> 726
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 434
atgagcgaac gcctgaaaaa tagcaaacgt attgttatta aggccggcac cagcattctg     60
accggcaaag atggtcgctt cagtccggcc catctggccc gtctgtgtga tcagattctg    120
gcactggtta ataaaaaaaa agaggtggtt ctggtgagta gtggcgcaat tggtctgggc    180
atggaagtga ccagcttcaa aaaacgtccg aaaaaaatgg cacagctgca agcctgcgcc    240
gccattggtc agggcaaact gatgcatgca tacgaacagt tcttcagtaa acgtggtatt    300
cataccgcac agattctgct gacccgtgat ggtctggaag atcgtgaacg cttcctgcgt    360
gcaagcggtg ccgtggcaga aattctgaaa atgaaagttc tgccgattgt gaatgaaaat    420
gataccgtta gcaccgaaga aattgccttc ggcgataatg atcgcctgag cgtgcatgtg    480
agtcatctgg ttgatgccga tctgctgatt ctgctgagtg atgtggatgg cttctatctg    540
aatgatggca gtcgtattcg cctggttagc agcattcgcg aaattcgtga agaactggtt    600
aaacatgtta aagatagccg caaagaaaaa accgtgggtg gtatgagtgc aaaactgaaa    660
gccgcaacca ccgcaatgaa tctgggcatt ccgatgctga ttgttaatgg tcatgaaccg    720
ggctaa                                                               726

<210> 435
<211> 289
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 435
Met Lys Thr Leu Leu Trp Leu Asn Arg Ala Met Lys Glu Leu Val Leu
1               5                   10                  15
Leu Lys Leu Gly Gly Ser Leu Ile Thr Asp Lys Thr Gln Pro Phe Thr
            20                  25                  30
Pro Arg Leu Asp Val Met Asp Asp Leu Ala Leu Gln Ile Lys Thr Ala
        35                  40                  45
Leu Gln Ile Arg Val Asp Leu Gln Leu Val Leu Gly His Gly Ser Gly
    50                  55                  60
Ser Phe Gly His Val Pro Ala Ser Glu Tyr Arg Thr Arg Asp Gly Leu
65                  70                  75                  80
Pro Pro Arg Ala Thr Pro Leu Ala His Arg Glu Arg Asp Ala Thr Glu
                85                  90                  95
Glu Asn Tyr Trp Arg Gly Phe Ala Glu Val Trp Tyr Gln Ala Ser Ala
            100                 105                 110
Leu Asn Arg Phe Val Met Glu Ala Leu His Arg Ala Asp Val Pro Ser
        115                 120                 125
Ile Ala Leu Ser Pro Ala Ala Ser Val Ile Ala Ser Asn Gly Gln Val
    130                 135                 140
Ser Val Trp Glu Thr Thr Pro Leu Arg Met Ala Leu Ser Ala Gly Ile
145                 150                 155                 160
Val Pro Val Val Tyr Gly Asp Val Val Cys Asp Glu Val Arg Gly Gly
                165                 170                 175
Thr Ile Leu Ser Thr Glu Asp Leu Phe Ser His Leu Thr Arg Ala Leu
            180                 185                 190
Asn Pro Asp Arg Ile Leu Leu Ala Gly Leu Glu Ala Ala Val Trp Glu
        195                 200                 205
Asp Phe Pro Thr Arg Thr Lys Lys Ile Glu Arg Ile Thr Pro Ala Ser
    210                 215                 220
Phe Arg Glu Val Ser Ser Gly Val Gly Lys Ala Ala Gly Ala Asp Val
225                 230                 235                 240
Thr Gly Gly Met Glu Ser Lys Val Arg Gln Met Leu Glu Leu Val Gln
                245                 250                 255
Lys Val Pro Gly Leu Thr Ile Gln Val Phe Ser Gly Glu Glu Pro Gly
            260                 265                 270
Asn Leu Val Arg Ala Leu Gly Gly Glu Thr Leu Gly Thr Leu Ile Thr
        275                 280                 285
Ala

<210> 436

<400> 436
000

<210> 437
<211> 876
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 437
atgaagaccc tgctgtggct gaatcgtgca atgaaagaac tggtgctgct gaaactgggt     60
ggcagcctga ttaccgataa aacacaaccg ttcaccccgc gtctggatgt gatggatgat    120
ctggcactgc aaattaaaac cgcactgcaa atccgtgtgg atctgcaact ggttctgggc    180
catggcagtg gcagcttcgg tcatgtgccg gcaagcgaat atcgtacccg tgatggtctg    240
ccgccgcgcg caacccctct ggcacatcgt gaacgcgatg caaccgaaga aaattattgg    300
cgtggcttcg ccgaagtgtg gtatcaggca agcgcactga atcgcttcgt gatggaagca    360
ctgcatcgcg ccgatgttcc gagcattgcc ctgagtccgg ccgcaagcgt tattgcaagc    420
aatggtcagg tgagcgtgtg ggaaaccacc ccgctgcgta tggcactgag cgcaggtatt    480
gttccggttg tgtatggcga tgttgtgtgt gatgaagttc gcggtggtac cattctgagc    540
accgaagact tattcagcca tctgacccgt gcactgaatc cggatcgcat tctgctggcc    600
ggtctggaag cagcagtgtg ggaagacttc ccgacccgta ccaaaaaaat tgaacgtatt    660
accccggcca gcttccgcga agtgagcagt ggcgttggca aagccgcagg tgcagatgtt    720
accggtggta tggaaagtaa agtgcgccag atgctggaac tggtgcagaa agtgccgggc    780
ctgaccattc aggtgttcag tggcgaagaa ccgggtaatc tggtgcgcgc actgggcggc    840
gaaaccttag gcaccctgat taccgcctaa ggatcc                              876

<210> 438
<211> 295
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 438
Met Glu Asp Ile Ala Asp Leu Phe Lys Ser Gly Arg Glu Val Leu Ile
1               5                   10                  15
Val Ser Ser Gly Ala Val Gly Leu Gly Val Lys Glu Leu Ala Leu Asp
            20                  25                  30
Lys Arg Pro Ala Asp Leu Pro Thr Thr Gln Ala Cys Ala Ala Val Gly
        35                  40                  45
Gln Gly Ile Leu Gln Ser Met Tyr Ala Asp Ala Phe Lys Arg Leu Gly
    50                  55                  60
Val Lys Thr Ala Gln Val Leu Leu Cys Glu Glu Asp Phe Thr Asn Arg
65                  70                  75                  80
Lys Lys Tyr Leu Asn Leu Arg Ser Thr Ile Ala Arg Leu Leu Glu Leu
                85                  90                  95
Gly Val Ile Pro Ile Ile Asn Glu Asn Asp Thr Val Ser Thr Ser Glu
            100                 105                 110
Ile Glu Ser Ser Ala Thr Ala Ser Gly Arg Lys Val Asn Phe Gly Asp
        115                 120                 125
Asn Asp Lys Leu Ser Ala Leu Val Ala Ser Lys Val Asp Ala Asp Met
    130                 135                 140
Leu Leu Ile Leu Thr Asp Val Asn Gly Leu Tyr Ser Asp Asp Pro Asn
145                 150                 155                 160
Thr Cys Pro Asp Ala Glu Leu Ile Asp Thr Val Ala Asp Leu Ala Pro
                165                 170                 175
Tyr Gln Ile Pro Lys Thr Glu Lys Lys Val Asn Gly Lys Lys Gly Ala
            180                 185                 190
His Gln Gly Gly Arg Gly Gly Ile Arg Ser Lys Leu Glu Ala Ala Ala
        195                 200                 205
Val Val Thr Gln Ser Gly Leu Pro Cys Val Ile Ala Gly Gly Arg Asn
    210                 215                 220
His Lys Val Ile Glu Arg Leu Phe Asn Gly Glu Ser Leu Gly Thr Ile
225                 230                 235                 240
Phe Leu Pro Gly Ala Ala Met Ala Gly Lys Ser Arg Trp Ile Ala Phe
                245                 250                 255
Ala Thr Thr Ile Asn Gly Ser Val Thr Val Asn Gln Gly Ala Arg Asp
            260                 265                 270
Ala Leu Val Lys Lys Lys Ala Ser Leu Leu Pro Ala Gly Ile Val Lys
        275                 280                 285
Val Asp Gly Ser Phe Ala Arg
    290                 295

<210> 439

<400> 439
000

<210> 440
<211> 894
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 440
atggaagata tcgccgatct gttcaaaagt ggccgtgaag ttctgattgt gagtagtggc     60
gcagttggtc tgggcgttaa agaactggcc ctggataaac gtccggcaga cttaccgacc    120
acccaggcct gcgcagcagt gggtcagggt attctgcaaa gcatgtatgc cgatgcattc    180
aaacgtctgg gtgtgaaaac cgcccaggtt ctgctgtgtg aagaagactt caccaatcgt    240
aaaaaatatc tgaatctgcg cagcaccatt gcacgtctgc tggaactggg tgttattccg    300
attattaatg aaaatgacac cgtgagcacc agcgaaattg aaagtagcgc aaccgcaagt    360
ggccgcaaag ttaacttcgg cgataatgat aaactgagtg cactggtggc aagtaaagtt    420
gatgcagata tgctgctgat tctgaccgat gttaatggcc tgtatagtga tgatccgaat    480
acctgtccgg atgccgaact gattgatacc gttgccgatc tggccccgta tcagattccg    540
aaaaccgaaa aaaaagtgaa tggtaaaaag ggcgcccatc agggcggccg cggtggtatt    600
cgcagcaaac tggaagcagc agcagttgtg acccagagcg gcctgccgtg cgtgattgcc    660
ggtggccgta atcataaagt gattgaacgt ctgttcaatg gtgaaagcct gggcaccatc    720
ttcctgccgg gcgcagcaat ggccggcaaa agccgttgga ttgcattcgc caccaccatt    780
aatggtagtg ttaccgtgaa tcagggcgca cgcgatgccc tggttaaaaa aaaagcaagc    840
ctgctgccgg ccggcattgt gaaagttgat ggcagcttcg cccgctaagg atcc          894

<210> 441
<211> 227
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 441
Met Glu Asp Lys Ile Val Ile Ser Leu Gly Gly Ser Leu Ile Val Pro
1               5                   10                  15
Glu Glu Ile Asp Val Glu Phe Leu Lys Ser Phe Lys Glu Leu Ile Ile
            20                  25                  30
Gly Glu Ile Ala Lys Gly Lys Lys Phe Ile Leu Ile Thr Gly Gly Gly
        35                  40                  45
Arg Val Cys Arg Lys Tyr Gln Asn Val Ala Lys Asp Ile Ser Asn Pro
    50                  55                  60
Asn His Glu Asp Leu Asp Trp Ile Gly Ile Ala Ser Leu Lys Leu Asn
65                  70                  75                  80
Ala Glu Leu Leu Arg Val Ile Phe Lys Asp His Ala Tyr Asn Arg Val
                85                  90                  95
Val Glu Asn Leu Ser Leu Pro Phe Pro Phe Glu Asp Ser Ile Val Ile
            100                 105                 110
Gly Ser Ala Tyr Glu Pro Gly His Ser Thr Asp Tyr Asp Ala Val Leu
        115                 120                 125
Gly Ala Gln Ser Ile Gly Ala Lys Thr Ile Ile Asn Leu Ser Asn Thr
    130                 135                 140
Asp Tyr Val Tyr Asp Ser Asp Pro Lys Thr Asn Pro Asp Ala Lys Lys
145                 150                 155                 160
Ile Glu Thr Ile Thr Trp Asp Glu Tyr Gln Lys Ile Ile Pro Ala Glu
                165                 170                 175
Trp His Pro Gly Leu Ser Thr Pro Phe Asp Pro Thr Ala Ser Val Leu
            180                 185                 190
Ala Lys Gln Glu Gly Ile Thr Val Ile Ser Met Asn Gly Lys Pro Ile
        195                 200                 205
Ser Asn Leu Ala Asn Cys Leu Asn Gly Glu Asn Phe Ile Gly Thr Thr
    210                 215                 220
Ile Lys Pro
225

<210> 442
<211> 681
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 442
atggaagaca aaatagtaat ttccctaggg ggttcattaa tagtgccaga agagattgat     60
gttgagtttc taaagtcttt taaagaattg attattgggg aaattgctaa aggcaagaaa    120
tttattttaa taactggagg tggtagagtt tgcagaaagt atcagaatgt tgctaaggat    180
atttctaatc caaaccatga agatctggat tggataggta ttgcatcttt aaaattaaac    240
gcagaacttt tgcgagttat ttttaaagac catgcttata atagagttgt tgagaattta    300
tcgctacctt ttccctttga agattcaata gtgattggat ctgcttatga accaggacac    360
agtactgatt atgatgccgt acttggggct caaagtattg gagcgaaaac tataattaat    420
ttatctaata ctgattacgt atatgattct gaccctaaga ctaatcctga tgctaaaaaa    480
atagaaacta taacctggga cgaatatcaa aagattattc ccgcagagtg gcaccccgga    540
cttagcactc catttgatcc cactgcgtct gttttagcca aacaggaagg cattacagtg    600
attagtatga acggtaaacc aatttccaat ttagctaatt gtttaaatgg agaaaatttc    660
atcggcacaa ccattaaacc a                                              681

<210> 443
<211> 690
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 443
atggaagata agatcgttat cagtctgggt ggtagtctga ttgtgccgga agaaattgat     60
gtggagttcc tgaaatcatt caaagaactg attatcggtg aaattgccaa aggcaaaaaa    120
ttcattctga ttaccggcgg tggtcgcgtg tgtcgtaaat atcagaatgt tgcaaaagat    180
atcagcaatc cgaatcatga agacttagat tggattggca ttgcaagcct gaaactgaat    240
gccgaactgc tgcgcgtgat cttcaaagat catgcctata atcgtgtggt tgaaaatctg    300
agcctgccgt tcccgttcga agatagtatt gtgattggca gtgcatacga accgggccat    360
agtaccgatt atgatgccgt tctgggcgca cagagtattg gtgccaaaac cattattaat    420
ctgagtaata ccgactatgt gtatgatagc gatccgaaaa ccaatccgga tgccaaaaaa    480
attgaaacca ttacctggga tgaatatcag aaaattatcc cggccgaatg gcatccgggc    540
ctgagcaccc cgttcgatcc gaccgccagt gtgctggcaa aacaggaagg cattaccgtg    600
attagcatga atggtaaacc gattagtaat ctggcaaatt gcctgaatgg tgaaaacttc    660
attggtacca ccattaaacc gtaaggatcc                                     690

<210> 444
<211> 368
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 444
Met Thr Asp Ser Pro Arg Lys Arg Ile Val Val Lys Val Gly Ser Ala
1               5                   10                  15
Leu Ile Ala Pro His Lys Gln Gly Cys Ser Ser His Tyr Leu Leu Gly
            20                  25                  30
Ile Ala Gln Phe Ile Thr Tyr Cys Arg Val Gln Gly Ile Gln Val Val
        35                  40                  45
Leu Val Ser Ser Gly Ser Val Ala Ala Gly Trp His His Phe Glu Gly
    50                  55                  60
Gln Ala Gln Pro Ser Val Thr Val Lys Lys Ala Met Ala Ala Ala Gly
65                  70                  75                  80
Gln Ala Asp Met Met Ala Thr Trp Asn Lys Leu Phe Asp Phe Pro Thr
                85                  90                  95
Ala Gln Leu Leu Leu Thr His Gly Asp Leu Arg Asn Arg Glu Arg Tyr
            100                 105                 110
Ile Ser Ile Arg Asp Thr Ile Phe Ser Leu Leu Glu His Gly Leu Met
        115                 120                 125
Pro Ile Ile Asn Glu Asn Asp Ala Val Thr Ala Asp Lys Leu Lys Val
    130                 135                 140
Gly Asp Asn Asp Asn Leu Ser Ala Met Val Ala Ala Ala Ala Asp Ala
145                 150                 155                 160
Asp Thr Leu Val Ile Cys Ser Asp Val Asp Gly Leu Tyr Asp Gln Asn
                165                 170                 175
Pro His Glu His Pro Asn Ala Lys Leu Ile Lys Gln Val Thr Glu Ile
            180                 185                 190
Asn Ala Asp Ile Tyr Ala Met Ala Gly Gly Ala Ser Ser Asp Val Gly
        195                 200                 205
Thr Gly Gly Met Arg Thr Lys Ile Gln Ala Ala Glu Lys Ala Ile Ser
    210                 215                 220
His Gly Ile Glu Thr Phe Ile Ile Asn Gly Phe Asn Ala Asp Ser Phe
225                 230                 235                 240
Ser Gln Leu Leu Lys Gly Gln Asn Pro Gly Thr Leu Phe Thr Pro Tyr
                245                 250                 255
Glu Lys Pro Met Gln Glu His Leu His Trp Met Thr His Thr Ser Gln
            260                 265                 270
Ala Gln Gly Glu Val Ile Val Glu Asp Asp Phe Asp Leu Ala Leu Asp
        275                 280                 285
Gln His Ser Glu Gln Leu Thr Ser Asp Asp Val Val Glu Val Lys Gly
    290                 295                 300
Asp Phe Ser Val Gly Asp Thr Ile Leu Val Arg Lys Gly Asp Gly Thr
305                 310                 315                 320
Lys Leu Ala Lys Ala Lys Ser Asn Tyr Ser Ser Cys Leu Leu Ser Phe
                325                 330                 335
Ile Thr Glu Gln Asp Asp Gln Ala Phe Ala Ser Glu Phe Gln Gln Lys
            340                 345                 350
Thr Gly Pro Ile Ile Ser Asp Lys Asn Ile Ala Ile Leu Lys Ser Ile
        355                 360                 365

<210> 445
<211> 1104
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 445
atgacagact ctccccgtaa acgtattgtc gtcaaagtgg ggagcgcctt aatcgcgccc     60
cacaagcaag gttgcagtag ccattatctc ttgggtatcg cgcagtttat tacttactgc    120
cgtgtccaag gtatccaagt ggtactggtt tcatccggct cggttgccgc tggttggcat    180
cattttgagg gccaagctca gcccagtgtc acagtgaaaa aggccatggc ggccgcgggg    240
caggcggata tgatggcgac gtggaataag ctatttgatt ttcccaccgc ccaactgctg    300
ctgacccatg gcgacttacg taatcgcgag cgttatatca gtattcgaga caccattttt    360
agcctgctcg aacacggttt aatgccgatc atcaatgaga atgatgccgt taccgccgac    420
aaacttaagg ttggcgataa cgataatctc tcggccatgg tggcggctgc ggccgatgcc    480
gacaccttag tgatttgctc ggatgtggat ggactctatg atcaaaatcc ccacgaacat    540
cccaatgcca agttgataaa gcaagtcact gaaatcaatg ccgatatcta tgcgatggcg    600
ggaggcgcca gcagcgatgt tggtacagga ggcatgcgca ctaagatcca agccgccgaa    660
aaagccatct ctcacggcat tgagaccttt attatcaatg gctttaatgc cgactccttt    720
agccaactgc taaaggggca aaatccgggc accctcttta ccccctacga aaaaccgatg    780
caggagcatt tgcattggat gacccacacc tcgcaggcgc agggcgaagt gatcgtcgag    840
gatgattttg acctcgcact cgatcagcac agcgagcaat taaccagcga tgatgtggtt    900
gaagtcaaag gggatttctc agtgggcgat accattctgg tgcgtaaagg cgatggcact    960
aagttggcga aagccaaatc taactacagc agttgcctac tgagttttat taccgagcag   1020
gatgatcagg cgtttgccag tgaattccag caaaaaaccg gccccatcat ttccgataag   1080
aatatcgcca ttcttaaatc catt                                          1104

<210> 446
<211> 1113
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 446
atgaccgata gtccgcgtaa acgtattgtg gtgaaagttg gtagtgccct gattgccccg     60
cataaacagg gctgtagtag tcattatctg ctgggcattg cccagttcat tacctattgt    120
cgtgttcagg gtattcaggt ggttctggtg agtagtggta gcgttgccgc cggttggcat    180
cacttcgaag gccaggccca gccgagcgtt accgttaaaa aagccatggc agccgcaggt    240
caggccgata tgatggccac ctggaataaa ctgttcgact tcccgaccgc ccagctgctg    300
ctgacccatg gtgatctgcg taatcgtgaa cgctatatta gtattcgtga taccatcttc    360
agtctgctgg aacatggtct gatgccgatt attaatgaaa atgatgcagt gaccgcagat    420
aaactgaaag tgggcgataa tgataatctg agtgcaatgg tggccgccgc agccgatgca    480
gataccctgg tgatctgtag cgatgtggat ggcctgtatg atcagaatcc gcatgaacat    540
ccgaatgcca aactgattaa acaggtgacc gaaattaatg cagatatcta tgcaatggcc    600
ggtggcgcca gtagtgatgt gggcaccggt ggcatgcgta ccaaaattca ggccgcagaa    660
aaagccatta gtcatggtat tgaaaccttc attatcaatg gcttcaatgc agatagcttc    720
agccagctgc tgaaaggtca gaatccgggt accctgttca ccccgtatga aaaaccgatg    780
caggaacatc tgcattggat gacccatacc agtcaggccc agggtgaagt gattgttgaa    840
gatgacttcg atctggccct ggatcagcat agtgaacagc tgaccagcga tgatgttgtt    900
gaagttaaag gcgacttcag cgttggcgat accattctgg tgcgcaaagg tgatggcacc    960
aaactggcca aagccaaaag caattatagt agctgcctgc tgagcttcat taccgaacag   1020
gatgatcagg cattcgccag tgagttccag cagaaaaccg gcccgattat tagcgataaa   1080
aatattgcca ttctgaagag catttaagga tcc                                1113

<210> 447
<211> 265
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 447
Met Ile Pro Leu Thr Leu Ile Lys Leu Gly Gly Ser Ile Val Thr Asp
1               5                   10                  15
Lys Ala Lys His Glu His Tyr Arg Gly Gln His Val Arg Glu Val Ala
            20                  25                  30
Lys Leu Leu Ser Thr Tyr Phe Thr Gln Arg Asn Glu Ala Cys Leu Ile
        35                  40                  45
Gly His Gly Gln Gly Ser Phe Gly His Pro Ala Val Lys Lys Asn Gln
    50                  55                  60
Gln His Phe Ser Asp Arg Ser His Phe Ala Pro Gln Ala Met Ala Glu
65                  70                  75                  80
Met Leu Arg Val Val Thr His Leu His Glu Arg Ile Leu Asp Asp Leu
                85                  90                  95
Val Arg Glu Arg Val Pro Ala Ile Ser Phe Arg Phe Ser Gln Gln Tyr
            100                 105                 110
Val Val Asp Gly Ala Ala Glu Ala Arg Val Asp Leu Thr Leu Leu Glu
        115                 120                 125
Ala Leu Leu Asp Leu Arg Met Val Pro Val Thr Thr Gly Asp Ile Leu
    130                 135                 140
Val Asp Thr Glu Val Gly Asn Arg Val Leu Ser Thr Glu Lys Ile Phe
145                 150                 155                 160
Met Ala Leu Ile Arg Ala Leu Gln His Ser Asp Lys Tyr Arg Val Glu
                165                 170                 175
Arg Val Ala Tyr Val Thr Gln Val Ala Gly Val Leu Asp Lys Ala Gly
            180                 185                 190
Lys Val Ile Glu Arg Ile Gly Ala Asp Glu Glu Val Asp Gln Ser Trp
        195                 200                 205
Phe Phe Ala Gln Ala Asp Gln Ala Asp Val Thr Gly Ala Met Lys His
    210                 215                 220
Lys Val Glu Ala Ala Gln Ala Val Ala Gln Leu Gly Ile Pro Val Ala
225                 230                 235                 240
Ile Leu Ser Ala Asn Asp Pro Lys Asn Leu Asp Arg Tyr Leu Arg Asn
                245                 250                 255
Gln Ala Trp Ile Gly Thr Arg Ile Ala
            260                 265

<210> 448
<211> 795
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 448
atgattcctc ttaccttgat caagcttggt ggctcaatag tgaccgacaa agcgaagcac     60
gagcactatc gcggtcagca tgttcgagag gtagccaaac tattgtcaac atattttacc    120
cagcgaaatg aggcatgttt gattgggcat ggccagggta gttttggtca cccagcggtg    180
aaaaaaaacc agcagcattt ttctgatcga tctcacttcg ctccgcaagc gatggcagag    240
atgttgcgag tggtcaccca cttgcacgaa cgtattttgg atgacttagt ccgtgaacga    300
gtgcccgcaa ttagtttccg atttagccag cagtacgtgg tggatggcgc agctgaggct    360
cgggtcgact tgaccctcct cgaagcattg ctcgatttgc gcatggtgcc cgtaacgacg    420
ggtgatattt tggttgatac cgaggttggt aatcgggtgc tttcgaccga aaaaattttt    480
atggcgttga ttcgagcatt gcagcattca gacaagtacc gagtcgagcg agtagcgtat    540
gtgacacaag tagcaggagt gcttgacaaa gcaggcaagg ttatcgaacg gattggcgct    600
gatgaggaag tggatcaatc atggttcttt gctcaagctg accaggccga tgtgaccgga    660
gcgatgaaac ataaggttga ggcggcgcaa gcagtggctc aattaggtat tcccgtagca    720
attttgtctg ccaatgaccc caaaaatctt gatcgatatt tacgcaatca agcttggatc    780
ggtacacgaa tagcg                                                     795

<210> 449
<211> 804
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 449
atgatcccgc tgaccctgat taaactgggc ggcagtattg tgaccgataa agcaaaacat     60
gaacattatc gcggccagca tgtgcgtgaa gtggcaaaac tgctgagcac ctacttcacc    120
cagcgtaatg aagcatgtct gattggccat ggccagggca gcttcggtca tccggccgtg    180
aaaaaaaatc agcagcactt cagtgatcgc agtcacttcg ccccgcaggc aatggccgaa    240
atgctgcgcg ttgttaccca tctgcatgaa cgtattctgg atgatctggt tcgcgaacgc    300
gtgccggcaa ttagcttccg cttcagtcag cagtatgtgg ttgatggtgc cgccgaagcc    360
cgcgtggatc tgaccctgct ggaagcactg ctggatctgc gtatggtgcc ggttaccacc    420
ggtgatattc tggtggatac cgaagttggc aatcgcgttc tgagtaccga aaaaatcttc    480
atggcactga ttcgcgccct gcaacatagt gataaatatc gtgtggaacg cgttgcctat    540
gtgacccagg tggccggtgt tctggataaa gccggtaaag ttattgaacg tattggcgca    600
gatgaagaag tggatcagag ttggttcttc gcacaggccg atcaggcaga tgtgaccggc    660
gcaatgaaac ataaagtgga agccgcacag gcagtggcac agctgggtat tccggtggcc    720
attctgagtg ccaatgatcc gaaaaatctg gatcgttatc tgcgcaatca ggcctggatt    780
ggtacccgta ttgcataagg atcc                                           804

<210> 450
<211> 276
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 450
Met Gly Ala Gly Gly His Gln Asn Asn Ala Asp Arg Thr Glu Lys Tyr
1               5                   10                  15
Ala Ala Thr Thr Ala Asn Lys Ala Lys Val Thr Thr Pro Ala Arg Ile
            20                  25                  30
Ser Ile Thr Arg Leu Tyr Pro Val Thr Phe Leu Gln Leu Cys Val Lys
        35                  40                  45
Thr Cys Thr Met Thr Lys Glu Leu Val Leu Leu Lys Leu Gly Gly Ser
    50                  55                  60
Leu Ile Thr Asp Lys Asp Ile Pro Tyr Thr Pro Arg Leu Asp Lys Leu
65                  70                  75                  80
Lys Glu Leu Ala Leu Glu Ile Lys Thr Val Leu Asp Ser Asn Pro Glu
                85                  90                  95
Leu Leu Leu Ile Leu Gly His Gly Ser Gly Ser Phe Gly His Val Ala
            100                 105                 110
Ala Lys Lys His Gly Thr Arg Asp Gly Val Gln Thr Pro Glu Gln Trp
        115                 120                 125
Lys Gly Phe Ala Glu Val Arg Phe Gln Ala Ala Glu Leu Asn Pro Phe
    130                 135                 140
Val Met Glu Ser Leu Phe Asn Ala Gly Val Pro Ala Ile Ser Phe Pro
145                 150                 155                 160
Pro Ser Ser Ser Met Val Ser Asp Asp Arg Lys Val Ile His His Asn
                165                 170                 175
Ile Leu Ala Ile Arg Lys Ala Leu Asn Val His Leu Leu Pro Val Val
            180                 185                 190
Tyr Gly Asp Val Ala Phe Asp Glu Lys Arg Gly Gly Thr Ile Leu Ser
        195                 200                 205
Thr Glu Asp Val Phe Thr Phe Leu Val Asp Gln Phe Ser Pro Ser Arg
    210                 215                 220
Ile Leu Leu Ala Gly Ile Glu Ala Gly Val Trp Ala Asp Phe Pro Ala
225                 230                 235                 240
Arg Thr Lys Leu Val Lys Gln Ile Gln Leu Ser Asp Tyr Glu Lys Met
                245                 250                 255
Arg Thr Ser Ile Gly Gly Ser Ala Ser Thr Asp Val Thr Gly Gly Met
            260                 265                 270
Lys Ala Lys Val
        275

<210> 451

<400> 451
000

<210> 452
<211> 837
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 452
atgggcgctg gtggtcatca gaataatgcc gatcgtaccg aaaaatatgc agccaccacc     60
gccaataaag ccaaagtgac caccccggcc cgcattagta ttacccgtct gtatccggtt    120
accttcctgc aactgtgcgt taaaacctgt accatgacca aagaactggt gctgctgaaa    180
ctgggtggca gcctgattac cgataaagat attccgtata ccccgcgcct ggataaactg    240
aaagaactgg ccctggaaat taaaaccgtt ctggatagca atccggaact gctgctgatt    300
ctgggtcatg gcagtggcag cttcggtcat gtggcagcca aaaaacatgg tacccgcgat    360
ggtgttcaga ccccggaaca gtggaaaggc ttcgccgaag tgcgcttcca ggcagcagaa    420
ctgaatccgt tcgttatgga aagtctgttc aatgccggtg ttccggcaat tagcttcccg    480
ccgagcagca gcatggttag tgatgatcgc aaagttattc atcataatat cctggcaatc    540
cgtaaagcac tgaatgtgca tctgctgccg gtggtgtatg gcgatgttgc cttcgatgaa    600
aaacgtggcg gtaccattct gagcaccgaa gatgtgttca ccttcctggt tgatcagttc    660
agcccgagcc gtattctgct ggcaggcatt gaagccggcg tgtgggccga cttcccggca    720
agaaccaaac tggttaaaca gattcagctg agcgattatg aaaaaatgcg taccagcatt    780
ggcggcagcg caagtaccga tgttaccggc ggtatgaaag caaaagtgta aggatcc       837

<210> 453
<211> 205
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 453
Met Ala Ala Lys Gln Tyr Gln Thr Ile Ala Gly Val His Thr Arg Glu
1               5                   10                  15
Gln Trp Arg Gly Phe Ala Glu Val Ala Thr Val Ala Ser Glu Leu Asn
            20                  25                  30
Tyr Leu Val Ala Arg Glu Phe Ser Ala Ala Gly Val Pro Val Trp Arg
        35                  40                  45
Leu Gln Pro Ser Ala Ser Ala Ile Ser Arg Asp Gly Val Leu Val Ser
    50                  55                  60
Leu Ala Leu Glu Ser Pro Arg Gln Gly Leu Glu Asn Gly Leu Val Pro
65                  70                  75                  80
Leu Val Tyr Gly Asn Val Ala Leu Asp Glu Val Arg Gly Gly Thr Ile
                85                  90                  95
Ile Ser Thr Glu Thr Ile Phe Phe Tyr Met Ala Lys His Leu Pro Val
            100                 105                 110
Asn Arg Ile Leu Leu Leu Gly Glu Val Glu Gly Val Met Asp Ser Ala
        115                 120                 125
Gly Gln Val Ile Pro Ser Ile Thr Pro Ala Lys Phe Ala Ala Ile Gln
    130                 135                 140
Gln Tyr Leu Gly Gly Ser Ala Gly Thr Asp Val Thr Gly Gly Met Val
145                 150                 155                 160
Thr Lys Val Gln Asp Met Leu Thr Leu Ala Lys Gln Val Ser Gly Leu
                165                 170                 175
Thr Ile Arg Ile Met Asp Gly Arg Gln Pro Gly Leu Leu Phe Glu Thr
            180                 185                 190
Leu Leu Gly Lys Val Glu Pro Gly Thr Leu Ile Ser Gly
        195                 200                 205

<210> 454

<400> 454
000

<210> 455
<211> 624
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 455
atggcagcaa aacagtatca gaccattgcc ggtgtgcata cccgtgaaca gtggcgtggc     60
ttcgcagaag tggccaccgt tgcaagtgaa ctgaattatc tggttgcacg cgagttcagt    120
gccgcaggtg tgccggtgtg gcgcctgcaa ccgagtgcca gcgcaattag ccgtgatggc    180
gtgctggtta gtctggcact ggaaagtccg cgtcagggcc tggaaaatgg tctggtgccg    240
ctggtgtatg gtaatgttgc cctggatgaa gttcgtggcg gcaccattat tagcaccgaa    300
accatcttct tctatatggc caaacatctg ccggttaatc gcattctgct gctgggcgaa    360
gttgaaggcg tgatggatag tgcaggccag gttattccga gtattacccc ggccaaattc    420
gccgcaattc agcagtatct gggcggtagt gccggcaccg atgttaccgg cggcatggtt    480
accaaagtgc aggatatgct gaccctggca aaacaggtta gcggtctgac cattcgtatt    540
atggatggcc gtcagccggg tctgctgttc gaaaccttac tgggcaaagt tgaaccgggt    600
accctgatta gtggctaagg atcc                                           624

<210> 456
<211> 208
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 456
Met Thr Ile Ala Lys Gln Cys Ala Glu Ile Ala Gln Met Gly Val Gln
1               5                   10                  15
Leu Ala Ile Val Val Gly Gly Gly Asn Phe Ile Arg Gly Ala Thr Phe
            20                  25                  30
Ala Glu Asp Gly His Ile Pro Arg Ala Thr Ala Asp Tyr Met Gly Met
        35                  40                  45
Leu Ala Thr Val Ile Asn Ala Val Ala Leu Gln Glu Thr Met Glu Lys
    50                  55                  60
Phe Gly Gln Pro Thr Arg Val Leu Ser Ala Ile Ser Val Tyr Ser Val
65                  70                  75                  80
Cys Glu Pro Phe Ile Arg Arg Arg Ala Val Arg His Leu Glu Lys Gly
                85                  90                  95
Arg Ala Ile Ile Leu Ala Ala Gly Thr Gly Asn Pro Phe Phe Thr Thr
            100                 105                 110
Asp Thr Cys Ala Ala Leu Arg Ala Thr Glu Ile Ala Ala Asp Val Leu
        115                 120                 125
Leu Lys Ala Thr Lys Val Asp Gly Ile Tyr Asp Lys Asp Pro Lys Lys
    130                 135                 140
His Pro Asp Thr Lys Leu Phe Asn Ser Ile Ser Tyr Asp Gln Val His
145                 150                 155                 160
Asn Asp Lys Leu Arg Val Met Asp Leu Thr Ala Ile Thr Leu Cys Met
                165                 170                 175
Glu Arg Lys Leu Pro Leu Val Val Phe Asn Met Lys Lys Pro Gly Asn
            180                 185                 190
Ile Ala Arg Val Val Leu Gly Glu Asn Val Gly Thr Lys Ile Gln Pro
        195                 200                 205

<210> 457

<400> 457
000

<210> 458
<211> 633
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 458
atgaccattg caaaacagtg tgcagaaatt gcacagatgg gcgttcagct ggcaattgtt     60
gttggcggtg gtaacttcat tcgtggcgcc accttcgccg aagatggtca tattccgcgt    120
gcaaccgcag attatatggg tatgctggca accgtgatta atgccgtggc cctgcaagaa    180
accatggaaa aattcggcca gccgacccgc gttctgagcg ccattagtgt gtatagcgtg    240
tgcgaaccgt tcattcgccg tcgcgcagtg cgtcatctgg aaaaaggtcg tgccattatt    300
ctggccgccg gtaccggcaa tccgttcttc accaccgata cctgcgcagc actgcgcgcc    360
accgaaattg ccgccgatgt tctgctgaaa gccaccaaag tggatggcat ctatgataaa    420
gatccgaaaa aacatccgga taccaaactg ttcaatagca ttagctatga tcaggttcat    480
aatgataagc tgcgtgtgat ggatctgacc gcaattaccc tgtgtatgga acgtaaactg    540
ccgctggttg tgttcaatat gaaaaaaccg ggtaatattg cacgcgttgt gctgggcgaa    600
aatgttggta ccaaaattca gccgtaagga tcc                                 633

<210> 459
<211> 218
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 459
Met Ala Asp Ile Ser Ile Gly Pro Met Ser Glu Lys Ile Asp Lys Leu
1               5                   10                  15
Ala Ser Ser Phe Gln Val Lys Ser Lys Thr Val Leu Lys Lys Ser Glu
            20                  25                  30
Leu Trp Val Val Lys Ala Gly Ser Gln Met Val Ile Asp Gly Gly Pro
        35                  40                  45
Met Leu Ile Arg Ser Trp Met Ser Gln Val Ser Glu Leu Ala Arg Gln
    50                  55                  60
Asn His Val Gln Val Ile Trp Val Thr Ser Gly Ala Ile Ala Ser Ala
65                  70                  75                  80
Arg Lys Arg Thr Gly Lys Val Pro Lys Leu Leu Arg Glu Lys Gln Ala
                85                  90                  95
Leu Ser Ala Ile Gly Gln Pro His Leu Ile Asn His Tyr Leu Val Ala
            100                 105                 110
Leu Gln Glu Asn Asp Val Ser Gly Ala Gln Ile Leu Leu Thr Ala Glu
        115                 120                 125
Asp Leu Arg Ser Gln Ala Arg Arg Thr Tyr Leu Gln Gln Thr Leu Lys
    130                 135                 140
Thr Leu Leu Glu Trp Asn Phe Leu Pro Ile Leu Asn Glu Asn Asp Ala
145                 150                 155                 160
Val Ala Thr Glu Glu Ile Gln Phe Gly Asp Asn Asp Arg Leu Ala Ala
                165                 170                 175
Leu Val Ala Ile His Met Lys Ala Lys Arg Leu Val Leu Leu Thr Asp
            180                 185                 190
Val Asp Gly Leu Tyr Asp Arg Asp Pro Lys Thr Asp Ala Ser Ala Lys
        195                 200                 205
Leu Val Ser Glu Leu Ser Gly Ile Pro Ser
    210                 215

<210> 460

<400> 460
000

<210> 461
<211> 663
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 461
atggccgata ttagtattgg tccgatgagc gaaaaaattg ataaactggc aagtagcttc     60
caggtgaaaa gcaaaaccgt tctgaaaaaa agtgaactgt gggttgttaa agcaggcagc    120
cagatggtga ttgatggtgg tccgatgctg attcgtagct ggatgagtca ggtgagcgaa    180
ctggcacgtc agaatcatgt gcaggttatc tgggtgacca gtggcgcaat tgcaagcgca    240
cgcaaacgta ccggcaaagt tccgaaactg ctgcgtgaaa aacaggccct gagtgcaatt    300
ggtcagccgc atctgattaa tcattatctg gtggcactgc aagaaaatga tgttagcggt    360
gcacagattc tgctgaccgc agaagactta cgtagccagg cccgtcgcac ctatctgcaa    420
cagaccctga aaaccttact ggaatggaac ttcctgccga ttctgaatga aaatgatgcc    480
gtggcaaccg aagaaattca gttcggtgat aatgatcgcc tggccgcact ggttgccatt    540
cacatgaaag caaaacgtct ggtgctgctg accgatgtgg atggtctgta tgatcgtgat    600
ccgaaaaccg atgccagcgc caaactggtg agcgaattaa gtggtattcc gagttaagga    660
tcc                                                                  663

<210> 462
<211> 191
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 462
Met Pro Pro Val Ser Leu Pro Ala Pro Gln Ser Tyr Ala Tyr Thr Gly
1               5                   10                  15
Leu Thr Lys Tyr Leu Ser Arg Phe Ser Ile Ser Asn His Tyr Thr Ile
            20                  25                  30
Ala Met Asn Thr Ser Thr Met Thr Thr Lys Asn Val Thr Met Ile Lys
        35                  40                  45
Leu Gly Gly Ser Val Ile Thr Asp Lys Asn Val Lys Asp Val Phe Arg
    50                  55                  60
Arg Glu Val Leu Ile Ser Leu Ile Asn Asp Ile Lys Lys Phe Arg Glu
65                  70                  75                  80
Glu Asn Pro Asp Gln Leu Leu Ile Ile Gly His Gly Gln Gly Ser Phe
                85                  90                  95
Ala His Phe Pro Ala Lys Lys Tyr Arg Thr Met Glu Gly Phe Ile Asn
            100                 105                 110
Glu Tyr Ser Arg Tyr Gly Met Ala Val Thr Gln Phe Thr Val Gly Asn
        115                 120                 125
Leu His Gln Leu Val Leu Glu Glu Met Leu Gly His Glu Leu Pro Val
    130                 135                 140
Val Ser Phe Arg Val Asn Ser Ala Ala Val Ala Lys Lys Ser Lys Met
145                 150                 155                 160
Ser Tyr Leu Ala Gly Glu Val Leu Gly Glu Tyr Leu Asn Gln Gly Leu
                165                 170                 175
Leu Pro Val Thr Cys Gly Asp Val Leu Val Asp Ser Glu Arg Gly
            180                 185                 190

<210> 463
<211> 573
<212> DNA
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 463
atgccgccgg tttcgctccc tgccccgcag tcatatgcat atacgggttt gacaaaatac     60
ttgagtagat ttagcattag caaccattat actatagcga tgaacacttc tactatgacc    120
acaaaaaatg tcacaatgat taaactgggc ggatcggtaa ttacagataa aaatgttaag    180
gatgttttcc gccgggaagt tttaataagt cttattaatg atatcaagaa gttcagggag    240
gaaaatccgg accaactgct gataatcggg catggtcagg gcagttttgc tcattttccg    300
gcaaagaaat atcgcaccat ggaagggttt atcaatgagt atagccgcta tgggatggcg    360
gtgacacaat ttacggtcgg aaatctgcat cagctagttt tggaggagat gctagggcat    420
gaactaccgg tcgtcagctt cagggtcaat agtgcggctg tggcaaaaaa atccaaaatg    480
tcatatttgg caggagaagt tttaggcgaa tatttaaatc aagggctact tccagtaacc    540
tgcggcgatg ttttagttga ttcggaaaga ggc                                 573

<210> 464
<211> 582
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 464
atgccgcctg tgagcctgcc ggcaccgcag agttatgcat ataccggcct gaccaaatat     60
ctgagtcgct tcagtattag caatcattat accattgcaa tgaacaccag taccatgacc    120
accaaaaatg ttaccatgat taaactgggc ggtagcgtta ttaccgataa aaatgtgaaa    180
gatgtgttcc gtcgcgaagt gctgattagc ctgattaatg atattaaaaa gttccgtgag    240
gagaatccgg atcagctgct gattattggc catggtcagg gcagcttcgc ccacttcccg    300
gccaaaaaat atcgtaccat ggaaggcttc attaatgaat atagtcgcta tggtatggcc    360
gtgacccagt tcaccgtggg taatctgcat cagctggtgc tggaagaaat gctgggccat    420
gaactgccgg ttgtgagctt ccgtgtgaat agcgccgccg ttgcaaaaaa aagtaaaatg    480
agctatctgg caggtgaagt tctgggcgaa tatctgaatc agggtctgct gccggtgacc    540
tgcggcgatg ttctggttga tagtgaacgc ggctaaggat cc                       582

<210> 465
<211> 193
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 465
Met Asn Arg Ile Val Val Gly Glu Met Leu Lys Gln Asn Ile Pro Ala
1               5                   10                  15
Val Thr Gln Gln Ser Ala Ser Leu Phe Ser Leu Ala Asn Arg Asp Ala
            20                  25                  30
Lys Pro Asp Ile Ser Leu Lys Ile Val Gln Ser Leu Leu Gly Gly Gly
        35                  40                  45
Phe Val Pro Val Val Tyr Gly Asp Val Leu Phe Thr Gln Asp Gly Asn
    50                  55                  60
Phe Thr Ile Tyr Ser Thr Glu Lys Val Leu Asn Asn Leu Ala Leu Ala
65                  70                  75                  80
Leu Arg Glu Ser Gly Glu Thr Ile Gly Lys Val Ile His Cys Gly Glu
                85                  90                  95
Thr Asp Gly Phe Leu Lys Asp Gly Gln Val Val Ser Ile Ile Thr Pro
            100                 105                 110
Glu Thr His Gln Glu Met Leu Ala Ser Met Asp Lys Ala Lys Gly Phe
        115                 120                 125
Asp Val Thr Gly Gly Met Glu His Lys Val Glu Met Ala Leu Asn Leu
    130                 135                 140
Ala His Asp Gly Ile Asp Ser Phe Ile Val Gly Asn Asn His Gly Gly
145                 150                 155                 160
Asn Leu Tyr Arg Thr Ile Val Gly Lys Glu Tyr Ile Gly Thr Arg Ile
                165                 170                 175
His Trp Leu His Lys Cys Leu Thr Ile Met Arg Cys Met Cys Asp Arg
            180                 185                 190
Ile

<210> 466

<400> 466
000

<210> 467
<211> 588
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 467
atgaaccgta ttgttgttgg cgaaatgctg aaacagaata ttccggccgt tacccagcag     60
agcgcaagtc tgttcagtct ggccaatcgt gatgcaaaac cggatattag cctgaaaatt    120
gtgcagagtc tgctgggtgg cggcttcgtg ccggtggtgt atggtgatgt gctgttcacc    180
caggatggca acttcaccat ctatagtacc gaaaaagttc tgaataacct ggccctggca    240
ctgcgtgaaa gcggcgaaac cattggcaaa gttattcatt gtggtgaaac cgatggcttc    300
ctgaaagatg gtcaggtggt tagcattatt accccggaaa cacatcagga aatgctggca    360
agtatggata aagccaaagg cttcgatgtg accggtggta tggaacataa agttgaaatg    420
gccctgaatc tggcacatga tggtattgat agcttcattg tgggtaataa tcatggcggc    480
aatctgtatc gcaccattgt gggtaaagaa tatattggta cccgtattca ttggctgcat    540
aaatgtctga ccattatgcg ttgcatgtgc gatcgtattt aaggatcc                 588

<210> 468
<211> 289
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 468
Met Ala Leu Ile Val Met Lys Phe Gly Gly Thr Ser Val Ala Asp Val
1               5                   10                  15
Glu Lys Ile Glu Asn Ala Ala Asp Lys Val Val Lys Glu Val Glu Arg
            20                  25                  30
Gly His Lys Val Ala Val Val Val Ser Ala Met Ser Gly Val Pro Asn
        35                  40                  45
Gln Leu Val Gly Tyr Cys Ser Glu Ile Ser Ser Leu His Asp Val Arg
    50                  55                  60
Glu Tyr Asp Thr Val Val Ala Ser Gly Glu Gln Val Thr Ala Gly Leu
65                  70                  75                  80
Met Ala Met Ala Leu Gln Lys Arg Gly Val Thr Ala Arg Ser Trp Leu
                85                  90                  95
Gly Trp Gln Ile Pro Ile Lys Thr Asn Ser Ile His Gly Lys Ala Arg
            100                 105                 110
Ile Glu Glu Ile Glu Thr Lys Glu Leu His Lys Arg Leu Asp Ala Gly
        115                 120                 125
Glu Val Leu Val Val Pro Gly Phe Gln Gly Val Thr Lys Asn Lys Arg
    130                 135                 140
Ile Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Ser Ala Val Ala Leu
145                 150                 155                 160
Ala Ala Ala Leu Lys Ala Asp Arg Cys Asp Ile Tyr Thr Asp Val Lys
                165                 170                 175
Gly Val Tyr Thr Ala Asp Pro Arg Ile Val Pro Lys Ala Lys Met Ile
            180                 185                 190
Pro Lys Ile Ser Tyr Glu Glu Met Leu Glu Leu Ala Ser Leu Gly Ser
        195                 200                 205
Lys Val Leu Gln Thr Arg Ser Val Glu Met Gly Ala Lys Ser Gly Val
    210                 215                 220
Pro Ile Gln Val Leu Ser Thr Phe Glu Pro Tyr Leu Gly Ser Asp Leu
225                 230                 235                 240
Lys Gly Thr Leu Val Thr Lys Glu Glu Asn Ile Val Glu Gln Glu Ile
                245                 250                 255
Val Ser Gly Ile Ala His Asn Lys Asp Glu Ala Lys Val Thr Val Val
            260                 265                 270
Gly Val Glu Asp Lys Pro Gly Ile Ala Ala Ser Leu Phe Glu Pro Leu
        275                 280                 285
Ala

<210> 469

<400> 469
000

<210> 470
<211> 876
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 470
atggccctga ttgttatgaa attcggtggt accagcgtgg cagatgttga aaaaattgaa     60
aatgcagccg ataaagttgt taaagaagtt gaacgcggtc ataaagttgc agtggtggtg    120
agcgcaatga gtggcgttcc gaatcagctg gttggctatt gcagcgaaat tagcagtctg    180
catgatgttc gtgaatatga taccgttgtt gccagtggcg aacaggtgac cgcaggtctg    240
atggcaatgg ccctgcaaaa acgcggtgtg accgcacgca gttggctggg ttggcagatt    300
ccgattaaaa ccaatagcat tcatggtaaa gcacgcattg aagaaattga aaccaaagaa    360
ctgcataaac gtctggatgc cggcgaagtg ctggttgttc cgggcttcca gggtgtgacc    420
aaaaataaac gtattaccac cctgggtcgt ggtggcagtg ataccagcgc agttgcactg    480
gcagcagcac tgaaagcaga tcgttgtgat atctataccg atgttaaagg tgtgtatacc    540
gcagatccgc gtattgtgcc gaaagcaaaa atgattccga aaattagcta cgaagaaatg    600
ctggaactgg ccagcctggg tagcaaagtt ctgcaaacca gaagcgttga aatgggcgca    660
aaaagcggcg tgccgattca ggtgctgagt accttcgaac cgtatctggg tagcgatctg    720
aaaggtaccc tggttaccaa agaagaaaat attgttgaac aggaaatcgt tagcggtatt    780
gcccataata aagatgaagc caaagtgacc gttgttggcg ttgaagataa accgggtatt    840
gcagccagtc tgttcgaacc gctggcataa ggatcc                              876

<210> 471
<211> 222
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 471
Met Ser Ser Arg Gly Leu Gly Asp Val Tyr Lys Arg Gln Leu Lys Leu
1               5                   10                  15
Gly Ala Met Pro Ile Ile Asn Glu Asn Asp Ser Val Thr Thr Asn Glu
            20                  25                  30
Ile Arg Phe Gly Asp Asn Asp Ser Leu Ser Ala Trp Val Ala His Leu
        35                  40                  45
Met Gly Ala Asp Ala Leu Ile Leu Leu Thr Asp Val Asp Gly Leu Phe
    50                  55                  60
Asp Ser Asp Pro Arg Phe Asn Lys His Ala Lys Val Ile Lys Asp Val
65                  70                  75                  80
His Asn Ile Ala Asp Val Lys His Leu Ala Gly His Ala Gly Thr Gln
                85                  90                  95
Arg Gly Thr Gly Gly Met Val Thr Lys Leu Arg Ala Ala Glu Leu Ala
            100                 105                 110
Thr Val Ala Gly Thr Glu Thr Leu Ile Ile Gly Gly Gly Gly Pro Gly
        115                 120                 125
Leu Glu Ala Leu Ala Lys Gly Glu Ile Arg Gly Thr Arg Phe Tyr Ala
    130                 135                 140
Lys Thr Ser Pro Ser Ala Arg Lys Ser Trp Leu Ala Gln Leu Pro Leu
145                 150                 155                 160
Arg Gly Ser Ile Glu Ile Asp Ala Gly Ala Ala Lys Ala Leu Ser Arg
                165                 170                 175
Gly Asn Ser Leu Leu Pro Lys Gly Ile Thr Val Ile Asp Gly His Phe
            180                 185                 190
Asp Phe Gly Asp Ala Val Ala Val Thr His Asn Gly Ala Cys Val Ala
        195                 200                 205
Arg Gly Leu Ser Asn Tyr Pro Ser Ala Ala Leu Lys Lys Ile
    210                 215                 220

<210> 472

<400> 472
000

<210> 473
<211> 675
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 473
atgagtagtc gcggcctggg cgatgtgtat aaacgccagc tgaaactggg cgccatgccg     60
attattaatg aaaatgatag cgttaccacc aatgaaattc gcttcggtga taatgatagc    120
ctgagcgcat gggttgccca tctgatgggt gccgatgcac tgattctgct gaccgatgtt    180
gatggtctgt tcgatagcga tccgcgcttc aataaacatg ccaaagttat taaagacgtg    240
cataatattg ccgatgttaa acatctggca ggtcatgccg gcacccagcg tggcaccggt    300
ggtatggtta ccaaactgcg tgccgcagaa ctggcaaccg tggccggtac cgaaacctta    360
attattggcg gcggcggccc gggtctggaa gcattagcaa aaggcgaaat tcgcggcacc    420
cgcttctatg caaaaaccag cccgagcgcc cgtaaaagct ggctggcaca gctgccgctg    480
cgtggcagca ttgaaattga tgccggcgca gccaaagcac tgagtcgtgg taatagcctg    540
ctgccgaaag gcattaccgt gattgatggc cacttcgact tcggtgatgc agttgcagtg    600
acccataatg gtgcatgcgt tgcacgcggc ctgagcaatt atccgagcgc cgccctgaaa    660
aaaatttaag gatcc                                                     675

<210> 474
<211> 331
<212> PRT
<213> Unknown

<220>
<223> Obtained from metagenome sequencing

<400> 474
Met Ser Leu Pro Asn Ala Gly His Ala Arg Arg Ile Val Val Lys Leu
1               5                   10                  15
Gly Thr Gly Val Leu Thr Ser Gly Ile Gly Gln Leu Asp Thr Asp Arg
            20                  25                  30
Ile Gly Ser Ile Ala Arg Gly Ile Ala Ser Leu Arg Gln Arg Gly Thr
        35                  40                  45
Glu Val Ile Val Val Ser Ser Gly Ala Val Gly Leu Gly Met Gly Ala
    50                  55                  60
Leu Lys Leu Gln Lys Lys Pro Lys Asp Val Ser Lys Lys Gln Ala Cys
65                  70                  75                  80
Ala Ala Ile Gly Gln Ser Arg Leu Met Gln Thr Trp Gln Asn Ala Phe
                85                  90                  95
Ser Pro Leu Asn Leu Thr Val Ala Gln Val Leu Leu Thr His Glu Asp
            100                 105                 110
Leu Arg Ser Arg Thr Arg Tyr Leu Gly Val Lys Glu Cys Leu Gln Gln
        115                 120                 125
Leu Ile Ala Tyr Gly Thr Ile Pro Ile Ile Asn Glu Asn Asp Thr Val
    130                 135                 140
Ser Ala Ala Glu Ile Lys Phe Gly Asp Asn Asp Thr Leu Ser Ala Met
145                 150                 155                 160
Val Ala Ser Leu Thr Glu Ala Thr His Leu Ala Ile Leu Ser Thr Ala
                165                 170                 175
Pro Gly Leu Ile Asp Met Lys Gly Thr Gly Gln Ile Ile Pro Val Val
            180                 185                 190
Glu Arg Ile Thr Pro Glu Ile Glu Ala Met Ala Gly Gly Thr Thr Ser
        195                 200                 205
Glu Thr Ala Thr Gly Gly Met Ile Ser Lys Ile Ser Ala Ala Arg Leu
    210                 215                 220
Ala Thr Gln Ala Gly Cys Gly Val Phe Ile Ala Ser Gly Ala Glu Pro
225                 230                 235                 240
Asp Ile Leu Asn Lys Leu Leu Ser Gly Thr Gly Pro Gly Thr Phe Phe
                245                 250                 255
Val Pro Ser Gly Leu Pro Leu Glu Ala Lys Lys Arg Trp Leu Ala Tyr
            260                 265                 270
Phe Gln Arg Pro Ser Gly Thr Leu Leu Val Asn Thr Cys Ala Val Pro
        275                 280                 285
Val Leu Arg Asp Gln Gly Arg Ser Leu Leu Ala Val Gly Val Thr Gly
    290                 295                 300
Ala Lys Gly Gln Phe Gln Ser Gly Asp Ile Val Asn Ile Ala Ala Pro
305                 310                 315                 320
Asp Gly Thr Ile Phe Ala Arg Gly Lys Thr Ala
                325                 330

<210> 475

<400> 475
000

<210> 476
<211> 1002
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 476
atgagtctgc cgaatgccgg ccatgcacgc cgtattgtgg tgaaactggg caccggtgtt     60
ctgaccagcg gtattggtca gctggatacc gatcgcattg gcagcattgc ccgcggtatt    120
gcaagtctgc gtcagcgcgg caccgaagtt attgtggtta gcagtggtgc cgtgggtctg    180
ggtatgggcg cactgaaact gcaaaaaaaa ccgaaagatg ttagtaaaaa gcaggcatgc    240
gcagccattg gccagagccg tctgatgcag acctggcaga atgcattcag cccgctgaat    300
ctgaccgttg cccaggttct gctgacccat gaagacttac gcagccgcac ccgctatctg    360
ggtgttaaag aatgcctgca acagctgatt gcctatggca ccattccgat tattaatgaa    420
aatgataccg tgagtgcagc cgaaattaaa ttcggcgata atgataccct gagtgccatg    480
gttgccagcc tgaccgaagc cacccatctg gccattctga gcaccgcccc gggcctgatt    540
gatatgaaag gtaccggtca gattattccg gtggtggaac gtattacccc ggaaattgaa    600
gccatggccg gtggcaccac cagcgaaacc gcaaccggcg gcatgattag caaaattagc    660
gccgcacgtc tggcaaccca ggccggttgc ggcgtgttca ttgccagtgg cgccgaaccg    720
gatattctga ataaactgct gagcggcacc ggtccgggca ccttcttcgt gccgagtggc    780
ctgccgctgg aagccaaaaa acgttggctg gcatacttcc agcgtccgag tggcaccctg    840
ctggtgaata cctgtgcagt tccggttctg cgtgatcagg gtcgcagcct gctggcagtg    900
ggtgttaccg gcgccaaagg ccagttccag agcggtgata ttgtgaatat tgcagccccg    960
gatggtacca tcttcgcccg tggcaaaacc gcctaaggat cc                      1002

<210> 477
<211> 257
<212> PRT
<213> Methanococcus vannielii

<400> 477
Met Phe Ala Ile Leu Lys Leu Gly Gly Ser Ile Leu Cys Asp Lys Asn
1               5                   10                  15
Val Pro Tyr Ser Ile Asn Trp Glu Asn Leu Gln Asn Ile Gly Ile Glu
            20                  25                  30
Ile Lys Glu Ala Leu Glu Tyr Tyr Arg Lys Glu Glu Ile Asn Leu Lys
        35                  40                  45
Leu Ile Ile Val His Gly Gly Gly Ser Phe Gly His Pro Val Ala Lys
    50                  55                  60
Lys Tyr Leu Lys Asn Gly Lys Phe Val Asp Met Gly Lys Gly Tyr Trp
65                  70                  75                  80
Glu Ile Gln Lys Ala Met Arg Lys Phe Asn Asn Ile Val Ile Asp Glu
                85                  90                  95
Leu Gln Asn Phe Glu Ile Pro Val Val Ser Ile Gln Pro Ser Ser Phe
            100                 105                 110
Ile Thr Phe Asp Lys Asp Leu Asn Leu Arg Phe Asp Thr Asn Ala Ile
        115                 120                 125
Glu Lys Met Leu Glu Lys Asp Leu Ile Pro Val Ile His Gly Asp Ile
    130                 135                 140
Val Ile Asp Glu Arg Glu Asn Asn Phe Lys Ile Phe Ser Gly Asp His
145                 150                 155                 160
Ala Leu Pro His Leu Ser Lys Lys Leu Asn Pro Asp Leu Ser Leu His
                165                 170                 175
Ala Ser Asp Val Asp Gly Val Trp Asp Thr Lys Phe Lys Val Ile Glu
            180                 185                 190
Lys Ile Asp Ser Ser Asn Ile Asn Lys Val Leu Glu Ser Leu Lys Pro
        195                 200                 205
Ser Asn Lys Glu Asp Val Thr Gly Gly Met His Leu Lys Val Met Glu
    210                 215                 220
Cys Tyr Asn Leu Gly Val Lys Thr Ile Ile Phe Asn Gly Ser Lys Lys
225                 230                 235                 240
Arg Asn Ile Tyr Asn Ala Leu Leu Lys Asn Val Lys Gly Thr Ser Ile
                245                 250                 255
Asn

<210> 478
<211> 774
<212> DNA
<213> Methanococcus vannielii

<400> 478
atgtttgcaa ttttaaaact tggcggaagt atcctttgcg ataaaaatgt tccatattca     60
attaattggg aaaatttaca gaatattgga atcgaaataa aagaagccct tgaatattac    120
agaaaagaag aaataaacct aaaattaatt attgttcacg gaggaggctc ttttggccat    180
cctgttgcta aaaaatattt aaaaaatgga aaatttgtag atatgggcaa aggatactgg    240
gaaattcaaa aagcaatgag aaaattcaac aatatcgtaa ttgatgaact acaaaatttt    300
gaaattccgg tagtttcaat acagccctcc tcatttatca cttttgacaa agatttaaac    360
ttacgttttg acacaaatgc tattgaaaag atgcttgaaa aagatttaat tcccgtaatt    420
cacggggata ttgtaattga tgaaagggaa aataatttca agatattttc tggagaccat    480
gcactccccc acctttcaaa aaaactaaat cctgatctga gtcttcatgc atccgacgtt    540
gatggtgtat gggatactaa atttaaggta attgaaaaaa tagattcatc aaatattaat    600
aaagtgctag aatcgttaaa accttcaaac aaagaagatg taaccggagg aatgcactta    660
aaagtaatgg aatgttataa tttaggcgtt aaaaccataa tttttaatgg aagtaaaaaa    720
agaaatatat ataatgccct tttaaaaaat gtaaaaggaa cttcaataaa ttaa          774

<210> 479
<211> 780
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 479
atgttcgcca ttctgaaact gggcggcagt attctgtgcg ataaaaatgt gccgtatagc     60
attaattggg aaaatctgca aaacattggc attgaaatta aggaagcact ggaatattac    120
cgcaaagaag aaattaatct gaagctgatt atcgtgcatg gcggcggtag cttcggccat    180
ccggtggcaa aaaaatatct gaaaaatggt aagttcgtgg atatgggcaa aggctattgg    240
gaaattcaga aagcaatgcg caaattcaat aatattgtga ttgacgaact gcaaaacttc    300
gaaattccgg ttgtgagtat tcagccgagt agcttcatta ccttcgataa agacttaaat    360
ctgcgcttcg ataccaatgc cattgaaaaa atgctggaaa aagacttaat cccggtgatt    420
catggcgata ttgtgattga tgaacgtgaa aataacttca aaatcttcag tggcgatcat    480
gcactgccgc atctgagtaa aaaactgaat ccggatctga gcctgcatgc aagcgatgtg    540
gatggtgtgt gggataccaa attcaaagtg attgaaaaaa tcgacagcag taatattaac    600
aaggtgctgg aaagcctgaa accgagtaat aaagaagatg tgaccggtgg tatgcatctg    660
aaagttatgg aatgttataa cctgggcgtg aaaaccatta tcttcaatgg cagcaaaaaa    720
cgcaatatct ataatgccct gctgaaaaat gtgaaaggta ccagcattaa ttaaaagctt    780

<210> 480
<211> 257
<212> PRT
<213> Methanococcoides burtonii

<400> 480
Met Asn Ser Asn Asn Gly Ile Thr Ile Leu Lys Ile Gly Gly Ser Val
1               5                   10                  15
Ile Thr Asp Lys Arg Ser Glu Asp Gly Leu Ala Trp Glu Glu Glu Ile
            20                  25                  30
Val Arg Ile Ala Arg Glu Ile Ser Gly Phe Glu Gly Lys Leu Ile Ile
        35                  40                  45
Val His Gly Ala Gly Ser Tyr Gly His Pro Gln Ala Lys Arg Tyr Ala
    50                  55                  60
Leu Thr Glu Gly Phe His Ala Glu Gly Ala Val Val Thr His Asn Ala
65                  70                  75                  80
Val Lys Ala Leu Asn Arg Ile Val Val Gly Ile Leu Asn Asp Glu Gly
                85                  90                  95
Val Asn Ala Ile Ser Val His Pro Met Cys Cys Thr Val Ala Lys Asn
            100                 105                 110
Gly Arg Ile Ser Asp Met Tyr Leu Gly Ser Ile Arg Leu Met Leu Glu
        115                 120                 125
Lys Gly Leu Val Pro Val Leu His Gly Asp Val Val Met Asp Glu Val
    130                 135                 140
Lys Gly Val Ser Ile Ile Ser Gly Asp Gln Val Ile Pro Tyr Leu Ala
145                 150                 155                 160
Thr Gln Leu Lys Ala Ser Arg Ile Gly Val Gly Ser Ala Ala Asp Gly
                165                 170                 175
Val Phe Asp Asp Lys Gly Val Thr Ile Pro Val Ile Thr Ser Met Asn
            180                 185                 190
Phe Asp Glu Ile Lys Ala Tyr Ile Gly Gly Ser Ala Gly Thr Asp Val
        195                 200                 205
Thr Gly Gly Met Leu Gly Lys Val Leu Glu Met Leu Glu Leu Gly Lys
    210                 215                 220
Thr Ser Ser Ile Thr Ser Tyr Ile Phe Asn Ala Thr Val Val Gly Ser
225                 230                 235                 240
Val Ser Ser Phe Leu Asn Gly Glu Asn Ile Gly Thr Ala Ile Lys Asp
                245                 250                 255
Ser

<210> 481
<211> 774
<212> DNA
<213> Methanococcoides burtonii

<400> 481
gtgaactcaa ataacgggat aactatattg aagataggcg gcagcgtaat taccgataag     60
agatcggaag acgggcttgc ctgggaagaa gagatagtcc ggatagctcg tgaaatatcc    120
ggttttgaag gtaaacttat cattgtgcat ggtgcaggtt cctacggtca ccctcaggca    180
aaaagatatg ctcttacgga aggcttccat gcagaagggg ctgttgtcac tcacaatgca    240
gtaaaggcat tgaaccgtat cgtagttgga attctcaatg atgaaggtgt gaatgcgata    300
tcagtacatc caatgtgttg cacggtcgca aagaacggtc gcatctcgga catgtatctt    360
ggaagtatca ggttgatgct tgagaaaggg cttgttccgg tattacatgg tgatgttgta    420
atggatgagg taaaaggcgt atccatcatc tccggtgatc aggtgatacc ctatcttgca    480
acacagttga aggcttccag aattggagtt gggagtgcag cagatggtgt gttcgatgat    540
aagggcgtaa ccattccagt gatcacttct atgaatttcg atgagataaa ggcatatatc    600
ggtggctcgg caggaactga tgtgaccggt ggaatgctcg ggaaagtgct tgaaatgctg    660
gaacttggca aaacatcaag tataacctcc tatatattca atgcaactgt agtgggaagc    720
gtgtcgagtt ttctaaatgg tgagaacatc ggtaccgcaa taaaagattc ataa          774

<210> 482
<211> 780
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 482
atgaacagta acaacggtat taccattctg aaaattggtg gtagcgttat taccgataaa     60
cgtagcgaag atggtctggc atgggaagaa gaaattgtgc gtattgcacg cgaaattagt    120
ggcttcgaag gcaaactgat tattgtgcat ggtgccggta gttatggtca tccgcaggcc    180
aaacgttatg ccctgaccga aggcttccat gcagaaggtg ccgtggttac ccataatgca    240
gttaaagcac tgaatcgcat tgttgttggc attctgaatg atgaaggtgt taatgcaatt    300
agcgtgcatc cgatgtgttg caccgttgcc aaaaatggtc gtattagcga tatgtatctg    360
ggtagcattc gtctgatgct ggaaaaaggc ctggtgccgg ttctgcatgg tgatgtggtg    420
atggatgaag ttaaaggcgt tagcattatt agcggtgatc aggtgattcc gtatctggcc    480
acccagctga aagcaagtcg tattggtgtg ggcagtgcag cagatggtgt gttcgatgat    540
aaaggcgtta ccattccggt tattaccagc atgaacttcg atgaaattaa agcatatatc    600
ggtggtagcg caggcaccga tgtgaccggt ggtatgctgg gtaaagtgct ggaaatgctg    660
gaactgggca aaaccagcag cattaccagc tatatcttca atgccaccgt tgtgggcagc    720
gtgagtagct tcctgaatgg cgaaaatatt ggtaccgcaa ttaaagatag ttaaaagctt    780

<210> 483
<211> 258
<212> PRT
<213> Methanolobus tindarius

<400> 483
Met Asp Asn Asn Asn Ile Thr Ile Leu Lys Ile Gly Gly Ser Val Ile
1               5                   10                  15
Thr Asp Lys Ser Ala Asp Asp Gly Thr Ala Arg Leu Ser Glu Ile Glu
            20                  25                  30
Arg Ile Ala Ala Glu Ile Ser Gly Phe Glu Gly Lys Leu Ile Ile Val
        35                  40                  45
His Gly Ala Gly Ser Phe Gly His Pro Gln Val Lys Arg Phe Gly Leu
    50                  55                  60
Thr Gly Lys Phe Asp His Glu Gly Ser Ile Ile Thr His Met Ser Val
65                  70                  75                  80
Arg Lys Leu Asn Thr Met Val Val Glu Thr Leu Asn Ser Ala Gly Ile
                85                  90                  95
Asn Ala Leu Pro Val His Pro Met Ala Cys Ala Ile Ser Ser Asn Ser
            100                 105                 110
Arg Ile Lys Ser Met Phe Arg Glu Gln Ile Glu Glu Met Leu Ala Asn
        115                 120                 125
Gly Phe Val Pro Val Leu His Gly Asp Met Val Met Asp Thr Asp Leu
    130                 135                 140
Gly Thr Ser Val Leu Ser Gly Asp Gln Ile Val Pro Tyr Leu Ala Ile
145                 150                 155                 160
Gln Met Lys Ala Ser Arg Ile Gly Ile Gly Ser Ala Glu Glu Gly Val
                165                 170                 175
Leu Asp Asp Lys Gly Gly Val Ile Pro Leu Ile Asn Asn Glu Asn Phe
            180                 185                 190
Asp Glu Ile Lys Ala Tyr Leu Ser Gly Ser Ala Asn Thr Asp Val Thr
        195                 200                 205
Gly Gly Met Leu Gly Lys Val Leu Glu Leu Leu Glu Leu Ser Glu Gln
    210                 215                 220
Ser Asn Ser Thr Ser Tyr Ile Phe Asn Ala Gly Asn Thr Gly Asn Ile
225                 230                 235                 240
Ser Asp Phe Leu Ser Gly Lys Asn Ile Gly Thr Ala Ile Gly Ala Gly
                245                 250                 255
Thr Ile

<210> 484
<211> 777
<212> DNA
<213> Methanolobus tindarius

<400> 484
atggataaca ataatattac tatcctgaag ataggcggta gtgtcatcac agataaaagt     60
gctgatgatg gtactgcaag attaagtgaa atagaaagaa tagcagctga gatctcaggt    120
tttgaaggaa aacttatcat tgtacatggt gcaggatcat tcggacatcc gcaggtcaaa    180
cgtttcggac ttaccggtaa atttgaccac gaaggttcca taattacaca catgtcagta    240
aggaaactca acacaatggt tgttgaaacc ttaaactctg caggaattaa cgcattgcca    300
gtacatccta tggcatgtgc aatatcaagc aacagccgca taaagagtat gttccgtgaa    360
caaatagagg aaatgcttgc taatggattt gtacccgttc ttcatggcga catggtaatg    420
gacaccgacc ttgggacttc ggtcctgtcc ggtgatcaga ttgttccgta tcttgcaata    480
cagatgaaag catcaagaat cggaattgga agtgcggaag aaggtgtgct tgatgacaaa    540
ggaggagtaa tacccctaat caacaacgaa aattttgatg aaattaaggc atatctcagt    600
ggttctgcta acaccgatgt taccggcgga atgctgggaa aagtacttga gcttttagaa    660
ctaagtgagc agtcaaacag cacttcttat atatttaatg caggcaatac aggaaacata    720
tcagatttcc tcagtggtaa gaatattggt accgcaatag gagccgggac tatataa       777

<210> 485
<211> 783
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 485
atggataaca acaacatcac cattctgaaa attggcggca gtgttattac cgataaaagc     60
gcagatgatg gtaccgcacg cctgagtgaa attgaacgca ttgcagcaga aattagtggc    120
ttcgaaggta aactgattat tgtgcatggc gcaggcagct tcggccatcc gcaggtgaaa    180
cgcttcggcc tgaccggcaa attcgatcat gaaggtagta ttattaccca catgagcgtt    240
cgtaaactga ataccatggt ggtggaaacc ttaaatagtg caggtattaa tgccctgccg    300
gttcatccga tggcctgtgc aattagtagt aatagtcgta ttaagagcat gttccgcgaa    360
cagattgaag aaatgctggc aaatggcttc gttccggttc tgcatggtga tatggtgatg    420
gataccgatc tgggtaccag cgttctgagt ggtgatcaga ttgtgccgta tctggccatt    480
cagatgaaag caagtcgcat tggcattggt agcgcagaag aaggcgtgct ggatgataaa    540
ggcggcgtga ttccgctgat taataatgaa aacttcgatg aaatcaaggc ctatctgagc    600
ggcagtgcca ataccgatgt taccggtggc atgctgggca aagttctgga actgctggaa    660
ctgagtgaac agagtaatag caccagctat atcttcaatg caggtaatac cggtaatatt    720
agcgacttcc tgagcggtaa aaatattggt accgcaattg gcgccggcac catttaaaag    780
ctt                                                                  783

<210> 486
<211> 332
<212> PRT
<213> Arabidopsis thaliana

<400> 486
Met Glu Leu Asn Ile Ser Glu Ser Arg Ser Arg Ser Ile Arg Cys Ile
1               5                   10                  15
Val Lys Leu Gly Gly Ala Ala Ile Thr Cys Lys Asn Glu Leu Glu Lys
            20                  25                  30
Ile His Asp Glu Asn Leu Glu Val Val Ala Cys Gln Leu Arg Gln Ala
        35                  40                  45
Met Leu Glu Gly Ser Ala Pro Ser Lys Val Ile Gly Met Asp Trp Ser
    50                  55                  60
Lys Arg Pro Gly Ser Ser Glu Ile Ser Cys Asp Val Asp Asp Ile Gly
65                  70                  75                  80
Asp Gln Lys Ser Ser Glu Phe Ser Lys Phe Val Val Val His Gly Ala
                85                  90                  95
Gly Ser Phe Gly His Phe Gln Ala Ser Arg Ser Gly Val His Lys Gly
            100                 105                 110
Gly Leu Glu Lys Pro Ile Val Lys Ala Gly Phe Val Ala Thr Arg Ile
        115                 120                 125
Ser Val Thr Asn Leu Asn Leu Glu Ile Val Arg Ala Leu Ala Arg Glu
    130                 135                 140
Gly Ile Pro Thr Ile Gly Met Ser Pro Phe Ser Cys Gly Trp Ser Thr
145                 150                 155                 160
Ser Lys Arg Asp Val Ala Ser Ala Asp Leu Ala Thr Val Ala Lys Thr
                165                 170                 175
Ile Asp Ser Gly Phe Val Pro Val Leu His Gly Asp Ala Val Leu Asp
            180                 185                 190
Asn Ile Leu Gly Cys Thr Ile Leu Ser Gly Asp Val Ile Ile Arg His
        195                 200                 205
Leu Ala Asp His Leu Lys Pro Glu Tyr Val Val Phe Leu Thr Asp Val
    210                 215                 220
Leu Gly Val Tyr Asp Arg Pro Pro Ser Pro Ser Glu Pro Asp Ala Val
225                 230                 235                 240
Leu Leu Lys Glu Ile Ala Val Gly Glu Asp Gly Ser Trp Lys Val Val
                245                 250                 255
Asn Pro Leu Leu Glu His Thr Asp Lys Lys Val Asp Tyr Ser Val Ala
            260                 265                 270
Ala His Asp Thr Thr Gly Gly Met Glu Thr Lys Ile Ser Glu Ala Ala
        275                 280                 285
Met Ile Ala Lys Leu Gly Val Asp Val Tyr Ile Val Lys Ala Ala Thr
    290                 295                 300
Thr His Ser Gln Arg Ala Leu Asn Gly Asp Leu Arg Asp Ser Val Pro
305                 310                 315                 320
Glu Asp Trp Leu Gly Thr Ile Ile Arg Phe Ser Lys
                325                 330

<210> 487
<211> 2520
<212> DNA
<213> Arabidopsis thaliana

<400> 487
aaaaaaagag agaatctttc cggtgaatgg gaaatggcaa gaagaaggat acgagtacga     60
tgaagttgag ttagagcggg agagagagac ataccgttga ccactaaaga cgagagcttt    120
agagagagac gcaacaatgg agctgaatat ttccgagagt cgaagcagat caattcgttg    180
cattgtgaaa cttggtttgt cttttgatcc ctattaaaat acttgctgga tcgattctta    240
gatcgcattt tagttgttct gtcttatggg tttgattctg atttgactat tgaataccga    300
agatgaggaa agactttaaa gatctgtctt ttattcattt cccatcaatt ttacctgtct    360
agactttgta acagcagaat gtctaaatga tttgatagga ggtgcggcaa ttacttgcaa    420
aaacgagctg gagaagattc acgatgaaaa tctggaggtc gtggcgtgtc agttacgtca    480
agctatgttg gagggttcag ctccaagcaa ggttattggc atggattgga gcaagagacc    540
tggaagctct gagatttctt gtgatgtgga tgacataggg gatcaaaagt cttctgagtt    600
tagtaaattt gttgtggttc atggcgctgg ttagttggag attgattctt atctgcttaa    660
atcttataat gaaggatcaa aatacatttt tgatttagct cttacagatc attgacctgc    720
ttgtgttagt tgagctgtta tttagttttt gggtttctta tgtgttgttc tgatttttgt    780
cattaggttc ctttgggcac tttcaggcca gtagatctgg ggttcacaaa ggaggacttg    840
agaaacctat tgtcaaagct ggtttcgttg ctactcgtat atctgtgaga ctgttagcta    900
aatctgatgc tcttacgtaa gatagtcttg agctgttctg atagtctaca acttcttttt    960
gcaggtgaca aatcttaatc ttgaaattgt acgagcacta gcccgaggta tctcttaagc   1020
ttattctctt ttcctactct ggctaagagg aaaagatgaa ttttgaagct aaacattatg   1080
atctcttgca gagggcattc ctacgatagg catgtctcca ttttcatgtg gttggtcaac   1140
ctccaaaaga gatgtaagag ctaactgaac ttcttaacat atttcctgat ctgaagcatg   1200
tgggtttaga tttcaatgag actgcttttt gcattaggtg gcttctgcag atctagcaac   1260
cgtagctaaa accatagact caggatttgt ccctgtaagt tttgatccca ccagtgaaat   1320
taggtagatg tttaccaaag tctcttgact gtaatttgga cagtaatttt tgtcatggcc   1380
aggttctcca tggagatgca gtgctggaca atatactggt aatatgctct ccagagatac   1440
caaatagcta tctagattct tgtccgttta gaatttgaac agttgaaaat cttgtcctga   1500
acagggctgc accatattga gtggtgatgt tatcatccgt catcttgcag atcatttgaa   1560
gccagaatat gttgtctttc tcgtatccta ttcatagtat aaaccggaaa tttaatttgt   1620
gttctctctc cctctttcct atatattctt tgttatctga gtaactcttt acaaagacag   1680
atgtactagg tgtctacgat cgaccacctt caccttcaga gcccgacgct gtgctcttga   1740
aagagatcgg ttagtttcaa actttactgt ttgttttttt tatatctttt ccagacttgg   1800
cctgatacta acagttcatg cagctgttgg agaagatgga agctggaagg ttgtgaatcc   1860
actgttggag cacacagaca agaaaggtaa acgaaagaac caatgttgta aaactatgca   1920
atacaataag tctgagggat ttgttaaaag ttcaaacctc tctgattttt ggaatcaaac   1980
agttgactac tctgttgcgg cgcacgatac aaccggtgga atggaaacga agatatcaga   2040
agctgctatg attgcaaaac ttggagtcga tgtctacatt gtgaaggtag attatagaca   2100
acttttgcct ttaaccaact aaatatctta gataaagatg gagcttttaa ccgccatcat   2160
gctctttttt tcaatctcct caaacaggct gcgacaactc attcacagag agcactaaac   2220
ggtgatttga gagatagtgt tcctgaagat tggcttggta ctatcatcag attctcaaag   2280
tagaataatc tcctgacaaa tacactaatt ccagttcctc aacaaaggaa agcattgacc   2340
ttttcttgct attactgttt taccagactt tgtgcaagga atacaaaatc gtgtaatgtc   2400
aaaaccagtt aaaatgtcga atactatcgc agcctgtgta actctatgtt tccacaagta   2460
ttcaatcaca tatcttacac tgttattgtt aaataaaatg tatatgtcac tcttgaatcg   2520

<210> 488
<211> 1009
<212> DNA
<213> Artificial sequence

<220>
<223> Codon optimised sequence

<400> 488
atggaactga atatcagcga aagccgtagt cgtagcattc gttgtattgt taaactgggt     60
ggtgcagcaa ttacctgtaa aaatgaactg gaaaaaatcc atgacgaaaa tctggaagtt    120
gttgcatgtc agctgcgcca ggccatgctg gaaggtagcg ccccgagtaa agttattggt    180
atggattgga gtaaacgccc gggcagtagt gaaattagtt gtgatgttga tgatatcggc    240
gatcagaaaa gcagtgaatt ttcaaaattc gtggtggttc atggcgcagg tagctttggc    300
cattttcagg caagtcgtag tggtgttcat aaaggtggcc tggaaaaacc gattgttaaa    360
gccggctttg ttgccacccg cattagcgtt accaatctga atctggaaat tgttcgtgcc    420
ctggcacgtg aaggcattcc gaccattggc atgagcccgt ttagctgtgg ctggagtacc    480
agcaaacgtg atgttgccag cgccgatctg gcaaccgtgg caaaaaccat tgatagtggt    540
tttgtgccgg tgctgcatgg cgatgccgtt ctggataata ttctgggctg taccattctg    600
agcggcgatg ttattattcg tcatctggcc gatcatctga aaccggaata tgttgttttt    660
ctgaccgatg ttctgggtgt ttatgatcgc ccgccgagtc cgagcgaacc ggatgctgtt    720
ctgctgaaag aaattgccgt gggtgaagat ggcagttgga aagttgttaa tccgctgctg    780
gaacataccg ataaaaaagt ggattatagt gtggccgcac atgataccac cggtggtatg    840
gaaaccaaaa ttagtgaagc cgccatgatt gccaaactgg gcgttgatgt gtatattgtt    900
aaagcagcca ccacccatag ccagcgtgca ctgaatggcg atctgcgtga tagtgtgccg    960
gaagattggc tgggtaccat tattcgcttt agcaaatgac tgaaagctt               1009

<210> 489

<400> 489
000

<210> 490

<400> 490
000

<210> 491
<211> 5275
<212> DNA
<213> Artificial sequence

<220>
<223> Plasmid

<400> 491
ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag     60
gagatatacc atgctgacca ttctgaaact gggtggtagc attctgagtg ataaaaatgt    120
gccgtatagt attaagtggg ataatctgga acgcattgca atggaaatta aaaatgcact    180
ggattattac aagaaccaga ataaagaaat caagctgatt ctggtgcatg gtggtggtgc    240
cttcggccat ccggttgcaa aaaaatatct gaaaattgaa gacggtaaga agatcttcat    300
taatatggaa aagggcttct gggaaattca gcgtgccatg cgtcgcttca ataatattat    360
tattgacacc ctgcaaagct atgatattcc ggccgtgagt attcagccga gtagcttcgt    420
ggtgttcggc gataaactga tcttcgatac cagtgccatt aaagaaatgc tgaaacgtaa    480
tctggtgccg gtgattcatg gcgatattgt gattgatgat aaaaatggtt accgtattat    540
cagtggtgat gatattgttc cgtatctggc caatgaactg aaagccgatc tgattctgta    600
tgcaaccgat gttgatggcg tgctgattga taataaaccg attaaacgca ttgacaaaaa    660
caatatctat aagatcctga actacctgag tggtagcaat agtattgatg tgaccggtgg    720
tatgaaatat aaaattgata tgatccgcaa gaacaagtgt cgcggcttcg tgttcaatgg    780
caataaagca aataatatct acaaggccct gctgggcgaa gttgaaggca ccgaaattga    840
cttcagtgaa tgagcggccg cataatgctt aagtcgaaca gaaagtaatc gtattgtaca    900
cggccgcata atcgaaatta atacgactca ctatagggga attgtgagcg gataacaatt    960
ccccatctta gtatattagt taagtataag aaggagatat acatatgagc aaagttatca   1020
ccgatgtgtt ctataccgcc ttcaaaaccg ccctgccgct gaccagtagt ccgctggttc   1080
agtgtattac caatgaaatt accgttgaaa gcatggccaa tgccctgctg tatattgatg   1140
ccaaaccggt tatggcagat gatcagcgcg agttcccgga gttcttcgcc cagagcgatg   1200
cactgctgct gaatctgggc catattagtg aagtgcgcca gcagaatctg ctggcagcag   1260
gtaaattcgc ccaggcaacc aatcagccga ccgtgattga tctggtgggt gttagtgcaa   1320
cccagctgcg ttatgatctg ggccatcagc tgctggcaaa tcatccgaat gtggtgaaag   1380
gtaatattag tgaaatgcgc cgcttcgccg atctgaaaag taccggtcgc ggtgtggatg   1440
gtagccagct ggatcagagt gcaaccgcac tgggcgaact ggccgccagc ctgcaacagc   1500
tgacccaggc attcccgacc accaccttcc tggcaaccgg taaaattgat ctggttgtga   1560
gcgcaaaagg cacctggtat ctgaaaaatg gtgttccgca gctggatcgc ttcaccggca   1620
ccggtgatat tgtgggtgcc ctgattgccg cactgctggg taccggcctg gataatgatg   1680
cagccgttgt ggtggcagtg agttacttca attgctgtgg tgaagttgca gccgcacaga   1740
atcgcaccgg cggtctggca gccttccgcg aaggtaccct gaatcagctg agtctgctgg   1800
cagccaccgc cgattggctg caaatggtta aaggtgaagc actgtgactc gagtctggta   1860
aagaaaccgc tgctgcgaaa tttgaacgcc agcacatgga ctcgtctact agcgcagctt   1920
aattaaccta ggctgctgcc accgctgagc aataactagc ataacccctt ggggcctcta   1980
aacgggtctt gaggggtttt ttgctgaaac ctcaggcatt tgagaagcac acggtcacac   2040
tgcttccggt agtcaataaa ccggtaaacc agcaatagac ataagcggct atttaacgac   2100
cctgccctga accgacgacc gggtcatcgt ggccggatct tgcggcccct cggcttgaac   2160
gaattgttag acattatttg ccgactacct tggtgatctc gcctttcacg tagtggacaa   2220
attcttccaa ctgatctgcg cgcgaggcca agcgatcttc ttcttgtcca agataagcct   2280
gtctagcttc aagtatgacg ggctgatact gggccggcag gcgctccatt gcccagtcgg   2340
cagcgacatc cttcggcgcg attttgccgg ttactgcgct gtaccaaatg cgggacaacg   2400
taagcactac atttcgctca tcgccagccc agtcgggcgg cgagttccat agcgttaagg   2460
tttcatttag cgcctcaaat agatcctgtt caggaaccgg atcaaagagt tcctccgccg   2520
ctggacctac caaggcaacg ctatgttctc ttgcttttgt cagcaagata gccagatcaa   2580
tgtcgatcgt ggctggctcg aagatacctg caagaatgtc attgcgctgc cattctccaa   2640
attgcagttc gcgcttagct ggataacgcc acggaatgat gtcgtcgtgc acaacaatgg   2700
tgacttctac agcgcggaga atctcgctct ctccagggga agccgaagtt tccaaaaggt   2760
cgttgatcaa agctcgccgc gttgtttcat caagccttac ggtcaccgta accagcaaat   2820
caatatcact gtgtggcttc aggccgccat ccactgcgga gccgtacaaa tgtacggcca   2880
gcaacgtcgg ttcgagatgg cgctcgatga cgccaactac ctctgatagt tgagtcgata   2940
cttcggcgat caccgcttcc ctcatactct tcctttttca atattattga agcatttatc   3000
agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag   3060
ctagctcact cggtcgctac gctccgggcg tgagactgcg gcgggcgctg cggacacata   3120
caaagttacc cacagattcc gtggataagc aggggactaa catgtgaggc aaaacagcag   3180
ggccgcgccg gtggcgtttt tccataggct ccgccctcct gccagagttc acataaacag   3240
acgcttttcc ggtgcatctg tgggagccgt gaggctcaac catgaatctg acagtacggg   3300
cgaaacccga caggacttaa agatccccac cgtttccggc gggtcgctcc ctcttgcgct   3360
ctcctgttcc gaccctgccg tttaccggat acctgttccg cctttctccc ttacgggaag   3420
tgtggcgctt tctcatagct cacacactgg tatctcggct cggtgtaggt cgttcgctcc   3480
aagctgggct gtaagcaaga actccccgtt cagcccgact gctgcgcctt atccggtaac   3540
tgttcacttg agtccaaccc ggaaaagcac ggtaaaacgc cactggcagc agccattggt   3600
aactgggagt tcgcagagga tttgtttagc taaacacgcg gttgctcttg aagtgtgcgc   3660
caaagtccgg ctacactgga aggacagatt tggttgctgt gctctgcgaa agccagttac   3720
cacggttaag cagttcccca actgacttaa ccttcgatca aaccacctcc ccaggtggtt   3780
ttttcgttta cagggcaaaa gattacgcgc agaaaaaaag gatctcaaga agatcctttg   3840
atcttttcta ctgaaccgct ctagatttca gtgcaattta tctcttcaaa tgtagcacct   3900
gaagtcagcc ccatacgata taagttgtaa ttctcatgtt agtcatgccc cgcgcccacc   3960
ggaaggagct gactgggttg aaggctctca agggcatcgg tcgagatccc ggtgcctaat   4020
gagtgagcta acttacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc   4080
tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg   4140
ggcgccaggg tggtttttct tttcaccagt gagacgggca acagctgatt gcccttcacc   4200
gcctggccct gagagagttg cagcaagcgg tccacgctgg tttgccccag caggcgaaaa   4260
tcctgtttga tggtggttaa cggcgggata taacatgagc tgtcttcggt atcgtcgtat   4320
cccactaccg agatgtccgc accaacgcgc agcccggact cggtaatggc gcgcattgcg   4380
cccagcgcca tctgatcgtt ggcaaccagc atcgcagtgg gaacgatgcc ctcattcagc   4440
atttgcatgg tttgttgaaa accggacatg gcactccagt cgccttcccg ttccgctatc   4500
ggctgaattt gattgcgagt gagatattta tgccagccag ccagacgcag acgcgccgag   4560
acagaactta atgggcccgc taacagcgcg atttgctggt gacccaatgc gaccagatgc   4620
tccacgccca gtcgcgtacc gtcttcatgg gagaaaataa tactgttgat gggtgtctgg   4680
tcagagacat caagaaataa cgccggaaca ttagtgcagg cagcttccac agcaatggca   4740
tcctggtcat ccagcggata gttaatgatc agcccactga cgcgttgcgc gagaagattg   4800
tgcaccgccg ctttacaggc ttcgacgccg cttcgttcta ccatcgacac caccacgctg   4860
gcacccagtt gatcggcgcg agatttaatc gccgcgacaa tttgcgacgg cgcgtgcagg   4920
gccagactgg aggtggcaac gccaatcagc aacgactgtt tgcccgccag ttgttgtgcc   4980
acgcggttgg gaatgtaatt cagctccgcc atcgccgctt ccactttttc ccgcgttttc   5040
gcagaaacgt ggctggcctg gttcaccacg cgggaaacgg tctgataaga gacaccggca   5100
tactctgcga catcgtataa cgttactggt ttcacattca ccaccctgaa ttgactctct   5160
tccgggcgct atcatgccat accgcgaaag gttttgcgcc attcgatggt gtccgggatc   5220
tcgacgctct cccttatgcg actcctgcat taggaaatta atacgactca ctata        5275
