                         SEQUENCE LISTING

<110>  GeneSys Ltd
       Clark, Duncan R
       Nicholas, Morant
 
<120>  Enzyme

<130>  P1444PC00

<150>  GB0804721.9
<151>  2008-03-14

<150>  US 61/069429
<151>  2008-03-14

<160>  39    

<170>  PatentIn version 3.5

<210>  1
<211>  773
<212>  PRT
<213>  Palaeococcus helgesonii


<220>
<221>  MISC_FEATURE
<222>  (622)..(622)
<223>  Xaa is Leu, Pro, Gln or Arg

<400>  1

Met Ile Leu Asp Thr Asp Tyr Ile Thr Glu Asn Gly Lys Pro Val Ile 
1               5                   10                  15      


Arg Ile Phe Lys Lys Glu Asn Gly Glu Phe Lys Ile Glu Tyr Asp Arg 
            20                  25                  30          


Asn Phe Glu Pro Tyr Ile Tyr Ala Leu Leu Glu Asn Glu Glu Glu Ile 
        35                  40                  45              


Glu Asp Ile Lys Arg Ile Thr Ala Glu Arg His Gly Lys Lys Val Arg 
    50                  55                  60                  


Ile Val Arg Ala Glu Lys Val Lys Lys Lys Phe Leu Gly Glu Pro Ile 
65                  70                  75                  80  


Glu Val Trp Lys Leu Val Phe Glu His Pro Gln Asp Val Pro Asp Ile 
                85                  90                  95      


Ile Arg Lys His Pro Ala Val Val Asp Ile Tyr Glu Tyr Asp Ile Pro 
            100                 105                 110         


Phe Ala Lys Arg Tyr Leu Ile Asp Arg Gly Leu Val Pro Met Glu Gly 
        115                 120                 125             


Asp Glu Glu Leu Lys Met Leu Ala Phe Asp Ile Glu Thr Phe Tyr His 
    130                 135                 140                 


Glu Gly Asp Glu Phe Gly Glu Gly Glu Ile Leu Met Ile Ser Tyr Ala 
145                 150                 155                 160 


Asp Glu Gly Gly Ala Arg Val Ile Thr Trp Lys Arg Ile Asp Leu Pro 
                165                 170                 175     


Tyr Val Glu Thr Val Ser Thr Glu Arg Glu Ala Ile Lys Arg Phe Leu 
            180                 185                 190         


His Val Leu Lys Glu Lys Asp Pro Asp Val Leu Ile Thr Tyr Asn Gly 
        195                 200                 205             


Asp Asn Phe Asp Phe Ala Tyr Ile Lys Lys Arg Cys Glu Lys Leu Gly 
    210                 215                 220                 


Leu Lys Phe Thr Ile Gly Arg Asp Gly Ser Glu Pro Lys Ile Gln Arg 
225                 230                 235                 240 


Met Gly Asp Arg Phe Ala Val Glu Val Lys Gly Ile Lys Gly Arg Ile 
                245                 250                 255     


His Leu Asp Leu Tyr Pro Val Val Arg His Thr Ile Arg Leu Pro Thr 
            260                 265                 270         


Tyr Thr Leu Glu Ala Val Tyr Glu Ala Val Phe Gly Lys Arg Lys Glu 
        275                 280                 285             


Lys Val Tyr Ala Glu Glu Ile Ala Thr Ala Trp Lys Ser Glu Glu Gly 
    290                 295                 300                 


Leu Lys Arg Val Ala Gln Tyr Ser Met Glu Asp Ala Lys Ala Thr Tyr 
305                 310                 315                 320 


Glu Leu Gly Arg Glu Phe Phe Pro Met Glu Val Glu Leu Ala Lys Leu 
                325                 330                 335     


Ile Gly Gln Ser Val Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 
            340                 345                 350         


Val Glu Trp Tyr Leu Leu Arg Glu Ala Tyr Glu Arg Asn Glu Leu Ala 
        355                 360                 365             


Pro Asn Lys Pro Gly Asp Ala Glu Tyr Arg Lys Arg Met Arg Ser Ser 
    370                 375                 380                 


Tyr Leu Gly Gly Tyr Val Lys Glu Pro Glu Lys Gly Leu Trp Glu Ser 
385                 390                 395                 400 


Ile Ala Tyr Leu Asp Phe Arg Ser Leu Tyr Pro Ser Ile Ile Val Thr 
                405                 410                 415     


His Asn Val Ser Pro Asp Thr Leu Glu Arg Glu Cys Lys Asn Tyr Tyr 
            420                 425                 430         


Val Ala Pro Val Val Gly Tyr Arg Phe Cys Ser Asp Phe Lys Gly Phe 
        435                 440                 445             


Ile Pro Ser Ile Leu Glu Glu Leu Ile Glu Thr Arg Gln Lys Val Lys 
    450                 455                 460                 


Arg Lys Met Lys Ala Thr Ile Asp Pro Val Glu Arg Lys Met Leu Asp 
465                 470                 475                 480 


Tyr Arg Gln Arg Ala Leu Lys Ile Leu Ala Asn Ser Tyr Tyr Gly Tyr 
                485                 490                 495     


Thr Gly Tyr Pro Lys Ala Arg Trp Tyr Ser Lys Glu Cys Ala Glu Ser 
            500                 505                 510         


Val Thr Ala Trp Gly Arg His Tyr Ile Glu Thr Thr Ile Asn Glu Ala 
        515                 520                 525             


Glu Gly Phe Gly Phe Lys Val Leu Tyr Ala Asp Thr Asp Gly Phe Phe 
    530                 535                 540                 


Ala Thr Ile Pro Gly Glu Lys Pro Glu Val Ile Lys Lys Lys Ala Leu 
545                 550                 555                 560 


Glu Phe Leu Lys His Ile Asn Lys Lys Leu Pro Gly Met Leu Glu Leu 
                565                 570                 575     


Glu Tyr Glu Gly Phe Tyr Thr Arg Gly Phe Phe Val Thr Lys Lys Lys 
            580                 585                 590         


Tyr Ala Leu Ile Asp Glu Glu Gly His Ile Thr Thr Arg Gly Leu Glu 
        595                 600                 605             


Val Val Arg Arg Asp Trp Ser Glu Ile Ala Lys Glu Thr Xaa Ala Lys 
    610                 615                 620                 


Val Leu Glu Val Ile Leu Arg Glu Gly Ser Ile Glu Lys Ala Ala Gly 
625                 630                 635                 640 


Ile Val Lys Lys Val Val Glu Asp Leu Ala Asn Tyr Arg Val Pro Val 
                645                 650                 655     


Glu Lys Leu Val Ile His Glu Gln Ile Thr Arg Glu Leu Lys Asp Tyr 
            660                 665                 670         


Lys Ala Thr Gly Pro His Val Ala Ile Ala Lys Arg Leu Gln Ala Arg 
        675                 680                 685             


Gly Ile Lys Val Lys Pro Gly Thr Ile Ile Ser Tyr Val Val Leu Lys 
    690                 695                 700                 


Gly Ser Lys Lys Ile Ser Asp Arg Val Ile Leu Phe Asp Glu Tyr Asp 
705                 710                 715                 720 


Pro Gly Arg His Lys Tyr Asp Pro Asp Tyr Tyr Ile His Asn Gln Val 
                725                 730                 735     


Leu Pro Ala Val Leu Arg Ile Leu Glu Ala Phe Gly Tyr Lys Glu Lys 
            740                 745                 750         


Asp Leu Glu Tyr Gln Arg Met Arg Gln Met Gly Leu Gly Ala Trp Leu 
        755                 760                 765             


Gly Thr Gly Lys Gly 
    770             


<210>  2
<211>  2573
<212>  DNA
<213>  Palaeococcus helgesonii


<220>
<221>  misc_feature
<222>  (1865)..(1865)
<223>  n is a, c, g, or t

<400>  2
atgatacttg atacagatta tataacggag aatggaaaac ccgttatcag gatttttaag       60

aaggaaaacg gcgagtttaa aatagaatac gacaggaatt ttgagcccta catttacgcg      120

cttctggaga atgaggagga aatagaggac attaaaagga taaccgccga gaggcacgga      180

aaaaaagtga gaatcgtgcg ggctgagaag gttaagaaaa agttcctggg agagcccata      240

gaggtgtgga agcttgtttt tgagcatcca caggacgtcc cggacattat aaggaagcat      300

cctgccgttg tggacatcta cgagtacgat atacccttcg caaagcgcta cctcatagac      360

agagggcttg ttccgatgga gggcgacgag gagctcaaaa tgctggcttt tgatattgag      420

acgttctacc atgagggaga tgaattcgga gagggcgaaa ttttgatgat aagctacgcc      480

gatgagggcg gcgcgagggt gattacgtgg aagagaattg acctccccta tgtggaaacg      540

gtatccacag agagggaagc cataaagcgc ttcctccatg ttctgaagga aaaagatccg      600

gacgtgctca tcacgtacaa cggcgacaac ttcgattttg cttacataaa aaagcgctgt      660

gaaaagctcg ggttgaagtt cacaatcggg agggacggaa gcgaaccaaa aattcagagg      720

atgggggatc gcttcgccgt cgaggtcaag ggcatcaagg gcagaataca ccttgatctc      780

tatcccgtcg tgaggcacac aataaggctc cccacctata cgcttgaggc ggtctatgaa      840

gccgttttcg gaaagcgaaa ggagaaggtc tatgcagaag agatagcgac ggcatggaag      900

agtgaggagg ggcttaagag ggtcgcgcag tattcaatgg aggatgcaaa agccacatat      960

gagctcggaa gggagttctt cccgatggag gtggaactgg caaagctcat agggcagagc     1020

gtttgggacg tatcgaggtc aagcacgggc aacctggtgg agtggtacct cctgagagag     1080

gcatatgaga ggaacgagct cgcaccgaat aagccggggg atgcggaata caggaaaaga     1140

atgcgctctt cctatctcgg gggctacgtc aaggagcccg agaaaggatt atgggagagc     1200

atagcttatt tagattttcg cagcttgtac ccctccataa tcgtcaccca caacgtttct     1260

cccgatacgc ttgaaagaga atgcaaaaac tattatgtgg ctccagttgt tggctaccgc     1320

ttctgcagtg actttaaggg attcatccca agcatcctgg aggagctcat agaaaccagg     1380

cagaaggtta agaggaagat gaaggccacg attgaccccg tggagaggaa gatgctcgac     1440

tacaggcaga gggcattgaa gattctggcg aatagctatt acggttatac gggctatcca     1500

aaagcgcgct ggtattcgaa ggagtgtgcc gagagcgtca cggcatgggg gaggcactac     1560

atagagacca ctatcaatga ggcagaggga ttcgggttta aagtgctcta tgcggacact     1620

gatggctttt ttgcaacaat acccggtgaa aaaccggagg tcataaaaaa gaaggccttg     1680

gaattcctga aacacataaa taaaaagctc cccggaatgc tcgagcttga gtatgagggc     1740

ttctacacga ggggattctt cgtcaccaaa aagaagtacg ctctcattga tgaggagggg     1800

cacataacca cgaggggcct tgaggttgtg aggagggact ggagtgagat agcaaaggaa     1860

acccnagcta aagtgctgga ggtcatctta agggagggta gcattgaaaa ggcagcgggg     1920

atcgtgaaga aagttgttga ggatctggca aattaccgcg ttcccgtaga aaagctggtc     1980

attcacgagc agattacccg ggaattaaag gattataagg cgacgggacc ccacgtggcg     2040

atagcaaagc gccttcaggc aaggggcatc aaggtgaagc ccggcaccat aataagctat     2100

gttgttttga aggggagcaa gaagataagc gacagggtaa tcctgttcga tgagtacgac     2160

cccggcaggc ataagtatga cccagattac tacatccaca atcaggttct ccccgcggtt     2220

cttagaatac tcgaagcctt cggatacaag gagaaagatc tggagtacca gaggatgaga     2280

cagatgggac ttggggcgtg gcttggaacg gggaaggggt gagaggaaat atgccggtaa     2340

aagcctcatg gaattacttc ttccatcctt tcgtagattc cggcttttct caaaacctca     2400

cggcatgggg gaggcactac atagagacca ctatcaatga ggcagaggga ttcgggttta     2460

aagtgctcta tgcggacact gatggctttt ttgcaacaat acccggtgaa aaaccggagg     2520

tcataaaaaa gaaggccttg gaattcctga aacacataaa taaaaagctc ccc            2573


<210>  3
<211>  29
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence


<220>
<221>  misc_feature
<222>  (18)..(18)
<223>  n is a, c, g, or t

<400>  3
cgcgggagaa cctggttntc datrtarta                                         29


<210>  4
<211>  9
<212>  PRT
<213>  Unknown

<220>
<223>  Protein fragment

<400>  4

Tyr Tyr Ile Glu Asn Gln Val Leu Pro 
1               5                   


<210>  5
<211>  29
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence


<220>
<221>  misc_feature
<222>  (21)..(21)
<223>  n is a, c, g, or t

<400>  5
tactacggat aggccaargc nagrtggta                                         29


<210>  6
<211>  9
<212>  PRT
<213>  Unknown

<220>
<223>  Protein fragment


<220>
<221>  MISC_FEATURE
<222>  (4)..(4)
<223>  Xaa is STOP

<400>  6

Tyr Tyr Gly Xaa Ala Asn Ala Arg Trp 
1               5                   


<210>  7
<211>  18
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  7
tgtaaaacga cggccagt                                                     18


<210>  8
<211>  24
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  8
agcggataac aatttcacac agga                                              24


<210>  9
<211>  17
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  9
catccacagg acgtccc                                                      17


<210>  10
<211>  6
<212>  PRT
<213>  Unknown

<220>
<223>  Protein fragment

<400>  10

His Pro Gln Asp Val Pro 
1               5       


<210>  11
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  11
taaacccgaa tccctctgcc                                                   20


<210>  12
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  12
ttgtgtgcct cacgacggga                                                   20


<210>  13
<211>  22
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  13
gcaaggggca tcaaggtgaa gc                                                22


<210>  14
<211>  22
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  14
tgttttgaag gggagcaaga ag                                                22


<210>  15
<211>  22
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  15
gcttttctac gggaacgcgg ta                                                22


<210>  16
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  16
gtgacgctct cggcacactc                                                   20


<210>  17
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  17
ccacaacggc aggatgcttc                                                   20


<210>  18
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  18
tagatgtcca caacggcagg                                                   20


<210>  19
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  19
cagagggctt gttccgatgg                                                   20


<210>  20
<211>  40
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  20
aagcttccat gggtattctt gatacagatt atataacgga                             40


<210>  21
<211>  39
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  21
ggatccgtcg acttacccct tccccgttcc aagccacgc                              39


<210>  22
<211>  55
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  22
ttcccctcta gaaataattt tgtttaactt taagaaggag atatacatat gcacc            55


<210>  23
<211>  55
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  23
gaattcggat ccgctagcca tggtatggtg atggtgatgg tgcatatgta tatct            55


<210>  24
<211>  24
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  24
aaattaatac gactcactat aggg                                              24


<210>  25
<211>  19
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  25
gctagttatt gctcagcgg                                                    19


<210>  26
<211>  20
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  26
cctgctctgc cgcttcacgc                                                   20


<210>  27
<211>  24
<212>  DNA
<213>  Artificial

<220>
<223>  Primer sequence

<400>  27
ccatgattca gtgtgcccgt ctgg                                              24


<210>  28
<211>  23
<212>  PRT
<213>  Artificial

<220>
<223>  Consensus sequence


<220>
<221>  MISC_FEATURE
<222>  (2)..(2)
<223>  Xaa is Lys or Arg

<220>
<221>  MISC_FEATURE
<222>  (3)..(3)
<223>  Xaa is Glu or Asp

<220>
<221>  MISC_FEATURE
<222>  (4)..(4)
<223>  Xaa is Met or Ala

<220>
<221>  MISC_FEATURE
<222>  (7)..(7)
<223>  Xaa is Arg or Ala

<220>
<221>  MISC_FEATURE
<222>  (10)..(10)
<223>  Xaa is Lys, Arg or His

<220>
<221>  MISC_FEATURE
<222>  (11)..(11)
<223>  Xaa is Val or Ile

<220>
<221>  MISC_FEATURE
<222>  (12)..(12)
<223>  Xaa is Val, Ile or Leu

<220>
<221>  MISC_FEATURE
<222>  (13)..(13)
<223>  Xaa is Lys or Arg

<220>
<221>  MISC_FEATURE
<222>  (19)..(19)
<223>  Xaa is Val or Ile

<220>
<221>  MISC_FEATURE
<222>  (20)..(20)
<223>  Xaa is Leu or Ile

<220>
<221>  MISC_FEATURE
<222>  (21)..(21)
<223>  Xaa is Val or Ile

<400>  28

Glu Xaa Xaa Xaa Ile Lys Xaa Phe Leu Xaa Xaa Xaa Xaa Glu Lys Asp 
1               5                   10                  15      


Pro Asp Xaa Xaa Xaa Thr Tyr 
            20              


<210>  29
<211>  36
<212>  PRT
<213>  Artificial

<220>
<223>  Consensus sequence


<220>
<221>  MISC_FEATURE
<222>  (2)..(2)
<223>  Xaa is Tyr or Phe

<220>
<221>  MISC_FEATURE
<222>  (8)..(8)
<223>  Xaa is Lys or Arg

<220>
<221>  MISC_FEATURE
<222>  (12)..(12)
<223>  Xaa is Glu or Asp

<220>
<221>  MISC_FEATURE
<222>  (13)..(13)
<223>  Xaa is Asn, Gly or Ser

<220>
<221>  MISC_FEATURE
<222>  (14)..(14)
<223>  Xaa is Leu or Ile

<220>
<221>  MISC_FEATURE
<222>  (15)..(15)
<223>  Xaa is Val or Ala

<220>
<221>  MISC_FEATURE
<222>  (16)..(16)
<223>  Xaa is Tyr or Ser

<220>
<221>  MISC_FEATURE
<222>  (19)..(19)
<223>  Xaa is Tyr or Phe

<220>
<221>  MISC_FEATURE
<222>  (20)..(20)
<223>  Xaa is Lys or Arg

<220>
<221>  MISC_FEATURE
<222>  (21)..(21)
<223>  Xaa is Ser or Ala

<220>
<221>  MISC_FEATURE
<222>  (28)..(28)
<223>  Xaa is Val or Ile

<400>  29

Gly Xaa Val Lys Glu Pro Glu Xaa Gly Leu Trp Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  15      


Leu Asp Xaa Xaa Xaa Leu Tyr Pro Ser Ile Ile Xaa Thr His Asn Val 
            20                  25                  30          


Ser Pro Asp Thr 
        35      


<210>  30
<211>  22
<212>  PRT
<213>  Artificial

<220>
<223>  Consensus sequence


<220>
<221>  MISC_FEATURE
<222>  (6)..(6)
<223>  Xaa is Leu or Ile

<220>
<221>  MISC_FEATURE
<222>  (8)..(8)
<223>  Xaa is Gly, Lys or Glu

<220>
<221>  MISC_FEATURE
<222>  (9)..(9)
<223>  Xaa is Asn, Asp, His or Glu

<220>
<221>  MISC_FEATURE
<222>  (11)..(11)
<223>  Xaa is Leu or Ile

<220>
<221>  MISC_FEATURE
<222>  (12)..(12)
<223>  Xaa is Glu or Asp

<220>
<221>  MISC_FEATURE
<222>  (13)..(13)
<223>  Xaa is Glu or Thr

<220>
<221>  MISC_FEATURE
<222>  (16)..(16)
<223>  Xaa is Lys or Glu

<220>
<221>  MISC_FEATURE
<222>  (17)..(17)
<223>  Xaa is Val or Ile

<220>
<221>  MISC_FEATURE
<222>  (19)..(19)
<223>  Xaa is Arg, Lys or Thr

<400>  30

Gly Phe Ile Pro Ser Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Arg Gln Xaa 
1               5                   10                  15      


Xaa Lys Xaa Lys Met Lys 
            20          


<210>  31
<211>  22
<212>  PRT
<213>  Artificial

<220>
<223>  Consensus sequence


<220>
<221>  MISC_FEATURE
<222>  (5)..(5)
<223>  Xaa is Lys or Arg

<220>
<221>  MISC_FEATURE
<222>  (7)..(7)
<223>  Xaa is Leu or Ile

<220>
<221>  MISC_FEATURE
<222>  (9)..(9)
<223>  Xaa is Leu or Ile

<220>
<221>  MISC_FEATURE
<222>  (14)..(14)
<223>  Xaa is Tyr or Phe

<220>
<221>  MISC_FEATURE
<222>  (18)..(18)
<223>  Xaa is Tyr or Thr

<220>
<221>  MISC_FEATURE
<222>  (21)..(21)
<223>  Xaa is Ala or Pro

<220>
<221>  MISC_FEATURE
<222>  (22)..(22)
<223>  Xaa is Lys or Arg

<400>  31

Asp Tyr Arg Gln Xaa Ala Xaa Lys Xaa Leu Ala Asn Ser Xaa Tyr Gly 
1               5                   10                  15      


Tyr Xaa Gly Tyr Xaa Xaa 
            20          


<210>  32
<211>  7
<212>  PRT
<213>  Artificial

<220>
<223>  Consensus sequence


<220>
<221>  MISC_FEATURE
<222>  (5)..(5)
<223>  Xaa is Phe or Leu

<220>
<221>  MISC_FEATURE
<222>  (6)..(6)
<223>  Xaa is Tyr, Phe or His

<400>  32

Asp Thr Asp Gly Xaa Xaa Ala 
1               5           


<210>  33
<211>  23
<212>  PRT
<213>  Artificial

<220>
<223>  Consensus sequence


<220>
<221>  MISC_FEATURE
<222>  (5)..(5)
<223>  Xaa is Gly or His

<220>
<221>  MISC_FEATURE
<222>  (6)..(6)
<223>  Xaa is Val or Ile

<220>
<221>  MISC_FEATURE
<222>  (7)..(7)
<223>  Xaa is Val, Thr or Ile

<220>
<221>  MISC_FEATURE
<222>  (13)..(13)
<223>  Xaa is Val or Ile

<220>
<221>  MISC_FEATURE
<222>  (20)..(20)
<223>  Xaa is Glu or Asp

<400>  33

Asp Glu Glu Gly Xaa Xaa Xaa Thr Arg Gly Leu Glu Xaa Val Arg Arg 
1               5                   10                  15      


Asp Trp Ser Xaa Ile Ala Lys 
            20              


<210>  34
<211>  15
<212>  PRT
<213>  Artificial

<220>
<223>  Consensus sequence


<220>
<221>  MISC_FEATURE
<222>  (7)..(7)
<223>  Xaa is Val or Ile

<400>  34

Leu Tyr Pro Ser Ile Ile Xaa Thr His Asn Val Ser Pro Asp Thr 
1               5                   10                  15  


<210>  35
<211>  16
<212>  PRT
<213>  Artificial

<220>
<223>  Consensus sequence


<220>
<221>  MISC_FEATURE
<222>  (6)..(6)
<223>  Xaa is Val or Ile

<220>
<221>  MISC_FEATURE
<222>  (13)..(13)
<223>  Xaa is Glu or Asp

<400>  35

Thr Arg Gly Leu Glu Xaa Val Arg Arg Asp Trp Ser Xaa Ile Ala Lys 
1               5                   10                  15      


<210>  36
<211>  2322
<212>  DNA
<213>  Palaeococcus helgesonii

<400>  36
atgatacttg atacagatta tataacggag aatggaaaac ccgttatcag gatttttaag       60

aaggaaaacg gcgagtttaa aatagaatac gacaggaatt ttgagcccta catttacgcg      120

cttctggaga atgaggagga aatagaggac attaaaagga taaccgccga gaggcacgga      180

aaaaaagtga gaatcgtgcg ggctgagaag gttaagaaaa agttcctggg agagcccata      240

gaggtgtgga agcttgtttt tgagcatcca caggacgtcc cggacattat aaggaagcat      300

cctgccgttg tggacatcta cgagtacgat atacccttcg caaagcgcta cctcatagac      360

agagggcttg ttccgatgga gggcgacgag gagctcaaaa tgctggcttt tgatattgag      420

acgttctacc atgagggaga tgaattcgga gagggcgaaa ttttgatgat aagctacgcc      480

gatgagggcg gcgcgagggt gattacgtgg aagagaattg acctccccta tgtggaaacg      540

gtatccacag agagggaagc cataaagcgc ttcctccatg ttctgaagga aaaagatccg      600

gacgtgctca tcacgtacaa cggcgacaac ttcgattttg cttacataaa aaagcgctgt      660

gaaaagctcg ggttgaagtt cacaatcggg agggacggaa gcgaaccaaa aattcagagg      720

atgggggatc gcttcgccgt cgaggtcaag ggcatcaagg gcagaataca ccttgatctc      780

tatcccgtcg tgaggcacac aataaggctc cccacctata cgcttgaggc ggtctatgaa      840

gccgttttcg gaaagcgaaa ggagaaggtc tatgcagaag agatagcgac ggcatggaag      900

agtgaggagg ggcttaagag ggtcgcgcag tattcaatgg aggatgcaaa agccacatat      960

gagctcggaa gggagttctt cccgatggag gtggaactgg caaagctcat agggcagagc     1020

gtttgggacg tatcgaggtc aagcacgggc aacctggtgg agtggtacct cctgagagag     1080

gcatatgaga ggaacgagct cgcaccgaat aagccggggg atgcggaata caggaaaaga     1140

atgcgctctt cctatctcgg gggctacgtc aaggagcccg agaaaggatt atgggagagc     1200

atagcttatt tagattttcg cagcttgtac ccctccataa tcgtcaccca caacgtttct     1260

cccgatacgc ttgaaagaga atgcaaaaac tattatgtgg ctccagttgt tggctaccgc     1320

ttctgcagtg actttaaggg attcatccca agcatcctgg aggagctcat agaaaccagg     1380

cagaaggtta agaggaagat gaaggccacg attgaccccg tggagaggaa gatgctcgac     1440

tacaggcaga gggcattgaa gattctggcg aatagctatt acggttatac gggctatcca     1500

aaagcgcgct ggtattcgaa ggagtgtgcc gagagcgtca cggcatgggg gaggcactac     1560

atagagacca ctatcaatga ggcagaggga ttcgggttta aagtgctcta tgcggacact     1620

gatggctttt ttgcaacaat acccggtgaa aaaccggagg tcataaaaaa gaaggccttg     1680

gaattcctga aacacataaa taaaaagctc cccggaatgc tcgagcttga gtatgagggc     1740

ttctacacga ggggattctt cgtcaccaaa aagaagtacg ctctcattga tgaggagggg     1800

cacataacca cgaggggcct tgaggttgtg aggagggact ggagtgagat agcaaaggaa     1860

acccaagcta aagtgctgga ggtcatctta agggagggta gcattgaaaa ggcagcgggg     1920

atcgtgaaga aagttgttga ggatctggca aattaccgcg ttcccgtaga aaagctggtc     1980

attcacgagc agattacccg ggaattaaag gattataagg cgacgggacc ccacgtggcg     2040

atagcaaagc gccttcaggc aaggggcatc aaggtgaagc ccggcaccat aataagctat     2100

gttgttttga aggggagcaa gaagataagc gacagggtaa tcctgttcga tgagtacgac     2160

cccggcaggc ataagtatga cccagattac tacatccaca atcaggttct ccccgcggtt     2220

cttagaatac tcgaagcctt cggatacaag gagaaagatc tggagtacca gaggatgaga     2280

cagatgggac ttggggcgtg gcttggaacg gggaaggggt ga                        2322


<210>  37
<211>  81
<212>  DNA
<213>  Artificial

<220>
<223>  Plasmid fragment

<400>  37
tctagaaata attttgttta actttaagaa ggagatatac atatgcacca tcaccatcac       60

cataccatgg ctagcggatc c                                                 81


<210>  38
<211>  9
<212>  PRT
<213>  Artificial

<220>
<223>  Protein tag sequence

<400>  38

Met His His His His His His Thr Met 
1               5                   


<210>  39
<211>  773
<212>  PRT
<213>  Palaeococcus helgesonii

<400>  39

Met Ile Leu Asp Thr Asp Tyr Ile Thr Glu Asn Gly Lys Pro Val Ile 
1               5                   10                  15      


Arg Ile Phe Lys Lys Glu Asn Gly Glu Phe Lys Ile Glu Tyr Asp Arg 
            20                  25                  30          


Asn Phe Glu Pro Tyr Ile Tyr Ala Leu Leu Glu Asn Glu Glu Glu Ile 
        35                  40                  45              


Glu Asp Ile Lys Arg Ile Thr Ala Glu Arg His Gly Lys Lys Val Arg 
    50                  55                  60                  


Ile Val Arg Ala Glu Lys Val Lys Lys Lys Phe Leu Gly Glu Pro Ile 
65                  70                  75                  80  


Glu Val Trp Lys Leu Val Phe Glu His Pro Gln Asp Val Pro Asp Ile 
                85                  90                  95      


Ile Arg Lys His Pro Ala Val Val Asp Ile Tyr Glu Tyr Asp Ile Pro 
            100                 105                 110         


Phe Ala Lys Arg Tyr Leu Ile Asp Arg Gly Leu Val Pro Met Glu Gly 
        115                 120                 125             


Asp Glu Glu Leu Lys Met Leu Ala Phe Asp Ile Glu Thr Phe Tyr His 
    130                 135                 140                 


Glu Gly Asp Glu Phe Gly Glu Gly Glu Ile Leu Met Ile Ser Tyr Ala 
145                 150                 155                 160 


Asp Glu Gly Gly Ala Arg Val Ile Thr Trp Lys Arg Ile Asp Leu Pro 
                165                 170                 175     


Tyr Val Glu Thr Val Ser Thr Glu Arg Glu Ala Ile Lys Arg Phe Leu 
            180                 185                 190         


His Val Leu Lys Glu Lys Asp Pro Asp Val Leu Ile Thr Tyr Asn Gly 
        195                 200                 205             


Asp Asn Phe Asp Phe Ala Tyr Ile Lys Lys Arg Cys Glu Lys Leu Gly 
    210                 215                 220                 


Leu Lys Phe Thr Ile Gly Arg Asp Gly Ser Glu Pro Lys Ile Gln Arg 
225                 230                 235                 240 


Met Gly Asp Arg Phe Ala Val Glu Val Lys Gly Ile Lys Gly Arg Ile 
                245                 250                 255     


His Leu Asp Leu Tyr Pro Val Val Arg His Thr Ile Arg Leu Pro Thr 
            260                 265                 270         


Tyr Thr Leu Glu Ala Val Tyr Glu Ala Val Phe Gly Lys Arg Lys Glu 
        275                 280                 285             


Lys Val Tyr Ala Glu Glu Ile Ala Thr Ala Trp Lys Ser Glu Glu Gly 
    290                 295                 300                 


Leu Lys Arg Val Ala Gln Tyr Ser Met Glu Asp Ala Lys Ala Thr Tyr 
305                 310                 315                 320 


Glu Leu Gly Arg Glu Phe Phe Pro Met Glu Val Glu Leu Ala Lys Leu 
                325                 330                 335     


Ile Gly Gln Ser Val Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 
            340                 345                 350         


Val Glu Trp Tyr Leu Leu Arg Glu Ala Tyr Glu Arg Asn Glu Leu Ala 
        355                 360                 365             


Pro Asn Lys Pro Gly Asp Ala Glu Tyr Arg Lys Arg Met Arg Ser Ser 
    370                 375                 380                 


Tyr Leu Gly Gly Tyr Val Lys Glu Pro Glu Lys Gly Leu Trp Glu Ser 
385                 390                 395                 400 


Ile Ala Tyr Leu Asp Phe Arg Ser Leu Tyr Pro Ser Ile Ile Val Thr 
                405                 410                 415     


His Asn Val Ser Pro Asp Thr Leu Glu Arg Glu Cys Lys Asn Tyr Tyr 
            420                 425                 430         


Val Ala Pro Val Val Gly Tyr Arg Phe Cys Ser Asp Phe Lys Gly Phe 
        435                 440                 445             


Ile Pro Ser Ile Leu Glu Glu Leu Ile Glu Thr Arg Gln Lys Val Lys 
    450                 455                 460                 


Arg Lys Met Lys Ala Thr Ile Asp Pro Val Glu Arg Lys Met Leu Asp 
465                 470                 475                 480 


Tyr Arg Gln Arg Ala Leu Lys Ile Leu Ala Asn Ser Tyr Tyr Gly Tyr 
                485                 490                 495     


Thr Gly Tyr Pro Lys Ala Arg Trp Tyr Ser Lys Glu Cys Ala Glu Ser 
            500                 505                 510         


Val Thr Ala Trp Gly Arg His Tyr Ile Glu Thr Thr Ile Asn Glu Ala 
        515                 520                 525             


Glu Gly Phe Gly Phe Lys Val Leu Tyr Ala Asp Thr Asp Gly Phe Phe 
    530                 535                 540                 


Ala Thr Ile Pro Gly Glu Lys Pro Glu Val Ile Lys Lys Lys Ala Leu 
545                 550                 555                 560 


Glu Phe Leu Lys His Ile Asn Lys Lys Leu Pro Gly Met Leu Glu Leu 
                565                 570                 575     


Glu Tyr Glu Gly Phe Tyr Thr Arg Gly Phe Phe Val Thr Lys Lys Lys 
            580                 585                 590         


Tyr Ala Leu Ile Asp Glu Glu Gly His Ile Thr Thr Arg Gly Leu Glu 
        595                 600                 605             


Val Val Arg Arg Asp Trp Ser Glu Ile Ala Lys Glu Thr Gln Ala Lys 
    610                 615                 620                 


Val Leu Glu Val Ile Leu Arg Glu Gly Ser Ile Glu Lys Ala Ala Gly 
625                 630                 635                 640 


Ile Val Lys Lys Val Val Glu Asp Leu Ala Asn Tyr Arg Val Pro Val 
                645                 650                 655     


Glu Lys Leu Val Ile His Glu Gln Ile Thr Arg Glu Leu Lys Asp Tyr 
            660                 665                 670         


Lys Ala Thr Gly Pro His Val Ala Ile Ala Lys Arg Leu Gln Ala Arg 
        675                 680                 685             


Gly Ile Lys Val Lys Pro Gly Thr Ile Ile Ser Tyr Val Val Leu Lys 
    690                 695                 700                 


Gly Ser Lys Lys Ile Ser Asp Arg Val Ile Leu Phe Asp Glu Tyr Asp 
705                 710                 715                 720 


Pro Gly Arg His Lys Tyr Asp Pro Asp Tyr Tyr Ile His Asn Gln Val 
                725                 730                 735     


Leu Pro Ala Val Leu Arg Ile Leu Glu Ala Phe Gly Tyr Lys Glu Lys 
            740                 745                 750         


Asp Leu Glu Tyr Gln Arg Met Arg Gln Met Gly Leu Gly Ala Trp Leu 
        755                 760                 765             


Gly Thr Gly Lys Gly 
    770             


