                         SEQUENCE LISTING

<110>  ERA BIOTECH, S.A.
 
<120>  SPLIT INTEINS AND USES THEREOF

<130>  P7749PC00

<150>  US 61/540101
<151>  2011-09-28

<150>  EP12171848
<151>  2012-06-13

<160>  108   

<170>  PatentIn version 3.5

<210>  1
<211>  678
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GP-41.1 N-fragment DNA

<400>  1
ccatggccag ttggagccac ccgcagttcg aaaaagcgag caaagaaacc tttacccatt       60

accagccgca gggcaacagt gacccggctc ataccgcaac cgcgcccggc ggattgagtg      120

cgaaagcgcc tgcaatgacc ccgctgatgc tggacacctc cagccgtaag ctggttgcgt      180

gggatggcac caccgacggt gctgccgttg gcattcttgc ggttgctgct gaccagacca      240

gcaccacgct gacgttctac aagtccggca cgttccgtta tgaggatgtg ctctggccgg      300

aggctgccag cgacgagacg aaaaaacgga ccgcgtttgc cggaacggca atcagcatcg      360

ttggatccac ccgtagcggt tattgcctgg acctgaaaac ccaggtgcag accccgcagg      420

gcatgaagga gattagcaac attcaggtgg gcgacctggt tctgagcaac accggctata      480

atgaggtgct gaacgtgttc ccgaagagca aaaagaagag ctacaagatc acgctggagg      540

acggcaagga aatcatttgc agcgaagaac atctgtttcc gacccagacc ggcgaaatga      600

atattagcgg tggcctgaaa gaaggcatgt gcctgtatgt gaaagagggc ggtcaccacc      660

atcatcacca ctaagctt                                                    678


<210>  2
<211>  223
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP-41.1 N-fragment Protein

<400>  2

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Thr Arg Ser Gly Tyr Cys 
        115                 120                 125             


Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu Ile 
    130                 135                 140                 


Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr Asn 
145                 150                 155                 160 


Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys Ile 
                165                 170                 175     


Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu Phe 
            180                 185                 190         


Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu Gly 
        195                 200                 205             


Met Cys Leu Tyr Val Lys Glu Gly Gly His His His His His His 
    210                 215                 220             


<210>  3
<211>  88
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP 41.1 (InteinN)

<400>  3

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu 
                85              


<210>  4
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP 41.1 (ExteinN)

<400>  4

Thr Arg Ser Gly Tyr 
1               5   


<210>  5
<211>  520
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GP41.1 C-fragment DNA

<400>  5
catatgggca aaaacagcat gatgctgaag aagatcctga agatcgagga gctggacgag       60

cgcgagctga ttgatatcga agtgagcggc aaccacctgt tctacgccaa tgacattctg      120

acgcataata gcagcagcga tgtgggtacc ggatctgata aaattattca tctgactgat      180

gattcttttg atactgatgt acttaaggca gatggtgcaa tcctggttga tttctgggca      240

cactggtgcg gtccgtgcaa aatgatcgct ccgattctgg atgaaatcgc tgacgaatat      300

cagggcaaac tgaccgttgc aaaactgaac atcgatcaca acccgggcac tgcgccgaaa      360

tatggcatcc gtggtatccc gactctgctg ctgttcaaaa acggtgaagt ggcggcaacc      420

aaagtgggtg cactgtctaa aggtcagttg aaagagttcc tcgacgctaa cctggccggc      480

tctgaattca gatctcatca ccatcaccat cactaagctt                            520


<210>  6
<211>  170
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP41.1 C-fragment Protein

<400>  6

Met Gly Lys Asn Ser Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu 
1               5                   10                  15      


Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu 
            20                  25                  30          


Phe Tyr Ala Asn Asp Ile Leu Thr His Asn Ser Ser Ser Asp Val Gly 
        35                  40                  45              


Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr 
    50                  55                  60                  


Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His 
65                  70                  75                  80  


Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala 
                85                  90                  95      


Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His 
            100                 105                 110         


Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu 
        115                 120                 125             


Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu 
    130                 135                 140                 


Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser 
145                 150                 155                 160 


Glu Phe Arg Ser His His His His His His 
                165                 170 


<210>  7
<211>  37
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP 41.1 (InteinC)

<400>  7

Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp Glu Arg Glu 
1               5                   10                  15      


Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu Phe Tyr Ala Asn Asp 
            20                  25                  30          


Ile Leu Thr His Asn 
        35          


<210>  8
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP 41.1 (ExteinC)

<400>  8

Ser Ser Ser Asp Val 
1               5   


<210>  9
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  E coli enhancer

<400>  9

Met Gly Lys Asn Ser 
1               5   


<210>  10
<211>  681
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GP 41.8 N-fragment DNA

<400>  10
ccatggccag ttggagccac ccgcagttcg aaaaagcgag caaagaaacc tttacccatt       60

accagccgca gggcaacagt gacccggctc ataccgcaac cgcgcccggc ggattgagtg      120

cgaaagcgcc tgcaatgacc ccgctgatgc tggacacctc cagccgtaag ctggttgcgt      180

gggatggcac caccgacggt gctgccgttg gcattcttgc ggttgctgct gaccagacca      240

gcaccacgct gacgttctac aagtccggca cgttccgtta tgaggatgtg ctctggccgg      300

aggctgccag cgacgagacg aaaaaacgga ccgcgtttgc cggaacggca atcagcatcg      360

ttggatccag ccaactgaat cgttgcctga gcctggatac gatggttgtg accaatggca      420

aagcgattga gattcgtgat gtgaaagtgg gcgattggct ggaaagcgaa tgtggcccgg      480

tgcaggtgac cgaagtgctg ccgattatca agcagccggt gtttgaaatt gtgctgaaga      540

gcggcaaaaa gatccgtgtg agcgcgaatc ataaattccc gaccaaagat ggcctgaaaa      600

ccatcaatag cggtctgaaa gttggcgact tcctgcgtag ccgtgcgaaa ggcggccatc      660

atcaccacca tcactaagct t                                                681


<210>  11
<211>  224
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP 41.8 N-fragment PROTEIN

<400>  11

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Ser Gln Leu Asn Arg Cys 
        115                 120                 125             


Leu Ser Leu Asp Thr Met Val Val Thr Asn Gly Lys Ala Ile Glu Ile 
    130                 135                 140                 


Arg Asp Val Lys Val Gly Asp Trp Leu Glu Ser Glu Cys Gly Pro Val 
145                 150                 155                 160 


Gln Val Thr Glu Val Leu Pro Ile Ile Lys Gln Pro Val Phe Glu Ile 
                165                 170                 175     


Val Leu Lys Ser Gly Lys Lys Ile Arg Val Ser Ala Asn His Lys Phe 
            180                 185                 190         


Pro Thr Lys Asp Gly Leu Lys Thr Ile Asn Ser Gly Leu Lys Val Gly 
        195                 200                 205             


Asp Phe Leu Arg Ser Arg Ala Lys Gly Gly His His His His His His 
    210                 215                 220                 


<210>  12
<211>  89
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP41.8 (Intein-N)

<400>  12

Cys Leu Ser Leu Asp Thr Met Val Val Thr Asn Gly Lys Ala Ile Glu 
1               5                   10                  15      


Ile Arg Asp Val Lys Val Gly Asp Trp Leu Glu Ser Glu Cys Gly Pro 
            20                  25                  30          


Val Gln Val Thr Glu Val Leu Pro Ile Ile Lys Gln Pro Val Phe Glu 
        35                  40                  45              


Ile Val Leu Lys Ser Gly Lys Lys Ile Arg Val Ser Ala Asn His Lys 
    50                  55                  60                  


Phe Pro Thr Lys Asp Gly Leu Lys Thr Ile Asn Ser Gly Leu Lys Val 
65                  70                  75                  80  


Gly Asp Phe Leu Arg Ser Arg Ala Lys 
                85                  


<210>  13
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP41.8 (Extein-N)

<400>  13

Ser Gln Leu Asn Arg 
1               5   


<210>  14
<211>  529
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GP41.8 C-term DNA

<400>  14
catatgtgcg agatcttcga gaacgagatc gactgggatg aaatcgcgag cattgagtat       60

gtgggcgttg aggagaccat tgacatcaac gtgacgaacg accgcctgtt cttcgcaaac      120

ggcattctga cccataatag cgcggtggaa gagggtaccg gatctgataa aattattcat      180

ctgactgatg attcttttga tactgatgta cttaaggcag atggtgcaat cctggttgat      240

ttctgggcac actggtgcgg tccgtgcaaa atgatcgctc cgattctgga tgaaatcgct      300

gacgaatatc agggcaaact gaccgttgca aaactgaaca tcgatcacaa cccgggcact      360

gcgccgaaat atggcatccg tggtatcccg actctgctgc tgttcaaaaa cggtgaagtg      420

gcggcaacca aagtgggtgc actgtctaaa ggtcagttga aagagttcct cgacgctaac      480

ctggccggct ctgaattcag atctcatcac catcaccatc actaagctt                  529


<210>  15
<211>  173
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP41.8 PROTEIN

<400>  15

Met Cys Glu Ile Phe Glu Asn Glu Ile Asp Trp Asp Glu Ile Ala Ser 
1               5                   10                  15      


Ile Glu Tyr Val Gly Val Glu Glu Thr Ile Asp Ile Asn Val Thr Asn 
            20                  25                  30          


Asp Arg Leu Phe Phe Ala Asn Gly Ile Leu Thr His Asn Ser Ala Val 
        35                  40                  45              


Glu Glu Gly Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser 
    50                  55                  60                  


Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe 
65                  70                  75                  80  


Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp 
                85                  90                  95      


Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn 
            100                 105                 110         


Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile 
        115                 120                 125             


Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val 
    130                 135                 140                 


Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu 
145                 150                 155                 160 


Ala Gly Ser Glu Phe Arg Ser His His His His His His 
                165                 170             


<210>  16
<211>  45
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP41.8 (InteinC)

<400>  16

Met Cys Glu Ile Phe Glu Asn Glu Ile Asp Trp Asp Glu Ile Ala Ser 
1               5                   10                  15      


Ile Glu Tyr Val Gly Val Glu Glu Thr Ile Asp Ile Asn Val Thr Asn 
            20                  25                  30          


Asp Arg Leu Phe Phe Ala Asn Gly Ile Leu Thr His Asn 
        35                  40                  45  


<210>  17
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP41.8 (ExteinC)

<400>  17

Ser Ala Val Glu Glu 
1               5   


<210>  18
<211>  729
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NrdJ1 N-term DNA

<400>  18
ccatggccag ttggagccac ccgcagttcg aaaaagcgag caaagaaacc tttacccatt       60

accagccgca gggcaacagt gacccggctc ataccgcaac cgcgcccggc ggattgagtg      120

cgaaagcgcc tgcaatgacc ccgctgatgc tggacacctc cagccgtaag ctggttgcgt      180

gggatggcac caccgacggt gctgccgttg gcattcttgc ggttgctgct gaccagacca      240

gcaccacgct gacgttctac aagtccggca cgttccgtta tgaggatgtg ctctggccgg      300

aggctgccag cgacgagacg aaaaaacgga ccgcgtttgc cggaacggca atcagcatcg      360

ttggatccgg caccaatccg tgttgcctgg tgggcagcag cgagatcatc acccgtaact      420

acggcaaaac cacgatcaaa gaggtggttg agatcttcga caacgacaag aatatccagg      480

tgctggcgtt caacacccac acggacaata tcgaatgggc cccaattaaa gcggcgcaac      540

tgacccgtcc aaacgcagag ctggtggaac tggaaattaa caccctgcat ggcgtgaaaa      600

ccatccgttg caccccggat catccagtgt ataccaaaaa tcgtgactat gtgcgcgccg      660

atgagctgac cgatgatgat gaactggtgg tggcgattgg cggccatcac caccatcacc      720

actaagctt                                                              729


<210>  19
<211>  240
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NrdJ1 N-term PROTEIN

<400>  19

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Gly Thr Asn Pro Cys Cys 
        115                 120                 125             


Leu Val Gly Ser Ser Glu Ile Ile Thr Arg Asn Tyr Gly Lys Thr Thr 
    130                 135                 140                 


Ile Lys Glu Val Val Glu Ile Phe Asp Asn Asp Lys Asn Ile Gln Val 
145                 150                 155                 160 


Leu Ala Phe Asn Thr His Thr Asp Asn Ile Glu Trp Ala Pro Ile Lys 
                165                 170                 175     


Ala Ala Gln Leu Thr Arg Pro Asn Ala Glu Leu Val Glu Leu Glu Ile 
            180                 185                 190         


Asn Thr Leu His Gly Val Lys Thr Ile Arg Cys Thr Pro Asp His Pro 
        195                 200                 205             


Val Tyr Thr Lys Asn Arg Asp Tyr Val Arg Ala Asp Glu Leu Thr Asp 
    210                 215                 220                 


Asp Asp Glu Leu Val Val Ala Ile Gly Gly His His His His His His 
225                 230                 235                 240 


<210>  20
<211>  105
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NrdJ1 (InteinN)

<400>  20

Cys Leu Val Gly Ser Ser Glu Ile Ile Thr Arg Asn Tyr Gly Lys Thr 
1               5                   10                  15      


Thr Ile Lys Glu Val Val Glu Ile Phe Asp Asn Asp Lys Asn Ile Gln 
            20                  25                  30          


Val Leu Ala Phe Asn Thr His Thr Asp Asn Ile Glu Trp Ala Pro Ile 
        35                  40                  45              


Lys Ala Ala Gln Leu Thr Arg Pro Asn Ala Glu Leu Val Glu Leu Glu 
    50                  55                  60                  


Ile Asn Thr Leu His Gly Val Lys Thr Ile Arg Cys Thr Pro Asp His 
65                  70                  75                  80  


Pro Val Tyr Thr Lys Asn Arg Asp Tyr Val Arg Ala Asp Glu Leu Thr 
                85                  90                  95      


Asp Asp Asp Glu Leu Val Val Ala Ile 
            100                 105 


<210>  21
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NrdJ1 (ExteinN)

<400>  21

Gly Thr Asn Pro Cys 
1               5   


<210>  22
<211>  514
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NrdJ1 C-term DNA

<400>  22
catatggaag cgaagaccta catcggtaaa ctgaagagcc gcaagattgt tagcaacgag       60

gacacctacg atatccagac cagcacgcat aatttctttg cgaacgacat cctggtgcac      120

aacagcgaaa ttgtgctggg taccggatct gataaaatta ttcatctgac tgatgattct      180

tttgatactg atgtacttaa ggcagatggt gcaatcctgg ttgatttctg ggcacactgg      240

tgcggtccgt gcaaaatgat cgctccgatt ctggatgaaa tcgctgacga atatcagggc      300

aaactgaccg ttgcaaaact gaacatcgat cacaacccgg gcactgcgcc gaaatatggc      360

atccgtggta tcccgactct gctgctgttc aaaaacggtg aagtggcggc aaccaaagtg      420

ggtgcactgt ctaaaggtca gttgaaagag ttcctcgacg ctaacctggc cggctctgaa      480

ttcagatctc atcaccatca ccatcactaa gctt                                  514


<210>  23
<211>  168
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NrdJ1 C-term PROTEIN

<400>  23

Met Glu Ala Lys Thr Tyr Ile Gly Lys Leu Lys Ser Arg Lys Ile Val 
1               5                   10                  15      


Ser Asn Glu Asp Thr Tyr Asp Ile Gln Thr Ser Thr His Asn Phe Phe 
            20                  25                  30          


Ala Asn Asp Ile Leu Val His Asn Ser Glu Ile Val Leu Gly Thr Gly 
        35                  40                  45              


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
    50                  55                  60                  


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
65                  70                  75                  80  


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
                85                  90                  95      


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
            100                 105                 110         


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
        115                 120                 125             


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
    130                 135                 140                 


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
145                 150                 155                 160 


Arg Ser His His His His His His 
                165             


<210>  24
<211>  40
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NrdJ1 (Inteinc)

<400>  24

Met Glu Ala Lys Thr Tyr Ile Gly Lys Leu Lys Ser Arg Lys Ile Val 
1               5                   10                  15      


Ser Asn Glu Asp Thr Tyr Asp Ile Gln Thr Ser Thr His Asn Phe Phe 
            20                  25                  30          


Ala Asn Asp Ile Leu Val His Asn 
        35                  40  


<210>  25
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NrdJ1 (Exteinc)

<400>  25

Ser Glu Ile Val Leu 
1               5   


<210>  26
<211>  681
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DNA-E N-term DNA

<400>  26
ccatggccag ttggagccac ccgcagttcg aaaaagcgag caaagaaacc tttacccatt       60

accagccgca gggcaacagt gacccggctc ataccgcaac cgcgcccggc ggattgagtg      120

cgaaagcgcc tgcaatgacc ccgctgatgc tggacacctc cagccgtaag ctggttgcgt      180

gggatggcac caccgacggt gctgccgttg gcattcttgc ggttgctgct gaccagacca      240

gcaccacgct gacgttctac aagtccggca cgttccgtta tgaggatgtg ctctggccgg      300

aggctgccag cgacgagacg aaaaaacgga ccgcgtttgc cggaacggca atcagcatcg      360

ttggatcctg tttaagctat gaaacggaaa tattgacagt agaatatgga ttattaccga      420

ttggtaaaat tgtagaaaag cgcatcgaat gtactgttta tagcgttgat aataatggaa      480

atatttatac acaacctgta gcacaatggc acgatcgcgg agaacaagag gtgtttgagt      540

attgtttgga agatggttca ttgattcggg caacaaaaga ccataagttt atgactgttg      600

atggtcaaat gttgccaatt gatgaaatat ttgaacgtga attggatttg atgcgggttg      660

ataatttgcc gaattaagct t                                                681


<210>  27
<211>  224
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  DNA-E N-term PROTEIN

<400>  27

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Cys Leu Ser Tyr Glu Thr 
        115                 120                 125             


Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val 
    130                 135                 140                 


Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn 
145                 150                 155                 160 


Ile Tyr Thr Gln Pro Val Ala Gln Trp His Asp Arg Gly Glu Gln Glu 
                165                 170                 175     


Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys 
            180                 185                 190         


Asp His Lys Phe Met Thr Val Asp Gly Gln Met Leu Pro Ile Asp Glu 
        195                 200                 205             


Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn 
    210                 215                 220                 


<210>  28
<211>  102
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  DNA-E (Inteinn)

<400>  28

Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu 
1               5                   10                  15      


Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser 
            20                  25                  30          


Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His 
        35                  40                  45              


Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser 
    50                  55                  60                  


Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln 
65                  70                  75                  80  


Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg 
                85                  90                  95      


Val Asp Asn Leu Pro Asn 
            100         


<210>  29
<211>  496
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  DNA-E C-term DNA

<400>  29
catatgatca aaatagccac acgtaaatat ttaggcaaac aaaatgtcta tgacattgga       60

gttgagcgcg accataattt tgcactcaaa aatggcttca tagcttctaa ttgtttcaat      120

ggtaccggat ctgataaaat tattcatctg actgatgatt cttttgatac tgatgtactt      180

aaggcagatg gtgcaatcct ggttgatttc tgggcacact ggtgcggtcc gtgcaaaatg      240

atcgctccga ttctggatga aatcgctgac gaatatcagg gcaaactgac cgttgcaaaa      300

ctgaacatcg atcacaaccc gggcactgcg ccgaaatatg gcatccgtgg tatcccgact      360

ctgctgctgt tcaaaaacgg tgaagtggcg gcaaccaaag tgggtgcact gtctaaaggt      420

cagttgaaag agttcctcga cgctaacctg gccggctctg aattcagatc tcatcaccat      480

caccatcact aagctt                                                      496


<210>  30
<211>  162
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  DNA-E C-term PROTEIN

<400>  30

Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr 
1               5                   10                  15      


Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 
            20                  25                  30          


Ile Ala Ser Asn Cys Phe Asn Gly Thr Gly Ser Asp Lys Ile Ile His 
        35                  40                  45              


Leu Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala 
    50                  55                  60                  


Ile Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile 
65                  70                  75                  80  


Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr 
                85                  90                  95      


Val Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr 
            100                 105                 110         


Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val 
        115                 120                 125             


Ala Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe 
    130                 135                 140                 


Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser His His His His 
145                 150                 155                 160 


His His 
        


<210>  31
<211>  36
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  DNA-E (InteinC)

<400>  31

Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr 
1               5                   10                  15      


Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 
            20                  25                  30          


Ile Ala Ser Asn 
        35      


<210>  32
<211>  717
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IMPDH N-term DNA

<400>  32
ccatggccag ttggagccac ccgcagttcg aaaaagcgag caaagaaacc tttacccatt       60

accagccgca gggcaacagt gacccggctc ataccgcaac cgcgcccggc ggattgagtg      120

cgaaagcgcc tgcaatgacc ccgctgatgc tggacacctc cagccgtaag ctggttgcgt      180

gggatggcac caccgacggt gctgccgttg gcattcttgc ggttgctgct gaccagacca      240

gcaccacgct gacgttctac aagtccggca cgttccgtta tgaggatgtg ctctggccgg      300

aggctgccag cgacgagacg aaaaaacgga ccgcgtttgc cggaacggca atcagcatcg      360

ttggatccgg cattggcggt ggctgctttg tgccgggcac cctggtgaac acggaaaacg      420

gcctgaagaa aatcgaggaa attaaggtgg gcgacaaggt gttcagccat accggcaaac      480

tgcaggaagt tgtggacacg ctgatctttg accgcgacga agaaatcatc agcattaacg      540

gcatcgactg cacgaaaaac cacgagttct acgtgatcga caaggagaac gcgaaccgtg      600

tgaacgaaga caatatccat ctgttcgcgc gttgggttca cgcggaggag ctggacatga      660

aaaaacatct gctgattgag ctggaaggcg gccatcatca ccaccaccac taagctt         717


<210>  33
<211>  236
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IMPDH N-term PROTEIN

<400>  33

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Gly Ile Gly Gly Gly Cys 
        115                 120                 125             


Phe Val Pro Gly Thr Leu Val Asn Thr Glu Asn Gly Leu Lys Lys Ile 
    130                 135                 140                 


Glu Glu Ile Lys Val Gly Asp Lys Val Phe Ser His Thr Gly Lys Leu 
145                 150                 155                 160 


Gln Glu Val Val Asp Thr Leu Ile Phe Asp Arg Asp Glu Glu Ile Ile 
                165                 170                 175     


Ser Ile Asn Gly Ile Asp Cys Thr Lys Asn His Glu Phe Tyr Val Ile 
            180                 185                 190         


Asp Lys Glu Asn Ala Asn Arg Val Asn Glu Asp Asn Ile His Leu Phe 
        195                 200                 205             


Ala Arg Trp Val His Ala Glu Glu Leu Asp Met Lys Lys His Leu Leu 
    210                 215                 220                 


Ile Glu Leu Glu Gly Gly His His His His His His 
225                 230                 235     


<210>  34
<211>  101
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IMPDH (Inteinn)

<400>  34

Cys Phe Val Pro Gly Thr Leu Val Asn Thr Glu Asn Gly Leu Lys Lys 
1               5                   10                  15      


Ile Glu Glu Ile Lys Val Gly Asp Lys Val Phe Ser His Thr Gly Lys 
            20                  25                  30          


Leu Gln Glu Val Val Asp Thr Leu Ile Phe Asp Arg Asp Glu Glu Ile 
        35                  40                  45              


Ile Ser Ile Asn Gly Ile Asp Cys Thr Lys Asn His Glu Phe Tyr Val 
    50                  55                  60                  


Ile Asp Lys Glu Asn Ala Asn Arg Val Asn Glu Asp Asn Ile His Leu 
65                  70                  75                  80  


Phe Ala Arg Trp Val His Ala Glu Glu Leu Asp Met Lys Lys His Leu 
                85                  90                  95      


Leu Ile Glu Leu Glu 
            100     


<210>  35
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IMPDH (Exteinn)

<400>  35

Gly Ile Gly Gly Gly 
1               5   


<210>  36
<211>  514
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  IMPDH C-term DNA

<400>  36
catatgaagt tcaagctgaa ggagatcacg agcatcgaga ccaagcacta caagggcaag       60

gtgcacgatc tgaccgtgaa tcaggaccac agctataacg tgcgcggcac cgtggtgcat      120

aatagcattt gcagcaccgg taccggatct gataaaatta ttcatctgac tgatgattct      180

tttgatactg atgtacttaa ggcagatggt gcaatcctgg ttgatttctg ggcacactgg      240

tgcggtccgt gcaaaatgat cgctccgatt ctggatgaaa tcgctgacga atatcagggc      300

aaactgaccg ttgcaaaact gaacatcgat cacaacccgg gcactgcgcc gaaatatggc      360

atccgtggta tcccgactct gctgctgttc aaaaacggtg aagtggcggc aaccaaagtg      420

ggtgcactgt ctaaaggtca gttgaaagag ttcctcgacg ctaacctggc cggctctgaa      480

ttcagatctc atcaccatca ccatcactaa gctt                                  514


<210>  37
<211>  168
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IMPDH C-term PROTEIN

<400>  37

Met Lys Phe Lys Leu Lys Glu Ile Thr Ser Ile Glu Thr Lys His Tyr 
1               5                   10                  15      


Lys Gly Lys Val His Asp Leu Thr Val Asn Gln Asp His Ser Tyr Asn 
            20                  25                  30          


Val Arg Gly Thr Val Val His Asn Ser Ile Cys Ser Thr Gly Thr Gly 
        35                  40                  45              


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
    50                  55                  60                  


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
65                  70                  75                  80  


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
                85                  90                  95      


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
            100                 105                 110         


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
        115                 120                 125             


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
    130                 135                 140                 


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
145                 150                 155                 160 


Arg Ser His His His His His His 
                165             


<210>  38
<211>  40
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IMPDH (InteinC)

<400>  38

Met Lys Phe Lys Leu Lys Glu Ile Thr Ser Ile Glu Thr Lys His Tyr 
1               5                   10                  15      


Lys Gly Lys Val His Asp Leu Thr Val Asn Gln Asp His Ser Tyr Asn 
            20                  25                  30          


Val Arg Gly Thr Val Val His Asn 
        35                  40  


<210>  39
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IMPDH (ExteinC)

<400>  39

Ser Ile Cys Ser Thr 
1               5   


<210>  40
<211>  8
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Strep taq

<400>  40

Trp Ser His Pro Gln Phe Glu Lys 
1               5               


<210>  41
<211>  107
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  gpD

<400>  41

Lys Glu Thr Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala 
1               5                   10                  15      


His Thr Ala Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met 
            20                  25                  30          


Thr Pro Leu Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp 
        35                  40                  45              


Gly Thr Thr Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp 
    50                  55                  60                  


Gln Thr Ser Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr 
65                  70                  75                  80  


Glu Asp Val Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg 
                85                  90                  95      


Thr Ala Phe Ala Gly Thr Ala Ile Ser Ile Val 
            100                 105         


<210>  42
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  H6

<400>  42

His His His His His His 
1               5       


<210>  43
<211>  111
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Trx

<400>  43

Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp 
1               5                   10                  15      


Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp 
            20                  25                  30          


Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp 
        35                  40                  45              


Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn 
    50                  55                  60                  


Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu 
65                  70                  75                  80  


Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 
                85                  90                  95      


Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser 
            100                 105                 110     


<210>  44
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence ChsXcplhXTXXG comprised in the N1 box


<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /note = "Xaa is a hydrophobic amino acid"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /note = "Xaa is a small amino acid"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /note = "Xaa is a charged amino acid"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /note = "Xaa is a polar amino acid"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /note = "Xaa is a large amino acid"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /note = "Xaa is a hydrophobic amino acid"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (11)..(12)
<223>  /note = "Xaa is any amino acid"

<400>  44

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Xaa Xaa Gly 
1               5                   10              


<210>  45
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence comprised in the intein N-terminal domain


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Cys"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Leu"

/replace = "Phe"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Ser"

       /replace = "Thr"

/replace = "Val"

/replace = "Ala"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /replace = "Leu"

       /replace = "Pro"

/replace = "Gly"

/replace = "Tyr"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /replace = "Asp"

       /replace = "Glu"

/replace = "Lys"

/replace = "Gly"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Thr"

/replace = "Ala"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "Glu"

/replace = "Gln"

/replace = "Leu"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "Met"

/replace = "Lys"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Ile"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Leu"

       /replace = "Gln"

/replace = "Val"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Lys"

/replace = "Asp"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /replace = "Thr"

/replace = "Ile"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "Val"

       /replace = "Pro"

/replace = "Gln"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "Glu"

/replace = "Lys"

/replace = "Leu"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Glu"

       /replace = "Gln"

/replace = "Gly"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Tyr"

/replace = "Ile"

/replace = "Glu"

<220>
<221>  VARIANT
<222>  (13)..(13)
<223>  /replace = "Tyr"

       /replace = "Gly"

/replace = "Lys"

/replace = "Pro"

<220>
<221>  VARIANT
<222>  (13)..(13)
<223>  /replace = "Asp"

<400>  45

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10              


<210>  46
<211>  13
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence comprised in the intein N-terminal domain


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Cys"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Leu"

/replace = "Phe"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Ser"

       /replace = "Thr"

/replace = "Val"

/replace = "Ala"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /replace = "Leu"

/replace = "Pro"

/replace = "Gly"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /replace = "Asp"

/replace = "Lys"

/replace = "Gly"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Thr"

/replace = "Ala"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "Gln"

       /replace = "Leu"

/replace = "Met"

/replace = "Lys"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "Thr"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Ile"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Gln"

       /replace = "Val"

/replace = "Asn"

/replace = "Lys"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Asp"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /replace = "Thr"

/replace = "Ile"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "Pro"

       /replace = "Gln"

/replace = "Asn"

/replace = "Glu"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "Lys"

/replace = "Leu"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Glu"

       /replace = "Gln"

/replace = "Gly"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Tyr"

/replace = "Ile"

/replace = "Glu"

<220>
<221>  VARIANT
<222>  (13)..(13)
<223>  /replace = "Gly"

       /replace = "Lys"

/replace = "Pro"

/replace = "Asp"

<400>  46

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10              


<210>  47
<211>  15
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence GXXhXhTXaHXhhTX  comprised in the N3 box


<220>
<221>  VARIANT
<222>  (2)..(3)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /note = "Xaa is a hydrophobic amino acid"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /note = "Xaa is a hydrophobic amino acid"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /note = "Xaa is an acidic amino acid"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (12)..(13)
<223>  /note = "Xaa is a hydrophobic amino acid"

<220>
<221>  VARIANT
<222>  (15)..(15)
<223>  /note = "Xaa is any amino acid"

<400>  47

Gly Xaa Xaa Xaa Xaa Xaa Thr Xaa Xaa His Xaa Xaa Xaa Thr Xaa 
1               5                   10                  15  


<210>  48
<211>  15
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence comprised in the intein N-terminal domain


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Gly"

/replace = "Ala"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Ser"

       /replace = "Lys"

/replace = "Gln"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Phe"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Leu"

       /replace = "Glu"

/replace = "Lys"

/replace = "Arg"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /replace = "Ile"

/replace = "Leu"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /replace = "Arg"

       /replace = "Ile"

/replace = "Val"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Ala"

       /replace = "Cys"

/replace = "Val"

/replace = "Glu"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "Thr"

/replace = "Ser"

/replace = "Asp"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Lys"

       /replace = "Glu"

/replace = "Ala"

/replace = "Pro"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Asn"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Asp"

       /replace = "Glu"

/replace = "Asn"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /replace = "His"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "Lys"

       /replace = "Leu"

/replace = "Gln"

/replace = "Met"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Phe"

/replace = "Val"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (13)..(13)
<223>  /replace = "Met"

       /replace = "Pro"

/replace = "Phe"

/replace = "Tyr"

<220>
<221>  VARIANT
<222>  (13)..(13)
<223>  /replace = "Ala"

<220>
<221>  VARIANT
<222>  (14)..(14)
<223>  /replace = "Thr"

<220>
<221>  VARIANT
<222>  (15)..(15)
<223>  /replace = "Val"

       /replace = "Gln"

/replace = "Lys"

/replace = "Leu"

<400>  48

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  15  


<210>  49
<211>  15
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence comprised in the intein N-terminal domain


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Gly"

/replace = "Ala"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Lys"

       /replace = "Gln"

/replace = "Asn"

/replace = "Phe"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Glu"

/replace = "Lys"

/replace = "Arg"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /replace = "Ile"

/replace = "Leu"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /replace = "Arg"

       /replace = "Ile"

/replace = "Val"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Cys"

/replace = "Val"

/replace = "Glu"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "Thr"

/replace = "Ser"

/replace = "Asp"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Glu"

       /replace = "Ala"

/replace = "Pro"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Asp"

       /replace = "Glu"

/replace = "Asn"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /replace = "His"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "Lys"

       /replace = "Leu"

/replace = "Gln"

/replace = "Met"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Phe"

/replace = "Val"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (13)..(13)
<223>  /replace = "Pro"

       /replace = "Phe"

/replace = "Tyr"

/replace = "Ala"

<220>
<221>  VARIANT
<222>  (14)..(14)
<223>  /replace = "Thr"

<220>
<221>  VARIANT
<222>  (15)..(15)
<223>  /replace = "Gln"

/replace = "Lys"

/replace = "Leu"

<400>  49

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  15  


<210>  50
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence XhhDIpVXXpHXFX comprised in the C1 box


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (2)..(3)
<223>  /note = "Xaa is a hydrophobic amino acid"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /note = "Xaa is a polar amino acid"

<220>
<221>  VARIANT
<222>  (8)..(9)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /note = "Xaa is a polar amino acid"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (14)..(14)
<223>  /note = "Xaa is any amino acid"

<400>  50

Xaa Xaa Xaa Asp Ile Xaa Val Xaa Xaa Xaa His Xaa Phe Xaa 
1               5                   10                  


<210>  51
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence comprised in the intein C-terminal domain


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Asn"

       /replace = "Glu"

/replace = "Leu"

/replace = "Lys"

<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Gln"

       /replace = "Asp"

/replace = "Pro"

/replace = "Arg"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Val"

/replace = "Leu"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Tyr"

       /replace = "Ile"

/replace = "Val"

/replace = "His"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Phe"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /replace = "Asp"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /replace = "Ile"

/replace = "Leu"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Gly"

       /replace = "Glu"

/replace = "Thr"

/replace = "Gln"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Lys"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "Val"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Glu"

       /replace = "Ser"

/replace = "Thr"

/replace = "Asp"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Asn"

/replace = "Lys"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Arg"

       /replace = "Gly"

/replace = "Asp"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Gln"

/replace = "Ser"

/replace = "Lys"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /replace = "Asp"

       /replace = "Glu"

/replace = "Asn"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /replace = "Lys"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "His"

       /replace = "Arg"

/replace = "Ser"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "Asn"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Asn"

       /replace = "Leu"

/replace = "Ser"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Asn"

<220>
<221>  VARIANT
<222>  (13)..(13)
<223>  /replace = "Phe"

       /replace = "Tyr"

/replace = "Leu"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (14)..(14)
<223>  /replace = "Ala"

       /replace = "Tyr"

/replace = "Phe"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (14)..(14)
<223>  /replace = "Cys"

/replace = "Ser"

<400>  51

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  


<210>  52
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence comprised in the intein C-terminal domain


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Glu"

       /replace = "Leu"

/replace = "Lys"

/replace = "Gln"

<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Asp"

/replace = "Pro"

/replace = "Arg"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Val"

/replace = "Leu"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Tyr"

       /replace = "Ile"

/replace = "Val"

/replace = "His"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Phe"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /replace = "Asp"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /replace = "Ile"

/replace = "Leu"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Gly"

       /replace = "Glu"

/replace = "Thr"

/replace = "Gln"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Lys"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "Val"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Glu"

       /replace = "Ser"

/replace = "Thr"

/replace = "Asp"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Asn"

/replace = "Lys"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Gly"

       /replace = "Asp"

/replace = "Asn"

/replace = "Gln"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Ser"

/replace = "Lys"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /replace = "Asp"

       /replace = "Glu"

/replace = "Asn"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (10)..(10)
<223>  /replace = "Lys"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "His"

       /replace = "Arg"

/replace = "Ser"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (11)..(11)
<223>  /replace = "Asn"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Asn"

       /replace = "Leu"

/replace = "Ser"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (12)..(12)
<223>  /replace = "Asn"

<220>
<221>  VARIANT
<222>  (13)..(13)
<223>  /replace = "Phe"

       /replace = "Tyr"

/replace = "Leu"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (14)..(14)
<223>  /replace = "Ala"

       /replace = "Tyr"

/replace = "Phe"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (14)..(14)
<223>  /replace = "Cys"

/replace = "Ser"

<400>  52

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   10                  


<210>  53
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence hNXIhXHNn comprised in the C2 box


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /note = "Xaa is a hydrophobic amino acid"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /note = "Xaa is a hydrophobic amino acid"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /note = "Xaa is any amino acid"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /note = "Xaa is a nucleophilic amino acid"

<400>  53

Xaa Asn Xaa Ile Xaa Xaa His Asn Xaa 
1               5                   


<210>  54
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence comprised in the intein C-terminal domain


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Leu"

       /replace = "Ala"

/replace = "Val"

/replace = "Ile"

<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Cys"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Asn"

/replace = "Arg"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Gly"

       /replace = "Asp"

/replace = "Ala"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /replace = "Ile"

/replace = "Phe"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /replace = "Leu"

/replace = "Ile"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Val"

       /replace = "Ile"

/replace = "Thr"

/replace = "Ala"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "His"

/replace = "Ser"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Asn"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Ser"

/replace = "Thr"

/replace = "Cys"

<400>  54

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   


<210>  55
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Sequence comprised in the intein C-terminal domain


<220>
<221>  VARIANT
<222>  (1)..(1)
<223>  /replace = "Ala"

       /replace = "Val"

/replace = "Ile"

/replace = "Cys"

<220>
<221>  MISC_FEATURE
<222>  (1)..(8)
<223>  /note = "intein sequence"

<220>
<221>  VARIANT
<222>  (2)..(2)
<223>  /replace = "Asn"

/replace = "Arg"

<220>
<221>  VARIANT
<222>  (3)..(3)
<223>  /replace = "Gly"

       /replace = "Asp"

/replace = "Ala"

/replace = "Asn"

<220>
<221>  VARIANT
<222>  (4)..(4)
<223>  /replace = "Ile"

/replace = "Phe"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (5)..(5)
<223>  /replace = "Leu"

/replace = "Val"

<220>
<221>  VARIANT
<222>  (6)..(6)
<223>  /replace = "Val"

/replace = "Ile"

/replace = "Thr"

<220>
<221>  VARIANT
<222>  (7)..(7)
<223>  /replace = "His"

<220>
<221>  VARIANT
<222>  (8)..(8)
<223>  /replace = "Asn"

<220>
<221>  VARIANT
<222>  (9)..(9)
<223>  /replace = "Ser"

/replace = "Thr"

/replace = "Cys"

<220>
<221>  MISC_FEATURE
<222>  (9)..(9)
<223>  /note = "first amino acid of the extein"

<400>  55

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1               5                   


<210>  56
<211>  223
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP-41.1 C1A N-fragment Protein

<400>  56

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Thr Arg Ser Gly Tyr Ala 
        115                 120                 125             


Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu Ile 
    130                 135                 140                 


Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr Asn 
145                 150                 155                 160 


Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys Ile 
                165                 170                 175     


Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu Phe 
            180                 185                 190         


Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu Gly 
        195                 200                 205             


Met Cys Leu Tyr Val Lys Glu Gly Gly His His His His His His 
    210                 215                 220             


<210>  57
<211>  224
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP 41.8 C1A N-fragment PROTEIN

<400>  57

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Ser Gln Leu Asn Arg Ala 
        115                 120                 125             


Leu Ser Leu Asp Thr Met Val Val Thr Asn Gly Lys Ala Ile Glu Ile 
    130                 135                 140                 


Arg Asp Val Lys Val Gly Asp Trp Leu Glu Ser Glu Cys Gly Pro Val 
145                 150                 155                 160 


Gln Val Thr Glu Val Leu Pro Ile Ile Lys Gln Pro Val Phe Glu Ile 
                165                 170                 175     


Val Leu Lys Ser Gly Lys Lys Ile Arg Val Ser Ala Asn His Lys Phe 
            180                 185                 190         


Pro Thr Lys Asp Gly Leu Lys Thr Ile Asn Ser Gly Leu Lys Val Gly 
        195                 200                 205             


Asp Phe Leu Arg Ser Arg Ala Lys Gly Gly His His His His His His 
    210                 215                 220                 


<210>  58
<211>  240
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NrdJ1 C1A N-term PROTEIN

<400>  58

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Gly Thr Asn Pro Cys Ala 
        115                 120                 125             


Leu Val Gly Ser Ser Glu Ile Ile Thr Arg Asn Tyr Gly Lys Thr Thr 
    130                 135                 140                 


Ile Lys Glu Val Val Glu Ile Phe Asp Asn Asp Lys Asn Ile Gln Val 
145                 150                 155                 160 


Leu Ala Phe Asn Thr His Thr Asp Asn Ile Glu Trp Ala Pro Ile Lys 
                165                 170                 175     


Ala Ala Gln Leu Thr Arg Pro Asn Ala Glu Leu Val Glu Leu Glu Ile 
            180                 185                 190         


Asn Thr Leu His Gly Val Lys Thr Ile Arg Cys Thr Pro Asp His Pro 
        195                 200                 205             


Val Tyr Thr Lys Asn Arg Asp Tyr Val Arg Ala Asp Glu Leu Thr Asp 
    210                 215                 220                 


Asp Asp Glu Leu Val Val Ala Ile Gly Gly His His His His His His 
225                 230                 235                 240 


<210>  59
<211>  236
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IMPDH C1A N-term PROTEIN

<400>  59

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Gly Ile Gly Gly Gly Ala 
        115                 120                 125             


Phe Val Pro Gly Thr Leu Val Asn Thr Glu Asn Gly Leu Lys Lys Ile 
    130                 135                 140                 


Glu Glu Ile Lys Val Gly Asp Lys Val Phe Ser His Thr Gly Lys Leu 
145                 150                 155                 160 


Gln Glu Val Val Asp Thr Leu Ile Phe Asp Arg Asp Glu Glu Ile Ile 
                165                 170                 175     


Ser Ile Asn Gly Ile Asp Cys Thr Lys Asn His Glu Phe Tyr Val Ile 
            180                 185                 190         


Asp Lys Glu Asn Ala Asn Arg Val Asn Glu Asp Asn Ile His Leu Phe 
        195                 200                 205             


Ala Arg Trp Val His Ala Glu Glu Leu Asp Met Lys Lys His Leu Leu 
    210                 215                 220                 


Ile Glu Leu Glu Gly Gly His His His His His His 
225                 230                 235     


<210>  60
<211>  165
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP41.1 deltaext C-fragment Protein

<400>  60

Met Gly Lys Asn Ser Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu 
1               5                   10                  15      


Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu 
            20                  25                  30          


Phe Tyr Ala Asn Asp Ile Leu Thr His Asn Gly Thr Gly Ser Asp Lys 
        35                  40                  45              


Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala 
    50                  55                  60                  


Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro Cys 
65                  70                  75                  80  


Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly 
                85                  90                  95      


Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr Ala 
            100                 105                 110         


Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn 
        115                 120                 125             


Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln Leu 
    130                 135                 140                 


Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser His 
145                 150                 155                 160 


His His His His His 
                165 


<210>  61
<211>  168
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  GP41.8 deltaext C-term PROTEIN

<400>  61

Met Cys Glu Ile Phe Glu Asn Glu Ile Asp Trp Asp Glu Ile Ala Ser 
1               5                   10                  15      


Ile Glu Tyr Val Gly Val Glu Glu Thr Ile Asp Ile Asn Val Thr Asn 
            20                  25                  30          


Asp Arg Leu Phe Phe Ala Asn Gly Ile Leu Thr His Asn Gly Thr Gly 
        35                  40                  45              


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
    50                  55                  60                  


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
65                  70                  75                  80  


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
                85                  90                  95      


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
            100                 105                 110         


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
        115                 120                 125             


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
    130                 135                 140                 


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
145                 150                 155                 160 


Arg Ser His His His His His His 
                165             


<210>  62
<211>  163
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  NrdJ1 deltaext C-term PROTEIN

<400>  62

Met Glu Ala Lys Thr Tyr Ile Gly Lys Leu Lys Ser Arg Lys Ile Val 
1               5                   10                  15      


Ser Asn Glu Asp Thr Tyr Asp Ile Gln Thr Ser Thr His Asn Phe Phe 
            20                  25                  30          


Ala Asn Asp Ile Leu Val His Asn Gly Thr Gly Ser Asp Lys Ile Ile 
        35                  40                  45              


His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly 
    50                  55                  60                  


Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met 
65                  70                  75                  80  


Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu 
                85                  90                  95      


Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys 
            100                 105                 110         


Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu 
        115                 120                 125             


Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu 
    130                 135                 140                 


Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser His His His 
145                 150                 155                 160 


His His His 
            


<210>  63
<211>  163
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  IMPDH deltaext C-term PROTEIN

<400>  63

Met Lys Phe Lys Leu Lys Glu Ile Thr Ser Ile Glu Thr Lys His Tyr 
1               5                   10                  15      


Lys Gly Lys Val His Asp Leu Thr Val Asn Gln Asp His Ser Tyr Asn 
            20                  25                  30          


Val Arg Gly Thr Val Val His Asn Gly Thr Gly Ser Asp Lys Ile Ile 
        35                  40                  45              


His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly 
    50                  55                  60                  


Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met 
65                  70                  75                  80  


Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu 
                85                  90                  95      


Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys 
            100                 105                 110         


Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu 
        115                 120                 125             


Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu 
    130                 135                 140                 


Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser His His His 
145                 150                 155                 160 


His His His 
            


<210>  64
<211>  106
<212>  PRT
<213>  Artificial sequence

<220>
<223>  N-terminal region of the NrdA2 intein

<400>  64

Cys Leu Thr Gly Asp Ala Lys Ile Asp Val Leu Ile Asp Asn Ile Pro 
1               5                   10                  15      


Ile Ser Gln Ile Ser Leu Glu Glu Val Val Asn Leu Phe Asn Glu Gly 
            20                  25                  30          


Lys Glu Ile Tyr Val Leu Ser Tyr Asn Ile Asp Thr Lys Glu Val Glu 
        35                  40                  45              


Tyr Lys Glu Ile Ser Asp Ala Gly Leu Ile Ser Glu Ser Ala Glu Val 
    50                  55                  60                  


Leu Glu Ile Ile Asp Glu Glu Thr Gly Gln Lys Ile Val Cys Thr Pro 
65                  70                  75                  80  


Asp His Lys Val Tyr Thr Leu Asn Arg Gly Tyr Val Ser Ala Lys Asp 
                85                  90                  95      


Leu Lys Glu Asp Asp Glu Leu Val Phe Ser 
            100                 105     


<210>  65
<211>  34
<212>  PRT
<213>  Artificial sequence

<220>
<223>  C-terminal region of the NrdA2 intein

<400>  65

Met Gly Leu Lys Ile Ile Lys Arg Glu Ser Lys Glu Pro Val Phe Asp 
1               5                   10                  15      


Ile Thr Val Lys Asp Asn Ser Asn Phe Phe Ala Asn Asn Ile Leu Val 
            20                  25                  30          


His Asn 
        


<210>  66
<211>  166
<212>  PRT
<213>  Artificial sequence

<220>
<223>  G1C(S)

<400>  66

Met Gly Lys Asn Ser Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu 
1               5                   10                  15      


Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu 
            20                  25                  30          


Phe Tyr Ala Asn Asp Ile Leu Thr His Asn Ser Gly Thr Gly Ser Asp 
        35                  40                  45              


Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys 
    50                  55                  60                  


Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro 
65                  70                  75                  80  


Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln 
                85                  90                  95      


Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr 
            100                 105                 110         


Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys 
        115                 120                 125             


Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln 
    130                 135                 140                 


Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser 
145                 150                 155                 160 


His His His His His His 
                165     


<210>  67
<211>  218
<212>  PRT
<213>  Artificial

<220>
<223>  G1N(deltaext)

<400>  67

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Lys Glu Thr 
1               5                   10                  15      


Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala His Thr Ala 
            20                  25                  30          


Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met Thr Pro Leu 
        35                  40                  45              


Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp Gly Thr Thr 
    50                  55                  60                  


Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp Gln Thr Ser 
65                  70                  75                  80  


Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr Glu Asp Val 
                85                  90                  95      


Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg Thr Ala Phe 
            100                 105                 110         


Ala Gly Thr Ala Ile Ser Ile Val Gly Ser Cys Leu Asp Leu Lys Thr 
        115                 120                 125             


Gln Val Gln Thr Pro Gln Gly Met Lys Glu Ile Ser Asn Ile Gln Val 
    130                 135                 140                 


Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr Asn Glu Val Leu Asn Val 
145                 150                 155                 160 


Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys Ile Thr Leu Glu Asp Gly 
                165                 170                 175     


Lys Glu Ile Ile Cys Ser Glu Glu His Leu Phe Pro Thr Gln Thr Gly 
            180                 185                 190         


Glu Met Asn Ile Ser Gly Gly Leu Lys Glu Gly Met Cys Leu Tyr Val 
        195                 200                 205             


Lys Glu Gly Gly His His His His His His 
    210                 215             


<210>  68
<211>  170
<212>  PRT
<213>  Artificial

<220>
<223>  GP41.1 N to A C-fragment protein

<400>  68

Met Gly Lys Asn Ser Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu 
1               5                   10                  15      


Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu 
            20                  25                  30          


Phe Tyr Ala Asn Asp Ile Leu Thr His Ala Ser Ser Ser Asp Val Gly 
        35                  40                  45              


Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr 
    50                  55                  60                  


Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His 
65                  70                  75                  80  


Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala 
                85                  90                  95      


Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His 
            100                 105                 110         


Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu 
        115                 120                 125             


Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu 
    130                 135                 140                 


Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser 
145                 150                 155                 160 


Glu Phe Arg Ser His His His His His His 
                165                 170 


<210>  69
<211>  173
<212>  PRT
<213>  Artificial

<220>
<223>  GP41-8 N to A C-terminal fragment

<400>  69

Met Cys Glu Ile Phe Glu Asn Glu Ile Asp Trp Asp Glu Ile Ala Ser 
1               5                   10                  15      


Ile Glu Tyr Val Gly Val Glu Glu Thr Ile Asp Ile Asn Val Thr Asn 
            20                  25                  30          


Asp Arg Leu Phe Phe Ala Asn Gly Ile Leu Thr His Ala Ser Ala Val 
        35                  40                  45              


Glu Glu Gly Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser 
    50                  55                  60                  


Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe 
65                  70                  75                  80  


Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp 
                85                  90                  95      


Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn 
            100                 105                 110         


Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile 
        115                 120                 125             


Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val 
    130                 135                 140                 


Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu 
145                 150                 155                 160 


Ala Gly Ser Glu Phe Arg Ser His His His His His His 
                165                 170             


<210>  70
<211>  168
<212>  PRT
<213>  Artificial

<220>
<223>  NrdJ1 N to A C-terminal fragment

<400>  70

Met Glu Ala Lys Thr Tyr Ile Gly Lys Leu Lys Ser Arg Lys Ile Val 
1               5                   10                  15      


Ser Asn Glu Asp Thr Tyr Asp Ile Gln Thr Ser Thr His Asn Phe Phe 
            20                  25                  30          


Ala Asn Asp Ile Leu Val His Ala Ser Glu Ile Val Leu Gly Thr Gly 
        35                  40                  45              


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
    50                  55                  60                  


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
65                  70                  75                  80  


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
                85                  90                  95      


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
            100                 105                 110         


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
        115                 120                 125             


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
    130                 135                 140                 


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
145                 150                 155                 160 


Arg Ser His His His His His His 
                165             


<210>  71
<211>  168
<212>  PRT
<213>  Artificial

<220>
<223>  IMPDH1 N to A C-terminal fragment

<400>  71

Met Lys Phe Lys Leu Lys Glu Ile Thr Ser Ile Glu Thr Lys His Tyr 
1               5                   10                  15      


Lys Gly Lys Val His Asp Leu Thr Val Asn Gln Asp His Ser Tyr Asn 
            20                  25                  30          


Val Arg Gly Thr Val Val His Ala Ser Ile Cys Ser Thr Gly Thr Gly 
        35                  40                  45              


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
    50                  55                  60                  


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
65                  70                  75                  80  


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
                85                  90                  95      


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
            100                 105                 110         


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
        115                 120                 125             


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
    130                 135                 140                 


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
145                 150                 155                 160 


Arg Ser His His His His His His 
                165             


<210>  72
<211>  170
<212>  PRT
<213>  Artificial

<220>
<223>  GP41.1 N/S to A C-terminal fragment

<400>  72

Met Gly Lys Asn Ser Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu 
1               5                   10                  15      


Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu 
            20                  25                  30          


Phe Tyr Ala Asn Asp Ile Leu Thr His Ala Ala Ser Ser Asp Val Gly 
        35                  40                  45              


Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr 
    50                  55                  60                  


Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His 
65                  70                  75                  80  


Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala 
                85                  90                  95      


Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His 
            100                 105                 110         


Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu 
        115                 120                 125             


Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu 
    130                 135                 140                 


Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser 
145                 150                 155                 160 


Glu Phe Arg Ser His His His His His His 
                165                 170 


<210>  73
<211>  173
<212>  PRT
<213>  Artificial

<220>
<223>  GP41.8 N/S to A C-terminal fragment

<400>  73

Met Cys Glu Ile Phe Glu Asn Glu Ile Asp Trp Asp Glu Ile Ala Ser 
1               5                   10                  15      


Ile Glu Tyr Val Gly Val Glu Glu Thr Ile Asp Ile Asn Val Thr Asn 
            20                  25                  30          


Asp Arg Leu Phe Phe Ala Asn Gly Ile Leu Thr His Ala Ala Ala Val 
        35                  40                  45              


Glu Glu Gly Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser 
    50                  55                  60                  


Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe 
65                  70                  75                  80  


Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp 
                85                  90                  95      


Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn 
            100                 105                 110         


Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile 
        115                 120                 125             


Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val 
    130                 135                 140                 


Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu 
145                 150                 155                 160 


Ala Gly Ser Glu Phe Arg Ser His His His His His His 
                165                 170             


<210>  74
<211>  168
<212>  PRT
<213>  Artificial

<220>
<223>  NrdJ1 N/S to A C-terminal fragment

<400>  74

Met Glu Ala Lys Thr Tyr Ile Gly Lys Leu Lys Ser Arg Lys Ile Val 
1               5                   10                  15      


Ser Asn Glu Asp Thr Tyr Asp Ile Gln Thr Ser Thr His Asn Phe Phe 
            20                  25                  30          


Ala Asn Asp Ile Leu Val His Ala Ala Glu Ile Val Leu Gly Thr Gly 
        35                  40                  45              


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
    50                  55                  60                  


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
65                  70                  75                  80  


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
                85                  90                  95      


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
            100                 105                 110         


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
        115                 120                 125             


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
    130                 135                 140                 


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
145                 150                 155                 160 


Arg Ser His His His His His His 
                165             


<210>  75
<211>  168
<212>  PRT
<213>  Artificial

<220>
<223>  IMPDH1 N/S to A C-terminal fragment

<400>  75

Met Lys Phe Lys Leu Lys Glu Ile Thr Ser Ile Glu Thr Lys His Tyr 
1               5                   10                  15      


Lys Gly Lys Val His Asp Leu Thr Val Asn Gln Asp His Ser Tyr Asn 
            20                  25                  30          


Val Arg Gly Thr Val Val His Ala Ala Ile Cys Ser Thr Gly Thr Gly 
        35                  40                  45              


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
    50                  55                  60                  


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
65                  70                  75                  80  


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
                85                  90                  95      


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
            100                 105                 110         


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
        115                 120                 125             


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
    130                 135                 140                 


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
145                 150                 155                 160 


Arg Ser His His His His His His 
                165             


<210>  76
<211>  109
<212>  PRT
<213>  bacteriophage lambda

<400>  76

Lys Glu Thr Phe Thr His Tyr Gln Pro Gln Gly Asn Ser Asp Pro Ala 
1               5                   10                  15      


His Thr Ala Thr Ala Pro Gly Gly Leu Ser Ala Lys Ala Pro Ala Met 
            20                  25                  30          


Thr Pro Leu Met Leu Asp Thr Ser Ser Arg Lys Leu Val Ala Trp Asp 
        35                  40                  45              


Gly Thr Thr Asp Gly Ala Ala Val Gly Ile Leu Ala Val Ala Ala Asp 
    50                  55                  60                  


Gln Thr Ser Thr Thr Leu Thr Phe Tyr Lys Ser Gly Thr Phe Arg Tyr 
65                  70                  75                  80  


Glu Asp Val Leu Trp Pro Glu Ala Ala Ser Asp Glu Thr Lys Lys Arg 
                85                  90                  95      


Thr Ala Phe Ala Gly Thr Ala Ile Ser Ile Val Gly Ser 
            100                 105                 


<210>  77
<211>  111
<212>  PRT
<213>  Escherichia coli

<400>  77

Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp 
1               5                   10                  15      


Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp 
            20                  25                  30          


Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp 
        35                  40                  45              


Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn 
    50                  55                  60                  


Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu 
65                  70                  75                  80  


Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 
                85                  90                  95      


Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser 
            100                 105                 110     


<210>  78
<211>  9
<212>  PRT
<213>  Artificial

<220>
<223>  C1-Box of the C-terminal region of the GP41-1 intein

<400>  78

Ala Asn Asp Ile Leu Thr His Asn Ser 
1               5                   


<210>  79
<211>  88
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-1 N-intein

<400>  79

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu 
                85              


<210>  80
<211>  27
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-2 N-intein

<400>  80

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Gln Gln Gly Leu Lys Asp 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu 
            20                  25          


<210>  81
<211>  46
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-3 N-intein

<400>  81

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser 
        35                  40                  45      


<210>  82
<211>  88
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-4 N-intein

<400>  82

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu 
                85              


<210>  83
<211>  88
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-5 N-intein

<400>  83

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu 
                85              


<210>  84
<211>  43
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-6 N-intein

<400>  84

Ser Tyr Lys Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu 
1               5                   10                  15      


Glu His Leu Phe Pro Thr Gln Asn Gly Glu Val Asn Ile Lys Gly Gly 
            20                  25                  30          


Leu Lys Glu Gly Met Cys Leu Tyr Val Lys Glu 
        35                  40              


<210>  85
<211>  88
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-7 N-intein

<400>  85

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Leu Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Gln Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Asn Gly Glu Val Asn Ile Lys Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu 
                85              


<210>  86
<211>  89
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-8 N-intein

<400>  86

Cys Leu Ser Leu Asp Thr Met Val Val Thr Asn Gly Lys Ala Ile Glu 
1               5                   10                  15      


Ile Arg Asp Val Lys Val Gly Asp Trp Leu Glu Ser Glu Cys Gly Pro 
            20                  25                  30          


Val Gln Val Thr Glu Val Leu Pro Ile Ile Lys Gln Pro Val Phe Glu 
        35                  40                  45              


Ile Val Leu Lys Ser Gly Lys Lys Ile Arg Val Ser Ala Asn His Lys 
    50                  55                  60                  


Phe Pro Thr Lys Asp Gly Leu Lys Thr Ile Asn Ser Gly Leu Lys Val 
65                  70                  75                  80  


Gly Asp Phe Leu Arg Ser Arg Ala Lys 
                85                  


<210>  87
<211>  101
<212>  PRT
<213>  Artificial

<220>
<223>  IMPDH-1 N-intein

<400>  87

Cys Phe Val Pro Gly Thr Leu Val Asn Thr Glu Asn Gly Leu Lys Lys 
1               5                   10                  15      


Ile Glu Glu Ile Lys Val Gly Asp Lys Val Phe Ser His Thr Gly Lys 
            20                  25                  30          


Leu Gln Glu Val Val Asp Thr Leu Ile Phe Asp Arg Asp Glu Glu Ile 
        35                  40                  45              


Ile Ser Ile Asn Gly Ile Asp Cys Thr Lys Asn His Glu Phe Tyr Val 
    50                  55                  60                  


Ile Asp Lys Glu Asn Ala Asn Arg Val Asn Glu Asp Asn Ile His Leu 
65                  70                  75                  80  


Phe Ala Arg Trp Val His Ala Glu Glu Leu Asp Met Lys Lys His Leu 
                85                  90                  95      


Leu Ile Glu Leu Glu 
            100     


<210>  88
<211>  133
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-1 N-intein

<400>  88

Cys Val Ala Gly Asp Thr Lys Ile Lys Ile Lys Tyr Pro Glu Ser Val 
1               5                   10                  15      


Gly Asp Gln Tyr Gly Thr Trp Tyr Trp Asn Val Leu Glu Lys Glu Ile 
            20                  25                  30          


Gln Ile Glu Asp Leu Glu Asp Tyr Ile Ile Met Arg Glu Cys Glu Ile 
        35                  40                  45              


Tyr Asp Ser Asn Ala Pro Gln Ile Glu Val Leu Ser Tyr Asn Ile Glu 
    50                  55                  60                  


Thr Gly Glu Gln Glu Trp Lys Pro Ile Thr Ala Phe Ala Gln Thr Ser 
65                  70                  75                  80  


Pro Lys Ala Lys Val Met Lys Ile Thr Asp Glu Glu Ser Gly Lys Ser 
                85                  90                  95      


Ile Val Val Thr Pro Glu His Gln Val Phe Thr Lys Asn Arg Gly Tyr 
            100                 105                 110         


Val Met Ala Lys Asp Leu Ile Glu Thr Asp Glu Pro Ile Ile Val Asn 
        115                 120                 125             


Lys Asp Met Asn Phe 
    130             


<210>  89
<211>  106
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-2 N-intein

<400>  89

Cys Leu Thr Gly Asp Ala Lys Ile Asp Val Leu Ile Asp Asn Ile Pro 
1               5                   10                  15      


Ile Ser Gln Ile Ser Leu Glu Glu Val Val Asn Leu Phe Asn Glu Gly 
            20                  25                  30          


Lys Glu Ile Tyr Val Leu Ser Tyr Asn Ile Asp Thr Lys Glu Val Glu 
        35                  40                  45              


Tyr Lys Glu Ile Ser Asp Ala Gly Leu Ile Ser Glu Ser Ala Glu Val 
    50                  55                  60                  


Leu Glu Ile Ile Asp Glu Glu Thr Gly Gln Lys Ile Val Cys Thr Pro 
65                  70                  75                  80  


Asp His Lys Val Tyr Thr Leu Asn Arg Gly Tyr Val Ser Ala Lys Asp 
                85                  90                  95      


Leu Lys Glu Asp Asp Glu Leu Val Phe Ser 
            100                 105     


<210>  90
<211>  105
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-4 N-intein

<400>  90

Cys Leu Ala Gly Asp Thr Thr Val Thr Val Leu Glu Gly Asp Ile Val 
1               5                   10                  15      


Phe Glu Met Thr Leu Glu Asn Leu Val Ser Leu Tyr Lys Asn Val Phe 
            20                  25                  30          


Ser Val Ser Val Leu Ser Phe Asn Pro Glu Thr Gln Lys Gln Glu Phe 
        35                  40                  45              


Lys Pro Val Thr Asn Ala Ala Leu Met Asn Pro Glu Ser Lys Val Leu 
    50                  55                  60                  


Lys Ile Thr Asp Ser Asp Thr Gly Lys Ser Ile Val Cys Thr Pro Asp 
65                  70                  75                  80  


His Lys Val Phe Thr Lys Asn Arg Gly Tyr Val Ile Ala Ser Glu Leu 
                85                  90                  95      


Asn Ala Glu Asp Ile Leu Glu Ile Lys 
            100                 105 


<210>  91
<211>  65
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-5 N-intein

<400>  91

His Thr Glu Thr Val Arg Arg Val Gly Thr Ile Thr Ala Phe Ala Gln 
1               5                   10                  15      


Thr Ser Pro Lys Ser Lys Val Met Lys Ile Thr Asp Glu Glu Ser Gly 
            20                  25                  30          


Asn Ser Ile Val Val Thr Pro Glu His Lys Val Phe Thr Lys Asn Arg 
        35                  40                  45              


Gly Tyr Val Met Ala Lys Asn Leu Val Glu Thr Asp Glu Leu Val Ile 
    50                  55                  60                  


Asn 
65  


<210>  92
<211>  49
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-6 N-intein

<400>  92

Tyr Val Cys Ser Arg Asp Asp Thr Thr Gly Phe Lys Leu Ile Cys Thr 
1               5                   10                  15      


Pro Asp His Met Ile Tyr Thr Lys Asn Arg Gly Tyr Ile Met Ala Lys 
            20                  25                  30          


Tyr Leu Lys Glu Asp Asp Glu Leu Leu Ile Asn Glu Ile His Leu Pro 
        35                  40                  45              


Thr 
    


<210>  93
<211>  105
<212>  PRT
<213>  Artificial

<220>
<223>  NrdJ-1 N-intein

<400>  93

Cys Leu Val Gly Ser Ser Glu Ile Ile Thr Arg Asn Tyr Gly Lys Thr 
1               5                   10                  15      


Thr Ile Lys Glu Val Val Glu Ile Phe Asp Asn Asp Lys Asn Ile Gln 
            20                  25                  30          


Val Leu Ala Phe Asn Thr His Thr Asp Asn Ile Glu Trp Ala Pro Ile 
        35                  40                  45              


Lys Ala Ala Gln Leu Thr Arg Pro Asn Ala Glu Leu Val Glu Leu Glu 
    50                  55                  60                  


Ile Asp Thr Leu His Gly Val Lys Thr Ile Arg Cys Thr Pro Asp His 
65                  70                  75                  80  


Pro Val Tyr Thr Lys Asn Arg Gly Tyr Val Arg Ala Asp Glu Leu Thr 
                85                  90                  95      


Asp Asp Asp Glu Leu Val Val Ala Ile 
            100                 105 


<210>  94
<211>  105
<212>  PRT
<213>  Artificial

<220>
<223>  NrdJ-2 N-intein

<400>  94

Cys Leu Val Gly Ser Ser Glu Ile Ile Thr Arg Asn Tyr Gly Lys Thr 
1               5                   10                  15      


Thr Ile Lys Glu Val Val Glu Ile Phe Asp Asn Asp Lys Asn Ile Gln 
            20                  25                  30          


Val Leu Ala Phe Asn Thr His Thr Asp Asn Ile Glu Trp Ala Pro Ile 
        35                  40                  45              


Lys Ala Ala Gln Leu Thr Arg Pro Asn Ala Glu Leu Val Glu Leu Glu 
    50                  55                  60                  


Ile Asn Thr Leu His Gly Val Lys Thr Ile Arg Cys Thr Pro Asp His 
65                  70                  75                  80  


Pro Val Tyr Thr Lys Asn Arg Asp Tyr Val Arg Ala Asp Glu Leu Thr 
                85                  90                  95      


Asp Asp Asp Glu Leu Val Val Ala Ile 
            100                 105 


<210>  95
<211>  38
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-1 C-intein

<400>  95

Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp Glu Arg Glu 
1               5                   10                  15      


Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu Phe Tyr Ala Asn Asp 
            20                  25                  30          


Ile Leu Thr His Asn Ser 
        35              


<210>  96
<211>  38
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-2 C-intein

<400>  96

Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp Glu Arg Glu 
1               5                   10                  15      


Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu Phe Tyr Ala Asn Ala 
            20                  25                  30          


Ile Leu Thr His Asn Ser 
        35              


<210>  97
<211>  26
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-7 C-intein

<400>  97

Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp Glu Arg Glu 
1               5                   10                  15      


Leu Ile Asp Ile Glu Val Ser Gly Asn His 
            20                  25      


<210>  98
<211>  46
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-8 C-intein

<400>  98

Met Cys Glu Ile Phe Glu Asn Glu Ile Asp Trp Asp Glu Ile Ala Ser 
1               5                   10                  15      


Ile Glu Tyr Val Gly Val Glu Glu Thr Ile Asp Ile Asn Val Thr Asn 
            20                  25                  30          


Asp Arg Leu Phe Phe Ala Asn Gly Ile Leu Thr His Asn Ser 
        35                  40                  45      


<210>  99
<211>  47
<212>  PRT
<213>  Artificial

<220>
<223>  gp41-9 C-intein

<400>  99

Met Ile Met Lys Asn Arg Glu Arg Phe Ile Thr Glu Lys Ile Leu Asn 
1               5                   10                  15      


Ile Glu Glu Ile Asp Asp Asp Leu Thr Val Asp Ile Gly Met Asp Asn 
            20                  25                  30          


Glu Asp His Tyr Phe Val Ala Asn Asp Ile Leu Thr His Asn Thr 
        35                  40                  45          


<210>  100
<211>  41
<212>  PRT
<213>  Artificial

<220>
<223>  IMPDH-1 C-intein

<400>  100

Met Lys Phe Lys Leu Lys Glu Ile Thr Ser Ile Glu Thr Lys His Tyr 
1               5                   10                  15      


Lys Gly Lys Val His Asp Leu Thr Val Asn Gln Asp His Ser Tyr Asn 
            20                  25                  30          


Val Arg Gly Thr Val Val His Asn Ser 
        35                  40      


<210>  101
<211>  43
<212>  PRT
<213>  Artificial

<220>
<223>  IMPDH-2 C.intein

<400>  101

Met Lys Phe Thr Leu Glu Pro Ile Thr Lys Ile Asp Ser Tyr Glu Val 
1               5                   10                  15      


Thr Ala Glu Pro Val Tyr Asp Ile Glu Val Glu Asn Asp His Ser Phe 
            20                  25                  30          


Cys Val Glu Asn Gly Phe Val Val His Asn Ser 
        35                  40              


<210>  102
<211>  41
<212>  PRT
<213>  Artificial

<220>
<223>  IMPDH-3 C-intein

<400>  102

Met Lys Phe Lys Leu Val Glu Ile Thr Ser Lys Glu Thr Phe Asn Tyr 
1               5                   10                  15      


Ser Gly Gln Val His Asp Leu Thr Val Glu Asp Asp His Ser Tyr Ser 
            20                  25                  30          


Ile Asn Asn Ile Val Val His Asn Ser 
        35                  40      


<210>  103
<211>  35
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-2 C-intein

<400>  103

Met Gly Leu Lys Ile Ile Lys Arg Glu Ser Lys Glu Pro Val Phe Asp 
1               5                   10                  15      


Ile Thr Val Lys Asp Asn Ser Asn Phe Phe Ala Asn Asn Ile Leu Val 
            20                  25                  30          


His Asn Cys 
        35  


<210>  104
<211>  34
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-3

<400>  104

Met Leu Lys Ile Glu Tyr Leu Glu Glu Glu Ile Pro Val Tyr Asp Ile 
1               5                   10                  15      


Thr Val Glu Glu Thr His Asn Phe Phe Ala Asn Asp Ile Leu Ile His 
            20                  25                  30          


Asn Cys 
        


<210>  105
<211>  28
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-5 C-intein

<400>  105

Met Leu Lys Ile Glu Tyr Leu Glu Glu Glu Ile Pro Val Tyr Asp Ile 
1               5                   10                  15      


Thr Val Glu Gly Thr His Asn Leu Ala Tyr Ser Leu 
            20                  25              


<210>  106
<211>  33
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-6 C-intein

<400>  106

Met Gly Ile Lys Ile Arg Lys Leu Glu Gln Asn Arg Val Tyr Asp Ile 
1               5                   10                  15      


Lys Val Glu Lys Ile Ile Ile Phe Cys Asn Asn Ile Leu Val His Asn 
            20                  25                  30          


Cys 
    


<210>  107
<211>  34
<212>  PRT
<213>  Artificial

<220>
<223>  NrdA-7 C-intein

<400>  107

Met Leu Lys Ile Glu Tyr Leu Glu Glu Glu Ile Pro Val Tyr Asp Ile 
1               5                   10                  15      


Thr Val Glu Lys Thr Asn Asn Phe Phe Ala Asn Asp Ile Leu Val His 
            20                  25                  30          


Asn Cys 
        


<210>  108
<211>  41
<212>  PRT
<213>  Artificial

<220>
<223>  NrdJ-1 C-intein

<400>  108

Met Glu Ala Lys Thr Tyr Ile Gly Lys Leu Lys Ser Arg Lys Ile Val 
1               5                   10                  15      


Ser Asn Glu Asp Thr Tyr Asp Ile Gln Thr Ser Thr His Asn Phe Phe 
            20                  25                  30          


Ala Asn Asp Ile Leu Val His Asn Ser 
        35                  40      


