
                                SEQUENCE LISTING

<110> Su, Wei Wen
      Zhang, Bei
      

<120> AUTO-PROCESSING DOMAINS FOR EXPRESSION
  OF POLYPEPTIDES

<130> UOH.046A

<150> 61/564,808          
<151> 2011-11-29  

<150> 61/563,508          
<151> 2011-11-23  

<160> 54

<170> FastSEQ for Windows Version 4.0

<210> 1
<211> 159
<212> PRT
<213> Artificial Sequence

<220> 
<223> Ssp DnaE intein

<400> 1
Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
 1               5                  10                  15      
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
            20                  25                  30          
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
        35                  40                  45              
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
    50                  55                  60                  
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
65                  70                  75                  80  
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
                85                  90                  95      
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
            100                 105                 110         
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile
        115                 120                 125             
Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro
    130                 135                 140                 
Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Asn
145                 150                 155                 


<210> 2
<211> 76
<212> PRT
<213> Artificial Sequence

<220> 
<223> UB polypeptide

<400> 2
Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile Thr Leu Glu
 1               5                  10                  15      
Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys Ile Gln Asp
            20                  25                  30          
Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe Ala Gly Arg
        35                  40                  45              
Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile Gln Lys Glu
    50                  55                  60                  
Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly
65                  70                  75      


<210> 3
<211> 103
<212> PRT
<213> Artificial Sequence

<220> 
<223> SUMO polypeptide

<400> 3
Gly Ser Met Ser Asp Gln Glu Ala Lys Pro Ser Thr Glu Asp Leu Gly
 1               5                  10                  15      
Asp Lys Lys Glu Gly Glu Tyr Ile Lys Leu Lys Val Ile Gly Gln Asp
            20                  25                  30          
Ser Ser Glu Ile His Phe Lys Val Lys Met Thr Thr His Leu Lys Lys
        35                  40                  45              
Leu Lys Glu Ser Tyr Cys Gln Arg Gln Gly Val Pro Met Asn Ser Leu
    50                  55                  60                  
Arg Phe Leu Phe Glu Gly Gln Arg Ile Ala Asp Asn His Thr Pro Lys
65                  70                  75                  80  
Glu Leu Gly Met Glu Glu Glu Asp Val Ile Glu Val Tyr Gln Glu Gln
                85                  90                  95      
Thr Gly Gly His Ser Thr Val
            100             


<210> 4
<211> 63
<212> PRT
<213> Artificial Sequence

<220> 
<223> FMDV 2A sequence

<400> 4
Gly Ser Gly Ser Arg Val Thr Glu Leu Leu Tyr Arg Met Lys Arg Ala
 1               5                  10                  15      
Glu Thr Tyr Cys Pro Arg Pro Leu Leu Ala Ile His Pro Thr Glu Ala
            20                  25                  30          
Arg His Lys Gln Lys Ile Val Ala Pro Val Lys Gln Leu Leu Asn Phe
        35                  40                  45              
Asp Leu Leu Lys Leu Ala Gly Asp Val Glu Ser Asn Pro Gly Pro
    50                  55                  60              


<210> 5
<211> 25
<212> PRT
<213> Artificial Sequence

<220> 
<223> (Strongylocentrotus purpuratus) 2A sequence

<400> 5
Asp Gly Phe Cys Ile Leu Tyr Leu Leu Leu Ile Leu Leu Met Arg Ser
 1               5                  10                  15      
Gly Asp Val Glu Thr Asn Pro Gly Pro
            20                  25  


<210> 6
<211> 2367
<212> DNA
<213> Artificial Sequence

<220> 
<223> Intein 190 (pE1775-mGFP172-DnaE
      Intein-2A-mCherry-streptag)

<400> 6
ggtaccgtcg accaaggaga tataacaatg aagactaatc tttttctctt tctcatcttt 60
tcacttctcc tatcattatc ctcggccgaa ttcagtaaag gagaagaact tttcactgga 120
gttgtcccaa ttcttgttga attagatggt gatgttaatg ggcacaaatt ttctgtcagt 180
ggagagggtg aaggtgatgc aacatacgga aaacttaccc ttaaatttat ttgcactact 240
ggaaaactac ctgttccttg gccaacactt gtcactactt tcacttatgg tgttcaatgc 300
ttttcaagat acccagatca tatgaagcgg cacgacttct tcaagagcgc catgcctgag 360
ggatacgtgc aggagaggac catcttcttc aaggacgacg ggaactacaa gacacgtgct 420
gaagtcaagt ttgagggaga caccctcgtc aacaggatcg agcttaaggg aatcgatttc 480
aaggaggacg gaaacatcct cggccacaag ttggaataca actacaactc ccacaacgta 540
tacatcatgg ccgacaagca aaagaacggc atcaaagcca acttcaagac ccgccacaac 600
atcgaacacc atcaccatca ccatgacggc ggcgtgcaac tcgctgatca ttatcaacaa 660
aatactccaa ttggcgatgg ccctgtcctt ttaccagaca accattacct gtccacacaa 720
tctgcccttt cgaaagatcc caacgaaaag agagaccaca tggtccttct tgagtttgta 780
acagctgctg ggattacaca tggcatggat gaactataca aactcgaggg aggatctaag 840
tttgcaaatg attgtttgtc cttcggaact gagatactta cagttgaata tggaccactt 900
cctattggaa agattgtgag tgaagagatc aactgcagtg tttattccgt ggatccagag 960
ggtagagttt acactcaagc aattgctcag tggcatgata ggggagaaca ggaggttctt 1020
gaatatgagt tggaagatgg ttctgtgata agagctacat cagatcacag gtttcttact 1080
acagattacc aacttttggc aatcgaagag attttcgcta gacagctcga tcttctcact 1140
ttggaaaata ttaagcaaac agaagaggca cttgataacc ataggcttcc atttcctctt 1200
ttggatgctg gaactattaa gatggttaaa gtgataggaa gaaggtcatt gggtgttcaa 1260
agaatatttg atatcggact tcctcaggat cacaatttct tactcgcaaa cggtgctatt 1320
gctgcagctt gtttcaatgg ttctggttct agagttactg agcttttgta taggatgaag 1380
agggcagaaa catactgccc aagaccttta ctcgcaatcc atccaacaga ggctaggcac 1440
aagcaaaaaa ttgttgctcc tgtgaaacag cttttgaact ttgatcttct caagcttgcg 1500
ggagacgtcg agtccaaccc tgggccccag gtgctgaaca ccatggtgaa caaacacttc 1560
ttgtcccttt cggtcctcat cgtcctcctt ggcctctcct ccaacttgac agccggcatg 1620
ctgagcaagg gcgaggagga taacatggcc atcatcaagg agttcatgcg cttcaaggtg 1680
cacatggagg gctccgtgaa cggccacgag ttcgagatcg agggcgaggg cgagggccgc 1740
ccctacgagg gcacccagac cgccaagctg aaggtgacca agggtggccc cctgcccttc 1800
gcctgggaca tcctgtcccc tcagttcatg tacggctcca aggcctacgt gaagcacccc 1860
gccgacatcc ccgactactt gaagctgtcc ttccccgagg gcttcaagtg ggagcgcgtg 1920
atgaacttcg aggacggcgg cgtggtgacc gtgacccagg actcctccct gcaggacggc 1980
gagttcatct acaaggtgaa gctgcgcggc accaacttcc cctccgacgg ccccgtaatg 2040
cagaagaaga ccatgggctg ggaggcctcc tccgagcgga tgtaccccga ggacggcgcc 2100
ctgaagggcg agatcaagca gaggctgaag ctgaaggacg gcggccacta cgacgctgag 2160
gtcaagacca cctacaaggc caagaagccc gtgcagctgc ccggcgccta caacgtcaac 2220
atcaagttgg acatcacctc ccacaacgag gactacacca tcgtggaaca gtacgaacgc 2280
gccgagggcc gccactccac cggcggcatg gacgagctgt acaagggttc tggatggtca 2340
catcctcagt ttgaaaaatg agagctc                                     2367

<210> 7
<211> 777
<212> PRT
<213> Artificial Sequence

<220> 
<223> Intein 190 (pE1775-mGFP172-DnaE
      Intein-2A-mCherry-streptag)

<400> 7
Met Lys Thr Asn Leu Phe Leu Phe Leu Ile Phe Ser Leu Leu Leu Ser
 1               5                  10                  15      
Leu Ser Ser Ala Glu Phe Ser Lys Gly Glu Glu Leu Phe Thr Gly Val
            20                  25                  30          
Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe
        35                  40                  45              
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr
    50                  55                  60                  
Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr
65                  70                  75                  80  
Leu Val Thr Thr Phe Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro
                85                  90                  95      
Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly
            100                 105                 110         
Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys
        115                 120                 125             
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile
    130                 135                 140                 
Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His
145                 150                 155                 160 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp
                165                 170                 175     
Lys Gln Lys Asn Gly Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile
            180                 185                 190         
Glu His His His His His His Asp Gly Gly Val Gln Leu Ala Asp His
        195                 200                 205             
Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp
    210                 215                 220                 
Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu
225                 230                 235                 240 
Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile
                245                 250                 255     
Thr His Gly Met Asp Glu Leu Tyr Lys Leu Glu Gly Gly Ser Lys Phe
            260                 265                 270         
Ala Asn Asp Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr
        275                 280                 285             
Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser
    290                 295                 300                 
Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala
305                 310                 315                 320 
Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu
                325                 330                 335     
Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr
            340                 345                 350         
Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp
        355                 360                 365             
Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn
    370                 375                 380                 
His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val
385                 390                 395                 400 
Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile
                405                 410                 415     
Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala
            420                 425                 430         
Ala Ala Cys Phe Asn Gly Ser Gly Ser Arg Val Thr Glu Leu Leu Tyr
        435                 440                 445             
Arg Met Lys Arg Ala Glu Thr Tyr Cys Pro Arg Pro Leu Leu Ala Ile
    450                 455                 460                 
His Pro Thr Glu Ala Arg His Lys Gln Lys Ile Val Ala Pro Val Lys
465                 470                 475                 480 
Gln Leu Leu Asn Phe Asp Leu Leu Lys Leu Ala Gly Asp Val Glu Ser
                485                 490                 495     
Asn Pro Gly Pro Gln Val Leu Asn Thr Met Val Asn Lys His Phe Leu
            500                 505                 510         
Ser Leu Ser Val Leu Ile Val Leu Leu Gly Leu Ser Ser Asn Leu Thr
        515                 520                 525             
Ala Gly Met Leu Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys
    530                 535                 540                 
Glu Phe Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly His
545                 550                 555                 560 
Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr
                565                 570                 575     
Gln Thr Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala
            580                 585                 590         
Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val
        595                 600                 605             
Lys His Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu
    610                 615                 620                 
Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val
625                 630                 635                 640 
Thr Val Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys
                645                 650                 655     
Val Lys Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln
            660                 665                 670         
Lys Lys Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu
        675                 680                 685             
Asp Gly Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp
    690                 695                 700                 
Gly Gly His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys
705                 710                 715                 720 
Pro Val Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile
                725                 730                 735     
Thr Ser His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala
            740                 745                 750         
Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys Gly Ser
        755                 760                 765             
Gly Trp Ser His Pro Gln Phe Glu Lys
    770                 775         


<210> 8
<211> 2181
<212> DNA
<213> Artificial Sequence

<220> 
<223> Intein-26: pE1775-KpnI-SalI-mGFP172-DnaE
      intein-SC-2A-mCherry-Streptag

<400> 8
ggtaccgtcg accaaggaga tataacaatg agtaaaggag aagaactttt cactggagtt 60
gtcccaattc ttgttgaatt agatggtgat gttaatgggc acaaattttc tgtcagtgga 120
gagggtgaag gtgatgcaac atacggaaaa cttaccctta aatttatttg cactactgga 180
aaactacctg ttccttggcc aacacttgtc actactttca cttatggtgt tcaatgcttt 240
tcaagatacc cagatcatat gaagcggcac gacttcttca agagcgccat gcctgaggga 300
tacgtgcagg agaggaccat cttcttcaag gacgacggga actacaagac acgtgctgaa 360
gtcaagtttg agggagacac cctcgtcaac aggatcgagc ttaagggaat cgatttcaag 420
gaggacggaa acatcctcgg ccacaagttg gaatacaact acaactccca caacgtatac 480
atcatggccg acaagcaaaa gaacggcatc aaagccaact tcaagacccg ccacaacatc 540
gaacaccatc accatcacca tgacggcggc gtgcaactcg ctgatcatta tcaacaaaat 600
actccaattg gcgatggccc tgtcctttta ccagacaacc attacctgtc cacacaatct 660
gccctttcga aagatcccaa cgaaaagaga gaccacatgg tccttcttga gtttgtaaca 720
gctgctggga ttacacatgg catggatgaa ctatacaaat gtttgtcctt cggaactgag 780
atacttacag ttgaatatgg accacttcct attggaaaga ttgtgagtga agagatcaac 840
tgcagtgttt attccgtgga tccagagggt agagtttaca ctcaagcaat tgctcagtgg 900
catgataggg gagaacagga ggttcttgaa tatgagttgg aagatggttc tgtgataaga 960
gctacatcag atcacaggtt tcttactaca gattaccaac ttttggcaat cgaagagatt 1020
ttcgctagac agctcgatct tctcactttg gaaaatatta agcaaacaga agaggcactt 1080
gataaccata ggcttccatt tcctcttttg gatgctggaa ctattaagat ggttaaagtg 1140
ataggaagaa ggtcattggg tgttcaaaga atatttgata tcggacttcc tcaggatcac 1200
aatttcttac tcgcaaacgg tgctattgct gcagcttgtt cttgtggttc tggttctaga 1260
gttactgagc ttttgtatag gatgaagagg gcagaaacat actgcccaag acctttactc 1320
gcaatccatc caacagaggc taggcacaag caaaaaattg ttgctcctgt gaaacagctt 1380
ttgaactttg atcttctcaa gcttgcggga gacgtcgagt ccaaccctgg gcccgtgagc 1440
aagggcgagg aggataacat ggccatcatc aaggagttca tgcgcttcaa ggtgcacatg 1500
gagggctccg tgaacggcca cgagttcgag atcgagggcg agggcgaggg ccgcccctac 1560
gagggcaccc agaccgccaa gctgaaggtg accaagggtg gccccctgcc cttcgcctgg 1620
gacatcctgt cccctcagtt catgtacggc tccaaggcct acgtgaagca ccccgccgac 1680
atccccgact acttgaagct gtccttcccc gagggcttca agtgggagcg cgtgatgaac 1740
ttcgaggacg gcggcgtggt gaccgtgacc caggactcct ccctgcagga cggcgagttc 1800
atctacaagg tgaagctgcg cggcaccaac ttcccctccg acggccccgt aatgcagaag 1860
aagaccatgg gctgggaggc ctcctccgag cggatgtacc ccgaggacgg cgccctgaag 1920
ggcgagatca agcagaggct gaagctgaag gacggcggcc actacgacgc tgaggtcaag 1980
accacctaca aggccaagaa gcccgtgcag ctgcccggcg cctacaacgt caacatcaag 2040
ttggacatca cctcccacaa cgaggactac accatcgtgg aacagtacga acgcgccgag 2100
ggccgccact ccaccggcgg catggacgag ctgtacaagg gttctggatg gtcacatcct 2160
cagtttgaaa aatgagagct c                                           2181

<210> 9
<211> 715
<212> PRT
<213> Artificial Sequence

<220> 
<223> Intein-26: pE1775-KpnI-SalI-mGFP172-DnaE
      intein-SC-2A-mCherry-Streptag

<400> 9
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile Glu His His His His
                165                 170                 175     
His His Asp Gly Gly Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr
            180                 185                 190         
Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser
        195                 200                 205             
Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met
    210                 215                 220                 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp
225                 230                 235                 240 
Glu Leu Tyr Lys Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu
                245                 250                 255     
Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys
            260                 265                 270         
Ser Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile
        275                 280                 285             
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu
    290                 295                 300                 
Glu Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr
305                 310                 315                 320 
Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu
                325                 330                 335     
Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp
            340                 345                 350         
Asn His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met
        355                 360                 365             
Val Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp
    370                 375                 380                 
Ile Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile
385                 390                 395                 400 
Ala Ala Ala Cys Ser Cys Gly Ser Gly Ser Arg Val Thr Glu Leu Leu
                405                 410                 415     
Tyr Arg Met Lys Arg Ala Glu Thr Tyr Cys Pro Arg Pro Leu Leu Ala
            420                 425                 430         
Ile His Pro Thr Glu Ala Arg His Lys Gln Lys Ile Val Ala Pro Val
        435                 440                 445             
Lys Gln Leu Leu Asn Phe Asp Leu Leu Lys Leu Ala Gly Asp Val Glu
    450                 455                 460                 
Ser Asn Pro Gly Pro Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile
465                 470                 475                 480 
Ile Lys Glu Phe Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn
                485                 490                 495     
Gly His Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu
            500                 505                 510         
Gly Thr Gln Thr Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro
        515                 520                 525             
Phe Ala Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala
    530                 535                 540                 
Tyr Val Lys His Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe
545                 550                 555                 560 
Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly
                565                 570                 575     
Val Val Thr Val Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile
            580                 585                 590         
Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val
        595                 600                 605             
Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr
    610                 615                 620                 
Pro Glu Asp Gly Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu
625                 630                 635                 640 
Lys Asp Gly Gly His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala
                645                 650                 655     
Lys Lys Pro Val Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu
            660                 665                 670         
Asp Ile Thr Ser His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu
        675                 680                 685             
Arg Ala Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys
    690                 695                 700                 
Gly Ser Gly Trp Ser His Pro Gln Phe Glu Lys
705                 710                 715 


<210> 10
<211> 2229
<212> DNA
<213> Artificial Sequence

<220> 
<223> mGFP172-DnaE intein w/o
      N-extein-UBQ11-mCherry-streptag

<400> 10
ggtaccgtcg accaaggaga tataacaatg agtaaaggag aagaactttt cactggagtt 60
gtcccaattc ttgttgaatt agatggtgat gttaatgggc acaaattttc tgtcagtgga 120
gagggtgaag gtgatgcaac atacggaaaa cttaccctta aatttatttg cactactgga 180
aaactacctg ttccttggcc aacacttgtc actactttca cttatggtgt tcaatgcttt 240
tcaagatacc cagatcatat gaagcggcac gacttcttca agagcgccat gcctgaggga 300
tacgtgcagg agaggaccat cttcttcaag gacgacggga actacaagac acgtgctgaa 360
gtcaagtttg agggagacac cctcgtcaac aggatcgagc ttaagggaat cgatttcaag 420
gaggacggaa acatcctcgg ccacaagttg gaatacaact acaactccca caacgtatac 480
atcatggccg acaagcaaaa gaacggcatc aaagccaact tcaagacccg ccacaacatc 540
gaacaccatc accatcacca tgacggcggc gtgcaactcg ctgatcatta tcaacaaaat 600
actccaattg gcgatggccc tgtcctttta ccagacaacc attacctgtc cacacaatct 660
gccctttcga aagatcccaa cgaaaagaga gaccacatgg tccttcttga gtttgtaaca 720
gctgctggga ttacacatgg catggatgaa ctatacaaat gtttgtcctt cggaactgag 780
atacttacag ttgaatatgg accacttcct attggaaaga ttgtgagtga agagatcaac 840
tgcagtgttt attccgtgga tccagagggt agagtttaca ctcaagcaat tgctcagtgg 900
catgataggg gagaacagga ggttcttgaa tatgagttgg aagatggttc tgtgataaga 960
gctacatcag atcacaggtt tcttactaca gattaccaac ttttggcaat cgaagagatt 1020
ttcgctagac agctcgatct tctcactttg gaaaatatta agcaaacaga agaggcactt 1080
gataaccata ggcttccatt tcctcttttg gatgctggaa ctattaagat ggttaaagtg 1140
ataggaagaa ggtcattggg tgttcaaaga atatttgata tcggacttcc tcaggatcac 1200
aatttcttac tcgcaaacgg tgctattgct gcagcttgtt cttgtggttc tggtatgcag 1260
atcttcgtaa agactttgac cggaaagacc atcactcttg aagttgaaag ctccgacacc 1320
attgataacg tgaaggctaa gatccaggac aaggaaggca ttcctccgga ccagcagcgt 1380
ctcatcttcg ctggaaggca gcttgaggat ggacgtactt tggccgacta caacatccag 1440
aaggagtcca ctcttcactt ggtcctccgt ctccgcggcg gtgtgagcaa gggcgaggag 1500
gataacatgg ccatcatcaa ggagttcatg cgcttcaagg tgcacatgga gggctccgtg 1560
aacggccacg agttcgagat cgagggcgag ggcgagggcc gcccctacga gggcacccag 1620
accgccaagc tgaaggtgac caagggtggc cccctgccct tcgcctggga catcctgtcc 1680
cctcagttca tgtacggctc caaggcctac gtgaagcacc ccgccgacat ccccgactac 1740
ttgaagctgt ccttccccga gggcttcaag tgggagcgcg tgatgaactt cgaggacggc 1800
ggcgtggtga ccgtgaccca ggactcctcc ctgcaggacg gcgagttcat ctacaaggtg 1860
aagctgcgcg gcaccaactt cccctccgac ggccccgtaa tgcagaagaa gaccatgggc 1920
tgggaggcct cctccgagcg gatgtacccc gaggacggcg ccctgaaggg cgagatcaag 1980
cagaggctga agctgaagga cggcggccac tacgacgctg aggtcaagac cacctacaag 2040
gccaagaagc ccgtgcagct gcccggcgcc tacaacgtca acatcaagtt ggacatcacc 2100
tcccacaacg aggactacac catcgtggaa cagtacgaac gcgccgaggg ccgccactcc 2160
accggcggca tggacgagct gtacaagggt tctggatggt cacatcctca gtttgaaaaa 2220
tgagagctc                                                         2229

<210> 11
<211> 731
<212> PRT
<213> Artificial Sequence

<220> 
<223> mGFP172-DnaE intein w/o
      N-extein-UBQ11-mCherry-streptag

<400> 11
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile Glu His His His His
                165                 170                 175     
His His Asp Gly Gly Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr
            180                 185                 190         
Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser
        195                 200                 205             
Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met
    210                 215                 220                 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp
225                 230                 235                 240 
Glu Leu Tyr Lys Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu
                245                 250                 255     
Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys
            260                 265                 270         
Ser Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile
        275                 280                 285             
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu
    290                 295                 300                 
Glu Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr
305                 310                 315                 320 
Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu
                325                 330                 335     
Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp
            340                 345                 350         
Asn His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met
        355                 360                 365             
Val Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp
    370                 375                 380                 
Ile Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile
385                 390                 395                 400 
Ala Ala Ala Cys Ser Cys Gly Ser Gly Met Gln Ile Phe Val Lys Thr
                405                 410                 415     
Leu Thr Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile
            420                 425                 430         
Asp Asn Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp
        435                 440                 445             
Gln Gln Arg Leu Ile Phe Ala Gly Arg Gln Leu Glu Asp Gly Arg Thr
    450                 455                 460                 
Leu Ala Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu
465                 470                 475                 480 
Arg Leu Arg Gly Gly Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile
                485                 490                 495     
Ile Lys Glu Phe Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn
            500                 505                 510         
Gly His Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu
        515                 520                 525             
Gly Thr Gln Thr Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro
    530                 535                 540                 
Phe Ala Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala
545                 550                 555                 560 
Tyr Val Lys His Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe
                565                 570                 575     
Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly
            580                 585                 590         
Val Val Thr Val Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile
        595                 600                 605             
Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val
    610                 615                 620                 
Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr
625                 630                 635                 640 
Pro Glu Asp Gly Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu
                645                 650                 655     
Lys Asp Gly Gly His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala
            660                 665                 670         
Lys Lys Pro Val Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu
        675                 680                 685             
Asp Ile Thr Ser His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu
    690                 695                 700                 
Arg Ala Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys
705                 710                 715                 720 
Gly Ser Gly Trp Ser His Pro Gln Phe Glu Lys
                725                 730     


<210> 12
<211> 2082
<212> DNA
<213> Artificial Sequence

<220> 
<223> pE1775-Int261 (KpnISalI-GFP172-DnaE Intein w/o
      N-exteins-Sea urchin 2A-mCherry-streptag)

<400> 12
ggtaccgtcg accaaggaga tataacaatg agtaaaggag aagaactttt cactggagtt 60
gtcccaattc ttgttgaatt agatggtgat gttaatgggc acaaattttc tgtcagtgga 120
gagggtgaag gtgatgcaac atacggaaaa cttaccctta aatttatttg cactactgga 180
aaactacctg ttccttggcc aacacttgtc actactttca cttatggtgt tcaatgcttt 240
tcaagatacc cagatcatat gaagcggcac gacttcttca agagcgccat gcctgaggga 300
tacgtgcagg agaggaccat cttcttcaag gacgacggga actacaagac acgtgctgaa 360
gtcaagtttg agggagacac cctcgtcaac aggatcgagc ttaagggaat cgatttcaag 420
gaggacggaa acatcctcgg ccacaagttg gaatacaact acaactccca caacgtatac 480
atcatggccg acaagcaaaa gaacggcatc aaagccaact tcaagacccg ccacaacatc 540
gaacaccatc accatcacca tgacggcggc gtgcaactcg ctgatcatta tcaacaaaat 600
actccaattg gcgatggccc tgtcctttta ccagacaacc attacctgtc cacacaatct 660
gccctttcga aagatcccaa cgaaaagaga gaccacatgg tccttcttga gtttgtaaca 720
gctgctggga ttacacatgg catggatgaa ctatacaaat gtttgtcctt cggaactgag 780
atacttacag ttgaatatgg accacttcct attggaaaga ttgtgagtga agagatcaac 840
tgcagtgttt attccgtgga tccagagggt agagtttaca ctcaagcaat tgctcagtgg 900
catgataggg gagaacagga ggttcttgaa tatgagttgg aagatggttc tgtgataaga 960
gctacatcag atcacaggtt tcttactaca gattaccaac ttttggcaat cgaagagatt 1020
ttcgctagac agctcgatct tctcactttg gaaaatatta agcaaacaga agaggcactt 1080
gataaccata ggcttccatt tcctcttttg gatgctggaa ctattaagat ggttaaagtg 1140
ataggaagaa ggtcattggg tgttcaaaga atatttgata tcggacttcc tcaggatcac 1200
aatttcttac tcgcaaacgg tgctattgct gcagcttgtt cttgtggttc tggttctaga 1260
gatggattct gcattctcta tctgctcctg atcctcttga tgagatctgg tgacgttgaa 1320
accaatccag ggcccgtgag caagggcgag gaggataaca tggccatcat caaggagttc 1380
atgcgcttca aggtgcacat ggagggctcc gtgaacggcc acgagttcga gatcgagggc 1440
gagggcgagg gccgccccta cgagggcacc cagaccgcca agctgaaggt gaccaagggt 1500
ggccccctgc ccttcgcctg ggacatcctg tcccctcagt tcatgtacgg ctccaaggcc 1560
tacgtgaagc accccgccga catccccgac tacttgaagc tgtccttccc cgagggcttc 1620
aagtgggagc gcgtgatgaa cttcgaggac ggcggcgtgg tgaccgtgac ccaggactcc 1680
tccctgcagg acggcgagtt catctacaag gtgaagctgc gcggcaccaa cttcccctcc 1740
gacggccccg taatgcagaa gaagaccatg ggctgggagg cctcctccga gcggatgtac 1800
cccgaggacg gcgccctgaa gggcgagatc aagcagaggc tgaagctgaa ggacggcggc 1860
cactacgacg ctgaggtcaa gaccacctac aaggccaaga agcccgtgca gctgcccggc 1920
gcctacaacg tcaacatcaa gttggacatc acctcccaca acgaggacta caccatcgtg 1980
gaacagtacg aacgcgccga gggccgccac tccaccggcg gcatggacga gctgtacaag 2040
ggttctggat ggtcacatcc tcagtttgaa aaatgagagc tc                    2082

<210> 13
<211> 682
<212> PRT
<213> Artificial Sequence

<220> 
<223> pE1775-Int261 (KpnISalI-GFP172-DnaE Intein w/o
      N-exteins-Sea urchin 2A-mCherry-streptag)

<400> 13
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile Glu His His His His
                165                 170                 175     
His His Asp Gly Gly Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr
            180                 185                 190         
Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser
        195                 200                 205             
Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met
    210                 215                 220                 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp
225                 230                 235                 240 
Glu Leu Tyr Lys Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu
                245                 250                 255     
Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys
            260                 265                 270         
Ser Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile
        275                 280                 285             
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu
    290                 295                 300                 
Glu Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr
305                 310                 315                 320 
Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu
                325                 330                 335     
Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp
            340                 345                 350         
Asn His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met
        355                 360                 365             
Val Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp
    370                 375                 380                 
Ile Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile
385                 390                 395                 400 
Ala Ala Ala Cys Ser Cys Gly Ser Gly Ser Arg Asp Gly Phe Cys Ile
                405                 410                 415     
Leu Tyr Leu Leu Leu Ile Leu Leu Met Arg Ser Gly Asp Val Glu Thr
            420                 425                 430         
Asn Pro Gly Pro Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile
        435                 440                 445             
Lys Glu Phe Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly
    450                 455                 460                 
His Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly
465                 470                 475                 480 
Thr Gln Thr Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe
                485                 490                 495     
Ala Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr
            500                 505                 510         
Val Lys His Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro
        515                 520                 525             
Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val
    530                 535                 540                 
Val Thr Val Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr
545                 550                 555                 560 
Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met
                565                 570                 575     
Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro
            580                 585                 590         
Glu Asp Gly Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys
        595                 600                 605             
Asp Gly Gly His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys
    610                 615                 620                 
Lys Pro Val Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp
625                 630                 635                 640 
Ile Thr Ser His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg
                645                 650                 655     
Ala Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys Gly
            660                 665                 670         
Ser Gly Trp Ser His Pro Gln Phe Glu Lys
        675                 680         


<210> 14
<211> 2520
<212> DNA
<213> Artificial Sequence

<220> 
<223> Intein-Intein:  DI-2 (GFP-172(His)6-Ssp DnaE
      intein - Ssp DnaB intein-mCherry-Strep Tag

<400> 14
ggtaccgtcg accaaggaga tataacaatg agtaaaggag aagaactttt cactggagtt 60
gtcccaattc ttgttgaatt agatggtgat gttaatgggc acaaattttc tgtcagtgga 120
gagggtgaag gtgatgcaac atacggaaaa cttaccctta aatttatttg cactactgga 180
aaactacctg ttccttggcc aacacttgtc actactttca cttatggtgt tcaatgcttt 240
tcaagatacc cagatcatat gaagcggcac gacttcttca agagcgccat gcctgaggga 300
tacgtgcagg agaggaccat cttcttcaag gacgacggga actacaagac acgtgctgaa 360
gtcaagtttg agggagacac cctcgtcaac aggatcgagc ttaagggaat cgatttcaag 420
gaggacggaa acatcctcgg ccacaagttg gaatacaact acaactccca caacgtatac 480
atcatggccg acaagcaaaa gaacggcatc aaagccaact tcaagacccg ccacaacatc 540
gaacaccatc accatcacca tgacggcggc gtgcaactcg ctgatcatta tcaacaaaat 600
actccaattg gcgatggccc tgtcctttta ccagacaacc attacctgtc cacacaatct 660
gccctttcga aagatcccaa cgaaaagaga gaccacatgg tccttcttga gtttgtaaca 720
gctgctggga ttacacatgg catggatgaa ctatacaaac tcgagggagg atctaagttt 780
gcaaatgatt gtttgtcctt cggaactgag atacttacag ttgaatatgg accacttcct 840
attggaaaga ttgtgagtga agagatcaac tgcagtgttt attccgtgga tccagagggt 900
agagtttaca ctcaagcaat tgctcagtgg catgataggg gagaacagga ggttcttgaa 960
tatgagttgg aagatggttc tgtgataaga gctacatcag atcacaggtt tcttactaca 1020
gattaccaac ttttggcaat cgaagagatt ttcgctagac agctcgatct tctcactttg 1080
gaaaatatta agcaaacaga agaggcactt gataaccata ggcttccatt tcctcttttg 1140
gatgctggaa ctattaagat ggttaaagtg ataggaagaa ggtcattggg tgttcaaaga 1200
atatttgata tcggacttcc tcaggatcac aatttcttac tcgcaaacgg tgctattgct 1260
gcagcttgtt tcaatggttc tggttctaga gagtctggag ctatctctgg cgatagtctg 1320
atcagcctgg ctagcacagg aaaaagagtt tctattaaag atttgttaga tgaaaaagat 1380
tttgaaatat gggcaattaa tgaacagacg atgaagctag aatcagctaa agttagtcgt 1440
gtattttgta ctggcaaaaa gctagtttat attctaaaaa ctcgactagg tagaactatc 1500
aaggcaacag caaatcatag atttttaact attgatggtt ggaaaagatt agatgagcta 1560
tctttaaaag agcatattgc tctaccccgt aaactagaaa gctcctcttt acaattgtca 1620
ccagaaatag aaaagttgtc tcagagtgat atttactggg actccatcgt ttctattacg 1680
gagactggag tcgaagaggt ttttgatttg actgtgccag gaccacataa ctttgtcgcg 1740
aatgacatca ttgtacacaa cagccgcggg cccgtgagca agggcgagga ggataacatg 1800
gccatcatca aggagttcat gcgcttcaag gtgcacatgg agggctccgt gaacggccac 1860
gagttcgaga tcgagggcga gggcgagggc cgcccctacg agggcaccca gaccgccaag 1920
ctgaaggtga ccaagggtgg ccccctgccc ttcgcctggg acatcctgtc ccctcagttc 1980
atgtacggct ccaaggccta cgtgaagcac cccgccgaca tccccgacta cttgaagctg 2040
tccttccccg agggcttcaa gtgggagcgc gtgatgaact tcgaggacgg cggcgtggtg 2100
accgtgaccc aggactcctc cctgcaggac ggcgagttca tctacaaggt gaagctgcgc 2160
ggcaccaact tcccctccga cggccccgta atgcagaaga agaccatggg ctgggaggcc 2220
tcctccgagc ggatgtaccc cgaggacggc gccctgaagg gcgagatcaa gcagaggctg 2280
aagctgaagg acggcggcca ctacgacgct gaggtcaaga ccacctacaa ggccaagaag 2340
cccgtgcagc tgcccggcgc ctacaacgtc aacatcaagt tggacatcac ctcccacaac 2400
gaggactaca ccatcgtgga acagtacgaa cgcgccgagg gccgccactc caccggcggc 2460
atggacgagc tgtacaaggg ttctggatgg tcacatcctc agtttgaaaa atgagagctc 2520


<210> 15
<211> 828
<212> PRT
<213> Artificial Sequence

<220> 
<223> Intein-Intein:  DI-2 (GFP-172(His)6-Ssp DnaE
      intein - Ssp DnaB intein-mCherry-Strep Tag

<400> 15
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe
    50                  55                  60                  
Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile Glu His His His His
                165                 170                 175     
His His Asp Gly Gly Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr
            180                 185                 190         
Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser
        195                 200                 205             
Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met
    210                 215                 220                 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp
225                 230                 235                 240 
Glu Leu Tyr Lys Leu Glu Gly Gly Ser Lys Phe Ala Asn Asp Cys Leu
                245                 250                 255     
Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu Pro Ile
            260                 265                 270         
Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser Val Asp
        275                 280                 285             
Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His Asp Arg
    290                 295                 300                 
Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser Val Ile
305                 310                 315                 320 
Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln Leu Leu
                325                 330                 335     
Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr Leu Glu
            340                 345                 350         
Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu Pro Phe
        355                 360                 365             
Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile Gly Arg
    370                 375                 380                 
Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro Gln Asp
385                 390                 395                 400 
His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys Phe Asn
                405                 410                 415     
Gly Ser Gly Ser Arg Glu Ser Gly Ala Ile Ser Gly Asp Ser Leu Ile
            420                 425                 430         
Ser Leu Ala Ser Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp
        435                 440                 445             
Glu Lys Asp Phe Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu
    450                 455                 460                 
Glu Ser Ala Lys Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val
465                 470                 475                 480 
Tyr Ile Leu Lys Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn
                485                 490                 495     
His Arg Phe Leu Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser
            500                 505                 510         
Leu Lys Glu His Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu
        515                 520                 525             
Gln Leu Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp
    530                 535                 540                 
Asp Ser Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp
545                 550                 555                 560 
Leu Thr Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val
                565                 570                 575     
His Asn Ser Arg Gly Pro Val Ser Lys Gly Glu Glu Asp Asn Met Ala
            580                 585                 590         
Ile Ile Lys Glu Phe Met Arg Phe Lys Val His Met Glu Gly Ser Val
        595                 600                 605             
Asn Gly His Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr
    610                 615                 620                 
Glu Gly Thr Gln Thr Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu
625                 630                 635                 640 
Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys
                645                 650                 655     
Ala Tyr Val Lys His Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser
            660                 665                 670         
Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly
        675                 680                 685             
Gly Val Val Thr Val Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe
    690                 695                 700                 
Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro
705                 710                 715                 720 
Val Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met
                725                 730                 735     
Tyr Pro Glu Asp Gly Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys
            740                 745                 750         
Leu Lys Asp Gly Gly His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys
        755                 760                 765             
Ala Lys Lys Pro Val Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys
    770                 775                 780                 
Leu Asp Ile Thr Ser His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr
785                 790                 795                 800 
Glu Arg Ala Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr
                805                 810                 815     
Lys Gly Ser Gly Trp Ser His Pro Gln Phe Glu Lys
            820                 825             


<210> 16
<211> 2487
<212> DNA
<213> Artificial Sequence

<220> 
<223> DI-2C GFP-172(His)6-DnaE intein -DnaB
      intein-mCherry-Strep Tag)

<400> 16
ggtaccgtcg accaaggaga tataacaatg agtaaaggag aagaactttt cactggagtt 60
gtcccaattc ttgttgaatt agatggtgat gttaatgggc acaaattttc tgtcagtgga 120
gagggtgaag gtgatgcaac atacggaaaa cttaccctta aatttatttg cactactgga 180
aaactacctg ttccttggcc aacacttgtc actactttca cttatggtgt tcaatgcttt 240
tcaagatacc cagatcatat gaagcggcac gacttcttca agagcgccat gcctgaggga 300
tacgtgcagg agaggaccat cttcttcaag gacgacggga actacaagac acgtgctgaa 360
gtcaagtttg agggagacac cctcgtcaac aggatcgagc ttaagggaat cgatttcaag 420
gaggacggaa acatcctcgg ccacaagttg gaatacaact acaactccca caacgtatac 480
atcatggccg acaagcaaaa gaacggcatc aaagccaact tcaagacccg ccacaacatc 540
gaacaccatc accatcacca tgacggcggc gtgcaactcg ctgatcatta tcaacaaaat 600
actccaattg gcgatggccc tgtcctttta ccagacaacc attacctgtc cacacaatct 660
gccctttcga aagatcccaa cgaaaagaga gaccacatgg tccttcttga gtttgtaaca 720
gctgctggga ttacacatgg catggatgaa ctatacaaac tcgagtatgc attgtccttc 780
ggaactgaga tacttacagt tgaatatgga ccacttccta ttggaaagat tgtgagtgaa 840
gagatcaact gcagtgttta ttccgtggat ccagagggta gagtttacac tcaagcaatt 900
gctcagtggc atgatagggg agaacaggag gttcttgaat atgagttgga agatggttct 960
gtgataagag ctacatcaga tcacaggttt cttactacag attaccaact tttggcaatc 1020
gaagagattt tcgctagaca gctcgatctt ctcactttgg aaaatattaa gcaaacagaa 1080
gaggcacttg ataaccatag gcttccattt cctcttttgg atgctggaac tattaagatg 1140
gttaaagtga taggaagaag gtcattgggt gttcaaagaa tatttgatat cggacttcct 1200
caggatcaca atttcttact cgcaaacggt gctattgctg cagctggtgg ttctagagag 1260
tctggagcta tctctggcga tagtctgatc agcctggcta gcacaggaaa aagagtttct 1320
attaaagatt tgttagatga aaaagatttt gaaatatggg caattaatga acagacgatg 1380
aagctagaat cagctaaagt tagtcgtgta ttttgtactg gcaaaaagct agtttatatt 1440
ctaaaaactc gactaggtag aactatcaag gcaacagcaa atcatagatt tttaactatt 1500
gatggttgga aaagattaga tgagctatct ttaaaagagc atattgctct accccgtaaa 1560
ctagaaagct cctctttaca attgtcacca gaaatagaaa agttgtctca gagtgatatt 1620
tactgggact ccatcgtttc tattacggag actggagtcg aagaggtttt tgatttgact 1680
gtgccaggac cacataactt tgtcgcgaat gacatcattg tacacaacag ccgcgggccc 1740
gtgagcaagg gcgaggagga taacatggcc atcatcaagg agttcatgcg cttcaaggtg 1800
cacatggagg gctccgtgaa cggccacgag ttcgagatcg agggcgaggg cgagggccgc 1860
ccctacgagg gcacccagac cgccaagctg aaggtgacca agggtggccc cctgcccttc 1920
gcctgggaca tcctgtcccc tcagttcatg tacggctcca aggcctacgt gaagcacccc 1980
gccgacatcc ccgactactt gaagctgtcc ttccccgagg gcttcaagtg ggagcgcgtg 2040
atgaacttcg aggacggcgg cgtggtgacc gtgacccagg actcctccct gcaggacggc 2100
gagttcatct acaaggtgaa gctgcgcggc accaacttcc cctccgacgg ccccgtaatg 2160
cagaagaaga ccatgggctg ggaggcctcc tccgagcgga tgtaccccga ggacggcgcc 2220
ctgaagggcg agatcaagca gaggctgaag ctgaaggacg gcggccacta cgacgctgag 2280
gtcaagacca cctacaaggc caagaagccc gtgcagctgc ccggcgccta caacgtcaac 2340
atcaagttgg acatcacctc ccacaacgag gactacacca tcgtggaaca gtacgaacgc 2400
gccgagggcc gccactccac cggcggcatg gacgagctgt acaagggttc tggatggtca 2460
catcctcagt ttgaaaaatg agagctc                                     2487

<210> 17
<211> 817
<212> PRT
<213> Artificial Sequence

<220> 
<223> DI-2C GFP-172(His)6-DnaE intein -DnaB
      intein-mCherry-Strep Tag)

<400> 17
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe
    50                  55                  60                  
Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile Glu His His His His
                165                 170                 175     
His His Asp Gly Gly Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr
            180                 185                 190         
Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser
        195                 200                 205             
Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met
    210                 215                 220                 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp
225                 230                 235                 240 
Glu Leu Tyr Lys Leu Glu Tyr Ala Leu Ser Phe Gly Thr Glu Ile Leu
                245                 250                 255     
Thr Val Glu Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu
            260                 265                 270         
Ile Asn Cys Ser Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr
        275                 280                 285             
Gln Ala Ile Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu
    290                 295                 300                 
Tyr Glu Leu Glu Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg
305                 310                 315                 320 
Phe Leu Thr Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala
                325                 330                 335     
Arg Gln Leu Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu
            340                 345                 350         
Ala Leu Asp Asn His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr
        355                 360                 365             
Ile Lys Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg
    370                 375                 380                 
Ile Phe Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn
385                 390                 395                 400 
Gly Ala Ile Ala Ala Ala Gly Gly Ser Arg Glu Ser Gly Ala Ile Ser
                405                 410                 415     
Gly Asp Ser Leu Ile Ser Leu Ala Ser Thr Gly Lys Arg Val Ser Ile
            420                 425                 430         
Lys Asp Leu Leu Asp Glu Lys Asp Phe Glu Ile Trp Ala Ile Asn Glu
        435                 440                 445             
Gln Thr Met Lys Leu Glu Ser Ala Lys Val Ser Arg Val Phe Cys Thr
    450                 455                 460                 
Gly Lys Lys Leu Val Tyr Ile Leu Lys Thr Arg Leu Gly Arg Thr Ile
465                 470                 475                 480 
Lys Ala Thr Ala Asn His Arg Phe Leu Thr Ile Asp Gly Trp Lys Arg
                485                 490                 495     
Leu Asp Glu Leu Ser Leu Lys Glu His Ile Ala Leu Pro Arg Lys Leu
            500                 505                 510         
Glu Ser Ser Ser Leu Gln Leu Ser Pro Glu Ile Glu Lys Leu Ser Gln
        515                 520                 525             
Ser Asp Ile Tyr Trp Asp Ser Ile Val Ser Ile Thr Glu Thr Gly Val
    530                 535                 540                 
Glu Glu Val Phe Asp Leu Thr Val Pro Gly Pro His Asn Phe Val Ala
545                 550                 555                 560 
Asn Asp Ile Ile Val His Asn Ser Arg Gly Pro Val Ser Lys Gly Glu
                565                 570                 575     
Glu Asp Asn Met Ala Ile Ile Lys Glu Phe Met Arg Phe Lys Val His
            580                 585                 590         
Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu Gly Glu Gly
        595                 600                 605             
Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys Val Thr
    610                 615                 620                 
Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln Phe
625                 630                 635                 640 
Met Tyr Gly Ser Lys Ala Tyr Val Lys His Pro Ala Asp Ile Pro Asp
                645                 650                 655     
Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met
            660                 665                 670         
Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser Leu
        675                 680                 685             
Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe
    690                 695                 700                 
Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu Ala
705                 710                 715                 720 
Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly Ala Leu Lys Gly Glu Ile
                725                 730                 735     
Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly His Tyr Asp Ala Glu Val
            740                 745                 750         
Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Leu Pro Gly Ala Tyr
        755                 760                 765             
Asn Val Asn Ile Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr Thr
    770                 775                 780                 
Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg His Ser Thr Gly Gly
785                 790                 795                 800 
Met Asp Glu Leu Tyr Lys Gly Ser Gly Trp Ser His Pro Gln Phe Glu
                805                 810                 815     
Lys
    


<210> 18
<211> 2268
<212> DNA
<213> Artificial Sequence

<220> 
<223> Intein 70: (pE1775-SP-mGFP172-DnaE Intein-2A
      ver.3-mCherry-streptag)

<400> 18
gtcgaccaag gagatataac aatgaagact aatctttttc tctttctcat cttttcactt 60
ctcctatcat tatcctcggc cgaattcagt aaaggagaag aacttttcac tggagttgtc 120
ccaattcttg ttgaattaga tggtgatgtt aatgggcaca aattttctgt cagtggagag 180
ggtgaaggtg atgcaacata cggaaaactt acccttaaat ttatttgcac tactggaaaa 240
ctacctgttc cttggccaac acttgtcact actttcactt atggtgttca atgcttttca 300
agatacccag atcatatgaa gcggcacgac ttcttcaaga gcgccatgcc tgagggatac 360
gtgcaggaga ggaccatctt cttcaaggac gacgggaact acaagacacg tgctgaagtc 420
aagtttgagg gagacaccct cgtcaacagg atcgagctta agggaatcga tttcaaggag 480
gacggaaaca tcctcggcca caagttggaa tacaactaca actcccacaa cgtatacatc 540
atggccgaca agcaaaagaa cggcatcaaa gccaacttca agacccgcca caacatcgaa 600
caccatcacc atcaccatga cggcggcgtg caactcgctg atcattatca acaaaatact 660
ccaattggcg atggccctgt ccttttacca gacaaccatt acctgtccac acaatctgcc 720
ctttcgaaag atcccaacga aaagagagac cacatggtcc ttcttgagtt tgtaacagct 780
gctgggatta cacatggcat ggatgaacta tacaaactcg agggaggatc taagtttgca 840
aatgattgtt tgtccttcgg aactgagata cttacagttg aatatggacc acttcctatt 900
ggaaagattg tgagtgaaga gatcaactgc agtgtttatt ccgtggatcc agagggtaga 960
gtttacactc aagcaattgc tcagtggcat gataggggag aacaggaggt tcttgaatat 1020
gagttggaag atggttctgt gataagagct acatcagatc acaggtttct tactacagat 1080
taccaacttt tggcaatcga agagattttc gctagacagc tcgatcttct cactttggaa 1140
aatattaagc aaacagaaga ggcacttgat aaccataggc ttccatttcc tcttttggat 1200
gctggaacta ttaagatggt taaagtgata ggaagaaggt cattgggtgt tcaaagaata 1260
tttgatatcg gacttcctca ggatcacaat ttcttactcg caaacggtgc tattgctgca 1320
gcttgtttca atggttctgg ttctagagtt actgagcttt tgtataggat gaagagggca 1380
gaaacatact gcccaagacc tttactcgca atccatccaa cagaggctag gcacaagcaa 1440
aaaattgttg ctcctgtgaa acagcttttg aactttgatc ttctcaagct tgcgggagac 1500
gtcgagtcca accctgggcc cgtgagcaag ggcgaggagg ataacatggc catcatcaag 1560
gagttcatgc gcttcaaggt gcacatggag ggctccgtga acggccacga gttcgagatc 1620
gagggcgagg gcgagggccg cccctacgag ggcacccaga ccgccaagct gaaggtgacc 1680
aagggtggcc ccctgccctt cgcctgggac atcctgtccc ctcagttcat gtacggctcc 1740
aaggcctacg tgaagcaccc cgccgacatc cccgactact tgaagctgtc cttccccgag 1800
ggcttcaagt gggagcgcgt gatgaacttc gaggacggcg gcgtggtgac cgtgaccaag 1860
accatgggct gggaggcctc ctccgagcgg atgtaccccg aggacggcgc cctgaagggc 1920
gagatcaagc agaggctgaa gctgaagcag gactcctccc tgcaggacgg cgagttcatc 1980
tacaaggtga agctgcgcgg caccaacttc ccctccgacg gccccgtaat gcagaaggac 2040
ggcggccact acgacgctga ggtcaagacc acctacaagg ccaagaagcc cgtgcagctg 2100
cccggcgcct acaacgtcaa catcaagttg gacatcacct cccacaacga ggactacacc 2160
atcgtggaac agtacgaacg cgccgagggc cgccactcca ccggcggcat ggacgagctg 2220
tacaagggtt ctggatggtc acatcctcag tttgaaaaat gagagctc              2268

<210> 19
<211> 746
<212> PRT
<213> Artificial Sequence

<220> 
<223> Intein 70: (pE1775-SP-mGFP172-DnaE Intein-2A
      ver.3-mCherry-streptag)

<400> 19
Met Lys Thr Asn Leu Phe Leu Phe Leu Ile Phe Ser Leu Leu Leu Ser
 1               5                  10                  15      
Leu Ser Ser Ala Glu Phe Ser Lys Gly Glu Glu Leu Phe Thr Gly Val
            20                  25                  30          
Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe
        35                  40                  45              
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr
    50                  55                  60                  
Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr
65                  70                  75                  80  
Leu Val Thr Thr Phe Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro
                85                  90                  95      
Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly
            100                 105                 110         
Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys
        115                 120                 125             
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile
    130                 135                 140                 
Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His
145                 150                 155                 160 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp
                165                 170                 175     
Lys Gln Lys Asn Gly Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile
            180                 185                 190         
Glu His His His His His His Asp Gly Gly Val Gln Leu Ala Asp His
        195                 200                 205             
Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp
    210                 215                 220                 
Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu
225                 230                 235                 240 
Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile
                245                 250                 255     
Thr His Gly Met Asp Glu Leu Tyr Lys Leu Glu Gly Gly Ser Lys Phe
            260                 265                 270         
Ala Asn Asp Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr
        275                 280                 285             
Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser
    290                 295                 300                 
Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala
305                 310                 315                 320 
Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu
                325                 330                 335     
Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr
            340                 345                 350         
Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp
        355                 360                 365             
Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn
    370                 375                 380                 
His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val
385                 390                 395                 400 
Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile
                405                 410                 415     
Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala
            420                 425                 430         
Ala Ala Cys Phe Asn Gly Ser Gly Ser Arg Val Thr Glu Leu Leu Tyr
        435                 440                 445             
Arg Met Lys Arg Ala Glu Thr Tyr Cys Pro Arg Pro Leu Leu Ala Ile
    450                 455                 460                 
His Pro Thr Glu Ala Arg His Lys Gln Lys Ile Val Ala Pro Val Lys
465                 470                 475                 480 
Gln Leu Leu Asn Phe Asp Leu Leu Lys Leu Ala Gly Asp Val Glu Ser
                485                 490                 495     
Asn Pro Gly Pro Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile
            500                 505                 510         
Lys Glu Phe Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly
        515                 520                 525             
His Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly
    530                 535                 540                 
Thr Gln Thr Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe
545                 550                 555                 560 
Ala Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr
                565                 570                 575     
Val Lys His Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro
            580                 585                 590         
Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val
        595                 600                 605             
Val Thr Val Thr Lys Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met
    610                 615                 620                 
Tyr Pro Glu Asp Gly Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys
625                 630                 635                 640 
Leu Lys Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val
                645                 650                 655     
Lys Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys
            660                 665                 670         
Asp Gly Gly His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys
        675                 680                 685             
Lys Pro Val Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp
    690                 695                 700                 
Ile Thr Ser His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg
705                 710                 715                 720 
Ala Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys Gly
                725                 730                 735     
Ser Gly Trp Ser His Pro Gln Phe Glu Lys
            740                 745     


<210> 20
<211> 2319
<212> DNA
<213> Artificial Sequence

<220> 
<223> Intein-2A-UBQ-1 (GFP172-DnaE intein w/o
      overhang-sea urchin 2A-Arabidopsis
      UBQ11-mCherry-streptag)

<400> 20
ggtaccgtcg accaaggaga tataacaatg agtaaaggag aagaactttt cactggagtt 60
gtcccaattc ttgttgaatt agatggtgat gttaatgggc acaaattttc tgtcagtgga 120
gagggtgaag gtgatgcaac atacggaaaa cttaccctta aatttatttg cactactgga 180
aaactacctg ttccttggcc aacacttgtc actactttca cttatggtgt tcaatgcttt 240
tcaagatacc cagatcatat gaagcggcac gacttcttca agagcgccat gcctgaggga 300
tacgtgcagg agaggaccat cttcttcaag gacgacggga actacaagac acgtgctgaa 360
gtcaagtttg agggagacac cctcgtcaac aggatcgagc ttaagggaat cgatttcaag 420
gaggacggaa acatcctcgg ccacaagttg gaatacaact acaactccca caacgtatac 480
atcatggccg acaagcaaaa gaacggcatc aaagccaact tcaagacccg ccacaacatc 540
gaacaccatc accatcacca tgacggcggc gtgcaactcg ctgatcatta tcaacaaaat 600
actccaattg gcgatggccc tgtcctttta ccagacaacc attacctgtc cacacaatct 660
gccctttcga aagatcccaa cgaaaagaga gaccacatgg tccttcttga gtttgtaaca 720
gctgctggga ttacacatgg catggatgaa ctatacaaat gtttgtcctt cggaactgag 780
atacttacag ttgaatatgg accacttcct attggaaaga ttgtgagtga agagatcaac 840
tgcagtgttt attccgtgga tccagagggt agagtttaca ctcaagcaat tgctcagtgg 900
catgataggg gagaacagga ggttcttgaa tatgagttgg aagatggttc tgtgataaga 960
gctacatcag atcacaggtt tcttactaca gattaccaac ttttggcaat cgaagagatt 1020
ttcgctagac agctcgatct tctcactttg gaaaatatta agcaaacaga agaggcactt 1080
gataaccata ggcttccatt tcctcttttg gatgctggaa ctattaagat ggttaaagtg 1140
ataggaagaa ggtcattggg tgttcaaaga atatttgata tcggacttcc tcaggatcac 1200
aatttcttac tcgcaaacgg tgctattgct gcagcttgtt cttgtggttc tggttctaga 1260
ggatctggcg atggattctg cattctctat ctgctcctga tcctcttgat gaggtctggt 1320
gacgttgaaa ccaaccctgg gcccatgcag atcttcgtaa agactttgac cggaaagacc 1380
atcactcttg aagttgaaag ctccgacacc attgataacg tgaaggctaa gatccaggac 1440
aaggaaggca ttcctccgga ccagcagcgt ctcatcttcg ctggaaggca gcttgaggat 1500
ggacgtactt tggccgacta caacatccag aaggagtcca ctcttcactt ggtcctccgt 1560
ctccgcggcg gtgtgagcaa gggcgaggag gataacatgg ccatcatcaa ggagttcatg 1620
cgcttcaagg tgcacatgga gggctccgtg aacggccacg agttcgagat cgagggcgag 1680
ggcgagggcc gcccctacga gggcacccag accgccaagc tgaaggtgac caagggtggc 1740
cccctgccct tcgcctggga catcctgtcc cctcagttca tgtacggctc caaggcctac 1800
gtgaagcacc ccgccgacat ccccgactac ttgaagctgt ccttccccga gggcttcaag 1860
tgggagcgcg tgatgaactt cgaggacggc ggcgtggtga ccgtgaccca ggactcctcc 1920
ctgcaggacg gcgagttcat ctacaaggtg aagctgcgcg gcaccaactt cccctccgac 1980
ggccccgtaa tgcagaagaa gaccatgggc tgggaggcct cctccgagcg gatgtacccc 2040
gaggacggcg ccctgaaggg cgagatcaag cagaggctga agctgaagga cggcggccac 2100
tacgacgctg aggtcaagac cacctacaag gccaagaagc ccgtgcagct gcccggcgcc 2160
tacaacgtca acatcaagtt ggacatcacc tcccacaacg aggactacac catcgtggaa 2220
cagtacgaac gcgccgaggg ccgccactcc accggcggca tggacgagct gtacaagggt 2280
tctggatggt cacatcctca gtttgaaaaa tgagagctc                        2319

<210> 21
<211> 761
<212> PRT
<213> Artificial Sequence

<220> 
<223> Intein-2A-UBQ-1 (GFP172-DnaE intein w/o
      overhang-sea urchin 2A-Arabidopsis
      UBQ11-mCherry-streptag)

<400> 21
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile Glu His His His His
                165                 170                 175     
His His Asp Gly Gly Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr
            180                 185                 190         
Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser
        195                 200                 205             
Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met
    210                 215                 220                 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp
225                 230                 235                 240 
Glu Leu Tyr Lys Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu
                245                 250                 255     
Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys
            260                 265                 270         
Ser Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile
        275                 280                 285             
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu
    290                 295                 300                 
Glu Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr
305                 310                 315                 320 
Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu
                325                 330                 335     
Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp
            340                 345                 350         
Asn His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met
        355                 360                 365             
Val Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp
    370                 375                 380                 
Ile Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile
385                 390                 395                 400 
Ala Ala Ala Cys Ser Cys Gly Ser Gly Ser Arg Gly Ser Gly Asp Gly
                405                 410                 415     
Phe Cys Ile Leu Tyr Leu Leu Leu Ile Leu Leu Met Arg Ser Gly Asp
            420                 425                 430         
Val Glu Thr Asn Pro Gly Pro Met Gln Ile Phe Val Lys Thr Leu Thr
        435                 440                 445             
Gly Lys Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn
    450                 455                 460                 
Val Lys Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln
465                 470                 475                 480 
Arg Leu Ile Phe Ala Gly Arg Gln Leu Glu Asp Gly Arg Thr Leu Ala
                485                 490                 495     
Asp Tyr Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu
            500                 505                 510         
Arg Gly Gly Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys
        515                 520                 525             
Glu Phe Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly His
    530                 535                 540                 
Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr
545                 550                 555                 560 
Gln Thr Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala
                565                 570                 575     
Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val
            580                 585                 590         
Lys His Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu
        595                 600                 605             
Gly Phe Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val
    610                 615                 620                 
Thr Val Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys
625                 630                 635                 640 
Val Lys Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln
                645                 650                 655     
Lys Lys Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu
            660                 665                 670         
Asp Gly Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp
        675                 680                 685             
Gly Gly His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys
    690                 695                 700                 
Pro Val Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile
705                 710                 715                 720 
Thr Ser His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala
                725                 730                 735     
Glu Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys Gly Ser
            740                 745                 750         
Gly Trp Ser His Pro Gln Phe Glu Lys
        755                 760     


<210> 22
<211> 4
<212> PRT
<213> Artificial Sequence

<220> 
<223> Furin target site

<220> 
<221> VARIANT        
<222> (2)...(3)
<223> Xaa = any amino acid

<400> 22
Arg Xaa Xaa Arg
 1              


<210> 23
<211> 4
<212> PRT
<213> Artificial Sequence

<220> 
<223> Furin target site

<220> 
<221> VARIANT        
<222> 2
<223> Xaa = any amino acid residue

<220> 
<221> VARIANT        
<222> 3
<223> Xaa = Lys or Arg

<400> 23
Arg Xaa Xaa Arg
 1              


<210> 24
<211> 97
<212> PRT
<213> Homo sapiens

<400> 24
Glu Ala Lys Pro Ser Thr Glu Asp Leu Gly Asp Lys Lys Glu Gly Glu
 1               5                  10                  15      
Tyr Ile Lys Leu Lys Val Ile Gly Gln Asp Ser Ser Glu Ile His Phe
            20                  25                  30          
Lys Val Lys Met Thr Thr His Leu Lys Lys Leu Lys Glu Ser Tyr Cys
        35                  40                  45              
Gln Arg Gln Gly Val Pro Met Asn Ser Leu Arg Phe Leu Phe Glu Gly
    50                  55                  60                  
Gln Arg Ile Ala Asp Asn His Thr Pro Lys Glu Leu Gly Met Glu Glu
65                  70                  75                  80  
Glu Asp Val Ile Glu Val Tyr Gln Glu Gln Thr Gly Gly His Ser Thr
                85                  90                  95      
Val
    


<210> 25
<211> 303
<212> DNA
<213> Homo sapiens

<400> 25
atgtctgacc aggaggcaaa accttcaact gaggacttgg gggataagaa ggaaggtgaa 60
tatattaaac tcaaagtcat tggacaggat agcagtgaga ttcacttcaa agtgaaaatg 120
acaacacatc tcaagaaact caaagaatca tactgtcaaa gacagggtgt tccaatgaat 180
tcactcaggt ttctctttga gggtcagaga attgctgata atcatactcc aaaagaactg 240
ggaatggagg aagaagatgt gattgaagtt tatcaggaac aaacgggggg tcattcaaca 300
gtt                                                               303

<210> 26
<211> 2304
<212> DNA
<213> Artificial Sequence

<220> 
<223> Ssp DnaE intein-2A-SUMO

<400> 26
ggtaccgtcg accaaggaga tataacaatg agtaaaggag aagaactttt cactggagtt 60
gtcccaattc ttgttgaatt agatggtgat gttaatgggc acaaattttc tgtcagtgga 120
gagggtgaag gtgatgcaac atacggaaaa cttaccctta aatttatttg cactactgga 180
aaactacctg ttccttggcc aacacttgtc actactttca cttatggtgt tcaatgcttt 240
tcaagatacc cagatcatat gaagcggcac gacttcttca agagcgccat gcctgaggga 300
tacgtgcagg agaggaccat cttcttcaag gacgacggga actacaagac acgtgctgaa 360
gtcaagtttg agggagacac cctcgtcaac aggatcgagc ttaagggaat cgatttcaag 420
gaggacggaa acatcctcgg ccacaagttg gaatacaact acaactccca caacgtatac 480
atcatggccg acaagcaaaa gaacggcatc aaagccaact tcaagacccg ccacaacatc 540
gaacaccatc accatcacca tgacggcggc gtgcaactcg ctgatcatta tcaacaaaat 600
actccaattg gcgatggccc tgtcctttta ccagacaacc attacctgtc cacacaatct 660
gccctttcga aagatcccaa cgaaaagaga gaccacatgg tccttcttga gtttgtaaca 720
gctgctggga ttacacatgg catggatgaa ctatacaaat gtttgtcctt cggaactgag 780
atacttacag ttgaatatgg accacttcct attggaaaga ttgtgagtga agagatcaac 840
tgcagtgttt attccgtgga tccagagggt agagtttaca ctcaagcaat tgctcagtgg 900
catgataggg gagaacagga ggttcttgaa tatgagttgg aagatggttc tgtgataaga 960
gctacatcag atcacaggtt tcttactaca gattaccaac ttttggcaat cgaagagatt 1020
ttcgctagac agctcgatct tctcactttg gaaaatatta agcaaacaga agaggcactt 1080
gataaccata ggcttccatt tcctcttttg gatgctggaa ctattaagat ggttaaagtg 1140
ataggaagaa ggtcattggg tgttcaaaga atatttgata tcggacttcc tcaggatcac 1200
aatttcttac tcgcaaacgg tgctattgct gcagcttgtt cttgtggttc tggtatgtct 1260
gaccaggagg caaaaccttc aactgaggac ttgggggata agaaggaagg tgaatatatt 1320
aaactcaaag tcattggaca ggatagcagt gagattcact tcaaagtgaa aatgacaaca 1380
catctcaaga aactcaaaga atcatactgt caaagacagg gtgttccaat gaattcactc 1440
aggtttctct ttgagggtca gagaattgct gataatcata ctccaaaaga actgggaatg 1500
gaggaagaag atgtgattga agtttatcag gaacaaacgg ggggtcattc aacagttgtg 1560
agcaagggcg aggaggataa catggccatc atcaaggagt tcatgcgctt caaggtgcac 1620
atggagggct ccgtgaacgg ccacgagttc gagatcgagg gcgagggcga gggccgcccc 1680
tacgagggca cccagaccgc caagctgaag gtgaccaagg gtggccccct gcccttcgcc 1740
tgggacatcc tgtcccctca gttcatgtac ggctccaagg cctacgtgaa gcaccccgcc 1800
gacatccccg actacttgaa gctgtccttc cccgagggct tcaagtggga gcgcgtgatg 1860
aacttcgagg acggcggcgt ggtgaccgtg acccaggact cctccctgca ggacggcgag 1920
ttcatctaca aggtgaagct gcgcggcacc aacttcccct ccgacggccc cgtaatgcag 1980
aagaagacca tgggctggga ggcctcctcc gagcggatgt accccgagga cggcgccctg 2040
aagggcgaga tcaagcagag gctgaagctg aaggacggcg gccactacga cgctgaggtc 2100
aagaccacct acaaggccaa gaagcccgtg cagctgcccg gcgcctacaa cgtcaacatc 2160
aagttggaca tcacctccca caacgaggac tacaccatcg tggaacagta cgaacgcgcc 2220
gagggccgcc actccaccgg cggcatggac gagctgtaca agggttctgg atggtcacat 2280
cctcagtttg aaaaatgaga gctc                                        2304

<210> 27
<211> 756
<212> PRT
<213> Artificial Sequence

<220> 
<223> Ssp DnaE intein-2A-SUMO

<400> 27
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val
 1               5                  10                  15      
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu
            20                  25                  30          
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys
        35                  40                  45              
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe
    50                  55                  60                  
Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg
65                  70                  75                  80  
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg
                85                  90                  95      
Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val
            100                 105                 110         
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile
        115                 120                 125             
Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn
    130                 135                 140                 
Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly
145                 150                 155                 160 
Ile Lys Ala Asn Phe Lys Thr Arg His Asn Ile Glu His His His His
                165                 170                 175     
His His Asp Gly Gly Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr
            180                 185                 190         
Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser
        195                 200                 205             
Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met
    210                 215                 220                 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His Gly Met Asp
225                 230                 235                 240 
Glu Leu Tyr Lys Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu
                245                 250                 255     
Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys
            260                 265                 270         
Ser Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile
        275                 280                 285             
Ala Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu
    290                 295                 300                 
Glu Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr
305                 310                 315                 320 
Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu
                325                 330                 335     
Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp
            340                 345                 350         
Asn His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met
        355                 360                 365             
Val Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp
    370                 375                 380                 
Ile Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile
385                 390                 395                 400 
Ala Ala Ala Cys Ser Cys Gly Ser Gly Met Ser Asp Gln Glu Ala Lys
                405                 410                 415     
Pro Ser Thr Glu Asp Leu Gly Asp Lys Lys Glu Gly Glu Tyr Ile Lys
            420                 425                 430         
Leu Lys Val Ile Gly Gln Asp Ser Ser Glu Ile His Phe Lys Val Lys
        435                 440                 445             
Met Thr Thr His Leu Lys Lys Leu Lys Glu Ser Tyr Cys Gln Arg Gln
    450                 455                 460                 
Gly Val Pro Met Asn Ser Leu Arg Phe Leu Phe Glu Gly Gln Arg Ile
465                 470                 475                 480 
Ala Asp Asn His Thr Pro Lys Glu Leu Gly Met Glu Glu Glu Asp Val
                485                 490                 495     
Ile Glu Val Tyr Gln Glu Gln Thr Gly Gly His Ser Thr Val Val Ser
            500                 505                 510         
Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe Met Arg Phe
        515                 520                 525             
Lys Val His Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu
    530                 535                 540                 
Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu
545                 550                 555                 560 
Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser
                565                 570                 575     
Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His Pro Ala Asp
            580                 585                 590         
Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu
        595                 600                 605             
Arg Val Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp
    610                 615                 620                 
Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys Leu Arg Gly
625                 630                 635                 640 
Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly
                645                 650                 655     
Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly Ala Leu Lys
            660                 665                 670         
Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly His Tyr Asp
        675                 680                 685             
Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Leu Pro
    690                 695                 700                 
Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser His Asn Glu
705                 710                 715                 720 
Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg His Ser
                725                 730                 735     
Thr Gly Gly Met Asp Glu Leu Tyr Lys Gly Ser Gly Trp Ser His Pro
            740                 745                 750         
Gln Phe Glu Lys
        755     


<210> 28
<211> 159
<212> PRT
<213> Artificial Sequence

<220> 
<223> Ssp modified DnaE intein

<400> 28
Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
 1               5                  10                  15      
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
            20                  25                  30          
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
        35                  40                  45              
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
    50                  55                  60                  
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
65                  70                  75                  80  
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
                85                  90                  95      
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
            100                 105                 110         
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile
        115                 120                 125             
Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro
    130                 135                 140                 
Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Ala
145                 150                 155                 


<210> 29
<211> 159
<212> PRT
<213> Artificial Sequence

<220> 
<223> Ssp modified DnaE intein

<400> 29
Ala Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
 1               5                  10                  15      
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
            20                  25                  30          
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
        35                  40                  45              
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
    50                  55                  60                  
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
65                  70                  75                  80  
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
                85                  90                  95      
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
            100                 105                 110         
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile
        115                 120                 125             
Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro
    130                 135                 140                 
Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Ala
145                 150                 155                 


<210> 30
<211> 12
<212> PRT
<213> Artificial Sequence

<220> 
<223> Mutant GFP

<400> 30
Asp His Met Val Leu His Glu Ser Val Asn Ala Ala
 1               5                  10          


<210> 31
<211> 7
<212> PRT
<213> Artificial Sequence

<220> 
<223> Mutant GFP

<400> 31
Asp His Met Val Leu His Glu
 1               5          


<210> 32
<211> 5
<212> PRT
<213> Artificial Sequence

<220> 
<223> Mutant GFP

<400> 32
Ser Val Asn Ala Ala
 1               5  


<210> 33
<211> 20
<212> PRT
<213> Amphimedon queenslandica

<400> 33
Leu Leu Cys Phe Met Leu Leu Leu Leu Leu Ser Gly Asp Val Glu Leu
 1               5                  10                  15      
Asn Pro Gly Pro
            20  


<210> 34
<211> 20
<212> PRT
<213> Amphimedon queenslandica

<400> 34
His His Phe Met Phe Leu Leu Leu Leu Leu Ala Gly Asp Ile Glu Leu
 1               5                  10                  15      
Asn Pro Gly Pro
            20  


<210> 35
<211> 20
<212> PRT
<213> Saccoglossus kowalevskii

<400> 35
Trp Phe Leu Val Leu Leu Ser Phe Ile Leu Ser Gly Asp Ile Glu Val
 1               5                  10                  15      
Asn Pro Gly Pro
            20  


<210> 36
<211> 20
<212> PRT
<213> Branchiostoma floridae

<400> 36
Lys Asn Cys Ala Met Tyr Met Leu Leu Leu Ser Gly Asp Val Glu Thr
 1               5                  10                  15      
Asn Pro Gly Pro
            20  


<210> 37
<211> 20
<212> PRT
<213> Branchiostoma floridae

<400> 37
Met Val Ile Ser Gln Leu Met Leu Lys Leu Ala Gly Asp Val Glu Glu
 1               5                  10                  15      
Asn Pro Gly Pro
            20  


<210> 38
<211> 8
<212> PRT
<213> Artificial Sequence

<220> 
<223> 2A consensus sequence

<220> 
<221> VARIANT        
<222> 2, 4
<223> Xaa = any amino acid residue

<400> 38
Asp Xaa Glu Xaa Asn Pro Gly Pro
 1               5              


<210> 39
<211> 795
<212> DNA
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 39
ctcgagggag gatctaagtt tgcaaatgat tgtttgtcct tcggaactga gatacttaca 60
gttgaatatg gaccacttcc tattggaaag attgtgagtg aagagatcaa ctgcagtgtt 120
tattccgtgg atccagaggg tagagtttac actcaagcaa ttgctcagtg gcatgatagg 180
ggagaacagg aggttcttga atatgagttg gaagatggtt ctgtgataag agctacatca 240
gatcacaggt ttcttactac agattaccaa cttttggcaa tcgaagagat tttcgctaga 300
cagctcgatc ttctcacttt ggaaaatatt aagcaaacag aagaggcact tgataaccat 360
aggcttccat ttcctctttt ggatgctgga actattaaga tggttaaagt gataggaaga 420
aggtcattgg gtgttcaaag aatatttgat atcggacttc ctcaggatca caatttctta 480
ctcgcaaacg gtgctattgc tgcagcttgt ttcaatggtt ctggttctag agttactgag 540
cttttgtata ggatgaagag ggcagaaaca tactgcccaa gacctttact cgcaatccat 600
ccaacagagg ctaggcacaa gcaaaaaatt gttgctcctg tgaaacagct tttgaacttt 660
gatcttctca agcttgcggg agacgtcgag tccaaccctg ggccccaggt gctgaacacc 720
atggtgaaca aacacttctt gtccctttcg gtcctcatcg tcctccttgg cctctcctcc 780
aacttgacag ccggc                                                  795

<210> 40
<211> 265
<212> PRT
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 40
Leu Glu Gly Gly Ser Lys Phe Ala Asn Asp Cys Leu Ser Phe Gly Thr
 1               5                  10                  15      
Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val
            20                  25                  30          
Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser Val Asp Pro Glu Gly Arg
        35                  40                  45              
Val Tyr Thr Gln Ala Ile Ala Gln Trp His Asp Arg Gly Glu Gln Glu
    50                  55                  60                  
Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser Val Ile Arg Ala Thr Ser
65                  70                  75                  80  
Asp His Arg Phe Leu Thr Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu
                85                  90                  95      
Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln
            100                 105                 110         
Thr Glu Glu Ala Leu Asp Asn His Arg Leu Pro Phe Pro Leu Leu Asp
        115                 120                 125             
Ala Gly Thr Ile Lys Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly
    130                 135                 140                 
Val Gln Arg Ile Phe Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu
145                 150                 155                 160 
Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys Phe Asn Gly Ser Gly Ser
                165                 170                 175     
Arg Val Thr Glu Leu Leu Tyr Arg Met Lys Arg Ala Glu Thr Tyr Cys
            180                 185                 190         
Pro Arg Pro Leu Leu Ala Ile His Pro Thr Glu Ala Arg His Lys Gln
        195                 200                 205             
Lys Ile Val Ala Pro Val Lys Gln Leu Leu Asn Phe Asp Leu Leu Lys
    210                 215                 220                 
Leu Ala Gly Asp Val Glu Ser Asn Pro Gly Pro Gln Val Leu Asn Thr
225                 230                 235                 240 
Met Val Asn Lys His Phe Leu Ser Leu Ser Val Leu Ile Val Leu Leu
                245                 250                 255     
Gly Leu Ser Ser Asn Leu Thr Ala Gly
            260                 265 


<210> 41
<211> 705
<212> DNA
<213> Artificial Sequence

<220> 
<223> Processing domain

<220> 
<221> CDS            
<222> (1)...(705)

<400> 41
ctc gag gga gga tct aag ttt gca aat gat tgt ttg tcc ttc gga act   48
Leu Glu Gly Gly Ser Lys Phe Ala Asn Asp Cys Leu Ser Phe Gly Thr
 1               5                   10                  15
 
gag ata ctt aca gtt gaa tat gga cca ctt cct att gga aag att gtg   96
Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val
             20                  25                  30
 
agt gaa gag atc aac tgc agt gtt tat tcc gtg gat cca gag ggt aga   144
Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser Val Asp Pro Glu Gly Arg
         35                  40                  45
 
gtt tac act caa gca att gct cag tgg cat gat agg gga gaa cag gag   192
Val Tyr Thr Gln Ala Ile Ala Gln Trp His Asp Arg Gly Glu Gln Glu
     50                  55                  60
 
gtt ctt gaa tat gag ttg gaa gat ggt tct gtg ata aga gct aca tca   240
Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser Val Ile Arg Ala Thr Ser
 65                  70                  75                  80
 
gat cac agg ttt ctt act aca gat tac caa ctt ttg gca atc gaa gag   288
Asp His Arg Phe Leu Thr Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu
                 85                  90                  95
 
att ttc gct aga cag ctc gat ctt ctc act ttg gaa aat att aag caa   336
Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln
            100                 105                 110
 
aca gaa gag gca ctt gat aac cat agg ctt cca ttt cct ctt ttg gat   384
Thr Glu Glu Ala Leu Asp Asn His Arg Leu Pro Phe Pro Leu Leu Asp
        115                 120                 125
 
gct gga act att aag atg gtt aaa gtg ata gga aga agg tca ttg ggt   432
Ala Gly Thr Ile Lys Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly
    130                 135                 140
 
gtt caa aga ata ttt gat atc gga ctt cct cag gat cac aat ttc tta   480
Val Gln Arg Ile Phe Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu
145                 150                 155                 160
 
ctc gca aac ggt gct att gct gca gct tgt tct tgt ggt tct ggt tct   528
Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys Ser Cys Gly Ser Gly Ser
                165                 170                 175
 
aga gtt act gag ctt ttg tat agg atg aag agg gca gaa aca tac tgc   576
Arg Val Thr Glu Leu Leu Tyr Arg Met Lys Arg Ala Glu Thr Tyr Cys
            180                 185                 190
 
cca aga cct tta ctc gca atc cat cca aca gag gct agg cac aag caa   624
Pro Arg Pro Leu Leu Ala Ile His Pro Thr Glu Ala Arg His Lys Gln
        195                 200                 205
 
aaa att gtt gct cct gtg aaa cag ctt ttg aac ttt gat ctt ctc aag   672
Lys Ile Val Ala Pro Val Lys Gln Leu Leu Asn Phe Asp Leu Leu Lys
    210                 215                 220
 
ctt gcg gga gac gtc gag tcc aac cct ggg ccc                       705
Leu Ala Gly Asp Val Glu Ser Asn Pro Gly Pro
225                 230                 235
 

<210> 42
<211> 235
<212> PRT
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 42
Leu Glu Gly Gly Ser Lys Phe Ala Asn Asp Cys Leu Ser Phe Gly Thr
 1               5                  10                  15      
Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val
            20                  25                  30          
Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser Val Asp Pro Glu Gly Arg
        35                  40                  45              
Val Tyr Thr Gln Ala Ile Ala Gln Trp His Asp Arg Gly Glu Gln Glu
    50                  55                  60                  
Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser Val Ile Arg Ala Thr Ser
65                  70                  75                  80  
Asp His Arg Phe Leu Thr Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu
                85                  90                  95      
Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln
            100                 105                 110         
Thr Glu Glu Ala Leu Asp Asn His Arg Leu Pro Phe Pro Leu Leu Asp
        115                 120                 125             
Ala Gly Thr Ile Lys Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly
    130                 135                 140                 
Val Gln Arg Ile Phe Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu
145                 150                 155                 160 
Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys Ser Cys Gly Ser Gly Ser
                165                 170                 175     
Arg Val Thr Glu Leu Leu Tyr Arg Met Lys Arg Ala Glu Thr Tyr Cys
            180                 185                 190         
Pro Arg Pro Leu Leu Ala Ile His Pro Thr Glu Ala Arg His Lys Gln
        195                 200                 205             
Lys Ile Val Ala Pro Val Lys Gln Leu Leu Asn Phe Asp Leu Leu Lys
    210                 215                 220                 
Leu Ala Gly Asp Val Glu Ser Asn Pro Gly Pro
225                 230                 235 


<210> 43
<211> 723
<212> DNA
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 43
tgtttgtcct tcggaactga gatacttaca gttgaatatg gaccacttcc tattggaaag 60
attgtgagtg aagagatcaa ctgcagtgtt tattccgtgg atccagaggg tagagtttac 120
actcaagcaa ttgctcagtg gcatgatagg ggagaacagg aggttcttga atatgagttg 180
gaagatggtt ctgtgataag agctacatca gatcacaggt ttcttactac agattaccaa 240
cttttggcaa tcgaagagat tttcgctaga cagctcgatc ttctcacttt ggaaaatatt 300
aagcaaacag aagaggcact tgataaccat aggcttccat ttcctctttt ggatgctgga 360
actattaaga tggttaaagt gataggaaga aggtcattgg gtgttcaaag aatatttgat 420
atcggacttc ctcaggatca caatttctta ctcgcaaacg gtgctattgc tgcagcttgt 480
tcttgtggtt ctggtatgca gatcttcgta aagactttga ccggaaagac catcactctt 540
gaagttgaaa gctccgacac cattgataac gtgaaggcta agatccagga caaggaaggc 600
attcctccgg accagcagcg tctcatcttc gctggaaggc agcttgagga tggacgtact 660
ttggccgact acaacatcca gaaggagtcc actcttcact tggtcctccg tctccgcggc 720
ggt                                                               723

<210> 44
<211> 241
<212> PRT
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 44
Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
 1               5                  10                  15      
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
            20                  25                  30          
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
        35                  40                  45              
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
    50                  55                  60                  
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
65                  70                  75                  80  
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
                85                  90                  95      
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
            100                 105                 110         
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile
        115                 120                 125             
Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro
    130                 135                 140                 
Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys
145                 150                 155                 160 
Ser Cys Gly Ser Gly Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys
                165                 170                 175     
Thr Ile Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys
            180                 185                 190         
Ala Lys Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu
        195                 200                 205             
Ile Phe Ala Gly Arg Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr
    210                 215                 220                 
Asn Ile Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly
225                 230                 235                 240 
Gly
    


<210> 45
<211> 576
<212> DNA
<213> Artificial Sequence

<220> 
<223> Processing domain

<220> 
<221> CDS            
<222> (1)...(576)

<400> 45
tgt ttg tcc ttc gga act gag ata ctt aca gtt gaa tat gga cca ctt   48
Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
 1               5                   10                  15
 
cct att gga aag att gtg agt gaa gag atc aac tgc agt gtt tat tcc   96
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
             20                  25                  30
 
gtg gat cca gag ggt aga gtt tac act caa gca att gct cag tgg cat   144
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
         35                  40                  45
 
gat agg gga gaa cag gag gtt ctt gaa tat gag ttg gaa gat ggt tct   192
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
     50                  55                  60
 
gtg ata aga gct aca tca gat cac agg ttt ctt act aca gat tac caa   240
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
 65                  70                  75                  80
 
ctt ttg gca atc gaa gag att ttc gct aga cag ctc gat ctt ctc act   288
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
                 85                  90                  95
 
ttg gaa aat att aag caa aca gaa gag gca ctt gat aac cat agg ctt   336
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
            100                 105                 110
 
cca ttt cct ctt ttg gat gct gga act att aag atg gtt aaa gtg ata   384
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile
        115                 120                 125
 
gga aga agg tca ttg ggt gtt caa aga ata ttt gat atc gga ctt cct   432
Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro
    130                 135                 140
 
cag gat cac aat ttc tta ctc gca aac ggt gct att gct gca gct tgt   480
Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys
145                 150                 155                 160
 
tct tgt ggt tct ggt tct aga gat gga ttc tgc att ctc tat ctg ctc   528
Ser Cys Gly Ser Gly Ser Arg Asp Gly Phe Cys Ile Leu Tyr Leu Leu
                165                 170                 175
 
ctg atc ctc ttg atg aga tct ggt gac gtt gaa acc aat cca ggg ccc   576
Leu Ile Leu Leu Met Arg Ser Gly Asp Val Glu Thr Asn Pro Gly Pro
            180                 185                 190
 


<210> 46
<211> 192
<212> PRT
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 46
Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
 1               5                  10                  15      
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
            20                  25                  30          
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
        35                  40                  45              
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
    50                  55                  60                  
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
65                  70                  75                  80  
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
                85                  90                  95      
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
            100                 105                 110         
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile
        115                 120                 125             
Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro
    130                 135                 140                 
Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys
145                 150                 155                 160 
Ser Cys Gly Ser Gly Ser Arg Asp Gly Phe Cys Ile Leu Tyr Leu Leu
                165                 170                 175     
Leu Ile Leu Leu Met Arg Ser Gly Asp Val Glu Thr Asn Pro Gly Pro
            180                 185                 190         


<210> 47
<211> 1014
<212> DNA
<213> Artificial Sequence

<220> 
<223> Processing domain

<220> 
<221> CDS            
<222> (1)...(1014)

<400> 47
ctc gag gga gga tct aag ttt gca aat gat tgt ttg tcc ttc gga act   48
Leu Glu Gly Gly Ser Lys Phe Ala Asn Asp Cys Leu Ser Phe Gly Thr
 1               5                   10                  15
 
gag ata ctt aca gtt gaa tat gga cca ctt cct att gga aag att gtg   96
Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val
             20                  25                  30
 
agt gaa gag atc aac tgc agt gtt tat tcc gtg gat cca gag ggt aga   144
Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser Val Asp Pro Glu Gly Arg
         35                  40                  45
 
gtt tac act caa gca att gct cag tgg cat gat agg gga gaa cag gag   192
Val Tyr Thr Gln Ala Ile Ala Gln Trp His Asp Arg Gly Glu Gln Glu
     50                  55                  60
 
gtt ctt gaa tat gag ttg gaa gat ggt tct gtg ata aga gct aca tca   240
Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser Val Ile Arg Ala Thr Ser
 65                  70                  75                  80
 
gat cac agg ttt ctt act aca gat tac caa ctt ttg gca atc gaa gag   288
Asp His Arg Phe Leu Thr Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu
                 85                  90                  95
 
att ttc gct aga cag ctc gat ctt ctc act ttg gaa aat att aag caa   336
Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln
            100                 105                 110
 
aca gaa gag gca ctt gat aac cat agg ctt cca ttt cct ctt ttg gat   384
Thr Glu Glu Ala Leu Asp Asn His Arg Leu Pro Phe Pro Leu Leu Asp
        115                 120                 125
 
gct gga act att aag atg gtt aaa gtg ata gga aga agg tca ttg ggt   432
Ala Gly Thr Ile Lys Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly
    130                 135                 140
 
gtt caa aga ata ttt gat atc gga ctt cct cag gat cac aat ttc tta   480
Val Gln Arg Ile Phe Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu
145                 150                 155                 160
 
ctc gca aac ggt gct att gct gca gct tgt ttc aat ggt tct ggt tct   528
Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys Phe Asn Gly Ser Gly Ser
                165                 170                 175
 
aga gag tct gga gct atc tct ggc gat agt ctg atc agc ctg gct agc   576
Arg Glu Ser Gly Ala Ile Ser Gly Asp Ser Leu Ile Ser Leu Ala Ser
            180                 185                 190
 
aca gga aaa aga gtt tct att aaa gat ttg tta gat gaa aaa gat ttt   624
Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp Glu Lys Asp Phe
        195                 200                 205
 
gaa ata tgg gca att aat gaa cag acg atg aag cta gaa tca gct aaa   672
Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu Glu Ser Ala Lys
    210                 215                 220
 
gtt agt cgt gta ttt tgt act ggc aaa aag cta gtt tat att cta aaa   720
Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val Tyr Ile Leu Lys
225                 230                 235                 240
 
act cga cta ggt aga act atc aag gca aca gca aat cat aga ttt tta   768
Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn His Arg Phe Leu
                245                 250                 255
 
act att gat ggt tgg aaa aga tta gat gag cta tct tta aaa gag cat   816
Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser Leu Lys Glu His
            260                 265                 270
 
att gct cta ccc cgt aaa cta gaa agc tcc tct tta caa ttg tca cca   864
Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu Gln Leu Ser Pro
        275                 280                 285
 
gaa ata gaa aag ttg tct cag agt gat att tac tgg gac tcc atc gtt   912
Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp Ser Ile Val
    290                 295                 300
 
tct att acg gag act gga gtc gaa gag gtt ttt gat ttg act gtg cca   960
Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu Thr Val Pro
305                 310                 315                 320
 
gga cca cat aac ttt gtc gcg aat gac atc att gta cac aac agc cgc   1008
Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His Asn Ser Arg
                325                 330                 335
 
ggg ccc                                                           1014
Gly Pro

 

<210> 48
<211> 338
<212> PRT
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 48
Leu Glu Gly Gly Ser Lys Phe Ala Asn Asp Cys Leu Ser Phe Gly Thr
 1               5                  10                  15      
Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val
            20                  25                  30          
Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser Val Asp Pro Glu Gly Arg
        35                  40                  45              
Val Tyr Thr Gln Ala Ile Ala Gln Trp His Asp Arg Gly Glu Gln Glu
    50                  55                  60                  
Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser Val Ile Arg Ala Thr Ser
65                  70                  75                  80  
Asp His Arg Phe Leu Thr Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu
                85                  90                  95      
Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln
            100                 105                 110         
Thr Glu Glu Ala Leu Asp Asn His Arg Leu Pro Phe Pro Leu Leu Asp
        115                 120                 125             
Ala Gly Thr Ile Lys Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly
    130                 135                 140                 
Val Gln Arg Ile Phe Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu
145                 150                 155                 160 
Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys Phe Asn Gly Ser Gly Ser
                165                 170                 175     
Arg Glu Ser Gly Ala Ile Ser Gly Asp Ser Leu Ile Ser Leu Ala Ser
            180                 185                 190         
Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu Asp Glu Lys Asp Phe
        195                 200                 205             
Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys Leu Glu Ser Ala Lys
    210                 215                 220                 
Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu Val Tyr Ile Leu Lys
225                 230                 235                 240 
Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala Asn His Arg Phe Leu
                245                 250                 255     
Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu Ser Leu Lys Glu His
            260                 265                 270         
Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser Leu Gln Leu Ser Pro
        275                 280                 285             
Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr Trp Asp Ser Ile Val
    290                 295                 300                 
Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe Asp Leu Thr Val Pro
305                 310                 315                 320 
Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile Val His Asn Ser Arg
                325                 330                 335     
Gly Pro
        


<210> 49
<211> 981
<212> DNA
<213> Artificial Sequence

<220> 
<223> Processing domain

<220> 
<221> CDS            
<222> (1)...(981)

<400> 49
ctc gag tat gca ttg tcc ttc gga act gag ata ctt aca gtt gaa tat   48
Leu Glu Tyr Ala Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr
 1               5                   10                  15
 
gga cca ctt cct att gga aag att gtg agt gaa gag atc aac tgc agt   96
Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser
             20                  25                  30
 
gtt tat tcc gtg gat cca gag ggt aga gtt tac act caa gca att gct   144
Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala
         35                  40                  45
 
cag tgg cat gat agg gga gaa cag gag gtt ctt gaa tat gag ttg gaa   192
Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu
     50                  55                  60
 
gat ggt tct gtg ata aga gct aca tca gat cac agg ttt ctt act aca   240
Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr
 65                  70                  75                  80
 
gat tac caa ctt ttg gca atc gaa gag att ttc gct aga cag ctc gat   288
Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp
                 85                  90                  95
 
ctt ctc act ttg gaa aat att aag caa aca gaa gag gca ctt gat aac   336
Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn
            100                 105                 110
 
cat agg ctt cca ttt cct ctt ttg gat gct gga act att aag atg gtt   384
His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val
        115                 120                 125
 
aaa gtg ata gga aga agg tca ttg ggt gtt caa aga ata ttt gat atc   432
Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile
    130                 135                 140
 
gga ctt cct cag gat cac aat ttc tta ctc gca aac ggt gct att gct   480
Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala
145                 150                 155                 160
 
gca gct ggt ggt tct aga gag tct gga gct atc tct ggc gat agt ctg   528
Ala Ala Gly Gly Ser Arg Glu Ser Gly Ala Ile Ser Gly Asp Ser Leu
                165                 170                 175
 
atc agc ctg gct agc aca gga aaa aga gtt tct att aaa gat ttg tta   576
Ile Ser Leu Ala Ser Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu
            180                 185                 190
 
gat gaa aaa gat ttt gaa ata tgg gca att aat gaa cag acg atg aag   624
Asp Glu Lys Asp Phe Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys
        195                 200                 205
 
cta gaa tca gct aaa gtt agt cgt gta ttt tgt act ggc aaa aag cta   672
Leu Glu Ser Ala Lys Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu
    210                 215                 220
 
gtt tat att cta aaa act cga cta ggt aga act atc aag gca aca gca   720
Val Tyr Ile Leu Lys Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala
225                 230                 235                 240
 
aat cat aga ttt tta act att gat ggt tgg aaa aga tta gat gag cta   768
Asn His Arg Phe Leu Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu
                245                 250                 255
 
tct tta aaa gag cat att gct cta ccc cgt aaa cta gaa agc tcc tct   816
Ser Leu Lys Glu His Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser
            260                 265                 270
 
tta caa ttg tca cca gaa ata gaa aag ttg tct cag agt gat att tac   864
Leu Gln Leu Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr
        275                 280                 285
 
tgg gac tcc atc gtt tct att acg gag act gga gtc gaa gag gtt ttt   912
Trp Asp Ser Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe
    290                 295                 300
 
gat ttg act gtg cca gga cca cat aac ttt gtc gcg aat gac atc att   960
Asp Leu Thr Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile
305                 310                 315                 320
 
gta cac aac agc cgc ggg ccc                                       981
Val His Asn Ser Arg Gly Pro
                325
 

<210> 50
<211> 327
<212> PRT
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 50
Leu Glu Tyr Ala Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr
 1               5                  10                  15      
Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser
            20                  25                  30          
Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala
        35                  40                  45              
Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu
    50                  55                  60                  
Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr
65                  70                  75                  80  
Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp
                85                  90                  95      
Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn
            100                 105                 110         
His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val
        115                 120                 125             
Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile
    130                 135                 140                 
Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala
145                 150                 155                 160 
Ala Ala Gly Gly Ser Arg Glu Ser Gly Ala Ile Ser Gly Asp Ser Leu
                165                 170                 175     
Ile Ser Leu Ala Ser Thr Gly Lys Arg Val Ser Ile Lys Asp Leu Leu
            180                 185                 190         
Asp Glu Lys Asp Phe Glu Ile Trp Ala Ile Asn Glu Gln Thr Met Lys
        195                 200                 205             
Leu Glu Ser Ala Lys Val Ser Arg Val Phe Cys Thr Gly Lys Lys Leu
    210                 215                 220                 
Val Tyr Ile Leu Lys Thr Arg Leu Gly Arg Thr Ile Lys Ala Thr Ala
225                 230                 235                 240 
Asn His Arg Phe Leu Thr Ile Asp Gly Trp Lys Arg Leu Asp Glu Leu
                245                 250                 255     
Ser Leu Lys Glu His Ile Ala Leu Pro Arg Lys Leu Glu Ser Ser Ser
            260                 265                 270         
Leu Gln Leu Ser Pro Glu Ile Glu Lys Leu Ser Gln Ser Asp Ile Tyr
        275                 280                 285             
Trp Asp Ser Ile Val Ser Ile Thr Glu Thr Gly Val Glu Glu Val Phe
    290                 295                 300                 
Asp Leu Thr Val Pro Gly Pro His Asn Phe Val Ala Asn Asp Ile Ile
305                 310                 315                 320 
Val His Asn Ser Arg Gly Pro
                325         


<210> 51
<211> 705
<212> DNA
<213> Artificial Sequence

<220> 
<223> Processing domain

<220> 
<221> CDS            
<222> (1)...(705)

<400> 51
ctc gag gga gga tct aag ttt gca aat gat tgt ttg tcc ttc gga act   48
Leu Glu Gly Gly Ser Lys Phe Ala Asn Asp Cys Leu Ser Phe Gly Thr
 1               5                   10                  15
 
gag ata ctt aca gtt gaa tat gga cca ctt cct att gga aag att gtg   96
Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val
             20                  25                  30
 
agt gaa gag atc aac tgc agt gtt tat tcc gtg gat cca gag ggt aga   144
Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser Val Asp Pro Glu Gly Arg
         35                  40                  45
 
gtt tac act caa gca att gct cag tgg cat gat agg gga gaa cag gag   192
Val Tyr Thr Gln Ala Ile Ala Gln Trp His Asp Arg Gly Glu Gln Glu
     50                  55                  60
 
gtt ctt gaa tat gag ttg gaa gat ggt tct gtg ata aga gct aca tca   240
Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser Val Ile Arg Ala Thr Ser
 65                  70                  75                  80
 
gat cac agg ttt ctt act aca gat tac caa ctt ttg gca atc gaa gag   288
Asp His Arg Phe Leu Thr Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu
                 85                  90                  95
 
att ttc gct aga cag ctc gat ctt ctc act ttg gaa aat att aag caa   336
Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln
            100                 105                 110
 
aca gaa gag gca ctt gat aac cat agg ctt cca ttt cct ctt ttg gat   384
Thr Glu Glu Ala Leu Asp Asn His Arg Leu Pro Phe Pro Leu Leu Asp
        115                 120                 125
 
gct gga act att aag atg gtt aaa gtg ata gga aga agg tca ttg ggt   432
Ala Gly Thr Ile Lys Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly
    130                 135                 140
 
gtt caa aga ata ttt gat atc gga ctt cct cag gat cac aat ttc tta   480
Val Gln Arg Ile Phe Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu
145                 150                 155                 160
 
ctc gca aac ggt gct att gct gca gct tgt ttc aat ggt tct ggt tct   528
Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys Phe Asn Gly Ser Gly Ser
                165                 170                 175
 
aga gtt act gag ctt ttg tat agg atg aag agg gca gaa aca tac tgc   576
Arg Val Thr Glu Leu Leu Tyr Arg Met Lys Arg Ala Glu Thr Tyr Cys
            180                 185                 190
 
cca aga cct tta ctc gca atc cat cca aca gag gct agg cac aag caa   624
Pro Arg Pro Leu Leu Ala Ile His Pro Thr Glu Ala Arg His Lys Gln
        195                 200                 205
 
aaa att gtt gct cct gtg aaa cag ctt ttg aac ttt gat ctt ctc aag   672
Lys Ile Val Ala Pro Val Lys Gln Leu Leu Asn Phe Asp Leu Leu Lys
    210                 215                 220
 
ctt gcg gga gac gtc gag tcc aac cct ggg ccc                       705
Leu Ala Gly Asp Val Glu Ser Asn Pro Gly Pro
225                 230                 235
 

<210> 52
<211> 235
<212> PRT
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 52
Leu Glu Gly Gly Ser Lys Phe Ala Asn Asp Cys Leu Ser Phe Gly Thr
 1               5                  10                  15      
Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu Pro Ile Gly Lys Ile Val
            20                  25                  30          
Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser Val Asp Pro Glu Gly Arg
        35                  40                  45              
Val Tyr Thr Gln Ala Ile Ala Gln Trp His Asp Arg Gly Glu Gln Glu
    50                  55                  60                  
Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser Val Ile Arg Ala Thr Ser
65                  70                  75                  80  
Asp His Arg Phe Leu Thr Thr Asp Tyr Gln Leu Leu Ala Ile Glu Glu
                85                  90                  95      
Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr Leu Glu Asn Ile Lys Gln
            100                 105                 110         
Thr Glu Glu Ala Leu Asp Asn His Arg Leu Pro Phe Pro Leu Leu Asp
        115                 120                 125             
Ala Gly Thr Ile Lys Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly
    130                 135                 140                 
Val Gln Arg Ile Phe Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu
145                 150                 155                 160 
Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys Phe Asn Gly Ser Gly Ser
                165                 170                 175     
Arg Val Thr Glu Leu Leu Tyr Arg Met Lys Arg Ala Glu Thr Tyr Cys
            180                 185                 190         
Pro Arg Pro Leu Leu Ala Ile His Pro Thr Glu Ala Arg His Lys Gln
        195                 200                 205             
Lys Ile Val Ala Pro Val Lys Gln Leu Leu Asn Phe Asp Leu Leu Lys
    210                 215                 220                 
Leu Ala Gly Asp Val Glu Ser Asn Pro Gly Pro
225                 230                 235 


<210> 53
<211> 813
<212> DNA
<213> Artificial Sequence

<220> 
<223> Processing domain

<220> 
<221> CDS            
<222> (1)...(813)

<400> 53
tgt ttg tcc ttc gga act gag ata ctt aca gtt gaa tat gga cca ctt   48
Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
 1               5                   10                  15
 
cct att gga aag att gtg agt gaa gag atc aac tgc agt gtt tat tcc   96
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
             20                  25                  30
 
gtg gat cca gag ggt aga gtt tac act caa gca att gct cag tgg cat   144
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
         35                  40                  45
 
gat agg gga gaa cag gag gtt ctt gaa tat gag ttg gaa gat ggt tct   192
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
     50                  55                  60
 
gtg ata aga gct aca tca gat cac agg ttt ctt act aca gat tac caa   240
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
 65                  70                  75                  80
 
ctt ttg gca atc gaa gag att ttc gct aga cag ctc gat ctt ctc act   288
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
                 85                  90                  95
 
ttg gaa aat att aag caa aca gaa gag gca ctt gat aac cat agg ctt   336
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
            100                 105                 110
 
cca ttt cct ctt ttg gat gct gga act att aag atg gtt aaa gtg ata   384
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile
        115                 120                 125
 
gga aga agg tca ttg ggt gtt caa aga ata ttt gat atc gga ctt cct   432
Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro
    130                 135                 140
 
cag gat cac aat ttc tta ctc gca aac ggt gct att gct gca gct tgt   480
Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys
145                 150                 155                 160
 
tct tgt ggt tct ggt tct aga gga tct ggc gat gga ttc tgc att ctc   528
Ser Cys Gly Ser Gly Ser Arg Gly Ser Gly Asp Gly Phe Cys Ile Leu
                165                 170                 175
 
tat ctg ctc ctg atc ctc ttg atg agg tct ggt gac gtt gaa acc aac   576
Tyr Leu Leu Leu Ile Leu Leu Met Arg Ser Gly Asp Val Glu Thr Asn
            180                 185                 190
 
cct ggg ccc atg cag atc ttc gta aag act ttg acc gga aag acc atc   624
Pro Gly Pro Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile
        195                 200                 205
 
act ctt gaa gtt gaa agc tcc gac acc att gat aac gtg aag gct aag   672
Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys
    210                 215                 220
 
atc cag gac aag gaa ggc att cct ccg gac cag cag cgt ctc atc ttc   720
Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe
225                 230                 235                 240
 
gct gga agg cag ctt gag gat gga cgt act ttg gcc gac tac aac atc   768
Ala Gly Arg Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile
                245                 250                 255
 
cag aag gag tcc act ctt cac ttg gtc ctc cgt ctc cgc ggc ggt       813
Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly
            260                 265                 270
 


<210> 54
<211> 271
<212> PRT
<213> Artificial Sequence

<220> 
<223> Processing domain

<400> 54
Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr Gly Pro Leu
 1               5                  10                  15      
Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser Val Tyr Ser
            20                  25                  30          
Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala Gln Trp His
        35                  40                  45              
Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu Asp Gly Ser
    50                  55                  60                  
Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr Asp Tyr Gln
65                  70                  75                  80  
Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp Leu Leu Thr
                85                  90                  95      
Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn His Arg Leu
            100                 105                 110         
Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys Met Val Lys Val Ile
        115                 120                 125             
Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe Asp Ile Gly Leu Pro
    130                 135                 140                 
Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala Ile Ala Ala Ala Cys
145                 150                 155                 160 
Ser Cys Gly Ser Gly Ser Arg Gly Ser Gly Asp Gly Phe Cys Ile Leu
                165                 170                 175     
Tyr Leu Leu Leu Ile Leu Leu Met Arg Ser Gly Asp Val Glu Thr Asn
            180                 185                 190         
Pro Gly Pro Met Gln Ile Phe Val Lys Thr Leu Thr Gly Lys Thr Ile
        195                 200                 205             
Thr Leu Glu Val Glu Ser Ser Asp Thr Ile Asp Asn Val Lys Ala Lys
    210                 215                 220                 
Ile Gln Asp Lys Glu Gly Ile Pro Pro Asp Gln Gln Arg Leu Ile Phe
225                 230                 235                 240 
Ala Gly Arg Gln Leu Glu Asp Gly Arg Thr Leu Ala Asp Tyr Asn Ile
                245                 250                 255     
Gln Lys Glu Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly
            260                 265                 270     

