               SEQUENCE LISTING

<110> The University of Southampton

<120> Mammalian Inteins

<130> CURBD/P75368PC

<160> 22

<170> BiSSAP 1.3.6

<210> 1
<211> 37
<212> PRT
<213> Synechocystis PCC6803


<400> 1
Met Val Lys Val Ile Gly Arg Arg Ser Leu Gly Val Gln Arg Ile Phe 
1               5                   10                  15      
Asp Ile Gly Leu Pro Gln Asp His Asn Phe Leu Leu Ala Asn Gly Ala 
            20                  25                  30          
Ile Ala Ala Asn Cys 
        35          

<210> 2
<211> 126
<212> PRT
<213> Synechocystis PCC6803


<400> 2
Ala Glu Tyr Cys Leu Ser Phe Gly Thr Glu Ile Leu Thr Val Glu Tyr 
1               5                   10                  15      
Gly Pro Leu Pro Ile Gly Lys Ile Val Ser Glu Glu Ile Asn Cys Ser 
            20                  25                  30          
Val Tyr Ser Val Asp Pro Glu Gly Arg Val Tyr Thr Gln Ala Ile Ala 
        35                  40                  45              
Gln Trp His Asp Arg Gly Glu Gln Glu Val Leu Glu Tyr Glu Leu Glu 
    50                  55                  60                  
Asp Gly Ser Val Ile Arg Ala Thr Ser Asp His Arg Phe Leu Thr Thr 
65                  70                  75                  80  
Asp Tyr Gln Leu Leu Ala Ile Glu Glu Ile Phe Ala Arg Gln Leu Asp 
                85                  90                  95      
Leu Leu Thr Leu Glu Asn Ile Lys Gln Thr Glu Glu Ala Leu Asp Asn 
            100                 105                 110         
His Arg Leu Pro Phe Pro Leu Leu Asp Ala Gly Thr Ile Lys 
        115                 120                 125     

<210> 3
<211> 36
<212> PRT
<213> Nostoc sp. PCC73102


<400> 3
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr 
1               5                   10                  15      
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 
            20                  25                  30          
Ile Ala Ser Asn 
        35      

<210> 4
<211> 102
<212> PRT
<213> Nostoc sp. PCC73102


<400> 4
Cys Leu Ser Tyr Glu Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu 
1               5                   10                  15      
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser 
            20                  25                  30          
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His 
        35                  40                  45              
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser 
    50                  55                  60                  
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln 
65                  70                  75                  80  
Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg 
                85                  90                  95      
Val Asp Asn Leu Pro Asn 
            100         

<210> 5
<211> 35
<212> PRT
<213> Artificial Sequence


<220> 
<223> Amino acid sequence for Cfa DnaE C-terminus intein domain

<400> 5
Val Lys Ile Ile Ser Arg Lys Ser Leu Gly Thr Gln Asn Val Tyr Asp 
1               5                   10                  15      
Ile Gly Val Glu Lys Asp His Asn Phe Leu Leu Lys Asn Gly Leu Val 
            20                  25                  30          
Ala Ser Asn 
        35  

<210> 6
<211> 100
<212> PRT
<213> Artificial Sequence


<220> 
<223> Amino acid sequence for Cfa DnaE N-terminus intein domain

<400> 6
Cys Leu Ser Tyr Asp Thr Glu Ile Leu Thr Val Glu Tyr Gly Phe Leu 
1               5                   10                  15      
Pro Ile Gly Lys Ile Val Glu Glu Arg Ile Glu Cys Thr Val Tyr Thr 
            20                  25                  30          
Val Asp Lys Asn Gly Phe Val Tyr Thr Gln Pro Ile Ala Gln Trp His 
        35                  40                  45              
Asn Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser 
    50                  55                  60                  
Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Thr Asp Gly Gln Met 
65                  70                  75                  80  
Leu Pro Ile Asp Glu Ile Phe Glu Arg Gly Leu Asp Leu Lys Gln Val 
                85                  90                  95      
Asp Gly Leu Pro 
            100 

<210> 7
<211> 38
<212> PRT
<213> Artificial Sequence


<220> 
<223> Amino acid sequence for gp41-1 C-terminus intein domain

<400> 7
Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp Glu Arg Glu 
1               5                   10                  15      
Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu Phe Tyr Ala Asn Asp 
            20                  25                  30          
Ile Leu Thr His Asn Ser 
        35              

<210> 8
<211> 88
<212> PRT
<213> Artificial Sequence


<220> 
<223> gp41-1 N-terminus intein domain

<400> 8
Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      
Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          
Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              
Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  
Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  
Gly Met Cys Leu Tyr Val Lys Glu 
                85              

<210> 9
<211> 826
<212> PRT
<213> Homo sapiens


<400> 9
Met Glu Gly Ala Gly Gly Ala Asn Asp Lys Lys Lys Ile Ser Ser Glu 
1               5                   10                  15      
Arg Arg Lys Glu Lys Ser Arg Asp Ala Ala Arg Ser Arg Arg Ser Lys 
            20                  25                  30          
Glu Ser Glu Val Phe Tyr Glu Leu Ala His Gln Leu Pro Leu Pro His 
        35                  40                  45              
Asn Val Ser Ser His Leu Asp Lys Ala Ser Val Met Arg Leu Thr Ile 
    50                  55                  60                  
Ser Tyr Leu Arg Val Arg Lys Leu Leu Asp Ala Gly Asp Leu Asp Ile 
65                  70                  75                  80  
Glu Asp Asp Met Lys Ala Gln Met Asn Cys Phe Tyr Leu Lys Ala Leu 
                85                  90                  95      
Asp Gly Phe Val Met Val Leu Thr Asp Asp Gly Asp Met Ile Tyr Ile 
            100                 105                 110         
Ser Asp Asn Val Asn Lys Tyr Met Gly Leu Thr Gln Phe Glu Leu Thr 
        115                 120                 125             
Gly His Ser Val Phe Asp Phe Thr His Pro Cys Asp His Glu Glu Met 
    130                 135                 140                 
Arg Glu Met Leu Thr His Arg Asn Gly Leu Val Lys Lys Gly Lys Glu 
145                 150                 155                 160 
Gln Asn Thr Gln Arg Ser Phe Phe Leu Arg Met Lys Cys Thr Leu Thr 
                165                 170                 175     
Ser Arg Gly Arg Thr Met Asn Ile Lys Ser Ala Thr Trp Lys Val Leu 
            180                 185                 190         
His Cys Thr Gly His Ile His Val Tyr Asp Thr Asn Ser Asn Gln Pro 
        195                 200                 205             
Gln Cys Gly Tyr Lys Lys Pro Pro Met Thr Cys Leu Val Leu Ile Cys 
    210                 215                 220                 
Glu Pro Ile Pro His Pro Ser Asn Ile Glu Ile Pro Leu Asp Ser Lys 
225                 230                 235                 240 
Thr Phe Leu Ser Arg His Ser Leu Asp Met Lys Phe Ser Tyr Cys Asp 
                245                 250                 255     
Glu Arg Ile Thr Glu Leu Met Gly Tyr Glu Pro Glu Glu Leu Leu Gly 
            260                 265                 270         
Arg Ser Ile Tyr Glu Tyr Tyr His Ala Leu Asp Ser Asp His Leu Thr 
        275                 280                 285             
Lys Thr His His Asp Met Phe Thr Lys Gly Gln Val Thr Thr Gly Gln 
    290                 295                 300                 
Tyr Arg Met Leu Ala Lys Arg Gly Gly Tyr Val Trp Val Glu Thr Gln 
305                 310                 315                 320 
Ala Thr Val Ile Tyr Asn Thr Lys Asn Ser Gln Pro Gln Cys Ile Val 
                325                 330                 335     
Cys Val Asn Tyr Val Val Ser Gly Ile Ile Gln His Asp Leu Ile Phe 
            340                 345                 350         
Ser Leu Gln Gln Thr Glu Cys Val Leu Lys Pro Val Glu Ser Ser Asp 
        355                 360                 365             
Met Lys Met Thr Gln Leu Phe Thr Lys Val Glu Ser Glu Asp Thr Ser 
    370                 375                 380                 
Ser Leu Phe Asp Lys Leu Lys Lys Glu Pro Asp Ala Leu Thr Leu Leu 
385                 390                 395                 400 
Ala Pro Ala Ala Gly Asp Thr Ile Ile Ser Leu Asp Phe Gly Ser Asn 
                405                 410                 415     
Asp Thr Glu Thr Asp Asp Gln Gln Leu Glu Glu Val Pro Leu Tyr Asn 
            420                 425                 430         
Asp Val Met Leu Pro Ser Pro Asn Glu Lys Leu Gln Asn Ile Asn Leu 
        435                 440                 445             
Ala Met Ser Pro Leu Pro Thr Ala Glu Thr Pro Lys Pro Leu Arg Ser 
    450                 455                 460                 
Ser Ala Asp Pro Ala Leu Asn Gln Glu Val Ala Leu Lys Leu Glu Pro 
465                 470                 475                 480 
Asn Pro Glu Ser Leu Glu Leu Ser Phe Thr Met Pro Gln Ile Gln Asp 
                485                 490                 495     
Gln Thr Pro Ser Pro Ser Asp Gly Ser Thr Arg Gln Ser Ser Pro Glu 
            500                 505                 510         
Pro Asn Ser Pro Ser Glu Tyr Cys Phe Tyr Val Asp Ser Asp Met Val 
        515                 520                 525             
Asn Glu Phe Lys Leu Glu Leu Val Glu Lys Leu Phe Ala Glu Asp Thr 
    530                 535                 540                 
Glu Ala Lys Asn Pro Phe Ser Thr Gln Asp Thr Asp Leu Asp Leu Glu 
545                 550                 555                 560 
Met Leu Ala Pro Tyr Ile Pro Met Asp Asp Asp Phe Gln Leu Arg Ser 
                565                 570                 575     
Phe Asp Gln Leu Ser Pro Leu Glu Ser Ser Ser Ala Ser Pro Glu Ser 
            580                 585                 590         
Ala Ser Pro Gln Ser Thr Val Thr Val Phe Gln Gln Thr Gln Ile Gln 
        595                 600                 605             
Glu Pro Thr Ala Asn Ala Thr Thr Thr Thr Ala Thr Thr Asp Glu Leu 
    610                 615                 620                 
Lys Thr Val Thr Lys Asp Arg Met Glu Asp Ile Lys Ile Leu Ile Ala 
625                 630                 635                 640 
Ser Pro Ser Pro Thr His Ile His Lys Glu Thr Thr Ser Ala Thr Ser 
                645                 650                 655     
Ser Pro Tyr Arg Asp Thr Gln Ser Arg Thr Ala Ser Pro Asn Arg Ala 
            660                 665                 670         
Gly Lys Gly Val Ile Glu Gln Thr Glu Lys Ser His Pro Arg Ser Pro 
        675                 680                 685             
Asn Val Leu Ser Val Ala Leu Ser Gln Arg Thr Thr Val Pro Glu Glu 
    690                 695                 700                 
Glu Leu Asn Pro Lys Ile Leu Ala Leu Gln Asn Ala Gln Arg Lys Arg 
705                 710                 715                 720 
Lys Met Glu His Asp Gly Ser Leu Phe Gln Ala Val Gly Ile Gly Thr 
                725                 730                 735     
Leu Leu Gln Gln Pro Asp Asp His Ala Ala Thr Thr Ser Leu Ser Trp 
            740                 745                 750         
Lys Arg Val Lys Gly Cys Lys Ser Ser Glu Gln Asn Gly Met Glu Gln 
        755                 760                 765             
Lys Thr Ile Ile Leu Ile Pro Ser Asp Leu Ala Cys Arg Leu Leu Gly 
    770                 775                 780                 
Gln Ser Met Asp Glu Ser Gly Leu Pro Gln Leu Thr Ser Tyr Asp Cys 
785                 790                 795                 800 
Glu Val Asn Ala Pro Ile Gln Gly Ser Arg Asn Leu Leu Gln Gly Glu 
                805                 810                 815     
Glu Leu Leu Arg Ala Leu Asp Gln Val Asn 
            820                 825     

<210> 10
<211> 56
<212> PRT
<213> Homo sapiens


<400> 10
Asn Pro Phe Ser Thr Gln Asp Thr Asp Leu Asp Leu Glu Met Leu Ala 
1               5                   10                  15      
Pro Tyr Ile Pro Met Asp Asp Asp Phe Gln Leu Arg Ser Phe Asp Gln 
            20                  25                  30          
Leu Ser Pro Leu Glu Ser Ser Ser Ala Ser Pro Glu Ser Ala Ser Pro 
        35                  40                  45              
Gln Ser Thr Val Thr Val Phe Gln 
    50                  55      

<210> 11
<211> 11
<212> PRT
<213> Artificial Sequence


<220> 
<223> SsrA tag

<400> 11
Ala Ala Asn Asp Glu Asn Tyr Ala Leu Ala Ala 
1               5                   10      

<210> 12
<211> 150
<212> PRT
<213> Artificial Sequence


<220> 
<223> Wherein XXXXXX is the extein and cyclic peptide to be produced;
      the first X is C, S, or T, and subsequent Xs are shown as ~ in
      the description, which denotes an amino acid of the cyclic
      peptide sequence.

<400> 12
His His His His His His Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu 
1               5                   10                  15      
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Tyr His Asn Phe 
            20                  25                  30          
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Xaa Xaa Xaa Xaa Xaa Xaa 
        35                  40                  45              
Cys Leu Ser Tyr Asp Thr Glu Ile Leu Thr Val Glu Tyr Gly Ile Leu 
    50                  55                  60                  
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser 
65                  70                  75                  80  
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His 
                85                  90                  95      
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Cys 
            100                 105                 110         
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln 
        115                 120                 125             
Met Met Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg 
    130                 135                 140                 
Val Asp Asn Leu Pro Asn 
145                 150 

<210> 13
<211> 6
<212> PRT
<213> Artificial Sequence


<220> 
<223> Hexahistidine (6xHis) tag

<400> 13
His His His His His His 
1               5       

<210> 14
<211> 36
<212> PRT
<213> Artificial Sequence


<220> 
<223> Constituent comprising the C-terminus intein domain

<400> 14
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gln Asn Val Tyr 
1               5                   10                  15      
Asp Ile Gly Val Glu Arg Tyr His Asn Phe Ala Leu Lys Asn Gly Phe 
            20                  25                  30          
Ile Ala Ser Asn 
        35      

<210> 15
<211> 102
<212> PRT
<213> Artificial Sequence


<220> 
<223> Constituent comprising the N-terminus intein domain

<400> 15
Cys Leu Ser Tyr Asp Thr Glu Ile Leu Thr Val Glu Tyr Gly Ile Leu 
1               5                   10                  15      
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser 
            20                  25                  30          
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His 
        35                  40                  45              
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Cys 
    50                  55                  60                  
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln 
65                  70                  75                  80  
Met Met Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg 
                85                  90                  95      
Val Asp Asn Leu Pro Asn 
            100         

<210> 16
<211> 206
<212> PRT
<213> Artificial Sequence


<220> 
<223> Wherein XXXXXX is the extein and cyclic peptide to be produced;
      the first X is C, S, or T, and subsequent Xs are shown as ~ in
      the description, which denotes an amino acid of the cyclic
      peptide sequence.

<400> 16
His His His His His His Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu 
1               5                   10                  15      
Gly Lys Gln Asn Val Tyr Asp Ile Gly Val Glu Arg Tyr His Asn Phe 
            20                  25                  30          
Ala Leu Lys Asn Gly Phe Ile Ala Ser Asn Xaa Xaa Xaa Xaa Xaa Xaa 
        35                  40                  45              
Cys Leu Ser Tyr Asp Thr Glu Ile Leu Thr Val Glu Tyr Gly Ile Leu 
    50                  55                  60                  
Pro Ile Gly Lys Ile Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser 
65                  70                  75                  80  
Val Asp Asn Asn Gly Asn Ile Tyr Thr Gln Pro Val Ala Gln Trp His 
                85                  90                  95      
Asp Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Cys 
            100                 105                 110         
Leu Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Val Asp Gly Gln 
        115                 120                 125             
Met Met Pro Ile Asp Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg 
    130                 135                 140                 
Val Asp Asn Leu Pro Asn Asn Pro Phe Ser Thr Gln Asp Thr Asp Leu 
145                 150                 155                 160 
Asp Leu Glu Met Leu Ala Pro Tyr Ile Pro Met Asp Asp Asp Phe Gln 
                165                 170                 175     
Leu Arg Ser Phe Asp Gln Leu Ser Pro Leu Glu Ser Ser Ser Ala Ser 
            180                 185                 190         
Pro Glu Ser Ala Ser Pro Gln Ser Thr Val Thr Val Phe Gln 
        195                 200                 205     

<210> 17
<211> 56
<212> PRT
<213> Artificial Sequence


<220> 
<223> Comprising amino acids 548-603 of the full length ODD domain of
      HIF-1?

<400> 17
Asn Pro Phe Ser Thr Gln Asp Thr Asp Leu Asp Leu Glu Met Leu Ala 
1               5                   10                  15      
Pro Tyr Ile Pro Met Asp Asp Asp Phe Gln Leu Arg Ser Phe Asp Gln 
            20                  25                  30          
Leu Ser Pro Leu Glu Ser Ser Ser Ala Ser Pro Glu Ser Ala Ser Pro 
        35                  40                  45              
Gln Ser Thr Val Thr Val Phe Gln 
    50                  55      

<210> 18
<211> 46
<212> PRT
<213> Artificial Sequence


<220> 
<223> C-terminus intein domain (+N-terminus 6xHis tag)

<400> 18
Met Gly His His His His His His Gly Ser Gly Val Lys Ile Ile Ser 
1               5                   10                  15      
Arg Lys Ser Leu Gly Thr Gln Asn Val Tyr Asp Ile Gly Val Gly Glu 
            20                  25                  30          
Pro His Asn Phe Leu Leu Lys Asn Gly Leu Val Ala Ser Asn 
        35                  40                  45      

<210> 19
<211> 101
<212> PRT
<213> Artificial Sequence


<220> 
<223> N-terminus intein domain

<400> 19
Cys Leu Ser Tyr Asp Thr Glu Ile Leu Thr Val Glu Tyr Gly Phe Leu 
1               5                   10                  15      
Pro Ile Gly Lys Ile Val Glu Glu Arg Ile Glu Cys Thr Val Tyr Thr 
            20                  25                  30          
Val Asp Lys Asn Gly Phe Val Tyr Thr Gln Pro Ile Ala Gln Trp His 
        35                  40                  45              
Asn Arg Gly Glu Gln Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser 
    50                  55                  60                  
Ile Ile Arg Ala Thr Lys Asp His Lys Phe Met Thr Thr Asp Gly Gln 
65                  70                  75                  80  
Met Leu Pro Ile Asp Glu Ile Phe Glu Arg Gly Leu Asp Leu Lys Gln 
                85                  90                  95      
Val Asp Gly Leu Pro 
            100     

<210> 20
<211> 280
<212> PRT
<213> Artificial Sequence


<220> 
<223> Wildtype Cfa splice junctions (CFN & AEY) incorporated

<400> 20
Cys Phe Asn Trp Ser His Pro Gln Phe Glu Lys Gly Gly Gly Ser Gly 
1               5                   10                  15      
Gly Gly Ser Gly Gly Ser Ala Trp Ser His Pro Gln Phe Glu Lys Gly 
            20                  25                  30          
Gly Ser Gly Gly Glu Phe Met Val Ser Lys Gly Glu Glu Leu Phe Thr 
        35                  40                  45              
Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
    50                  55                  60                  
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
65                  70                  75                  80  
Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
                85                  90                  95      
Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg 
            100                 105                 110         
Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro 
        115                 120                 125             
Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn 
    130                 135                 140                 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
145                 150                 155                 160 
Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu 
                165                 170                 175     
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met 
            180                 185                 190         
Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His 
        195                 200                 205             
Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn 
    210                 215                 220                 
Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
225                 230                 235                 240 
Ser Thr Gln Ser Lys Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
                245                 250                 255     
Met Val Leu Lys Glu Arg Val Thr Ala Ala Gly Ile Thr Leu Gly Met 
            260                 265                 270         
Asp Glu Leu Tyr Lys Ala Glu Tyr 
        275                 280 

<210> 21
<211> 56
<212> PRT
<213> Artificial Sequence


<220> 
<223> oxygen dependent degradation (ODD) domain from HIF-1?, comprising
      the key residue P564 for hydroxylation, incorporated C-terminus
      to N-terminus intein domain

<400> 21
Asn Pro Phe Ser Thr Gln Asp Thr Asp Leu Asp Leu Glu Met Leu Ala 
1               5                   10                  15      
Pro Tyr Ile Pro Met Asp Asp Asp Phe Gln Leu Arg Ser Phe Asp Gln 
            20                  25                  30          
Leu Ser Pro Leu Glu Ser Ser Ser Ala Ser Pro Glu Ser Ala Ser Pro 
        35                  40                  45              
Gln Ser Thr Val Thr Val Phe Gln 
    50                  55      

<210> 22
<211> 246
<212> PRT
<213> Artificial Sequence


<220> 
<223> fluorescent protein mCherry fused C-terminal to ODD domain,
      followed by a FLAG tag

<400> 22
Met Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe 
1               5                   10                  15      
Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly His Glu Phe 
            20                  25                  30          
Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr 
        35                  40                  45              
Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp 
    50                  55                  60                  
Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His 
65                  70                  75                  80  
Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe 
                85                  90                  95      
Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val Thr Val 
            100                 105                 110         
Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys 
        115                 120                 125             
Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys 
    130                 135                 140                 
Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly 
145                 150                 155                 160 
Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly 
                165                 170                 175     
His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val 
            180                 185                 190         
Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser 
        195                 200                 205             
His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly 
    210                 215                 220                 
Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys Thr Gly Asp Tyr 
225                 230                 235                 240 
Lys Asp Asp Asp Asp Lys 
                245     

