                         SEQUENCE LISTING

<110>  Helmholtz Zentrum Muenchen GmbH
       Klinikum rechts der Isar der Technischen Universitaet Muenchen
 
<120>  METHOD FOR DETECTING A SPECIFIC SPLICE EVENT OF A GENE OF 
       INTEREST

<130>  HEL16551PCT

<150>  LU101118
<151>  2019-02-06

<160>  65    

<170>  PatentIn version 3.5

<210>  1
<211>  105
<212>  PRT
<213>  Artificial

<220>
<223>  N-terminal splicing region of NrdJ-1

<400>  1

Cys Leu Val Gly Ser Ser Glu Ile Ile Thr Arg Asn Tyr Gly Lys Thr 
1               5                   10                  15      


Thr Ile Lys Glu Val Val Glu Ile Phe Asp Asn Asp Lys Asn Ile Gln 
            20                  25                  30          


Val Leu Ala Phe Asn Thr His Thr Asp Asn Ile Glu Trp Ala Pro Ile 
        35                  40                  45              


Lys Ala Ala Gln Leu Thr Arg Pro Asn Ala Glu Leu Val Glu Leu Glu 
    50                  55                  60                  


Ile Asp Thr Leu His Gly Val Lys Thr Ile Arg Cys Thr Pro Asp His 
65                  70                  75                  80  


Pro Val Tyr Thr Lys Asn Arg Gly Tyr Val Arg Ala Asp Glu Leu Thr 
                85                  90                  95      


Asp Asp Asp Glu Leu Val Val Ala Ile 
            100                 105 


<210>  2
<211>  88
<212>  PRT
<213>  Artificial

<220>
<223>  N-terminial splicing region of gp41-1

<400>  2

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu 
                85              


<210>  3
<211>  40
<212>  PRT
<213>  Artificial

<220>
<223>  C-terminal splicing region of NrdJ-1

<400>  3

Met Glu Ala Lys Thr Tyr Ile Gly Lys Leu Lys Ser Arg Lys Ile Val 
1               5                   10                  15      


Ser Asn Glu Asp Thr Tyr Asp Ile Gln Thr Ser Thr His Asn Phe Phe 
            20                  25                  30          


Ala Asn Asp Ile Leu Val His Asn 
        35                  40  


<210>  4
<211>  37
<212>  PRT
<213>  Artificial

<220>
<223>  C-terminal splicing region of gp41-1

<400>  4

Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp Glu Arg Glu 
1               5                   10                  15      


Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu Phe Tyr Ala Asn Asp 
            20                  25                  30          


Ile Leu Thr His Asn 
        35          


<210>  5
<211>  808
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  5

Cys Leu Val Gly Ser Ser Glu Ile Ile Thr Arg Asn Tyr Gly Lys Thr 
1               5                   10                  15      


Thr Ile Lys Glu Val Val Glu Ile Phe Asp Asn Asp Lys Asn Ile Gln 
            20                  25                  30          


Val Leu Ala Phe Asn Thr His Thr Asp Asn Ile Glu Trp Ala Pro Ile 
        35                  40                  45              


Lys Ala Ala Gln Leu Thr Arg Pro Asn Ala Glu Leu Val Glu Leu Glu 
    50                  55                  60                  


Ile Asp Thr Leu His Gly Val Lys Thr Ile Arg Cys Thr Pro Asp His 
65                  70                  75                  80  


Pro Val Tyr Thr Lys Asn Arg Gly Tyr Val Arg Ala Asp Glu Leu Thr 
                85                  90                  95      


Asp Asp Asp Glu Leu Val Val Ala Ile Gly Gly Gly Gly Pro Glu Asp 
            100                 105                 110         


Glu Leu Ala Ala Asn Glu Glu Glu Leu Gln Gln Asn Glu Gln Lys Leu 
        115                 120                 125             


Ala Gln Ile Lys Gln Lys Leu Gln Ala Ile Lys Tyr Gly Gly Ser Gly 
    130                 135                 140                 


Gly Gly Gly Ser Gly Thr Gly Met Glu Asp Ala Lys Asn Ile Lys Lys 
145                 150                 155                 160 


Gly Pro Ala Pro Arg Tyr Pro Leu Glu Asp Gly Thr Ala Gly Glu Gln 
                165                 170                 175     


Leu His Lys Ala Met Lys Arg Tyr Ala Gln Val Pro Gly Thr Ile Ala 
            180                 185                 190         


Phe Thr Asp Ala His Ile Glu Val Asn Ile Thr Tyr Ala Glu Tyr Phe 
        195                 200                 205             


Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys Arg Tyr Gly Leu Asn 
    210                 215                 220                 


Thr Asn His Arg Ile Val Val Cys Ser Glu Asn Ser Leu Gln Phe Phe 
225                 230                 235                 240 


Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val Ala Val Ala Pro Ala 
                245                 250                 255     


Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn Ser Met Asn Ile Ser 
            260                 265                 270         


Gln Pro Thr Val Val Phe Val Ser Lys Lys Gly Leu Gln Lys Ile Leu 
        275                 280                 285             


Asn Val Gln Lys Lys Leu Pro Ile Ile Gln Lys Ile Ile Ile Met Asp 
    290                 295                 300                 


Ser Lys Thr Asp Tyr Gln Gly Phe Gln Ser Met Tyr Thr Phe Val Thr 
305                 310                 315                 320 


Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp Phe Lys Pro Glu Ser 
                325                 330                 335     


Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met Asn Ser Ser Gly Ser 
            340                 345                 350         


Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His Arg Thr Ala Cys Val 
        355                 360                 365             


Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly Asn Gln Ile Lys Pro 
    370                 375                 380                 


Asp Thr Ala Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met 
385                 390                 395                 400 


Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met 
                405                 410                 415     


Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg Ser Leu Gln Asp Tyr Lys 
            420                 425                 430         


Ile Gln Ser Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys 
        435                 440                 445             


Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala 
    450                 455                 460                 


Ser Gly Gly Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys 
465                 470                 475                 480 


Arg Phe His Leu Pro Gly Ile Arg Gln Gly Tyr Gly Leu Thr Glu Thr 
                485                 490                 495     


Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala 
            500                 505                 510         


Val Gly Lys Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp 
        515                 520                 525             


Thr Gly Lys Thr Leu Gly Val Asn Gln Arg Gly Glu Leu Cys Val Arg 
    530                 535                 540                 


Gly Pro Met Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn 
545                 550                 555                 560 


Ala Leu Ile Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr 
                565                 570                 575     


Trp Asp Glu Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu 
            580                 585                 590         


Ile Lys Tyr Lys Gly Tyr Gln Val Ala Pro Ala Glu Leu Glu Ser Ile 
        595                 600                 605             


Leu Leu Gln His Pro Asn Ile Arg Asp Ala Gly Val Ala Gly Leu Pro 
    610                 615                 620                 


Asp Asp Asp Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His 
625                 630                 635                 640 


Gly Lys Thr Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gln 
                645                 650                 655     


Val Thr Thr Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu 
            660                 665                 670         


Val Pro Lys Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu 
        675                 680                 685             


Ile Leu Ile Lys Ala Lys Lys Gly Gly Lys Ile Ala Val Gly Gly Ser 
    690                 695                 700                 


Gly Gly Asp Tyr Lys Asp Asp Asp Asp Lys Gly Ser Pro Gly Ile Thr 
705                 710                 715                 720 


Ser Tyr Ser Thr His Tyr Thr Lys Leu Ser Gly Gly Ser Pro Glu Asp 
                725                 730                 735     


Glu Ile Gln Gln Leu Glu Glu Glu Ile Ala Gln Leu Glu Gln Lys Asn 
            740                 745                 750         


Ala Ala Leu Lys Glu Lys Asn Gln Ala Leu Lys Tyr Gly Gly Gly Gly 
        755                 760                 765             


Met Glu Ala Lys Thr Tyr Ile Gly Lys Leu Lys Ser Arg Lys Ile Val 
    770                 775                 780                 


Ser Asn Glu Asp Thr Tyr Asp Ile Gln Thr Ser Thr His Asn Phe Phe 
785                 790                 795                 800 


Ala Asn Asp Ile Leu Val His Asn 
                805             


<210>  6
<211>  755
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  6

Cys Leu Val Gly Ser Ser Glu Ile Ile Thr Arg Asn Tyr Gly Lys Thr 
1               5                   10                  15      


Thr Ile Lys Glu Val Val Glu Ile Phe Asp Asn Asp Lys Asn Ile Gln 
            20                  25                  30          


Val Leu Ala Phe Asn Thr His Thr Asp Asn Ile Glu Trp Ala Pro Ile 
        35                  40                  45              


Lys Ala Ala Gln Leu Thr Arg Pro Asn Ala Glu Leu Val Glu Leu Glu 
    50                  55                  60                  


Ile Asp Thr Leu His Gly Val Lys Thr Ile Arg Cys Thr Pro Asp His 
65                  70                  75                  80  


Pro Val Tyr Thr Lys Asn Arg Gly Tyr Val Arg Ala Asp Glu Leu Thr 
                85                  90                  95      


Asp Asp Asp Glu Leu Val Val Ala Ile Gly Gly Gly Gly Pro Glu Asp 
            100                 105                 110         


Glu Leu Ala Ala Asn Glu Glu Glu Leu Gln Gln Asn Glu Gln Lys Leu 
        115                 120                 125             


Ala Gln Ile Lys Gln Lys Leu Gln Ala Ile Lys Tyr Gly Gly Ser Gly 
    130                 135                 140                 


Gly Gly Gly Ser Gly Thr Gly Met Glu Asp Ala Lys Asn Ile Lys Lys 
145                 150                 155                 160 


Gly Pro Ala Pro Arg Tyr Pro Leu Glu Asp Gly Thr Ala Gly Glu Gln 
                165                 170                 175     


Leu His Lys Ala Met Lys Arg Tyr Ala Gln Val Pro Gly Thr Ile Ala 
            180                 185                 190         


Phe Thr Asp Ala His Ile Glu Val Asn Ile Thr Tyr Ala Glu Tyr Phe 
        195                 200                 205             


Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys Arg Tyr Gly Leu Asn 
    210                 215                 220                 


Thr Asn His Arg Ile Val Val Cys Ser Glu Asn Ser Leu Gln Phe Phe 
225                 230                 235                 240 


Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val Ala Val Ala Pro Ala 
                245                 250                 255     


Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn Ser Met Asn Ile Ser 
            260                 265                 270         


Gln Pro Thr Val Val Phe Val Ser Lys Lys Gly Leu Gln Lys Ile Leu 
        275                 280                 285             


Asn Val Gln Lys Lys Leu Pro Ile Ile Gln Lys Ile Ile Ile Met Asp 
    290                 295                 300                 


Ser Lys Thr Asp Tyr Gln Gly Phe Gln Ser Met Tyr Thr Phe Val Thr 
305                 310                 315                 320 


Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp Phe Lys Pro Glu Ser 
                325                 330                 335     


Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met Asn Ser Ser Gly Ser 
            340                 345                 350         


Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His Arg Thr Ala Cys Val 
        355                 360                 365             


Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly Asn Gln Ile Lys Pro 
    370                 375                 380                 


Asp Thr Ala Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met 
385                 390                 395                 400 


Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met 
                405                 410                 415     


Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg Ser Leu Gln Asp Tyr Lys 
            420                 425                 430         


Ile Gln Ser Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys 
        435                 440                 445             


Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala 
    450                 455                 460                 


Ser Gly Gly Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys 
465                 470                 475                 480 


Arg Phe His Leu Pro Gly Ile Arg Gln Gly Tyr Gly Leu Thr Glu Thr 
                485                 490                 495     


Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala 
            500                 505                 510         


Val Gly Lys Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp 
        515                 520                 525             


Thr Gly Lys Thr Leu Gly Val Asn Gln Arg Gly Glu Leu Cys Val Arg 
    530                 535                 540                 


Gly Pro Met Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn 
545                 550                 555                 560 


Ala Leu Ile Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr 
                565                 570                 575     


Trp Asp Glu Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu 
            580                 585                 590         


Ile Lys Tyr Lys Gly Tyr Gln Val Ala Pro Ala Glu Leu Glu Ser Ile 
        595                 600                 605             


Leu Leu Gln His Pro Asn Ile Arg Asp Ala Gly Val Ala Gly Leu Pro 
    610                 615                 620                 


Asp Asp Asp Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His 
625                 630                 635                 640 


Gly Lys Thr Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gln 
                645                 650                 655     


Val Thr Thr Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu 
            660                 665                 670         


Val Pro Lys Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu 
        675                 680                 685             


Ile Leu Ile Lys Ala Lys Lys Gly Gly Lys Ile Ala Val Gly Gly Ser 
    690                 695                 700                 


Gly Gly Asp Tyr Lys Asp Asp Asp Asp Lys Gly Ser Pro Gly Ile Thr 
705                 710                 715                 720 


Ser Tyr Ser Thr His Tyr Thr Lys Leu Ser Gly Gln Val Ser Ile Leu 
                725                 730                 735     


Phe Thr Ala Gln Leu Asn Glu Thr Asp Arg Asn Trp Ser Cys Arg Asn 
            740                 745                 750         


Arg Val Gly 
        755 


<210>  7
<211>  414
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  7

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Val Phe Thr Leu Glu Asp 
145                 150                 155                 160 


Phe Val Gly Asp Trp Arg Gln Thr Ala Gly Tyr Asn Leu Asp Gln Val 
                165                 170                 175     


Leu Glu Gln Gly Gly Val Ser Ser Leu Phe Gln Asn Leu Gly Val Ser 
            180                 185                 190         


Val Thr Pro Ile Gln Arg Ile Val Leu Ser Gly Glu Asn Gly Leu Lys 
        195                 200                 205             


Ile Asp Ile His Val Ile Ile Pro Tyr Glu Gly Leu Ser Gly Asp Gln 
    210                 215                 220                 


Met Gly Gln Ile Glu Lys Ile Phe Lys Val Val Tyr Pro Val Asp Asp 
225                 230                 235                 240 


His His Phe Lys Val Ile Leu His Tyr Gly Thr Leu Val Ile Asp Gly 
                245                 250                 255     


Val Thr Pro Asn Met Ile Asp Tyr Phe Gly Arg Pro Tyr Glu Gly Ile 
            260                 265                 270         


Ala Val Phe Asp Gly Lys Lys Ile Thr Val Thr Gly Thr Leu Trp Asn 
        275                 280                 285             


Gly Asn Lys Ile Ile Asp Glu Arg Leu Ile Asn Pro Asp Gly Ser Leu 
    290                 295                 300                 


Leu Phe Arg Val Thr Ile Asn Gly Val Thr Gly Trp Arg Leu Cys Glu 
305                 310                 315                 320 


Arg Ile Leu Ala Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile 
                325                 330                 335     


Gly Thr Ser Gly Gly Ser Pro Glu Asp Glu Asn Ala Ala Leu Glu Glu 
            340                 345                 350         


Lys Ile Ala Gln Leu Lys Gln Lys Asn Ala Ala Leu Lys Glu Glu Ile 
        355                 360                 365             


Gln Ala Leu Glu Tyr Gly Gly Gly Gly Met Met Leu Lys Lys Ile Leu 
    370                 375                 380                 


Lys Ile Glu Glu Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val Ser 
385                 390                 395                 400 


Gly Asn His Leu Phe Tyr Ala Asn Asp Ile Leu Thr His Asn 
                405                 410                 


<210>  8
<211>  380
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  8

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Val Phe Thr Leu Glu Asp 
145                 150                 155                 160 


Phe Val Gly Asp Trp Arg Gln Thr Ala Gly Tyr Asn Leu Asp Gln Val 
                165                 170                 175     


Leu Glu Gln Gly Gly Val Ser Ser Leu Phe Gln Asn Leu Gly Val Ser 
            180                 185                 190         


Val Thr Pro Ile Gln Arg Ile Val Leu Ser Gly Glu Asn Gly Leu Lys 
        195                 200                 205             


Ile Asp Ile His Val Ile Ile Pro Tyr Glu Gly Leu Ser Gly Asp Gln 
    210                 215                 220                 


Met Gly Gln Ile Glu Lys Ile Phe Lys Val Val Tyr Pro Val Asp Asp 
225                 230                 235                 240 


His His Phe Lys Val Ile Leu His Tyr Gly Thr Leu Val Ile Asp Gly 
                245                 250                 255     


Val Thr Pro Asn Met Ile Asp Tyr Phe Gly Arg Pro Tyr Glu Gly Ile 
            260                 265                 270         


Ala Val Phe Asp Gly Lys Lys Ile Thr Val Thr Gly Thr Leu Trp Asn 
        275                 280                 285             


Gly Asn Lys Ile Ile Asp Glu Arg Leu Ile Asn Pro Asp Gly Ser Leu 
    290                 295                 300                 


Leu Phe Arg Val Thr Ile Asn Gly Val Thr Gly Trp Arg Leu Cys Glu 
305                 310                 315                 320 


Arg Ile Leu Ala Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile 
                325                 330                 335     


Gly Thr Ser Asn Trp Leu Arg Cys Pro Ser Val Gly Arg Ala His Ile 
            340                 345                 350         


Ala His Ser Pro Arg Glu Val Gly Gly Arg Gly Arg Gln Leu Asn Arg 
        355                 360                 365             


Cys Leu Glu Lys Val Ala Arg Gly Lys Leu Gly Lys 
    370                 375                 380 


<210>  9
<211>  363
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  9

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Val Phe Thr Leu Glu Asp 
145                 150                 155                 160 


Phe Val Gly Asp Trp Arg Gln Thr Ala Gly Tyr Asn Leu Asp Gln Val 
                165                 170                 175     


Leu Glu Gln Gly Gly Val Ser Ser Leu Phe Gln Asn Leu Gly Val Ser 
            180                 185                 190         


Val Thr Pro Ile Gln Arg Ile Val Leu Ser Gly Glu Asn Gly Leu Lys 
        195                 200                 205             


Ile Asp Ile His Val Ile Ile Pro Tyr Glu Gly Leu Ser Gly Asp Gln 
    210                 215                 220                 


Met Gly Gln Ile Glu Lys Ile Phe Lys Val Val Tyr Pro Val Asp Asp 
225                 230                 235                 240 


His His Phe Lys Val Ile Leu His Tyr Gly Thr Leu Val Ile Asp Gly 
                245                 250                 255     


Val Thr Pro Asn Met Ile Asp Tyr Phe Gly Arg Pro Tyr Glu Gly Ile 
            260                 265                 270         


Ala Val Phe Asp Gly Lys Lys Ile Thr Val Thr Gly Thr Leu Trp Asn 
        275                 280                 285             


Gly Asn Lys Ile Ile Asp Glu Arg Leu Ile Asn Pro Asp Gly Ser Leu 
    290                 295                 300                 


Leu Phe Arg Val Thr Ile Asn Gly Val Thr Gly Trp Arg Leu Cys Glu 
305                 310                 315                 320 


Arg Ile Leu Ala Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile 
                325                 330                 335     


Gly Thr Ser Gln Val Ser Ile Leu Phe Thr Ala Gln Leu Asn Glu Thr 
            340                 345                 350         


Asp Arg Asn Trp Ser Cys Arg Asn Arg Val Gly 
        355                 360             


<210>  10
<211>  331
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  10

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Lys Pro 
    130                 135                 140                 


Leu Ser Gln Glu Glu Ser Thr Leu Ile Glu Arg Ala Thr Ala Thr Ile 
145                 150                 155                 160 


Asn Ser Ile Pro Ile Ser Glu Asp Tyr Ser Val Ala Ser Ala Ala Leu 
                165                 170                 175     


Ser Ser Asp Gly Arg Ile Phe Thr Gly Val Asn Val Tyr His Phe Thr 
            180                 185                 190         


Gly Gly Pro Cys Ala Glu Leu Val Val Leu Gly Thr Ala Ala Ala Ala 
        195                 200                 205             


Ala Ala Gly Asn Leu Thr Cys Ile Val Ala Ile Gly Asn Glu Asn Arg 
    210                 215                 220                 


Gly Ile Leu Ser Pro Cys Gly Arg Cys Arg Gln Val Leu Leu Asp Leu 
225                 230                 235                 240 


His Pro Gly Ile Lys Ala Ile Val Lys Asp Ser Asp Gly Gln Pro Thr 
                245                 250                 255     


Ala Val Gly Ile Arg Glu Leu Leu Pro Ser Gly Tyr Val Trp Glu Gly 
            260                 265                 270         


Gly Gly Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg 
        275                 280                 285             


Leu Met Gly Lys Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile 
    290                 295                 300                 


Gly Thr Ser Gln Val Ser Ile Leu Phe Thr Ala Gln Leu Asn Glu Thr 
305                 310                 315                 320 


Asp Arg Asn Trp Ser Cys Arg Asn Arg Val Gly 
                325                 330     


<210>  11
<211>  652
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  11

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Pro Pro Arg Lys Arg Cys 
145                 150                 155                 160 


Cys Cys Ala Arg Arg Gly Thr Gln Leu Met Leu Val Gly Leu Leu Ser 
                165                 170                 175     


Thr Ala Met Trp Ala Gly Leu Leu Ala Leu Leu Leu Leu Trp His Trp 
            180                 185                 190         


Glu Thr Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Ile Gly 
        195                 200                 205             


Thr Gly Phe Pro Phe Asp Pro His Tyr Val Glu Val Leu Gly Glu Arg 
    210                 215                 220                 


Met His Tyr Val Asp Val Gly Pro Arg Asp Gly Thr Pro Val Leu Phe 
225                 230                 235                 240 


Leu His Gly Asn Pro Thr Ser Ser Tyr Val Trp Arg Asn Ile Ile Pro 
                245                 250                 255     


His Val Ala Pro Thr His Arg Val Ile Ala Pro Asp Leu Ile Gly Met 
            260                 265                 270         


Gly Lys Ser Asp Lys Pro Asp Leu Gly Tyr Phe Phe Asp Asp His Val 
        275                 280                 285             


Arg Phe Met Asp Ala Phe Ile Glu Ala Leu Gly Leu Glu Glu Val Val 
    290                 295                 300                 


Leu Val Ile His Asp Trp Gly Ser Ala Leu Gly Phe His Trp Ala Lys 
305                 310                 315                 320 


Arg Asn Pro Glu Arg Val Lys Gly Ile Ala Phe Met Glu Phe Ile Arg 
                325                 330                 335     


Pro Ile Pro Thr Trp Asp Glu Trp Pro Glu Phe Ala Arg Glu Thr Phe 
            340                 345                 350         


Gln Ala Phe Arg Thr Thr Asp Val Gly Arg Lys Leu Ile Ile Asp Gln 
        355                 360                 365             


Asn Val Phe Ile Glu Gly Thr Leu Pro Met Gly Val Val Arg Pro Leu 
    370                 375                 380                 


Thr Glu Val Glu Met Asp His Tyr Arg Glu Pro Phe Leu Asn Pro Val 
385                 390                 395                 400 


Asp Arg Glu Pro Leu Trp Arg Phe Pro Asn Glu Leu Pro Ile Ala Gly 
                405                 410                 415     


Glu Pro Ala Asn Ile Val Ala Leu Val Glu Glu Tyr Met Asp Trp Leu 
            420                 425                 430         


His Gln Ser Pro Val Pro Lys Leu Leu Phe Trp Gly Thr Pro Gly Val 
        435                 440                 445             


Leu Ile Pro Pro Ala Glu Ala Ala Arg Leu Ala Lys Ser Leu Pro Asn 
    450                 455                 460                 


Ala Lys Ala Val Asp Ile Gly Pro Gly Leu Asn Leu Leu Gln Glu Asp 
465                 470                 475                 480 


Asn Pro Asp Leu Ile Gly Ser Glu Ile Ala Arg Trp Leu Ser Thr Leu 
                485                 490                 495     


Glu Ile Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala His 
            500                 505                 510         


His Phe Ser Glu Pro Glu Ile Thr Leu Ile Ile Phe Gly Val Met Ala 
        515                 520                 525             


Leu Val Ile Gly Thr Ile Leu Leu Ile Ser Tyr Gly Ile Arg Arg Leu 
    530                 535                 540                 


Ile Lys Lys Ser Pro Ser Gly Gly Gly Gly Ser Thr Gly Ser Gly Gly 
545                 550                 555                 560 


Ser Gly Phe Cys Tyr Glu Asn Glu Val Gly Ser Gly Arg Ser Arg Phe 
                565                 570                 575     


Val Lys Lys Asp Gly His Cys Asn Val Gln Phe Ile Asn Val Gly Ser 
            580                 585                 590         


Gly Lys Ser Arg Ile Thr Ser Glu Gly Glu Tyr Ile Pro Leu Asp Gln 
        595                 600                 605             


Ile Asp Ile Asn Val Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg 
    610                 615                 620                 


Ile Gly Thr Ser Gln Val Ser Ile Leu Phe Thr Ala Gln Leu Asn Glu 
625                 630                 635                 640 


Thr Asp Arg Asn Trp Ser Cys Arg Asn Arg Val Gly 
                645                 650         


<210>  12
<211>  933
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  12

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Pro Pro Arg Lys Arg Cys 
145                 150                 155                 160 


Cys Cys Ala Arg Arg Gly Thr Gln Leu Met Leu Val Gly Leu Leu Ser 
                165                 170                 175     


Thr Ala Met Trp Ala Gly Leu Leu Ala Leu Leu Leu Leu Trp His Trp 
            180                 185                 190         


Glu Thr Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
        195                 200                 205             


Gly Ser Gly Gly Gly Gly Ser Arg Lys Arg Thr Gln Pro Thr Phe Gly 
    210                 215                 220                 


Phe Thr Val Asn Trp Lys Phe Ser Glu Ser Thr Thr Val Phe Thr Gly 
225                 230                 235                 240 


Gln Cys Phe Ile Asp Arg Asn Gly Lys Glu Val Leu Lys Thr Met Trp 
                245                 250                 255     


Leu Leu Arg Ser Ser Val Asn Asp Ile Gly Asp Asp Trp Lys Ala Thr 
            260                 265                 270         


Arg Val Gly Ile Asn Ile Phe Thr Arg Leu Arg Thr Gln Lys Glu Gly 
        275                 280                 285             


Gly Ser Gly Gly Ser Ala Arg Lys Cys Ser Leu Thr Gly Lys Trp Thr 
    290                 295                 300                 


Asn Asp Leu Gly Ser Asn Met Thr Ile Gly Ala Val Asn Ser Arg Gly 
305                 310                 315                 320 


Glu Phe Thr Gly Thr Tyr Ile Thr Ala Val Thr Ala Thr Ser Asn Glu 
                325                 330                 335     


Ile Lys Glu Ser Pro Leu His Gly Thr Gln Asn Thr Ile Asn Lys Ser 
            340                 345                 350         


Gly Gly Ser Thr Thr Val Phe Thr Gly Gln Cys Phe Ile Asp Arg Asn 
        355                 360                 365             


Gly Lys Glu Val Leu Lys Thr Met Trp Leu Leu Arg Ser Ser Val Asn 
    370                 375                 380                 


Asp Ile Gly Asp Asp Trp Lys Ala Thr Arg Val Gly Ile Asn Ile Phe 
385                 390                 395                 400 


Thr Arg Leu Arg Thr Gln Lys Glu Gly Gly Ser Gly Gly Ser Ala Arg 
                405                 410                 415     


Lys Cys Ser Leu Thr Gly Lys Trp Thr Asn Asp Leu Gly Ser Asn Met 
            420                 425                 430         


Thr Ile Gly Ala Val Asn Ser Arg Gly Glu Phe Thr Gly Thr Tyr Ile 
        435                 440                 445             


Thr Ala Val Thr Ala Thr Ser Asn Glu Ile Lys Glu Ser Pro Leu His 
    450                 455                 460                 


Gly Thr Gln Asn Thr Ile Asn Lys Arg Thr Gln Pro Thr Phe Gly Phe 
465                 470                 475                 480 


Thr Val Asn Trp Lys Phe Ser Glu Gly Gly Ser Gly Ser Gly Ser Gly 
                485                 490                 495     


Ser Gly Ser Gly Arg Thr Gln Pro Thr Phe Gly Phe Thr Val Asn Trp 
            500                 505                 510         


Lys Phe Ser Glu Ser Thr Thr Val Phe Thr Gly Gln Cys Phe Ile Asp 
        515                 520                 525             


Arg Asn Gly Lys Glu Val Leu Lys Thr Met Trp Leu Leu Arg Ser Ser 
    530                 535                 540                 


Val Asn Asp Ile Gly Asp Asp Trp Lys Ala Thr Arg Val Gly Ile Asn 
545                 550                 555                 560 


Ile Phe Thr Arg Leu Arg Thr Gln Lys Glu Gly Gly Ser Gly Gly Ser 
                565                 570                 575     


Ala Arg Lys Cys Ser Leu Thr Gly Lys Trp Thr Asn Asp Leu Gly Ser 
            580                 585                 590         


Asn Met Thr Ile Gly Ala Val Asn Ser Arg Gly Glu Phe Thr Gly Thr 
        595                 600                 605             


Tyr Ile Thr Ala Val Thr Ala Thr Ser Asn Glu Ile Lys Glu Ser Pro 
    610                 615                 620                 


Leu His Gly Thr Gln Asn Thr Ile Asn Lys Ser Gly Gly Ser Thr Thr 
625                 630                 635                 640 


Val Phe Thr Gly Gln Cys Phe Ile Asp Arg Asn Gly Lys Glu Val Leu 
                645                 650                 655     


Lys Thr Met Trp Leu Leu Arg Ser Ser Val Asn Asp Ile Gly Asp Asp 
            660                 665                 670         


Trp Lys Ala Thr Arg Val Gly Ile Asn Ile Phe Thr Arg Leu Arg Thr 
        675                 680                 685             


Gln Lys Glu Gly Gly Ser Gly Gly Ser Ala Arg Lys Cys Ser Leu Thr 
    690                 695                 700                 


Gly Lys Trp Thr Asn Asp Leu Gly Ser Asn Met Thr Ile Gly Ala Val 
705                 710                 715                 720 


Asn Ser Arg Gly Glu Phe Thr Gly Thr Tyr Ile Thr Ala Val Thr Ala 
                725                 730                 735     


Thr Ser Asn Glu Ile Lys Glu Ser Pro Leu His Gly Thr Gln Asn Thr 
            740                 745                 750         


Ile Asn Lys Arg Thr Gln Pro Thr Phe Gly Phe Thr Val Asn Trp Lys 
        755                 760                 765             


Phe Ser Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
    770                 775                 780                 


Gly Ser Gly Gly Gly Gly Ser Ala His His Phe Ser Glu Pro Glu Ile 
785                 790                 795                 800 


Thr Leu Ile Ile Phe Gly Val Met Ala Leu Val Ile Gly Thr Ile Leu 
                805                 810                 815     


Leu Ile Ser Tyr Gly Ile Arg Arg Leu Ile Lys Lys Ser Pro Ser Gly 
            820                 825                 830         


Gly Gly Gly Ser Thr Gly Ser Gly Gly Ser Gly Phe Cys Tyr Glu Asn 
        835                 840                 845             


Glu Val Gly Ser Gly Arg Ser Arg Phe Val Lys Lys Asp Gly His Cys 
    850                 855                 860                 


Asn Val Gln Phe Ile Asn Val Gly Ser Gly Lys Ser Arg Ile Thr Ser 
865                 870                 875                 880 


Glu Gly Glu Tyr Ile Pro Leu Asp Gln Ile Asp Ile Asn Val Gly Ser 
                885                 890                 895     


Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile Gly Thr Ser Gln Val Ser 
            900                 905                 910         


Ile Leu Phe Thr Ala Gln Leu Asn Glu Thr Asp Arg Asn Trp Ser Cys 
        915                 920                 925             


Arg Asn Arg Val Gly 
    930             


<210>  13
<211>  415
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  13

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Val Ser 
        115                 120                 125             


Lys Gly Glu Glu Asp Asn Met Ala Ser Leu Pro Ala Thr His Glu Leu 
    130                 135                 140                 


His Ile Phe Gly Ser Ile Asn Gly Val Asp Phe Asp Met Val Gly Gln 
145                 150                 155                 160 


Gly Thr Gly Asn Pro Asn Asp Gly Tyr Glu Glu Leu Asn Leu Lys Ser 
                165                 170                 175     


Thr Lys Gly Asp Leu Gln Phe Ser Pro Trp Ile Leu Val Pro His Ile 
            180                 185                 190         


Gly Tyr Gly Phe His Gln Tyr Leu Pro Tyr Pro Asp Gly Met Ser Pro 
        195                 200                 205             


Phe Gln Ala Ala Met Val Asp Gly Ser Gly Tyr Gln Val His Arg Thr 
    210                 215                 220                 


Met Gln Phe Glu Asp Gly Ala Ser Leu Thr Val Asn Tyr Arg Tyr Thr 
225                 230                 235                 240 


Tyr Glu Gly Ser His Ile Lys Gly Glu Ala Gln Val Lys Gly Thr Gly 
                245                 250                 255     


Phe Pro Ala Asp Gly Pro Val Met Thr Asn Ser Leu Thr Ala Ala Asp 
            260                 265                 270         


Trp Cys Arg Ser Lys Lys Thr Tyr Pro Asn Asp Lys Thr Ile Ile Ser 
        275                 280                 285             


Thr Phe Lys Trp Ser Tyr Thr Thr Gly Asn Gly Lys Arg Tyr Arg Ser 
    290                 295                 300                 


Thr Ala Arg Thr Thr Tyr Thr Phe Ala Lys Pro Met Ala Ala Asn Tyr 
305                 310                 315                 320 


Leu Lys Asn Gln Pro Met Tyr Val Phe Arg Lys Thr Glu Leu Lys His 
                325                 330                 335     


Ser Lys Thr Glu Leu Asn Phe Lys Glu Trp Gln Lys Ala Phe Thr Asp 
            340                 345                 350         


Val Met Gly Met Asp Glu Leu Tyr Lys Gly Thr Gly Phe Ala Asn Glu 
        355                 360                 365             


Leu Gly Pro Arg Leu Met Gly Lys Gly Ser Gly Gly Ser Ser Tyr Thr 
    370                 375                 380                 


Ser Asn Arg Ile Gly Thr Ser Gln Val Ser Ile Leu Phe Thr Ala Gln 
385                 390                 395                 400 


Leu Asn Glu Thr Asp Arg Asn Trp Ser Cys Arg Asn Arg Val Gly 
                405                 410                 415 


<210>  14
<211>  391
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  14

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Ser Gly Gly Gly 
                85                  90                  95      


Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Leu Val Pro 
            100                 105                 110         


Glu Leu Asn Glu Lys Asp Asp Asp Gln Val Gln Lys Ala Leu Ala Ser 
        115                 120                 125             


Arg Glu Asn Thr Gln Leu Met Asn Arg Asp Asn Ile Glu Ile Thr Val 
    130                 135                 140                 


Arg Asp Phe Lys Thr Leu Ala Pro Arg Arg Trp Leu Asn Ser Gly Ile 
145                 150                 155                 160 


Ile Ser Phe Phe Met Lys Tyr Ile Glu Lys Ser Thr Pro Asn Thr Val 
                165                 170                 175     


Ala Phe Asn Ser Phe Phe Tyr Thr Asn Leu Ser Glu Arg Gly Tyr Gln 
            180                 185                 190         


Gly Val Arg Arg Trp Met Lys Arg Lys Lys Thr Gln Ile Asp Lys Leu 
        195                 200                 205             


Asp Lys Ile Phe Thr Pro Ile Asn Leu Asn Gln Ser His Trp Ala Leu 
    210                 215                 220                 


Gly Ile Ile Asp Leu Lys Lys Lys Thr Ile Gly Tyr Val Asp Ser Leu 
225                 230                 235                 240 


Ser Asn Gly Pro Asn Ala Met Ser Phe Ala Ile Leu Thr Asp Leu Gln 
                245                 250                 255     


Lys Tyr Val Met Glu Glu Ser Lys His Thr Ile Gly Glu Asp Phe Asp 
            260                 265                 270         


Leu Ile His Leu Asp Cys Pro Gln Gln Pro Asn Gly Tyr Asp Cys Gly 
        275                 280                 285             


Ile Tyr Val Cys Met Asn Thr Leu Tyr Gly Ser Ala Asp Ala Pro Leu 
    290                 295                 300                 


Asp Phe Asp Tyr Lys Asp Ala Ile Arg Met Arg Arg Phe Ile Ala His 
305                 310                 315                 320 


Leu Ile Leu Thr Asp Ala Leu Lys Gly Gly Gly Gly Ser Gly Thr Gly 
                325                 330                 335     


Phe Ala Asn Glu Leu Gly Pro Arg Leu Met Gly Lys Gly Ser Gly Gly 
            340                 345                 350         


Gly Gly Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp Glu 
        355                 360                 365             


Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu Phe Tyr Ala 
    370                 375                 380                 


Asn Asp Ile Leu Thr His Asn 
385                 390     


<210>  15
<211>  436
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  15

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Glu Ser 
    130                 135                 140                 


Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser Thr Ile Cys 
145                 150                 155                 160 


His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu Tyr Gly Ile 
                165                 170                 175     


Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe Arg Arg Asn 
            180                 185                 190         


Asn Gly Thr Leu Val Val Gln Ser Leu His Gly Val Phe Lys Val Lys 
        195                 200                 205             


Asn Thr Thr Thr Leu Gln Gln His Leu Ile Asp Gly Arg Asp Met Ile 
    210                 215                 220                 


Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gln Lys Leu Lys 
225                 230                 235                 240 


Phe Arg Glu Pro Gln Arg Glu Glu Arg Ile Cys Leu Val Thr Thr Asn 
                245                 250                 255     


Phe Gln Thr Lys Ser Met Ser Ser Met Val Ser Asp Thr Ser Cys Thr 
            260                 265                 270         


Phe Pro Ser Gly Asp Gly Ile Phe Trp Lys His Trp Ile Gln Thr Lys 
        275                 280                 285             


Asp Gly Gln Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Phe Ile 
    290                 295                 300                 


Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asn Tyr Phe 
305                 310                 315                 320 


Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gln Glu Ala 
                325                 330                 335     


Gln Gln Trp Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp 
            340                 345                 350         


Gly Gly His Lys Val Phe Met Val Lys Pro Glu Glu Pro Phe Gln Pro 
        355                 360                 365             


Val Lys Glu Ala Thr Gln Leu Met Asn Gly Gly Gly Gly Ser Gly Thr 
    370                 375                 380                 


Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met Gly Lys Gly Ser Gly 
385                 390                 395                 400 


Gly Ser Ser Tyr Thr Ser Asn Arg Ile Gly Thr Ser Gln Val Ser Ile 
                405                 410                 415     


Leu Phe Thr Ala Gln Leu Asn Glu Thr Asp Arg Asn Trp Ser Cys Arg 
            420                 425                 430         


Asn Arg Val Gly 
        435     


<210>  16
<211>  139
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  16

Leu Gln Arg Gly Ala Glu Arg Val Pro Gln Glu Gln Glu Glu Val Leu 
1               5                   10                  15      


Gln Asp His Pro Gly Arg Trp Gln Arg Asp His Leu Leu Arg Gly Thr 
            20                  25                  30          


Pro Val Pro Asn Pro Asp Arg Arg Asp Glu His Leu Trp Arg Pro Glu 
        35                  40                  45              


Arg Gly His Val Pro Val Arg Glu Arg Arg Arg Arg Arg Ile Arg Val 
    50                  55                  60                  


Gln Gly Arg Arg Gly Gln His Gly Gln Pro Ala Cys His Pro Arg Ala 
65                  70                  75                  80  


Ala His Leu Arg Gln His Gln Arg Arg Gly Leu Arg His Gly Gly Thr 
                85                  90                  95      


Gly His Arg Gln Pro Gln Arg Arg Ile Arg Gly Thr Glu Pro Glu Val 
            100                 105                 110         


His Gln Gly Gly Pro Pro Val Gln Pro Leu Asp Ser Gly Ala Pro His 
        115                 120                 125             


Arg Leu Arg Leu Pro Pro Val Pro Ala Leu Pro 
    130                 135                 


<210>  17
<211>  984
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  17

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Pro Pro Arg Lys Arg Cys 
145                 150                 155                 160 


Cys Cys Ala Arg Arg Gly Thr Gln Leu Met Leu Val Gly Leu Leu Ser 
                165                 170                 175     


Thr Ala Met Trp Ala Gly Leu Leu Ala Leu Leu Leu Leu Trp His Trp 
            180                 185                 190         


Glu Thr Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
        195                 200                 205             


Gly Ser Gly Gly Gly Gly Ser Arg Lys Arg Thr Gln Pro Thr Phe Gly 
    210                 215                 220                 


Phe Thr Val Asn Trp Lys Phe Ser Glu Ser Thr Thr Val Phe Thr Gly 
225                 230                 235                 240 


Gln Cys Phe Ile Asp Arg Asn Gly Lys Glu Val Leu Lys Thr Met Trp 
                245                 250                 255     


Leu Leu Arg Ser Ser Val Asn Asp Ile Gly Asp Asp Trp Lys Ala Thr 
            260                 265                 270         


Arg Val Gly Ile Asn Ile Phe Thr Arg Leu Arg Thr Gln Lys Glu Gly 
        275                 280                 285             


Gly Ser Gly Gly Ser Ala Arg Lys Cys Ser Leu Thr Gly Lys Trp Thr 
    290                 295                 300                 


Asn Asp Leu Gly Ser Asn Met Thr Ile Gly Ala Val Asn Ser Arg Gly 
305                 310                 315                 320 


Glu Phe Thr Gly Thr Tyr Ile Thr Ala Val Thr Ala Thr Ser Asn Glu 
                325                 330                 335     


Ile Lys Glu Ser Pro Leu His Gly Thr Gln Asn Thr Ile Asn Lys Ser 
            340                 345                 350         


Gly Gly Ser Thr Thr Val Phe Thr Gly Gln Cys Phe Ile Asp Arg Asn 
        355                 360                 365             


Gly Lys Glu Val Leu Lys Thr Met Trp Leu Leu Arg Ser Ser Val Asn 
    370                 375                 380                 


Asp Ile Gly Asp Asp Trp Lys Ala Thr Arg Val Gly Ile Asn Ile Phe 
385                 390                 395                 400 


Thr Arg Leu Arg Thr Gln Lys Glu Gly Gly Ser Gly Gly Ser Ala Arg 
                405                 410                 415     


Lys Cys Ser Leu Thr Gly Lys Trp Thr Asn Asp Leu Gly Ser Asn Met 
            420                 425                 430         


Thr Ile Gly Ala Val Asn Ser Arg Gly Glu Phe Thr Gly Thr Tyr Ile 
        435                 440                 445             


Thr Ala Val Thr Ala Thr Ser Asn Glu Ile Lys Glu Ser Pro Leu His 
    450                 455                 460                 


Gly Thr Gln Asn Thr Ile Asn Lys Arg Thr Gln Pro Thr Phe Gly Phe 
465                 470                 475                 480 


Thr Val Asn Trp Lys Phe Ser Glu Gly Gly Ser Gly Ser Gly Ser Gly 
                485                 490                 495     


Ser Gly Ser Gly Arg Thr Gln Pro Thr Phe Gly Phe Thr Val Asn Trp 
            500                 505                 510         


Lys Phe Ser Glu Ser Thr Thr Val Phe Thr Gly Gln Cys Phe Ile Asp 
        515                 520                 525             


Arg Asn Gly Lys Glu Val Leu Lys Thr Met Trp Leu Leu Arg Ser Ser 
    530                 535                 540                 


Val Asn Asp Ile Gly Asp Asp Trp Lys Ala Thr Arg Val Gly Ile Asn 
545                 550                 555                 560 


Ile Phe Thr Arg Leu Arg Thr Gln Lys Glu Gly Gly Ser Gly Gly Ser 
                565                 570                 575     


Ala Arg Lys Cys Ser Leu Thr Gly Lys Trp Thr Asn Asp Leu Gly Ser 
            580                 585                 590         


Asn Met Thr Ile Gly Ala Val Asn Ser Arg Gly Glu Phe Thr Gly Thr 
        595                 600                 605             


Tyr Ile Thr Ala Val Thr Ala Thr Ser Asn Glu Ile Lys Glu Ser Pro 
    610                 615                 620                 


Leu His Gly Thr Gln Asn Thr Ile Asn Lys Ser Gly Gly Ser Thr Thr 
625                 630                 635                 640 


Val Phe Thr Gly Gln Cys Phe Ile Asp Arg Asn Gly Lys Glu Val Leu 
                645                 650                 655     


Lys Thr Met Trp Leu Leu Arg Ser Ser Val Asn Asp Ile Gly Asp Asp 
            660                 665                 670         


Trp Lys Ala Thr Arg Val Gly Ile Asn Ile Phe Thr Arg Leu Arg Thr 
        675                 680                 685             


Gln Lys Glu Gly Gly Ser Gly Gly Ser Ala Arg Lys Cys Ser Leu Thr 
    690                 695                 700                 


Gly Lys Trp Thr Asn Asp Leu Gly Ser Asn Met Thr Ile Gly Ala Val 
705                 710                 715                 720 


Asn Ser Arg Gly Glu Phe Thr Gly Thr Tyr Ile Thr Ala Val Thr Ala 
                725                 730                 735     


Thr Ser Asn Glu Ile Lys Glu Ser Pro Leu His Gly Thr Gln Asn Thr 
            740                 745                 750         


Ile Asn Lys Arg Thr Gln Pro Thr Phe Gly Phe Thr Val Asn Trp Lys 
        755                 760                 765             


Phe Ser Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
    770                 775                 780                 


Gly Ser Gly Gly Gly Gly Ser Ala His His Phe Ser Glu Pro Glu Ile 
785                 790                 795                 800 


Thr Leu Ile Ile Phe Gly Val Met Ala Leu Val Ile Gly Thr Ile Leu 
                805                 810                 815     


Leu Ile Ser Tyr Gly Ile Arg Arg Leu Ile Lys Lys Ser Pro Ser Gly 
            820                 825                 830         


Gly Gly Gly Ser Thr Gly Ser Gly Gly Ser Gly Phe Cys Tyr Glu Asn 
        835                 840                 845             


Glu Val Gly Ser Gly Arg Ser Arg Phe Val Lys Lys Asp Gly His Cys 
    850                 855                 860                 


Asn Val Gln Phe Ile Asn Val Gly Ser Gly Lys Ser Arg Ile Thr Ser 
865                 870                 875                 880 


Glu Gly Glu Tyr Ile Pro Leu Asp Gln Ile Asp Ile Asn Val Gly Ser 
                885                 890                 895     


Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile Gly Thr Ser Gly Gly Ser 
            900                 905                 910         


Pro Glu Asp Glu Asn Ala Ala Leu Glu Glu Lys Ile Ala Gln Leu Lys 
        915                 920                 925             


Gln Lys Asn Ala Ala Leu Lys Glu Glu Ile Gln Ala Leu Glu Tyr Gly 
    930                 935                 940                 


Gly Gly Gly Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp 
945                 950                 955                 960 


Glu Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu Phe Tyr 
                965                 970                 975     


Ala Asn Asp Ile Leu Thr His Asn 
            980                 


<210>  18
<211>  703
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  18

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Pro Pro Arg Lys Arg Cys 
145                 150                 155                 160 


Cys Cys Ala Arg Arg Gly Thr Gln Leu Met Leu Val Gly Leu Leu Ser 
                165                 170                 175     


Thr Ala Met Trp Ala Gly Leu Leu Ala Leu Leu Leu Leu Trp His Trp 
            180                 185                 190         


Glu Thr Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glu Ile Gly 
        195                 200                 205             


Thr Gly Phe Pro Phe Asp Pro His Tyr Val Glu Val Leu Gly Glu Arg 
    210                 215                 220                 


Met His Tyr Val Asp Val Gly Pro Arg Asp Gly Thr Pro Val Leu Phe 
225                 230                 235                 240 


Leu His Gly Asn Pro Thr Ser Ser Tyr Val Trp Arg Asn Ile Ile Pro 
                245                 250                 255     


His Val Ala Pro Thr His Arg Val Ile Ala Pro Asp Leu Ile Gly Met 
            260                 265                 270         


Gly Lys Ser Asp Lys Pro Asp Leu Gly Tyr Phe Phe Asp Asp His Val 
        275                 280                 285             


Arg Phe Met Asp Ala Phe Ile Glu Ala Leu Gly Leu Glu Glu Val Val 
    290                 295                 300                 


Leu Val Ile His Asp Trp Gly Ser Ala Leu Gly Phe His Trp Ala Lys 
305                 310                 315                 320 


Arg Asn Pro Glu Arg Val Lys Gly Ile Ala Phe Met Glu Phe Ile Arg 
                325                 330                 335     


Pro Ile Pro Thr Trp Asp Glu Trp Pro Glu Phe Ala Arg Glu Thr Phe 
            340                 345                 350         


Gln Ala Phe Arg Thr Thr Asp Val Gly Arg Lys Leu Ile Ile Asp Gln 
        355                 360                 365             


Asn Val Phe Ile Glu Gly Thr Leu Pro Met Gly Val Val Arg Pro Leu 
    370                 375                 380                 


Thr Glu Val Glu Met Asp His Tyr Arg Glu Pro Phe Leu Asn Pro Val 
385                 390                 395                 400 


Asp Arg Glu Pro Leu Trp Arg Phe Pro Asn Glu Leu Pro Ile Ala Gly 
                405                 410                 415     


Glu Pro Ala Asn Ile Val Ala Leu Val Glu Glu Tyr Met Asp Trp Leu 
            420                 425                 430         


His Gln Ser Pro Val Pro Lys Leu Leu Phe Trp Gly Thr Pro Gly Val 
        435                 440                 445             


Leu Ile Pro Pro Ala Glu Ala Ala Arg Leu Ala Lys Ser Leu Pro Asn 
    450                 455                 460                 


Ala Lys Ala Val Asp Ile Gly Pro Gly Leu Asn Leu Leu Gln Glu Asp 
465                 470                 475                 480 


Asn Pro Asp Leu Ile Gly Ser Glu Ile Ala Arg Trp Leu Ser Thr Leu 
                485                 490                 495     


Glu Ile Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala His 
            500                 505                 510         


His Phe Ser Glu Pro Glu Ile Thr Leu Ile Ile Phe Gly Val Met Ala 
        515                 520                 525             


Leu Val Ile Gly Thr Ile Leu Leu Ile Ser Tyr Gly Ile Arg Arg Leu 
    530                 535                 540                 


Ile Lys Lys Ser Pro Ser Gly Gly Gly Gly Ser Thr Gly Ser Gly Gly 
545                 550                 555                 560 


Ser Gly Phe Cys Tyr Glu Asn Glu Val Gly Ser Gly Arg Ser Arg Phe 
                565                 570                 575     


Val Lys Lys Asp Gly His Cys Asn Val Gln Phe Ile Asn Val Gly Ser 
            580                 585                 590         


Gly Lys Ser Arg Ile Thr Ser Glu Gly Glu Tyr Ile Pro Leu Asp Gln 
        595                 600                 605             


Ile Asp Ile Asn Val Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg 
    610                 615                 620                 


Ile Gly Thr Ser Gly Gly Ser Pro Glu Asp Glu Asn Ala Ala Leu Glu 
625                 630                 635                 640 


Glu Lys Ile Ala Gln Leu Lys Gln Lys Asn Ala Ala Leu Lys Glu Glu 
                645                 650                 655     


Ile Gln Ala Leu Glu Tyr Gly Gly Gly Gly Met Met Leu Lys Lys Ile 
            660                 665                 670         


Leu Lys Ile Glu Glu Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val 
        675                 680                 685             


Ser Gly Asn His Leu Phe Tyr Ala Asn Asp Ile Leu Thr His Asn 
    690                 695                 700             


<210>  19
<211>  584
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  19

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Pro Pro Arg Lys Arg Cys 
145                 150                 155                 160 


Cys Cys Ala Arg Arg Gly Thr Gln Leu Met Leu Val Gly Leu Leu Ser 
                165                 170                 175     


Thr Ala Met Trp Ala Gly Leu Leu Ala Leu Leu Leu Leu Trp His Trp 
            180                 185                 190         


Glu Thr Glu Gly Gly Gly Gly Ser Gly Thr Gly Ser Gly Val Phe Thr 
        195                 200                 205             


Leu Glu Asp Phe Val Gly Asp Trp Arg Gln Thr Ala Gly Tyr Asn Leu 
    210                 215                 220                 


Asp Gln Val Leu Glu Gln Gly Gly Val Ser Ser Leu Phe Gln Asn Leu 
225                 230                 235                 240 


Gly Val Ser Val Thr Pro Ile Gln Arg Ile Val Leu Ser Gly Glu Asn 
                245                 250                 255     


Gly Leu Lys Ile Asp Ile His Val Ile Ile Pro Tyr Glu Gly Leu Ser 
            260                 265                 270         


Gly Asp Gln Met Gly Gln Ile Glu Lys Ile Phe Lys Val Val Tyr Pro 
        275                 280                 285             


Val Asp Asp His His Phe Lys Val Ile Leu His Tyr Gly Thr Leu Val 
    290                 295                 300                 


Ile Asp Gly Val Thr Pro Asn Met Ile Asp Tyr Phe Gly Arg Pro Tyr 
305                 310                 315                 320 


Glu Gly Ile Ala Val Phe Asp Gly Lys Lys Ile Thr Val Thr Gly Thr 
                325                 330                 335     


Leu Trp Asn Gly Asn Lys Ile Ile Asp Glu Arg Leu Ile Asn Pro Asp 
            340                 345                 350         


Gly Ser Leu Leu Phe Arg Val Thr Ile Asn Gly Val Thr Gly Trp Arg 
        355                 360                 365             


Leu Cys Glu Arg Ile Leu Ala Gly Thr Asp Tyr Lys Asp Asp Asp Asp 
    370                 375                 380                 


Lys Gly Gly Gly Gly Gly Ser Ala His His Phe Ser Glu Pro Glu Ile 
385                 390                 395                 400 


Thr Leu Ile Ile Phe Gly Val Met Ala Leu Val Ile Gly Thr Ile Leu 
                405                 410                 415     


Leu Ile Ser Tyr Gly Ile Arg Arg Leu Ile Lys Lys Ser Pro Ser Gly 
            420                 425                 430         


Gly Gly Gly Ser Thr Gly Ser Gly Gly Ser Gly Phe Cys Tyr Glu Asn 
        435                 440                 445             


Glu Val Gly Ser Gly Arg Ser Arg Phe Val Lys Lys Asp Gly His Cys 
    450                 455                 460                 


Asn Val Gln Phe Ile Asn Val Gly Ser Gly Lys Ser Arg Ile Thr Ser 
465                 470                 475                 480 


Glu Gly Glu Tyr Ile Pro Leu Asp Gln Ile Asp Ile Asn Val Gly Ser 
                485                 490                 495     


Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile Gly Thr Ser Gly Gly Ser 
            500                 505                 510         


Pro Glu Asp Glu Asn Ala Ala Leu Glu Glu Lys Ile Ala Gln Leu Lys 
        515                 520                 525             


Gln Lys Asn Ala Ala Leu Lys Glu Glu Ile Gln Ala Leu Glu Tyr Gly 
    530                 535                 540                 


Gly Gly Gly Met Met Leu Lys Lys Ile Leu Lys Ile Glu Glu Leu Asp 
545                 550                 555                 560 


Glu Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn His Leu Phe Tyr 
                565                 570                 575     


Ala Asn Asp Ile Leu Thr His Asn 
            580                 


<210>  20
<211>  604
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  20

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Pro Pro Arg Lys Arg Cys 
145                 150                 155                 160 


Cys Cys Ala Arg Arg Gly Thr Gln Leu Met Leu Val Gly Leu Leu Ser 
                165                 170                 175     


Thr Ala Met Trp Ala Gly Leu Leu Ala Leu Leu Leu Leu Trp His Trp 
            180                 185                 190         


Glu Thr Glu Gly Gly Gly Gly Ser Arg Arg Arg Arg Arg Lys Arg Ser 
        195                 200                 205             


Ala Arg Gly Thr Gly Ser Gly Val Phe Thr Leu Glu Asp Phe Val Gly 
    210                 215                 220                 


Asp Trp Arg Gln Thr Ala Gly Tyr Asn Leu Asp Gln Val Leu Glu Gln 
225                 230                 235                 240 


Gly Gly Val Ser Ser Leu Phe Gln Asn Leu Gly Val Ser Val Thr Pro 
                245                 250                 255     


Ile Gln Arg Ile Val Leu Ser Gly Glu Asn Gly Leu Lys Ile Asp Ile 
            260                 265                 270         


His Val Ile Ile Pro Tyr Glu Gly Leu Ser Gly Asp Gln Met Gly Gln 
        275                 280                 285             


Ile Glu Lys Ile Phe Lys Val Val Tyr Pro Val Asp Asp His His Phe 
    290                 295                 300                 


Lys Val Ile Leu His Tyr Gly Thr Leu Val Ile Asp Gly Val Thr Pro 
305                 310                 315                 320 


Asn Met Ile Asp Tyr Phe Gly Arg Pro Tyr Glu Gly Ile Ala Val Phe 
                325                 330                 335     


Asp Gly Lys Lys Ile Thr Val Thr Gly Thr Leu Trp Asn Gly Asn Lys 
            340                 345                 350         


Ile Ile Asp Glu Arg Leu Ile Asn Pro Asp Gly Ser Leu Leu Phe Arg 
        355                 360                 365             


Val Thr Ile Asn Gly Val Thr Gly Trp Arg Leu Cys Glu Arg Ile Leu 
    370                 375                 380                 


Ala Gly Thr Asp Tyr Lys Asp Asp Asp Asp Lys Gly Arg Arg Arg Arg 
385                 390                 395                 400 


Arg Lys Arg Ser Ala Arg Gly Gly Gly Gly Ser Ala His His Phe Ser 
                405                 410                 415     


Glu Pro Glu Ile Thr Leu Ile Ile Phe Gly Val Met Ala Leu Val Ile 
            420                 425                 430         


Gly Thr Ile Leu Leu Ile Ser Tyr Gly Ile Arg Arg Leu Ile Lys Lys 
        435                 440                 445             


Ser Pro Ser Gly Gly Gly Gly Ser Thr Gly Ser Gly Gly Ser Gly Phe 
    450                 455                 460                 


Cys Tyr Glu Asn Glu Val Gly Ser Gly Arg Ser Arg Phe Val Lys Lys 
465                 470                 475                 480 


Asp Gly His Cys Asn Val Gln Phe Ile Asn Val Gly Ser Gly Lys Ser 
                485                 490                 495     


Arg Ile Thr Ser Glu Gly Glu Tyr Ile Pro Leu Asp Gln Ile Asp Ile 
            500                 505                 510         


Asn Val Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile Gly Thr 
        515                 520                 525             


Ser Gly Gly Ser Pro Glu Asp Glu Asn Ala Ala Leu Glu Glu Lys Ile 
    530                 535                 540                 


Ala Gln Leu Lys Gln Lys Asn Ala Ala Leu Lys Glu Glu Ile Gln Ala 
545                 550                 555                 560 


Leu Glu Tyr Gly Gly Gly Gly Met Met Leu Lys Lys Ile Leu Lys Ile 
                565                 570                 575     


Glu Glu Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val Ser Gly Asn 
            580                 585                 590         


His Leu Phe Tyr Ala Asn Asp Ile Leu Thr His Asn 
        595                 600                 


<210>  21
<211>  352
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  21

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Gly Gly Ser Gly 
                85                  90                  95      


Gly Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu 
            100                 105                 110         


Met Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Val Phe Thr Leu Glu 
        115                 120                 125             


Asp Phe Val Gly Asp Trp Arg Gln Thr Ala Gly Tyr Asn Leu Asp Gln 
    130                 135                 140                 


Val Leu Glu Gln Gly Gly Val Ser Ser Leu Phe Gln Asn Leu Gly Val 
145                 150                 155                 160 


Ser Val Thr Pro Ile Gln Arg Ile Val Leu Ser Gly Glu Asn Gly Leu 
                165                 170                 175     


Lys Ile Asp Ile His Val Ile Ile Pro Tyr Glu Gly Leu Ser Gly Asp 
            180                 185                 190         


Gln Met Gly Gln Ile Glu Lys Ile Phe Lys Val Val Tyr Pro Val Asp 
        195                 200                 205             


Asp His His Phe Lys Val Ile Leu His Tyr Gly Thr Leu Val Ile Asp 
    210                 215                 220                 


Gly Val Thr Pro Asn Met Ile Asp Tyr Phe Gly Arg Pro Tyr Glu Gly 
225                 230                 235                 240 


Ile Ala Val Phe Asp Gly Lys Lys Ile Thr Val Thr Gly Thr Leu Trp 
                245                 250                 255     


Asn Gly Asn Lys Ile Ile Asp Glu Arg Leu Ile Asn Pro Asp Gly Ser 
            260                 265                 270         


Leu Leu Phe Arg Val Thr Ile Asn Gly Val Thr Gly Trp Arg Leu Cys 
        275                 280                 285             


Glu Arg Ile Leu Ala Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg 
    290                 295                 300                 


Ile Gly Thr Ser Gly Gly Ser Gly Gly Gly Gly Met Met Leu Lys Lys 
305                 310                 315                 320 


Ile Leu Lys Ile Glu Glu Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu 
                325                 330                 335     


Val Ser Gly Asn His Leu Phe Tyr Ala Asn Asp Ile Leu Thr His Asn 
            340                 345                 350         


<210>  22
<211>  414
<212>  PRT
<213>  Artificial

<220>
<223>  split intein - heterologous polynucleotide construct

<400>  22

Cys Leu Asp Leu Lys Thr Gln Val Gln Thr Pro Gln Gly Met Lys Glu 
1               5                   10                  15      


Ile Ser Asn Ile Gln Val Gly Asp Leu Val Leu Ser Asn Thr Gly Tyr 
            20                  25                  30          


Asn Glu Val Leu Asn Val Phe Pro Lys Ser Lys Lys Lys Ser Tyr Lys 
        35                  40                  45              


Ile Thr Leu Glu Asp Gly Lys Glu Ile Ile Cys Ser Glu Glu His Leu 
    50                  55                  60                  


Phe Pro Thr Gln Thr Gly Glu Met Asn Ile Ser Gly Gly Leu Lys Glu 
65                  70                  75                  80  


Gly Met Cys Leu Tyr Val Lys Glu Gly Gly Gly Gly Pro Glu Asp Lys 
                85                  90                  95      


Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu Ala 
            100                 105                 110         


Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu Gly Gly Ser Gly Gly 
        115                 120                 125             


Gly Gly Ser Gly Thr Gly Phe Ala Asn Glu Leu Gly Pro Arg Leu Met 
    130                 135                 140                 


Gly Lys Gly Ser Gly Gly Gly Gly Ser Gly Val Phe Thr Leu Glu Asp 
145                 150                 155                 160 


Phe Val Gly Asp Trp Arg Gln Thr Ala Gly Tyr Asn Leu Asp Gln Val 
                165                 170                 175     


Leu Glu Gln Gly Gly Val Ser Ser Leu Phe Gln Asn Leu Gly Val Ser 
            180                 185                 190         


Val Thr Pro Ile Gln Arg Ile Val Leu Ser Gly Glu Asn Gly Leu Lys 
        195                 200                 205             


Ile Asp Ile His Val Ile Ile Pro Tyr Glu Gly Leu Ser Gly Asp Gln 
    210                 215                 220                 


Met Gly Gln Ile Glu Lys Ile Phe Lys Val Val Tyr Pro Val Asp Asp 
225                 230                 235                 240 


His His Phe Lys Val Ile Leu His Tyr Gly Thr Leu Val Ile Asp Gly 
                245                 250                 255     


Val Thr Pro Asn Met Ile Asp Tyr Phe Gly Arg Pro Tyr Glu Gly Ile 
            260                 265                 270         


Ala Val Phe Asp Gly Lys Lys Ile Thr Val Thr Gly Thr Leu Trp Asn 
        275                 280                 285             


Gly Asn Lys Ile Ile Asp Glu Arg Leu Ile Asn Pro Asp Gly Ser Leu 
    290                 295                 300                 


Leu Phe Arg Val Thr Ile Asn Gly Val Thr Gly Trp Arg Leu Cys Glu 
305                 310                 315                 320 


Arg Ile Leu Ala Gly Ser Gly Gly Ser Ser Tyr Thr Ser Asn Arg Ile 
                325                 330                 335     


Gly Thr Ser Gly Gly Ser Pro Glu Asp Glu Asn Ala Ala Leu Glu Glu 
            340                 345                 350         


Lys Ile Ala Gln Leu Lys Gln Lys Asn Ala Ala Leu Lys Glu Glu Ile 
        355                 360                 365             


Gln Ala Leu Glu Tyr Gly Gly Gly Gly Met Met Leu Lys Lys Ile Leu 
    370                 375                 380                 


Lys Ile Glu Glu Leu Asp Glu Arg Glu Leu Ile Asp Ile Glu Val Ser 
385                 390                 395                 400 


Gly Asn His Leu Phe Tyr Ala Asn Asp Ile Leu Thr His Asn 
                405                 410                 


<210>  23
<211>  8642
<212>  DNA
<213>  Artificial

<220>
<223>  Expression vector for Cas9

<400>  23
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaatagtaa cgccaatagg       60

gactttccat tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca      120

tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc      180

ctggcattgt gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt      240

attagtcatc gctattacca tggtcgaggt gagccccacg ttctgcttca ctctccccat      300

ctcccccccc tccccacccc caattttgta tttatttatt ttttaattat tttgtgcagc      360

gatgggggcg gggggggggg gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg      420

gggcggggcg aggcggagag gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt      480

tccttttatg gcgaggcggc ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc      540

gggagtcgct gcgacgctgc cttcgccccg tgccccgctc cgccgccgcc tcgcgccgcc      600

cgccccggct ctgactgacc gcgttactcc cacaggtgag cgggcgggac ggcccttctc      660

ctccgggctg taattagctg agcaagaggt aagggtttaa gggatggttg gttggtgggg      720

tattaatgtt taattacctg gagcacctgc ctgaaatcac tttttttcag gttggatcct      780

taattaataa tacgactcac tataggggcc gccaccatgg acaagaagta cagcatcggc      840

ctggacatcg gcaccaactc tgtgggctgg gccgtgatca ccgacgagta caaggtgccc      900

agcaagaaat tcaaggtgct gggcaacacc gaccggcaca gcatcaagaa gaacctgatc      960

ggagccctgc tgttcgacag cggcgaaaca gccgaggcca cccggctgaa gagaaccgcc     1020

agaagaagat acaccagacg gaagaaccgg atctgctatc tgcaagagat cttcagcaac     1080

gagatggcca aggtggacga cagcttcttc cacagactgg aagagtcctt cctggtggaa     1140

gaggataaga agcacgagcg gcaccccatc ttcggcaaca tcgtggacga ggtggcctac     1200

cacgagaagt accccaccat ctaccacctg agaaagaaac tggtggacag caccgacaag     1260

gccgacctgc ggctgatcta tctggccctg gcccacatga tcaagttccg gggccacttc     1320

ctgatcgagg gcgacctgaa ccccgacaac agcgacgtgg acaagctgtt catccagctg     1380

gtgcagacct acaaccagct gttcgaggaa aaccccatca acgccagcgg cgtggacgcc     1440

aaggccatcc tgtctgccag actgagcaag agcagacggc tggaaaatct gatcgcccag     1500

ctgcccggcg agaagaagaa tggcctgttc ggcaacctga ttgccctgag cctgggcctg     1560

acccccaact tcaagagcaa cttcgacctg gccgaggatg ccaaactgca gctgagcaag     1620

gacacctacg acgacgacct ggacaacctg ctggcccaga tcggcgacca gtacgccgac     1680

ctgtttctgg ccgccaagaa cctgtccgac gccatcctgc tgagcgacat cctgagagtg     1740

aacaccgaga tcaccaaggc ccccctgagc gcctctatga tcaagagata cgacgagcac     1800

caccaggacc tgaccctgct gaaagctctc gtgcggcagc agctgcctga gaagtacaaa     1860

gagattttct tcgaccagag caagaacggc tacgccggct acattgacgg cggagccagc     1920

caggaagagt tctacaagtt catcaagccc atcctggaaa agatggacgg caccgaggaa     1980

ctgctcgtga agctgaacag agaggacctg ctgcggaagc agcggacctt cgacaacggc     2040

agcatccccc accagatcca cctgggagag ctgcacgcca ttctgcggcg gcaggaagat     2100

ttttacccat tcctgaagga caaccgggaa aagatcgaga agatcctgac cttccgcatc     2160

ccctactacg tgggccctct ggccagggga aacagcagat tcgcctggat gaccagaaag     2220

agcgaggaaa ccatcacccc ctggaacttc gaggaagtgg tggacaaggg cgcttccgcc     2280

cagagcttca tcgagcggat gaccaacttc gataagaacc tgcccaacga gaaggtgctg     2340

cccaagcaca gcctgctgta cgagtacttc accgtgtata acgagctgac caaagtgaaa     2400

tacgtgaccg agggaatgag aaagcccgcc ttcctgagcg gcgagcagaa aaaggccatc     2460

gtggacctgc tgttcaagac caaccggaaa gtgaccgtga agcagctgaa agaggactac     2520

ttcaagaaaa tcgagtgctt cgactccgtg gaaatctccg gcgtggaaga tcggttcaac     2580

gcctccctgg gcacatacca cgatctgctg aaaattatca aggacaagga cttcctggac     2640

aatgaggaaa acgaggacat tctggaagat atcgtgctga ccctgacact gtttgaggac     2700

agagagatga tcgaggaacg gctgaaaacc tatgcccacc tgttcgacga caaagtgatg     2760

aagcagctga agcggcggag atacaccggc tggggcaggc tgagccggaa gctgatcaac     2820

ggcatccggg acaagcagtc cggcaagaca atcctggatt tcctgaagtc cgacggcttc     2880

gccaacagaa acttcatgca gctgatccac gacgacagcc tgacctttaa agaggacatc     2940

cagaaagccc aggtgtccgg ccagggcgat agcctgcacg agcacattgc caatctggcc     3000

ggcagccccg ccattaagaa gggcatcctg cagacagtga aggtggtgga cgagctcgtg     3060

aaagtgatgg gccggcacaa gcccgagaac atcgtgatcg aaatggccag agagaaccag     3120

accacccaga agggacagaa gaacagccgc gagagaatga agcggatcga agagggcatc     3180

aaagagctgg gcagccagat cctgaaagaa caccccgtgg aaaacaccca gctgcagaac     3240

gagaagctgt acctgtacta cctgcagaat gggcgggata tgtacgtgga ccaggaactg     3300

gacatcaacc ggctgtccga ctacgatgtg gaccatatcg tgcctcagag ctttctgaag     3360

gacgactcca tcgacaacaa ggtgctgacc agaagcgaca agaaccgggg caagagcgac     3420

aacgtgccct ccgaagaggt cgtgaagaag atgaagaact actggcggca gctgctgaac     3480

gccaagctga ttacccagag aaagttcgac aatctgacca aggccgagag aggcggcctg     3540

agcgaactgg ataaggccgg cttcatcaag agacagctgg tggaaacccg gcagatcaca     3600

aagcacgtgg cacagatcct ggactcccgg atgaacacta agtacgacga gaatgacaag     3660

ctgatccggg aagtgaaagt gatcaccctg aagtccaagc tggtgtccga tttccggaag     3720

gatttccagt tttacaaagt gcgcgagatc aacaactacc accacgccca cgacgcctac     3780

ctgaacgccg tcgtgggaac cgccctgatc aaaaagtacc ctaagctgga aagcgagttc     3840

gtgtacggcg actacaaggt gtacgacgtg cggaagatga tcgccaagag cgagcaggaa     3900

atcggcaagg ctaccgccaa gtacttcttc tacagcaaca tcatgaactt tttcaagacc     3960

gagattaccc tggccaacgg cgagatccgg aagcggcctc tgatcgagac aaacggcgaa     4020

accggggaga tcgtgtggga taagggccgg gattttgcca ccgtgcggaa agtgctgagc     4080

atgccccaag tgaatatcgt gaaaaagacc gaggtgcaga caggcggctt cagcaaagag     4140

tctatcctgc ccaagaggaa cagcgataag ctgatcgcca gaaagaagga ctgggaccct     4200

aagaagtacg gcggcttcga cagccccacc gtggcctatt ctgtgctggt ggtggccaaa     4260

gtggaaaagg gcaagtccaa gaaactgaag agtgtgaaag agctgctggg gatcaccatc     4320

atggaaagaa gcagcttcga gaagaatccc atcgactttc tggaagccaa gggctacaaa     4380

gaagtgaaaa aggacctgat catcaagctg cctaagtact ccctgttcga gctggaaaac     4440

ggccggaaga gaatgctggc ctctgccggc gaactgcaga agggaaacga actggccctg     4500

ccctccaaat atgtgaactt cctgtacctg gccagccact atgagaagct gaagggctcc     4560

cccgaggata atgagcagaa acagctgttt gtggaacagc acaagcacta cctggacgag     4620

atcatcgagc agatcagcga gttctccaag agagtgatcc tggccgacgc taatctggac     4680

aaagtgctgt ccgcctacaa caagcaccgg gataagccca tcagagagca ggccgagaat     4740

atcatccacc tgtttaccct gaccaatctg ggagcccctg ccgccttcaa gtactttgac     4800

accaccatcg accggaagag gtacaccagc accaaagagg tgctggacgc caccctgatc     4860

caccagagca tcaccggcct gtacgagaca cggatcgacc tgtctcagct gggaggcgac     4920

gcctatccct atgacgtgcc cgattatgcc agcctgggca gcggctcccc caagaaaaaa     4980

cgcaaggtgg aagatcctaa gaaaaagcgg aaagtggact gaacgcgtaa atgattgcag     5040

atccactagt tctagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc     5100

tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct     5160

ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg     5220

gggtggggtg gggcaggaca gcaaggggga ggattgggaa gagaatagca ggcatgctgg     5280

ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc agctgggggc ggccgcagga     5340

acccctagtg atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg     5400

gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc     5460

gcgcagctgc ctgcaggggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat     5520

ttcacaccgc atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc attaagcgcg     5580

gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct     5640

cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta     5700

aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa     5760

cttgatttgg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct     5820

ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc     5880

aaccctatct cgggctattc ttttgattta taagggattt tgccgatttc ggcctattgg     5940

ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt     6000

acaattttat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc     6060

cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct     6120

tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca     6180

ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg     6240

ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct     6300

atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga     6360

taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc     6420

cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg     6480

aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc     6540

aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact     6600

tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc     6660

ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag     6720

catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat     6780

aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt     6840

ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa     6900

gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc     6960

aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg     7020

gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt     7080

gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc actggggcca     7140

gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat     7200

gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca     7260

gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg     7320

atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg     7380

ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt     7440

ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg     7500

ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata     7560

ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca     7620

ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag     7680

tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc     7740

tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga     7800

tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg     7860

tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac     7920

gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg     7980

tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg     8040

ttcctggcct tttgctggcc ttttgctcac atgtcctgca ggcagctgcg cgctcgctcg     8100

ctcactgagg ccgcccgggc aaagcccggg cgtcgggcga cctttggtcg cccggcctca     8160

gtgagcgagc gagcgcgcag agagggagtg gccaactcca tcactagggg ttcctgcggc     8220

cgcaaggtcg ggcaggaaga gggcctattt cccatgattc cttcatattt gcatatacga     8280

tacaaggctg ttagagagat aattggaatt aatttgactg taaacacaaa gatattagta     8340

caaaatacgt gacgtagaaa gtaataattt cttgggtagt ttgcagtttt aaaattatgt     8400

tttaaaatgg actatcatat gcttaccgta acttgaaagt atttcgattt cttggcttta     8460

tatatcttgt ggaaaggacg aaacaccggg tcttcgagaa gacctgttta agagctatgc     8520

tggaaacagc atagcaagtt taaataaggc tagtccgtta tcaacttgaa aaagtggcac     8580

cgagtcggtg ctttttttga attcgtttaa acggtacccg ttacataact tacggtaaat     8640

gg                                                                    8642


<210>  24
<211>  8927
<212>  DNA
<213>  Artificial

<220>
<223>  Expression vector for Cas9

<400>  24
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaatagtaa cgccaatagg       60

gactttccat tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca      120

tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc      180

ctggcattgt gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt      240

attagtcatc gctattacca tggtcgaggt gagccccacg ttctgcttca ctctccccat      300

ctcccccccc tccccacccc caattttgta tttatttatt ttttaattat tttgtgcagc      360

gatgggggcg gggggggggg gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg      420

gggcggggcg aggcggagag gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt      480

tccttttatg gcgaggcggc ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc      540

gggagtcgct gcgacgctgc cttcgccccg tgccccgctc cgccgccgcc tcgcgccgcc      600

cgccccggct ctgactgacc gcgttactcc cacaggtgag cgggcgggac ggcccttctc      660

ctccgggctg taattagctg agcaagaggt aagggtttaa gggatggttg gttggtgggg      720

tattaatgtt taattacctg gagcacctgc ctgaaatcac tttttttcag gttggatcct      780

taattaataa tacgactcac tataggggcc gccaccatgg acaagaagta cagcatcggc      840

ctggacatcg gcaccaactc tgtgggctgg gccgtgatca ccgacgagta caaggtgccc      900

agcaagaaat tcaaggtgct gggcaacacc gaccggcaca gcatcaagaa gaacctgatc      960

ggagccctgc tgttcgacag cggcgaaaca gccgaggcca cccggctgaa gagaaccgcc     1020

agaagaagat acaccagacg gaagaaccgg atctgctatc tgcaagagat cttcagcaac     1080

gagatggcca aggtggacga cagcttcttc cacagactgg aagagtcctt cctggtggaa     1140

gaggataaga agcacgagcg gcaccccatc ttcggcaaca tcgtggacga ggtggcctac     1200

cacgagaagt accccaccat ctaccacctg agaaagaaac tggtggacag caccgacaag     1260

gccgacctgc ggctgatcta tctggccctg gcccacatga tcaagttccg gggccacttc     1320

ctgatcgagg gcgacctgaa ccccgacaac agcgacgtgg acaagctgtt catccagctg     1380

gtgcagacct acaaccagct gttcgaggaa aaccccatca acgccagcgg cgtggacgcc     1440

aaggccatcc tgtctgccag actgagcaag agcagacggc tggaaaatct gatcgcccag     1500

ctgcccggcg agaagaagaa tggcctgttc ggcaacctga ttgccctgag cctgggcctg     1560

acccccaact tcaagagcaa cttcgacctg gccgaggatg ccaaactgca gctgagcaag     1620

gacacctacg acgacgacct ggacaacctg ctggcccaga tcggcgacca gtacgccgac     1680

ctgtttctgg ccgccaagaa cctgtccgac gccatcctgc tgagcgacat cctgagagtg     1740

aacaccgaga tcaccaaggc ccccctgagc gcctctatga tcaagagata cgacgagcac     1800

caccaggacc tgaccctgct gaaagctctc gtgcggcagc agctgcctga gaagtacaaa     1860

gagattttct tcgaccagag caagaacggc tacgccggct acattgacgg cggagccagc     1920

caggaagagt tctacaagtt catcaagccc atcctggaaa agatggacgg caccgaggaa     1980

ctgctcgtga agctgaacag agaggacctg ctgcggaagc agcggacctt cgacaacggc     2040

agcatccccc accagatcca cctgggagag ctgcacgcca ttctgcggcg gcaggaagat     2100

ttttacccat tcctgaagga caaccgggaa aagatcgaga agatcctgac cttccgcatc     2160

ccctactacg tgggccctct ggccagggga aacagcagat tcgcctggat gaccagaaag     2220

agcgaggaaa ccatcacccc ctggaacttc gaggaagtgg tggacaaggg cgcttccgcc     2280

cagagcttca tcgagcggat gaccaacttc gataagaacc tgcccaacga gaaggtgctg     2340

cccaagcaca gcctgctgta cgagtacttc accgtgtata acgagctgac caaagtgaaa     2400

tacgtgaccg agggaatgag aaagcccgcc ttcctgagcg gcgagcagaa aaaggccatc     2460

gtggacctgc tgttcaagac caaccggaaa gtgaccgtga agcagctgaa agaggactac     2520

ttcaagaaaa tcgagtgctt cgactccgtg gaaatctccg gcgtggaaga tcggttcaac     2580

gcctccctgg gcacatacca cgatctgctg aaaattatca aggacaagga cttcctggac     2640

aatgaggaaa acgaggacat tctggaagat atcgtgctga ccctgacact gtttgaggac     2700

agagagatga tcgaggaacg gctgaaaacc tatgcccacc tgttcgacga caaagtgatg     2760

aagcagctga agcggcggag atacaccggc tggggcaggc tgagccggaa gctgatcaac     2820

ggcatccggg acaagcagtc cggcaagaca atcctggatt tcctgaagtc cgacggcttc     2880

gccaacagaa acttcatgca gctgatccac gacgacagcc tgacctttaa agaggacatc     2940

cagaaagccc aggtgtccgg ccagggcgat agcctgcacg agcacattgc caatctggcc     3000

ggcagccccg ccattaagaa gggcatcctg cagacagtga aggtggtgga cgagctcgtg     3060

aaagtgatgg gccggcacaa gcccgagaac atcgtgatcg aaatggccag agagaaccag     3120

accacccaga agggacagaa gaacagccgc gagagaatga agcggatcga agagggcatc     3180

aaagagctgg gcagccagat cctgaaagaa caccccgtgg aaaacaccca gctgcagaac     3240

gagaagctgt acctgtacta cctgcagaat gggcgggata tgtacgtgga ccaggaactg     3300

gacatcaacc ggctgtccga ctacgatgtg gaccatatcg tgcctcagag ctttctgaag     3360

gacgactcca tcgacaacaa ggtgctgacc agaagcgaca agaaccgggg caagagcgac     3420

aacgtgccct ccgaagaggt cgtgaagaag atgaagaact actggcggca gctgctgaac     3480

gccaagctga ttacccagag aaagttcgac aatctgacca aggccgagag aggcggcctg     3540

agcgaactgg ataaggccgg cttcatcaag agacagctgg tggaaacccg gcagatcaca     3600

aagcacgtgg cacagatcct ggactcccgg atgaacacta agtacgacga gaatgacaag     3660

ctgatccggg aagtgaaagt gatcaccctg aagtccaagc tggtgtccga tttccggaag     3720

gatttccagt tttacaaagt gcgcgagatc aacaactacc accacgccca cgacgcctac     3780

ctgaacgccg tcgtgggaac cgccctgatc aaaaagtacc ctaagctgga aagcgagttc     3840

gtgtacggcg actacaaggt gtacgacgtg cggaagatga tcgccaagag cgagcaggaa     3900

atcggcaagg ctaccgccaa gtacttcttc tacagcaaca tcatgaactt tttcaagacc     3960

gagattaccc tggccaacgg cgagatccgg aagcggcctc tgatcgagac aaacggcgaa     4020

accggggaga tcgtgtggga taagggccgg gattttgcca ccgtgcggaa agtgctgagc     4080

atgccccaag tgaatatcgt gaaaaagacc gaggtgcaga caggcggctt cagcaaagag     4140

tctatcctgc ccaagaggaa cagcgataag ctgatcgcca gaaagaagga ctgggaccct     4200

aagaagtacg gcggcttcga cagccccacc gtggcctatt ctgtgctggt ggtggccaaa     4260

gtggaaaagg gcaagtccaa gaaactgaag agtgtgaaag agctgctggg gatcaccatc     4320

atggaaagaa gcagcttcga gaagaatccc atcgactttc tggaagccaa gggctacaaa     4380

gaagtgaaaa aggacctgat catcaagctg cctaagtact ccctgttcga gctggaaaac     4440

ggccggaaga gaatgctggc ctctgccggc gaactgcaga agggaaacga actggccctg     4500

ccctccaaat atgtgaactt cctgtacctg gccagccact atgagaagct gaagggctcc     4560

cccgaggata atgagcagaa acagctgttt gtggaacagc acaagcacta cctggacgag     4620

atcatcgagc agatcagcga gttctccaag agagtgatcc tggccgacgc taatctggac     4680

aaagtgctgt ccgcctacaa caagcaccgg gataagccca tcagagagca ggccgagaat     4740

atcatccacc tgtttaccct gaccaatctg ggagcccctg ccgccttcaa gtactttgac     4800

accaccatcg accggaagag gtacaccagc accaaagagg tgctggacgc caccctgatc     4860

caccagagca tcaccggcct gtacgagaca cggatcgacc tgtctcagct gggaggcgac     4920

gcctatccct atgacgtgcc cgattatgcc agcctgggca gcggctcccc caagaaaaaa     4980

cgcaaggtgg aagatcctaa gaaaaagcgg aaagtggacg gcagcggcgc caccaacttt     5040

agcttgctga aacaggctgg cgacgttgaa gagaatcccg ggcctttgat tttcgtgaaa     5100

acccttaccg ggaaaaccat caccctcgag gttgaaccct cggatacgat agaaaatgta     5160

aaggccaaga tccaggataa ggaaggaatt cctcctgatc agcagagact ggcctttgct     5220

ggcaaatcgc tggaagatgg acgtactttg tctgactaca atattctaaa ggactctaaa     5280

cttcatcctc tgttgagact tcgttgaacg cgtaaatgat tgcagatcca ctagttctag     5340

agctcgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc     5400

ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga     5460

ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca     5520

ggacagcaag ggggaggatt gggaagagaa tagcaggcat gctggggatg cggtgggctc     5580

tatggcttct gaggcggaaa gaaccagctg ggggcggccg caggaacccc tagtgatgga     5640

gttggccact ccctctctgc gcgctcgctc gctcactgag gccgggcgac caaaggtcgc     5700

ccgacgcccg ggctttgccc gggcggcctc agtgagcgag cgagcgcgca gctgcctgca     5760

ggggcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcatacg     5820

tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt     5880

acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc     5940

ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct     6000

ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga tttgggtgat     6060

ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc     6120

acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcgggc     6180

tattcttttg atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg     6240

atttaacaaa aatttaacgc gaattttaac aaaatattaa cgtttacaat tttatggtgc     6300

actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca cccgccaaca     6360

cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag acaagctgtg     6420

accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa acgcgcgaga     6480

cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct     6540

tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc     6600

taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa     6660

tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt     6720

gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct     6780

gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc     6840

cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta     6900

tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac     6960

tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc     7020

atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac     7080

ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg     7140

gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac     7200

gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc     7260

gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt     7320

gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga     7380

gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc     7440

cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag     7500

atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca     7560

tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc     7620

ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca     7680

gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc     7740

tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta     7800

ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt     7860

ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc     7920

gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg     7980

ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg     8040

tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag     8100

ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc     8160

agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat     8220

agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg     8280

gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc     8340

tggccttttg ctcacatgtc ctgcaggcag ctgcgcgctc gctcgctcac tgaggccgcc     8400

cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag cgagcgagcg     8460

cgcagagagg gagtggccaa ctccatcact aggggttcct gcggccgcaa ggtcgggcag     8520

gaagagggcc tatttcccat gattccttca tatttgcata tacgatacaa ggctgttaga     8580

gagataattg gaattaattt gactgtaaac acaaagatat tagtacaaaa tacgtgacgt     8640

agaaagtaat aatttcttgg gtagtttgca gttttaaaat tatgttttaa aatggactat     8700

catatgctta ccgtaacttg aaagtatttc gatttcttgg ctttatatat cttgtggaaa     8760

ggacgaaaca ccgggtcttc gagaagacct gtttaagagc tatgctggaa acagcatagc     8820

aagtttaaat aaggctagtc cgttatcaac ttgaaaaagt ggcaccgagt cggtgctttt     8880

tttgaattcg tttaaacggt acccgttaca taacttacgg taaatgg                   8927


<210>  25
<211>  7723
<212>  DNA
<213>  Artificial

<220>
<223>  Expression vector for Cas9

<400>  25
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaatagtaa cgccaatagg       60

gactttccat tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca      120

tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc      180

ctggcattgt gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt      240

attagtcatc gctattacca tggtcgaggt gagccccacg ttctgcttca ctctccccat      300

ctcccccccc tccccacccc caattttgta tttatttatt ttttaattat tttgtgcagc      360

gatgggggcg gggggggggg gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg      420

gggcggggcg aggcggagag gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt      480

tccttttatg gcgaggcggc ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc      540

gggagtcgct gcgacgctgc cttcgccccg tgccccgctc cgccgccgcc tcgcgccgcc      600

cgccccggct ctgactgacc gcgttactcc cacaggtgag cgggcgggac ggcccttctc      660

ctccgggctg taattagctg agcaagaggt aagggtttaa gggatggttg gttggtgggg      720

tattaatgtt taattacctg gagcacctgc ctgaaatcac tttttttcag gttggatcct      780

taattaataa tacgactcac tataggggcc gccaccatga agaggaacta catcctcggc      840

ctggacatcg gcatcacatc tgtcggctac ggcatcatcg actacgagac aagggacgtg      900

atcgacgctg gcgtgcggct gttcaaagag gccaacgtcg agaacaacga gggcagaaga      960

tccaagagag gcgccagaag gctgaagaga agaaggcggc acagaatcca gagagtgaag     1020

aagctgctgt tcgactacaa cctgctgacc gaccacagcg agctgagcgg catcaatcct     1080

tacgaggcca gagtgaaggg cctgagccag aagctgagcg aggaagagtt ctctgccgct     1140

ctgctgcacc tggctaaaag acggggagtg cacaacgtga acgaggtgga agaggacacc     1200

ggcaacgagc tgtccaccaa agagcagatc agcagaaaca gcaaggccct ggaagagaaa     1260

tacgtggccg agctgcaact ggaaaggctg aaaaaggacg gcgaagtgcg gggcagcatc     1320

aacagattca agaccagcga ctacgtgaaa gaggctaagc agctcctgaa ggtgcagaag     1380

gcttaccacc agctggacca gagcttcatc gacacctaca tcgacctgct ggaaaccaga     1440

aggacctact acgaaggacc tggcgagggc agcccttttg gctggaagga catcaaagaa     1500

tggtacgaga tgctgatggg ccactgcaca tacttccccg aggaactgag aagcgtgaag     1560

tacgcctaca acgccgacct gtacaacgcc ctgaacgacc tgaacaacct cgtgatcacc     1620

agggacgaga acgagaagct ggaatattac gagaagttcc agatcatcga gaacgtgttc     1680

aagcagaaga agaagcccac actgaagcag atcgccaaag agatcctcgt caacgaggaa     1740

gatattaagg gctacagagt gaccagcacc ggcaagcccg agttcaccaa tctgaaggtg     1800

taccacgaca tcaaggacat taccgctcgg aaagaaatca tcgaaaacgc tgagctgctg     1860

gaccaaatcg ccaagatcct gaccatctac cagagcagcg aggacattca agaagaactg     1920

accaacctga actccgagct gacccaagag gaaatcgagc agattagcaa cctgaaggga     1980

tacaccggca cacacaacct gagcctgaag gccatcaacc tgatcctgga cgagctgtgg     2040

cacaccaacg acaaccagat cgctatcttc aacaggctga agctggtgcc taagaaggtg     2100

gacctgtcac agcagaaaga gattcctaca acactggtgg acgacttcat cctgtctcca     2160

gtggtcaagc gcagcttcat ccagagcatc aaagtgatca acgccatcat caagaagtac     2220

ggcctgccta acgacatcat catcgagctg gctagagaga agaactccaa ggacgcccag     2280

aaaatgatca acgagatgca gaagagaaac cggcagacca acgagaggat cgaggaaatc     2340

atcagaacca ccggcaaaga gaacgccaag tacctgatcg agaagatcaa gctgcacgac     2400

atgcaagagg gcaagtgcct gtacagcctg gaagctatcc ctcttgagga cctgctgaac     2460

aatcccttca actatgaggt ggaccacatc atccccagaa gcgtgtcctt cgacaacagc     2520

ttcaacaaca aggtgctcgt gaagcaagaa gagaactcca agaagggcaa cagaacccca     2580

ttccagtacc tgagcagcag cgacagcaag atcagctacg agactttcaa gaagcacatc     2640

ctgaacctcg ccaaaggcaa gggccgcatc agcaagacca agaaagagta tctgctggaa     2700

gaacgggaca tcaacaggtt ctccgtgcag aaagacttca tcaaccggaa cctggtggac     2760

accagatacg ccacaagggg cctgatgaat ctgctgagaa gctacttccg cgtgaacaat     2820

ctggacgtga aagtcaagtc catcaacggc ggcttcacca gctttctgag aagaaagtgg     2880

aagtttaaga aagagcggaa caaggggtac aagcaccacg ccgaggacgc cctgatcatt     2940

gccaacgccg atttcatctt caaagagtgg aagaaactgg acaaggcaaa gaaagtgatg     3000

gaaaaccaga tgttcgagga aaagcaggcc gagagcatgc ccgagatcga gacagagcaa     3060

gagtacaaag aaatcttcat cacgccccac cagatcaagc acattaagga cttcaaggac     3120

tacaagtaca gccaccgcgt ggacaagaag cctaacagag agctgattaa cgacaccctg     3180

tactccacca gaaaggacga caagggaaac accctgatcg tcaacaacct gaatggcctg     3240

tacgacaagg acaacgacaa gctcaagaag ctgatcaaca agagccccga aaagctgctg     3300

atgtaccacc acgatcctca gacctaccag aaactgaagc tcatcatgga acagtacggc     3360

gacgagaaga accctctgta caagtactac gaggaaaccg ggaactacct gaccaagtac     3420

tccaaaaagg ataacggccc cgtgatcaag aagattaagt attacggcaa caagctgaac     3480

gcccacctgg acatcaccga cgactaccct aactccagaa acaaggtcgt gaagctgtcc     3540

ctgaagcctt acagattcga cgtgtacctg gacaacggcg tgtacaagtt cgtgaccgtg     3600

aagaacctgg atgtgatcaa aaaagaaaac tactatgaag tgaacagcaa gtgctatgag     3660

gaagccaaaa agctgaagaa gatcagcaac caggctgagt ttatcgcctc cttctacaac     3720

aacgatctga tcaagatcaa cggggagctg tatagagtga tcggagtgaa caacgacctg     3780

ctcaacagga tcgaagtgaa tatgatcgac atcacctacc gcgagtacct ggaaaacatg     3840

aacgacaaga ggccacctcg gatcattaag acaatcgcca gcaagacgca gagcattaag     3900

aagtacagca cagacatcct gggcaacctg tacgaagtga agtctaagaa gcacccgcag     3960

attatcaaga aaggcggatc cacaccgcct aagaaaaaga gaaaggtcga ggacggcgag     4020

ggcccagctg ccaaaagagt gaaactggat tccggagccg ctcctgccgc caagaagaaa     4080

aagctggatt acaaggacga cgatgacaag tgaacgcgta aatgattgca gatccactag     4140

ttctagagct cgctgatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg     4200

cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata     4260

aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt     4320

ggggcaggac agcaaggggg aggattggga agagaatagc aggcatgctg gggatgcggt     4380

gggctctatg gcttctgagg cggaaagaac cagctggggg cggccgcagg aacccctagt     4440

gatggagttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa     4500

ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg agcgagcgag cgcgcagctg     4560

cctgcagggg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg     4620

catacgtcaa agcaaccata gtacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg     4680

gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct     4740

ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg     4800

ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgatttg     4860

ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg     4920

gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc     4980

tcgggctatt cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat     5040

gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt tacaatttta     5100

tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacacccg     5160

ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa     5220

gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc     5280

gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat gataataatg     5340

gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta     5400

tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt     5460

caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc     5520

ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt gaaagtaaaa     5580

gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt     5640

aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt     5700

ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact cggtcgccgc     5760

atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa gcatcttacg     5820

gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga taacactgcg     5880

gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt tttgcacaac     5940

atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca     6000

aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg caaactatta     6060

actggcgaac tacttactct agcttcccgg caacaattaa tagactggat ggaggcggat     6120

aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat tgctgataaa     6180

tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc agatggtaag     6240

ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga tgaacgaaat     6300

agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc agaccaagtt     6360

tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag gatctaggtg     6420

aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga     6480

gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta     6540

atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa     6600

gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact     6660

gtccttctag tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca     6720

tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt     6780

accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg     6840

ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag     6900

cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta     6960

agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat     7020

ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg     7080

tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc     7140

ttttgctggc cttttgctca catgtcctgc aggcagctgc gcgctcgctc gctcactgag     7200

gccgcccggg caaagcccgg gcgtcgggcg acctttggtc gcccggcctc agtgagcgag     7260

cgagcgcgca gagagggagt ggccaactcc atcactaggg gttcctgcgg ccgcaaggtc     7320

gggcaggaag agggcctatt tcccatgatt ccttcatatt tgcatatacg atacaaggct     7380

gttagagaga taattggaat taatttgact gtaaacacaa agatattagt acaaaatacg     7440

tgacgtagaa agtaataatt tcttgggtag tttgcagttt taaaattatg ttttaaaatg     7500

gactatcata tgcttaccgt aacttgaaag tatttcgatt tcttggcttt atatatcttg     7560

tggaaaggac gaaacaccgg gtcttcgaga agacctgtta tagtactctg gaaacagaat     7620

ctactataac aaggcaaaat gccgtgttta tctcgtcaac ttgttggcga gatttttttg     7680

aattcgttta aacggtaccc gttacataac ttacggtaaa tgg                       7723


<210>  26
<211>  8008
<212>  DNA
<213>  Artificial

<220>
<223>  Expression vector for Cas9

<400>  26
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaatagtaa cgccaatagg       60

gactttccat tgacgtcaat gggtggagta tttacggtaa actgcccact tggcagtaca      120

tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc      180

ctggcattgt gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt      240

attagtcatc gctattacca tggtcgaggt gagccccacg ttctgcttca ctctccccat      300

ctcccccccc tccccacccc caattttgta tttatttatt ttttaattat tttgtgcagc      360

gatgggggcg gggggggggg gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg      420

gggcggggcg aggcggagag gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt      480

tccttttatg gcgaggcggc ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc      540

gggagtcgct gcgacgctgc cttcgccccg tgccccgctc cgccgccgcc tcgcgccgcc      600

cgccccggct ctgactgacc gcgttactcc cacaggtgag cgggcgggac ggcccttctc      660

ctccgggctg taattagctg agcaagaggt aagggtttaa gggatggttg gttggtgggg      720

tattaatgtt taattacctg gagcacctgc ctgaaatcac tttttttcag gttggatcct      780

taattaataa tacgactcac tataggggcc gccaccatga agaggaacta catcctcggc      840

ctggacatcg gcatcacatc tgtcggctac ggcatcatcg actacgagac aagggacgtg      900

atcgacgctg gcgtgcggct gttcaaagag gccaacgtcg agaacaacga gggcagaaga      960

tccaagagag gcgccagaag gctgaagaga agaaggcggc acagaatcca gagagtgaag     1020

aagctgctgt tcgactacaa cctgctgacc gaccacagcg agctgagcgg catcaatcct     1080

tacgaggcca gagtgaaggg cctgagccag aagctgagcg aggaagagtt ctctgccgct     1140

ctgctgcacc tggctaaaag acggggagtg cacaacgtga acgaggtgga agaggacacc     1200

ggcaacgagc tgtccaccaa agagcagatc agcagaaaca gcaaggccct ggaagagaaa     1260

tacgtggccg agctgcaact ggaaaggctg aaaaaggacg gcgaagtgcg gggcagcatc     1320

aacagattca agaccagcga ctacgtgaaa gaggctaagc agctcctgaa ggtgcagaag     1380

gcttaccacc agctggacca gagcttcatc gacacctaca tcgacctgct ggaaaccaga     1440

aggacctact acgaaggacc tggcgagggc agcccttttg gctggaagga catcaaagaa     1500

tggtacgaga tgctgatggg ccactgcaca tacttccccg aggaactgag aagcgtgaag     1560

tacgcctaca acgccgacct gtacaacgcc ctgaacgacc tgaacaacct cgtgatcacc     1620

agggacgaga acgagaagct ggaatattac gagaagttcc agatcatcga gaacgtgttc     1680

aagcagaaga agaagcccac actgaagcag atcgccaaag agatcctcgt caacgaggaa     1740

gatattaagg gctacagagt gaccagcacc ggcaagcccg agttcaccaa tctgaaggtg     1800

taccacgaca tcaaggacat taccgctcgg aaagaaatca tcgaaaacgc tgagctgctg     1860

gaccaaatcg ccaagatcct gaccatctac cagagcagcg aggacattca agaagaactg     1920

accaacctga actccgagct gacccaagag gaaatcgagc agattagcaa cctgaaggga     1980

tacaccggca cacacaacct gagcctgaag gccatcaacc tgatcctgga cgagctgtgg     2040

cacaccaacg acaaccagat cgctatcttc aacaggctga agctggtgcc taagaaggtg     2100

gacctgtcac agcagaaaga gattcctaca acactggtgg acgacttcat cctgtctcca     2160

gtggtcaagc gcagcttcat ccagagcatc aaagtgatca acgccatcat caagaagtac     2220

ggcctgccta acgacatcat catcgagctg gctagagaga agaactccaa ggacgcccag     2280

aaaatgatca acgagatgca gaagagaaac cggcagacca acgagaggat cgaggaaatc     2340

atcagaacca ccggcaaaga gaacgccaag tacctgatcg agaagatcaa gctgcacgac     2400

atgcaagagg gcaagtgcct gtacagcctg gaagctatcc ctcttgagga cctgctgaac     2460

aatcccttca actatgaggt ggaccacatc atccccagaa gcgtgtcctt cgacaacagc     2520

ttcaacaaca aggtgctcgt gaagcaagaa gagaactcca agaagggcaa cagaacccca     2580

ttccagtacc tgagcagcag cgacagcaag atcagctacg agactttcaa gaagcacatc     2640

ctgaacctcg ccaaaggcaa gggccgcatc agcaagacca agaaagagta tctgctggaa     2700

gaacgggaca tcaacaggtt ctccgtgcag aaagacttca tcaaccggaa cctggtggac     2760

accagatacg ccacaagggg cctgatgaat ctgctgagaa gctacttccg cgtgaacaat     2820

ctggacgtga aagtcaagtc catcaacggc ggcttcacca gctttctgag aagaaagtgg     2880

aagtttaaga aagagcggaa caaggggtac aagcaccacg ccgaggacgc cctgatcatt     2940

gccaacgccg atttcatctt caaagagtgg aagaaactgg acaaggcaaa gaaagtgatg     3000

gaaaaccaga tgttcgagga aaagcaggcc gagagcatgc ccgagatcga gacagagcaa     3060

gagtacaaag aaatcttcat cacgccccac cagatcaagc acattaagga cttcaaggac     3120

tacaagtaca gccaccgcgt ggacaagaag cctaacagag agctgattaa cgacaccctg     3180

tactccacca gaaaggacga caagggaaac accctgatcg tcaacaacct gaatggcctg     3240

tacgacaagg acaacgacaa gctcaagaag ctgatcaaca agagccccga aaagctgctg     3300

atgtaccacc acgatcctca gacctaccag aaactgaagc tcatcatgga acagtacggc     3360

gacgagaaga accctctgta caagtactac gaggaaaccg ggaactacct gaccaagtac     3420

tccaaaaagg ataacggccc cgtgatcaag aagattaagt attacggcaa caagctgaac     3480

gcccacctgg acatcaccga cgactaccct aactccagaa acaaggtcgt gaagctgtcc     3540

ctgaagcctt acagattcga cgtgtacctg gacaacggcg tgtacaagtt cgtgaccgtg     3600

aagaacctgg atgtgatcaa aaaagaaaac tactatgaag tgaacagcaa gtgctatgag     3660

gaagccaaaa agctgaagaa gatcagcaac caggctgagt ttatcgcctc cttctacaac     3720

aacgatctga tcaagatcaa cggggagctg tatagagtga tcggagtgaa caacgacctg     3780

ctcaacagga tcgaagtgaa tatgatcgac atcacctacc gcgagtacct ggaaaacatg     3840

aacgacaaga ggccacctcg gatcattaag acaatcgcca gcaagacgca gagcattaag     3900

aagtacagca cagacatcct gggcaacctg tacgaagtga agtctaagaa gcacccgcag     3960

attatcaaga aaggcggatc cacaccgcct aagaaaaaga gaaaggtcga ggacggcgag     4020

ggcccagctg ccaaaagagt gaaactggat tccggagccg ctcctgccgc caagaagaaa     4080

aagctggatt acaaggacga cgatgacaag ggcagcggcg ccaccaactt tagcttgctg     4140

aaacaggctg gcgacgttga agagaatccc gggcctttga ttttcgtgaa aacccttacc     4200

gggaaaacca tcaccctcga ggttgaaccc tcggatacga tagaaaatgt aaaggccaag     4260

atccaggata aggaaggaat tcctcctgat cagcagagac tggcctttgc tggcaaatcg     4320

ctggaagatg gacgtacttt gtctgactac aatattctaa aggactctaa acttcatcct     4380

ctgttgagac ttcgttgaac gcgtaaatga ttgcagatcc actagttcta gagctcgctg     4440

atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc     4500

ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc     4560

atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa     4620

gggggaggat tgggaagaga atagcaggca tgctggggat gcggtgggct ctatggcttc     4680

tgaggcggaa agaaccagct gggggcggcc gcaggaaccc ctagtgatgg agttggccac     4740

tccctctctg cgcgctcgct cgctcactga ggccgggcga ccaaaggtcg cccgacgccc     4800

gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc agctgcctgc aggggcgcct     4860

gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatac gtcaaagcaa     4920

ccatagtacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc     4980

gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt     5040

ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc     5100

cgatttagtg ctttacggca cctcgacccc aaaaaacttg atttgggtga tggttcacgt     5160

agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt     5220

aatagtggac tcttgttcca aactggaaca acactcaacc ctatctcggg ctattctttt     5280

gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa     5340

aaatttaacg cgaattttaa caaaatatta acgtttacaa ttttatggtg cactctcagt     5400

acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac     5460

gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc     5520

gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag acgaaagggc     5580

ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca     5640

ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat     5700

tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa     5760

aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt     5820

tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag     5880

ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt     5940

tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg     6000

gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag     6060

aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta     6120

agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg     6180

acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta     6240

actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac     6300

accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt     6360

actctagctt cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca     6420

cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag     6480

cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta     6540

gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag     6600

ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt     6660

tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat     6720

aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta     6780

gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa     6840

acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt     6900

tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag     6960

ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta     7020

atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca     7080

agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag     7140

cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa     7200

agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga     7260

acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc     7320

gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc     7380

ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt     7440

gctcacatgt cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag     7500

cccgggcgtc gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag     7560

ggagtggcca actccatcac taggggttcc tgcggccgca aggtcgggca ggaagagggc     7620

ctatttccca tgattccttc atatttgcat atacgataca aggctgttag agagataatt     7680

ggaattaatt tgactgtaaa cacaaagata ttagtacaaa atacgtgacg tagaaagtaa     7740

taatttcttg ggtagtttgc agttttaaaa ttatgtttta aaatggacta tcatatgctt     7800

accgtaactt gaaagtattt cgatttcttg gctttatata tcttgtggaa aggacgaaac     7860

accgggtctt cgagaagacc tgttatagta ctctggaaac agaatctact ataacaaggc     7920

aaaatgccgt gtttatctcg tcaacttgtt ggcgagattt ttttgaattc gtttaaacgg     7980

tacccgttac ataacttacg gtaaatgg                                        8008


<210>  27
<211>  6169
<212>  DNA
<213>  Artificial

<220>
<223>  Expression vector for Flp recombinase

<400>  27
ggcgcgccgg attcgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc       60

attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc      120

tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt      180

aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca      240

cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg      300

taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca      360

gtacatctac gtattagtca tcgctattac catggtcgag gtgagcccca cgttctgctt      420

cactctcccc atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt      480

attttgtgca gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg      540

gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc      600

gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag      660

cgcgcggcgg gcgggagtcg ctgcgtcgcg ccttcgcccc gtgccccgct ccgccgccgc      720

ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga gcgggcggga      780

cggcccttct cctccgggct gtaattagcg cttggtttaa tgacggctcg tttcttttct      840

gtggctgcgt gaaagcctta aagggctccg ggagggccct ttgtgcgggg gggagcggct      900

cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggcccg cgctgcccgg      960

cggctgtgag cgctgcgggc gcggcgcggg gctttgtgcg ctccgcgtgt gcgcgagggg     1020

agcgcggccg ggggcggtgc cccgcggtgc gggggggctg cgaggggaac aaaggctgcg     1080

tgcggggtgt gtgcgtgggg gggtgagcag ggggtgtggg cgcggcggtc gggctgtaac     1140

ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg tgcggggctc     1200

cgtgcggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca ggtgggggtg     1260

ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg cggcggcccc     1320

ggagcgccgg cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt     1380

gcgagagggc gcagggactt cctttgtccc aaatctggcg gagccgaaat ctgggaggcg     1440

ccgccgcacc ccctctagcg ggcgcgggcg aagcggtgcg gcgccggcag gaaggaaatg     1500

ggcggggagg gccttcgtgc gtcgccgcgc cgccgtcccc ttctccatct ccagcctcgg     1560

ggctgccgca gggggacggc tgccttcggg ggggacgggg cagggcgggg ttcggcttct     1620

ggcgtgtgac cggcggctct agagcctctg ctaaccatgt tcatgccttc ttctttttcc     1680

tacagatcct taattaataa tacgactcac tataggggcc gccaccatga gccagttcga     1740

catcctgtgc aagacccctc caaaggtgct cgtgcggcag ttcgtggaaa gattcgagag     1800

gcctagcggc gagaagatcg cctcttgtgc tgccgagctg acctacctgt gctggatgat     1860

cacccacaac ggcaccgcca tcaagagggc caccttcatg agctacaaca ccatcatcag     1920

caacagcctg agcttcgaca tcgtgaacaa gagcctccag ttcaagtaca agacccagaa     1980

ggctaccatc ctggaagcca gcctgaagaa gctgatcccc gcctgggagt tcacaatcat     2040

cccttacaac ggccagaagc accagagcga catcacagac atcgtgtcca gcctccagct     2100

ccagttcgag tctagcgagg aagccgacaa gggcaacagc cacagcaaga agatgctgaa     2160

ggccctgctg agcgagggcg agtctatctg ggagatcaca gagaagatcc tgaacagctt     2220

cgagtacacc agccggttca ccaagacaaa gaccctgtac cagttcctgt tcctggctac     2280

cttcatcaac tgcggcagat tctccgacat caagaacgtg gaccccaaga gcttcaagct     2340

ggtgcagaac aagtacctgg gcgtgatcat tcagtgcctc gtgaccgaga ctaagaccag     2400

cgtgtccaga cacatctact ttttcagcgc cagaggcaga atcgaccctc tggtgtacct     2460

ggacgagttc ctgagaaaca gcgagcccgt gctgaagaga gtgaacagaa ccggcaacag     2520

cagctccaac aagcaagagt accagctgct gaaggacaac ctcgtgcggt cctacaacaa     2580

ggctctgaag aagaacgccc cgtatcctat cttcgccatt aagaacggcc ctaagagcca     2640

catcggcaga cacctgatga ccagctttct gagcatgaag ggcctgacag agctgaccaa     2700

cgtcgtcggc aattggagcg ataagagagc ctctgccgtc gccagaacca cctacacaca     2760

ccagatcaca gctatccccg accactactt cgccctggtg tctaggtact acgcctacga     2820

tcccatcagc aaagagatga tcgccctgaa ggacgagaca aaccccatcg aggaatggca     2880

gcacatcgag cagctgaagg gatctgccga gggcagcatc agataccctg cttggaacgg     2940

catcatctcc caagaggtgc tggactacct gagcagctac atcaacagaa gaatcggcgg     3000

cagcggcgga tcccctgctg ctaaaagagt gaagctggac tccggatgaa cgcgtaaatg     3060

attgcagatc cactagttct agagctcgct gatcagcctc gactgtgcct tctagttgcc     3120

agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt gccactccca     3180

ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg tgtcattcta     3240

ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac aatagcaggc     3300

atgctgggga tgcggtgggc tctatggctt ctgaggcgga aagaaccagc tggggctcga     3360

gatccactag ttctagcctc gaggctagag cggccgccac tggccgtcgt tttacaacgt     3420

cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc     3480

gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc     3540

ctgaatggcg aatgggacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt     3600

acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc     3660

ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct     3720

ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat     3780

ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc     3840

acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc     3900

tattcttttg atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg     3960

atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat ttaggtggca     4020

cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata     4080

tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga     4140

gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc     4200

ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg     4260

cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc     4320

ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat     4380

cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact     4440

tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat     4500

tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga     4560

tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc     4620

ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga     4680

tgcctgtagc aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag     4740

cttcccggca acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc     4800

gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt     4860

ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct     4920

acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg     4980

cctcactgat taagcattgg taactgtcag accaagttta ctcatatata ctttagattg     5040

atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca     5100

tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga     5160

tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa     5220

aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga     5280

aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt     5340

taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt     5400

taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat     5460

agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct     5520

tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca     5580

cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag     5640

agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc     5700

gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga     5760

aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca     5820

tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag     5880

ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg     5940

aagagcgccc aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat taatgcagct     6000

ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt     6060

agctcactca ttaggcaccc caggctttac actttatgct tccggctcgt atgttgtgtg     6120

gaattgtgag cggataacaa tttcacacag gaaacagcta tgaccatga                 6169


<210>  28
<211>  5917
<212>  DNA
<213>  Artificial

<220>
<223>  Expression vector for Cre recombinase

<400>  28
ggcgcgccgg attcgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc       60

attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc      120

tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt      180

aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca      240

cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg      300

taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca      360

gtacatctac gtattagtca tcgctattac catggtcgag gtgagcccca cgttctgctt      420

cactctcccc atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt      480

attttgtgca gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg      540

gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc      600

gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag      660

cgcgcggcgg gcgggagtcg ctgcgtcgcg ccttcgcccc gtgccccgct ccgccgccgc      720

ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga gcgggcggga      780

cggcccttct cctccgggct gtaattagcg cttggtttaa tgacggctcg tttcttttct      840

gtggctgcgt gaaagcctta aagggctccg ggagggccct ttgtgcgggg gggagcggct      900

cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggcccg cgctgcccgg      960

cggctgtgag cgctgcgggc gcggcgcggg gctttgtgcg ctccgcgtgt gcgcgagggg     1020

agcgcggccg ggggcggtgc cccgcggtgc gggggggctg cgaggggaac aaaggctgcg     1080

tgcggggtgt gtgcgtgggg gggtgagcag ggggtgtggg cgcggcggtc gggctgtaac     1140

ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg tgcggggctc     1200

cgtgcggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca ggtgggggtg     1260

ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg cggcggcccc     1320

ggagcgccgg cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt     1380

gcgagagggc gcagggactt cctttgtccc aaatctggcg gagccgaaat ctgggaggcg     1440

ccgccgcacc ccctctagcg ggcgcgggcg aagcggtgcg gcgccggcag gaaggaaatg     1500

ggcggggagg gccttcgtgc gtcgccgcgc cgccgtcccc ttctccatct ccagcctcgg     1560

ggctgccgca gggggacggc tgccttcggg ggggacgggg cagggcgggg ttcggcttct     1620

ggcgtgtgac cggcggctct agagcctctg ctaaccatgt tcatgccttc ttctttttcc     1680

tacagatcct taattaataa tacgactcac tataggggcc gccaccatga gcaacctgct     1740

gaccgtgcac cagaacctgc ctgctctgcc tgtggacgcc acatctgatg aagtgcggaa     1800

gaacctgatg gacatgttca gagacagaca ggccttcagc gagcacacct ggaagatgct     1860

gctgagcgtg tgtagaagct gggccgcttg gtgcaagctg aacaacagaa agtggttccc     1920

cgccgagcct gaggacgtgc gagattacct gctgtacctg caagctagag gcctggccgt     1980

gaaaaccatc cagcagcacc tgggccagct gaacatgctg cacagaagaa gcggcctgcc     2040

tagacctagc gacagcaacg ctgtgtccct ggtcatgaga aggattcgga aagaaaacgt     2100

ggacgctggc gagagagcta agcaggctct ggccttcgag agaaccgact tcgatcaagt     2160

gcgcagcctg atggaaaaca gcgacagatg ccaggatatt cggaacctgg ccttcctggg     2220

aatcgcctac aacaccctgc tgagaatcgc cgagatcgcc agaatcagag tgaaggacat     2280

cagcagaacc gacggcggca gaatgctgat ccacatcggc agaacaaaga ccctggtgtc     2340

cacagctggc gtcgagaagg ctctgagtct gggcgtgaca aagctggtgg aaagatggat     2400

cagcgtgtcc ggcgtggccg acgatcctaa caactacctg ttctgtcgcg tgcgcaagaa     2460

cggcgtggca gctccttctg ctacaagcca gctgagcaca agagccctgg aaggcatctt     2520

cgaggccaca cacagactga tctacggcgc caaggatgac agcggccaga gataccttgc     2580

ttggagcggc cacagtgcta gagtgggcgc tgctagagac atggctagag caggcgtgtc     2640

aatccccgag atcatgcaag ctggcggctg gaccaacgtg aacatcgtga tgaactacat     2700

ccgcaacctg gacagcgaga caggcgctat ggttcgactg cttgaagatg gcgacggtgg     2760

atccggtcct gccgctaaga gagtgaagct ggactgaacg cgtaaatgat tgcagatcca     2820

ctagttctag agctcgctga tcagcctcga ctgtgccttc tagttgccag ccatctgttg     2880

tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct     2940

aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg     3000

gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggatg     3060

cggtgggctc tatggcttct gaggcggaaa gaaccagctg gggctcgaga tccactagtt     3120

ctagcctcga ggctagagcg gccgccactg gccgtcgttt tacaacgtcg tgactgggaa     3180

aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt     3240

aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa     3300

tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg     3360

accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc     3420

gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga     3480

tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt     3540

gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat     3600

agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat     3660

ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa     3720

tttaacgcga attttaacaa aatattaacg cttacaattt aggtggcact tttcggggaa     3780

atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca     3840

tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc     3900

aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc     3960

acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt     4020

acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt     4080

ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg     4140

ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact     4200

caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg     4260

ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga     4320

aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg     4380

aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa     4440

tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac     4500

aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc     4560

cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca     4620

ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga     4680

gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta     4740

agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc     4800

atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc     4860

cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt     4920

cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac     4980

cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct     5040

tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact     5100

tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg     5160

ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata     5220

aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga     5280

cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag     5340

ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg     5400

agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac     5460

ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca     5520

acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg     5580

cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc     5640

gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa     5700

tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt     5760

ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt     5820

aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg     5880

gataacaatt tcacacagga aacagctatg accatga                              5917


<210>  29
<211>  32
<212>  PRT
<213>  Artificial

<220>
<223>  synthetic coiled-coil domain

<400>  29

Pro Glu Asp Glu Leu Ala Ala Asn Glu Glu Glu Leu Gln Gln Asn Glu 
1               5                   10                  15      


Gln Lys Leu Ala Gln Ile Lys Gln Lys Leu Gln Ala Ile Lys Tyr Gly 
            20                  25                  30          


<210>  30
<211>  28
<212>  PRT
<213>  Artificial

<220>
<223>  synthetic coiled-coil domain

<400>  30

Glu Ile Gln Gln Leu Glu Glu Glu Ile Ala Gln Leu Glu Gln Lys Asn 
1               5                   10                  15      


Ala Ala Leu Lys Glu Lys Asn Gln Ala Leu Lys Tyr 
            20                  25              


<210>  31
<211>  28
<212>  PRT
<213>  Artificial

<220>
<223>  synthetic coiled-coil domain

<400>  31

Lys Leu Gln Ala Ile Lys Tyr Glu Leu Ala Gln Asn Glu Glu Glu Leu 
1               5                   10                  15      


Ala Gln Ile Glu Glu Lys Leu Ala Ala Asn Lys Glu 
            20                  25              


<210>  32
<211>  28
<212>  PRT
<213>  Artificial

<220>
<223>  synthetic coiled-coil domain

<400>  32

Glu Asn Ala Ala Leu Glu Glu Lys Ile Ala Gln Leu Lys Gln Lys Asn 
1               5                   10                  15      


Ala Ala Leu Lys Glu Glu Ile Gln Ala Leu Glu Tyr 
            20                  25              


<210>  33
<211>  550
<212>  PRT
<213>  Artificial

<220>
<223>  Firefly luciferase-x5

<400>  33

Met Glu Asp Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Arg Tyr Pro 
1               5                   10                  15      


Leu Glu Asp Gly Thr Ala Gly Glu Gln Leu His Lys Ala Met Lys Arg 
            20                  25                  30          


Tyr Ala Gln Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu 
        35                  40                  45              


Val Asn Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala 
    50                  55                  60                  


Glu Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val 
65                  70                  75                  80  


Cys Ser Glu Asn Ser Leu Gln Phe Phe Met Pro Val Leu Gly Ala Leu 
                85                  90                  95      


Phe Ile Gly Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg 
            100                 105                 110         


Glu Leu Leu Asn Ser Met Asn Ile Ser Gln Pro Thr Val Val Phe Val 
        115                 120                 125             


Ser Lys Lys Gly Leu Gln Lys Ile Leu Asn Val Gln Lys Lys Leu Pro 
    130                 135                 140                 


Ile Ile Gln Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gln Gly 
145                 150                 155                 160 


Phe Gln Ser Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe 
                165                 170                 175     


Asn Glu Tyr Asp Phe Lys Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile 
            180                 185                 190         


Ala Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val 
        195                 200                 205             


Ala Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp 
    210                 215                 220                 


Pro Ile Phe Gly Asn Gln Ile Lys Pro Asp Thr Ala Ile Leu Ser Val 
225                 230                 235                 240 


Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu 
                245                 250                 255     


Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu Leu 
            260                 265                 270         


Phe Leu Arg Ser Leu Gln Asp Tyr Lys Ile Gln Ser Ala Leu Leu Val 
        275                 280                 285             


Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr 
    290                 295                 300                 


Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser 
305                 310                 315                 320 


Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile 
                325                 330                 335     


Arg Gln Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr 
            340                 345                 350         


Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe 
        355                 360                 365             


Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val 
    370                 375                 380                 


Asn Gln Arg Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly 
385                 390                 395                 400 


Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly 
                405                 410                 415     


Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe 
            420                 425                 430         


Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gln 
        435                 440                 445             


Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gln His Pro Asn Ile 
    450                 455                 460                 


Arg Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu 
465                 470                 475                 480 


Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr Met Thr Glu Lys 
                485                 490                 495     


Glu Ile Val Asp Tyr Val Ala Ser Gln Val Thr Thr Ala Lys Lys Leu 
            500                 505                 510         


Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys Gly Leu Thr Gly 
        515                 520                 525             


Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile Lys Ala Lys Lys 
    530                 535                 540                 


Gly Gly Lys Ile Ala Val 
545                 550 


<210>  34
<211>  170
<212>  PRT
<213>  Artificial

<220>
<223>  NanoLuc

<400>  34

Val Phe Thr Leu Glu Asp Phe Val Gly Asp Trp Arg Gln Thr Ala Gly 
1               5                   10                  15      


Tyr Asn Leu Asp Gln Val Leu Glu Gln Gly Gly Val Ser Ser Leu Phe 
            20                  25                  30          


Gln Asn Leu Gly Val Ser Val Thr Pro Ile Gln Arg Ile Val Leu Ser 
        35                  40                  45              


Gly Glu Asn Gly Leu Lys Ile Asp Ile His Val Ile Ile Pro Tyr Glu 
    50                  55                  60                  


Gly Leu Ser Gly Asp Gln Met Gly Gln Ile Glu Lys Ile Phe Lys Val 
65                  70                  75                  80  


Val Tyr Pro Val Asp Asp His His Phe Lys Val Ile Leu His Tyr Gly 
                85                  90                  95      


Thr Leu Val Ile Asp Gly Val Thr Pro Asn Met Ile Asp Tyr Phe Gly 
            100                 105                 110         


Arg Pro Tyr Glu Gly Ile Ala Val Phe Asp Gly Lys Lys Ile Thr Val 
        115                 120                 125             


Thr Gly Thr Leu Trp Asn Gly Asn Lys Ile Ile Asp Glu Arg Leu Ile 
    130                 135                 140                 


Asn Pro Asp Gly Ser Leu Leu Phe Arg Val Thr Ile Asn Gly Val Thr 
145                 150                 155                 160 


Gly Trp Arg Leu Cys Glu Arg Ile Leu Ala 
                165                 170 


<210>  35
<211>  131
<212>  PRT
<213>  Artificial

<220>
<223>  blasticidin-S-deaminase

<400>  35

Ala Lys Pro Leu Ser Gln Glu Glu Ser Thr Leu Ile Glu Arg Ala Thr 
1               5                   10                  15      


Ala Thr Ile Asn Ser Ile Pro Ile Ser Glu Asp Tyr Ser Val Ala Ser 
            20                  25                  30          


Ala Ala Leu Ser Ser Asp Gly Arg Ile Phe Thr Gly Val Asn Val Tyr 
        35                  40                  45              


His Phe Thr Gly Gly Pro Cys Ala Glu Leu Val Val Leu Gly Thr Ala 
    50                  55                  60                  


Ala Ala Ala Ala Ala Gly Asn Leu Thr Cys Ile Val Ala Ile Gly Asn 
65                  70                  75                  80  


Glu Asn Arg Gly Ile Leu Ser Pro Cys Gly Arg Cys Arg Gln Val Leu 
                85                  90                  95      


Leu Asp Leu His Pro Gly Ile Lys Ala Ile Val Lys Asp Ser Asp Gly 
            100                 105                 110         


Gln Pro Thr Ala Val Gly Ile Arg Glu Leu Leu Pro Ser Gly Tyr Val 
        115                 120                 125             


Trp Glu Gly 
    130     


<210>  36
<211>  295
<212>  PRT
<213>  Artificial

<220>
<223>  HaloTag

<400>  36

Glu Ile Gly Thr Gly Phe Pro Phe Asp Pro His Tyr Val Glu Val Leu 
1               5                   10                  15      


Gly Glu Arg Met His Tyr Val Asp Val Gly Pro Arg Asp Gly Thr Pro 
            20                  25                  30          


Val Leu Phe Leu His Gly Asn Pro Thr Ser Ser Tyr Val Trp Arg Asn 
        35                  40                  45              


Ile Ile Pro His Val Ala Pro Thr His Arg Val Ile Ala Pro Asp Leu 
    50                  55                  60                  


Ile Gly Met Gly Lys Ser Asp Lys Pro Asp Leu Gly Tyr Phe Phe Asp 
65                  70                  75                  80  


Asp His Val Arg Phe Met Asp Ala Phe Ile Glu Ala Leu Gly Leu Glu 
                85                  90                  95      


Glu Val Val Leu Val Ile His Asp Trp Gly Ser Ala Leu Gly Phe His 
            100                 105                 110         


Trp Ala Lys Arg Asn Pro Glu Arg Val Lys Gly Ile Ala Phe Met Glu 
        115                 120                 125             


Phe Ile Arg Pro Ile Pro Thr Trp Asp Glu Trp Pro Glu Phe Ala Arg 
    130                 135                 140                 


Glu Thr Phe Gln Ala Phe Arg Thr Thr Asp Val Gly Arg Lys Leu Ile 
145                 150                 155                 160 


Ile Asp Gln Asn Val Phe Ile Glu Gly Thr Leu Pro Met Gly Val Val 
                165                 170                 175     


Arg Pro Leu Thr Glu Val Glu Met Asp His Tyr Arg Glu Pro Phe Leu 
            180                 185                 190         


Asn Pro Val Asp Arg Glu Pro Leu Trp Arg Phe Pro Asn Glu Leu Pro 
        195                 200                 205             


Ile Ala Gly Glu Pro Ala Asn Ile Val Ala Leu Val Glu Glu Tyr Met 
    210                 215                 220                 


Asp Trp Leu His Gln Ser Pro Val Pro Lys Leu Leu Phe Trp Gly Thr 
225                 230                 235                 240 


Pro Gly Val Leu Ile Pro Pro Ala Glu Ala Ala Arg Leu Ala Lys Ser 
                245                 250                 255     


Leu Pro Asn Ala Lys Ala Val Asp Ile Gly Pro Gly Leu Asn Leu Leu 
            260                 265                 270         


Gln Glu Asp Asn Pro Asp Leu Ile Gly Ser Glu Ile Ala Arg Trp Leu 
        275                 280                 285             


Ser Thr Leu Glu Ile Ser Gly 
    290                 295 


<210>  37
<211>  556
<212>  PRT
<213>  Artificial

<220>
<223>  single chain Avidin

<400>  37

Arg Lys Arg Thr Gln Pro Thr Phe Gly Phe Thr Val Asn Trp Lys Phe 
1               5                   10                  15      


Ser Glu Ser Thr Thr Val Phe Thr Gly Gln Cys Phe Ile Asp Arg Asn 
            20                  25                  30          


Gly Lys Glu Val Leu Lys Thr Met Trp Leu Leu Arg Ser Ser Val Asn 
        35                  40                  45              


Asp Ile Gly Asp Asp Trp Lys Ala Thr Arg Val Gly Ile Asn Ile Phe 
    50                  55                  60                  


Thr Arg Leu Arg Thr Gln Lys Glu Gly Gly Ser Gly Gly Ser Ala Arg 
65                  70                  75                  80  


Lys Cys Ser Leu Thr Gly Lys Trp Thr Asn Asp Leu Gly Ser Asn Met 
                85                  90                  95      


Thr Ile Gly Ala Val Asn Ser Arg Gly Glu Phe Thr Gly Thr Tyr Ile 
            100                 105                 110         


Thr Ala Val Thr Ala Thr Ser Asn Glu Ile Lys Glu Ser Pro Leu His 
        115                 120                 125             


Gly Thr Gln Asn Thr Ile Asn Lys Ser Gly Gly Ser Thr Thr Val Phe 
    130                 135                 140                 


Thr Gly Gln Cys Phe Ile Asp Arg Asn Gly Lys Glu Val Leu Lys Thr 
145                 150                 155                 160 


Met Trp Leu Leu Arg Ser Ser Val Asn Asp Ile Gly Asp Asp Trp Lys 
                165                 170                 175     


Ala Thr Arg Val Gly Ile Asn Ile Phe Thr Arg Leu Arg Thr Gln Lys 
            180                 185                 190         


Glu Gly Gly Ser Gly Gly Ser Ala Arg Lys Cys Ser Leu Thr Gly Lys 
        195                 200                 205             


Trp Thr Asn Asp Leu Gly Ser Asn Met Thr Ile Gly Ala Val Asn Ser 
    210                 215                 220                 


Arg Gly Glu Phe Thr Gly Thr Tyr Ile Thr Ala Val Thr Ala Thr Ser 
225                 230                 235                 240 


Asn Glu Ile Lys Glu Ser Pro Leu His Gly Thr Gln Asn Thr Ile Asn 
                245                 250                 255     


Lys Arg Thr Gln Pro Thr Phe Gly Phe Thr Val Asn Trp Lys Phe Ser 
            260                 265                 270         


Glu Gly Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Arg Thr Gln 
        275                 280                 285             


Pro Thr Phe Gly Phe Thr Val Asn Trp Lys Phe Ser Glu Ser Thr Thr 
    290                 295                 300                 


Val Phe Thr Gly Gln Cys Phe Ile Asp Arg Asn Gly Lys Glu Val Leu 
305                 310                 315                 320 


Lys Thr Met Trp Leu Leu Arg Ser Ser Val Asn Asp Ile Gly Asp Asp 
                325                 330                 335     


Trp Lys Ala Thr Arg Val Gly Ile Asn Ile Phe Thr Arg Leu Arg Thr 
            340                 345                 350         


Gln Lys Glu Gly Gly Ser Gly Gly Ser Ala Arg Lys Cys Ser Leu Thr 
        355                 360                 365             


Gly Lys Trp Thr Asn Asp Leu Gly Ser Asn Met Thr Ile Gly Ala Val 
    370                 375                 380                 


Asn Ser Arg Gly Glu Phe Thr Gly Thr Tyr Ile Thr Ala Val Thr Ala 
385                 390                 395                 400 


Thr Ser Asn Glu Ile Lys Glu Ser Pro Leu His Gly Thr Gln Asn Thr 
                405                 410                 415     


Ile Asn Lys Ser Gly Gly Ser Thr Thr Val Phe Thr Gly Gln Cys Phe 
            420                 425                 430         


Ile Asp Arg Asn Gly Lys Glu Val Leu Lys Thr Met Trp Leu Leu Arg 
        435                 440                 445             


Ser Ser Val Asn Asp Ile Gly Asp Asp Trp Lys Ala Thr Arg Val Gly 
    450                 455                 460                 


Ile Asn Ile Phe Thr Arg Leu Arg Thr Gln Lys Glu Gly Gly Ser Gly 
465                 470                 475                 480 


Gly Ser Ala Arg Lys Cys Ser Leu Thr Gly Lys Trp Thr Asn Asp Leu 
                485                 490                 495     


Gly Ser Asn Met Thr Ile Gly Ala Val Asn Ser Arg Gly Glu Phe Thr 
            500                 505                 510         


Gly Thr Tyr Ile Thr Ala Val Thr Ala Thr Ser Asn Glu Ile Lys Glu 
        515                 520                 525             


Ser Pro Leu His Gly Thr Gln Asn Thr Ile Asn Lys Arg Thr Gln Pro 
    530                 535                 540                 


Thr Phe Gly Phe Thr Val Asn Trp Lys Phe Ser Glu 
545                 550                 555     


<210>  38
<211>  236
<212>  PRT
<213>  Artificial

<220>
<223>  TEV protease X3

<400>  38

Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser 
1               5                   10                  15      


Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu 
            20                  25                  30          


Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe 
        35                  40                  45              


Arg Arg Asn Asn Gly Thr Leu Val Val Gln Ser Leu His Gly Val Phe 
    50                  55                  60                  


Lys Val Lys Asn Thr Thr Thr Leu Gln Gln His Leu Ile Asp Gly Arg 
65                  70                  75                  80  


Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gln 
                85                  90                  95      


Lys Leu Lys Phe Arg Glu Pro Gln Arg Glu Glu Arg Ile Cys Leu Val 
            100                 105                 110         


Thr Thr Asn Phe Gln Thr Lys Ser Met Ser Ser Met Val Ser Asp Thr 
        115                 120                 125             


Ser Cys Thr Phe Pro Ser Gly Asp Gly Ile Phe Trp Lys His Trp Ile 
    130                 135                 140                 


Gln Thr Lys Asp Gly Gln Cys Gly Ser Pro Leu Val Ser Thr Arg Asp 
145                 150                 155                 160 


Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn Thr Asn 
                165                 170                 175     


Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu Thr Asn 
            180                 185                 190         


Gln Glu Ala Gln Gln Trp Val Ser Gly Trp Arg Leu Asn Ala Asp Ser 
        195                 200                 205             


Val Leu Trp Gly Gly His Lys Val Phe Met Val Lys Pro Glu Glu Pro 
    210                 215                 220                 


Phe Gln Pro Val Lys Glu Ala Thr Gln Leu Met Asn 
225                 230                 235     


<210>  39
<211>  219
<212>  PRT
<213>  Artificial

<220>
<223>  modified Ulp1 from Saccharomyces cerevisiae

<400>  39

Leu Val Pro Glu Leu Asn Glu Lys Asp Asp Asp Gln Val Gln Lys Ala 
1               5                   10                  15      


Leu Ala Ser Arg Glu Asn Thr Gln Leu Met Asn Arg Asp Asn Ile Glu 
            20                  25                  30          


Ile Thr Val Arg Asp Phe Lys Thr Leu Ala Pro Arg Arg Trp Leu Asn 
        35                  40                  45              


Ser Gly Ile Ile Ser Phe Phe Met Lys Tyr Ile Glu Lys Ser Thr Pro 
    50                  55                  60                  


Asn Thr Val Ala Phe Asn Ser Phe Phe Tyr Thr Asn Leu Ser Glu Arg 
65                  70                  75                  80  


Gly Tyr Gln Gly Val Arg Arg Trp Met Lys Arg Lys Lys Thr Gln Ile 
                85                  90                  95      


Asp Lys Leu Asp Lys Ile Phe Thr Pro Ile Asn Leu Asn Gln Ser His 
            100                 105                 110         


Trp Ala Leu Gly Ile Ile Asp Leu Lys Lys Lys Thr Ile Gly Tyr Val 
        115                 120                 125             


Asp Ser Leu Ser Asn Gly Pro Asn Ala Met Ser Phe Ala Ile Leu Thr 
    130                 135                 140                 


Asp Leu Gln Lys Tyr Val Met Glu Glu Ser Lys His Thr Ile Gly Glu 
145                 150                 155                 160 


Asp Phe Asp Leu Ile His Leu Asp Cys Pro Gln Gln Pro Asn Gly Tyr 
                165                 170                 175     


Asp Cys Gly Ile Tyr Val Cys Met Asn Thr Leu Tyr Gly Ser Ala Asp 
            180                 185                 190         


Ala Pro Leu Asp Phe Asp Tyr Lys Asp Ala Ile Arg Met Arg Arg Phe 
        195                 200                 205             


Ile Ala His Leu Ile Leu Thr Asp Ala Leu Lys 
    210                 215                 


<210>  40
<211>  235
<212>  PRT
<213>  Artificial

<220>
<223>  green fluorescent protein derivative

<400>  40

Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ser Leu Pro Ala Thr His 
1               5                   10                  15      


Glu Leu His Ile Phe Gly Ser Ile Asn Gly Val Asp Phe Asp Met Val 
            20                  25                  30          


Gly Gln Gly Thr Gly Asn Pro Asn Asp Gly Tyr Glu Glu Leu Asn Leu 
        35                  40                  45              


Lys Ser Thr Lys Gly Asp Leu Gln Phe Ser Pro Trp Ile Leu Val Pro 
    50                  55                  60                  


His Ile Gly Tyr Gly Phe His Gln Tyr Leu Pro Tyr Pro Asp Gly Met 
65                  70                  75                  80  


Ser Pro Phe Gln Ala Ala Met Val Asp Gly Ser Gly Tyr Gln Val His 
                85                  90                  95      


Arg Thr Met Gln Phe Glu Asp Gly Ala Ser Leu Thr Val Asn Tyr Arg 
            100                 105                 110         


Tyr Thr Tyr Glu Gly Ser His Ile Lys Gly Glu Ala Gln Val Lys Gly 
        115                 120                 125             


Thr Gly Phe Pro Ala Asp Gly Pro Val Met Thr Asn Ser Leu Thr Ala 
    130                 135                 140                 


Ala Asp Trp Cys Arg Ser Lys Lys Thr Tyr Pro Asn Asp Lys Thr Ile 
145                 150                 155                 160 


Ile Ser Thr Phe Lys Trp Ser Tyr Thr Thr Gly Asn Gly Lys Arg Tyr 
                165                 170                 175     


Arg Ser Thr Ala Arg Thr Thr Tyr Thr Phe Ala Lys Pro Met Ala Ala 
            180                 185                 190         


Asn Tyr Leu Lys Asn Gln Pro Met Tyr Val Phe Arg Lys Thr Glu Leu 
        195                 200                 205             


Lys His Ser Lys Thr Glu Leu Asn Phe Lys Glu Trp Gln Lys Ala Phe 
    210                 215                 220                 


Thr Asp Val Met Gly Met Asp Glu Leu Tyr Lys 
225                 230                 235 


<210>  41
<211>  438
<212>  PRT
<213>  Artificial

<220>
<223>  Flp recombinase with C-terminal NLS

<400>  41

Met Ser Gln Phe Asp Ile Leu Cys Lys Thr Pro Pro Lys Val Leu Val 
1               5                   10                  15      


Arg Gln Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys Ile Ala 
            20                  25                  30          


Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met Ile Thr His Asn 
        35                  40                  45              


Gly Thr Ala Ile Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr Ile Ile 
    50                  55                  60                  


Ser Asn Ser Leu Ser Phe Asp Ile Val Asn Lys Ser Leu Gln Phe Lys 
65                  70                  75                  80  


Tyr Lys Thr Gln Lys Ala Thr Ile Leu Glu Ala Ser Leu Lys Lys Leu 
                85                  90                  95      


Ile Pro Ala Trp Glu Phe Thr Ile Ile Pro Tyr Asn Gly Gln Lys His 
            100                 105                 110         


Gln Ser Asp Ile Thr Asp Ile Val Ser Ser Leu Gln Leu Gln Phe Glu 
        115                 120                 125             


Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu 
    130                 135                 140                 


Lys Ala Leu Leu Ser Glu Gly Glu Ser Ile Trp Glu Ile Thr Glu Lys 
145                 150                 155                 160 


Ile Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr 
                165                 170                 175     


Leu Tyr Gln Phe Leu Phe Leu Ala Thr Phe Ile Asn Cys Gly Arg Phe 
            180                 185                 190         


Ser Asp Ile Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gln Asn 
        195                 200                 205             


Lys Tyr Leu Gly Val Ile Ile Gln Cys Leu Val Thr Glu Thr Lys Thr 
    210                 215                 220                 


Ser Val Ser Arg His Ile Tyr Phe Phe Ser Ala Arg Gly Arg Ile Asp 
225                 230                 235                 240 


Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu 
                245                 250                 255     


Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gln Glu Tyr 
            260                 265                 270         


Gln Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys 
        275                 280                 285             


Lys Asn Ala Pro Tyr Pro Ile Phe Ala Ile Lys Asn Gly Pro Lys Ser 
    290                 295                 300                 


His Ile Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu 
305                 310                 315                 320 


Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser 
                325                 330                 335     


Ala Val Ala Arg Thr Thr Tyr Thr His Gln Ile Thr Ala Ile Pro Asp 
            340                 345                 350         


His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro Ile Ser 
        355                 360                 365             


Lys Glu Met Ile Ala Leu Lys Asp Glu Thr Asn Pro Ile Glu Glu Trp 
    370                 375                 380                 


Gln His Ile Glu Gln Leu Lys Gly Ser Ala Glu Gly Ser Ile Arg Tyr 
385                 390                 395                 400 


Pro Ala Trp Asn Gly Ile Ile Ser Gln Glu Val Leu Asp Tyr Leu Ser 
                405                 410                 415     


Ser Tyr Ile Asn Arg Arg Ile Gly Gly Ser Gly Gly Ser Pro Ala Ala 
            420                 425                 430         


Lys Arg Val Lys Leu Asp 
        435             


<210>  42
<211>  356
<212>  PRT
<213>  Artificial

<220>
<223>  Cre recombinase with C-terminal NLS

<400>  42

Met Ser Asn Leu Leu Thr Val His Gln Asn Leu Pro Ala Leu Pro Val 
1               5                   10                  15      


Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg 
            20                  25                  30          


Asp Arg Gln Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val 
        35                  40                  45              


Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe 
    50                  55                  60                  


Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gln Ala 
65                  70                  75                  80  


Arg Gly Leu Ala Val Lys Thr Ile Gln Gln His Leu Gly Gln Leu Asn 
                85                  90                  95      


Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala 
            100                 105                 110         


Val Ser Leu Val Met Arg Arg Ile Arg Lys Glu Asn Val Asp Ala Gly 
        115                 120                 125             


Glu Arg Ala Lys Gln Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gln 
    130                 135                 140                 


Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gln Asp Ile Arg Asn 
145                 150                 155                 160 


Leu Ala Phe Leu Gly Ile Ala Tyr Asn Thr Leu Leu Arg Ile Ala Glu 
                165                 170                 175     


Ile Ala Arg Ile Arg Val Lys Asp Ile Ser Arg Thr Asp Gly Gly Arg 
            180                 185                 190         


Met Leu Ile His Ile Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly 
        195                 200                 205             


Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp 
    210                 215                 220                 


Ile Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys 
225                 230                 235                 240 


Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gln Leu 
                245                 250                 255     


Ser Thr Arg Ala Leu Glu Gly Ile Phe Glu Ala Thr His Arg Leu Ile 
            260                 265                 270         


Tyr Gly Ala Lys Asp Asp Ser Gly Gln Arg Tyr Leu Ala Trp Ser Gly 
        275                 280                 285             


His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val 
    290                 295                 300                 


Ser Ile Pro Glu Ile Met Gln Ala Gly Gly Trp Thr Asn Val Asn Ile 
305                 310                 315                 320 


Val Met Asn Tyr Ile Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val 
                325                 330                 335     


Arg Leu Leu Glu Asp Gly Asp Gly Gly Ser Gly Pro Ala Ala Lys Arg 
            340                 345                 350         


Val Lys Leu Asp 
        355     


<210>  43
<211>  199
<212>  PRT
<213>  Artificial

<220>
<223>  Expression product from Puromycin resistance gene

<400>  43

Met Thr Glu Tyr Lys Pro Thr Val Arg Leu Ala Thr Arg Asp Asp Val 
1               5                   10                  15      


Pro Arg Ala Val Arg Thr Leu Ala Ala Ala Phe Ala Asp Tyr Pro Ala 
            20                  25                  30          


Thr Arg His Thr Val Asp Pro Asp Arg His Ile Glu Arg Val Thr Glu 
        35                  40                  45              


Leu Gln Glu Leu Phe Leu Thr Arg Val Gly Leu Asp Ile Gly Lys Val 
    50                  55                  60                  


Trp Val Ala Asp Asp Gly Ala Ala Val Ala Val Trp Thr Thr Pro Glu 
65                  70                  75                  80  


Ser Val Glu Ala Gly Ala Val Phe Ala Glu Ile Gly Pro Arg Met Ala 
                85                  90                  95      


Glu Leu Ser Gly Ser Arg Leu Ala Ala Gln Gln Gln Met Glu Gly Leu 
            100                 105                 110         


Leu Ala Pro His Arg Pro Lys Glu Pro Ala Trp Phe Leu Ala Thr Val 
        115                 120                 125             


Gly Val Ser Pro Asp His Gln Gly Lys Gly Leu Gly Ser Ala Val Val 
    130                 135                 140                 


Leu Pro Gly Val Glu Ala Ala Glu Arg Ala Gly Val Pro Ala Phe Leu 
145                 150                 155                 160 


Glu Thr Ser Ala Pro Arg Asn Leu Pro Phe Tyr Glu Arg Leu Gly Phe 
                165                 170                 175     


Thr Val Thr Ala Asp Val Glu Val Pro Glu Gly Pro Arg Thr Trp Cys 
            180                 185                 190         


Met Thr Arg Lys Pro Gly Ala 
        195                 


<210>  44
<211>  5303
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  44
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgtct cgtgggcagc agcgagatca tcaccagaaa ctacggcaag accaccatca      720

aagaggtggt cgagatcttc gacaacgaca agaacatcca ggtgctggcc ttcaacaccc      780

acaccgacaa catcgagtgg gcccctatca aggccgctca gctgacaaga cctaacgccg      840

agctggtgga actggaaatc gacacactgc acggcgtgaa aaccatcaga tgcacccctg      900

atcaccccgt gtacaccaag aacagaggct acgtgcgggc cgacgagctg acagatgatg      960

acgaactggt ggtggctatc ggcggcggag gacctgagga tgaacttgct gccaacgagg     1020

aagaactgca acagaacgaa cagaagctgg cccagattaa gcagaagctc caggccatta     1080

agtacggcgg atccggcgga ggcggatctg gtaccggaat ggaagatgcc aagaacatca     1140

agaagggccc tgctcctaga taccctctgg aagatggaac cgctggcgag cagctgcaca     1200

aggccatgaa gagatacgct caggtgcccg gcacaatcgc cttcacagat gcccacatcg     1260

aagtgaacat cacctacgcc gagtacttcg agatgagcgt gcggctggcc gaagctatga     1320

agcgatacgg cctgaacacc aaccacagaa tcgtcgtgtg cagcgagaac agcctccagt     1380

tcttcatgcc tgtgctgggc gctctgttca tcggagtggc tgtggctcct gccaacgaca     1440

tctacaacga gcgcgagctg ctgaacagca tgaacatcag ccagcctacc gtggtgttcg     1500

tgtccaagaa gggactgcaa aagatcctga acgtgcagaa gaagctgccc atcatccaga     1560

aaatcatcat catggacagc aagaccgact accagggctt ccagagcatg tataccttcg     1620

tgaccagcca tctgccacca ggcttcaacg agtacgactt caagcccgag agcttcgaca     1680

gagacaagac aatcgccctg atcatgaaca gcagcggctc taccggactg cctaaaggcg     1740

ttgccctgcc tcacagaaca gcttgcgtca gattcagcca cgccagagat cccatcttcg     1800

gcaaccagat caagcctgac accgctatcc tgagcgtggt gccttttcac cacggcttcg     1860

gcatgttcac cacactgggc tacctgatct gcggcttcag agtggtgctg atgtatcgct     1920

ttgaggaaga actgttcctg cggagcctcc aggactacaa gatccagtct gctctgctgg     1980

tgcctactct gttcagcttc tttgccaaga gcaccctgat cgataagtac gacctgagca     2040

acctgcacga gatcgcctct ggcggagccc ctctgtctaa agaagtgggc gaagccgtcg     2100

ccaagagatt tcatctgccc ggcatcagac aaggctacgg actgaccgag acaaccagcg     2160

ccatcctgat cacacctgag ggcgacgata agcctggcgc tgtgggaaaa gtggtgccat     2220

tcttcgaggc taaggtggtg gacctggaca ccggcaaaac actgggagtg aatcagaggg     2280

gcgagctgtg tgtcagaggc cctatgatca tgagcggcta cgtgaacaac cccgaggcca     2340

ccaacgctct gatcgacaag gatggctggc tgcacagcgg cgacattgcc tactgggacg     2400

aagatgagca cttcttcatc gtggacagac tgaagtccct gatcaagtac aagggctacc     2460

aggtggcccc tgccgagctg gaatctatcc tgctccagca tcctaacatc cgcgacgctg     2520

gtgttgctgg cctgcctgac gatgatgctg gcgaacttcc tgctgccgtg gtggtgctgg     2580

aacacggcaa gaccatgacc gagaaagaaa tcgtggacta cgtggcctct caagtgacca     2640

ccgccaagaa actgagaggc ggcgtggtgt ttgtggacga ggtgccaaaa ggcctgaccg     2700

gcaagctgga cgccagaaag atcagagaga tcctcatcaa ggccaagaaa ggcggcaaga     2760

tcgctgtcgg aggatccggc ggagactaca aggacgacga tgacaaaggg tcacctggca     2820

taacttcgta tagtacacat tatacgaagt tatctggcgg gtcacccgag gatgagatcc     2880

agcagctgga agaggaaatc gcccagctgg aacagaagaa tgccgctctg aaagagaaga     2940

accaggctct gaagtacgga ggcggaggca tggaagccaa gacctacatc ggcaagctga     3000

agtccagaaa gatcgtgtcc aacgaggaca cctacgacat ccagaccagc acacacaact     3060

ttttcgccaa cgacatcctg gtgcacaact cgtcttcgta cccagctttt gttcccttta     3120

gtgagggtta attgcgcgct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg     3180

ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg     3240

tgcctaatga gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc     3300

gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt     3360

gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct     3420

gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga     3480

taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc     3540

cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg     3600

ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg     3660

aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt     3720

tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt     3780

gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg     3840

cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact     3900

ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt     3960

cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct     4020

gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac     4080

cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc     4140

tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg     4200

ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta     4260

aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca     4320

atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc     4380

ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc     4440

tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc     4500

agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat     4560

taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt     4620

tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc     4680

cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag     4740

ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt     4800

tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac     4860

tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg     4920

cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat     4980

tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc     5040

gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc     5100

tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa     5160

atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg     5220

tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg     5280

cacatttccc cgaaaagtgc cac                                             5303


<210>  45
<211>  7602
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  45
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgtct cgtgggcagc agcgagatca tcaccagaaa ctacggcaag accaccatca      720

aagaggtggt cgagatcttc gacaacgaca agaacatcca ggtgctggcc ttcaacaccc      780

acaccgacaa catcgagtgg gcccctatca aggccgctca gctgacaaga cctaacgccg      840

agctggtgga actggaaatc gacacactgc acggcgtgaa aaccatcaga tgcacccctg      900

atcaccccgt gtacaccaag aacagaggct acgtgcgggc cgacgagctg acagatgatg      960

acgaactggt ggtggctatc ggcggcggag gacctgagga tgaacttgct gccaacgagg     1020

aagaactgca acagaacgaa cagaagctgg cccagattaa gcagaagctc caggccatta     1080

agtacggcgg atccggcgga ggcggatctg gtaccggaat ggaagatgcc aagaacatca     1140

agaagggccc tgctcctaga taccctctgg aagatggaac cgctggcgag cagctgcaca     1200

aggccatgaa gagatacgct caggtgcccg gcacaatcgc cttcacagat gcccacatcg     1260

aagtgaacat cacctacgcc gagtacttcg agatgagcgt gcggctggcc gaagctatga     1320

agcgatacgg cctgaacacc aaccacagaa tcgtcgtgtg cagcgagaac agcctccagt     1380

tcttcatgcc tgtgctgggc gctctgttca tcggagtggc tgtggctcct gccaacgaca     1440

tctacaacga gcgcgagctg ctgaacagca tgaacatcag ccagcctacc gtggtgttcg     1500

tgtccaagaa gggactgcaa aagatcctga acgtgcagaa gaagctgccc atcatccaga     1560

aaatcatcat catggacagc aagaccgact accagggctt ccagagcatg tataccttcg     1620

tgaccagcca tctgccacca ggcttcaacg agtacgactt caagcccgag agcttcgaca     1680

gagacaagac aatcgccctg atcatgaaca gcagcggctc taccggactg cctaaaggcg     1740

ttgccctgcc tcacagaaca gcttgcgtca gattcagcca cgccagagat cccatcttcg     1800

gcaaccagat caagcctgac accgctatcc tgagcgtggt gccttttcac cacggcttcg     1860

gcatgttcac cacactgggc tacctgatct gcggcttcag agtggtgctg atgtatcgct     1920

ttgaggaaga actgttcctg cggagcctcc aggactacaa gatccagtct gctctgctgg     1980

tgcctactct gttcagcttc tttgccaaga gcaccctgat cgataagtac gacctgagca     2040

acctgcacga gatcgcctct ggcggagccc ctctgtctaa agaagtgggc gaagccgtcg     2100

ccaagagatt tcatctgccc ggcatcagac aaggctacgg actgaccgag acaaccagcg     2160

ccatcctgat cacacctgag ggcgacgata agcctggcgc tgtgggaaaa gtggtgccat     2220

tcttcgaggc taaggtggtg gacctggaca ccggcaaaac actgggagtg aatcagaggg     2280

gcgagctgtg tgtcagaggc cctatgatca tgagcggcta cgtgaacaac cccgaggcca     2340

ccaacgctct gatcgacaag gatggctggc tgcacagcgg cgacattgcc tactgggacg     2400

aagatgagca cttcttcatc gtggacagac tgaagtccct gatcaagtac aagggctacc     2460

aggtggcccc tgccgagctg gaatctatcc tgctccagca tcctaacatc cgcgacgctg     2520

gtgttgctgg cctgcctgac gatgatgctg gcgaacttcc tgctgccgtg gtggtgctgg     2580

aacacggcaa gaccatgacc gagaaagaaa tcgtggacta cgtggcctct caagtgacca     2640

ccgccaagaa actgagaggc ggcgtggtgt ttgtggacga ggtgccaaaa ggcctgaccg     2700

gcaagctgga cgccagaaag atcagagaga tcctcatcaa ggccaagaaa ggcggcaaga     2760

tcgctgtcgg aggatccggc ggagactaca aggacgacga tgacaaaggg tcacctggca     2820

taacttcgta tagtacacat tatacgaagt tatccggaca ggtaagtatc ctttttacag     2880

cacaacttaa tgagacagat agaaactggt cttgtagaaa cagagtaggc tagcccccag     2940

ctggttcttt ccgcctcaga agccatagag cccaccgcat ccccagcatg cctgctattc     3000

tcttcccaat cctccccctt gctgtcctgc cccaccccac cccccagaat agaatgacac     3060

ctactcagac aatgcgatgc aatttcctca ttttattagg aaaggacagt gggagtggca     3120

ccttccaggg tcaaggaagg cacgggggag gggcaaacaa cagatggctg gcaactagaa     3180

ggcacagtcg aggctgatca gcgagccgcc ggcgtctaga gaattgatcc cctcaggcgc     3240

caggctttct ggtcatgcac caggttctag ggccctcagg cacttccacg tcggcggtca     3300

cggtgaagcc cagtctctcg tagaagggca ggtttctggg ggcgcttgtt tccaggaagg     3360

cgggcacgcc agccctttca gcagcttcca ccccaggcag caccacagca gatcccagtc     3420

ccttgccctg gtggtcaggt gacacgccca cggtggccag aaaccaggca ggctcttttg     3480

gtctgtgggg ggccagcagg ccttccatct gctgctgggc agccagtcta gagccgctca     3540

gctcggccat tctaggtccg atctcggcga acacagcgcc ggcttccaca gactcagggg     3600

ttgtccacac agccacagcg gcgccatcat cggccaccca cactttgccg atgtccaggc     3660

ccactctggt cagaaacagt tcctgcagct cggtcactct ctcgatgtgc cggtcggggt     3720

ccacggtgtg tcttgtggca gggtaatcgg cgaaggcagc ggccagtgtc cgcacagctc     3780

ttggcacatc gtccctggtg gccagccgca ctgtgggctt gtactcggtc atggtggcgc     3840

gccttttagg ggtagttttc acgacacctg aaatggaaga aaaaaacttt gaaccactgt     3900

ctgaggcttg agaatgaacc aagatccaaa ctcaaaaagg gcaaattcca aggagaatta     3960

catcaagtgc caagctggcc taacttcagt ctccacccac tcagtgtggg gaaactccat     4020

cgcataaaac ccctcccccc aacctaaaga cgacgtactc caaaagctcg agaactaatc     4080

gaggtgcctg gacggcgccc ggtactccgt ggagtcacat gaagcgacgg ctgaggacgg     4140

aaaggccctt ttcctttgtg tgggtgactc acccgcccgc tctcccgagc gccgcgtcct     4200

ccattttgag ctccctgcag cagggccggg aagcggccat ctttccgctc acgcaactgg     4260

tgccgaccgg gccagccttg ccgcccaggg cggggcgata cacggcggcg cgaggccagg     4320

caccagagca ggccggccag cttgagacta cccccgtccg attctcggtg gccgcgctcg     4380

caggccccgc ctcgccgaac atgtgcgctg ggacgcacgg gccccgtcgc cgcccgcggc     4440

cccaaaaacc gaaataccag tgtgcagatc ttggcccgca tttacaagac tatcttgcca     4500

gaaaaaaagc gtcgcagcag gtcatcaaaa attttaaatg gctagagact tatcgaaagc     4560

agcgagacag gcgcgaaggt gccaccagat tcgcacgcgg cggccccagc gcccaggcca     4620

ggcctcaact caagcacgag gcgaaggggc tccttaagcg caaggcctcg aactctccca     4680

cccacttcca acccgaagct cgggatcaag aatcacgtac tgcagccagg ggcgtggaag     4740

taattcaagg cacgcaaggg ccataacccg taaagaggcc aggcccgcgg gaaccacaca     4800

cggcacttac ctgtgttctg gcggcaaacc cgttgcgaaa aagaacgttc acggcgacta     4860

ctgcacttat atacggttct cccccaccct cgggaaaaag gcggagccag tacacgacat     4920

cactttccca gtttaccccg cgccaccttc tctaggcacc ggttcaattg ccgacccctc     4980

cccccaactt ctcggggact gtgggcgatg tgcgctctgc ccactgacgg gcaccggagc     5040

caattcgaat cgcctgcttt tctgcctggt actaacttct ctcccctctc ctcttttctt     5100

tttctgcagg gcggccgcat aacttcgtat agtacacatt atacgaagtt atctggcggg     5160

tcacccgagg atgagatcca gcagctggaa gaggaaatcg cccagctgga acagaagaat     5220

gccgctctga aagagaagaa ccaggctctg aagtacggag gcggaggcat ggaagccaag     5280

acctacatcg gcaagctgaa gtccagaaag atcgtgtcca acgaggacac ctacgacatc     5340

cagaccagca cacacaactt tttcgccaac gacatcctgg tgcacaactc gtcttcgtac     5400

ccagcttttg ttccctttag tgagggttaa ttgcgcgctt ggcgtaatca tggtcatagc     5460

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca     5520

taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct     5580

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac     5640

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc     5700

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt     5760

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg     5820

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg     5880

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat     5940

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta     6000

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct     6060

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc     6120

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa     6180

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg     6240

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag     6300

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt     6360

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta     6420

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc     6480

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca     6540

cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa     6600

cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat     6660

ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct     6720

taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt     6780

tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat     6840

ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta     6900

atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg     6960

gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt     7020

tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg     7080

cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg     7140

taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc     7200

ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa     7260

ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac     7320

cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt     7380

ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg     7440

gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa     7500

gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata     7560

aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc ac                        7602


<210>  46
<211>  4121
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  46
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg acctgaggat aagctccagg      960

ccattaagta cgagctggcc cagaacgagg aagaactggc tcagatcgaa gagaagctgg     1020

ccgccaacaa agaaggcgga tccggcggag gcggatctgg aaccggtttt gctaatgagc     1080

tgggccccag actgatgggc aaaggcagcg gaggaggcgg aagcggagtc tttacactgg     1140

aagatttcgt cggcgactgg cggcagacag ctggctacaa tctggaccag gtgctggaac     1200

aaggcggcgt gtcctctctg tttcagaacc tgggagtgtc tgtgacccct atccagagaa     1260

tcgtgctgag cggcgagaac ggcctgaaga tcgacatcca cgtgatcatc ccttacgagg     1320

gcctgtccgg cgatcagatg ggacagatcg agaagatctt taaggtggtg taccccgtgg     1380

acgaccacca cttcaaagtg atcctgcact acggcaccct ggtcatcgat ggcgtgaccc     1440

caaacatgat cgactacttc ggcagaccct acgagggaat cgccgtgttc gacggcaaga     1500

aaatcaccgt gaccggcaca ctgtggaacg gcaacaagat catcgacgag agactgatca     1560

accccgacgg cagcctgctg ttcagagtga caatcaacgg cgtgacaggc tggcggctgt     1620

gcgaaagaat ccttgctggt tccggaggaa gttcctatac ttcaaataga ataggaactt     1680

ccggcgggtc acccgaggat gagaatgctg ctctggaaga gaagatcgcc cagctgaagc     1740

agaagaacgc cgctctgaaa gaagagatcc aggctctgga atacggaggc ggaggcatga     1800

tgctgaagaa gatcctgaag atcgaagaac tggacgagcg cgagctgatc gacatcgagg     1860

tgtccggcaa ccacctgttc tacgccaacg atatcctgac ccacaactcg tcttcgtacc     1920

cagcttttgt tccctttagt gagggttaat tgcgcgcttg gcgtaatcat ggtcatagct     1980

gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag ccggaagcat     2040

aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg cgttgcgctc     2100

actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg     2160

cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct     2220

gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt     2280

atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc     2340

caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga     2400

gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata     2460

ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac     2520

cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg     2580

taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc     2640

cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag     2700

acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt     2760

aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt     2820

atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg     2880

atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac     2940

gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca     3000

gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac     3060

ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac     3120

ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt     3180

tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt     3240

accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt     3300

atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc     3360

cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa     3420

tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg     3480

tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt     3540

gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc     3600

agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt     3660

aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg     3720

gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac     3780

tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc     3840

gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt     3900

tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg     3960

aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat attattgaag     4020

catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa     4080

acaaataggg gttccgcgca catttccccg aaaagtgcca c                         4121


<210>  47
<211>  6276
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  47
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg acctgaggat aagctccagg      960

ccattaagta cgagctggcc cagaacgagg aagaactggc tcagatcgaa gagaagctgg     1020

ccgccaacaa agaaggcgga tccggcggag gcggatctgg aaccggtttt gctaatgagc     1080

tgggccccag actgatgggc aaaggcagcg gaggaggcgg aagcggagtc tttacactgg     1140

aagatttcgt cggcgactgg cggcagacag ctggctacaa tctggaccag gtgctggaac     1200

aaggcggcgt gtcctctctg tttcagaacc tgggagtgtc tgtgacccct atccagagaa     1260

tcgtgctgag cggcgagaac ggcctgaaga tcgacatcca cgtgatcatc ccttacgagg     1320

gcctgtccgg cgatcagatg ggacagatcg agaagatctt taaggtggtg taccccgtgg     1380

acgaccacca cttcaaagtg atcctgcact acggcaccct ggtcatcgat ggcgtgaccc     1440

caaacatgat cgactacttc ggcagaccct acgagggaat cgccgtgttc gacggcaaga     1500

aaatcaccgt gaccggcaca ctgtggaacg gcaacaagat catcgacgag agactgatca     1560

accccgacgg cagcctgctg ttcagagtga caatcaacgg cgtgacaggc tggcggctgt     1620

gcgaaagaat ccttgctggt tccggaggaa gttcctatac ttcaaataga ataggaactt     1680

cgaattggct ccggtgcccg tcagtgggca gagcgcacat cgcccacagt ccccgagaag     1740

ttggggggag gggtcggcaa ttgaaccggt gcctagagaa ggtggcgcgg ggtaaactgg     1800

gaaagtgatg tcgtgtactg gctccgcctt tttcccgagg gtgggggaga accgtatata     1860

agtgcagtag tcgccgtgaa cgttcttttt cgcaacgggt ttgccgccag aacacaggta     1920

agtgccgtgt gtggttcccg cgggcctggc ctctttacgg gttatggccc ttgcgtgcct     1980

tgaattactt ccacgcccct ggctgcagta cgtgattctt gatcccgagc ttcgggttgg     2040

aagtgggtgg gagagttcga ggccttgcgc ttaaggagcc ccttcgcctc gtgcttgagt     2100

tgaggcctgg cctgggcgct ggggccgccg cgtgcgaatc tggtggcacc ttcgcgcctg     2160

tctcgctgct ttcgataagt ctctagccat ttaaaatttt tgatgacctg ctgcgacgct     2220

ttttttctgg caagatagtc ttgtaaatgc gggccaagat ctgcacactg gtatttcggt     2280

ttttggggcc gcgggcggcg acggggcccg tgcgtcccag cgcacatgtt cggcgaggcg     2340

gggcctgcga gcgcggccac cgagaatcgg acgggggtag tctcaagctg gccggcctgc     2400

tctggtgcct ggcctcgcgc cgccgtgtat cgccccgccc tgggcggcaa ggctggcccg     2460

gtcggcacca gttgcgtgag cggaaagatg gccgcttccc ggccctgctg cagggagctc     2520

aaaatggagg acgcggcgct cgggagagcg ggcgggtgag tcacccacac aaaggaaaag     2580

ggcctttccg tcctcagccg tcgcttcatg tgactccacg gagtaccggg cgccgtccag     2640

gcacctcgat tagttctcga gcttttggag tacgtcgtct ttaggttggg gggaggggtt     2700

ttatgcgatg gagtttcccc acactgagtg ggtggagact gaagttaggc cagcttggca     2760

cttgatgtaa ttctccttgg aatttgccct ttttgagttt ggatcttggt tcattctcaa     2820

gcctcagaca gtggttcaaa gtttttttct tccatttcag gtgtcgtgaa aactacccct     2880

aaaaggcgcg ccaccatgac cgagtacaag cccacagtgc ggctggccac cagggacgat     2940

gtgccaagag ctgtgcggac actggccgct gccttcgccg attaccctgc cacaagacac     3000

accgtggacc ccgaccggca catcgagaga gtgaccgagc tgcaggaact gtttctgacc     3060

agagtgggcc tggacatcgg caaagtgtgg gtggccgatg atggcgccgc tgtggctgtg     3120

tggacaaccc ctgagtctgt ggaagccggc gctgtgttcg ccgagatcgg acctagaatg     3180

gccgagctga gcggctctag actggctgcc cagcagcaga tggaaggcct gctggccccc     3240

cacagaccaa aagagcctgc ctggtttctg gccaccgtgg gcgtgtcacc tgaccaccag     3300

ggcaagggac tgggatctgc tgtggtgctg cctggggtgg aagctgctga aagggctggc     3360

gtgcccgcct tcctggaaac aagcgccccc agaaacctgc ccttctacga gagactgggc     3420

ttcaccgtga ccgccgacgt ggaagtgcct gagggcccta gaacctggtg catgaccaga     3480

aagcctggcg cctgagggga tcaattctct agacgccggc ggctcgctga tcagcctcga     3540

ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct tccttgaccc     3600

tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca tcgcattgtc     3660

tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag ggggaggatt     3720

gggaagagaa tagcaggcat gctggggatg cggtgggctc tatggcttct gaggcggaaa     3780

gaaccagctg ggggcggccg cggaagttcc tatacttcaa atagaatagg aacttccggc     3840

gggtcacccg aggatgagaa tgctgctctg gaagagaaga tcgcccagct gaagcagaag     3900

aacgccgctc tgaaagaaga gatccaggct ctggaatacg gaggcggagg catgatgctg     3960

aagaagatcc tgaagatcga agaactggac gagcgcgagc tgatcgacat cgaggtgtcc     4020

ggcaaccacc tgttctacgc caacgatatc ctgacccaca actcgtcttc gtacccagct     4080

tttgttccct ttagtgaggg ttaattgcgc gcttggcgta atcatggtca tagctgtttc     4140

ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt     4200

gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc     4260

ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg     4320

ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct     4380

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca     4440

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga     4500

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc     4560

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg     4620

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat     4680

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt     4740

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc     4800

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg     4860

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg     4920

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg     4980

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg     5040

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca     5100

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga     5160

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga     5220

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt     5280

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt     5340

catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat     5400

ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag     5460

caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct     5520

ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt     5580

tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg     5640

cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca     5700

aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt     5760

tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat     5820

gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac     5880

cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa     5940

aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt     6000

tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt     6060

tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa     6120

gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt     6180

atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa     6240

taggggttcc gcgcacattt ccccgaaaag tgccac                               6276


<210>  48
<211>  6417
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  48
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg acctgaggat aagctccagg      960

ccattaagta cgagctggcc cagaacgagg aagaactggc tcagatcgaa gagaagctgg     1020

ccgccaacaa agaaggcgga tccggcggag gcggatctgg aaccggtttt gctaatgagc     1080

tgggccccag actgatgggc aaaggcagcg gaggaggcgg aagcggagtc tttacactgg     1140

aagatttcgt cggcgactgg cggcagacag ctggctacaa tctggaccag gtgctggaac     1200

aaggcggcgt gtcctctctg tttcagaacc tgggagtgtc tgtgacccct atccagagaa     1260

tcgtgctgag cggcgagaac ggcctgaaga tcgacatcca cgtgatcatc ccttacgagg     1320

gcctgtccgg cgatcagatg ggacagatcg agaagatctt taaggtggtg taccccgtgg     1380

acgaccacca cttcaaagtg atcctgcact acggcaccct ggtcatcgat ggcgtgaccc     1440

caaacatgat cgactacttc ggcagaccct acgagggaat cgccgtgttc gacggcaaga     1500

aaatcaccgt gaccggcaca ctgtggaacg gcaacaagat catcgacgag agactgatca     1560

accccgacgg cagcctgctg ttcagagtga caatcaacgg cgtgacaggc tggcggctgt     1620

gcgaaagaat ccttgctggt tccggaggaa gttcctatac ttcaaataga ataggaactt     1680

cgcaggtaag tatccttttt acagcacaac ttaatgagac agatagaaac tggtcttgta     1740

gaaacagagt aggctagccc ccagctggtt ctttccgcct cagaagccat agagcccacc     1800

gcatccccag catgcctgct attctcttcc caatcctccc ccttgctgtc ctgccccacc     1860

ccacccccca gaatagaatg acacctactc agacaatgcg atgcaatttc ctcattttat     1920

taggaaagga cagtgggagt ggcaccttcc agggtcaagg aaggcacggg ggaggggcaa     1980

acaacagatg gctggcaact agaaggcaca gtcgaggctg atcagcgagc cgccggcgtc     2040

tagagaattg atcccctcag gcgccaggct ttctggtcat gcaccaggtt ctagggccct     2100

caggcacttc cacgtcggcg gtcacggtga agcccagtct ctcgtagaag ggcaggtttc     2160

tgggggcgct tgtttccagg aaggcgggca cgccagccct ttcagcagct tccaccccag     2220

gcagcaccac agcagatccc agtcccttgc cctggtggtc aggtgacacg cccacggtgg     2280

ccagaaacca ggcaggctct tttggtctgt ggggggccag caggccttcc atctgctgct     2340

gggcagccag tctagagccg ctcagctcgg ccattctagg tccgatctcg gcgaacacag     2400

cgccggcttc cacagactca ggggttgtcc acacagccac agcggcgcca tcatcggcca     2460

cccacacttt gccgatgtcc aggcccactc tggtcagaaa cagttcctgc agctcggtca     2520

ctctctcgat gtgccggtcg gggtccacgg tgtgtcttgt ggcagggtaa tcggcgaagg     2580

cagcggccag tgtccgcaca gctcttggca catcgtccct ggtggccagc cgcactgtgg     2640

gcttgtactc ggtcatggtg gcgcgccttt taggggtagt tttcacgaca cctgaaatgg     2700

aagaaaaaaa ctttgaacca ctgtctgagg cttgagaatg aaccaagatc caaactcaaa     2760

aagggcaaat tccaaggaga attacatcaa gtgccaagct ggcctaactt cagtctccac     2820

ccactcagtg tggggaaact ccatcgcata aaacccctcc ccccaaccta aagacgacgt     2880

actccaaaag ctcgagaact aatcgaggtg cctggacggc gcccggtact ccgtggagtc     2940

acatgaagcg acggctgagg acggaaaggc ccttttcctt tgtgtgggtg actcacccgc     3000

ccgctctccc gagcgccgcg tcctccattt tgagctccct gcagcagggc cgggaagcgg     3060

ccatctttcc gctcacgcaa ctggtgccga ccgggccagc cttgccgccc agggcggggc     3120

gatacacggc ggcgcgaggc caggcaccag agcaggccgg ccagcttgag actacccccg     3180

tccgattctc ggtggccgcg ctcgcaggcc ccgcctcgcc gaacatgtgc gctgggacgc     3240

acgggccccg tcgccgcccg cggccccaaa aaccgaaata ccagtgtgca gatcttggcc     3300

cgcatttaca agactatctt gccagaaaaa aagcgtcgca gcaggtcatc aaaaatttta     3360

aatggctaga gacttatcga aagcagcgag acaggcgcga aggtgccacc agattcgcac     3420

gcggcggccc cagcgcccag gccaggcctc aactcaagca cgaggcgaag gggctcctta     3480

agcgcaaggc ctcgaactct cccacccact tccaacccga agctcgggat caagaatcac     3540

gtactgcagc caggggcgtg gaagtaattc aaggcacgca agggccataa cccgtaaaga     3600

ggccaggccc gcgggaacca cacacggcac ttacctgtgt tctggcggca aacccgttgc     3660

gaaaaagaac gttcacggcg actactgcac ttatatacgg ttctccccca ccctcgggaa     3720

aaaggcggag ccagtacacg acatcacttt cccagtttac cccgcgccac cttctctagg     3780

caccggttca attgccgacc cctcccccca acttctcggg gactgtgggc gatgtgcgct     3840

ctgcccactg acgggcaccg gagccaattc gaatcgcctg cttttctgcc tggtactaac     3900

ttctctcccc tctcctcttt tctttttctg cagggcggcc gcggaagttc ctatacttca     3960

aatagaatag gaacttccgg cgggtcaccc gaggatgaga atgctgctct ggaagagaag     4020

atcgcccagc tgaagcagaa gaacgccgct ctgaaagaag agatccaggc tctggaatac     4080

ggaggcggag gcatgatgct gaagaagatc ctgaagatcg aagaactgga cgagcgcgag     4140

ctgatcgaca tcgaggtgtc cggcaaccac ctgttctacg ccaacgatat cctgacccac     4200

aactcgtctt cgtacccagc ttttgttccc tttagtgagg gttaattgcg cgcttggcgt     4260

aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca     4320

tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc taactcacat     4380

taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt     4440

aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct     4500

cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa     4560

aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa     4620

aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc     4680

tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga     4740

caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc     4800

cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt     4860

ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct     4920

gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg     4980

agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta     5040

gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct     5100

acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa     5160

gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt     5220

gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta     5280

cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat     5340

caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa     5400

gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct     5460

cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta     5520

cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct     5580

caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg     5640

gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa     5700

gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt     5760

cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta     5820

catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca     5880

gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta     5940

ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct     6000

gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg     6060

cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac     6120

tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact     6180

gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa     6240

atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt     6300

ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat     6360

gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccac        6417


<210>  49
<211>  6321
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  49
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg acctgaggat aagctccagg      960

ccattaagta cgagctggcc cagaacgagg aagaactggc tcagatcgaa gagaagctgg     1020

ccgccaacaa agaaggcgga tccggcggag gtggaagcgg aggcggagga tctggtggtg     1080

gtggatctgc taagcccctg agccaagagg aaagcaccct gatcgagaga gccaccgcca     1140

ccatcaacag catccctatc agcgaggact acagcgtggc ctctgctgct ctgtctagcg     1200

acggcagaat cttcacaggc gtgaacgtgt accactttac aggcggccct tgtgccgaac     1260

tggtggtgct tggaacagcc gctgccgctg ctgctggaaa cctgacatgt atcgtggcta     1320

tcggcaacga gaacagaggc atcctgtctc catgcggcag atgcagacag gtcctgctcg     1380

atctgcaccc tggcatcaag gccatcgtga aggactctga cggccagcct acagccgtgg     1440

gaatcagaga actgctgcct agcggctacg tgtgggaagg tggtggcgga ggaagcggca     1500

caggatttgc taatgagctg ggccctagac tgatgggcaa aggctccgga ggaagttcct     1560

atacttcaaa tagaatagga acttcgcagg taagtatcct ttttacagca caacttaatg     1620

agacagatag aaactggtct tgtagaaaca gagtaggcta gcccccagct ggttctttcc     1680

gcctcagaag ccatagagcc caccgcatcc ccagcatgcc tgctattctc ttcccaatcc     1740

tcccccttgc tgtcctgccc caccccaccc cccagaatag aatgacacct actcagacaa     1800

tgcgatgcaa tttcctcatt ttattaggaa aggacagtgg gagtggcacc ttccagggtc     1860

aaggaaggca cgggggaggg gcaaacaaca gatggctggc aactagaagg cacagtcgag     1920

gctgatcagc gagccgccgg cgtctagaga attgatcccc tcaggcgcca ggctttctgg     1980

tcatgcacca ggttctaggg ccctcaggca cttccacgtc ggcggtcacg gtgaagccca     2040

gtctctcgta gaagggcagg tttctggggg cgcttgtttc caggaaggcg ggcacgccag     2100

ccctttcagc agcttccacc ccaggcagca ccacagcaga tcccagtccc ttgccctggt     2160

ggtcaggtga cacgcccacg gtggccagaa accaggcagg ctcttttggt ctgtgggggg     2220

ccagcaggcc ttccatctgc tgctgggcag ccagtctaga gccgctcagc tcggccattc     2280

taggtccgat ctcggcgaac acagcgccgg cttccacaga ctcaggggtt gtccacacag     2340

ccacagcggc gccatcatcg gccacccaca ctttgccgat gtccaggccc actctggtca     2400

gaaacagttc ctgcagctcg gtcactctct cgatgtgccg gtcggggtcc acggtgtgtc     2460

ttgtggcagg gtaatcggcg aaggcagcgg ccagtgtccg cacagctctt ggcacatcgt     2520

ccctggtggc cagccgcact gtgggcttgt actcggtcat ggtggcgcgc cttttagggg     2580

tagttttcac gacacctgaa atggaagaaa aaaactttga accactgtct gaggcttgag     2640

aatgaaccaa gatccaaact caaaaagggc aaattccaag gagaattaca tcaagtgcca     2700

agctggccta acttcagtct ccacccactc agtgtgggga aactccatcg cataaaaccc     2760

ctccccccaa cctaaagacg acgtactcca aaagctcgag aactaatcga ggtgcctgga     2820

cggcgcccgg tactccgtgg agtcacatga agcgacggct gaggacggaa aggccctttt     2880

cctttgtgtg ggtgactcac ccgcccgctc tcccgagcgc cgcgtcctcc attttgagct     2940

ccctgcagca gggccgggaa gcggccatct ttccgctcac gcaactggtg ccgaccgggc     3000

cagccttgcc gcccagggcg gggcgataca cggcggcgcg aggccaggca ccagagcagg     3060

ccggccagct tgagactacc cccgtccgat tctcggtggc cgcgctcgca ggccccgcct     3120

cgccgaacat gtgcgctggg acgcacgggc cccgtcgccg cccgcggccc caaaaaccga     3180

aataccagtg tgcagatctt ggcccgcatt tacaagacta tcttgccaga aaaaaagcgt     3240

cgcagcaggt catcaaaaat tttaaatggc tagagactta tcgaaagcag cgagacaggc     3300

gcgaaggtgc caccagattc gcacgcggcg gccccagcgc ccaggccagg cctcaactca     3360

agcacgaggc gaaggggctc cttaagcgca aggcctcgaa ctctcccacc cacttccaac     3420

ccgaagctcg ggatcaagaa tcacgtactg cagccagggg cgtggaagta attcaaggca     3480

cgcaagggcc ataacccgta aagaggccag gcccgcggga accacacacg gcacttacct     3540

gtgttctggc ggcaaacccg ttgcgaaaaa gaacgttcac ggcgactact gcacttatat     3600

acggttctcc cccaccctcg ggaaaaaggc ggagccagta cacgacatca ctttcccagt     3660

ttaccccgcg ccaccttctc taggcaccgg ttcaattgcc gacccctccc cccaacttct     3720

cggggactgt gggcgatgtg cgctctgccc actgacgggc accggagcca attcgaatcg     3780

cctgcttttc tgcctggtac taacttctct cccctctcct cttttctttt tctgcagggc     3840

ggccgcggaa gttcctatac ttcaaataga ataggaactt ccggcgggtc acccgaggat     3900

gagaatgctg ctctggaaga gaagatcgcc cagctgaagc agaagaacgc cgctctgaaa     3960

gaagagatcc aggctctgga atacggaggc ggaggcatga tgctgaagaa gatcctgaag     4020

atcgaagaac tggacgagcg cgagctgatc gacatcgagg tgtccggcaa ccacctgttc     4080

tacgccaacg atatcctgac ccacaactcg tcttcgtacc cagcttttgt tccctttagt     4140

gagggttaat tgcgcgcttg gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt     4200

atccgctcac aattccacac aacatacgag ccggaagcat aaagtgtaaa gcctggggtg     4260

cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct ttccagtcgg     4320

gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc     4380

gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc     4440

ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata     4500

acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg     4560

cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct     4620

caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa     4680

gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc     4740

tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt     4800

aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg     4860

ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg     4920

cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct     4980

tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc     5040

tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg     5100

ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc     5160

aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt     5220

aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa     5280

aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat     5340

gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct     5400

gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg     5460

caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag     5520

ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta     5580

attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg     5640

ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg     5700

gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct     5760

ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta     5820

tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg     5880

gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc     5940

cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg     6000

gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga     6060

tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg     6120

ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat     6180

gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc     6240

tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca     6300

catttccccg aaaagtgcca c                                               6321


<210>  50
<211>  7284
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  50
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg acctgaggat aagctccagg      960

ccattaagta cgagctggcc cagaacgagg aagaactggc tcagatcgaa gagaagctgg     1020

ccgccaacaa agaaggcgga tccggcggag gcggatctgg aaccggtttt gctaatgagc     1080

tgggccccag actgatgggc aaaggcagcg gaggaggcgg aagcggacct cctaggaaga     1140

gatgttgttg cgctagaaga ggcacccagc tgatgctcgt gggcctgctg tctacagcta     1200

tgtgggctgg actgctggct ctgctgctgc tttggcattg ggagacggaa ggtggtggtg     1260

gatctggtgg cggaggctct gaaatcggca caggcttccc tttcgaccct cactacgtgg     1320

aagtgctggg cgagagaatg cactatgtgg atgtgggccc tagagatgga acccctgtgc     1380

tgtttctgca cggcaaccct accagctctt acgtgtggcg gaacatcatc cctcacgtgg     1440

cccctacaca cagagtgatc gcccctgatc tgatcggcat gggcaagagc gacaagcctg     1500

acctgggcta cttcttcgac gaccacgtgc ggttcatgga cgccttcatc gaggctctgg     1560

gactcgaaga ggtggtgctg gtcatccacg attggggctc tgctctgggc ttccactggg     1620

ccaagagaaa ccccgaaaga gtgaagggaa tcgccttcat ggagttcatc agacccattc     1680

ctacctggga cgagtggccc gagttcgcca gagagacatt ccaggccttc agaacaaccg     1740

acgtgggcag aaagctgatc atcgaccaga atgtgtttat cgagggcacc ctgcctatgg     1800

gcgtcgtcag acctctgacc gaggtggaaa tggaccacta cagagagcct tttctgaacc     1860

ccgtggatag agaacctctg tggcggttcc ctaacgagct gcctattgct ggcgagcccg     1920

ctaacattgt ggccctggtc gaagagtaca tggactggct gcatcagagc cccgtgccta     1980

agctgctgtt ttggggaact cccggcgtgc tgatccctcc tgctgaagct gctagactgg     2040

ctaagagcct gcctaacgct aaggccgtgg acatcggacc tggcctgaat ctgctgcaag     2100

aggataaccc cgacctgatc ggctctgaga tcgccagatg gctgagcaca ctggaaattt     2160

ctggcggtgg tggcggtagc ggtggcggtg gaagcgctca ccactttagc gagcccgaga     2220

tcaccctgat catcttcggc gtgatggccc tcgtgatcgg caccatcctg ctgatctctt     2280

acggcatcag acggctgatc aagaagtccc cctcaggcgg aggcggctct accggttccg     2340

gaggcagcgg cttctgctac gagaacgaag tcggcagtgg caggtccaga ttcgtgaaga     2400

aggacggcca ctgcaacgtg cagttcatca acgtcggaag cggcaagagc agaatcacct     2460

ctgagggcga gtacatccct ctggaccaga tcgatattaa tgtcggttcc ggaggaagtt     2520

cctatacttc aaatagaata ggaacttcgc aggtaagtat cctttttaca gcacaactta     2580

atgagacaga tagaaactgg tcttgtagaa acagagtagg ctagccccca gctggttctt     2640

tccgcctcag aagccataga gcccaccgca tccccagcat gcctgctatt ctcttcccaa     2700

tcctccccct tgctgtcctg ccccacccca ccccccagaa tagaatgaca cctactcaga     2760

caatgcgatg caatttcctc attttattag gaaaggacag tgggagtggc accttccagg     2820

gtcaaggaag gcacggggga ggggcaaaca acagatggct ggcaactaga aggcacagtc     2880

gaggctgatc agcgagccgc cggcgtctag agaattgatc ccctcaggcg ccaggctttc     2940

tggtcatgca ccaggttcta gggccctcag gcacttccac gtcggcggtc acggtgaagc     3000

ccagtctctc gtagaagggc aggtttctgg gggcgcttgt ttccaggaag gcgggcacgc     3060

cagccctttc agcagcttcc accccaggca gcaccacagc agatcccagt cccttgccct     3120

ggtggtcagg tgacacgccc acggtggcca gaaaccaggc aggctctttt ggtctgtggg     3180

gggccagcag gccttccatc tgctgctggg cagccagtct agagccgctc agctcggcca     3240

ttctaggtcc gatctcggcg aacacagcgc cggcttccac agactcaggg gttgtccaca     3300

cagccacagc ggcgccatca tcggccaccc acactttgcc gatgtccagg cccactctgg     3360

tcagaaacag ttcctgcagc tcggtcactc tctcgatgtg ccggtcgggg tccacggtgt     3420

gtcttgtggc agggtaatcg gcgaaggcag cggccagtgt ccgcacagct cttggcacat     3480

cgtccctggt ggccagccgc actgtgggct tgtactcggt catggtggcg cgccttttag     3540

gggtagtttt cacgacacct gaaatggaag aaaaaaactt tgaaccactg tctgaggctt     3600

gagaatgaac caagatccaa actcaaaaag ggcaaattcc aaggagaatt acatcaagtg     3660

ccaagctggc ctaacttcag tctccaccca ctcagtgtgg ggaaactcca tcgcataaaa     3720

cccctccccc caacctaaag acgacgtact ccaaaagctc gagaactaat cgaggtgcct     3780

ggacggcgcc cggtactccg tggagtcaca tgaagcgacg gctgaggacg gaaaggccct     3840

tttcctttgt gtgggtgact cacccgcccg ctctcccgag cgccgcgtcc tccattttga     3900

gctccctgca gcagggccgg gaagcggcca tctttccgct cacgcaactg gtgccgaccg     3960

ggccagcctt gccgcccagg gcggggcgat acacggcggc gcgaggccag gcaccagagc     4020

aggccggcca gcttgagact acccccgtcc gattctcggt ggccgcgctc gcaggccccg     4080

cctcgccgaa catgtgcgct gggacgcacg ggccccgtcg ccgcccgcgg ccccaaaaac     4140

cgaaatacca gtgtgcagat cttggcccgc atttacaaga ctatcttgcc agaaaaaaag     4200

cgtcgcagca ggtcatcaaa aattttaaat ggctagagac ttatcgaaag cagcgagaca     4260

ggcgcgaagg tgccaccaga ttcgcacgcg gcggccccag cgcccaggcc aggcctcaac     4320

tcaagcacga ggcgaagggg ctccttaagc gcaaggcctc gaactctccc acccacttcc     4380

aacccgaagc tcgggatcaa gaatcacgta ctgcagccag gggcgtggaa gtaattcaag     4440

gcacgcaagg gccataaccc gtaaagaggc caggcccgcg ggaaccacac acggcactta     4500

cctgtgttct ggcggcaaac ccgttgcgaa aaagaacgtt cacggcgact actgcactta     4560

tatacggttc tcccccaccc tcgggaaaaa ggcggagcca gtacacgaca tcactttccc     4620

agtttacccc gcgccacctt ctctaggcac cggttcaatt gccgacccct ccccccaact     4680

tctcggggac tgtgggcgat gtgcgctctg cccactgacg ggcaccggag ccaattcgaa     4740

tcgcctgctt ttctgcctgg tactaacttc tctcccctct cctcttttct ttttctgcag     4800

ggcggccgcg gaagttccta tacttcaaat agaataggaa cttccggcgg gtcacccgag     4860

gatgagaatg ctgctctgga agagaagatc gcccagctga agcagaagaa cgccgctctg     4920

aaagaagaga tccaggctct ggaatacgga ggcggaggca tgatgctgaa gaagatcctg     4980

aagatcgaag aactggacga gcgcgagctg atcgacatcg aggtgtccgg caaccacctg     5040

ttctacgcca acgatatcct gacccacaac tcgtcttcgt acccagcttt tgttcccttt     5100

agtgagggtt aattgcgcgc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt     5160

gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg     5220

gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt     5280

cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt     5340

tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc     5400

tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg     5460

ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg     5520

ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac     5580

gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg     5640

gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct     5700

ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg     5760

tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct     5820

gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac     5880

tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt     5940

tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc     6000

tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca     6060

ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat     6120

ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac     6180

gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt     6240

aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc     6300

aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg     6360

cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg     6420

ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc     6480

cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta     6540

ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg     6600

ttgccattgc tacaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct     6660

ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta     6720

gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg     6780

ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga     6840

ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt     6900

gcccggcgtc aatacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca     6960

ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt     7020

cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt     7080

ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga     7140

aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt     7200

gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc     7260

gcacatttcc ccgaaaagtg ccac                                            7284


<210>  51
<211>  8127
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  51
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg acctgaggat aagctccagg      960

ccattaagta cgagctggcc cagaacgagg aagaactggc tcagatcgaa gagaagctgg     1020

ccgccaacaa agaaggcgga tccggcggag gcggatctgg aaccggtttt gctaatgagc     1080

tgggccccag actgatgggc aaaggcagcg gaggaggcgg aagcggacct cctaggaaga     1140

gatgttgttg cgctagaaga ggcacccagc tgatgctcgt gggcctgctg tctacagcta     1200

tgtgggctgg actgctggct ctgctgctgc tttggcattg ggagacggaa ggtggtggtg     1260

gatctggtgg cggaggtagc ggtggtggcg gtagcggagg cggtggatct agaaaacgta     1320

cccagcctac cttcggcttc accgtgaact ggaagttcag cgagagcacc accgtgttca     1380

ccggccagtg cttcatcgac agaaacggca aagaggtgct gaaaaccatg tggctgctga     1440

gaagcagcgt gaacgacatc ggcgacgact ggaaggccac cagagtgggc atcaacatct     1500

tcaccagact gaggacccag aaagagggcg gctctggcgg aagcgccaga aagtgtagcc     1560

tgaccggcaa gtggaccaac gacctgggca gcaacatgac catcggcgcc gtgaacagca     1620

gaggcgagtt cacaggcacc tacatcaccg ccgtgaccgc caccagcaac gagatcaaag     1680

agagccccct gcacggcacc cagaacacca tcaacaagag cggcggcagc acaacagtgt     1740

ttacaggaca gtgttttatc gaccggaatg ggaaagaagt gctgaaaaca atgtggctgc     1800

tgcggtcctc cgtgaacgac attggagatg attggaaagc tacacgagtg gggattaaca     1860

tttttacccg gctgcgcaca cagaaagaag ggggcagcgg cggctccgct agaaagtgtt     1920

ctctgactgg aaaatggaca aacgatctgg ggtccaatat gacaatcggg gcagtgaact     1980

ctaggggcga gtttaccgga acatatatta cagccgtgac agctacctct aacgaaatca     2040

aagagtctcc tctgcacggg acacagaata ccattaacaa aagaacccag cccacattcg     2100

ggtttacagt gaattggaaa ttctccgagg gcggcagcgg aagcggatct ggctctggat     2160

ctggcaggac acagcccacc tttggattca ctgtgaattg gaagttttct gagtctacca     2220

cagtgttcac tgggcagtgt ttcattgatc gcaatggaaa agaggtgctg aaaactatgt     2280

ggctgctgcg ctcaagtgtg aatgacatcg gggatgattg gaaggcaact cgcgtgggaa     2340

tcaatatctt tacacggctg agaactcaga aagagggggg aagcggaggc agcgcccgga     2400

aatgctctct gacagggaag tggactaatg atctgggctc taacatgact attggagctg     2460

tgaatagccg gggagagttc accgggactt atatcactgc tgtgactgcc acctcaaatg     2520

agatcaaaga atcccccctg catggaacac agaacactat taacaagtcc ggcggctcca     2580

caaccgtgtt cacagggcag tgctttattg accggaacgg caaagaggtg ctgaaaacaa     2640

tgtggctgct gcgaagctct gtgaatgata ttggggacga ctggaaagca actagagtgg     2700

ggatcaatat tttcactcgc ctgcggaccc agaaagaagg cggaagcgga ggatctgcca     2760

gaaagtgctc actgacaggc aaatggacaa atgacctggg gagtaatatg actattgggg     2820

ccgtgaacag tcgcggcgag tttactggga cttacattac cgcagtgaca gcaacatcca     2880

atgagatcaa agaaagtcct ctgcatggca ctcagaacac aatcaacaaa aggacccagc     2940

caacctttgg ctttaccgtg aattggaagt tctctgaagg cggcggagga tccggcggag     3000

ggggaagtgg cgggggaggc agtgggggcg gaggaagcgc tcaccacttt agcgagcccg     3060

agatcaccct gatcatcttc ggcgtgatgg ccctcgtgat cggcaccatc ctgctgatct     3120

cttacggcat cagacggctg atcaagaagt ccccctcagg cggaggcggc tctaccggtt     3180

ccggaggcag cggcttctgc tacgagaacg aagtcggcag tggcaggtcc agattcgtga     3240

agaaggacgg ccactgcaac gtgcagttca tcaacgtcgg aagcggcaag agcagaatca     3300

cctctgaggg cgagtacatc cctctggacc agatcgatat taatgtcggt tccggaggaa     3360

gttcctatac ttcaaataga ataggaactt cgcaggtaag tatccttttt acagcacaac     3420

ttaatgagac agatagaaac tggtcttgta gaaacagagt aggctagccc ccagctggtt     3480

ctttccgcct cagaagccat agagcccacc gcatccccag catgcctgct attctcttcc     3540

caatcctccc ccttgctgtc ctgccccacc ccacccccca gaatagaatg acacctactc     3600

agacaatgcg atgcaatttc ctcattttat taggaaagga cagtgggagt ggcaccttcc     3660

agggtcaagg aaggcacggg ggaggggcaa acaacagatg gctggcaact agaaggcaca     3720

gtcgaggctg atcagcgagc cgccggcgtc tagagaattg atcccctcag gcgccaggct     3780

ttctggtcat gcaccaggtt ctagggccct caggcacttc cacgtcggcg gtcacggtga     3840

agcccagtct ctcgtagaag ggcaggtttc tgggggcgct tgtttccagg aaggcgggca     3900

cgccagccct ttcagcagct tccaccccag gcagcaccac agcagatccc agtcccttgc     3960

cctggtggtc aggtgacacg cccacggtgg ccagaaacca ggcaggctct tttggtctgt     4020

ggggggccag caggccttcc atctgctgct gggcagccag tctagagccg ctcagctcgg     4080

ccattctagg tccgatctcg gcgaacacag cgccggcttc cacagactca ggggttgtcc     4140

acacagccac agcggcgcca tcatcggcca cccacacttt gccgatgtcc aggcccactc     4200

tggtcagaaa cagttcctgc agctcggtca ctctctcgat gtgccggtcg gggtccacgg     4260

tgtgtcttgt ggcagggtaa tcggcgaagg cagcggccag tgtccgcaca gctcttggca     4320

catcgtccct ggtggccagc cgcactgtgg gcttgtactc ggtcatggtg gcgcgccttt     4380

taggggtagt tttcacgaca cctgaaatgg aagaaaaaaa ctttgaacca ctgtctgagg     4440

cttgagaatg aaccaagatc caaactcaaa aagggcaaat tccaaggaga attacatcaa     4500

gtgccaagct ggcctaactt cagtctccac ccactcagtg tggggaaact ccatcgcata     4560

aaacccctcc ccccaaccta aagacgacgt actccaaaag ctcgagaact aatcgaggtg     4620

cctggacggc gcccggtact ccgtggagtc acatgaagcg acggctgagg acggaaaggc     4680

ccttttcctt tgtgtgggtg actcacccgc ccgctctccc gagcgccgcg tcctccattt     4740

tgagctccct gcagcagggc cgggaagcgg ccatctttcc gctcacgcaa ctggtgccga     4800

ccgggccagc cttgccgccc agggcggggc gatacacggc ggcgcgaggc caggcaccag     4860

agcaggccgg ccagcttgag actacccccg tccgattctc ggtggccgcg ctcgcaggcc     4920

ccgcctcgcc gaacatgtgc gctgggacgc acgggccccg tcgccgcccg cggccccaaa     4980

aaccgaaata ccagtgtgca gatcttggcc cgcatttaca agactatctt gccagaaaaa     5040

aagcgtcgca gcaggtcatc aaaaatttta aatggctaga gacttatcga aagcagcgag     5100

acaggcgcga aggtgccacc agattcgcac gcggcggccc cagcgcccag gccaggcctc     5160

aactcaagca cgaggcgaag gggctcctta agcgcaaggc ctcgaactct cccacccact     5220

tccaacccga agctcgggat caagaatcac gtactgcagc caggggcgtg gaagtaattc     5280

aaggcacgca agggccataa cccgtaaaga ggccaggccc gcgggaacca cacacggcac     5340

ttacctgtgt tctggcggca aacccgttgc gaaaaagaac gttcacggcg actactgcac     5400

ttatatacgg ttctccccca ccctcgggaa aaaggcggag ccagtacacg acatcacttt     5460

cccagtttac cccgcgccac cttctctagg caccggttca attgccgacc cctcccccca     5520

acttctcggg gactgtgggc gatgtgcgct ctgcccactg acgggcaccg gagccaattc     5580

gaatcgcctg cttttctgcc tggtactaac ttctctcccc tctcctcttt tctttttctg     5640

cagggcggcc gcggaagttc ctatacttca aatagaatag gaacttccgg cgggtcaccc     5700

gaggatgaga atgctgctct ggaagagaag atcgcccagc tgaagcagaa gaacgccgct     5760

ctgaaagaag agatccaggc tctggaatac ggaggcggag gcatgatgct gaagaagatc     5820

ctgaagatcg aagaactgga cgagcgcgag ctgatcgaca tcgaggtgtc cggcaaccac     5880

ctgttctacg ccaacgatat cctgacccac aactcgtctt cgtacccagc ttttgttccc     5940

tttagtgagg gttaattgcg cgcttggcgt aatcatggtc atagctgttt cctgtgtgaa     6000

attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct     6060

ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc     6120

agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg     6180

gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc     6240

ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag     6300

gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa     6360

aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc     6420

gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc     6480

ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg     6540

cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt     6600

cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc     6660

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc     6720

cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag     6780

agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg     6840

ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa     6900

ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag     6960

gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact     7020

cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa     7080

attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt     7140

accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag     7200

ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca     7260

gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc     7320

agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt     7380

ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg     7440

ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca     7500

gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg     7560

ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca     7620

tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg     7680

tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct     7740

cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca     7800

tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca     7860

gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg     7920

tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac     7980

ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt     8040

attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc     8100

cgcgcacatt tccccgaaaa gtgccac                                         8127


<210>  52
<211>  6573
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  52
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg acctgaggat aagctccagg      960

ccattaagta cgagctggcc cagaacgagg aagaactggc tcagatcgaa gagaagctgg     1020

ccgccaacaa agaaggcgga tccgtgtcca agggcgaaga ggacaacatg gccagcctgc     1080

ctgccaccca cgagctgcac atcttcggca gcatcaacgg cgtggacttc gacatggtgg     1140

gacagggcac cggcaacccc aacgacggat acgaggaact gaacctgaag tccaccaagg     1200

gggacctcca gttcagcccc tggattctgg tgccccacat cggctacggc ttccaccagt     1260

acctgcccta ccctgacggc atgagccctt tccaggccgc tatggtggac ggctctggct     1320

accaggtgca cagaaccatg cagttcgagg acggcgccag cctgaccgtg aactacagat     1380

acacctacga gggcagccac atcaagggcg aggcccaagt gaagggcaca ggcttccctg     1440

ctgacggccc cgtgatgacc aactctctga cagccgccga ctggtgcaga agcaagaaaa     1500

cctaccctaa cgacaagacc atcatcagca ccttcaagtg gtcctacacc acaggcaacg     1560

gcaagagata cagaagcacc gccagaacca cctacacctt cgccaagccc atggccgcca     1620

actacctgaa gaaccagcct atgtacgtgt tccgaaagac cgagctgaag cacagcaaga     1680

cagaactgaa cttcaaagag tggcagaaag ccttcaccga cgtgatgggc atggacgagc     1740

tgtacaaggg aaccggtttc gccaacgagc tgggccccag actgatgggc aaaggctccg     1800

gaggaagttc ctatacttca aatagaatag gaacttcgca ggtaagtatc ctttttacag     1860

cacaacttaa tgagacagat agaaactggt cttgtagaaa cagagtaggc tagcccccag     1920

ctggttcttt ccgcctcaga agccatagag cccaccgcat ccccagcatg cctgctattc     1980

tcttcccaat cctccccctt gctgtcctgc cccaccccac cccccagaat agaatgacac     2040

ctactcagac aatgcgatgc aatttcctca ttttattagg aaaggacagt gggagtggca     2100

ccttccaggg tcaaggaagg cacgggggag gggcaaacaa cagatggctg gcaactagaa     2160

ggcacagtcg aggctgatca gcgagccgcc ggcgtctaga gaattgatcc cctcaggcgc     2220

caggctttct ggtcatgcac caggttctag ggccctcagg cacttccacg tcggcggtca     2280

cggtgaagcc cagtctctcg tagaagggca ggtttctggg ggcgcttgtt tccaggaagg     2340

cgggcacgcc agccctttca gcagcttcca ccccaggcag caccacagca gatcccagtc     2400

ccttgccctg gtggtcaggt gacacgccca cggtggccag aaaccaggca ggctcttttg     2460

gtctgtgggg ggccagcagg ccttccatct gctgctgggc agccagtcta gagccgctca     2520

gctcggccat tctaggtccg atctcggcga acacagcgcc ggcttccaca gactcagggg     2580

ttgtccacac agccacagcg gcgccatcat cggccaccca cactttgccg atgtccaggc     2640

ccactctggt cagaaacagt tcctgcagct cggtcactct ctcgatgtgc cggtcggggt     2700

ccacggtgtg tcttgtggca gggtaatcgg cgaaggcagc ggccagtgtc cgcacagctc     2760

ttggcacatc gtccctggtg gccagccgca ctgtgggctt gtactcggtc atggtggcgc     2820

gccttttagg ggtagttttc acgacacctg aaatggaaga aaaaaacttt gaaccactgt     2880

ctgaggcttg agaatgaacc aagatccaaa ctcaaaaagg gcaaattcca aggagaatta     2940

catcaagtgc caagctggcc taacttcagt ctccacccac tcagtgtggg gaaactccat     3000

cgcataaaac ccctcccccc aacctaaaga cgacgtactc caaaagctcg agaactaatc     3060

gaggtgcctg gacggcgccc ggtactccgt ggagtcacat gaagcgacgg ctgaggacgg     3120

aaaggccctt ttcctttgtg tgggtgactc acccgcccgc tctcccgagc gccgcgtcct     3180

ccattttgag ctccctgcag cagggccggg aagcggccat ctttccgctc acgcaactgg     3240

tgccgaccgg gccagccttg ccgcccaggg cggggcgata cacggcggcg cgaggccagg     3300

caccagagca ggccggccag cttgagacta cccccgtccg attctcggtg gccgcgctcg     3360

caggccccgc ctcgccgaac atgtgcgctg ggacgcacgg gccccgtcgc cgcccgcggc     3420

cccaaaaacc gaaataccag tgtgcagatc ttggcccgca tttacaagac tatcttgcca     3480

gaaaaaaagc gtcgcagcag gtcatcaaaa attttaaatg gctagagact tatcgaaagc     3540

agcgagacag gcgcgaaggt gccaccagat tcgcacgcgg cggccccagc gcccaggcca     3600

ggcctcaact caagcacgag gcgaaggggc tccttaagcg caaggcctcg aactctccca     3660

cccacttcca acccgaagct cgggatcaag aatcacgtac tgcagccagg ggcgtggaag     3720

taattcaagg cacgcaaggg ccataacccg taaagaggcc aggcccgcgg gaaccacaca     3780

cggcacttac ctgtgttctg gcggcaaacc cgttgcgaaa aagaacgttc acggcgacta     3840

ctgcacttat atacggttct cccccaccct cgggaaaaag gcggagccag tacacgacat     3900

cactttccca gtttaccccg cgccaccttc tctaggcacc ggttcaattg ccgacccctc     3960

cccccaactt ctcggggact gtgggcgatg tgcgctctgc ccactgacgg gcaccggagc     4020

caattcgaat cgcctgcttt tctgcctggt actaacttct ctcccctctc ctcttttctt     4080

tttctgcagg gcggccgcgg aagttcctat acttcaaata gaataggaac ttccggcggg     4140

tcacccgagg atgagaatgc tgctctggaa gagaagatcg cccagctgaa gcagaagaac     4200

gccgctctga aagaagagat ccaggctctg gaatacggag gcggaggcat gatgctgaag     4260

aagatcctga agatcgaaga actggacgag cgcgagctga tcgacatcga ggtgtccggc     4320

aaccacctgt tctacgccaa cgatatcctg acccacaact cgtcttcgta cccagctttt     4380

gttcccttta gtgagggtta attgcgcgct tggcgtaatc atggtcatag ctgtttcctg     4440

tgtgaaattg ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta     4500

aagcctgggg tgcctaatga gtgagctaac tcacattaat tgcgttgcgc tcactgcccg     4560

ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga     4620

gaggcggttt gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg     4680

tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag     4740

aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc     4800

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca     4860

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt     4920

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc     4980

tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc     5040

tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc     5100

ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact     5160

tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg     5220

ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta     5280

tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca     5340

aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa     5400

aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg     5460

aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc     5520

ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg     5580

acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat     5640

ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg     5700

gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa     5760

taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca     5820

tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc     5880

gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt     5940

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa     6000

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat     6060

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct     6120

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga     6180

gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag     6240

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga     6300

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca     6360

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg     6420

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc     6480

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag     6540

gggttccgcg cacatttccc cgaaaagtgc cac                                  6573


<210>  53
<211>  4052
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  53
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg atccggcgga ggcggaagcg      960

gtggcggtgg aagcggaggt ggcggatctg gacttgtgcc tgagctgaac gagaaggacg     1020

acgaccaggt ccagaaggcc ctggcctcca gagaaaacac ccagctgatg aacagagaca     1080

acatcgagat caccgtgcgg gacttcaaga cactggcccc gagaagatgg ctgaacagcg     1140

gcatcatcag ctttttcatg aagtacatcg agaagtctac ccctaacacc gtggccttca     1200

acagcttctt ctacaccaac ctgagcgaga ggggctacca gggcgttaga cggtggatga     1260

agagaaagaa aacccagatc gacaagctgg acaagatctt cacccctatc aacctgaacc     1320

agagccactg ggccctgggc atcatcgacc tgaagaagaa aacaatcggc tacgtggaca     1380

gcctgagcaa cggccctaac gccatgtctt tcgccatcct gaccgacctc cagaaatacg     1440

tgatggaaga gagcaagcac accatcggcg aggacttcga cctgatccac ctggactgtc     1500

cccagcagcc taacggctac gactgtggca tctacgtgtg catgaacacc ctgtacggca     1560

gcgccgatgc tcccctggac ttcgattaca aggacgccat cagaatgagg cggtttatcg     1620

cccacctgat cctgacagac gccctgaaag gtggtggtgg ttctggcacc ggtttcgcca     1680

acgagctggg ccccagactg atgggcaaag gctccggagg cggaggcatg atgctgaaga     1740

agatcctgaa gatcgaagaa ctggacgagc gcgagctgat cgacatcgag gtgtccggca     1800

accacctgtt ctacgccaac gatatcctga cccacaactc gtcttcgtac ccagcttttg     1860

ttccctttag tgagggttaa ttgcgcgctt ggcgtaatca tggtcatagc tgtttcctgt     1920

gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa     1980

agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc     2040

tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag     2100

aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt     2160

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga     2220

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg     2280

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa     2340

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt     2400

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct     2460

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct     2520

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc     2580

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt     2640

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc     2700

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat     2760

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa     2820

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa     2880

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga     2940

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct     3000

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga     3060

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc     3120

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg     3180

ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat     3240

aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat     3300

ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg     3360

caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc     3420

attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa     3480

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc     3540

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt     3600

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag     3660

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt     3720

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag     3780

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac     3840

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc     3900

gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca     3960

gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg     4020

ggttccgcgc acatttcccc gaaaagtgcc ac                                   4052


<210>  54
<211>  6636
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  54
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg acctgaggat aagctccagg      960

ccattaagta cgagctggcc cagaacgagg aagaactggc tcagatcgaa gagaagctgg     1020

ccgccaacaa agaaggcgga tccggcggag gcggaagcgg tggcggtgga agcggaggtg     1080

gcggatctgg cgaatctctg ttcaagggcc ccagagacta caaccccatc agcagcacca     1140

tctgccacct gaccaacgag tctgacggcc acaccacaag cctgtacggc atcggcttcg     1200

gccccttcat catcaccaac aagcacctgt tcagacggaa caacggcacc ctggtggtgc     1260

agtctctgca cggcgtgttc aaagtgaaga acaccaccac actccagcag catctgatcg     1320

acggcagaga catgatcatc atcagaatgc ccaaggactt cccgcctttt ccacagaagc     1380

tgaagttcag agagcctcag agagaggaac ggatctgcct cgtgaccacc aacttccaga     1440

ccaagagcat gagcagcatg gtgtccgaca caagctgcac attccctagc ggcgacggca     1500

tcttctggaa gcactggatt cagaccaagg acggccagtg tggaagccct ctggtgtcta     1560

ccagagatgg cttcatcgtg ggcatccaca gcgccagcaa cttcaccaac acaaacaact     1620

acttcaccag cgtgccgaag aacttcatgg aactgctgac caatcaagag gcccagcagt     1680

gggtttcagg ctggcggctg aatgccgatt ctgtgctgtg gggaggccac aaggtgttca     1740

tggtcaagcc cgaggaaccc ttccagcctg tgaaagaggc cacacagctg atgaatggtg     1800

gcggaggttc tggcaccggt ttcgccaacg agctgggccc cagactgatg ggcaaaggct     1860

ccggaggaag ttcctatact tcaaatagaa taggaacttc gcaggtaagt atccttttta     1920

cagcacaact taatgagaca gatagaaact ggtcttgtag aaacagagta ggctagcccc     1980

cagctggttc tttccgcctc agaagccata gagcccaccg catccccagc atgcctgcta     2040

ttctcttccc aatcctcccc cttgctgtcc tgccccaccc caccccccag aatagaatga     2100

cacctactca gacaatgcga tgcaatttcc tcattttatt aggaaaggac agtgggagtg     2160

gcaccttcca gggtcaagga aggcacgggg gaggggcaaa caacagatgg ctggcaacta     2220

gaaggcacag tcgaggctga tcagcgagcc gccggcgtct agagaattga tcccctcagg     2280

cgccaggctt tctggtcatg caccaggttc tagggccctc aggcacttcc acgtcggcgg     2340

tcacggtgaa gcccagtctc tcgtagaagg gcaggtttct gggggcgctt gtttccagga     2400

aggcgggcac gccagccctt tcagcagctt ccaccccagg cagcaccaca gcagatccca     2460

gtcccttgcc ctggtggtca ggtgacacgc ccacggtggc cagaaaccag gcaggctctt     2520

ttggtctgtg gggggccagc aggccttcca tctgctgctg ggcagccagt ctagagccgc     2580

tcagctcggc cattctaggt ccgatctcgg cgaacacagc gccggcttcc acagactcag     2640

gggttgtcca cacagccaca gcggcgccat catcggccac ccacactttg ccgatgtcca     2700

ggcccactct ggtcagaaac agttcctgca gctcggtcac tctctcgatg tgccggtcgg     2760

ggtccacggt gtgtcttgtg gcagggtaat cggcgaaggc agcggccagt gtccgcacag     2820

ctcttggcac atcgtccctg gtggccagcc gcactgtggg cttgtactcg gtcatggtgg     2880

cgcgcctttt aggggtagtt ttcacgacac ctgaaatgga agaaaaaaac tttgaaccac     2940

tgtctgaggc ttgagaatga accaagatcc aaactcaaaa agggcaaatt ccaaggagaa     3000

ttacatcaag tgccaagctg gcctaacttc agtctccacc cactcagtgt ggggaaactc     3060

catcgcataa aacccctccc cccaacctaa agacgacgta ctccaaaagc tcgagaacta     3120

atcgaggtgc ctggacggcg cccggtactc cgtggagtca catgaagcga cggctgagga     3180

cggaaaggcc cttttccttt gtgtgggtga ctcacccgcc cgctctcccg agcgccgcgt     3240

cctccatttt gagctccctg cagcagggcc gggaagcggc catctttccg ctcacgcaac     3300

tggtgccgac cgggccagcc ttgccgccca gggcggggcg atacacggcg gcgcgaggcc     3360

aggcaccaga gcaggccggc cagcttgaga ctacccccgt ccgattctcg gtggccgcgc     3420

tcgcaggccc cgcctcgccg aacatgtgcg ctgggacgca cgggccccgt cgccgcccgc     3480

ggccccaaaa accgaaatac cagtgtgcag atcttggccc gcatttacaa gactatcttg     3540

ccagaaaaaa agcgtcgcag caggtcatca aaaattttaa atggctagag acttatcgaa     3600

agcagcgaga caggcgcgaa ggtgccacca gattcgcacg cggcggcccc agcgcccagg     3660

ccaggcctca actcaagcac gaggcgaagg ggctccttaa gcgcaaggcc tcgaactctc     3720

ccacccactt ccaacccgaa gctcgggatc aagaatcacg tactgcagcc aggggcgtgg     3780

aagtaattca aggcacgcaa gggccataac ccgtaaagag gccaggcccg cgggaaccac     3840

acacggcact tacctgtgtt ctggcggcaa acccgttgcg aaaaagaacg ttcacggcga     3900

ctactgcact tatatacggt tctcccccac cctcgggaaa aaggcggagc cagtacacga     3960

catcactttc ccagtttacc ccgcgccacc ttctctaggc accggttcaa ttgccgaccc     4020

ctccccccaa cttctcgggg actgtgggcg atgtgcgctc tgcccactga cgggcaccgg     4080

agccaattcg aatcgcctgc ttttctgcct ggtactaact tctctcccct ctcctctttt     4140

ctttttctgc agggcggccg cggaagttcc tatacttcaa atagaatagg aacttccggc     4200

gggtcacccg aggatgagaa tgctgctctg gaagagaaga tcgcccagct gaagcagaag     4260

aacgccgctc tgaaagaaga gatccaggct ctggaatacg gaggcggagg catgatgctg     4320

aagaagatcc tgaagatcga agaactggac gagcgcgagc tgatcgacat cgaggtgtcc     4380

ggcaaccacc tgttctacgc caacgatatc ctgacccaca actcgtcttc gtacccagct     4440

tttgttccct ttagtgaggg ttaattgcgc gcttggcgta atcatggtca tagctgtttc     4500

ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt     4560

gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc     4620

ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg     4680

ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct     4740

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca     4800

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga     4860

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc     4920

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg     4980

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat     5040

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt     5100

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc     5160

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg     5220

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg     5280

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg     5340

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg     5400

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca     5460

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga     5520

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga     5580

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt     5640

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt     5700

catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat     5760

ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag     5820

caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct     5880

ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt     5940

tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg     6000

cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca     6060

aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt     6120

tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat     6180

gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac     6240

cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa     6300

aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt     6360

tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt     6420

tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa     6480

gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt     6540

atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa     6600

taggggttcc gcgcacattt ccccgaaaag tgccac                               6636


<210>  55
<211>  4037
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  55
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctgaa      660

gactctgcct ggaccttaag acccaggtgc agacccccca gggcatgaag gaaatcagca      720

acatccaagt gggcgacctg gtgctgagca acaccggcta caacgaggtg ctgaacgtgt      780

tccccaagag caagaagaag tcctacaaga tcaccctgga agatggcaaa gagatcatct      840

gctccgagga acacctgttc ccaacccaga ccggcgagat gaacatctct ggcggcctga      900

aagagggcat gtgcctgtac gtgaaagaag gcggcggagg atccgtgtcc aagggcgaag      960

aggacaacat ggccagcctg cctgccaccc acgagctgca catcttcggc agcatcaacg     1020

gcgtggactt cgacatggtg ggacagggca ccggcaaccc caacgacgga tacgaggaac     1080

tgaacctgaa gtccaccaag ggggacctcc agttcagccc ctggattctg gtgccccaca     1140

tcggctacgg cttccaccag tacctgccct accctgacgg catgagccct ttccaggccg     1200

ctatggtgga cggctctggc taccaggtgc acagaaccat gcagttcgag gacggcgcca     1260

gcctgaccgt gaactacaga tacacctacg agggcagcca catcaagggc gaggcccaag     1320

tgaagggcac aggcttccct gctgacggcc ccgtgatgac caactctctg acagccgccg     1380

actggtgcag aagcaagaaa acctacccta acgacaagac catcatcagc accttcaagt     1440

ggtcctacac cacaggcaac ggcaagagat acagaagcac cgccagaacc acctacacct     1500

tcgccaagcc catggccgcc aactacctga agaaccagcc tatgtacgtg ttccgaaaga     1560

ccgagctgaa gcacagcaag acagaactga acttcaaaga gtggcagaaa gccttcaccg     1620

acgtgatggg catggacgag ctgtacaagg gaaccggttt cgccaacgag ctgggcccca     1680

gactgatggg caaaggctcc ggaggcggag gcatgatgct gaagaagatc ctgaagatcg     1740

aagaactgga cgagcgcgag ctgatcgaca tcgaggtgtc cggcaaccac ctgttctacg     1800

ccaacgatat cctgacccac aactcgtctt cgtacccagc ttttgttccc tttagtgagg     1860

gttaattgcg cgcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc     1920

gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta     1980

atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa     2040

cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat     2100

tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg     2160

agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc     2220

aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt     2280

gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag     2340

tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc     2400

cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc     2460

ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt     2520

cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt     2580

atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc     2640

agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa     2700

gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa     2760

gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg     2820

tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga     2880

agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg     2940

gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg     3000

aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt     3060

aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact     3120

ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat     3180

gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg     3240

aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg     3300

ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat     3360

tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc     3420

ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt     3480

cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc     3540

agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga     3600

gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc     3660

gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa     3720

acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta     3780

acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg     3840

agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg     3900

aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat     3960

gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt     4020

tccccgaaaa gtgccac                                                    4037


<210>  56
<211>  8644
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  56
ggcgcgccgg attcgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc       60

attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc      120

tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt      180

aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca      240

cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg      300

taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca      360

gtacatctac gtattagtca tcgctattac catggtcgag gtgagcccca cgttctgctt      420

cactctcccc atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt      480

attttgtgca gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg      540

gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc      600

gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag      660

cgcgcggcgg gcgggagtcg ctgcgtcgcg ccttcgcccc gtgccccgct ccgccgccgc      720

ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga gcgggcggga      780

cggcccttct cctccgggct gtaattagcg cttggtttaa tgacggctcg tttcttttct      840

gtggctgcgt gaaagcctta aagggctccg ggagggccct ttgtgcgggg gggagcggct      900

cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggcccg cgctgcccgg      960

cggctgtgag cgctgcgggc gcggcgcggg gctttgtgcg ctccgcgtgt gcgcgagggg     1020

agcgcggccg ggggcggtgc cccgcggtgc gggggggctg cgaggggaac aaaggctgcg     1080

tgcggggtgt gtgcgtgggg gggtgagcag ggggtgtggg cgcggcggtc gggctgtaac     1140

ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg tgcggggctc     1200

cgtgcggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca ggtgggggtg     1260

ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg cggcggcccc     1320

ggagcgccgg cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt     1380

gcgagagggc gcagggactt cctttgtccc aaatctggcg gagccgaaat ctgggaggcg     1440

ccgccgcacc ccctctagcg ggcgcgggcg aagcggtgcg gcgccggcag gaaggaaatg     1500

ggcggggagg gccttcgtgc gtcgccgcgc cgccgtcccc ttctccatct ccagcctcgg     1560

ggctgccgca gggggacggc tgccttcggg ggggacgggg cagggcgggg ttcggcttct     1620

ggcgtgtgac cggcggctct agagcctctg ctaaccatgt tcatgccttc ttctttttcc     1680

tacagatcct taattaataa tacgactcac tataggggcc gccaccatga caccacctaa     1740

gaagaaacgg aaggtcgagg acggcgaggg ccctgctgct aagagagtga aactggactc     1800

cggagtgtcc aagggcgaag aggacaacat ggccagcctg cctgccaccc acgagctgca     1860

catcttcggc agcatcaacg gcgtggactt cgacatggtg ggacagggca ccggcaaccc     1920

caacgacgga tacgaggaac tgaacctgaa gtccaccaag ggggacctcc agttcagccc     1980

ctggattctg gtgccccaca tcggctacgg cttccaccag tacctgccct accctgacgg     2040

catgagccct ttccaggccg ctatggtgga cggctgcctg gaccttaaga cccaggtgca     2100

gaccccccag ggcatgaagg aaatcagcaa catccaagtg ggcgacctgg tgctgagcaa     2160

caccggctac aacgaggtgc tgaacgtgtt ccccaagagc aagaagaagt cctacaagat     2220

caccctggaa gatggcaaag agatcatctg ctccgaggaa cacctgttcc caacccagac     2280

cggcgagatg aacatctctg gcggcctgaa agagggcatg tgcctgtacg tgaaagaagg     2340

cggcggagga cctgaggata agctccaggc cattaagtac gagctggccc agaacgagga     2400

agaactggct cagatcgaag agaagctggc cgccaacaaa gaaggcggat ccggcggagg     2460

cggatctgga accggttttg ctaatgagct gggccccaga ctgatgggca aaggcagcgg     2520

aggaggcgga agcggacctc ctaggaagag atgttgttgc gctagaagag gcacccagct     2580

gatgctcgtg ggcctgctgt ctacagctat gtgggctgga ctgctggctc tgctgctgct     2640

ttggcattgg gagacggaag gtggtggtgg atctggtggc ggaggtagcg gtggtggcgg     2700

tagcggaggc ggtggatcta gaaaacgtac ccagcctacc ttcggcttca ccgtgaactg     2760

gaagttcagc gagagcacca ccgtgttcac cggccagtgc ttcatcgaca gaaacggcaa     2820

agaggtgctg aaaaccatgt ggctgctgag aagcagcgtg aacgacatcg gcgacgactg     2880

gaaggccacc agagtgggca tcaacatctt caccagactg aggacccaga aagagggcgg     2940

ctctggcgga agcgccagaa agtgtagcct gaccggcaag tggaccaacg acctgggcag     3000

caacatgacc atcggcgccg tgaacagcag aggcgagttc acaggcacct acatcaccgc     3060

cgtgaccgcc accagcaacg agatcaaaga gagccccctg cacggcaccc agaacaccat     3120

caacaagagc ggcggcagca caacagtgtt tacaggacag tgttttatcg accggaatgg     3180

gaaagaagtg ctgaaaacaa tgtggctgct gcggtcctcc gtgaacgaca ttggagatga     3240

ttggaaagct acacgagtgg ggattaacat ttttacccgg ctgcgcacac agaaagaagg     3300

gggcagcggc ggctccgcta gaaagtgttc tctgactgga aaatggacaa acgatctggg     3360

gtccaatatg acaatcgggg cagtgaactc taggggcgag tttaccggaa catatattac     3420

agccgtgaca gctacctcta acgaaatcaa agagtctcct ctgcacggga cacagaatac     3480

cattaacaaa agaacccagc ccacattcgg gtttacagtg aattggaaat tctccgaggg     3540

cggcagcgga agcggatctg gctctggatc tggcaggaca cagcccacct ttggattcac     3600

tgtgaattgg aagttttctg agtctaccac agtgttcact gggcagtgtt tcattgatcg     3660

caatggaaaa gaggtgctga aaactatgtg gctgctgcgc tcaagtgtga atgacatcgg     3720

ggatgattgg aaggcaactc gcgtgggaat caatatcttt acacggctga gaactcagaa     3780

agagggggga agcggaggca gcgcccggaa atgctctctg acagggaagt ggactaatga     3840

tctgggctct aacatgacta ttggagctgt gaatagccgg ggagagttca ccgggactta     3900

tatcactgct gtgactgcca cctcaaatga gatcaaagaa tcccccctgc atggaacaca     3960

gaacactatt aacaagtccg gcggctccac aaccgtgttc acagggcagt gctttattga     4020

ccggaacggc aaagaggtgc tgaaaacaat gtggctgctg cgaagctctg tgaatgatat     4080

tggggacgac tggaaagcaa ctagagtggg gatcaatatt ttcactcgcc tgcggaccca     4140

gaaagaaggc ggaagcggag gatctgccag aaagtgctca ctgacaggca aatggacaaa     4200

tgacctgggg agtaatatga ctattggggc cgtgaacagt cgcggcgagt ttactgggac     4260

ttacattacc gcagtgacag caacatccaa tgagatcaaa gaaagtcctc tgcatggcac     4320

tcagaacaca atcaacaaaa ggacccagcc aacctttggc tttaccgtga attggaagtt     4380

ctctgaaggc ggcggaggat ccggcggagg gggaagtggc gggggaggca gtgggggcgg     4440

aggaagcgct caccacttta gcgagcccga gatcaccctg atcatcttcg gcgtgatggc     4500

cctcgtgatc ggcaccatcc tgctgatctc ttacggcatc agacggctga tcaagaagtc     4560

cccctcaggc ggaggcggct ctaccggttc cggaggcagc ggcttctgct acgagaacga     4620

agtcggcagt ggcaggtcca gattcgtgaa gaaggacggc cactgcaacg tgcagttcat     4680

caacgtcgga agcggcaaga gcagaatcac ctctgagggc gagtacatcc ctctggacca     4740

gatcgatatt aatgtcggtt ccggaggaag ttcctatact tcaaatagaa taggaacttc     4800

cggcgggtca cccgaggatg agaatgctgc tctggaagag aagatcgccc agctgaagca     4860

gaagaacgcc gctctgaaag aagagatcca ggctctggaa tacggaggcg gaggcatgat     4920

gctgaagaag atcctgaaga tcgaagaact ggacgagcgc gagctgatcg acatcgaggt     4980

gtccggcaac cacctgttct acgccaacga tatcctgacc cacaactctg gctaccaggt     5040

gcacagaacc atgcagttcg aggacggcgc cagcctgacc gtgaactaca gatacaccta     5100

cgagggcagc cacatcaagg gcgaggccca agtgaagggc acaggcttcc ctgctgacgg     5160

ccccgtgatg accaactctc tgacagccgc cgactggtgc agaagcaaga aaacctaccc     5220

taacgacaag accatcatca gcaccttcaa gtggtcctac accacaggca acggcaagag     5280

atacagaagc accgccagaa ccacctacac cttcgccaag cccatggccg ccaactacct     5340

gaagaaccag cctatgtacg tgttccgaaa gaccgagctg aagcacagca agacagaact     5400

gaacttcaaa gagtggcaga aagccttcac cgacgtgatg ggcatggacg agctgtacaa     5460

gtccggagct gctccagccg ccaagaagaa gaagctcgac tacaaggacg acgacgataa     5520

gtgaacgcgt aaatgattgc agatccacta gttctagagc tcgctgatca gcctcgactg     5580

tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg     5640

aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga     5700

gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg     5760

aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa     5820

ccagctgggg ctcgagatcc actagttcta gcctcgaggc tagagcggcc gccactggcc     5880

gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca     5940

gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc     6000

caacagttgc gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg     6060

gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct     6120

cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta     6180

aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa     6240

cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct     6300

ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc     6360

aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg     6420

ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt     6480

acaatttagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct     6540

aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat     6600

attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg     6660

cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg     6720

aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc     6780

ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat     6840

gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact     6900

attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca     6960

tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact     7020

tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg     7080

atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg     7140

agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg     7200

aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg     7260

caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag     7320

ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc     7380

gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga     7440

tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat     7500

atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc     7560

tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag     7620

accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct     7680

gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac     7740

caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc     7800

tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg     7860

ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt     7920

tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt     7980

gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc     8040

tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca     8100

gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata     8160

gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg     8220

ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct     8280

ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta     8340

ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag     8400

tgagcgagga agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga     8460

ttcattaatg cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg     8520

caattaatgt gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg     8580

ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc     8640

atga                                                                  8644


<210>  57
<211>  7801
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  57
ggcgcgccgg attcgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc       60

attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc      120

tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt      180

aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca      240

cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg      300

taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca      360

gtacatctac gtattagtca tcgctattac catggtcgag gtgagcccca cgttctgctt      420

cactctcccc atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt      480

attttgtgca gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg      540

gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc      600

gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag      660

cgcgcggcgg gcgggagtcg ctgcgtcgcg ccttcgcccc gtgccccgct ccgccgccgc      720

ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga gcgggcggga      780

cggcccttct cctccgggct gtaattagcg cttggtttaa tgacggctcg tttcttttct      840

gtggctgcgt gaaagcctta aagggctccg ggagggccct ttgtgcgggg gggagcggct      900

cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggcccg cgctgcccgg      960

cggctgtgag cgctgcgggc gcggcgcggg gctttgtgcg ctccgcgtgt gcgcgagggg     1020

agcgcggccg ggggcggtgc cccgcggtgc gggggggctg cgaggggaac aaaggctgcg     1080

tgcggggtgt gtgcgtgggg gggtgagcag ggggtgtggg cgcggcggtc gggctgtaac     1140

ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg tgcggggctc     1200

cgtgcggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca ggtgggggtg     1260

ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg cggcggcccc     1320

ggagcgccgg cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt     1380

gcgagagggc gcagggactt cctttgtccc aaatctggcg gagccgaaat ctgggaggcg     1440

ccgccgcacc ccctctagcg ggcgcgggcg aagcggtgcg gcgccggcag gaaggaaatg     1500

ggcggggagg gccttcgtgc gtcgccgcgc cgccgtcccc ttctccatct ccagcctcgg     1560

ggctgccgca gggggacggc tgccttcggg ggggacgggg cagggcgggg ttcggcttct     1620

ggcgtgtgac cggcggctct agagcctctg ctaaccatgt tcatgccttc ttctttttcc     1680

tacagatcct taattaataa tacgactcac tataggggcc gccaccatga caccacctaa     1740

gaagaaacgg aaggtcgagg acggcgaggg ccctgctgct aagagagtga aactggactc     1800

cggagtgtcc aagggcgaag aggacaacat ggccagcctg cctgccaccc acgagctgca     1860

catcttcggc agcatcaacg gcgtggactt cgacatggtg ggacagggca ccggcaaccc     1920

caacgacgga tacgaggaac tgaacctgaa gtccaccaag ggggacctcc agttcagccc     1980

ctggattctg gtgccccaca tcggctacgg cttccaccag tacctgccct accctgacgg     2040

catgagccct ttccaggccg ctatggtgga cggctgcctg gaccttaaga cccaggtgca     2100

gaccccccag ggcatgaagg aaatcagcaa catccaagtg ggcgacctgg tgctgagcaa     2160

caccggctac aacgaggtgc tgaacgtgtt ccccaagagc aagaagaagt cctacaagat     2220

caccctggaa gatggcaaag agatcatctg ctccgaggaa cacctgttcc caacccagac     2280

cggcgagatg aacatctctg gcggcctgaa agagggcatg tgcctgtacg tgaaagaagg     2340

cggcggagga cctgaggata agctccaggc cattaagtac gagctggccc agaacgagga     2400

agaactggct cagatcgaag agaagctggc cgccaacaaa gaaggcggat ccggcggagg     2460

cggatctgga accggttttg ctaatgagct gggccccaga ctgatgggca aaggcagcgg     2520

aggaggcgga agcggacctc ctaggaagag atgttgttgc gctagaagag gcacccagct     2580

gatgctcgtg ggcctgctgt ctacagctat gtgggctgga ctgctggctc tgctgctgct     2640

ttggcattgg gagacggaag gtggtggtgg atctggtggc ggaggctctg aaatcggcac     2700

aggcttccct ttcgaccctc actacgtgga agtgctgggc gagagaatgc actatgtgga     2760

tgtgggccct agagatggaa cccctgtgct gtttctgcac ggcaacccta ccagctctta     2820

cgtgtggcgg aacatcatcc ctcacgtggc ccctacacac agagtgatcg cccctgatct     2880

gatcggcatg ggcaagagcg acaagcctga cctgggctac ttcttcgacg accacgtgcg     2940

gttcatggac gccttcatcg aggctctggg actcgaagag gtggtgctgg tcatccacga     3000

ttggggctct gctctgggct tccactgggc caagagaaac cccgaaagag tgaagggaat     3060

cgccttcatg gagttcatca gacccattcc tacctgggac gagtggcccg agttcgccag     3120

agagacattc caggccttca gaacaaccga cgtgggcaga aagctgatca tcgaccagaa     3180

tgtgtttatc gagggcaccc tgcctatggg cgtcgtcaga cctctgaccg aggtggaaat     3240

ggaccactac agagagcctt ttctgaaccc cgtggataga gaacctctgt ggcggttccc     3300

taacgagctg cctattgctg gcgagcccgc taacattgtg gccctggtcg aagagtacat     3360

ggactggctg catcagagcc ccgtgcctaa gctgctgttt tggggaactc ccggcgtgct     3420

gatccctcct gctgaagctg ctagactggc taagagcctg cctaacgcta aggccgtgga     3480

catcggacct ggcctgaatc tgctgcaaga ggataacccc gacctgatcg gctctgagat     3540

cgccagatgg ctgagcacac tggaaatttc tggcggtggt ggcggtagcg gtggcggtgg     3600

aagcgctcac cactttagcg agcccgagat caccctgatc atcttcggcg tgatggccct     3660

cgtgatcggc accatcctgc tgatctctta cggcatcaga cggctgatca agaagtcccc     3720

ctcaggcgga ggcggctcta ccggttccgg aggcagcggc ttctgctacg agaacgaagt     3780

cggcagtggc aggtccagat tcgtgaagaa ggacggccac tgcaacgtgc agttcatcaa     3840

cgtcggaagc ggcaagagca gaatcacctc tgagggcgag tacatccctc tggaccagat     3900

cgatattaat gtcggttccg gaggaagttc ctatacttca aatagaatag gaacttccgg     3960

cgggtcaccc gaggatgaga atgctgctct ggaagagaag atcgcccagc tgaagcagaa     4020

gaacgccgct ctgaaagaag agatccaggc tctggaatac ggaggcggag gcatgatgct     4080

gaagaagatc ctgaagatcg aagaactgga cgagcgcgag ctgatcgaca tcgaggtgtc     4140

cggcaaccac ctgttctacg ccaacgatat cctgacccac aactctggct accaggtgca     4200

cagaaccatg cagttcgagg acggcgccag cctgaccgtg aactacagat acacctacga     4260

gggcagccac atcaagggcg aggcccaagt gaagggcaca ggcttccctg ctgacggccc     4320

cgtgatgacc aactctctga cagccgccga ctggtgcaga agcaagaaaa cctaccctaa     4380

cgacaagacc atcatcagca ccttcaagtg gtcctacacc acaggcaacg gcaagagata     4440

cagaagcacc gccagaacca cctacacctt cgccaagccc atggccgcca actacctgaa     4500

gaaccagcct atgtacgtgt tccgaaagac cgagctgaag cacagcaaga cagaactgaa     4560

cttcaaagag tggcagaaag ccttcaccga cgtgatgggc atggacgagc tgtacaagtc     4620

cggagctgct ccagccgcca agaagaagaa gctcgactac aaggacgacg acgataagtg     4680

aacgcgtaaa tgattgcaga tccactagtt ctagagctcg ctgatcagcc tcgactgtgc     4740

cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg accctggaag     4800

gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta     4860

ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag     4920

acaatagcag gcatgctggg gatgcggtgg gctctatggc ttctgaggcg gaaagaacca     4980

gctggggctc gagatccact agttctagcc tcgaggctag agcggccgcc actggccgtc     5040

gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca     5100

catccccctt tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa     5160

cagttgcgca gcctgaatgg cgaatgggac gcgccctgta gcggcgcatt aagcgcggcg     5220

ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct     5280

ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat     5340

cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt     5400

gattagggtg atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg     5460

acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac     5520

cctatctcgg tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta     5580

aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca     5640

atttaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa     5700

tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt     5760

gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg     5820

cattttgcct tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag     5880

atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg     5940

agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg     6000

gcgcggtatt atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt     6060

ctcagaatga cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga     6120

cagtaagaga attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac     6180

ttctgacaac gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc     6240

atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc     6300

gtgacaccac gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac     6360

tacttactct agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag     6420

gaccacttct gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg     6480

gtgagcgtgg gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta     6540

tcgtagttat ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg     6600

ctgagatagg tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata     6660

tactttagat tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt     6720

ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc     6780

ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct     6840

tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa     6900

ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag     6960

tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc     7020

tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg     7080

actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca     7140

cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat     7200

gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg     7260

tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc     7320

ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc     7380

ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc     7440

cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg     7500

cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga     7560

gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct ccccgcgcgt tggccgattc     7620

attaatgcag ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa     7680

ttaatgtgag ttagctcact cattaggcac cccaggcttt acactttatg cttccggctc     7740

gtatgttgtg tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg     7800

a                                                                     7801


<210>  58
<211>  7444
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  58
ggcgcgccgg attcgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc       60

attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc      120

tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt      180

aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca      240

cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg      300

taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca      360

gtacatctac gtattagtca tcgctattac catggtcgag gtgagcccca cgttctgctt      420

cactctcccc atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt      480

attttgtgca gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg      540

gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc      600

gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag      660

cgcgcggcgg gcgggagtcg ctgcgtcgcg ccttcgcccc gtgccccgct ccgccgccgc      720

ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga gcgggcggga      780

cggcccttct cctccgggct gtaattagcg cttggtttaa tgacggctcg tttcttttct      840

gtggctgcgt gaaagcctta aagggctccg ggagggccct ttgtgcgggg gggagcggct      900

cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggcccg cgctgcccgg      960

cggctgtgag cgctgcgggc gcggcgcggg gctttgtgcg ctccgcgtgt gcgcgagggg     1020

agcgcggccg ggggcggtgc cccgcggtgc gggggggctg cgaggggaac aaaggctgcg     1080

tgcggggtgt gtgcgtgggg gggtgagcag ggggtgtggg cgcggcggtc gggctgtaac     1140

ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg tgcggggctc     1200

cgtgcggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca ggtgggggtg     1260

ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg cggcggcccc     1320

ggagcgccgg cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt     1380

gcgagagggc gcagggactt cctttgtccc aaatctggcg gagccgaaat ctgggaggcg     1440

ccgccgcacc ccctctagcg ggcgcgggcg aagcggtgcg gcgccggcag gaaggaaatg     1500

ggcggggagg gccttcgtgc gtcgccgcgc cgccgtcccc ttctccatct ccagcctcgg     1560

ggctgccgca gggggacggc tgccttcggg ggggacgggg cagggcgggg ttcggcttct     1620

ggcgtgtgac cggcggctct agagcctctg ctaaccatgt tcatgccttc ttctttttcc     1680

tacagatcct taattaataa tacgactcac tataggggcc gccaccatga caccacctaa     1740

gaagaaacgg aaggtcgagg acggcgaggg ccctgctgct aagagagtga aactggactc     1800

cggagtgtcc aagggcgaag aggacaacat ggccagcctg cctgccaccc acgagctgca     1860

catcttcggc agcatcaacg gcgtggactt cgacatggtg ggacagggca ccggcaaccc     1920

caacgacgga tacgaggaac tgaacctgaa gtccaccaag ggggacctcc agttcagccc     1980

ctggattctg gtgccccaca tcggctacgg cttccaccag tacctgccct accctgacgg     2040

catgagccct ttccaggccg ctatggtgga cggctgcctg gaccttaaga cccaggtgca     2100

gaccccccag ggcatgaagg aaatcagcaa catccaagtg ggcgacctgg tgctgagcaa     2160

caccggctac aacgaggtgc tgaacgtgtt ccccaagagc aagaagaagt cctacaagat     2220

caccctggaa gatggcaaag agatcatctg ctccgaggaa cacctgttcc caacccagac     2280

cggcgagatg aacatctctg gcggcctgaa agagggcatg tgcctgtacg tgaaagaagg     2340

cggcggagga cctgaggata agctccaggc cattaagtac gagctggccc agaacgagga     2400

agaactggct cagatcgaag agaagctggc cgccaacaaa gaaggcggat ccggcggagg     2460

cggatctgga accggttttg ctaatgagct gggccccaga ctgatgggca aaggcagcgg     2520

aggaggcgga agcggacctc ctaggaagag atgttgttgc gctagaagag gcacccagct     2580

gatgctcgtg ggcctgctgt ctacagctat gtgggctgga ctgctggctc tgctgctgct     2640

ttggcattgg gagacggaag gtggtggtgg atctggtacc ggaagcggag tctttacact     2700

ggaagatttc gtcggcgact ggcggcagac agctggctac aatctggacc aggtgctgga     2760

acaaggcggc gtgtcctctc tgtttcagaa cctgggagtg tctgtgaccc ctatccagag     2820

aatcgtgctg agcggcgaga acggcctgaa gatcgacatc cacgtgatca tcccttacga     2880

gggcctgtcc ggcgatcaga tgggacagat cgagaagatc tttaaggtgg tgtaccccgt     2940

ggacgaccac cacttcaaag tgatcctgca ctacggcacc ctggtcatcg atggcgtgac     3000

cccaaacatg atcgactact tcggcagacc ctacgaggga atcgccgtgt tcgacggcaa     3060

gaaaatcacc gtgaccggca cactgtggaa cggcaacaag atcatcgacg agagactgat     3120

caaccccgac ggcagcctgc tgttcagagt gacaatcaac ggcgtgacag gctggcggct     3180

gtgcgaaaga atccttgctg gtaccgacta caaggacgac gacgacaaag gaggtggcgg     3240

tggaagcgct caccacttta gcgagcccga gatcaccctg atcatcttcg gcgtgatggc     3300

cctcgtgatc ggcaccatcc tgctgatctc ttacggcatc agacggctga tcaagaagtc     3360

cccctcaggc ggaggcggct ctaccggttc cggaggcagc ggcttctgct acgagaacga     3420

agtcggcagt ggcaggtcca gattcgtgaa gaaggacggc cactgcaacg tgcagttcat     3480

caacgtcgga agcggcaaga gcagaatcac ctctgagggc gagtacatcc ctctggacca     3540

gatcgatatt aatgtcggtt ccggaggaag ttcctatact tcaaatagaa taggaacttc     3600

cggcgggtca cccgaggatg agaatgctgc tctggaagag aagatcgccc agctgaagca     3660

gaagaacgcc gctctgaaag aagagatcca ggctctggaa tacggaggcg gaggcatgat     3720

gctgaagaag atcctgaaga tcgaagaact ggacgagcgc gagctgatcg acatcgaggt     3780

gtccggcaac cacctgttct acgccaacga tatcctgacc cacaactctg gctaccaggt     3840

gcacagaacc atgcagttcg aggacggcgc cagcctgacc gtgaactaca gatacaccta     3900

cgagggcagc cacatcaagg gcgaggccca agtgaagggc acaggcttcc ctgctgacgg     3960

ccccgtgatg accaactctc tgacagccgc cgactggtgc agaagcaaga aaacctaccc     4020

taacgacaag accatcatca gcaccttcaa gtggtcctac accacaggca acggcaagag     4080

atacagaagc accgccagaa ccacctacac cttcgccaag cccatggccg ccaactacct     4140

gaagaaccag cctatgtacg tgttccgaaa gaccgagctg aagcacagca agacagaact     4200

gaacttcaaa gagtggcaga aagccttcac cgacgtgatg ggcatggacg agctgtacaa     4260

gtccggagct gctccagccg ccaagaagaa gaagctcgac tacaaggacg acgacgataa     4320

gtgaacgcgt aaatgattgc agatccacta gttctagagc tcgctgatca gcctcgactg     4380

tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg     4440

aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga     4500

gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg     4560

aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa     4620

ccagctgggg ctcgagatcc actagttcta gcctcgaggc tagagcggcc gccactggcc     4680

gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca     4740

gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc     4800

caacagttgc gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg     4860

gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct     4920

cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta     4980

aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa     5040

cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct     5100

ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc     5160

aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg     5220

ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt     5280

acaatttagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct     5340

aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat     5400

attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg     5460

cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg     5520

aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc     5580

ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat     5640

gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact     5700

attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca     5760

tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact     5820

tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg     5880

atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg     5940

agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg     6000

aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg     6060

caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag     6120

ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc     6180

gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga     6240

tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat     6300

atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc     6360

tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag     6420

accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct     6480

gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac     6540

caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc     6600

tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg     6660

ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt     6720

tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt     6780

gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc     6840

tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca     6900

gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata     6960

gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg     7020

ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct     7080

ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta     7140

ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag     7200

tgagcgagga agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga     7260

ttcattaatg cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg     7320

caattaatgt gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg     7380

ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc     7440

atga                                                                  7444


<210>  59
<211>  7504
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  59
ggcgcgccgg attcgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc       60

attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc      120

tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt      180

aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca      240

cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg      300

taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca      360

gtacatctac gtattagtca tcgctattac catggtcgag gtgagcccca cgttctgctt      420

cactctcccc atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt      480

attttgtgca gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg      540

gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc      600

gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag      660

cgcgcggcgg gcgggagtcg ctgcgtcgcg ccttcgcccc gtgccccgct ccgccgccgc      720

ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga gcgggcggga      780

cggcccttct cctccgggct gtaattagcg cttggtttaa tgacggctcg tttcttttct      840

gtggctgcgt gaaagcctta aagggctccg ggagggccct ttgtgcgggg gggagcggct      900

cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggcccg cgctgcccgg      960

cggctgtgag cgctgcgggc gcggcgcggg gctttgtgcg ctccgcgtgt gcgcgagggg     1020

agcgcggccg ggggcggtgc cccgcggtgc gggggggctg cgaggggaac aaaggctgcg     1080

tgcggggtgt gtgcgtgggg gggtgagcag ggggtgtggg cgcggcggtc gggctgtaac     1140

ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg tgcggggctc     1200

cgtgcggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca ggtgggggtg     1260

ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg cggcggcccc     1320

ggagcgccgg cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt     1380

gcgagagggc gcagggactt cctttgtccc aaatctggcg gagccgaaat ctgggaggcg     1440

ccgccgcacc ccctctagcg ggcgcgggcg aagcggtgcg gcgccggcag gaaggaaatg     1500

ggcggggagg gccttcgtgc gtcgccgcgc cgccgtcccc ttctccatct ccagcctcgg     1560

ggctgccgca gggggacggc tgccttcggg ggggacgggg cagggcgggg ttcggcttct     1620

ggcgtgtgac cggcggctct agagcctctg ctaaccatgt tcatgccttc ttctttttcc     1680

tacagatcct taattaataa tacgactcac tataggggcc gccaccatga caccacctaa     1740

gaagaaacgg aaggtcgagg acggcgaggg ccctgctgct aagagagtga aactggactc     1800

cggagtgtcc aagggcgaag aggacaacat ggccagcctg cctgccaccc acgagctgca     1860

catcttcggc agcatcaacg gcgtggactt cgacatggtg ggacagggca ccggcaaccc     1920

caacgacgga tacgaggaac tgaacctgaa gtccaccaag ggggacctcc agttcagccc     1980

ctggattctg gtgccccaca tcggctacgg cttccaccag tacctgccct accctgacgg     2040

catgagccct ttccaggccg ctatggtgga cggctgcctg gaccttaaga cccaggtgca     2100

gaccccccag ggcatgaagg aaatcagcaa catccaagtg ggcgacctgg tgctgagcaa     2160

caccggctac aacgaggtgc tgaacgtgtt ccccaagagc aagaagaagt cctacaagat     2220

caccctggaa gatggcaaag agatcatctg ctccgaggaa cacctgttcc caacccagac     2280

cggcgagatg aacatctctg gcggcctgaa agagggcatg tgcctgtacg tgaaagaagg     2340

cggcggagga cctgaggata agctccaggc cattaagtac gagctggccc agaacgagga     2400

agaactggct cagatcgaag agaagctggc cgccaacaaa gaaggcggat ccggcggagg     2460

cggatctgga accggttttg ctaatgagct gggccccaga ctgatgggca aaggcagcgg     2520

aggaggcgga agcggacctc ctaggaagag atgttgttgc gctagaagag gcacccagct     2580

gatgctcgtg ggcctgctgt ctacagctat gtgggctgga ctgctggctc tgctgctgct     2640

ttggcattgg gagacggaag gtggtggtgg atctcgccgc agaagaagaa agagaagcgc     2700

cagaggtacc ggaagcggag tctttacact ggaagatttc gtcggcgact ggcggcagac     2760

agctggctac aatctggacc aggtgctgga acaaggcggc gtgtcctctc tgtttcagaa     2820

cctgggagtg tctgtgaccc ctatccagag aatcgtgctg agcggcgaga acggcctgaa     2880

gatcgacatc cacgtgatca tcccttacga gggcctgtcc ggcgatcaga tgggacagat     2940

cgagaagatc tttaaggtgg tgtaccccgt ggacgaccac cacttcaaag tgatcctgca     3000

ctacggcacc ctggtcatcg atggcgtgac cccaaacatg atcgactact tcggcagacc     3060

ctacgaggga atcgccgtgt tcgacggcaa gaaaatcacc gtgaccggca cactgtggaa     3120

cggcaacaag atcatcgacg agagactgat caaccccgac ggcagcctgc tgttcagagt     3180

gacaatcaac ggcgtgacag gctggcggct gtgcgaaaga atccttgctg gtaccgacta     3240

caaggacgac gacgacaaag gacgcaggcg gagaagaaaa agatccgctc gcggtggcgg     3300

tggaagcgct caccacttta gcgagcccga gatcaccctg atcatcttcg gcgtgatggc     3360

cctcgtgatc ggcaccatcc tgctgatctc ttacggcatc agacggctga tcaagaagtc     3420

cccctcaggc ggaggcggct ctaccggttc cggaggcagc ggcttctgct acgagaacga     3480

agtcggcagt ggcaggtcca gattcgtgaa gaaggacggc cactgcaacg tgcagttcat     3540

caacgtcgga agcggcaaga gcagaatcac ctctgagggc gagtacatcc ctctggacca     3600

gatcgatatt aatgtcggtt ccggaggaag ttcctatact tcaaatagaa taggaacttc     3660

cggcgggtca cccgaggatg agaatgctgc tctggaagag aagatcgccc agctgaagca     3720

gaagaacgcc gctctgaaag aagagatcca ggctctggaa tacggaggcg gaggcatgat     3780

gctgaagaag atcctgaaga tcgaagaact ggacgagcgc gagctgatcg acatcgaggt     3840

gtccggcaac cacctgttct acgccaacga tatcctgacc cacaactctg gctaccaggt     3900

gcacagaacc atgcagttcg aggacggcgc cagcctgacc gtgaactaca gatacaccta     3960

cgagggcagc cacatcaagg gcgaggccca agtgaagggc acaggcttcc ctgctgacgg     4020

ccccgtgatg accaactctc tgacagccgc cgactggtgc agaagcaaga aaacctaccc     4080

taacgacaag accatcatca gcaccttcaa gtggtcctac accacaggca acggcaagag     4140

atacagaagc accgccagaa ccacctacac cttcgccaag cccatggccg ccaactacct     4200

gaagaaccag cctatgtacg tgttccgaaa gaccgagctg aagcacagca agacagaact     4260

gaacttcaaa gagtggcaga aagccttcac cgacgtgatg ggcatggacg agctgtacaa     4320

gtccggagct gctccagccg ccaagaagaa gaagctcgac tacaaggacg acgacgataa     4380

gtgaacgcgt aaatgattgc agatccacta gttctagagc tcgctgatca gcctcgactg     4440

tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg     4500

aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga     4560

gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg     4620

aagacaatag caggcatgct ggggatgcgg tgggctctat ggcttctgag gcggaaagaa     4680

ccagctgggg ctcgagatcc actagttcta gcctcgaggc tagagcggcc gccactggcc     4740

gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca     4800

gcacatcccc ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc     4860

caacagttgc gcagcctgaa tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg     4920

gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct     4980

cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta     5040

aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa     5100

cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct     5160

ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc     5220

aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg     5280

ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgctt     5340

acaatttagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct     5400

aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat     5460

attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg     5520

cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg     5580

aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc     5640

ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat     5700

gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact     5760

attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca     5820

tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact     5880

tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg     5940

atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg     6000

agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg     6060

aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg     6120

caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag     6180

ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc     6240

gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga     6300

tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat     6360

atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc     6420

tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag     6480

accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct     6540

gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac     6600

caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc     6660

tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg     6720

ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt     6780

tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt     6840

gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc     6900

tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca     6960

gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata     7020

gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg     7080

ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct     7140

ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta     7200

ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag     7260

tgagcgagga agcggaagag cgcccaatac gcaaaccgcc tctccccgcg cgttggccga     7320

ttcattaatg cagctggcac gacaggtttc ccgactggaa agcgggcagt gagcgcaacg     7380

caattaatgt gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg     7440

ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgacc     7500

atga                                                                  7504


<210>  60
<211>  6748
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  60
ggcgcgccgg attcgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc       60

attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc      120

tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt      180

aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca      240

cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg      300

taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca      360

gtacatctac gtattagtca tcgctattac catggtcgag gtgagcccca cgttctgctt      420

cactctcccc atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt      480

attttgtgca gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg      540

gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc      600

gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag      660

cgcgcggcgg gcgggagtcg ctgcgtcgcg ccttcgcccc gtgccccgct ccgccgccgc      720

ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga gcgggcggga      780

cggcccttct cctccgggct gtaattagcg cttggtttaa tgacggctcg tttcttttct      840

gtggctgcgt gaaagcctta aagggctccg ggagggccct ttgtgcgggg gggagcggct      900

cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggcccg cgctgcccgg      960

cggctgtgag cgctgcgggc gcggcgcggg gctttgtgcg ctccgcgtgt gcgcgagggg     1020

agcgcggccg ggggcggtgc cccgcggtgc gggggggctg cgaggggaac aaaggctgcg     1080

tgcggggtgt gtgcgtgggg gggtgagcag ggggtgtggg cgcggcggtc gggctgtaac     1140

ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg tgcggggctc     1200

cgtgcggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca ggtgggggtg     1260

ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg cggcggcccc     1320

ggagcgccgg cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt     1380

gcgagagggc gcagggactt cctttgtccc aaatctggcg gagccgaaat ctgggaggcg     1440

ccgccgcacc ccctctagcg ggcgcgggcg aagcggtgcg gcgccggcag gaaggaaatg     1500

ggcggggagg gccttcgtgc gtcgccgcgc cgccgtcccc ttctccatct ccagcctcgg     1560

ggctgccgca gggggacggc tgccttcggg ggggacgggg cagggcgggg ttcggcttct     1620

ggcgtgtgac cggcggctct agagcctctg ctaaccatgt tcatgccttc ttctttttcc     1680

tacagatcct taattaataa tacgactcac tataggggcc gccaccatga caccacctaa     1740

gaagaaacgg aaggtcgagg acggcgaggg ccctgctgct aagagagtga aactggactc     1800

cggagtgtcc aagggcgaag aggacaacat ggccagcctg cctgccaccc acgagctgca     1860

catcttcggc agcatcaacg gcgtggactt cgacatggtg ggacagggca ccggcaaccc     1920

caacgacgga tacgaggaac tgaacctgaa gtccaccaag ggggacctcc agttcagccc     1980

ctggattctg gtgccccaca tcggctacgg cttccaccag tacctgccct accctgacgg     2040

catgagccct ttccaggccg ctatggtgga cggctgcctg gaccttaaga cccaggtgca     2100

gaccccccag ggcatgaagg aaatcagcaa catccaagtg ggcgacctgg tgctgagcaa     2160

caccggctac aacgaggtgc tgaacgtgtt ccccaagagc aagaagaagt cctacaagat     2220

caccctggaa gatggcaaag agatcatctg ctccgaggaa cacctgttcc caacccagac     2280

cggcgagatg aacatctctg gcggcctgaa agagggcatg tgcctgtacg tgaaagaagg     2340

cggcggagga ggcggatccg gcggaggcgg atctggaacc ggttttgcta atgagctggg     2400

ccccagactg atgggcaaag gcagcggagg aggcggaagc ggagtcttta cactggaaga     2460

tttcgtcggc gactggcggc agacagctgg ctacaatctg gaccaggtgc tggaacaagg     2520

cggcgtgtcc tctctgtttc agaacctggg agtgtctgtg acccctatcc agagaatcgt     2580

gctgagcggc gagaacggcc tgaagatcga catccacgtg atcatccctt acgagggcct     2640

gtccggcgat cagatgggac agatcgagaa gatctttaag gtggtgtacc ccgtggacga     2700

ccaccacttc aaagtgatcc tgcactacgg caccctggtc atcgatggcg tgaccccaaa     2760

catgatcgac tacttcggca gaccctacga gggaatcgcc gtgttcgacg gcaagaaaat     2820

caccgtgacc ggcacactgt ggaacggcaa caagatcatc gacgagagac tgatcaaccc     2880

cgacggcagc ctgctgttca gagtgacaat caacggcgtg acaggctggc ggctgtgcga     2940

aagaatcctt gctggttccg gaggaagttc ctatacttca aatagaatag gaacttccgg     3000

cgggtcagga ggcggaggca tgatgctgaa gaagatcctg aagatcgaag aactggacga     3060

gcgcgagctg atcgacatcg aggtgtccgg caaccacctg ttctacgcca acgatatcct     3120

gacccacaac tctggctacc aggtgcacag aaccatgcag ttcgaggacg gcgccagcct     3180

gaccgtgaac tacagataca cctacgaggg cagccacatc aagggcgagg cccaagtgaa     3240

gggcacaggc ttccctgctg acggccccgt gatgaccaac tctctgacag ccgccgactg     3300

gtgcagaagc aagaaaacct accctaacga caagaccatc atcagcacct tcaagtggtc     3360

ctacaccaca ggcaacggca agagatacag aagcaccgcc agaaccacct acaccttcgc     3420

caagcccatg gccgccaact acctgaagaa ccagcctatg tacgtgttcc gaaagaccga     3480

gctgaagcac agcaagacag aactgaactt caaagagtgg cagaaagcct tcaccgacgt     3540

gatgggcatg gacgagctgt acaagtccgg agctgctcca gccgccaaga agaagaagct     3600

cgactacaag gacgacgacg ataagtgaac gcgtaaatga ttgcagatcc actagttcta     3660

gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt gtttgcccct     3720

cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg     3780

aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc     3840

aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat gcggtgggct     3900

ctatggcttc tgaggcggaa agaaccagct ggggctcgag atccactagt tctagcctcg     3960

aggctagagc ggccgccact ggccgtcgtt ttacaacgtc gtgactggga aaaccctggc     4020

gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg taatagcgaa     4080

gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga atgggacgcg     4140

ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca     4200

cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc     4260

gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct     4320

ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg     4380

ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc     4440

ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg     4500

attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg     4560

aattttaaca aaatattaac gcttacaatt taggtggcac ttttcgggga aatgtgcgcg     4620

gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat     4680

aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc     4740

gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa     4800

cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac     4860

tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga     4920

tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag     4980

agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca     5040

cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca     5100

tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa     5160

ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc     5220

tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa     5280

cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag     5340

actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct     5400

ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac     5460

tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa     5520

ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt     5580

aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat     5640

ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg     5700

agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc     5760

ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg     5820

tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag     5880

cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact     5940

ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg     6000

gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc     6060

ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg     6120

aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg     6180

cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag     6240

ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc     6300

gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct     6360

ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc     6420

ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc     6480

gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac     6540

cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact     6600

ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc     6660

aggctttaca ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat     6720

ttcacacagg aaacagctat gaccatga                                        6748


<210>  61
<211>  6934
<212>  DNA
<213>  Artificial

<220>
<223>  Vector comprising split intein - heterologous polynucleotide 
       construct

<400>  61
ggcgcgccgg attcgacatt gattattgac tagttattaa tagtaatcaa ttacggggtc       60

attagttcat agcccatata tggagttccg cgttacataa cttacggtaa atggcccgcc      120

tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt      180

aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca      240

cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg tcaatgacgg      300

taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc ctacttggca      360

gtacatctac gtattagtca tcgctattac catggtcgag gtgagcccca cgttctgctt      420

cactctcccc atctcccccc cctccccacc cccaattttg tatttattta ttttttaatt      480

attttgtgca gcgatggggg cggggggggg gggggggcgc gcgccaggcg gggcggggcg      540

gggcgagggg cggggcgggg cgaggcggag aggtgcggcg gcagccaatc agagcggcgc      600

gctccgaaag tttcctttta tggcgaggcg gcggcggcgg cggccctata aaaagcgaag      660

cgcgcggcgg gcgggagtcg ctgcgtcgcg ccttcgcccc gtgccccgct ccgccgccgc      720

ctcgcgccgc ccgccccggc tctgactgac cgcgttactc ccacaggtga gcgggcggga      780

cggcccttct cctccgggct gtaattagcg cttggtttaa tgacggctcg tttcttttct      840

gtggctgcgt gaaagcctta aagggctccg ggagggccct ttgtgcgggg gggagcggct      900

cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggcccg cgctgcccgg      960

cggctgtgag cgctgcgggc gcggcgcggg gctttgtgcg ctccgcgtgt gcgcgagggg     1020

agcgcggccg ggggcggtgc cccgcggtgc gggggggctg cgaggggaac aaaggctgcg     1080

tgcggggtgt gtgcgtgggg gggtgagcag ggggtgtggg cgcggcggtc gggctgtaac     1140

ccccccctgc acccccctcc ccgagttgct gagcacggcc cggcttcggg tgcggggctc     1200

cgtgcggggc gtggcgcggg gctcgccgtg ccgggcgggg ggtggcggca ggtgggggtg     1260

ccgggcgggg cggggccgcc tcgggccggg gagggctcgg gggaggggcg cggcggcccc     1320

ggagcgccgg cggctgtcga ggcgcggcga gccgcagcca ttgcctttta tggtaatcgt     1380

gcgagagggc gcagggactt cctttgtccc aaatctggcg gagccgaaat ctgggaggcg     1440

ccgccgcacc ccctctagcg ggcgcgggcg aagcggtgcg gcgccggcag gaaggaaatg     1500

ggcggggagg gccttcgtgc gtcgccgcgc cgccgtcccc ttctccatct ccagcctcgg     1560

ggctgccgca gggggacggc tgccttcggg ggggacgggg cagggcgggg ttcggcttct     1620

ggcgtgtgac cggcggctct agagcctctg ctaaccatgt tcatgccttc ttctttttcc     1680

tacagatcct taattaataa tacgactcac tataggggcc gccaccatga caccacctaa     1740

gaagaaacgg aaggtcgagg acggcgaggg ccctgctgct aagagagtga aactggactc     1800

cggagtgtcc aagggcgaag aggacaacat ggccagcctg cctgccaccc acgagctgca     1860

catcttcggc agcatcaacg gcgtggactt cgacatggtg ggacagggca ccggcaaccc     1920

caacgacgga tacgaggaac tgaacctgaa gtccaccaag ggggacctcc agttcagccc     1980

ctggattctg gtgccccaca tcggctacgg cttccaccag tacctgccct accctgacgg     2040

catgagccct ttccaggccg ctatggtgga cggctgcctg gaccttaaga cccaggtgca     2100

gaccccccag ggcatgaagg aaatcagcaa catccaagtg ggcgacctgg tgctgagcaa     2160

caccggctac aacgaggtgc tgaacgtgtt ccccaagagc aagaagaagt cctacaagat     2220

caccctggaa gatggcaaag agatcatctg ctccgaggaa cacctgttcc caacccagac     2280

cggcgagatg aacatctctg gcggcctgaa agagggcatg tgcctgtacg tgaaagaagg     2340

cggcggagga cctgaggata agctccaggc cattaagtac gagctggccc agaacgagga     2400

agaactggct cagatcgaag agaagctggc cgccaacaaa gaaggcggat ccggcggagg     2460

cggatctgga accggttttg ctaatgagct gggccccaga ctgatgggca aaggcagcgg     2520

aggaggcgga agcggagtct ttacactgga agatttcgtc ggcgactggc ggcagacagc     2580

tggctacaat ctggaccagg tgctggaaca aggcggcgtg tcctctctgt ttcagaacct     2640

gggagtgtct gtgaccccta tccagagaat cgtgctgagc ggcgagaacg gcctgaagat     2700

cgacatccac gtgatcatcc cttacgaggg cctgtccggc gatcagatgg gacagatcga     2760

gaagatcttt aaggtggtgt accccgtgga cgaccaccac ttcaaagtga tcctgcacta     2820

cggcaccctg gtcatcgatg gcgtgacccc aaacatgatc gactacttcg gcagacccta     2880

cgagggaatc gccgtgttcg acggcaagaa aatcaccgtg accggcacac tgtggaacgg     2940

caacaagatc atcgacgaga gactgatcaa ccccgacggc agcctgctgt tcagagtgac     3000

aatcaacggc gtgacaggct ggcggctgtg cgaaagaatc cttgctggtt ccggaggaag     3060

ttcctatact tcaaatagaa taggaacttc cggcgggtca cccgaggatg agaatgctgc     3120

tctggaagag aagatcgccc agctgaagca gaagaacgcc gctctgaaag aagagatcca     3180

ggctctggaa tacggaggcg gaggcatgat gctgaagaag atcctgaaga tcgaagaact     3240

ggacgagcgc gagctgatcg acatcgaggt gtccggcaac cacctgttct acgccaacga     3300

tatcctgacc cacaactctg gctaccaggt gcacagaacc atgcagttcg aggacggcgc     3360

cagcctgacc gtgaactaca gatacaccta cgagggcagc cacatcaagg gcgaggccca     3420

agtgaagggc acaggcttcc ctgctgacgg ccccgtgatg accaactctc tgacagccgc     3480

cgactggtgc agaagcaaga aaacctaccc taacgacaag accatcatca gcaccttcaa     3540

gtggtcctac accacaggca acggcaagag atacagaagc accgccagaa ccacctacac     3600

cttcgccaag cccatggccg ccaactacct gaagaaccag cctatgtacg tgttccgaaa     3660

gaccgagctg aagcacagca agacagaact gaacttcaaa gagtggcaga aagccttcac     3720

cgacgtgatg ggcatggacg agctgtacaa gtccggagct gctccagccg ccaagaagaa     3780

gaagctcgac tacaaggacg acgacgataa gtgaacgcgt aaatgattgc agatccacta     3840

gttctagagc tcgctgatca gcctcgactg tgccttctag ttgccagcca tctgttgttt     3900

gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat     3960

aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg     4020

tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg     4080

tgggctctat ggcttctgag gcggaaagaa ccagctgggg ctcgagatcc actagttcta     4140

gcctcgaggc tagagcggcc gccactggcc gtcgttttac aacgtcgtga ctgggaaaac     4200

cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat     4260

agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg     4320

gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc     4380

gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc     4440

acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt     4500

agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg     4560

ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt     4620

ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta     4680

taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt     4740

aacgcgaatt ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg     4800

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga     4860

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac     4920

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc     4980

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca     5040

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc     5100

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg     5160

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac     5220

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca     5280

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg     5340

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac     5400

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg     5460

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat     5520

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg     5580

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg     5640

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc     5700

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc     5760

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt     5820

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt     5880

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt     5940

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag     6000

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca     6060

gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca     6120

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg     6180

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg     6240

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct     6300

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga     6360

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc     6420

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg     6480

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg     6540

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt     6600

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc     6660

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac     6720

gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gacaggtttc     6780

ccgactggaa agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg     6840

caccccaggc tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat     6900

aacaatttca cacaggaaac agctatgacc atga                                 6934


<210>  62
<211>  388
<212>  PRT
<213>  Homo sapiens

<400>  62

Met Ala Val Ser Val Thr Pro Ile Arg Asp Thr Lys Trp Leu Thr Leu 
1               5                   10                  15      


Glu Val Cys Arg Glu Phe Gln Arg Gly Thr Cys Ser Arg Pro Asp Thr 
            20                  25                  30          


Glu Cys Lys Phe Ala His Pro Ser Lys Ser Cys Gln Val Glu Asn Gly 
        35                  40                  45              


Arg Val Ile Ala Cys Phe Asp Ser Leu Lys Gly Arg Cys Ser Arg Glu 
    50                  55                  60                  


Asn Cys Lys Tyr Leu His Pro Pro Pro His Leu Lys Thr Gln Leu Glu 
65                  70                  75                  80  


Ile Asn Gly Arg Asn Asn Leu Ile Gln Gln Lys Asn Met Ala Met Leu 
                85                  90                  95      


Ala Gln Gln Met Gln Leu Ala Asn Ala Met Met Pro Gly Ala Pro Leu 
            100                 105                 110         


Gln Pro Val Pro Met Phe Ser Val Ala Pro Ser Leu Ala Thr Asn Ala 
        115                 120                 125             


Ser Ala Ala Ala Phe Asn Pro Tyr Leu Gly Pro Val Ser Pro Ser Leu 
    130                 135                 140                 


Val Pro Ala Glu Ile Leu Pro Thr Ala Pro Met Leu Val Thr Gly Asn 
145                 150                 155                 160 


Pro Gly Val Pro Val Pro Ala Ala Ala Ala Ala Ala Ala Gln Lys Leu 
                165                 170                 175     


Met Arg Thr Asp Arg Leu Glu Val Cys Arg Glu Tyr Gln Arg Gly Asn 
            180                 185                 190         


Cys Asn Arg Gly Glu Asn Asp Cys Arg Phe Ala His Pro Ala Asp Ser 
        195                 200                 205             


Thr Met Ile Asp Thr Asn Asp Asn Thr Val Thr Val Cys Met Asp Tyr 
    210                 215                 220                 


Ile Lys Gly Arg Cys Ser Arg Glu Lys Cys Lys Tyr Phe His Pro Pro 
225                 230                 235                 240 


Ala His Leu Gln Ala Lys Ile Lys Ala Ala Gln Tyr Gln Val Asn Gln 
                245                 250                 255     


Ala Ala Ala Ala Gln Ala Ala Ala Thr Ala Ala Ala Met Thr Gln Ser 
            260                 265                 270         


Ala Val Lys Ser Leu Lys Arg Pro Leu Glu Ala Thr Phe Asp Leu Gly 
        275                 280                 285             


Ile Pro Gln Ala Val Leu Pro Pro Leu Pro Lys Arg Pro Ala Leu Glu 
    290                 295                 300                 


Lys Thr Asn Gly Ala Thr Ala Val Phe Asn Thr Gly Ile Phe Gln Tyr 
305                 310                 315                 320 


Gln Gln Ala Leu Ala Asn Met Gln Leu Gln Gln His Thr Ala Phe Leu 
                325                 330                 335     


Pro Pro Val Pro Met Val His Gly Ala Thr Pro Ala Thr Val Ser Ala 
            340                 345                 350         


Ala Thr Thr Ser Ala Thr Ser Val Pro Phe Ala Ala Thr Ala Thr Ala 
        355                 360                 365             


Asn Gln Ile Pro Ile Ile Ser Ala Glu His Leu Thr Ser His Lys Tyr 
    370                 375                 380                 


Val Thr Gln Met 
385             


<210>  63
<211>  373
<212>  PRT
<213>  Homo sapiens

<400>  63

Met Ala Leu Asn Val Ala Pro Val Arg Asp Thr Lys Trp Leu Thr Leu 
1               5                   10                  15      


Glu Val Cys Arg Gln Phe Gln Arg Gly Thr Cys Ser Arg Ser Asp Glu 
            20                  25                  30          


Glu Cys Lys Phe Ala His Pro Pro Lys Ser Cys Gln Val Glu Asn Gly 
        35                  40                  45              


Arg Val Ile Ala Cys Phe Asp Ser Leu Lys Gly Arg Cys Ser Arg Glu 
    50                  55                  60                  


Asn Cys Lys Tyr Leu His Pro Pro Thr His Leu Lys Thr Gln Leu Glu 
65                  70                  75                  80  


Ile Asn Gly Arg Asn Asn Leu Ile Gln Gln Lys Thr Ala Ala Ala Met 
                85                  90                  95      


Leu Ala Gln Gln Met Gln Phe Met Phe Pro Gly Thr Pro Leu His Pro 
            100                 105                 110         


Val Pro Thr Phe Pro Val Gly Pro Ala Ile Gly Thr Asn Thr Ala Ile 
        115                 120                 125             


Ser Phe Ala Pro Tyr Leu Ala Pro Val Thr Pro Gly Val Gly Leu Val 
    130                 135                 140                 


Pro Thr Glu Ile Leu Pro Thr Thr Pro Val Ile Val Pro Gly Ser Pro 
145                 150                 155                 160 


Pro Val Thr Val Pro Gly Ser Thr Ala Thr Gln Lys Leu Leu Arg Thr 
                165                 170                 175     


Asp Lys Leu Glu Val Cys Arg Glu Phe Gln Arg Gly Asn Cys Ala Arg 
            180                 185                 190         


Gly Glu Thr Asp Cys Arg Phe Ala His Pro Ala Asp Ser Thr Met Ile 
        195                 200                 205             


Asp Thr Ser Asp Asn Thr Val Thr Val Cys Met Asp Tyr Ile Lys Gly 
    210                 215                 220                 


Arg Cys Met Arg Glu Lys Cys Lys Tyr Phe His Pro Pro Ala His Leu 
225                 230                 235                 240 


Gln Ala Lys Ile Lys Ala Ala Gln His Gln Ala Asn Gln Ala Ala Val 
                245                 250                 255     


Ala Ala Gln Ala Ala Ala Ala Ala Ala Thr Val Met Ala Phe Pro Pro 
            260                 265                 270         


Gly Ala Leu His Pro Leu Pro Lys Arg Gln Ala Leu Glu Lys Ser Asn 
        275                 280                 285             


Gly Thr Ser Ala Val Phe Asn Pro Ser Val Leu His Tyr Gln Gln Ala 
    290                 295                 300                 


Leu Thr Ser Ala Gln Leu Gln Gln His Ala Ala Phe Ile Pro Thr Gly 
305                 310                 315                 320 


Ser Val Leu Cys Met Thr Pro Ala Thr Ser Ile Asp Asn Ser Glu Ile 
                325                 330                 335     


Ile Ser Arg Asn Gly Met Glu Cys Gln Glu Ser Ala Leu Arg Ile Thr 
            340                 345                 350         


Lys His Cys Tyr Cys Thr Tyr Tyr Pro Val Ser Ser Ser Ile Glu Leu 
        355                 360                 365             


Pro Gln Thr Ala Cys 
    370             


<210>  64
<211>  782
<212>  PRT
<213>  Homo sapiens

<400>  64

Met Ser Gly Glu Asp Gly Pro Ala Ala Gly Pro Gly Ala Ala Ala Ala 
1               5                   10                  15      


Ala Ala Arg Glu Arg Arg Arg Glu Gln Leu Arg Gln Trp Gly Ala Arg 
            20                  25                  30          


Ala Gly Ala Glu Pro Gly Pro Gly Glu Arg Arg Ala Arg Thr Val Arg 
        35                  40                  45              


Phe Glu Arg Ala Ala Glu Phe Leu Ala Ala Cys Ala Gly Gly Asp Leu 
    50                  55                  60                  


Asp Glu Ala Arg Leu Met Leu Arg Ala Ala Asp Pro Gly Pro Gly Ala 
65                  70                  75                  80  


Glu Leu Asp Pro Ala Ala Pro Pro Pro Ala Arg Ala Val Leu Asp Ser 
                85                  90                  95      


Thr Asn Ala Asp Gly Ile Ser Ala Leu His Gln Ala Cys Ile Asp Glu 
            100                 105                 110         


Asn Leu Glu Val Val Arg Phe Leu Val Glu Gln Gly Ala Thr Val Asn 
        115                 120                 125             


Gln Ala Asp Asn Glu Gly Trp Thr Pro Leu His Val Ala Ala Ser Cys 
    130                 135                 140                 


Gly Tyr Leu Asp Ile Ala Arg Tyr Leu Leu Ser His Gly Ala Asn Ile 
145                 150                 155                 160 


Ala Ala Val Asn Ser Asp Gly Asp Leu Pro Leu Asp Leu Ala Glu Ser 
                165                 170                 175     


Asp Ala Met Glu Gly Leu Leu Lys Ala Glu Ile Ala Arg Arg Gly Val 
            180                 185                 190         


Asp Val Glu Ala Ala Lys Arg Ala Glu Glu Glu Leu Leu Leu His Asp 
        195                 200                 205             


Thr Arg Cys Trp Leu Asn Gly Gly Ala Met Pro Glu Ala Arg His Pro 
    210                 215                 220                 


Arg Thr Gly Ala Ser Ala Leu His Val Ala Ala Ala Lys Gly Tyr Ile 
225                 230                 235                 240 


Glu Val Met Arg Leu Leu Leu Gln Ala Gly Tyr Asp Pro Glu Leu Arg 
                245                 250                 255     


Asp Gly Asp Gly Trp Thr Pro Leu His Ala Ala Ala His Trp Gly Val 
            260                 265                 270         


Glu Asp Ala Cys Arg Leu Leu Ala Glu His Gly Gly Gly Met Asp Ser 
        275                 280                 285             


Leu Thr His Ala Gly Gln Arg Pro Cys Asp Leu Ala Asp Glu Glu Val 
    290                 295                 300                 


Leu Ser Leu Leu Glu Glu Leu Ala Arg Lys Gln Glu Asp Leu Arg Asn 
305                 310                 315                 320 


Gln Lys Glu Ala Ser Gln Ser Arg Gly Gln Glu Pro Gln Ala Pro Ser 
                325                 330                 335     


Ser Ser Lys His Arg Arg Ser Ser Val Cys Arg Leu Ser Ser Arg Glu 
            340                 345                 350         


Lys Ile Ser Leu Gln Asp Leu Ser Lys Glu Arg Arg Pro Gly Gly Ala 
        355                 360                 365             


Gly Gly Pro Pro Ile Gln Asp Glu Asp Glu Gly Glu Glu Gly Pro Thr 
    370                 375                 380                 


Glu Pro Pro Pro Ala Glu Pro Arg Thr Leu Asn Gly Val Ser Ser Pro 
385                 390                 395                 400 


Pro His Pro Ser Pro Lys Ser Pro Val Gln Leu Glu Glu Ala Pro Phe 
                405                 410                 415     


Ser Arg Arg Phe Gly Leu Leu Lys Thr Gly Ser Ser Gly Ala Leu Gly 
            420                 425                 430         


Pro Pro Glu Arg Arg Thr Ala Glu Gly Ala Pro Gly Ala Gly Leu Gln 
        435                 440                 445             


Arg Ser Ala Ser Ser Ser Trp Leu Glu Gly Thr Ser Thr Gln Ala Lys 
    450                 455                 460                 


Glu Leu Arg Leu Ala Arg Ile Thr Pro Thr Pro Ser Pro Lys Leu Pro 
465                 470                 475                 480 


Glu Pro Ser Val Leu Ser Glu Val Thr Lys Pro Pro Pro Cys Leu Glu 
                485                 490                 495     


Asn Ser Ser Pro Pro Ser Arg Ile Pro Glu Pro Glu Ser Pro Ala Lys 
            500                 505                 510         


Pro Asn Val Pro Thr Ala Ser Thr Ala Pro Pro Ala Asp Ser Arg Asp 
        515                 520                 525             


Arg Arg Arg Ser Tyr Gln Met Pro Val Arg Asp Glu Glu Ser Glu Ser 
    530                 535                 540                 


Gln Arg Lys Ala Arg Ser Arg Leu Met Arg Gln Ser Arg Arg Ser Thr 
545                 550                 555                 560 


Gln Gly Val Thr Leu Thr Asp Leu Lys Glu Ala Glu Lys Ala Ala Gly 
                565                 570                 575     


Lys Ala Pro Glu Ser Glu Lys Pro Ala Gln Ser Leu Asp Pro Ser Arg 
            580                 585                 590         


Arg Pro Arg Val Pro Gly Val Glu Asn Ser Asp Ser Pro Ala Gln Arg 
        595                 600                 605             


Ala Glu Ala Pro Asp Gly Gln Gly Pro Gly Pro Gln Ala Ala Arg Glu 
    610                 615                 620                 


His Arg Lys Val Gly Lys Glu Trp Arg Gly Pro Ala Glu Gly Glu Glu 
625                 630                 635                 640 


Ala Glu Pro Ala Asp Arg Ser Gln Glu Ser Ser Thr Leu Glu Gly Gly 
                645                 650                 655     


Pro Ser Ala Arg Arg Gln Arg Trp Gln Arg Asp Leu Asn Pro Glu Pro 
            660                 665                 670         


Glu Pro Glu Ser Glu Glu Pro Asp Gly Gly Phe Arg Thr Leu Tyr Ala 
        675                 680                 685             


Glu Leu Arg Arg Glu Asn Glu Arg Leu Arg Glu Ala Leu Thr Glu Thr 
    690                 695                 700                 


Thr Leu Arg Leu Ala Gln Leu Lys Val Glu Leu Glu Arg Ala Thr Gln 
705                 710                 715                 720 


Arg Gln Glu Arg Phe Ala Glu Arg Pro Ala Leu Leu Glu Leu Glu Arg 
                725                 730                 735     


Phe Glu Arg Arg Ala Leu Glu Arg Lys Ala Ala Glu Leu Glu Glu Glu 
            740                 745                 750         


Leu Lys Ala Leu Ser Asp Leu Arg Ala Asp Asn Gln Arg Leu Lys Asp 
        755                 760                 765             


Glu Asn Ala Ala Leu Ile Arg Val Ile Ser Lys Leu Ser Lys 
    770                 775                 780         


<210>  65
<211>  4104
<212>  DNA
<213>  Artificial sequence

<220>
<223>  mammalian codon-optimized nuclease-defect S. pyogenes Cas9 (D10A,
       H840A)

<400>  65
atggacaaga agtacagcat cggcctggcc atcggcacca actctgtggg ctgggccgtg       60

atcaccgacg agtacaaggt gcccagcaag aaattcaagg tgctgggcaa caccgaccgg      120

cacagcatca agaagaacct gatcggagcc ctgctgttcg acagcggcga aacagccgag      180

gccacccggc tgaagagaac cgccagaaga agatacacca gacggaagaa ccggatctgc      240

tatctgcaag agatcttcag caacgagatg gccaaggtgg acgacagctt cttccacaga      300

ctggaagagt ccttcctggt ggaagaggat aagaagcacg agcggcaccc catcttcggc      360

aacatcgtgg acgaggtggc ctaccacgag aagtacccca ccatctacca cctgagaaag      420

aaactggtgg acagcaccga caaggccgac ctgcggctga tctatctggc cctggcccac      480

atgatcaagt tccggggcca cttcctgatc gagggcgacc tgaaccccga caacagcgac      540

gtggacaagc tgttcatcca gctggtgcag acctacaacc agctgttcga ggaaaacccc      600

atcaacgcca gcggcgtgga cgccaaggcc atcctgtctg ccagactgag caagagcaga      660

cggctggaaa atctgatcgc ccagctgccc ggcgagaaga agaatggcct gttcggcaac      720

ctgattgccc tgagcctggg cctgaccccc aacttcaaga gcaacttcga cctggccgag      780

gatgccaaac tgcagctgag caaggacacc tacgacgacg acctggacaa cctgctggcc      840

cagatcggcg accagtacgc cgacctgttt ctggccgcca agaacctgtc cgacgccatc      900

ctgctgagcg acatcctgag agtgaacacc gagatcacca aggcccccct gagcgcctct      960

atgatcaaga gatacgacga gcaccaccag gacctgaccc tgctgaaagc tctcgtgcgg     1020

cagcagctgc ctgagaagta caaagagatt ttcttcgacc agagcaagaa cggctacgcc     1080

ggctacattg acggcggagc cagccaggaa gagttctaca agttcatcaa gcccatcctg     1140

gaaaagatgg acggcaccga ggaactgctc gtgaagctga acagagagga cctgctgcgg     1200

aagcagcgga ccttcgacaa cggcagcatc ccccaccaga tccacctggg agagctgcac     1260

gccattctgc ggcggcagga agatttttac ccattcctga aggacaaccg ggaaaagatc     1320

gagaagatcc tgaccttccg catcccctac tacgtgggcc ctctggccag gggaaacagc     1380

agattcgcct ggatgaccag aaagagcgag gaaaccatca ccccctggaa cttcgaggaa     1440

gtggtggaca agggcgcttc cgcccagagc ttcatcgagc ggatgaccaa cttcgataag     1500

aacctgccca acgagaaggt gctgcccaag cacagcctgc tgtacgagta cttcaccgtg     1560

tataacgagc tgaccaaagt gaaatacgtg accgagggaa tgagaaagcc cgccttcctg     1620

agcggcgagc agaaaaaggc catcgtggac ctgctgttca agaccaaccg gaaagtgacc     1680

gtgaagcagc tgaaagagga ctacttcaag aaaatcgagt gcttcgactc cgtggaaatc     1740

tccggcgtgg aagatcggtt caacgcctcc ctgggcacat accacgatct gctgaaaatt     1800

atcaaggaca aggacttcct ggacaatgag gaaaacgagg acattctgga agatatcgtg     1860

ctgaccctga cactgtttga ggacagagag atgatcgagg aacggctgaa aacctatgcc     1920

cacctgttcg acgacaaagt gatgaagcag ctgaagcggc ggagatacac cggctggggc     1980

aggctgagcc ggaagctgat caacggcatc cgggacaagc agtccggcaa gacaatcctg     2040

gatttcctga agtccgacgg cttcgccaac agaaacttca tgcagctgat ccacgacgac     2100

agcctgacct ttaaagagga catccagaaa gcccaggtgt ccggccaggg cgatagcctg     2160

cacgagcaca ttgccaatct ggccggcagc cccgccatta agaagggcat cctgcagaca     2220

gtgaaggtgg tggacgagct cgtgaaagtg atgggccggc acaagcccga gaacatcgtg     2280

atcgaaatgg ccagagagaa ccagaccacc cagaagggac agaagaacag ccgcgagaga     2340

atgaagcgga tcgaagaggg catcaaagag ctgggcagcc agatcctgaa agaacacccc     2400

gtggaaaaca cccagctgca gaacgagaag ctgtacctgt actacctgca gaatgggcgg     2460

gatatgtacg tggaccagga actggacatc aaccggctgt ccgactacga tgtggacgcc     2520

atcgtgcctc agagctttct gaaggacgac tccatcgaca acaaggtgct gaccagaagc     2580

gacaagaacc ggggcaagag cgacaacgtg ccctccgaag aggtcgtgaa gaagatgaag     2640

aactactggc ggcagctgct gaacgccaag ctgattaccc agagaaagtt cgacaatctg     2700

accaaggccg agagaggcgg cctgagcgaa ctggataagg ccggcttcat caagagacag     2760

ctggtggaaa cccggcagat cacaaagcac gtggcacaga tcctggactc ccggatgaac     2820

actaagtacg acgagaatga caagctgatc cgggaagtga aagtgatcac cctgaagtcc     2880

aagctggtgt ccgatttccg gaaggatttc cagttttaca aagtgcgcga gatcaacaac     2940

taccaccacg cccacgacgc ctacctgaac gccgtcgtgg gaaccgccct gatcaaaaag     3000

taccctaagc tggaaagcga gttcgtgtac ggcgactaca aggtgtacga cgtgcggaag     3060

atgatcgcca agagcgagca ggaaatcggc aaggctaccg ccaagtactt cttctacagc     3120

aacatcatga actttttcaa gaccgagatt accctggcca acggcgagat ccggaagcgg     3180

cctctgatcg agacaaacgg cgaaaccggg gagatcgtgt gggataaggg ccgggatttt     3240

gccaccgtgc ggaaagtgct gagcatgccc caagtgaata tcgtgaaaaa gaccgaggtg     3300

cagacaggcg gcttcagcaa agagtctatc ctgcccaaga ggaacagcga taagctgatc     3360

gccagaaaga aggactggga ccctaagaag tacggcggct tcgacagccc caccgtggcc     3420

tattctgtgc tggtggtggc caaagtggaa aagggcaagt ccaagaaact gaagagtgtg     3480

aaagagctgc tggggatcac catcatggaa agaagcagct tcgagaagaa tcccatcgac     3540

tttctggaag ccaagggcta caaagaagtg aaaaaggacc tgatcatcaa gctgcctaag     3600

tactccctgt tcgagctgga aaacggccgg aagagaatgc tggcctctgc cggcgaactg     3660

cagaagggaa acgaactggc cctgccctcc aaatatgtga acttcctgta cctggccagc     3720

cactatgaga agctgaaggg ctcccccgag gataatgagc agaaacagct gtttgtggaa     3780

cagcacaagc actacctgga cgagatcatc gagcagatca gcgagttctc caagagagtg     3840

atcctggccg acgctaatct ggacaaagtg ctgtccgcct acaacaagca ccgggataag     3900

cccatcagag agcaggccga gaatatcatc cacctgttta ccctgaccaa tctgggagcc     3960

cctgccgcct tcaagtactt tgacaccacc atcgaccgga agaggtacac cagcaccaaa     4020

gaggtgctgg acgccaccct gatccaccag agcatcaccg gcctgtacga gacacggatc     4080

gacctgtctc agctgggagg cgac                                            4104


