                         SEQUENCE LISTING

<110>  THE RESEARCH FOUNDATION FOR THE STATE UNIVERSITY OF NEW 
       YORK
 
<120>  COMPOSITIONS AND METHODS COMPRISING PERMUTED PROTEIN TAGS FOR 
       FACILITATING OVEREXPRESSION, SOLUBILITY, AND PURIFICATION OF 
       TARGET PROTEINS

<130>  075098.00034

<150>  62/411,295
<151>  2016-10-21

<160>  16    

<170>  PatentIn version 3.5

<210>  1
<211>  277
<212>  PRT
<213>  Thermoanaerobacter tengcongensis

<400>  1

Lys Glu Gly Lys Thr Ile Gly Leu Val Ile Ser Thr Leu Asn Asn Pro 
1               5                   10                  15      


Phe Phe Val Thr Leu Lys Asn Gly Ala Glu Glu Lys Ala Lys Glu Leu 
            20                  25                  30          


Gly Tyr Lys Ile Ile Val Glu Asp Ser Gln Asn Asp Ser Ser Lys Glu 
        35                  40                  45              


Leu Ser Asn Val Glu Asp Leu Ile Gln Gln Lys Val Asp Val Leu Leu 
    50                  55                  60                  


Ile Asn Pro Val Asp Ser Asp Ala Val Val Thr Ala Ile Lys Glu Ala 
65                  70                  75                  80  


Asn Ser Lys Asn Ile Pro Val Ile Thr Ile Asp Arg Ser Ala Asn Gly 
                85                  90                  95      


Gly Asp Val Val Ser His Ile Ala Ser Asp Asn Val Lys Gly Gly Glu 
            100                 105                 110         


Met Ala Ala Glu Phe Ile Ala Lys Ala Leu Lys Gly Lys Gly Asn Val 
        115                 120                 125             


Val Glu Leu Glu Gly Ile Pro Gly Ala Ser Ala Ala Arg Asp Arg Gly 
    130                 135                 140                 


Lys Gly Phe Asp Glu Ala Ile Ala Lys Tyr Pro Asp Ile Lys Ile Val 
145                 150                 155                 160 


Ala Lys Gln Ala Ala Asp Phe Asp Arg Ser Lys Gly Leu Ser Val Met 
                165                 170                 175     


Glu Asn Ile Leu Gln Ala Gln Pro Lys Ile Asp Ala Val Phe Ala Gln 
            180                 185                 190         


Asn Asp Glu Met Ala Leu Gly Ala Ile Lys Ala Ile Glu Ala Ala Asn 
        195                 200                 205             


Arg Gln Gly Ile Ile Val Val Gly Phe Asp Gly Thr Glu Asp Ala Leu 
    210                 215                 220                 


Lys Ala Ile Lys Glu Gly Lys Met Ala Ala Thr Ile Ala Gln Gln Pro 
225                 230                 235                 240 


Ala Leu Met Gly Ser Leu Gly Val Glu Met Ala Asp Lys Tyr Leu Lys 
                245                 250                 255     


Gly Glu Lys Ile Pro Asn Phe Ile Pro Ala Glu Leu Lys Leu Ile Thr 
            260                 265                 270         


Lys Glu Asn Val Gln 
        275         


<210>  2
<211>  278
<212>  PRT
<213>  Thermoanaerobacter tengcongensis

<400>  2

Met Lys Glu Gly Lys Thr Ile Gly Leu Val Ile Ser Thr Leu Asn Asn 
1               5                   10                  15      


Pro Phe Phe Val Thr Leu Lys Asn Gly Ala Glu Glu Lys Ala Lys Glu 
            20                  25                  30          


Leu Gly Tyr Lys Ile Ile Val Glu Asp Ser Gln Asn Asp Ser Ser Lys 
        35                  40                  45              


Glu Leu Ser Asn Val Glu Asp Leu Ile Gln Gln Lys Val Asp Val Leu 
    50                  55                  60                  


Leu Ile Asn Pro Val Asp Ser Asp Ala Val Val Thr Ala Ile Lys Glu 
65                  70                  75                  80  


Ala Asn Ser Lys Asn Ile Pro Val Ile Thr Ile Asp Arg Ser Ala Asn 
                85                  90                  95      


Gly Gly Asp Val Val Ser His Ile Ala Ser Asp Asn Val Lys Gly Gly 
            100                 105                 110         


Glu Met Ala Ala Glu Phe Ile Ala Lys Ala Leu Lys Gly Lys Gly Asn 
        115                 120                 125             


Val Val Glu Leu Glu Gly Ile Pro Gly Ala Ser Ala Ala Arg Asp Arg 
    130                 135                 140                 


Gly Lys Gly Phe Asp Glu Ala Ile Ala Lys Tyr Pro Asp Ile Lys Ile 
145                 150                 155                 160 


Val Ala Lys Gln Ala Ala Asp Phe Asp Arg Ser Lys Gly Leu Ser Val 
                165                 170                 175     


Met Glu Asn Ile Leu Gln Ala Gln Pro Lys Ile Asp Ala Val Phe Ala 
            180                 185                 190         


Gln Asn Asp Glu Met Ala Leu Gly Ala Ile Lys Ala Ile Glu Ala Ala 
        195                 200                 205             


Asn Arg Gln Gly Ile Ile Val Val Gly Phe Asp Gly Thr Glu Asp Ala 
    210                 215                 220                 


Leu Lys Ala Ile Lys Glu Gly Lys Met Ala Ala Thr Ile Ala Gln Gln 
225                 230                 235                 240 


Pro Ala Leu Met Gly Ser Leu Gly Val Glu Met Ala Asp Lys Tyr Leu 
                245                 250                 255     


Lys Gly Glu Lys Ile Pro Asn Phe Ile Pro Ala Glu Leu Lys Leu Ile 
            260                 265                 270         


Thr Lys Glu Asn Val Gln 
        275             


<210>  3
<211>  353
<212>  PRT
<213>  artificial sequence

<220>
<223>  recombinant protein

<400>  3

Met Gly Ser Ser His His His His His His Ser Ser Ser Gly Asp Val 
1               5                   10                  15      


Val Ser His Ile Ala Ser Asp Asn Val Lys Gly Gly Glu Met Ala Ala 
            20                  25                  30          


Glu Phe Ile Ala Lys Ala Leu Lys Gly Lys Gly Asn Val Val Glu Leu 
        35                  40                  45              


Glu Gly Ile Pro Gly Ala Ser Ala Ala Arg Asp Arg Gly Lys Gly Phe 
    50                  55                  60                  


Asp Glu Ala Ile Ala Lys Tyr Pro Asp Ile Lys Ile Val Ala Lys Gln 
65                  70                  75                  80  


Ala Ala Asp Phe Asp Arg Ser Lys Gly Leu Ser Val Met Glu Asn Ile 
                85                  90                  95      


Leu Gln Ala Gln Pro Lys Ile Asp Ala Val Phe Ala Gln Asn Asp Glu 
            100                 105                 110         


Met Ala Leu Gly Ala Ile Lys Ala Ile Glu Ala Ala Asn Arg Gln Gly 
        115                 120                 125             


Ile Ile Val Val Gly Phe Asp Gly Thr Glu Asp Ala Leu Lys Ala Ile 
    130                 135                 140                 


Lys Glu Gly Lys Met Ala Ala Thr Ile Ala Gln Gln Pro Ala Leu Met 
145                 150                 155                 160 


Gly Ser Leu Gly Val Glu Met Ala Asp Lys Tyr Leu Lys Gly Glu Lys 
                165                 170                 175     


Ile Pro Asn Phe Ile Pro Ala Glu Leu Lys Leu Ile Thr Lys Glu Asn 
            180                 185                 190         


Val Gln Gly Gly Ser Ala Ser Gly Gly Thr Ser Gly Gly Ser Ser Ala 
        195                 200                 205             


Ala Gly Leu Glu Val Leu Phe Gln Gly Pro Ala Ala Ala Gly Gly Pro 
    210                 215                 220                 


Ser Gly Thr Met Gly Ser Gly Ser Gly Gly Leu Glu Val Leu Phe Gln 
225                 230                 235                 240 


Gly Pro Gly Ser Ser Gly Gly Thr Ala Ser Gly Gly Lys Glu Gly Lys 
                245                 250                 255     


Thr Ile Gly Leu Val Ile Ser Thr Leu Asn Asn Pro Phe Phe Val Thr 
            260                 265                 270         


Leu Lys Asn Gly Ala Glu Glu Lys Ala Lys Glu Leu Gly Tyr Lys Ile 
        275                 280                 285             


Ile Val Glu Asp Ser Gln Asn Asp Ser Ser Lys Glu Leu Ser Asn Val 
    290                 295                 300                 


Glu Asp Leu Ile Gln Gln Lys Val Asp Val Leu Leu Ile Asn Pro Val 
305                 310                 315                 320 


Asp Ser Asp Ala Val Val Thr Ala Ile Lys Glu Ala Asn Ser Lys Asn 
                325                 330                 335     


Ile Pro Val Ile Thr Ile Asp Arg Ser Ala Asn Gly His His His His 
            340                 345                 350         


His 
    


<210>  4
<211>  1062
<212>  DNA
<213>  artificial sequence

<220>
<223>  recombinant DNA sequence

<400>  4
atgggtagct cacaccatca ccatcaccat tcgagctcgg gcgacgtagt gtcgcatatc       60

gcaagcgata acgtcaaggg aggtgagatg gctgccgagt tcattgcgaa agccttaaag      120

ggcaaaggga acgtcgtgga attagagggc atcccaggag cgtcggcagc tcgtgaccgc      180

ggaaaagggt ttgacgaggc tattgctaag tacccagaca tcaagatcgt ggccaaacaa      240

gccgccgact ttgatcgcag taagggtttg agcgttatgg aaaacatctt acaggcgcaa      300

ccgaaaattg atgcagtgtt cgcgcaaaat gatgagatgg ctttgggtgc aattaaagca      360

attgaggctg cgaaccgcca aggtattatc gtggtcggct tcgacggtac agaggacgcg      420

ttgaaggcga ttaaagaagg gaagatggcg gcgaccattg cacaacagcc ggccttgatg      480

ggctcgcttg gggtcgagat ggctgataaa tatcttaagg gcgaaaaaat ccctaatttt      540

atccccgctg aattgaaatt gatcactaaa gaaaatgtgc aagggggttc tgcctctgga      600

gggacaagtg gtggatcgag tgcggccggt ttggaggttt tattccaggg tccagcggcc      660

gcaggtggcc cgagcgggac catgggctct ggatccggtg gcttagaagt attattccaa      720

ggtccaggtt cgtcaggagg gacggcatct ggagggaaag aaggcaaaac catcggtctg      780

gtcatcagta ccttgaataa ccccttcttt gtaaccttga aaaatggcgc tgaggagaaa      840

gcgaaggaac tgggatacaa gatcatcgta gaggattccc agaatgactc ctcgaaggag      900

ttatccaacg ttgaggattt aattcaacag aaggtcgacg tactgctgat taaccctgta      960

gattcggatg ctgttgtaac cgccattaag gaagcaaata gcaagaacat tccagttatt     1020

actattgatc gctcggccaa cgggcaccac caccaccact aa                        1062


<210>  5
<211>  830
<212>  PRT
<213>  artificial sequence

<220>
<223>  recombinant RBP with MDM2

<400>  5

Met Gly Ser Ser His His His Ser Ser Ser Gly Asp Val Val Ser His 
1               5                   10                  15      


Ile Ala Ser Asp Asn Val Lys Gly Gly Glu Met Ala Ala Glu Phe Ile 
            20                  25                  30          


Ala Lys Ala Leu Lys Gly Lys Gly Asn Val Val Glu Leu Glu Gly Ile 
        35                  40                  45              


Pro Gly Ala Ser Ala Ala Arg Asp Arg Gly Lys Gly Phe Asp Glu Ala 
    50                  55                  60                  


Ile Ala Lys Tyr Pro Asp Ile Lys Ile Val Ala Lys Gln Ala Ala Asp 
65                  70                  75                  80  


Phe Asp Arg Ser Lys Gly Leu Ser Val Met Glu Asn Ile Leu Gln Ala 
                85                  90                  95      


Gln Pro Lys Ile Asp Ala Val Phe Ala Gln Asn Asp Glu Met Ala Leu 
            100                 105                 110         


Gly Ala Ile Lys Ala Ile Glu Ala Ala Asn Arg Gln Gly Ile Ile Val 
        115                 120                 125             


Val Gly Phe Asp Gly Thr Glu Asp Ala Leu Lys Ala Ile Lys Glu Gly 
    130                 135                 140                 


Lys Met Ala Ala Thr Ile Ala Gln Gln Pro Ala Leu Met Gly Ser Leu 
145                 150                 155                 160 


Gly Val Glu Met Ala Asp Lys Tyr Leu Lys Gly Glu Lys Ile Pro Asn 
                165                 170                 175     


Phe Ile Pro Ala Glu Leu Lys Leu Ile Thr Lys Glu Asn Val Gln Gly 
            180                 185                 190         


Gly Ser Ala Ser Gly Gly Thr Ser Gly Gly Ser Ser Ala Ala Gly Leu 
        195                 200                 205             


Glu Val Leu Phe Gln Gly Pro Ala Ala Ala Met Cys Asn Thr Asn Met 
    210                 215                 220                 


Ser Val Pro Thr Asp Gly Ala Val Thr Thr Ser Gln Ile Pro Ala Ser 
225                 230                 235                 240 


Glu Gln Glu Thr Leu Val Arg Pro Lys Pro Leu Leu Leu Lys Leu Leu 
                245                 250                 255     


Lys Ser Val Gly Ala Gln Lys Asp Thr Tyr Thr Met Lys Glu Val Leu 
            260                 265                 270         


Phe Tyr Leu Gly Gln Tyr Ile Met Thr Lys Arg Leu Tyr Asp Glu Lys 
        275                 280                 285             


Gln Gln His Ile Val Tyr Cys Ser Asn Asp Leu Leu Gly Asp Leu Phe 
    290                 295                 300                 


Gly Val Pro Ser Phe Ser Val Lys Glu His Arg Lys Ile Tyr Thr Met 
305                 310                 315                 320 


Ile Tyr Arg Asn Leu Val Val Val Asn Gln Gln Glu Ser Ser Asp Ser 
                325                 330                 335     


Gly Thr Ser Val Ser Glu Asn Arg Cys His Leu Glu Gly Gly Ser Asp 
            340                 345                 350         


Gln Lys Asp Leu Val Gln Glu Leu Gln Glu Glu Lys Pro Ser Ser Ser 
        355                 360                 365             


His Leu Val Ser Arg Pro Ser Thr Ser Ser Arg Arg Arg Ala Ile Ser 
    370                 375                 380                 


Glu Thr Glu Glu Asn Ser Asp Glu Leu Ser Gly Glu Arg Gln Arg Lys 
385                 390                 395                 400 


Arg His Lys Ser Asp Ser Ile Ser Leu Ser Phe Asp Glu Ser Leu Ala 
                405                 410                 415     


Leu Cys Val Ile Arg Glu Ile Cys Cys Glu Arg Ser Ser Ser Ser Glu 
            420                 425                 430         


Ser Thr Gly Thr Pro Ser Asn Pro Asp Leu Asp Ala Gly Val Ser Glu 
        435                 440                 445             


His Ser Gly Asp Trp Leu Asp Gln Asp Ser Val Ser Asp Gln Phe Ser 
    450                 455                 460                 


Val Glu Phe Glu Val Glu Ser Leu Asp Ser Glu Asp Tyr Ser Leu Ser 
465                 470                 475                 480 


Glu Glu Gly Gln Glu Leu Ser Asp Glu Asp Asp Glu Val Tyr Gln Val 
                485                 490                 495     


Thr Val Tyr Gln Ala Gly Glu Ser Asp Thr Asp Ser Phe Glu Glu Asp 
            500                 505                 510         


Pro Glu Ile Ser Leu Ala Asp Tyr Trp Lys Cys Thr Ser Cys Asn Glu 
        515                 520                 525             


Met Asn Pro Pro Leu Pro Ser His Cys Asn Arg Cys Trp Ala Leu Arg 
    530                 535                 540                 


Glu Asn Trp Leu Pro Glu Asp Lys Gly Lys Asp Lys Gly Glu Ile Ser 
545                 550                 555                 560 


Glu Lys Ala Lys Leu Glu Asn Ser Thr Gln Ala Glu Glu Gly Phe Asp 
                565                 570                 575     


Val Pro Asp Cys Lys Lys Thr Ile Val Asn Asp Ser Arg Glu Ser Cys 
            580                 585                 590         


Val Glu Glu Asn Asp Asp Lys Ile Thr Gln Ala Ser Gln Ser Gln Glu 
        595                 600                 605             


Ser Glu Asp Tyr Ser Gln Pro Ser Thr Ser Ser Ser Ile Ile Tyr Ser 
    610                 615                 620                 


Ser Gln Glu Asp Val Lys Glu Phe Glu Arg Glu Glu Thr Gln Asp Lys 
625                 630                 635                 640 


Glu Glu Ser Val Glu Ser Ser Leu Pro Leu Asn Ala Ile Glu Pro Cys 
                645                 650                 655     


Val Ile Cys Gln Gly Arg Pro Lys Asn Gly Cys Ile Val His Gly Lys 
            660                 665                 670         


Thr Gly His Leu Met Ala Cys Phe Thr Cys Ala Lys Lys Leu Lys Lys 
        675                 680                 685             


Arg Asn Lys Pro Cys Pro Val Cys Arg Gln Pro Ile Gln Met Ile Val 
    690                 695                 700                 


Leu Thr Tyr Phe Pro Gly Ser Gly Gly Leu Glu Val Leu Phe Gln Gly 
705                 710                 715                 720 


Pro Gly Ser Ser Gly Gly Thr Ala Ser Gly Gly Lys Glu Gly Lys Thr 
                725                 730                 735     


Ile Gly Leu Val Ile Ser Thr Leu Asn Asn Pro Phe Phe Val Thr Leu 
            740                 745                 750         


Lys Asn Gly Ala Glu Glu Lys Ala Lys Glu Leu Gly Tyr Lys Ile Ile 
        755                 760                 765             


Val Glu Asp Ser Gln Asn Asp Ser Ser Lys Glu Leu Ser Asn Val Glu 
    770                 775                 780                 


Asp Leu Ile Gln Gln Lys Val Asp Val Leu Leu Ile Asn Pro Val Asp 
785                 790                 795                 800 


Ser Asp Ala Val Val Thr Ala Ile Lys Glu Ala Asn Ser Lys Asn Ile 
                805                 810                 815     


Pro Val Ile Thr Ile Asp Arg Ser Ala Asn Gly His His His 
            820                 825                 830 


<210>  6
<211>  344
<212>  PRT
<213>  artificial sequence

<220>
<223>  recombinant polypeptide

<400>  6

Met Gly Ser His His His Lys Gly Asn Val Val Glu Leu Glu Gly Ile 
1               5                   10                  15      


Pro Gly Ala Ser Ala Ala Arg Asp Arg Gly Lys Gly Phe Asp Glu Ala 
            20                  25                  30          


Ile Ala Lys Tyr Pro Asp Ile Lys Ile Val Ala Lys Gln Ala Ala Asp 
        35                  40                  45              


Phe Asp Arg Ser Lys Gly Leu Ser Val Met Glu Asn Ile Leu Gln Ala 
    50                  55                  60                  


Gln Pro Lys Ile Asp Ala Val Phe Ala Gln Asn Asp Glu Met Ala Leu 
65                  70                  75                  80  


Gly Ala Ile Lys Ala Ile Glu Ala Ala Asn Arg Gln Gly Ile Ile Val 
                85                  90                  95      


Val Gly Phe Asp Gly Thr Glu Asp Ala Leu Lys Ala Ile Lys Glu Gly 
            100                 105                 110         


Lys Met Ala Ala Thr Ile Ala Gln Gln Pro Ala Leu Met Gly Ser Leu 
        115                 120                 125             


Gly Val Glu Met Ala Asp Lys Tyr Leu Lys Gly Glu Lys Ile Pro Asn 
    130                 135                 140                 


Phe Ile Pro Ala Glu Leu Lys Leu Ile Thr Lys Glu Asn Val Gln Gly 
145                 150                 155                 160 


Gly Ser Ala Ser Gly Gly Thr Ser Gly Gly Ser Ser Ala Ala Gly Leu 
                165                 170                 175     


Glu Val Leu Phe Gln Gly Pro Ala Ala Ala Gly Gly Pro Ser Gly Thr 
            180                 185                 190         


Met Gly Ser Gly Ser Gly Gly Leu Glu Val Leu Phe Gln Gly Pro Gly 
        195                 200                 205             


Ser Ser Gly Gly Thr Ala Ser Gly Gly Lys Glu Gly Lys Thr Ile Gly 
    210                 215                 220                 


Leu Val Ile Ser Thr Leu Asn Asn Pro Phe Phe Val Thr Leu Lys Asn 
225                 230                 235                 240 


Gly Ala Glu Glu Lys Ala Lys Glu Leu Gly Tyr Lys Ile Ile Val Glu 
                245                 250                 255     


Asp Ser Gln Asn Asp Ser Ser Lys Glu Leu Ser Asn Val Glu Asp Leu 
            260                 265                 270         


Ile Gln Gln Lys Val Asp Val Leu Leu Ile Asn Pro Val Asp Ser Asp 
        275                 280                 285             


Ala Val Val Thr Ala Ile Lys Glu Ala Asn Ser Lys Asn Ile Pro Val 
    290                 295                 300                 


Ile Thr Ile Asp Arg Ser Ala Asn Gly Gly Asp Val Val Ser His Ile 
305                 310                 315                 320 


Ala Ser Asp Asn Val Lys Gly Gly Glu Met Ala Ala Glu Phe Ile Ala 
                325                 330                 335     


Lys Ala Leu Lys Gly His His His 
            340                 


<210>  7
<211>  1035
<212>  DNA
<213>  artificial sequence

<220>
<223>  DNA sequence encoding recombinant polypeptide

<400>  7
atgggtagcc accatcacaa agggaacgtc gtggaattag agggcatccc aggagcgtcg       60

gcagctcgtg accgcggaaa agggtttgac gaggctattg ctaagtaccc agacatcaag      120

atcgtggcca aacaagccgc cgactttgat cgcagtaagg gtttgagcgt tatggaaaac      180

atcttacagg cgcaaccgaa aattgatgca gtgttcgcgc aaaatgatga gatggctttg      240

ggtgcaatta aagcaattga ggctgcgaac cgccaaggta ttatcgtggt cggcttcgac      300

ggtacagagg acgcgttgaa ggcgattaaa gaagggaaga tggcggcgac cattgcacaa      360

cagccggcct tgatgggctc gcttggggtc gagatggctg ataaatatct taagggcgaa      420

aaaatcccta attttatccc cgctgaattg aaattgatca ctaaagaaaa tgtgcaaggg      480

ggttctgcct ctggagggac aagtggtgga tcgagtgcgg ccggtttgga ggttttattc      540

cagggtccag cggccgcagg tggcccgagc gggaccatgg gctctggatc cggtggctta      600

gaagtattat tccaaggtcc aggttcgtca ggagggacgg catctggagg gaaagaaggc      660

aaaaccatcg gtctggtcat cagtaccttg aataacccct tctttgtaac cttgaaaaat      720

ggcgctgagg agaaagcgaa ggaactggga tacaagatca tcgtagagga ttcccagaat      780

gactcctcga aggagttatc caacgttgag gatttaattc aacagaaggt cgacgtactg      840

ctgattaacc ctgtagattc ggatgctgtt gtaaccgcca ttaaggaagc aaatagcaag      900

aacattccag ttattactat tgatcgctcg gccaacgggg gtgatgttgt ttcccatatc      960

gccagcgata atgttaaggg tggcgaaatg gccgcggaat ttatcgcgaa agccctgaaa     1020

ggccaccatc actaa                                                      1035


<210>  8
<211>  575
<212>  PRT
<213>  artificial sequence

<220>
<223>  recombinant polypeptide

<400>  8

Met Gly Ser His His His Lys Gly Asn Val Val Glu Leu Glu Gly Ile 
1               5                   10                  15      


Pro Gly Ala Ser Ala Ala Arg Asp Arg Gly Lys Gly Phe Asp Glu Ala 
            20                  25                  30          


Ile Ala Lys Tyr Pro Asp Ile Lys Ile Val Ala Lys Gln Ala Ala Asp 
        35                  40                  45              


Phe Asp Arg Ser Lys Gly Leu Ser Val Met Glu Asn Ile Leu Gln Ala 
    50                  55                  60                  


Gln Pro Lys Ile Asp Ala Val Phe Ala Gln Asn Asp Glu Met Ala Leu 
65                  70                  75                  80  


Gly Ala Ile Lys Ala Ile Glu Ala Ala Asn Arg Gln Gly Ile Ile Val 
                85                  90                  95      


Val Gly Phe Asp Gly Thr Glu Asp Ala Leu Lys Ala Ile Lys Glu Gly 
            100                 105                 110         


Lys Met Ala Ala Thr Ile Ala Gln Gln Pro Ala Leu Met Gly Ser Leu 
        115                 120                 125             


Gly Val Glu Met Ala Asp Lys Tyr Leu Lys Gly Glu Lys Ile Pro Asn 
    130                 135                 140                 


Phe Ile Pro Ala Glu Leu Lys Leu Ile Thr Lys Glu Asn Val Gln Gly 
145                 150                 155                 160 


Gly Ser Ala Ser Gly Gly Thr Ser Gly Gly Ser Ser Ala Ala Gly Leu 
                165                 170                 175     


Glu Val Leu Phe Gln Gly Pro Ala Ala Ala Gly Met Val Ser Lys Gly 
            180                 185                 190         


Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly 
        195                 200                 205             


Asp Val Asn Gly His Lys Phe Ser Val Arg Gly Glu Gly Glu Gly Asp 
    210                 215                 220                 


Ala Thr Asn Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys 
225                 230                 235                 240 


Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Val 
                245                 250                 255     


Ala Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe 
            260                 265                 270         


Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Ser Phe 
        275                 280                 285             


Lys Asp Asp Gly Thr Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
    290                 295                 300                 


Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu 
305                 310                 315                 320 


Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His 
                325                 330                 335     


Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Ala Asn 
            340                 345                 350         


Phe Lys Ile Arg His Asn Val Glu Asp Gly Ser Val Gln Leu Ala Asp 
        355                 360                 365             


His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro 
    370                 375                 380                 


Asp Asn His Tyr Leu Ser His Gln Ser Ala Leu Ser Lys Asp Pro Asn 
385                 390                 395                 400 


Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
                405                 410                 415     


Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys Gly Ser Gly Gly Leu Glu 
            420                 425                 430         


Val Leu Phe Gln Gly Pro Gly Ser Ser Gly Gly Thr Ala Ser Gly Gly 
        435                 440                 445             


Lys Glu Gly Lys Thr Ile Gly Leu Val Ile Ser Thr Leu Asn Asn Pro 
    450                 455                 460                 


Phe Phe Val Thr Leu Lys Asn Gly Ala Glu Glu Lys Ala Lys Glu Leu 
465                 470                 475                 480 


Gly Tyr Lys Ile Ile Val Glu Asp Ser Gln Asn Asp Ser Ser Lys Glu 
                485                 490                 495     


Leu Ser Asn Val Glu Asp Leu Ile Gln Gln Lys Val Asp Val Leu Leu 
            500                 505                 510         


Ile Asn Pro Val Asp Ser Asp Ala Val Val Thr Ala Ile Lys Glu Ala 
        515                 520                 525             


Asn Ser Lys Asn Ile Pro Val Ile Thr Ile Asp Arg Ser Ala Asn Gly 
    530                 535                 540                 


Gly Asp Val Val Ser His Ile Ala Ser Asp Asn Val Lys Gly Gly Glu 
545                 550                 555                 560 


Met Ala Ala Glu Phe Ile Ala Lys Ala Leu Lys Gly His His His 
                565                 570                 575 


<210>  9
<211>  837
<212>  DNA
<213>  artificial sequence

<220>
<223>  DNA sequence encoding RBP with CP sites

<400>  9
atgaaagaag gcaaaaccat cggtctggtc atcagtacct tgaataaccc cttctttgta       60

accttgaaaa atggcgctga ggagaaagcg aaggaactgg gatacaagat catcgtagag      120

gattcccaga atgactcctc gaaggagtta tccaacgttg aggatttaat tcaacagaag      180

gtcgacgtac tgctgattaa ccctgtagat tcggatgctg ttgtaaccgc cattaaggaa      240

gcaaatagca agaacattcc agttattact attgatcgct cggccaacgg gggcgacgta      300

gtgtcgcata tcgcaagcga taacgtcaag ggaggtgaga tggctgccga gttcattgcg      360

aaagccttaa agggcaaagg gaacgtcgtg gaattagagg gcatcccagg agcgtcggca      420

gctcgtgacc gcggaaaagg gtttgacgag gctattgcta agtacccaga catcaagatc      480

gtggccaaac aagccgccga ctttgatcgc agtaagggtt tgagcgttat ggaaaacatc      540

ttacaggcgc aaccgaaaat tgatgcagtg ttcgcgcaaa atgatgagat ggctttgggt      600

gcaattaaag caattgaggc tgcgaaccgc caaggtatta tcgtggtcgg cttcgacggt      660

acagaggacg cgttgaaggc gattaaagaa gggaagatgg cggcgaccat tgcacaacag      720

ccggccttga tgggctcgct tggggtcgag atggctgata aatatcttaa gggcgaaaaa      780

atccctaatt ttatccccgc tgaattgaaa ttgatcacta aagaaaatgt gcaataa         837


<210>  10
<211>  380
<212>  PRT
<213>  Pyrococcus furiosus

<400>  10

Met Lys Ile Glu Glu Gly Lys Val Val Ile Trp His Ala Met Gln Pro 
1               5                   10                  15      


Asn Glu Leu Glu Val Phe Gln Ser Leu Ala Glu Glu Tyr Met Ala Leu 
            20                  25                  30          


Pro Glu Val Glu Ile Val Phe Glu Gln Lys Pro Asn Leu Glu Asp Ala 
        35                  40                  45              


Leu Lys Ala Ala Ile Pro Thr Gly Gln Gly Pro Asp Leu Phe Ile Trp 
    50                  55                  60                  


Ala His Asp Trp Ile Gly Lys Phe Ala Glu Ala Gly Leu Leu Glu Pro 
65                  70                  75                  80  


Ile Asp Glu Tyr Val Thr Glu Asp Leu Leu Asn Glu Phe Ala Pro Met 
                85                  90                  95      


Ala Gln Asp Ala Met Gln Tyr Lys Gly His Tyr Tyr Ala Leu Pro Phe 
            100                 105                 110         


Ala Ala Glu Thr Val Ala Ile Ile Tyr Asn Lys Glu Met Val Ser Glu 
        115                 120                 125             


Pro Pro Lys Thr Phe Asp Glu Met Lys Ala Ile Met Glu Lys Tyr Tyr 
    130                 135                 140                 


Asp Pro Ala Asn Glu Lys Tyr Gly Ile Ala Trp Pro Ile Asn Ala Tyr 
145                 150                 155                 160 


Phe Ile Ser Ala Ile Ala Gln Ala Phe Gly Gly Tyr Tyr Phe Asp Asp 
                165                 170                 175     


Lys Thr Glu Gln Pro Gly Leu Asp Lys Pro Glu Thr Ile Glu Gly Phe 
            180                 185                 190         


Lys Phe Phe Phe Thr Glu Ile Trp Pro Tyr Met Ala Pro Thr Gly Asp 
        195                 200                 205             


Tyr Asn Thr Gln Gln Ser Ile Phe Leu Glu Gly Arg Ala Pro Met Met 
    210                 215                 220                 


Val Asn Gly Pro Trp Ser Ile Asn Asp Val Lys Lys Ala Gly Ile Asn 
225                 230                 235                 240 


Phe Gly Val Val Pro Leu Pro Pro Ile Ile Lys Asp Gly Lys Glu Tyr 
                245                 250                 255     


Trp Pro Arg Pro Tyr Gly Gly Val Lys Leu Ile Tyr Phe Ala Ala Gly 
            260                 265                 270         


Ile Lys Asn Lys Asp Ala Ala Trp Lys Phe Ala Lys Trp Leu Thr Thr 
        275                 280                 285             


Ser Glu Glu Ser Ile Lys Thr Leu Ala Leu Glu Leu Gly Tyr Ile Pro 
    290                 295                 300                 


Val Leu Thr Lys Val Leu Asp Asp Pro Glu Ile Lys Asn Asp Pro Val 
305                 310                 315                 320 


Ile Tyr Gly Phe Gly Gln Ala Val Gln His Ala Tyr Leu Met Pro Lys 
                325                 330                 335     


Ser Pro Lys Met Ser Ala Val Trp Gly Gly Val Asp Gly Ala Ile Asn 
            340                 345                 350         


Glu Ile Leu Gln Asp Pro Gln Asn Ala Asp Ile Glu Gly Ile Leu Lys 
        355                 360                 365             


Lys Tyr Gln Gln Glu Ile Leu Asn Asn Met Gln Gly 
    370                 375                 380 


<210>  11
<211>  446
<212>  PRT
<213>  artificial sequence

<220>
<223>  Pful permutated polypeptide

<400>  11

Met Gly Ser Ser His His His Ser Ser Ser Glu Pro Pro Lys Thr Phe 
1               5                   10                  15      


Asp Glu Met Lys Ala Ile Met Glu Lys Tyr Tyr Asp Pro Ala Asn Glu 
            20                  25                  30          


Lys Tyr Gly Ile Ala Trp Pro Ile Asn Ala Tyr Phe Ile Ser Ala Ile 
        35                  40                  45              


Ala Gln Ala Phe Gly Gly Tyr Tyr Phe Asp Asp Lys Thr Glu Gln Pro 
    50                  55                  60                  


Gly Leu Asp Lys Pro Glu Thr Ile Glu Gly Phe Lys Phe Phe Phe Thr 
65                  70                  75                  80  


Glu Ile Trp Pro Tyr Met Ala Pro Thr Gly Asp Tyr Asn Thr Gln Gln 
                85                  90                  95      


Ser Ile Phe Leu Glu Gly Arg Ala Pro Met Met Val Asn Gly Pro Trp 
            100                 105                 110         


Ser Ile Asn Asp Val Lys Lys Ala Gly Ile Asn Phe Gly Val Val Pro 
        115                 120                 125             


Leu Pro Pro Ile Ile Lys Asp Gly Lys Glu Tyr Trp Pro Arg Pro Tyr 
    130                 135                 140                 


Gly Gly Val Lys Leu Ile Tyr Phe Ala Ala Gly Ile Lys Asn Lys Asp 
145                 150                 155                 160 


Ala Ala Trp Lys Phe Ala Lys Trp Leu Thr Thr Ser Glu Glu Ser Ile 
                165                 170                 175     


Lys Thr Leu Ala Leu Glu Leu Gly Tyr Ile Pro Val Leu Thr Lys Val 
            180                 185                 190         


Leu Asp Asp Pro Glu Ile Lys Asn Asp Pro Val Ile Tyr Gly Phe Gly 
        195                 200                 205             


Gln Ala Val Gln His Ala Tyr Leu Met Pro Lys Ser Pro Lys Met Ser 
    210                 215                 220                 


Ala Val Trp Gly Gly Val Asp Gly Ala Ile Asn Glu Ile Leu Gln Asp 
225                 230                 235                 240 


Pro Gln Asn Ala Asp Ile Glu Gly Ile Leu Lys Lys Tyr Gln Gln Glu 
                245                 250                 255     


Ile Leu Asn Asn Met Gln Gly Gly Gly Ser Ala Ser Gly Gly Thr Ser 
            260                 265                 270         


Gly Gly Ser Ser Ala Ala Gly Glu Asn Leu Tyr Phe Gln Gly Ala Ala 
        275                 280                 285             


Ala Gly Gly Pro Ser Gly Thr Met Gly Ser Gly Ser Gly Gly Leu Val 
    290                 295                 300                 


Pro Arg Gly Ser Gly Ser Ser Gly Gly Thr Ala Ser Gly Gly Lys Ile 
305                 310                 315                 320 


Glu Glu Gly Lys Val Val Ile Trp His Ala Met Gln Pro Asn Glu Leu 
                325                 330                 335     


Glu Val Phe Gln Ser Leu Ala Glu Glu Tyr Met Ala Leu Pro Glu Val 
            340                 345                 350         


Glu Ile Val Phe Glu Gln Lys Pro Asn Leu Glu Asp Ala Leu Lys Ala 
        355                 360                 365             


Ala Ile Pro Thr Gly Gln Gly Pro Asp Leu Phe Ile Trp Ala His Asp 
    370                 375                 380                 


Trp Ile Gly Lys Phe Ala Glu Ala Gly Leu Leu Glu Pro Ile Asp Glu 
385                 390                 395                 400 


Tyr Val Thr Glu Asp Leu Leu Asn Glu Phe Ala Pro Met Ala Gln Asp 
                405                 410                 415     


Ala Met Gln Tyr Lys Gly His Tyr Tyr Ala Leu Pro Phe Ala Ala Glu 
            420                 425                 430         


Thr Val Ala Ile Ile Tyr Asn Lys Glu Met Val His His His 
        435                 440                 445     


<210>  12
<211>  1341
<212>  DNA
<213>  artificial sequence

<220>
<223>  recombinant DNA sequence

<400>  12
atgggtagct cacaccatca ttcgagctcc gaaccaccga aaacctttga cgaaatgaaa       60

gcaatcatgg agaaatacta cgatccggcc aacgagaaat acggcattgc atggccgatt      120

aatgcgtact tcattagcgc cattgcgcaa gcgttcggag gttactattt cgatgacaaa      180

acggagcaac ctgggttaga taaacccgaa acgattgaag gttttaaatt cttctttacc      240

gagatttggc cttacatggc tcctactggc gactataata cgcaacagtc gatctttctg      300

gaaggacgtg ctccgatgat ggtgaacggt ccgtggagca ttaacgacgt caagaaagcc      360

ggcattaact ttggggttgt tccgttaccg ccgatcatca aggacggtaa agaatattgg      420

ccacgcccat acggtggtgt caaactgatc tactttgcag cgggcattaa gaacaaagat      480

gccgcgtgga aatttgcgaa atggctgacc acctcggaag aaagcatcaa aacattggca      540

ctggagttgg gctatattcc cgttctcact aaggtacttg atgatccgga aattaagaac      600

gatccggtaa tttatggctt tggtcaggcc gtgcagcatg cctatctgat gcctaaatct      660

cccaaaatgt cagcggtttg gggcggggtt gacggtgcga ttaatgagat cctgcaagat      720

ccgcagaatg ccgacatcga aggcatcttg aagaagtatc agcaggaaat tctgaacaat      780

atgcagggcg ggggttctgc ctctggaggg acaagtggtg gatcgagtgc ggccggtgag      840

aatttatatt ttcagggtgc ggccgcaggt ggcccgagcg ggaccatggg ctctggatcc      900

ggtggcttag taccgcgcgg tagcggttcg tcaggaggga cggcatctgg agggaaaatt      960

gaggagggca aagtggtcat ctggcacgca atgcagccta atgagctcga agtctttcaa     1020

agtctggcgg aagagtatat ggctcttcca gaagtggaaa ttgtgtttga acagaaaccc     1080

aatctggaag atgcgcttaa agccgcaatt ccaaccggtc agggtccgga tctcttcatt     1140

tgggctcatg actggattgg caaatttgcg gaagcaggac tgctggaacc gatcgatgaa     1200

tatgtgaccg aagatttact gaacgaattc gctccgatgg cgcaggatgc catgcaatac     1260

aaagggcact attatgcgct gccgttcgct gcagagacag tggccatcat ctataacaaa     1320

gaaatggtac accatcacta a                                               1341


<210>  13
<211>  7
<212>  PRT
<213>  artificial sequence

<220>
<223>  proteinase recongition site

<400>  13

Glu Asn Leu Tyr Phe Gln Gly 
1               5           


<210>  14
<211>  5
<212>  PRT
<213>  artificial sequence

<220>
<223>  Enterokinase recognition site

<400>  14

Asp Asp Asp Asp Lys 
1               5   


<210>  15
<211>  4
<212>  PRT
<213>  artificial sequence

<220>
<223>  Factor Xa recognition site

<400>  15

Ile Glu Gly Arg 
1               


<210>  16
<211>  6
<212>  PRT
<213>  artificial sequence

<220>
<223>  thrombin cleavage site

<400>  16

Leu Val Pro Arg Gly Ser 
1               5       


