                         SEQUENCE LISTING

<110>  Deutsches Krebsforschungszentrum Stiftung des oeffentlichen
        Rechts
 
<120>  VACCINATION AND ANTIBODY GENERATION PLATFORM

<130>  D31643WO

<150>  EP18202305.1
<151>  2018-10-24

<160>  17    

<170>  PatentIn version 3.5

<210>  1
<211>  492
<212>  PRT
<213>  Trypanosoma brucei

<400>  1

Met Ala Thr Gly Arg Ala Lys Asn Thr Lys Trp Ala Arg Trp Leu Ser 
1               5                   10                  15      


Thr Ala Gly Leu Ile Ile Val Val Thr Leu Pro Ala Thr Thr Met Ala 
            20                  25                  30          


Ala Glu Arg Thr Gly Leu Lys Ala Thr Ala Trp Lys Pro Leu Cys Lys 
        35                  40                  45              


Leu Thr Thr Glu Leu Ser Lys Val Ser Gly Glu Met Leu Asn Glu Gly 
    50                  55                  60                  


Gln Glu Val Ile Ser Asn Ile Gln Lys Ile Lys Ala Ala Glu Tyr Lys 
65                  70                  75                  80  


Val Ser Ile Tyr Leu Ala Lys Asn Pro Glu Thr Gln Ala Leu Gln Gln 
                85                  90                  95      


Leu Thr Leu Leu Arg Gly Tyr Phe Ala Arg Lys Thr Asn Gly Gly Leu 
            100                 105                 110         


Glu Ser Tyr Lys Thr Met Gly Leu Ala Thr Gln Ile Arg Ser Ala Arg 
        115                 120                 125             


Ala Ala Ala Tyr Leu Lys Gly Ser Ile Asp Glu Phe Leu Asn Leu Leu 
    130                 135                 140                 


Glu Ser Leu Lys Gly Gly Ser Glu Asn Lys Cys Leu Val Thr Thr Asn 
145                 150                 155                 160 


Ala Asp Thr Ala Ala Thr Arg Arg Glu Thr Lys Leu Asp Asp Gln Glu 
                165                 170                 175     


Cys Ala Leu Ser Met Pro Glu Thr Lys Pro Glu Ala Ala Thr Arg Thr 
            180                 185                 190         


Glu Leu Thr Gln Thr Gly Tyr Pro Asn Leu Gln His Gly Gly Gly Gly 
        195                 200                 205             


Thr Ala Asn Thr Phe Gln Pro Thr Thr Ser Thr Gly Thr Cys Lys Leu 
    210                 215                 220                 


Leu Ser Gly His Ser Thr Asn Gly Tyr Pro Thr Thr Ser Ala Leu Asp 
225                 230                 235                 240 


Thr Thr Ala Lys Val Leu Ala Gly Tyr Met Thr Ile Pro Asn Thr Gln 
                245                 250                 255     


Val Glu Ala Thr Leu Ala Asn Met Gln Ala Met Gly Asn Gly His Lys 
            260                 265                 270         


Ala Thr Ala Pro Ala Trp His Glu Ala Trp Glu Ala Arg Asn Arg Glu 
        275                 280                 285             


Ala Lys Ala Lys Asp Leu Ala Tyr Thr Asn Glu Thr Gly Asn Leu Asp 
    290                 295                 300                 


Thr Gln Pro Thr Leu Lys Ala Leu Val Lys Thr Leu Leu Leu Pro Lys 
305                 310                 315                 320 


Asp Asn Thr Glu His Asn Ala Glu Ala Thr Lys Leu Glu Ala Leu Phe 
                325                 330                 335     


Gly Gly Leu Ala Ala Asp Lys Thr Lys Thr Tyr Leu Asp Met Val Asp 
            340                 345                 350         


Ala Glu Ile Ile Pro Ala Gly Ile Ala Gly Arg Thr Thr Glu Ala Pro 
        355                 360                 365             


Leu Gly Lys Ile His Asp Thr Val Glu Leu Gly Asp Ile Leu Ser Asn 
    370                 375                 380                 


Tyr Glu Met Ile Ala Ala Gln Asn Val Val Thr Leu Lys Lys Asn Leu 
385                 390                 395                 400 


Asp Ala Val Ser Lys Lys Gln Gln Thr Glu Ser Ala Glu Asn Lys Glu 
                405                 410                 415     


Lys Ile Cys Asn Ala Ala Lys Asp Asn Gln Lys Ala Cys Glu Asn Leu 
            420                 425                 430         


Lys Glu Lys Gly Cys Val Phe Asn Thr Glu Ser Asn Lys Cys Glu Leu 
        435                 440                 445             


Lys Lys Asp Val Lys Glu Lys Leu Glu Lys Glu Ser Lys Glu Thr Glu 
    450                 455                 460                 


Gly Lys Asp Glu Lys Ala Asn Thr Thr Gly Ser Asn Ser Phe Leu Ile 
465                 470                 475                 480 


His Lys Ala Pro Leu Leu Leu Ala Phe Leu Leu Phe 
                485                 490         


<210>  2
<211>  476
<212>  PRT
<213>  Trypanosoma brucei

<400>  2

Met Pro Ser Asn Gln Glu Ala Arg Leu Phe Leu Ala Val Leu Val Leu 
1               5                   10                  15      


Ala Gln Val Leu Pro Ile Leu Val Asp Ser Ala Ala Glu Lys Gly Phe 
            20                  25                  30          


Lys Gln Ala Phe Trp Gln Pro Leu Cys Gln Val Ser Glu Glu Leu Asp 
        35                  40                  45              


Asp Gln Pro Lys Gly Ala Leu Phe Thr Leu Gln Ala Ala Ala Ser Lys 
    50                  55                  60                  


Ile Gln Lys Met Arg Asp Ala Ala Leu Arg Ala Ser Ile Tyr Ala Glu 
65                  70                  75                  80  


Ile Asn His Gly Thr Asn Arg Ala Lys Ala Ala Val Ile Val Ala Asn 
                85                  90                  95      


His Tyr Ala Met Lys Ala Asp Ser Gly Leu Glu Ala Leu Lys Gln Thr 
            100                 105                 110         


Leu Ser Ser Gln Glu Val Thr Ala Thr Ala Thr Ala Ser Tyr Leu Lys 
        115                 120                 125             


Gly Arg Ile Asp Glu Tyr Leu Asn Leu Leu Leu Gln Thr Lys Glu Ser 
    130                 135                 140                 


Gly Thr Ser Gly Cys Met Met Asp Thr Ser Gly Thr Asn Thr Val Thr 
145                 150                 155                 160 


Lys Ala Gly Gly Thr Ile Gly Gly Val Pro Cys Lys Leu Gln Leu Ser 
                165                 170                 175     


Pro Ile Gln Pro Lys Arg Pro Ala Ala Thr Tyr Leu Gly Lys Ala Gly 
            180                 185                 190         


Tyr Val Gly Leu Thr Arg Gln Ala Asp Ala Ala Asn Asn Phe His Asp 
        195                 200                 205             


Asn Asp Ala Glu Cys Arg Leu Ala Ser Gly His Asn Thr Asn Gly Leu 
    210                 215                 220                 


Gly Lys Ser Gly Gln Leu Ser Ala Ala Val Thr Met Ala Ala Gly Tyr 
225                 230                 235                 240 


Val Thr Val Ala Asn Ser Gln Thr Ala Val Thr Val Gln Ala Leu Asp 
                245                 250                 255     


Ala Leu Gln Glu Ala Ser Gly Ala Ala His Gln Pro Trp Ile Asp Ala 
            260                 265                 270         


Trp Lys Ala Lys Lys Ala Leu Thr Gly Ala Glu Thr Ala Glu Phe Arg 
        275                 280                 285             


Asn Glu Thr Ala Gly Ile Ala Gly Lys Thr Gly Val Thr Lys Leu Val 
    290                 295                 300                 


Glu Glu Ala Leu Leu Lys Lys Lys Asp Ser Glu Ala Ser Glu Ile Gln 
305                 310                 315                 320 


Thr Glu Leu Lys Lys Tyr Phe Ser Gly His Glu Asn Glu Gln Trp Thr 
                325                 330                 335     


Ala Ile Glu Lys Leu Ile Ser Glu Gln Pro Val Ala Gln Asn Leu Val 
            340                 345                 350         


Gly Asp Asn Gln Pro Thr Lys Leu Gly Glu Leu Glu Gly Asn Ala Lys 
        355                 360                 365             


Leu Thr Thr Ile Leu Ala Tyr Tyr Arg Met Glu Thr Ala Gly Lys Phe 
    370                 375                 380                 


Glu Val Leu Thr Gln Lys His Lys Pro Ala Glu Ser Gln Gln Gln Ala 
385                 390                 395                 400 


Ala Glu Thr Glu Gly Ser Cys Asn Lys Lys Asp Gln Asn Glu Cys Lys 
                405                 410                 415     


Ser Pro Cys Lys Trp His Asn Asp Ala Glu Asn Lys Lys Cys Thr Leu 
            420                 425                 430         


Asp Lys Glu Glu Ala Lys Lys Val Ala Asp Glu Thr Ala Lys Asp Gly 
        435                 440                 445             


Lys Thr Gly Asn Thr Asn Thr Thr Gly Ser Ser Asn Ser Phe Val Ile 
    450                 455                 460                 


Ser Lys Thr Pro Leu Trp Leu Ala Val Leu Leu Phe 
465                 470                 475     


<210>  3
<211>  509
<212>  PRT
<213>  Trypanosoma brucei

<400>  3

Met Gln Ala Ala Ala Leu Leu Leu Leu Val Leu Arg Ala Ile Thr Ser 
1               5                   10                  15      


Ile Glu Ala Ala Ala Asp Asp Val Asn Pro Asp Asp Asn Lys Glu Asp 
            20                  25                  30          


Phe Ala Val Leu Cys Ala Leu Ala Ala Leu Ala Asn Leu Gln Thr Thr 
        35                  40                  45              


Val Pro Ser Ile Asp Thr Ser Gly Leu Ala Ala Tyr Asp Asn Leu Gln 
    50                  55                  60                  


Gln Leu Asn Leu Ser Leu Ser Ser Lys Glu Trp Lys Ser Leu Phe Asn 
65                  70                  75                  80  


Lys Ala Ala Asp Ser Asn Gly Ser Pro Lys Gln Pro Pro Glu Gly Phe 
                85                  90                  95      


Gln Ser Asp Pro Thr Trp Arg Lys Gln Trp Pro Ile Trp Val Thr Ala 
            100                 105                 110         


Ala Ala Ala Leu Lys Ala Glu Asn Lys Glu Ala Ala Val Leu Ala Arg 
        115                 120                 125             


Ala Gly Leu Thr Asn Ala Pro Glu Glu Leu Arg Asn Arg Ala Arg Leu 
    130                 135                 140                 


Ala Leu Ile Pro Leu Leu Ala Gln Ala Glu Gln Ile Arg Asp Arg Leu 
145                 150                 155                 160 


Ser Glu Ile Gln Lys Gln Asn Glu Asp Thr Thr Pro Thr Ala Ile Ala 
                165                 170                 175     


Lys Ala Leu Asn Lys Ala Val Tyr Gly Gln Asp Lys Glu Thr Gly Ala 
            180                 185                 190         


Val Tyr Asn Ser Ala Asp Cys Phe Ser Gly Asn Val Ala Asp Ser Thr 
        195                 200                 205             


Gln Asn Ser Cys Lys Ala Gly Asn Gln Ala Ser Lys Ala Thr Thr Val 
    210                 215                 220                 


Ala Ala Thr Ile Val Cys Val Cys His Lys Lys Asn Gly Gly Asn Asp 
225                 230                 235                 240 


Ala Ala Asn Ala Cys Gly Arg Leu Ile Asn His Gln Ser Asp Ala Gly 
                245                 250                 255     


Ala Asn Leu Ala Thr Ala Ser Ser Asp Phe Gly Asp Ile Ile Ala Thr 
            260                 265                 270         


Cys Ala Ala Arg Pro Pro Lys Pro Leu Thr Ala Ala Tyr Leu Asp Ser 
        275                 280                 285             


Ala Leu Ala Ala Val Ser Ala Arg Ile Arg Phe Lys Asn Gly Asn Gly 
    290                 295                 300                 


Tyr Leu Gly Lys Phe Lys Ala Thr Gly Cys Thr Gly Ser Ala Ser Glu 
305                 310                 315                 320 


Gly Leu Cys Val Glu Tyr Thr Ala Leu Thr Ala Ala Thr Met Gln Asn 
                325                 330                 335     


Phe Tyr Lys Ile Pro Trp Val Lys Glu Ile Ser Asn Val Ala Glu Ala 
            340                 345                 350         


Leu Lys Arg Thr Glu Lys Asp Ala Ala Glu Ser Thr Leu Leu Ser Thr 
        355                 360                 365             


Trp Leu Lys Ala Ser Glu Asn Gln Gly Asn Ser Val Ala Gln Lys Leu 
    370                 375                 380                 


Ile Lys Val Gly Asp Ser Lys Ala Val Pro Pro Ala Gln Arg Gln Thr 
385                 390                 395                 400 


Gln Asn Lys Pro Gly Ser Asn Cys Asn Lys Asn Leu Lys Lys Ser Glu 
                405                 410                 415     


Cys Lys Asp Ser Asp Gly Cys Lys Trp Asn Arg Thr Glu Glu Thr Glu 
            420                 425                 430         


Gly Asp Phe Cys Lys Pro Lys Glu Thr Gly Thr Glu Asn Pro Ala Ala 
        435                 440                 445             


Gly Thr Gly Glu Gly Ala Ala Gly Ala Asn Thr Glu Thr Lys Lys Cys 
    450                 455                 460                 


Ser Asp Lys Lys Thr Glu Gly Asp Cys Lys Asp Gly Cys Lys Trp Asp 
465                 470                 475                 480 


Gly Lys Glu Cys Lys Asp Ser Ser Ile Leu Ala Thr Lys Lys Phe Ala 
                485                 490                 495     


Leu Thr Val Val Ser Ala Ala Phe Val Ala Leu Leu Phe 
            500                 505                 


<210>  4
<211>  499
<212>  PRT
<213>  Trypanosoma brucei

<400>  4

Met Gln Arg Leu Gly Thr Ala Val Phe Phe Leu Leu Ala Phe Arg Tyr 
1               5                   10                  15      


Ser Thr Glu Gln Ala Val Gly Leu Lys Glu Pro Asn Ala Pro Cys Thr 
            20                  25                  30          


Thr Ala Cys Gly Cys Lys Ser Arg Leu Leu Lys Arg Leu Asp Leu Tyr 
        35                  40                  45              


Thr Ser Lys Tyr Ala Asp Gly Ile Asn Asn Glu Arg Glu Asn Ser Glu 
    50                  55                  60                  


Ala Tyr Ser Lys Leu Val Thr Ala Ala Leu Ala Ala Val Pro Thr Met 
65                  70                  75                  80  


Gln Arg Lys Ile Leu Pro Leu Leu Gly Ala Ala Ala Asp Ile Leu Asp 
                85                  90                  95      


Ile Cys Arg Arg Glu Leu Ala Thr Ala Arg Pro Leu Val Gln Ala Ala 
            100                 105                 110         


Ile Ser Lys Ile Glu Glu Ala Ala Gly Val Tyr Asn Thr Leu His Lys 
        115                 120                 125             


Leu Glu Arg Gly Leu Gly Glu Ala Lys Ile Glu Phe Gly Gly Thr Asp 
    130                 135                 140                 


Leu Arg Leu Thr Lys Thr Lys Phe Arg Ala Thr Ser Leu Gly Thr Ile 
145                 150                 155                 160 


His Thr Ala Asp Cys Pro Asn Ala Asp Pro Gly Glu Thr Asn Val Lys 
                165                 170                 175     


Ile Gly Leu Glu His Glu Glu Asn Glu Pro Glu Pro Ala Lys Leu Ile 
            180                 185                 190         


Thr His Gly His Leu Asp Ala Thr Cys Ala Ser Gly Val Gly Gln Ser 
        195                 200                 205             


Ser Ser Cys His Thr Thr Ala Val Glu Ala Asn Thr His Leu Thr Leu 
    210                 215                 220                 


Gly Leu Thr Phe Ser Gly Ser Ser Lys Asp Glu Ser Ala Thr Trp Asn 
225                 230                 235                 240 


Ala Ala Thr Asn Asn Lys Arg Ala Ile His Ser Asn Asp Ala Asp Phe 
                245                 250                 255     


Leu Gly Ser Asn Ala Thr Val Ala His Glu Ala Leu Lys Ala Ile Arg 
            260                 265                 270         


Ser Ala Gly Ala Ser Thr Pro Cys Ser Ser Leu Ile Thr Asp Phe Asn 
        275                 280                 285             


Ala Val Arg Ala Asn Pro Lys Phe Lys Leu Met Val Ile Lys Ala Leu 
    290                 295                 300                 


Leu Asn Lys Pro Thr Ala Glu Lys Glu Ser Asp Ala Pro Ala Asp Glu 
305                 310                 315                 320 


Val Asn Asn Ala Ile Asn Ser Ala Tyr Gly Arg Glu Gly Ser Glu Tyr 
                325                 330                 335     


Asn Thr Lys Thr Trp Lys Asp Ile Gly Ser Thr Arg Ile Pro Lys Ala 
            340                 345                 350         


Asp Pro Pro Gly Glu Lys Thr Asp Thr Ile Asp Lys Leu Ser Ser Leu 
        355                 360                 365             


Pro Gln Trp Gly Asp Ala Ile Ala Arg Leu Leu Leu Gln Glu Ile Thr 
    370                 375                 380                 


Lys Gln Glu Glu Gln Ser Ile Lys Thr Ser Ser Asp Glu Ala Thr Asn 
385                 390                 395                 400 


Lys Glu Cys Asp Lys His Thr Ala Lys Thr Glu Gly Glu Cys Thr Lys 
                405                 410                 415     


Leu Gly Cys Asp Tyr Asp Ala Glu Asn Lys Lys Cys Lys Pro Lys Ser 
            420                 425                 430         


Glu Lys Glu Thr Thr Ala Ala Gly Lys Lys Asp Arg Ala Ala Gly Glu 
        435                 440                 445             


Thr Gly Cys Ala Lys His Gly Thr Asp Lys Asp Lys Cys Glu Asn Asp 
    450                 455                 460                 


Lys Ser Cys Lys Trp Glu Asn Asn Ala Cys Lys Asp Ser Ser Ile Leu 
465                 470                 475                 480 


Ala Thr Lys Lys Phe Ala Leu Ser Met Val Ser Ala Ala Phe Val Thr 
                485                 490                 495     


Leu Leu Phe 
            


<210>  5
<211>  514
<212>  PRT
<213>  Trypanosoma brucei

<400>  5

Met Val Tyr Arg Asn Ile Leu Gln Leu Ser Val Leu Lys Val Leu Leu 
1               5                   10                  15      


Ile Val Leu Ile Val Glu Ala Thr His Phe Gly Val Lys Tyr Glu Leu 
            20                  25                  30          


Trp Gln Pro Glu Cys Glu Leu Thr Ala Glu Leu Arg Lys Thr Ala Gly 
        35                  40                  45              


Val Ala Lys Met Lys Val Asn Ser Asp Leu Asn Ser Phe Lys Thr Leu 
    50                  55                  60                  


Glu Leu Thr Lys Met Lys Leu Leu Thr Phe Ala Ala Lys Phe Pro Glu 
65                  70                  75                  80  


Ser Lys Glu Ala Leu Thr Leu Arg Ala Leu Glu Ala Ala Leu Asn Thr 
                85                  90                  95      


Asp Leu Arg Ala Leu Arg Asp Asn Ile Ala Asn Gly Ile Asp Arg Ala 
            100                 105                 110         


Val Arg Ala Thr Ala Tyr Ala Ser Glu Ala Ala Gly Ala Leu Phe Ser 
        115                 120                 125             


Gly Ile Gln Thr Leu His Asp Ala Thr Asp Gly Thr Thr Tyr Cys Leu 
    130                 135                 140                 


Ser Ala Ser Gly Gln Gly Ser Asn Gly Asn Ala Ala Met Ala Ser Gln 
145                 150                 155                 160 


Gly Cys Lys Pro Leu Ala Leu Pro Glu Leu Leu Thr Glu Asp Ser Tyr 
                165                 170                 175     


Asn Thr Asp Val Ile Ser Asp Lys Gly Phe Pro Lys Ile Ser Pro Leu 
            180                 185                 190         


Thr Asn Ala Gln Gly Gln Gly Lys Ser Gly Glu Cys Gly Leu Phe Gln 
        195                 200                 205             


Ala Ala Ser Gly Ala Gln Ala Thr Asn Thr Gly Val Gln Phe Ser Gly 
    210                 215                 220                 


Gly Ser Arg Ile Asn Leu Gly Leu Gly Ala Ile Val Ala Ser Ala Ala 
225                 230                 235                 240 


Gln Gln Pro Thr Arg Pro Asp Leu Ser Asp Phe Ser Gly Thr Ala Arg 
                245                 250                 255     


Asn Gln Ala Asp Thr Leu Tyr Gly Lys Ala His Ala Ser Ile Thr Glu 
            260                 265                 270         


Leu Leu Gln Leu Ala Gln Gly Pro Lys Pro Gly Gln Thr Glu Val Glu 
        275                 280                 285             


Thr Met Lys Leu Leu Ala Gln Lys Thr Ala Ala Leu Asp Ser Ile Lys 
    290                 295                 300                 


Phe Gln Leu Ala Ala Ser Thr Gly Lys Lys Thr Ser Asp Tyr Lys Glu 
305                 310                 315                 320 


Asp Glu Asn Leu Lys Thr Glu Tyr Phe Gly Lys Thr Glu Ser Asn Ile 
                325                 330                 335     


Glu Ala Leu Trp Asn Lys Val Lys Glu Glu Lys Val Lys Gly Ala Asp 
            340                 345                 350         


Pro Glu Asp Pro Ser Lys Glu Ser Lys Ile Ser Asp Leu Asn Thr Glu 
        355                 360                 365             


Glu Gln Leu Gln Arg Val Leu Asp Tyr Tyr Ala Val Ala Thr Met Leu 
    370                 375                 380                 


Lys Leu Ala Lys Gln Ala Glu Asp Ile Ala Lys Leu Glu Thr Glu Ile 
385                 390                 395                 400 


Ala Asp Gln Arg Gly Lys Ser Pro Glu Ala Glu Cys Asn Lys Ile Thr 
                405                 410                 415     


Glu Glu Pro Lys Cys Ser Glu Glu Lys Ile Cys Ser Trp His Lys Glu 
            420                 425                 430         


Val Lys Ala Gly Glu Lys Asn Cys Gln Phe Asn Ser Thr Lys Ala Ser 
        435                 440                 445             


Lys Ser Gly Val Pro Val Thr Gln Thr Gln Thr Ala Gly Ala Asp Thr 
    450                 455                 460                 


Thr Ala Glu Lys Cys Lys Gly Lys Gly Glu Lys Asp Cys Lys Ser Pro 
465                 470                 475                 480 


Asp Cys Lys Trp Glu Gly Gly Thr Cys Lys Asp Ser Ser Ile Leu Ala 
                485                 490                 495     


Asn Lys Gln Phe Ala Leu Ser Val Ala Ser Ala Ala Phe Val Ala Leu 
            500                 505                 510         


Leu Phe 
        


<210>  6
<211>  7204
<212>  DNA
<213>  artificial

<220>
<223>  pHH-VSG3-G4S-Hyg plasmid DNA sequence

<400>  6
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgagctcggt acccggggat      420

ccgatatcga aggcagcgga aagtgtgcca atgcatttta aatgacagat ttattttatt      480

aaaaatgaga acacttagaa tatttcatgt cgattcaagc tagcagtaat atgccagtaa      540

atagtttcca tttacaagca caatttcatc ctccttacgt gattctgtag catttagttc      600

aagaaaaagc aataaaagaa ttcggagacg aagagccggt tagattgcat ttggacaccg      660

ctatacatat gttaagacac acaagcattc tatacgtaaa agatctagta tataggagca      720

acgctctgcc aaaacataat ggcaagacaa acggccgtgt ttgccgctga tgctacagaa      780

ccagcttaat ttccagaaga cgaaaatttg catgttttcc cacaatattt taattactct      840

tgaagattgt agttattcct acgcgacacg tacgcggcat gcaagcggca gcactgcttt      900

tattagtttt gcgcgcaata accagcatcg aagctgcagc cggtggcggt ggctcaggtg      960

gtggcggttc aggcggtggt ggctcagatg acgtcaatcc agatgacaac aaggaagact     1020

ttgcagtctt gtgcgcacta gctgcgctgg ccaacctcca gaccacggtg ccctcaatag     1080

acacgtcagg acttgcagcc tacgacaact tgcaacagct caacctaagc ctaagcagca     1140

aagaatggaa aagcctgttc aacaaagcgg ctgactcaaa cggatctccc aagcagccgc     1200

cggaaggatt tcaatcggac cctacttggc ggaagcagtg gcctatatgg gtaacagcag     1260

cagcagcatt aaaggccgaa aacaaagagg cagctgtcct agcgagggcg ggactaacaa     1320

acgcgccaga ggaactcaga aacagggccc ggctggcgct aataccctta ttagcccaag     1380

ccgagcaaat ccgggaccgg ctcagtgaaa tacaaaaaca aaacgaagac acgacaccaa     1440

cggcaatagc gaaggcactt aataaagccg tctacggcca ggacaaagaa acgggcgcgg     1500

tgtacaattc agcggattgc ttcagcggta acgttgcaga ctcaacccaa aactcctgca     1560

aagccgggaa ccaagcctcc aaagcgacga cagtagccgc aacgatagtt tgtgtttgcc     1620

acaaaaaaaa cggcggcaac gacgccgcaa acgcctgcgg tagactgatt aatcaccaat     1680

ccgacgctgg tgccaaccta gccaccgcca gctcagactt cggcgacata attgctacat     1740

gcgcagctcg cccgccaaaa ccattgaccg ctgcctatct agacagcgca ctagccgcgg     1800

tgagcgcgag gataaggttc aaaaacggca acggttacct gggcaaattc aaagcgacag     1860

gctgcacagg cgccgcaagt gaaggcttat gtgtcgaata cactgcccta acagcggcaa     1920

cgatgcaaaa tttttacaaa atcccgtggg taaaggagat ctcaaacgta gcggaagccc     1980

taaagaggac agaaaaagac gcagcagaat caacactgtt aagcacttgg cttaaagcca     2040

gcgaaaacca aggaaatagc gtcgctcaga agcttataaa ggtaggagac agcaaagcgg     2100

taccaccggc acagcgacag acacaaaata agccaggatc aaactgcaat aagaacctta     2160

aaaaaagcga atgcaaagac agtgatggtt gcaaatggaa caggactgag gagaccgaag     2220

gtgatttctg caaacctaaa gagacaggaa cagaaaaccc agcagcagga acaggagagg     2280

gagctgcagg agcaaatacg gaaaccaaaa agtgctcaga taagaaaact gaaggcgact     2340

gcaaagatgg atgcaaatgg gatggaaaag aatgcaaaga ttcctctatt ctagcaacca     2400

agaaattcgc cctcaccgtg gtttctgctg catttgtggc cttgcttttt taatttcccc     2460

cctcaaattt cccccctcct tttaaaattt tccttgctac ttgaaaactt tttgatatat     2520

tttaacacca aaaccagccg agattttgtg ttctgtgttt tgtaagttga ctgtctgatt     2580

gtctagaaat attttctggc aactaaaatt tttttctttt ttcctgtttt ttttgtaggt     2640

aggtaggaat gggggggggg gggtagttag gtaggttagt taggttagtt agggggttag     2700

ttaggggggt taggcttagg attaggcaca gcaaggtctt ctgaaattca tgtttttttt     2760

ttttttactc tgcattgcag tctccgctct tatttagttt tgctttacgt aaggtctcgt     2820

tgctgccata aaataagcta ctagtagctt accatgaaaa agcctgaact caccgcgacg     2880

tctgtcgaga agtttctgat cgaaaagttc gacagcgtct ccgacctgat gcagctctcg     2940

gagggcgaag aatctcgtgc tttcagcttc gatgtaggag ggcgtggata tgtcctgcgg     3000

gtaaatagct gcgccgatgg tttctacaaa gatcgttatg tttatcggca ctttgcatcg     3060

gccgcgctcc cgattccgga agtgcttgac attggggaat tcagcgagag cctgacctat     3120

tgcatctccc gccgtgcaca gggtgtcacg ttgcaagacc tgcctgaaac cgaactgccc     3180

gctgttctgc agccggtcgc ggaggccatg gatgcgatcg ctgcggccga tcttagccag     3240

acgagcgggt tcggcccatt cggaccgcaa ggaatcggtc aatacactac atggcgtgat     3300

ttcatatgcg cgattgctga tccccatgtg tatcactggc aaactgtgat ggacgacacc     3360

gtcagtgcgt ccgtcgcgca ggctctcgat gagctgatgc tttgggccga ggactgcccc     3420

gaagtccggc acctcgtgca cgcggatttc ggctccaaca atgtcctgac ggacaatggc     3480

cgcataacag cggtcattga ctggagcgag gcgatgttcg gggattccca atacgaggtc     3540

gccaacatct tcttctggag gccgtggttg gcttgtatgg agcagcagac gcgctacttc     3600

gagcggaggc atccggagct tgcaggatcg ccgcggctcc gggcgtatat gctccgcatt     3660

ggtcttgacc aactctatca gagcttggtt gacggcaatt tcgatgatgc agcttgggcg     3720

cagggtcgat gcgacgcaat cgtccgatcc ggagccggga ctgtcgggcg tacacaaatc     3780

gcccgcagaa gcgcggccgt ctggaccgat ggctgtgtag aagtactcgc cgatagtgga     3840

aaccgacgcc ccagcactcg tccgagggca aaggaatagg gatcgatcct gcccatttag     3900

ttagttggct tttcccttgt ctcgtgtctt ttccgtggaa aggttcccgg agtaatctga     3960

tggcacagca gggaggtgcg cctgcaggtt ggttaggaag gggggatgat gtaaaagaag     4020

aaaatggggg gatattagac ttaggcttag gattaggatt aggattagga ttagggttaa     4080

ttttttcctc ttttttttta actcacacct ctatcctgga tttttaattt ttttttttag     4140

ccattcgcgg ctcctttttt tttttttgcg ccaatgttta attttttatt gtgttttcaa     4200

tttttttgtc aaccatgcag cggctgtttt gttatgcgga ccctaaccct cctccccccc     4260

ccccgcccgc gcacctccat ttttaaaaat ttttttaccg cgtccttcaa ccagaatttt     4320

tttaaatttt ttaatttttt ttattttccg tggttttgaa tcttaatttt tcgacggcat     4380

gcccgctact cttttttggc tttttgtttt ttcgtttttt tttgacgacg ccttttttta     4440

aatttctttt cctcgatttt tttcgttcat tttttttggt ttagtattca ttttttgaac     4500

tttagttttg catttaaatt tttaacgggt ttttgcttac attttttttt tacatcctct     4560

ttttcttttt gctttttagt tttcgacatt tttcagattt ttttcttttt tgaatttttt     4620

ttttgttaca accaggcatc gttttttttg gcggcgcccc tttttggtaa caccggcggc     4680

cacggtgttt cggattaagg ccgcgggaat tcgattaggg ttagggttag ggttagggtt     4740

agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag ggttagggtt     4800

agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag ggttagggtt     4860

agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag ggttaatcac     4920

tagctagtgg atccgatatc tctagagtcg acctgcaggc atgcaagctt ggcgtaatca     4980

tggtcatagc tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga     5040

gccggaagca taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt     5100

gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga     5160

atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc     5220

actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg     5280

gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc     5340

cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc     5400

ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga     5460

ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc     5520

ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat     5580

agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg     5640

cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc     5700

aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga     5760

gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact     5820

agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt     5880

ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag     5940

cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg     6000

tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa     6060

aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata     6120

tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg     6180

atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata     6240

cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg     6300

gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct     6360

gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt     6420

tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc     6480

tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga     6540

tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt     6600

aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc     6660

atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa     6720

tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca     6780

catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca     6840

aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct     6900

tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc     6960

gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa     7020

tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt     7080

tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc     7140

taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac gaggcccttt     7200

cgtc                                                                  7204


<210>  7
<211>  7219
<212>  DNA
<213>  Artificial

<220>
<223>  pHH-ILTat1.24-G4S-Hyg plasmid DNA sequence

<400>  7
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc      240

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat      300

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt      360

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgagctcggt acccggggat      420

ccgatatcga aggcagcgga aagtgtgcca atgcatttta aatgacagat ttattttatt      480

aaaaatgaga acacttagaa tatttcatgt cgattcaagc tagcagtaat atgccagtaa      540

atagtttcca tttacaagca caatttcatc ctccttacgt gattctgtag catttagttc      600

aagaaaaagc aataaaagaa ttcggagacg aagagccggt tagattgcat ttggacaccg      660

ctatacatat gttaagacac acaagcattc tatacgtaaa agatctagta tataggagca      720

acgctctgcc aaaacataat ggcaagacaa acggccgtgt ttgccgctga tgctacagaa      780

ccagcttaat ttccagaaga cgaaaatttg catgttttcc cacaatattt taattactct      840

tgaagattgt agttattcct acgcgacacg tacgcggcat ggtatacaga aatatactgc      900

aattaagcgt cctaaaagta ctacttatag tgcttatagt ggaagcaggt ggcggtggct      960

caggtggtgg cggttcaggc ggtggtggct caacgcactt cggtgtaaaa tacgagctct     1020

ggcagccaga atgcgaactg acagcggaat tgcggaaaac tgcaggggtg gcaaaaatga     1080

aagttaatag cgatttgaac tcctttaaaa cgcttgaact cacaaagatg aaattgctaa     1140

ccttcgccgc aaaatttccc gaaagcaaag aggcactaac gttacgcgct ctagaagcgg     1200

cactaaacac tgatctacga gcactacgag ataatatagc aaatggcatc gacagggctg     1260

tccgggcaac agcgtacgca tcagaggcgg caggcgcttt attttctggc atacagacgc     1320

tccatgacgc caccgacggc acgacctatt gccttagcgc aagcgggcaa ggatccaacg     1380

gcaacgctgc aatggcatca cagggctgca aaccactagc gttaccagaa cttctaacag     1440

aagactcata caacaccgac gttatatcgg acaaagggtt cccgaagatt tcgccactaa     1500

caaatgccca aggacagggc aaaagcggcg aatgcggcct ttttcaagcc gcaagcggcg     1560

ctcaggcgac aaacacaggt gtgcagttct cagggggcag caggataaac ttaggccttg     1620

gcgccatagt agcaagcgca gcccagcagc cgacacgccc ggacctaagt gatttttccg     1680

gcacagcacg aaaccaagca gatacgctct acggcaaagc acatgcttcc atcacagagt     1740

tactgcagct cgcacagggg ccgaaaccag gacagaccga agtagaaaca atgaagcttc     1800

tagcacaaaa gacagcggca ttggacagca tcaagttcca actagcagca agcacaggaa     1860

agaaaacatc agactacaaa gaagacgaaa acttgaaaac ggaatacttt ggaaagacag     1920

aaagcaatat agaagcactt tggaacaaag taaaggaaga gaaagtgaaa ggagccgacc     1980

cggaggaccc aagcaaggag tccaagatta gtgacctcaa caccgaagag cagcttcaga     2040

gagttttaga ttactacgca gtggctacaa tgttaaagtt agctaaacaa gcggaggata     2100

ttgcaaaact cgaaactgaa atagcggatc aaagaggcaa atccccagaa gccgaatgca     2160

ataaaataac cgaggaaccc aaatgcagcg aggaaaagat ttgcagttgg cataaggagg     2220

ttaaagcggg agaaaagaac tgccaattta actcaacaaa agcctcaaaa agtggtgtgc     2280

ctgtaacaca aactcaaact gcaggagccg acacgacagc agaaaagtgc aaaggcaaag     2340

gagagaaaga ttgcaaatct ccggattgca aatgggaggg cggaacttgc aaagattcct     2400

ctattctagc aaacaaacaa tttgccctca gcgtggcttc tgccgcattt gtggccttgc     2460

ttttctaatt tcccccctca aatttccccc ctccttttaa aattttcctt gctacttgaa     2520

aactttttga tatattttaa caccaaaacc agccgagatt ttgtgttctg tgttttgtaa     2580

gttgactgtc tgattgtcta gaaatatttt ctggcaacta aaattttttt cttttttcct     2640

gttttttttg taggtaggta ggaatggggg ggggggggta gttaggtagg ttagttaggt     2700

tagttagggg gttagttagg ggggttaggc ttaggattag gcacagcaag gtcttctgaa     2760

attcatgttt tttttttttt tactctgcat tgcagtctcc gctcttattt agttttgctt     2820

tacgtaaggt ctcgttgctg ccataaaata agctactagt agcttaccat gaaaaagcct     2880

gaactcaccg cgacgtctgt cgagaagttt ctgatcgaaa agttcgacag cgtctccgac     2940

ctgatgcagc tctcggaggg cgaagaatct cgtgctttca gcttcgatgt aggagggcgt     3000

ggatatgtcc tgcgggtaaa tagctgcgcc gatggtttct acaaagatcg ttatgtttat     3060

cggcactttg catcggccgc gctcccgatt ccggaagtgc ttgacattgg ggaattcagc     3120

gagagcctga cctattgcat ctcccgccgt gcacagggtg tcacgttgca agacctgcct     3180

gaaaccgaac tgcccgctgt tctgcagccg gtcgcggagg ccatggatgc gatcgctgcg     3240

gccgatctta gccagacgag cgggttcggc ccattcggac cgcaaggaat cggtcaatac     3300

actacatggc gtgatttcat atgcgcgatt gctgatcccc atgtgtatca ctggcaaact     3360

gtgatggacg acaccgtcag tgcgtccgtc gcgcaggctc tcgatgagct gatgctttgg     3420

gccgaggact gccccgaagt ccggcacctc gtgcacgcgg atttcggctc caacaatgtc     3480

ctgacggaca atggccgcat aacagcggtc attgactgga gcgaggcgat gttcggggat     3540

tcccaatacg aggtcgccaa catcttcttc tggaggccgt ggttggcttg tatggagcag     3600

cagacgcgct acttcgagcg gaggcatccg gagcttgcag gatcgccgcg gctccgggcg     3660

tatatgctcc gcattggtct tgaccaactc tatcagagct tggttgacgg caatttcgat     3720

gatgcagctt gggcgcaggg tcgatgcgac gcaatcgtcc gatccggagc cgggactgtc     3780

gggcgtacac aaatcgcccg cagaagcgcg gccgtctgga ccgatggctg tgtagaagta     3840

ctcgccgata gtggaaaccg acgccccagc actcgtccga gggcaaagga atagggatcg     3900

atcctgccca tttagttagt tggcttttcc cttgtctcgt gtcttttccg tggaaaggtt     3960

cccggagtaa tctgatggca cagcagggag gtgcgcctgc aggttggtta ggaagggggg     4020

atgatgtaaa agaagaaaat ggggggatat tagacttagg cttaggatta ggattaggat     4080

taggattagg gttaattttt tcctcttttt ttttaactca cacctctatc ctggattttt     4140

aatttttttt tttagccatt cgcggctcct tttttttttt ttgcgccaat gtttaatttt     4200

ttattgtgtt ttcaattttt ttgtcaacca tgcagcggct gttttgttat gcggacccta     4260

accctcctcc cccccccccg cccgcgcacc tccattttta aaaatttttt taccgcgtcc     4320

ttcaaccaga atttttttaa attttttaat tttttttatt ttccgtggtt ttgaatctta     4380

atttttcgac ggcatgcccg ctactctttt ttggcttttt gttttttcgt ttttttttga     4440

cgacgccttt ttttaaattt cttttcctcg atttttttcg ttcatttttt ttggtttagt     4500

attcattttt tgaactttag ttttgcattt aaatttttaa cgggtttttg cttacatttt     4560

ttttttacat cctctttttc tttttgcttt ttagttttcg acatttttca gatttttttc     4620

ttttttgaat tttttttttg ttacaaccag gcatcgtttt ttttggcggc gccccttttt     4680

ggtaacaccg gcggccacgg tgtttcggat taaggccgcg ggaattcgat tagggttagg     4740

gttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg     4800

gttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg     4860

gttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg     4920

gttagggtta atcactagct agtggatccg atatctctag agtcgacctg caggcatgca     4980

agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt     5040

ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc     5100

taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc     5160

cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct     5220

tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca     5280

gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac     5340

atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt     5400

ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg     5460

cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc     5520

tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc     5580

gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc     5640

aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac     5700

tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt     5760

aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct     5820

aactacggct acactagaag aacagtattt ggtatctgcg ctctgctgaa gccagttacc     5880

ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt     5940

ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg     6000

atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc     6060

atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa     6120

tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag     6180

gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg     6240

tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga     6300

gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag     6360

cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa     6420

gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc     6480

atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca     6540

aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg     6600

atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat     6660

aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc     6720

aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg     6780

gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg     6840

gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt     6900

gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca     6960

ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata     7020

ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac     7080

atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa     7140

gtgccacctg acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt     7200

atcacgaggc cctttcgtc                                                  7219


<210>  8
<211>  15
<212>  PRT
<213>  Artificial

<220>
<223>  4GS linker

<400>  8

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1               5                   10                  15  


<210>  9
<211>  524
<212>  PRT
<213>  Artificial

<220>
<223>  VSG3.G4S (S317A) Protein

<400>  9

Met Gln Ala Ala Ala Leu Leu Leu Leu Val Leu Arg Ala Ile Thr Ser 
1               5                   10                  15      


Ile Glu Ala Ala Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
            20                  25                  30          


Gly Gly Gly Ser Asp Asp Val Asn Pro Asp Asp Asn Lys Glu Asp Phe 
        35                  40                  45              


Ala Val Leu Cys Ala Leu Ala Ala Leu Ala Asn Leu Gln Thr Thr Val 
    50                  55                  60                  


Pro Ser Ile Asp Thr Ser Gly Leu Ala Ala Tyr Asp Asn Leu Gln Gln 
65                  70                  75                  80  


Leu Asn Leu Ser Leu Ser Ser Lys Glu Trp Lys Ser Leu Phe Asn Lys 
                85                  90                  95      


Ala Ala Asp Ser Asn Gly Ser Pro Lys Gln Pro Pro Glu Gly Phe Gln 
            100                 105                 110         


Ser Asp Pro Thr Trp Arg Lys Gln Trp Pro Ile Trp Val Thr Ala Ala 
        115                 120                 125             


Ala Ala Leu Lys Ala Glu Asn Lys Glu Ala Ala Val Leu Ala Arg Ala 
    130                 135                 140                 


Gly Leu Thr Asn Ala Pro Glu Glu Leu Arg Asn Arg Ala Arg Leu Ala 
145                 150                 155                 160 


Leu Ile Pro Leu Leu Ala Gln Ala Glu Gln Ile Arg Asp Arg Leu Ser 
                165                 170                 175     


Glu Ile Gln Lys Gln Asn Glu Asp Thr Thr Pro Thr Ala Ile Ala Lys 
            180                 185                 190         


Ala Leu Asn Lys Ala Val Tyr Gly Gln Asp Lys Glu Thr Gly Ala Val 
        195                 200                 205             


Tyr Asn Ser Ala Asp Cys Phe Ser Gly Asn Val Ala Asp Ser Thr Gln 
    210                 215                 220                 


Asn Ser Cys Lys Ala Gly Asn Gln Ala Ser Lys Ala Thr Thr Val Ala 
225                 230                 235                 240 


Ala Thr Ile Val Cys Val Cys His Lys Lys Asn Gly Gly Asn Asp Ala 
                245                 250                 255     


Ala Asn Ala Cys Gly Arg Leu Ile Asn His Gln Ser Asp Ala Gly Ala 
            260                 265                 270         


Asn Leu Ala Thr Ala Ser Ser Asp Phe Gly Asp Ile Ile Ala Thr Cys 
        275                 280                 285             


Ala Ala Arg Pro Pro Lys Pro Leu Thr Ala Ala Tyr Leu Asp Ser Ala 
    290                 295                 300                 


Leu Ala Ala Val Ser Ala Arg Ile Arg Phe Lys Asn Gly Asn Gly Tyr 
305                 310                 315                 320 


Leu Gly Lys Phe Lys Ala Thr Gly Cys Thr Gly Ala Ala Ser Glu Gly 
                325                 330                 335     


Leu Cys Val Glu Tyr Thr Ala Leu Thr Ala Ala Thr Met Gln Asn Phe 
            340                 345                 350         


Tyr Lys Ile Pro Trp Val Lys Glu Ile Ser Asn Val Ala Glu Ala Leu 
        355                 360                 365             


Lys Arg Thr Glu Lys Asp Ala Ala Glu Ser Thr Leu Leu Ser Thr Trp 
    370                 375                 380                 


Leu Lys Ala Ser Glu Asn Gln Gly Asn Ser Val Ala Gln Lys Leu Ile 
385                 390                 395                 400 


Lys Val Gly Asp Ser Lys Ala Val Pro Pro Ala Gln Arg Gln Thr Gln 
                405                 410                 415     


Asn Lys Pro Gly Ser Asn Cys Asn Lys Asn Leu Lys Lys Ser Glu Cys 
            420                 425                 430         


Lys Asp Ser Asp Gly Cys Lys Trp Asn Arg Thr Glu Glu Thr Glu Gly 
        435                 440                 445             


Asp Phe Cys Lys Pro Lys Glu Thr Gly Thr Glu Asn Pro Ala Ala Gly 
    450                 455                 460                 


Thr Gly Glu Gly Ala Ala Gly Ala Asn Thr Glu Thr Lys Lys Cys Ser 
465                 470                 475                 480 


Asp Lys Lys Thr Glu Gly Asp Cys Lys Asp Gly Cys Lys Trp Asp Gly 
                485                 490                 495     


Lys Glu Cys Lys Asp Ser Ser Ile Leu Ala Thr Lys Lys Phe Ala Leu 
            500                 505                 510         


Thr Val Val Ser Ala Ala Phe Val Ala Leu Leu Phe 
        515                 520                 


<210>  10
<211>  488
<212>  PRT
<213>  Artificial

<220>
<223>  VSG2-1DK Protein

<400>  10

Met Pro Ser Asn Gln Glu Ala Arg Leu Phe Leu Ala Val Leu Val Leu 
1               5                   10                  15      


Ala Gln Val Leu Pro Ile Leu Val Asp Ser Ala Ala Gly Gly Gly Glu 
            20                  25                  30          


Asn Leu Tyr Phe Gln Gly Gly Gly Gly Gly Gly Phe Lys Gln Ala Phe 
        35                  40                  45              


Trp Gln Pro Leu Cys Gln Val Ser Glu Glu Leu Asp Asp Gln Pro Lys 
    50                  55                  60                  


Gly Ala Leu Phe Thr Leu Gln Ala Ala Ala Ser Lys Ile Gln Lys Met 
65                  70                  75                  80  


Arg Asp Ala Ala Leu Arg Ala Ser Ile Tyr Ala Glu Ile Asn His Gly 
                85                  90                  95      


Thr Asn Arg Ala Lys Ala Ala Val Ile Val Ala Asn His Tyr Ala Met 
            100                 105                 110         


Lys Ala Asp Ser Gly Leu Glu Ala Leu Lys Gln Thr Leu Ser Ser Gln 
        115                 120                 125             


Glu Val Thr Ala Thr Ala Thr Ala Ser Tyr Leu Lys Gly Arg Ile Asp 
    130                 135                 140                 


Glu Tyr Leu Asn Leu Leu Leu Gln Thr Lys Glu Ser Gly Thr Ser Gly 
145                 150                 155                 160 


Cys Met Met Asp Thr Ser Gly Thr Asn Thr Val Thr Lys Ala Gly Gly 
                165                 170                 175     


Thr Ile Gly Gly Val Pro Cys Lys Leu Gln Leu Ser Pro Ile Gln Pro 
            180                 185                 190         


Lys Arg Pro Ala Ala Thr Tyr Leu Gly Lys Ala Gly Tyr Val Gly Leu 
        195                 200                 205             


Thr Arg Gln Ala Asp Ala Ala Asn Asn Phe His Asp Asn Asp Ala Glu 
    210                 215                 220                 


Cys Arg Leu Ala Ser Gly His Asn Thr Asn Gly Leu Gly Lys Ser Gly 
225                 230                 235                 240 


Gln Leu Ser Ala Ala Val Thr Met Ala Ala Gly Tyr Val Thr Val Ala 
                245                 250                 255     


Asn Ser Gln Thr Ala Val Thr Val Gln Ala Leu Asp Ala Leu Gln Glu 
            260                 265                 270         


Ala Ser Gly Ala Ala His Gln Pro Trp Ile Asp Ala Trp Lys Ala Lys 
        275                 280                 285             


Lys Ala Leu Thr Gly Ala Glu Thr Ala Glu Phe Arg Asn Glu Thr Ala 
    290                 295                 300                 


Gly Ile Ala Gly Lys Thr Gly Val Thr Lys Leu Val Glu Glu Ala Leu 
305                 310                 315                 320 


Leu Lys Lys Lys Asp Ser Glu Ala Ser Glu Ile Gln Thr Glu Leu Lys 
                325                 330                 335     


Lys Tyr Phe Ser Gly His Glu Asn Glu Gln Trp Thr Ala Ile Glu Lys 
            340                 345                 350         


Leu Ile Ser Glu Gln Pro Val Ala Gln Asn Leu Val Gly Asp Asn Gln 
        355                 360                 365             


Pro Thr Lys Leu Gly Glu Leu Glu Gly Asn Ala Lys Leu Thr Thr Ile 
    370                 375                 380                 


Leu Ala Tyr Tyr Arg Met Glu Thr Ala Gly Lys Phe Glu Val Leu Thr 
385                 390                 395                 400 


Gln Lys His Lys Pro Ala Glu Ser Gln Gln Gln Ala Ala Glu Thr Glu 
                405                 410                 415     


Gly Ser Cys Asn Lys Lys Asp Gln Asn Glu Cys Lys Ser Pro Cys Lys 
            420                 425                 430         


Trp His Asn Asp Ala Glu Asn Lys Lys Cys Thr Leu Asp Lys Glu Glu 
        435                 440                 445             


Ala Lys Lys Val Ala Asp Glu Thr Ala Lys Asp Gly Lys Thr Gly Asn 
    450                 455                 460                 


Thr Asn Thr Thr Gly Ser Ser Asn Ser Phe Val Ile Ser Lys Thr Pro 
465                 470                 475                 480 


Leu Trp Leu Ala Val Leu Leu Phe 
                485             


<210>  11
<211>  529
<212>  PRT
<213>  Artificial

<220>
<223>  ILTat1.24-G4S Protein

<400>  11

Met Val Tyr Arg Asn Ile Leu Gln Leu Ser Val Leu Lys Val Leu Leu 
1               5                   10                  15      


Ile Val Leu Ile Val Glu Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly 
            20                  25                  30          


Ser Gly Gly Gly Gly Ser Thr His Phe Gly Val Lys Tyr Glu Leu Trp 
        35                  40                  45              


Gln Pro Glu Cys Glu Leu Thr Ala Glu Leu Arg Lys Thr Ala Gly Val 
    50                  55                  60                  


Ala Lys Met Lys Val Asn Ser Asp Leu Asn Ser Phe Lys Thr Leu Glu 
65                  70                  75                  80  


Leu Thr Lys Met Lys Leu Leu Thr Phe Ala Ala Lys Phe Pro Glu Ser 
                85                  90                  95      


Lys Glu Ala Leu Thr Leu Arg Ala Leu Glu Ala Ala Leu Asn Thr Asp 
            100                 105                 110         


Leu Arg Ala Leu Arg Asp Asn Ile Ala Asn Gly Ile Asp Arg Ala Val 
        115                 120                 125             


Arg Ala Thr Ala Tyr Ala Ser Glu Ala Ala Gly Ala Leu Phe Ser Gly 
    130                 135                 140                 


Ile Gln Thr Leu His Asp Ala Thr Asp Gly Thr Thr Tyr Cys Leu Ser 
145                 150                 155                 160 


Ala Ser Gly Gln Gly Ser Asn Gly Asn Ala Ala Met Ala Ser Gln Gly 
                165                 170                 175     


Cys Lys Pro Leu Ala Leu Pro Glu Leu Leu Thr Glu Asp Ser Tyr Asn 
            180                 185                 190         


Thr Asp Val Ile Ser Asp Lys Gly Phe Pro Lys Ile Ser Pro Leu Thr 
        195                 200                 205             


Asn Ala Gln Gly Gln Gly Lys Ser Gly Glu Cys Gly Leu Phe Gln Ala 
    210                 215                 220                 


Ala Ser Gly Ala Gln Ala Thr Asn Thr Gly Val Gln Phe Ser Gly Gly 
225                 230                 235                 240 


Ser Arg Ile Asn Leu Gly Leu Gly Ala Ile Val Ala Ser Ala Ala Gln 
                245                 250                 255     


Gln Pro Thr Arg Pro Asp Leu Ser Asp Phe Ser Gly Thr Ala Arg Asn 
            260                 265                 270         


Gln Ala Asp Thr Leu Tyr Gly Lys Ala His Ala Ser Ile Thr Glu Leu 
        275                 280                 285             


Leu Gln Leu Ala Gln Gly Pro Lys Pro Gly Gln Thr Glu Val Glu Thr 
    290                 295                 300                 


Met Lys Leu Leu Ala Gln Lys Thr Ala Ala Leu Asp Ser Ile Lys Phe 
305                 310                 315                 320 


Gln Leu Ala Ala Ser Thr Gly Lys Lys Thr Ser Asp Tyr Lys Glu Asp 
                325                 330                 335     


Glu Asn Leu Lys Thr Glu Tyr Phe Gly Lys Thr Glu Ser Asn Ile Glu 
            340                 345                 350         


Ala Leu Trp Asn Lys Val Lys Glu Glu Lys Val Lys Gly Ala Asp Pro 
        355                 360                 365             


Glu Asp Pro Ser Lys Glu Ser Lys Ile Ser Asp Leu Asn Thr Glu Glu 
    370                 375                 380                 


Gln Leu Gln Arg Val Leu Asp Tyr Tyr Ala Val Ala Thr Met Leu Lys 
385                 390                 395                 400 


Leu Ala Lys Gln Ala Glu Asp Ile Ala Lys Leu Glu Thr Glu Ile Ala 
                405                 410                 415     


Asp Gln Arg Gly Lys Ser Pro Glu Ala Glu Cys Asn Lys Ile Thr Glu 
            420                 425                 430         


Glu Pro Lys Cys Ser Glu Glu Lys Ile Cys Ser Trp His Lys Glu Val 
        435                 440                 445             


Lys Ala Gly Glu Lys Asn Cys Gln Phe Asn Ser Thr Lys Ala Ser Lys 
    450                 455                 460                 


Ser Gly Val Pro Val Thr Gln Thr Gln Thr Ala Gly Ala Asp Thr Thr 
465                 470                 475                 480 


Ala Glu Lys Cys Lys Gly Lys Gly Glu Lys Asp Cys Lys Ser Pro Asp 
                485                 490                 495     


Cys Lys Trp Glu Gly Gly Thr Cys Lys Asp Ser Ser Ile Leu Ala Asn 
            500                 505                 510         


Lys Gln Phe Ala Leu Ser Val Ala Ser Ala Ala Phe Val Ala Leu Leu 
        515                 520                 525             


Phe 
    


<210>  12
<211>  5
<212>  PRT
<213>  Staphylococcus aureus


<220>
<221>  misc_feature
<222>  (3)..(3)
<223>  Xaa can be any naturally occurring amino acid

<400>  12

Leu Pro Xaa Thr Gly 
1               5   


<210>  13
<211>  5
<212>  PRT
<213>  Staphylococcus aureus


<220>
<221>  misc_feature
<222>  (3)..(3)
<223>  Xaa can be any naturally occurring amino acid

<400>  13

Leu Pro Xaa Thr Ala 
1               5   


<210>  14
<211>  10
<212>  PRT
<213>  artificial

<220>
<223>  sortagging donor sequence

<400>  14

Gly Gly Gly Ser Leu Pro Ser Thr Gly Gly 
1               5                   10  


<210>  15
<211>  10
<212>  PRT
<213>  artificial

<220>
<223>  sortagging donor sequence and the sortagging acceptor sequence

<400>  15

Gly Gly Gly Ser Leu Pro Ser Thr Ala Ala 
1               5                   10  


<210>  16
<211>  7
<212>  PRT
<213>  artificial

<220>
<223>  part of sortase donor sequence

<400>  16

Ser Leu Pro Ser Thr Gly Gly 
1               5           


<210>  17
<211>  7
<212>  PRT
<213>  artificial

<220>
<223>  sortagging linker sequence

<400>  17

Ser Leu Pro Ser Thr Ala Ala 
1               5           


