                               SEQUENCE LISTING

<110> DUKE UNIVERSITY
 
<120> TRIMERIC HIV-1 ENVELOPES COMPOSITIONS AND USES THEREOF

<130> 1579-2054

<140> PCT/US2015/016663
<141> 2015-02-19

<150> 61/973,414
<151> 2014-04-01

<150> 61/941,902
<151> 2014-02-19

<160> 14    

<170> PatentIn version 3.5

<210> 1
<211> 847
<212> PRT
<213> Human immunodeficiency virus 1

<400> 1
Met Arg Val Lys Gly Ile Arg Lys Ser Tyr Gln Tyr Leu Trp Lys Gly 
1               5                   10                  15      


Gly Thr Leu Leu Leu Gly Ile Leu Met Ile Cys Ser Ala Val Glu Lys 
            20                  25                  30          


Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 
        35                  40                  45              


Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
    50                  55                  60                  


His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65                  70                  75                  80  


Gln Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys 
                85                  90                  95      


Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp 
            100                 105                 110         


Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
        115                 120                 125             


Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 
    130                 135                 140                 


Thr Met Glu Arg Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr 
145                 150                 155                 160 


Ser Ile Arg Asp Glu Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
                165                 170                 175     


Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser 
            180                 185                 190         


Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu 
        195                 200                 205             


Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys 
    210                 215                 220                 


Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser 
225                 230                 235                 240 


Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu 
                245                 250                 255     


Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp 
            260                 265                 270         


Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser 
        275                 280                 285             


Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile 
    290                 295                 300                 


His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly 
305                 310                 315                 320 


Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp 
                325                 330                 335     


Thr Leu Lys Gln Ile Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys 
            340                 345                 350         


Thr Ile Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Met 
        355                 360                 365             


His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln 
    370                 375                 380                 


Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr 
385                 390                 395                 400 


Glu Gly Asn Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn 
                405                 410                 415     


Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly 
            420                 425                 430         


Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp 
        435                 440                 445             


Gly Gly Ile Asn Glu Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly 
    450                 455                 460                 


Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
465                 470                 475                 480 


Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 
                485                 490                 495     


Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly 
            500                 505                 510         


Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu 
        515                 520                 525             


Thr Val Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn 
    530                 535                 540                 


Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr 
545                 550                 555                 560 


Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg 
                565                 570                 575     


Tyr Leu Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys 
            580                 585                 590         


Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys 
        595                 600                 605             


Ser Leu Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg 
    610                 615                 620                 


Glu Ile Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser 
625                 630                 635                 640 


Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys 
                645                 650                 655     


Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr 
            660                 665                 670         


Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu 
        675                 680                 685             


Val Phe Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser 
    690                 695                 700                 


Pro Leu Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg 
705                 710                 715                 720 


Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser 
                725                 730                 735     


Gly Arg Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg 
            740                 745                 750         


Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr 
        755                 760                 765             


Val Thr Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu 
    770                 775                 780                 


Lys Tyr Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn 
785                 790                 795                 800 


Ser Ala Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly 
                805                 810                 815     


Thr Asp Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu 
            820                 825                 830         


His Ile Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 
        835                 840                 845         


<210> 2
<211> 705
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 2
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly 
1               5                   10                  15      


Met Leu Val Ala Ser Val Leu Ala Val Glu Lys Leu Trp Val Thr Val 
            20                  25                  30          


Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 
        35                  40                  45              


Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 
    50                  55                  60                  


Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Val Val Leu 
65                  70                  75                  80  


Glu Asn Val Thr Glu His Phe Asn Met Trp Lys Asn Asn Met Val Glu 
                85                  90                  95      


Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro 
            100                 105                 110         


Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val 
        115                 120                 125             


Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly 
    130                 135                 140                 


Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Glu 
145                 150                 155                 160 


Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro Ile 
                165                 170                 175     


Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asp Thr Ser Val 
            180                 185                 190         


Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His 
        195                 200                 205             


Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Thr 
    210                 215                 220                 


Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr 
225                 230                 235                 240 


His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser 
                245                 250                 255     


Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp Asn Phe Thr Asn Asn 
            260                 265                 270         


Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser Val Glu Ile Asn Cys 
        275                 280                 285             


Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly 
    290                 295                 300                 


Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala 
305                 310                 315                 320 


His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp Thr Leu Lys Gln Ile 
                325                 330                 335     


Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys Thr Ile Val Phe Asn 
            340                 345                 350         


His Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys 
        355                 360                 365             


Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr 
    370                 375                 380                 


Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr Ile 
385                 390                 395                 400 


Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val 
                405                 410                 415     


Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser 
            420                 425                 430         


Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ile Asn Glu 
        435                 440                 445             


Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 
    450                 455                 460                 


Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu 
465                 470                 475                 480 


Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Ser Glu Lys 
                485                 490                 495     


Ser Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala 
            500                 505                 510         


Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg 
        515                 520                 525             


Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala 
    530                 535                 540                 


Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr Val Trp Gly Ile Lys 
545                 550                 555                 560 


Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gly Asp Gln 
                565                 570                 575     


Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr 
            580                 585                 590         


Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Arg Ile 
        595                 600                 605             


Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu Ile Asp Asn Tyr 
    610                 615                 620                 


Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu 
625                 630                 635                 640 


Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp 
                645                 650                 655     


Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile 
            660                 665                 670         


Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu Val Phe Thr Val Leu 
        675                 680                 685             


Ser Ile Val Asn Arg Val Arg Gln Gly Gly Gly His His His His His 
    690                 695                 700                 


His 
705 


<210> 3
<211> 802
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 3
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly 
1               5                   10                  15      


Met Leu Val Ala Ser Val Leu Ala Val Glu Lys Leu Trp Val Thr Val 
            20                  25                  30          


Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 
        35                  40                  45              


Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 
    50                  55                  60                  


Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Val Val Leu 
65                  70                  75                  80  


Glu Asn Val Thr Glu His Phe Asn Met Trp Lys Asn Asn Met Val Glu 
                85                  90                  95      


Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro 
            100                 105                 110         


Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val 
        115                 120                 125             


Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly 
    130                 135                 140                 


Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Glu 
145                 150                 155                 160 


Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro Ile 
                165                 170                 175     


Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asp Thr Ser Val 
            180                 185                 190         


Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His 
        195                 200                 205             


Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Thr 
    210                 215                 220                 


Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr 
225                 230                 235                 240 


His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser 
                245                 250                 255     


Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp Asn Phe Thr Asn Asn 
            260                 265                 270         


Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser Val Glu Ile Asn Cys 
        275                 280                 285             


Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly 
    290                 295                 300                 


Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala 
305                 310                 315                 320 


His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp Thr Leu Lys Gln Ile 
                325                 330                 335     


Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys Thr Ile Val Phe Asn 
            340                 345                 350         


His Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys 
        355                 360                 365             


Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr 
    370                 375                 380                 


Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr Ile 
385                 390                 395                 400 


Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val 
                405                 410                 415     


Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser 
            420                 425                 430         


Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ile Asn Glu 
        435                 440                 445             


Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 
    450                 455                 460                 


Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu 
465                 470                 475                 480 


Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Ser Glu Lys 
                485                 490                 495     


Ser Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala 
            500                 505                 510         


Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg 
        515                 520                 525             


Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala 
    530                 535                 540                 


Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr Val Trp Gly Ile Lys 
545                 550                 555                 560 


Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gly Asp Gln 
                565                 570                 575     


Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr 
            580                 585                 590         


Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Arg Ile 
        595                 600                 605             


Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu Ile Asp Asn Tyr 
    610                 615                 620                 


Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu 
625                 630                 635                 640 


Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp 
                645                 650                 655     


Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile 
            660                 665                 670         


Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu Val Phe Thr Val Leu 
        675                 680                 685             


Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln 
    690                 695                 700                 


Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu Gly Ile Glu 
705                 710                 715                 720 


Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Arg Leu Val Asn 
                725                 730                 735     


Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg Ser Leu Cys Leu Phe 
            740                 745                 750         


Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr Val Thr Arg Ile Val 
        755                 760                 765             


Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr Trp Trp Asn 
    770                 775                 780                 


Leu Leu Gln Tyr Trp Ser Gln Glu Leu Gly Gly Gly His His His His 
785                 790                 795                 800 


His His 
        


<210> 4
<211> 802
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 4
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly 
1               5                   10                  15      


Met Leu Val Ala Ser Val Leu Ala Val Glu Lys Leu Trp Val Thr Val 
            20                  25                  30          


Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 
        35                  40                  45              


Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 
    50                  55                  60                  


Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Val Val Leu 
65                  70                  75                  80  


Glu Asn Val Thr Glu His Phe Asn Met Trp Lys Asn Asn Met Val Glu 
                85                  90                  95      


Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro 
            100                 105                 110         


Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val 
        115                 120                 125             


Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly 
    130                 135                 140                 


Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Ser Ile Arg Asp Glu 
145                 150                 155                 160 


Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro Ile 
                165                 170                 175     


Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser Cys Asp Thr Ser Val 
            180                 185                 190         


Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His 
        195                 200                 205             


Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp Lys Thr 
    210                 215                 220                 


Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr 
225                 230                 235                 240 


His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser 
                245                 250                 255     


Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp Asn Phe Thr Asn Asn 
            260                 265                 270         


Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser Val Glu Ile Asn Cys 
        275                 280                 285             


Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly 
    290                 295                 300                 


Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala 
305                 310                 315                 320 


His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp Thr Leu Lys Gln Ile 
                325                 330                 335     


Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys Thr Ile Val Phe Asn 
            340                 345                 350         


His Ser Ser Gly Gly Asp Pro Glu Ile Val Met His Ser Phe Asn Cys 
        355                 360                 365             


Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr 
    370                 375                 380                 


Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr Ile 
385                 390                 395                 400 


Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val 
                405                 410                 415     


Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser 
            420                 425                 430         


Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ile Asn Glu 
        435                 440                 445             


Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 
    450                 455                 460                 


Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu 
465                 470                 475                 480 


Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Ser Glu Lys 
                485                 490                 495     


Ser Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala 
            500                 505                 510         


Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg 
        515                 520                 525             


Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala 
    530                 535                 540                 


Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr Val Trp Gly Ile Lys 
545                 550                 555                 560 


Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gly Asp Gln 
                565                 570                 575     


Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr 
            580                 585                 590         


Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Arg Ile 
        595                 600                 605             


Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu Ile Asp Asn Tyr 
    610                 615                 620                 


Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu 
625                 630                 635                 640 


Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp 
                645                 650                 655     


Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile 
            660                 665                 670         


Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu Val Phe Thr Val Leu 
        675                 680                 685             


Ser Ile Val Asn Arg Val Arg Gln Gly Ala Ser Pro Leu Ser Phe Gln 
    690                 695                 700                 


Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu Gly Ile Glu 
705                 710                 715                 720 


Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Arg Leu Val Asn 
                725                 730                 735     


Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg Ser Leu Cys Leu Phe 
            740                 745                 750         


Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr Val Thr Arg Ile Val 
        755                 760                 765             


Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr Trp Trp Asn 
    770                 775                 780                 


Leu Leu Gln Tyr Trp Ser Gln Glu Leu Gly Gly Gly His His His His 
785                 790                 795                 800 


His His 
        


<210> 5
<211> 846
<212> PRT
<213> Human immunodeficiency virus 1

<400> 5
Met Arg Val Met Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Phe Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Lys Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Val Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr 
        115                 120                 125             


Asn Ala Thr Ala Ser Asn Ser Ser Ile Ile Glu Gly Met Lys Asn Cys 
    130                 135                 140                 


Ser Phe Asn Ile Thr Thr Glu Leu Arg Asp Lys Arg Glu Lys Lys Asn 
145                 150                 155                 160 


Ala Leu Phe Tyr Lys Leu Asp Ile Val Gln Leu Asp Gly Asn Ser Ser 
                165                 170                 175     


Gln Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 
            180                 185                 190         


Pro Lys Val Ser Phe Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 
        195                 200                 205             


Gly Tyr Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Thr Gly Thr Gly 
    210                 215                 220                 


Pro Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro 
225                 230                 235                 240 


Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu 
                245                 250                 255     


Ile Ile Ile Arg Ser Glu Asn Ile Thr Asn Asn Val Lys Thr Ile Ile 
            260                 265                 270         


Val His Leu Asn Glu Ser Val Lys Ile Glu Cys Thr Arg Pro Asn Asn 
        275                 280                 285             


Lys Thr Arg Thr Ser Ile Arg Ile Gly Pro Gly Gln Ala Phe Tyr Ala 
    290                 295                 300                 


Thr Gly Gln Val Ile Gly Asp Ile Arg Glu Ala Tyr Cys Asn Ile Asn 
305                 310                 315                 320 


Glu Ser Lys Trp Asn Glu Thr Leu Gln Arg Val Ser Lys Lys Leu Lys 
                325                 330                 335     


Glu Tyr Phe Pro His Lys Asn Ile Thr Phe Gln Pro Ser Ser Gly Gly 
            340                 345                 350         


Asp Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
        355                 360                 365             


Tyr Cys Asn Thr Ser Ser Leu Phe Asn Arg Thr Tyr Met Ala Asn Ser 
    370                 375                 380                 


Thr Asp Met Ala Asn Ser Thr Glu Thr Asn Ser Thr Arg Thr Ile Thr 
385                 390                 395                 400 


Ile His Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly 
                405                 410                 415     


Arg Ala Met Tyr Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Ile Ser 
            420                 425                 430         


Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Asn Asn Thr 
        435                 440                 445             


Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser 
    450                 455                 460                 


Glu Leu Tyr Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Val Ala 
465                 470                 475                 480 


Pro Thr Asn Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val 
                485                 490                 495     


Gly Met Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
            500                 505                 510         


Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu 
        515                 520                 525             


Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu Ala 
    530                 535                 540                 


Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln 
545                 550                 555                 560 


Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu 
                565                 570                 575     


Gly Met Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val Tyr 
            580                 585                 590         


Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp Asp Asn 
        595                 600                 605             


Met Thr Trp Met Gln Trp Glu Arg Glu Ile Ser Asn Tyr Thr Glu Ile 
    610                 615                 620                 


Ile Tyr Glu Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu 
625                 630                 635                 640 


Gln Asp Leu Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn Trp Phe 
                645                 650                 655     


Asn Ile Thr Asn Trp Leu Gly Tyr Ile Lys Ile Phe Ile Met Ile Val 
            660                 665                 670         


Gly Gly Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser Leu Val 
        675                 680                 685             


Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr Leu Ile 
    690                 695                 700                 


Pro Ser Pro Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly 
705                 710                 715                 720 


Gly Glu Gln Asp Arg Asn Arg Ser Thr Arg Leu Val Ser Gly Phe Leu 
                725                 730                 735     


Ala Leu Val Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ile Tyr His 
            740                 745                 750         


Arg Leu Arg Asp Phe Ile Leu Ile Ala Ala Arg Ala Gly Glu Leu Leu 
        755                 760                 765             


Gly Arg Ser Ser Leu Lys Gly Leu Arg Arg Gly Trp Glu Ala Leu Lys 
    770                 775                 780                 


Tyr Leu Gly Ser Leu Val Gln Tyr Trp Gly Leu Glu Leu Lys Arg Ser 
785                 790                 795                 800 


Ala Ile Ser Leu Leu Asp Thr Leu Ala Ile Ala Val Gly Glu Gly Thr 
                805                 810                 815     


Asp Arg Ile Leu Glu Phe Val Leu Gly Ile Cys Arg Ala Ile Arg Asn 
            820                 825                 830         


Ile Pro Thr Arg Ile Arg Gln Gly Phe Glu Thr Ala Leu Leu 
        835                 840                 845     


<210> 6
<211> 2562
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 6
gtcgacgcta gcaccatgcg cgtgatgggc atccagcgca actaccccca gtggtggatc       60

tggtccatgc tgggcttctg gatgctgatg atctgcaacg gcatgtgggt gaccgtgtac      120

tacggcgtgc ccgtgtggaa ggaggccaag accaccctgt tctgcgcctc cgacgccaag      180

gcctacgaga aggaggtgca caacgtgtgg gccacccacg cctgcgtgcc caccgacccc      240

aacccccagg agatggtgct gaagaacgtg accgagaact tcaacatgtg gaagaacgac      300

atggtggacc agatgcacga ggacgtgatc tccctgtggg accagtccct gaagccctgc      360

gtgaagctga cccccctgtg cgtgaccctg aactgcacca acgccaccgc ctccaactcc      420

tccatcatcg agggcatgaa gaactgctcc ttcaacatca ccaccgagct gcgcgacaag      480

cgcgagaaga agaacgccct gttctacaag ctggacatcg tgcagctgga cggcaactcc      540

tcccagtacc gcctgatcaa ctgcaacacc tccgtgatca cccaggcctg ccccaaggtg      600

tccttcgacc ccatccccat ccactactgc gcccccgccg gctacgccat cctgaagtgc      660

aacaacaaga ccttcaccgg caccggcccc tgcaacaacg tgtccaccgt gcagtgcacc      720

cacggcatca agcccgtggt gtccacccag ctgctgctga acggctccct ggccgagggc      780

gagatcatca tccgctccga gaacatcacc aacaacgtga agaccatcat cgtgcacctg      840

aacgagtccg tgaagatcga gtgcacccgc cccaacaaca agacccgcac ctccatccgc      900

atcggccccg gccaggcctt ctacgccacc ggccaggtga tcggcgacat ccgcgaggcc      960

tactgcaaca tcaacgagtc caagtggaac gagaccctgc agcgcgtgtc caagaagctg     1020

aaggagtact tcccccacaa gaacatcacc ttccagccct cctccggcgg cgacctggag     1080

atcaccaccc actccttcaa ctgcggcggc gagttcttct actgcaacac ctcctccctg     1140

ttcaaccgca cctacatggc caactccacc gacatggcca actccaccga gaccaactcc     1200

acccgcacca tcaccatcca ctgccgcatc aagcagatca tcaacatgtg gcaggaggtg     1260

ggccgcgcca tgtacgcccc ccccatcgcc ggcaacatca cctgcatctc caacatcacc     1320

ggcctgctgc tgacccgcga cggcggcaag aacaacaccg agaccttccg ccccggcggc     1380

ggcaacatga aggacaactg gcgctccgag ctgtacaagt acaaggtggt ggaggtgaag     1440

cccctgggcg tggcccccac caacgcccgc cgccgcgtgg tggagcgcga gaagcgcgcc     1500

gtgggcatgg gcgccgtgtt cctgggcttc ctgggcgccg ccggctccac catgggcgcc     1560

gcctccatca ccctgaccgt gcaggcccgc cagctgctgt ccggcatcgt gcagcagcag     1620

tccaacctgc tgaaggccat cgaggcccag cagcacatgc tgaagctgac cgtgtggggc     1680

atcaagcagc tgcaggcccg cgtgctggcc ctggagcgct acctgaagga ccagcagctg     1740

ctgggcatgt ggggctgctc cggcaagctg atctgcacca ccaacgtgta ctggaactcc     1800

tcctggtcca acaagaccta cggcgacatc tgggacaaca tgacctggat gcagtgggag     1860

cgcgagatct ccaactacac cgagatcatc tacgagctgc tggaggagtc ccagaaccag     1920

caggagaaga acgagcagga cctgctggcc ctggaccgct ggaactccct gtggaactgg     1980

ttcaacatca ccaactggct gggctacatc aagatcttca tcatgatcgt gggcggcctg     2040

atcggcctgc gcatcatctt cgccgtgctg tccctggtga accgcgtgcg ccagggctac     2100

tcccccctgt ccctgcagac cctgatcccc tccccccgcg gccccgaccg ccccggcggc     2160

atcgaggagg agggcggcga gcaggaccgc aaccgctcca cccgcctggt gtccggcttc     2220

ctggccctgg tgtgggacga cctgcgctcc ctgtgcctgt tcatctacca ccgcctgcgc     2280

gacttcatcc tgatcgccgc ccgcgccggc gagctgctgg gccgctcctc cctgaagggc     2340

ctgcgccgcg gctgggaggc cctgaagtac ctgggctccc tggtgcagta ctggggcctg     2400

gagctgaagc gctccgccat ctccctgctg gacaccctgg ccatcgccgt gggcgagggc     2460

accgaccgca tcctggagtt cgtgctgggc atctgccgcg ccatccgcaa catccccacc     2520

cgcatccgcc agggcttcga gaccgccctg ctgtagggat cc                        2562


<210> 7
<211> 859
<212> PRT
<213> Human immunodeficiency virus 1

<400> 7
Met Arg Val Met Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Phe Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Lys Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Val Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr 
        115                 120                 125             


Asn Ala Asn Ala Thr Ala Ser Asn Ser Ser Ile Ile Glu Gly Met Asn 
    130                 135                 140                 


Ser Ser Ile Ile Glu Gly Met Lys Asn Cys Ser Phe Asn Ile Thr Thr 
145                 150                 155                 160 


Glu Leu Arg Asp Lys Arg Glu Lys Lys Asn Ala Leu Phe Tyr Lys Leu 
                165                 170                 175     


Asp Ile Val Gln Leu Asp Gly Asn Ser Ser Gln Tyr Arg Leu Ile Asn 
            180                 185                 190         


Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp 
        195                 200                 205             


Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys 
    210                 215                 220                 


Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser 
225                 230                 235                 240 


Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu 
                245                 250                 255     


Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Ile Ile Ile Arg Ser Glu 
            260                 265                 270         


Asn Ile Thr Asp Asn Gly Lys Thr Ile Ile Val His Leu Asn Glu Ser 
        275                 280                 285             


Val Lys Ile Glu Cys Thr Arg Pro Ser Asn Asn Thr Arg Thr Ser Ile 
    290                 295                 300                 


Arg Ile Gly Pro Gly Gln Ala Phe Tyr Ala Thr Gly Gln Val Ile Gly 
305                 310                 315                 320 


Asp Ile Arg Glu Ala His Cys Asn Ile Ser Glu Ser Lys Trp Asn Glu 
                325                 330                 335     


Thr Leu Gln Arg Val Ser Glu Lys Leu Lys Glu Tyr Phe Pro His Lys 
            340                 345                 350         


Asn Ile Thr Phe Gln Pro Ser Ser Gly Gly Asp Leu Glu Ile Thr Thr 
        355                 360                 365             


His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Ser 
    370                 375                 380                 


Leu Phe Asn Arg Thr Tyr Met Ala Thr Ser Thr Asp Met Ala Asn Ser 
385                 390                 395                 400 


Thr Glu Thr Asn Ser Thr Arg Ile Ile Thr Ile Arg Cys Arg Ile Lys 
                405                 410                 415     


Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr Ala Pro 
            420                 425                 430         


Pro Ile Ala Gly Asn Ile Thr Cys Ile Ser Asn Ile Thr Gly Leu Leu 
        435                 440                 445             


Leu Thr Arg Asp Gly Gly Lys Asn Asn Thr Glu Thr Phe Glu Thr Phe 
    450                 455                 460                 


Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr 
465                 470                 475                 480 


Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Val Ala Pro Thr Asn 
                485                 490                 495     


Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Met Gly 
            500                 505                 510         


Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
        515                 520                 525             


Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile 
    530                 535                 540                 


Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu Ala Gln Gln His 
545                 550                 555                 560 


Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val 
                565                 570                 575     


Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Met Trp 
            580                 585                 590         


Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val Tyr Trp Asn Ser 
        595                 600                 605             


Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp Asp Asn Met Thr Trp 
    610                 615                 620                 


Met Gln Trp Glu Arg Glu Ile Ser Asn Tyr Thr Glu Ile Ile Tyr Glu 
625                 630                 635                 640 


Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Asp Leu 
                645                 650                 655     


Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn Trp Phe Asn Ile Thr 
            660                 665                 670         


Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu 
        675                 680                 685             


Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser Leu Val Asn Arg Val 
    690                 695                 700                 


Arg Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr Leu Ile Pro Ser Pro 
705                 710                 715                 720 


Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly Gly Glu Gln 
                725                 730                 735     


Asp Arg Asn Arg Ser Thr Arg Leu Val Ser Gly Phe Leu Ala Leu Ala 
            740                 745                 750         


Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ile Tyr His Arg Leu Arg 
        755                 760                 765             


Asp Phe Ile Leu Ile Ala Ala Arg Ala Gly Glu Leu Leu Gly Arg Ser 
    770                 775                 780                 


Ser Leu Lys Gly Leu Arg Arg Gly Trp Glu Ala Leu Lys Tyr Leu Gly 
785                 790                 795                 800 


Ser Leu Val Gln Tyr Trp Gly Leu Glu Leu Lys Arg Ser Ala Ile Ser 
                805                 810                 815     


Leu Leu Asp Thr Leu Ala Ile Ala Val Gly Glu Gly Thr Asp Arg Ile 
            820                 825                 830         


Leu Glu Phe Val Leu Gly Ile Cys Arg Ala Ile Arg Asn Ile Pro Thr 
        835                 840                 845             


Arg Ile Arg Gln Gly Phe Glu Thr Ala Leu Leu 
    850                 855                 


<210> 8
<211> 2601
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 8
gtcgacgcta gcaccatgcg cgtgatgggc atccagcgca actaccccca gtggtggatc       60

tggtccatgc tgggcttctg gatgctgatg atctgcaacg gcatgtgggt gaccgtgtac      120

tacggcgtgc ccgtgtggaa ggaggccaag accaccctgt tctgcgcctc cgacgccaag      180

gcctacgaga aggaggtgca caacgtgtgg gccacccacg cctgcgtgcc caccgacccc      240

aacccccagg agatggtgct gaagaacgtg accgagaact tcaacatgtg gaagaacgac      300

atggtggacc agatgcacga ggacgtgatc tccctgtggg accagtccct gaagccctgc      360

gtgaagctga cccccctgtg cgtgaccctg aactgcacca acgccaacgc caccgcctcc      420

aactcctcca tcatcgaggg catgaactcc tccatcatcg agggcatgaa gaactgctcc      480

ttcaacatca ccaccgagct gcgcgacaag cgcgagaaga agaacgccct gttctacaag      540

ctggacatcg tgcagctgga cggcaactcc tcccagtacc gcctgatcaa ctgcaacacc      600

tccgtgatca cccaggcctg ccccaaggtg tccttcgacc ccatccccat ccactactgc      660

gcccccgccg gctacgccat cctgaagtgc aacaacaaga ccttcaacgg caccggcccc      720

tgcaacaacg tgtccaccgt gcagtgcacc cacggcatca agcccgtggt gtccacccag      780

ctgctgctga acggctccct ggccgagggc gagatcatca tccgctccga gaacatcacc      840

gacaacggca agaccatcat cgtgcacctg aacgagtccg tgaagatcga gtgcacccgc      900

ccctccaaca acacccgcac ctccatccgc atcggccccg gccaggcctt ctacgccacc      960

ggccaggtga tcggcgacat ccgcgaggcc cactgcaaca tctccgagtc caagtggaac     1020

gagaccctgc agcgcgtgtc cgagaagctg aaggagtact tcccccacaa gaacatcacc     1080

ttccagccct cctccggcgg cgacctggag atcaccaccc actccttcaa ctgcggcggc     1140

gagttcttct actgcaacac ctcctccctg ttcaaccgca cctacatggc cacctccacc     1200

gacatggcca actccaccga gaccaactcc acccgcatca tcaccatccg ctgccgcatc     1260

aagcagatca tcaacatgtg gcaggaggtg ggccgcgcca tgtacgcccc ccccatcgcc     1320

ggcaacatca cctgcatctc caacatcacc ggcctgctgc tgacccgcga cggcggcaag     1380

aacaacaccg agaccttcga gaccttccgc cccggcggcg gcaacatgaa ggacaactgg     1440

cgctccgagc tgtacaagta caaggtggtg gaggtgaagc ccctgggcgt ggcccccacc     1500

aacgcccgcc gccgcgtggt ggagcgcgag aagcgcgccg tgggcatggg cgccgtgttc     1560

ctgggcttcc tgggcgccgc cggctccacc atgggcgccg cctccatcac cctgaccgtg     1620

caggcccgcc agctgctgtc cggcatcgtg cagcagcagt ccaacctgct gaaggccatc     1680

gaggcccagc agcacatgct gaagctgacc gtgtggggca tcaagcagct gcaggcccgc     1740

gtgctggccc tggagcgcta cctgaaggac cagcagctgc tgggcatgtg gggctgctcc     1800

ggcaagctga tctgcaccac caacgtgtac tggaactcct cctggtccaa caagacctac     1860

ggcgacatct gggacaacat gacctggatg cagtgggagc gcgagatctc caactacacc     1920

gagatcatct acgagctgct ggaggagtcc cagaaccagc aggagaagaa cgagcaggac     1980

ctgctggccc tggaccgctg gaactccctg tggaactggt tcaacatcac caactggctg     2040

tggtacatca agatcttcat catgatcgtg ggcggcctga tcggcctgcg catcatcttc     2100

gccgtgctgt ccctggtgaa ccgcgtgcgc cagggctact cccccctgtc cctgcagacc     2160

ctgatcccct ccccccgcgg ccccgaccgc cccggcggca tcgaggagga gggcggcgag     2220

caggaccgca accgctccac ccgcctggtg tccggcttcc tggccctggc ctgggacgac     2280

ctgcgctccc tgtgcctgtt catctaccac cgcctgcgcg acttcatcct gatcgccgcc     2340

cgcgccggcg agctgctggg ccgctcctcc ctgaagggcc tgcgccgcgg ctgggaggcc     2400

ctgaagtacc tgggctccct ggtgcagtac tggggcctgg agctgaagcg ctccgccatc     2460

tccctgctgg acaccctggc catcgccgtg ggcgagggca ccgaccgcat cctggagttc     2520

gtgctgggca tctgccgcgc catccgcaac atccccaccc gcatccgcca gggcttcgag     2580

accgccctgc tgtagggatc c                                               2601


<210> 9
<211> 863
<212> PRT
<213> Human immunodeficiency virus 1

<400> 9
Met Lys Val Arg Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Leu Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Ala Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr 
        115                 120                 125             


Asp Ala Asn Ala Thr Ala Ser Asn Thr Asn Ala Thr Ala Ser Asn Ile 
    130                 135                 140                 


Asn Ala Thr Ala Ser Lys Ser Ser Ile Ile Glu Glu Met Lys Asn Cys 
145                 150                 155                 160 


Ser Phe Asn Ile Thr Thr Glu Leu Arg Asp Lys Arg Glu Lys Lys Tyr 
                165                 170                 175     


Ala Leu Phe Tyr Lys Leu Asp Ile Val Gln Leu Asp Gly Asn Ser Ser 
            180                 185                 190         


Gln Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 
        195                 200                 205             


Pro Lys Val Ser Phe Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 
    210                 215                 220                 


Gly Tyr Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly 
225                 230                 235                 240 


Pro Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro 
                245                 250                 255     


Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu 
            260                 265                 270         


Ile Ile Ile Arg Ser Glu Asn Ile Thr Asp Asn Ser Lys Thr Ile Ile 
        275                 280                 285             


Val His Leu Asn Glu Ser Val Lys Ile Glu Cys Thr Arg Pro Ser Asn 
    290                 295                 300                 


Asn Thr Arg Thr Ser Ile Arg Ile Gly Pro Gly Gln Ala Phe Tyr Ala 
305                 310                 315                 320 


Thr Gly Gln Val Ile Gly Asp Ile Arg Glu Ala His Cys Asn Ile Ser 
                325                 330                 335     


Glu Ser Lys Trp Asn Glu Thr Leu Gln Arg Val Ser Lys Lys Leu Lys 
            340                 345                 350         


Glu Tyr Phe Pro Asp Lys Asn Ile Thr Phe Gln Pro Ser Ser Gly Gly 
        355                 360                 365             


Asp Pro Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
    370                 375                 380                 


Tyr Cys Asn Thr Ser Ser Leu Phe Asn Arg Thr Tyr Met Ala Asn Ser 
385                 390                 395                 400 


Thr Glu Thr Asn Ser Thr Arg Thr Ile Thr Leu His Cys Arg Ile Lys 
                405                 410                 415     


Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr Ala Pro 
            420                 425                 430         


Pro Ile Ala Gly Asn Ile Thr Cys Ile Ser Asn Ile Thr Gly Leu Leu 
        435                 440                 445             


Leu Thr Arg Asp Gly Gly Glu Asn Thr Arg Asp Gly Gly Asn Asn Asn 
    450                 455                 460                 


Thr Glu Thr Phe Arg Pro Glu Gly Gly Asn Met Lys Asp Asn Trp Arg 
465                 470                 475                 480 


Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Val 
                485                 490                 495     


Ala Pro Thr Lys Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala 
            500                 505                 510         


Val Gly Met Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 
        515                 520                 525             


Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu 
    530                 535                 540                 


Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu 
545                 550                 555                 560 


Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu 
                565                 570                 575     


Gln Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln Leu 
            580                 585                 590         


Leu Gly Met Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val 
        595                 600                 605             


Tyr Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp Asp 
    610                 615                 620                 


Asn Met Thr Trp Met Gln Trp Glu Arg Glu Ile Ser Asn Tyr Thr Asp 
625                 630                 635                 640 


Ile Ile Tyr Asp Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 
                645                 650                 655     


Glu Gln Asp Leu Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn Trp 
            660                 665                 670         


Phe Asn Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile 
        675                 680                 685             


Val Gly Gly Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser Leu 
    690                 695                 700                 


Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr Leu 
705                 710                 715                 720 


Ile Pro Ser Pro Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu 
                725                 730                 735     


Gly Gly Glu Gln Asp Arg Asn Arg Ser Thr Arg Leu Val Ser Gly Phe 
            740                 745                 750         


Leu Ala Leu Ala Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ile Tyr 
        755                 760                 765             


His Arg Leu Arg Asp Phe Ile Leu Ile Ala Ala Arg Ala Gly Glu Leu 
    770                 775                 780                 


Leu Gly Arg Ser Ser Leu Lys Gly Leu Arg Arg Gly Trp Glu Ala Leu 
785                 790                 795                 800 


Lys Tyr Leu Gly Gly Leu Val Gln Tyr Trp Gly Leu Glu Leu Lys Arg 
                805                 810                 815     


Ser Ala Ile Ser Leu Leu Asp Thr Leu Ala Ile Ala Val Gly Glu Gly 
            820                 825                 830         


Thr Asp Arg Ile Leu Glu Phe Val Leu Gly Ile Cys Arg Ala Ile Arg 
        835                 840                 845             


Asn Ile Pro Thr Arg Ile Arg Gln Gly Phe Glu Thr Ala Leu Leu 
    850                 855                 860             


<210> 10
<211> 2613
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 10
gtcgacgcta gcaccatgaa ggtgcgcggc atccagcgca actaccccca gtggtggatc       60

tggtccatgc tgggcctgtg gatgctgatg atctgcaacg gcatgtgggt gaccgtgtac      120

tacggcgtgc ccgtgtggaa ggaggccaag accaccctgt tctgcgcctc cgacgccaag      180

gcctacgaga aggaggtgca caacgtgtgg gccacccacg cctgcgtgcc caccgacccc      240

aacccccagg agatggtgct ggagaacgtg accgagaact tcaacatgtg gaagaacgac      300

atggccgacc agatgcacga ggacgtgatc tccctgtggg accagtccct gaagccctgc      360

gtgaagctga cccccctgtg cgtgaccctg aactgcaccg acgccaacgc caccgcctcc      420

aacaccaacg ccaccgcctc caacatcaac gccaccgcct ccaagtcctc catcatcgag      480

gagatgaaga actgctcctt caacatcacc accgagctgc gcgacaagcg cgagaagaag      540

tacgccctgt tctacaagct ggacatcgtg cagctggacg gcaactcctc ccagtaccgc      600

ctgatcaact gcaacacctc cgtgatcacc caggcctgcc ccaaggtgtc cttcgacccc      660

atccccatcc actactgcgc ccccgccggc tacgccatcc tgaagtgcaa caacaagacc      720

ttcaacggca ccggcccctg caacaacgtg tccaccgtgc agtgcaccca cggcatcaag      780

cccgtggtgt ccacccagct gctgctgaac ggctccctgg ccgagggcga gatcatcatc      840

cgctccgaga acatcaccga caactccaag accatcatcg tgcacctgaa cgagtccgtg      900

aagatcgagt gcacccgccc ctccaacaac acccgcacct ccatccgcat cggccccggc      960

caggccttct acgccaccgg ccaggtgatc ggcgacatcc gcgaggccca ctgcaacatc     1020

tccgagtcca agtggaacga gaccctgcag cgcgtgtcca agaagctgaa ggagtacttc     1080

cccgacaaga acatcacctt ccagccctcc tccggcggcg accccgagat caccacccac     1140

tccttcaact gcggcggcga gttcttctac tgcaacacct cctccctgtt caaccgcacc     1200

tacatggcca actccaccga gaccaactcc acccgcacca tcaccctgca ctgccgcatc     1260

aagcagatca tcaacatgtg gcaggaggtg ggccgcgcca tgtacgcccc ccccatcgcc     1320

ggcaacatca cctgcatctc caacatcacc ggcctgctgc tgacccgcga cggcggcgag     1380

aacacccgcg acggcggcaa caacaacacc gagaccttcc gccccgaggg cggcaacatg     1440

aaggacaact ggcgctccga gctgtacaag tacaaggtgg tggaggtgaa gcccctgggc     1500

gtggccccca ccaaggcccg ccgccgcgtg gtggagcgcg agaagcgcgc cgtgggcatg     1560

ggcgccgtgt tcctgggctt cctgggcgcc gccggctcca ccatgggcgc cgcctccatc     1620

accctgaccg tgcaggcccg ccagctgctg tccggcatcg tgcagcagca gtccaacctg     1680

ctgaaggcca tcgaggccca gcagcacatg ctgaagctga ccgtgtgggg catcaagcag     1740

ctgcaggccc gcgtgctggc cctggagcgc tacctgaagg accagcagct gctgggcatg     1800

tggggctgct ccggcaagct gatctgcacc accaacgtgt actggaactc ctcctggtcc     1860

aacaagacct acggcgacat ctgggacaac atgacctgga tgcagtggga gcgcgagatc     1920

tccaactaca ccgacatcat ctacgacctg ctggaggagt cccagaacca gcaggagaag     1980

aacgagcagg acctgctggc cctggaccgc tggaactccc tgtggaactg gttcaacatc     2040

accaagtggc tgtggtacat caagatcttc atcatgatcg tgggcggcct gatcggcctg     2100

cgcatcatct tcgccgtgct gtccctggtg aaccgcgtgc gccagggcta ctcccccctg     2160

tccctgcaga ccctgatccc ctccccccgc ggccccgacc gccccggcgg catcgaggag     2220

gagggcggcg agcaggaccg caaccgctcc acccgcctgg tgtccggctt cctggccctg     2280

gcctgggacg acctgcgctc cctgtgcctg ttcatctacc accgcctgcg cgacttcatc     2340

ctgatcgccg cccgcgccgg cgagctgctg ggccgctcct ccctgaaggg cctgcgccgc     2400

ggctgggagg ccctgaagta cctgggcggc ctggtgcagt actggggcct ggagctgaag     2460

cgctccgcca tctccctgct ggacaccctg gccatcgccg tgggcgaggg caccgaccgc     2520

atcctggagt tcgtgctggg catctgccgc gccatccgca acatccccac ccgcatccgc     2580

cagggcttcg agaccgccct gctgtaggga tcc                                  2613


<210> 11
<211> 848
<212> PRT
<213> Human immunodeficiency virus 1

<400> 11
Met Arg Val Thr Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Leu Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Lys Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Ala Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Ile 
        115                 120                 125             


Asp Ala Asn Ala Thr Ala Ser Asn Ala Thr Ala Ser Asn Ser Ser Ile 
    130                 135                 140                 


Ile Glu Gly Met Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Leu Arg 
145                 150                 155                 160 


Asp Lys Ile Glu Lys Lys Asn Ala Leu Phe Tyr Lys Leu Asp Ile Val 
                165                 170                 175     


Gln Leu Asp Gly Asn Ser Ser Gln Tyr Arg Leu Ile Asn Cys Asn Thr 
            180                 185                 190         


Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp Pro Ile Pro 
        195                 200                 205             


Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys Cys Asn Asn 
    210                 215                 220                 


Lys Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser Thr Val Gln 
225                 230                 235                 240 


Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn 
                245                 250                 255     


Gly Ser Leu Ala Glu Gly Glu Ile Ile Ile Arg Ser Glu Asn Ile Thr 
            260                 265                 270         


Asn Ser Ala Lys Thr Ile Ile Val His Leu Asn Glu Ser Val Lys Ile 
        275                 280                 285             


Glu Cys Thr Arg Pro Ser Asn Asn Thr Arg Thr Ser Ile Arg Ile Gly 
    290                 295                 300                 


Pro Gly Gln Ala Phe Tyr Ala Thr Gly Gln Val Ile Gly Asp Ile Arg 
305                 310                 315                 320 


Lys Ala His Cys Asn Ile Ser Glu Ser Lys Trp Asn Glu Thr Leu Gln 
                325                 330                 335     


Arg Val Ser Lys Lys Leu Lys Glu Tyr Phe Pro His Lys Asn Ile Thr 
            340                 345                 350         


Phe Gln Pro Ser Ser Gly Gly Asp Leu Glu Ile Thr Thr His Ser Phe 
        355                 360                 365             


Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Ser Leu Phe Asn 
    370                 375                 380                 


Arg Thr Tyr Met Ala Asn Ser Thr Glu Thr Asn Ser Thr Arg Thr Ile 
385                 390                 395                 400 


Thr Leu His Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val 
                405                 410                 415     


Gly Arg Ala Met Tyr Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Ile 
            420                 425                 430         


Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn 
        435                 440                 445             


Thr Thr Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp 
    450                 455                 460                 


Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Ile Lys Pro Leu Gly 
465                 470                 475                 480 


Val Ala Pro Thr Asn Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg 
                485                 490                 495     


Ala Val Gly Met Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly 
            500                 505                 510         


Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln 
        515                 520                 525             


Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile 
    530                 535                 540                 


Glu Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln 
545                 550                 555                 560 


Leu Gln Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln 
                565                 570                 575     


Leu Leu Gly Met Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn 
            580                 585                 590         


Val Tyr Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp 
        595                 600                 605             


Asp Asn Met Thr Trp Met Gln Trp Glu Arg Glu Ile Ser Asp Tyr Thr 
    610                 615                 620                 


Glu Ile Ile Tyr Glu Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys 
625                 630                 635                 640 


Asn Glu Gln Asp Leu Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn 
                645                 650                 655     


Trp Phe Asn Ile Thr Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met 
            660                 665                 670         


Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser 
        675                 680                 685             


Leu Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr 
    690                 695                 700                 


Leu Thr Pro Ser Pro Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu 
705                 710                 715                 720 


Glu Gly Gly Glu Gln Asp Arg Asn Arg Ser Thr Arg Leu Val Ser Gly 
                725                 730                 735     


Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ile 
            740                 745                 750         


Tyr His Arg Leu Arg Asp Phe Ile Leu Ile Ala Ala Arg Ala Gly Glu 
        755                 760                 765             


Leu Leu Gly Arg Ser Ser Leu Lys Gly Leu Arg Arg Gly Trp Glu Ala 
    770                 775                 780                 


Leu Lys Tyr Leu Gly Gly Leu Val Gln Tyr Trp Gly Leu Glu Leu Lys 
785                 790                 795                 800 


Arg Ser Ala Ile Ser Leu Leu Asp Thr Leu Ala Ile Ala Val Gly Glu 
                805                 810                 815     


Gly Thr Asp Arg Ile Leu Glu Phe Val Leu Gly Ile Cys Arg Ala Ile 
            820                 825                 830         


Arg Asn Ile Pro Thr Arg Ile Arg Gln Gly Phe Glu Thr Ala Leu Leu 
        835                 840                 845             


<210> 12
<211> 2568
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 12
gtcgacgcta gcaccatgcg cgtgaccggc atccagcgca actaccccca gtggtggatc       60

tggtccatgc tgggcctgtg gatgctgatg atctgcaacg gcatgtgggt gaccgtgtac      120

tacggcgtgc ccgtgtggaa ggaggccaag accaccctgt tctgcgcctc cgacgccaag      180

gcctacgaga aggaggtgca caacgtgtgg gccacccacg cctgcgtgcc caccgacccc      240

aacccccagg agatggtgct gaagaacgtg accgagaact tcaacatgtg gaagaacgac      300

atggccgacc agatgcacga ggacgtgatc tccctgtggg accagtccct gaagccctgc      360

gtgaagctga cccccctgtg cgtgaccctg aactgcatcg acgccaacgc caccgcctcc      420

aacgccaccg cctccaactc ctccatcatc gagggcatga agaactgctc cttcaacatc      480

accaccgagc tgcgcgacaa gatcgagaag aagaacgccc tgttctacaa gctggacatc      540

gtgcagctgg acggcaactc ctcccagtac cgcctgatca actgcaacac ctccgtgatc      600

acccaggcct gccccaaggt gtccttcgac cccatcccca tccactactg cgcccccgcc      660

ggctacgcca tcctgaagtg caacaacaag accttcaacg gcaccggccc ctgcaacaac      720

gtgtccaccg tgcagtgcac ccacggcatc aagcccgtgg tgtccaccca gctgctgctg      780

aacggctccc tggccgaggg cgagatcatc atccgctccg agaacatcac caactccgcc      840

aagaccatca tcgtgcacct gaacgagtcc gtgaagatcg agtgcacccg cccctccaac      900

aacacccgca cctccatccg catcggcccc ggccaggcct tctacgccac cggccaggtg      960

atcggcgaca tccgcaaggc ccactgcaac atctccgagt ccaagtggaa cgagaccctg     1020

cagcgcgtgt ccaagaagct gaaggagtac ttcccccaca agaacatcac cttccagccc     1080

tcctccggcg gcgacctgga gatcaccacc cactccttca actgcggcgg cgagttcttc     1140

tactgcaaca cctcctccct gttcaaccgc acctacatgg ccaactccac cgagaccaac     1200

tccacccgca ccatcaccct gcactgccgc atcaagcaga tcatcaacat gtggcaggag     1260

gtgggccgcg ccatgtacgc cccccccatc gccggcaaca tcacctgcat ctccaacatc     1320

accggcctgc tgctgacccg cgacggcggc aacaacaaca ccaccgagac cttccgcccc     1380

ggcggcggca acatgaagga caactggcgc tccgagctgt acaagtacaa ggtggtggag     1440

atcaagcccc tgggcgtggc ccccaccaac gcccgccgcc gcgtggtgga gcgcgagaag     1500

cgcgccgtgg gcatgggcgc cgtgttcctg ggcttcctgg gcgccgccgg ctccaccatg     1560

ggcgccgcct ccatcaccct gaccgtgcag gcccgccagc tgctgtccgg catcgtgcag     1620

cagcagtcca acctgctgaa ggccatcgag gcccagcagc acatgctgaa gctgaccgtg     1680

tggggcatca agcagctgca ggcccgcgtg ctggccctgg agcgctacct gaaggaccag     1740

cagctgctgg gcatgtgggg ctgctccggc aagctgatct gcaccaccaa cgtgtactgg     1800

aactcctcct ggtccaacaa gacctacggc gacatctggg acaacatgac ctggatgcag     1860

tgggagcgcg agatctccga ctacaccgag atcatctacg agctgctgga ggagtcccag     1920

aaccagcagg agaagaacga gcaggacctg ctggccctgg accgctggaa ctccctgtgg     1980

aactggttca acatcaccaa ctggctgtgg tacatcaaga tcttcatcat gatcgtgggc     2040

ggcctgatcg gcctgcgcat catcttcgcc gtgctgtccc tggtgaaccg cgtgcgccag     2100

ggctactccc ccctgtccct gcagaccctg accccctccc cccgcggccc cgaccgcccc     2160

ggcggcatcg aggaggaggg cggcgagcag gaccgcaacc gctccacccg cctggtgtcc     2220

ggcttcctgg ccctggcctg ggacgacctg cgctccctgt gcctgttcat ctaccaccgc     2280

ctgcgcgact tcatcctgat cgccgcccgc gccggcgagc tgctgggccg ctcctccctg     2340

aagggcctgc gccgcggctg ggaggccctg aagtacctgg gcggcctggt gcagtactgg     2400

ggcctggagc tgaagcgctc cgccatctcc ctgctggaca ccctggccat cgccgtgggc     2460

gagggcaccg accgcatcct ggagttcgtg ctgggcatct gccgcgccat ccgcaacatc     2520

cccacccgca tccgccaggg cttcgagacc gccctgctgt agggatcc                  2568


<210> 13
<211> 801
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 13
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly 
1               5                   10                  15      


Met Leu Val Ala Ser Cys Leu Gly Met Trp Val Thr Val Tyr Tyr Gly 
            20                  25                  30          


Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp 
        35                  40                  45              


Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val Trp Ala Thr His Ala 
    50                  55                  60                  


Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met Val Leu Lys Asn Val 
65                  70                  75                  80  


Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met Val Asp Gln Met His 
                85                  90                  95      


Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys 
            100                 105                 110         


Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Asn Ala Thr Ala Ser 
        115                 120                 125             


Asn Ser Ser Ile Ile Glu Gly Met Lys Asn Cys Ser Phe Asn Ile Thr 
    130                 135                 140                 


Thr Glu Leu Arg Asp Lys Arg Glu Lys Lys Asn Ala Leu Phe Tyr Lys 
145                 150                 155                 160 


Leu Asp Ile Val Gln Leu Asp Gly Asn Ser Ser Gln Tyr Arg Leu Ile 
                165                 170                 175     


Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe 
            180                 185                 190         


Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu 
        195                 200                 205             


Lys Cys Asn Asn Lys Thr Phe Thr Gly Thr Gly Pro Cys Asn Asn Val 
    210                 215                 220                 


Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln 
225                 230                 235                 240 


Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Ile Ile Ile Arg Ser 
                245                 250                 255     


Glu Asn Ile Thr Asn Asn Val Lys Thr Ile Ile Val His Leu Asn Glu 
            260                 265                 270         


Ser Val Lys Ile Glu Cys Thr Arg Pro Asn Asn Lys Thr Arg Thr Ser 
        275                 280                 285             


Ile Arg Ile Gly Pro Gly Gln Ala Phe Tyr Ala Thr Gly Gln Val Ile 
    290                 295                 300                 


Gly Asp Ile Arg Glu Ala Tyr Cys Asn Ile Asn Glu Ser Lys Trp Asn 
305                 310                 315                 320 


Glu Thr Leu Gln Arg Val Ser Lys Lys Leu Lys Glu Tyr Phe Pro His 
                325                 330                 335     


Lys Asn Ile Thr Phe Gln Pro Ser Ser Gly Gly Asp Leu Glu Ile Thr 
            340                 345                 350         


Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser 
        355                 360                 365             


Ser Leu Phe Asn Arg Thr Tyr Met Ala Asn Ser Thr Asp Met Ala Asn 
    370                 375                 380                 


Ser Thr Glu Thr Asn Ser Thr Arg Thr Ile Thr Ile His Cys Arg Ile 
385                 390                 395                 400 


Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr Ala 
                405                 410                 415     


Pro Pro Ile Ala Gly Asn Ile Thr Cys Ile Ser Asn Ile Thr Gly Leu 
            420                 425                 430         


Leu Leu Thr Arg Asp Gly Gly Lys Asn Asn Thr Glu Thr Phe Arg Pro 
        435                 440                 445             


Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
    450                 455                 460                 


Lys Val Val Glu Val Lys Pro Leu Gly Val Ala Pro Thr Asn Ala Arg 
465                 470                 475                 480 


Arg Arg Val Val Glu Ser Glu Lys Ser Ala Val Gly Met Gly Ala Val 
                485                 490                 495     


Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser 
            500                 505                 510         


Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln 
        515                 520                 525             


Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu Ala Gln Gln His Met Leu 
    530                 535                 540                 


Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala 
545                 550                 555                 560 


Leu Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Met Trp Gly Cys 
                565                 570                 575     


Ser Gly Lys Leu Ile Cys Thr Thr Asn Val Tyr Trp Asn Ser Ser Trp 
            580                 585                 590         


Ser Asn Lys Thr Tyr Gly Asp Ile Trp Asp Asn Met Thr Trp Met Gln 
        595                 600                 605             


Trp Glu Arg Glu Ile Ser Asn Tyr Thr Glu Ile Ile Tyr Glu Leu Leu 
    610                 615                 620                 


Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Asp Leu Leu Ala 
625                 630                 635                 640 


Leu Asp Arg Trp Asn Ser Leu Trp Asn Trp Phe Asn Ile Thr Asn Trp 
                645                 650                 655     


Leu Gly Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly 
            660                 665                 670         


Leu Arg Ile Ile Phe Ala Val Leu Ser Leu Val Asn Arg Val Arg Gln 
        675                 680                 685             


Gly Ala Ser Pro Leu Ser Leu Gln Thr Leu Ile Pro Ser Pro Arg Gly 
    690                 695                 700                 


Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly Gly Glu Gln Asp Arg 
705                 710                 715                 720 


Asn Arg Ser Thr Arg Leu Val Ser Gly Phe Leu Ala Leu Val Trp Asp 
                725                 730                 735     


Asp Leu Arg Ser Leu Cys Leu Phe Ile Tyr His Arg Leu Arg Asp Phe 
            740                 745                 750         


Ile Leu Ile Ala Ala Arg Ala Gly Glu Leu Leu Gly Arg Ser Ser Leu 
        755                 760                 765             


Lys Gly Leu Arg Arg Gly Trp Glu Ala Leu Lys Tyr Leu Gly Ser Leu 
    770                 775                 780                 


Val Gln Tyr Trp Gly Leu Glu Leu Gly Gly Gly His His His His His 
785                 790                 795                 800 


His 
    


<210> 14
<211> 2424
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 14
ggatccgcca ccatgcccat gggctccctg cagcccctgg ccaccctgta cctgctgggc       60

atgctggtgg cctcctgcct gggcatgtgg gtgaccgtgt actacggcgt gcccgtgtgg      120

aaggaggcca agaccaccct gttctgcgcc tccgacgcca aggcctacga gaaggaggtg      180

cacaacgtgt gggccaccca cgcctgcgtg cccaccgacc ccaaccccca ggagatggtg      240

ctgaagaacg tgaccgagaa cttcaacatg tggaagaacg acatggtgga ccagatgcac      300

gaggacgtga tctccctgtg ggaccagtcc ctgaagccct gcgtgaagct gacccccctg      360

tgcgtgaccc tgaactgcac caacgccacc gcctccaact cctccatcat cgagggcatg      420

aagaactgct ccttcaacat caccaccgag ctgcgcgaca agcgcgagaa gaagaacgcc      480

ctgttctaca agctggacat cgtgcagctg gacggcaact cctcccagta ccgcctgatc      540

aactgcaaca cctccgtgat cacccaggcc tgccccaagg tgtccttcga ccccatcccc      600

atccactact gcgcccccgc cggctacgcc atcctgaagt gcaacaacaa gaccttcacc      660

ggcaccggcc cctgcaacaa cgtgtccacc gtgcagtgca cccacggcat caagcccgtg      720

gtgtccaccc agctgctgct gaacggctcc ctggccgagg gcgagatcat catccgctcc      780

gagaacatca ccaacaacgt gaagaccatc atcgtgcacc tgaacgagtc cgtgaagatc      840

gagtgcaccc gccccaacaa caagacccgc acctccatcc gcatcggccc cggccaggcc      900

ttctacgcca ccggccaggt gatcggcgac atccgcgagg cctactgcaa catcaacgag      960

tccaagtgga acgagaccct gcagcgcgtg tccaagaagc tgaaggagta cttcccccac     1020

aagaacatca ccttccagcc ctcctccggc ggcgacctgg agatcaccac ccactccttc     1080

aactgcggcg gcgagttctt ctactgcaac acctcctccc tgttcaaccg cacctacatg     1140

gccaactcca ccgacatggc caactccacc gagaccaact ccacccgcac catcaccatc     1200

cactgccgca tcaagcagat catcaacatg tggcaggagg tgggccgcgc catgtacgcc     1260

ccccccatcg ccggcaacat cacctgcatc tccaacatca ccggcctgct gctgacccgc     1320

gacggcggca agaacaacac cgagaccttc cgccccggcg gcggcaacat gaaggacaac     1380

tggcgctccg agctgtacaa gtacaaggtg gtggaggtga agcccctggg cgtggccccc     1440

accaacgccc gccgccgcgt ggtggagtcc gagaagtccg ccgtgggcat gggcgccgtg     1500

ttcctgggct tcctgggcgc cgccggctcc accatgggcg ccgcctccat caccctgacc     1560

gtgcaggccc gccagctgct gtccggcatc gtgcagcagc agtccaacct gctgaaggcc     1620

atcgaggccc agcagcacat gctgaagctg accgtgtggg gcatcaagca gctgcaggcc     1680

cgcgtgctgg ccctggagcg ctacctgaag gaccagcagc tgctgggcat gtggggctgc     1740

tccggcaagc tgatctgcac caccaacgtg tactggaact cctcctggtc caacaagacc     1800

tacggcgaca tctgggacaa catgacctgg atgcagtggg agcgcgagat ctccaactac     1860

accgagatca tctacgagct gctggaggag tcccagaacc agcaggagaa gaacgagcag     1920

gacctgctgg ccctggaccg ctggaactcc ctgtggaact ggttcaacat caccaactgg     1980

ctgggctaca tcaagatctt catcatgatc gtgggcggcc tgatcggcct gcgcatcatc     2040

ttcgccgtgc tgtccctggt gaaccgcgtg cgccagggcg cctcccccct gtccctgcag     2100

accctgatcc cctccccccg cggccccgac cgccccggcg gcatcgagga ggagggcggc     2160

gagcaggacc gcaaccgctc cacccgcctg gtgtccggct tcctggccct ggtgtgggac     2220

gacctgcgct ccctgtgcct gttcatctac caccgcctgc gcgacttcat cctgatcgcc     2280

gcccgcgccg gcgagctgct gggccgctcc tccctgaagg gcctgcgccg cggctgggag     2340

gccctgaagt acctgggctc cctggtgcag tactggggcc tggagctggg cggcggccac     2400

caccaccacc accactaact cgag                                            2424


