                         SEQUENCE LISTING

<110>  DANA-FARBER CANCER INSTITUTE, INC.
 
<120>  Compositions And Methods For Conformationally Stabilizing Primate
       Immunodeficiency Virus Envelope Glycoprotein Trimers

<130>  DFS-109.25

<140>  PCT/US2013/052855
<141>  2013-07-31

<150>  61/742,139
<151>  2012-08-03

<160>  18    

<170>  PatentIn version 3.5

<210>  1
<211>  2571
<212>  DNA
<213>  Human immunodeficiency virus type 1

<400>  1
atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg       60

ctccttggga tgttgatgat ctgtagtgct acagaaaaat tgtgggtcac agtctattat      120

ggggtacctg tgtggaagga agcaaccacc actctatttt gtgcatcaga tgctaaagca      180

tatgatacag aggtacataa tgtttgggcc acacatgcct gtgtacccac agaccccaac      240

ccacaagaag tagtattggt aaatgtgaca gaaaatttta acatgtggaa aaatgacatg      300

gtagaacaga tgcatgagga tataatcagt ttatgggatc aaagcctaaa gccatgtgta      360

aaattaaccc cactctgtgt tagtttaaag tgcactgatt tgaagaatga tactaatacc      420

aatagtagta gcgggagaat gataatggag aaaggagaga taaaaaactg ctctttcaat      480

atcagcacaa gcataagagg taaggtgcag aaagaatatg cattttttta taaacttgat      540

ataataccaa tagataatga tactaccagc tataagttga caagttgtaa cacctcagtc      600

attacacagg cctgtccaaa ggtatccttt gagccaattc ccatacatta ttgtgccccg      660

gctggttttg cgattctaaa atgtaataat aagacgttca atggaacagg accatgtaca      720

aatgtcagca cagtacaatg tacacatgga attaggccag tagtatcaac tcaactgctg      780

ttaaatggca gtctagcaga agaagaggta gtaattagat ctgtcaattt cacggacaat      840

gctaaaacca taatagtaca gctgaacaca tctgtagaaa ttaattgtac aagacccaac      900

aacaatacaa gaaaaagaat ccgtatccag agaggaccag ggagagcatt tgttacaata      960

ggaaaaatag gaaatatgag acaagcacat tgtaacatta gtagagcaaa atggaataac     1020

actttaaaac agatagctag caaattaaga gaacaatttg gaaataataa aacaataatc     1080

tttaagcaat cctcaggagg ggacccagaa attgtaacgc acagttttaa ttgtggaggg     1140

gaatttttct actgtaattc aacacaactg tttaatagta cttggtttaa tagtacttgg     1200

agtactgaag ggtcaaataa cactgaagga agtgacacaa tcaccctccc atgcagaata     1260

aaacaaatta taaacatgtg gcagaaagta ggaaaagcaa tgtatgcccc tcccatcagt     1320

ggacaaatta gatgttcatc aaatattaca gggctgctat taacaagaga tggtggtaat     1380

agcaacaatg agtccgagat cttcagacct ggaggaggag atatgaggga caattggaga     1440

agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag     1500

gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg gaataggagc tttgttcctt     1560

gggttcttgg gagcagcagg aagcactatg ggcgcagcct caatgacgct gacggtacag     1620

gccagacaat tattgtctgg tatagtgcag cagcagaaca atttgctgag ggctattgag     1680

gcgcaacagc atctgttgca actcacagtc tggggcatca agcagctcca ggcaagaatc     1740

ctggctgtgg aaagatacct aaaggatcaa cagctcctgg ggatttgggg ttgctctgga     1800

aaactcattt gcaccactgc tgtgccttgg aatgctagtt ggagtaataa atctctggaa     1860

cagatttgga atcacacgac ctggatggag tgggacagag aaattaacaa ttacacaagc     1920

ttaatacact ccttaattga agaatcgcaa aaccagcaag aaaagaatga acaagaatta     1980

ttggaattag ataaatgggc aagtttgtgg aattggttta acataacaaa ttggctgtgg     2040

tatataaaat tattcataat gatagtagga ggcttggtag gtttaagaat agtttttgct     2100

gtactttcta tagtgaatag agttaggcag ggatattcac cattatcgtt tcagacccac     2160

ctcccaaccc cgaggggacc cgacaggccc gaaggaatag aagaagaagg tggagagaga     2220

gacagagaca gatccattcg attagtgaac ggatccttgg cacttatctg ggacgatctg     2280

cggagcctgt gcctcttcag ctaccaccgc ttgagagact tactcttgat tgtaacgagg     2340

attgtggaac ttctgggacg cagggggtgg gaagccctca aatattggtg gaatctccta     2400

cagtattgga gtcaggaact aaagaatagt gctgttagct tgctcaatgc cacagccata     2460

gcagtagctg aggggacaga tagggttata gaagtagtac aaggagcttg tagagctatt     2520

cgccacatac ctagaagaat aagacagggc ttggaaagga ttttgctata a              2571


<210>  2
<211>  856
<212>  PRT
<213>  Human immunodeficiency virus type 1

<400>  2

Met Arg Val Lys Glu Lys Tyr Gln His Leu Trp Arg Trp Gly Trp Arg 
1               5                   10                  15      


Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu 
            20                  25                  30          


Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
        35                  40                  45              


Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
    50                  55                  60                  


Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65                  70                  75                  80  


Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
                85                  90                  95      


Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp 
            100                 105                 110         


Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 
        115                 120                 125             


Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 
    130                 135                 140                 


Gly Arg Met Ile Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn 
145                 150                 155                 160 


Ile Ser Thr Ser Ile Arg Gly Lys Val Gln Lys Glu Tyr Ala Phe Phe 
                165                 170                 175     


Tyr Lys Leu Asp Ile Ile Pro Ile Asp Asn Asp Thr Thr Ser Tyr Lys 
            180                 185                 190         


Leu Thr Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val 
        195                 200                 205             


Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala 
    210                 215                 220                 


Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
225                 230                 235                 240 


Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser 
                245                 250                 255     


Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile 
            260                 265                 270         


Arg Ser Val Asn Phe Thr Asp Asn Ala Lys Thr Ile Ile Val Gln Leu 
        275                 280                 285             


Asn Thr Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
    290                 295                 300                 


Lys Arg Ile Arg Ile Gln Arg Gly Pro Gly Arg Ala Phe Val Thr Ile 
305                 310                 315                 320 


Gly Lys Ile Gly Asn Met Arg Gln Ala His Cys Asn Ile Ser Arg Ala 
                325                 330                 335     


Lys Trp Asn Asn Thr Leu Lys Gln Ile Ala Ser Lys Leu Arg Glu Gln 
            340                 345                 350         


Phe Gly Asn Asn Lys Thr Ile Ile Phe Lys Gln Ser Ser Gly Gly Asp 
        355                 360                 365             


Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 
    370                 375                 380                 


Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp 
385                 390                 395                 400 


Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr Ile Thr Leu 
                405                 410                 415     


Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly Lys 
            420                 425                 430         


Ala Met Tyr Ala Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn 
        435                 440                 445             


Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu 
    450                 455                 460                 


Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
465                 470                 475                 480 


Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val 
                485                 490                 495     


Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala 
            500                 505                 510         


Val Gly Ile Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 
        515                 520                 525             


Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Gln Leu 
    530                 535                 540                 


Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Ile Glu 
545                 550                 555                 560 


Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu 
                565                 570                 575     


Gln Ala Arg Ile Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu 
            580                 585                 590         


Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val 
        595                 600                 605             


Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Gln Ile Trp Asn 
    610                 615                 620                 


His Thr Thr Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser 
625                 630                 635                 640 


Leu Ile His Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 
                645                 650                 655     


Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 
            660                 665                 670         


Phe Asn Ile Thr Asn Trp Leu Trp Tyr Ile Lys Leu Phe Ile Met Ile 
        675                 680                 685             


Val Gly Gly Leu Val Gly Leu Arg Ile Val Phe Ala Val Leu Ser Ile 
    690                 695                 700                 


Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His 
705                 710                 715                 720 


Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu 
                725                 730                 735     


Gly Gly Glu Arg Asp Arg Asp Arg Ser Ile Arg Leu Val Asn Gly Ser 
            740                 745                 750         


Leu Ala Leu Ile Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr 
        755                 760                 765             


His Arg Leu Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu 
    770                 775                 780                 


Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu 
785                 790                 795                 800 


Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn 
                805                 810                 815     


Ala Thr Ala Ile Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Val 
            820                 825                 830         


Val Gln Gly Ala Cys Arg Ala Ile Arg His Ile Pro Arg Arg Ile Arg 
        835                 840                 845             


Gln Gly Leu Glu Arg Ile Leu Leu 
    850                 855     


<210>  3
<211>  2544
<212>  DNA
<213>  Human immunodeficiency virus type 1

<400>  3
atgagagtga aggggatcag gaagagttat cagtacttgt ggaaaggggg caccttgctc       60

cttgggatat taatgatctg tagtgctgta gaaaagttgt gggtcacagt ctattatggg      120

gtacctgtgt ggaaagaagc aaccaccact ctattttgtg catcagatgc taaagcatat      180

gatacagagg tacataatgt ttgggccaca catgcctgtg tacccacaga ccccaaccca      240

caagaagtag tattggaaaa tgtaacagaa cattttaaca tgtggaaaaa taacatggta      300

gaacagatgc aggaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaaa      360

ttaaccccac tctgtgttac tttaaattgc aaggatgtga atgctactaa taccactaat      420

gatagcgagg gaacgatgga gagaggagaa ataaaaaact gctctttcaa tatcaccaca      480

agcataagag atgaggtgca gaaagaatat gctctttttt ataaacttga tgtagtacca      540

atagataata ataataccag ctataggttg ataagttgtg acacctcagt cattacacag      600

gcctgtccaa agatatcctt tgagccaatt cccatacatt attgtgcccc ggctggtttt      660

gcgattctaa agtgtaatga taagacgttc aatggaaaag gaccatgtaa aaatgtcagc      720

acagtacaat gtacacatgg aattaggcca gtagtatcaa ctcaactgct gctaaatggc      780

agtctagcag aagaagaggt agtaattaga tctgacaatt tcacgaacaa tgctaaaacc      840

ataatagtac agctgaaaga atctgtagaa attaattgta caagacccaa caacaataca      900

agaaaaagta tacatatagg accagggaga gcattttata ctacaggaga aataatagga      960

gatataagac aagcacattg taacattagt agagcaaaat ggaatgacac tttaaaacag     1020

atagttataa aattaagaga acaatttgag aataaaacaa tagtctttaa tcactcctca     1080

ggaggggacc cagaaattgt aatgcacagt tttaattgtg gaggagaatt tttctactgt     1140

aattcaacac aactgtttaa tagtacttgg aataataata ctgaagggtc aaataacact     1200

gaaggaaata ctatcacact cccatgcaga ataaaacaaa ttataaacat gtggcaggaa     1260

gtaggaaaag caatgtatgc ccctcccatc agaggacaaa ttagatgttc atcaaatatt     1320

acagggctgc tattaacaag agatggtggt attaatgaga atgggaccga gatcttcaga     1380

cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta     1440

aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcaaagagaa     1500

aaaagagcag tgggaatagg agctgtgttc cttgggttct tgggagcagc aggaagcact     1560

atgggcgcag cgtcaatgac actgacggta caggccagac tattattgtc tggtatagtg     1620

caacagcaga acaatttgct gagggctatt gaggcgcaac agcgtatgtt gcaactcaca     1680

gtctggggca tcaagcagct ccaggcaaga gtcctggctg tggaaagata cctaggggat     1740

caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct     1800

tggaatgcta gttggagtaa taaatctctg gataggattt ggaataacat gacctggatg     1860

gagtgggaaa gagaaattga caattacaca agcgaaatat acaccctaat tgaagaatcg     1920

cagaaccaac aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg     1980

tggaattggt ttgacataac aaaatggctg tggtatataa aaatattcat aatgatagta     2040

ggaggcttag taggtttaag actagttttt actgtacttt ctatagtgaa tagagttagg     2100

cagggatact caccattatc gtttcagacc ctcctcccag ccccgagggg acccgacagg     2160

cccgaaggaa tcgaagaaga aggtggagag agagacagag acagatccgg acgattagtg     2220

aacggattct tagcacttat ctgggtcgac ctgcggagcc tgtgcctctt cagctaccac     2280

cgcttgagag acttactctt gactgtaacg aggattgtgg aacttctggg acgcaggggg     2340

tgggaagtcc tgaaatattg gtggaatctc ctacagtatt ggagtcagga actaaagaat     2400

agtgctgtta gcttgctcaa tgccacagcc atagcagtag ctgaggggac agataggatt     2460

atagaagcat tacaaagaac ttatagagct attctccaca tacctacaag aataagacag     2520

ggcttggaaa gggctttgct ataa                                            2544


<210>  4
<211>  847
<212>  PRT
<213>  Human immunodeficiency virus type 1

<400>  4

Met Arg Val Lys Gly Ile Arg Lys Ser Tyr Gln Tyr Leu Trp Lys Gly 
1               5                   10                  15      


Gly Thr Leu Leu Leu Gly Ile Leu Met Ile Cys Ser Ala Val Glu Lys 
            20                  25                  30          


Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 
        35                  40                  45              


Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
    50                  55                  60                  


His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65                  70                  75                  80  


Gln Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys 
                85                  90                  95      


Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp 
            100                 105                 110         


Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
        115                 120                 125             


Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 
    130                 135                 140                 


Thr Met Glu Arg Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr 
145                 150                 155                 160 


Ser Ile Arg Asp Glu Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
                165                 170                 175     


Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser 
            180                 185                 190         


Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu 
        195                 200                 205             


Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys 
    210                 215                 220                 


Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser 
225                 230                 235                 240 


Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu 
                245                 250                 255     


Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp 
            260                 265                 270         


Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser 
        275                 280                 285             


Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile 
    290                 295                 300                 


His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly 
305                 310                 315                 320 


Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp 
                325                 330                 335     


Thr Leu Lys Gln Ile Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys 
            340                 345                 350         


Thr Ile Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Met 
        355                 360                 365             


His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln 
    370                 375                 380                 


Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr 
385                 390                 395                 400 


Glu Gly Asn Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn 
                405                 410                 415     


Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly 
            420                 425                 430         


Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp 
        435                 440                 445             


Gly Gly Ile Asn Glu Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly 
    450                 455                 460                 


Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
465                 470                 475                 480 


Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 
                485                 490                 495     


Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly 
            500                 505                 510         


Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu 
        515                 520                 525             


Thr Val Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn 
    530                 535                 540                 


Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr 
545                 550                 555                 560 


Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg 
                565                 570                 575     


Tyr Leu Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys 
            580                 585                 590         


Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys 
        595                 600                 605             


Ser Leu Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg 
    610                 615                 620                 


Glu Ile Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser 
625                 630                 635                 640 


Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys 
                645                 650                 655     


Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr 
            660                 665                 670         


Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu 
        675                 680                 685             


Val Phe Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser 
    690                 695                 700                 


Pro Leu Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg 
705                 710                 715                 720 


Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser 
                725                 730                 735     


Gly Arg Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg 
            740                 745                 750         


Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr 
        755                 760                 765             


Val Thr Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu 
    770                 775                 780                 


Lys Tyr Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn 
785                 790                 795                 800 


Ser Ala Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly 
                805                 810                 815     


Thr Asp Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu 
            820                 825                 830         


His Ile Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 
        835                 840                 845         


<210>  5
<211>  2586
<212>  DNA
<213>  Human immunodeficiency virus type 1

<400>  5
atgagagtga tggggacaca gacgagttat cagcacttgt ggagatgggg aatcttaatt       60

ttggggatgc taataatttg taaagctaca gattggtggg tcacagtata ctatggagta      120

cctgtgtgga aagatgcaga aaccacctta ttttgcgcat cagatgataa agcatatgag      180

acagaagcgc ataatgtctg ggccacacat gcctgtgtac ccacagaccc caacccacaa      240

gaagtaaacc taaaaaatgt gacagaagat tttaacatgt ggaaaaataa tatggtagag      300

cagatgcatg aagatataat cagtctatgg gatcaaagcc taaagccatg tgtaaaatta      360

acccctctct gtgtcacgtt aaactgtagc aatgccaaca ccaatagcac caatagcact      420

agcgccccta gcatgggccc tggagaaata aaaaactgtt cttttaatgt taccacagaa      480

gtaagagata aagaaaagaa agtctatgca ctgttttata aacttgatgt agtacaaatt      540

aatgaaagtg acagtaatag tacaaaggat agtactcagt atagactaat aaattgtaat      600

acctcagcca tcacacaggc ttgtccaaag gtatcctttg agccaattcc tatacattat      660

tgtgccccag ctggttttgc gattctaaag tgtgaggatc cgagattcaa tggaacagga      720

ccatgcaata atgttagctc agtacaatgt acacatggaa ttatgccagt agcatcaact      780

caactgctgt tgaatggcag tctagcagaa aaagaggtga tgattagatc tgaaaatatt      840

acaaacaatg ccaaaaacat aatagtacag tttaatgaat cggtaccaat tacttgtatc      900

agacccaaca acaatacgag aaaaggtata cctattggac caggacaagt cttctataca      960

agtgacataa taggggatat aagacaagca tattgtagta tcaacaaaac aaaatgggat     1020

gcctctttac aaaaggtagc tgaacaatta agaaaacact tccctaataa aacaataaat     1080

tttaccaaac cctcaggagg ggatctagaa attacaacac atagttttaa ttgtggagga     1140

gaatttttct attgtaatac aacaagcctg tttaatagca catggaagaa tggcgccacc     1200

atacaggaga atagcacgga gacaaatgga attatgactc tcccatgcag aataaaacaa     1260

attgtagaca tgtggcagga agtaggacaa gcaatgtatg cccctcccat tgcaggagta     1320

atatattgta catcaaacat tacaggaata atattgacaa gagatggtgg gagcagtaac     1380

accaatagtg agatctttag gcctggagga ggagatatga gggacaattg gagaagtgaa     1440

ttatataagt ataaagtagt aaaaattgaa ccactaggag tagcaccctc cagggcaaag     1500

agaagagtgg tggagagaga aaaaagagca gtgggaatag gagctgtttt ccttgggttc     1560

ttgggagctg caggaagcac tatgggcgcg gcgtcaataa cgctgacggt acaggccaga     1620

cagttattat ctggcatagt gcaacagcaa agcaatttgc tgaaggctat agaggctcaa     1680

cagcatctgt tgaaactcac agtctggggc attaaacagc tccaggcaag agtcctggct     1740

ctggagagat acctacaaga tcaacagctc ctgggaattt ggggttgctc tggaaaactc     1800

atctgcacca ctactgtgcc ctggaactct agttggagta ataagactta cgaggagatt     1860

tggaacaaca tgacctggtt gcaatgggat agagaaattg acaattacac aaatataata     1920

tacaatctac ttgaagaatc gcagaaccag caggaaaaga atgaacaaga cttactggca     1980

ttagataaat gggcaagttt gtggaattgg tttagcataa caaactggct gtggtatata     2040

agaatattca taatgatagt aggaggcttg ataggattaa gaatagttat ggctataatt     2100

tctgtagtga atagagttag gcagggatac tcacctttgt catttcagat ccctacccca     2160

aacccagagg gtctcgacag gcacggaaga atcgaagaag gaggtggaga gcaagacaga     2220

accagatcga ttcgattagt gagcggattc ttgggacttg cctgggacga cctacggagc     2280

ctgtgcctct tcagctacca ccgattgaga gattgcatct tgattgtagc gaggactgtg     2340

gaacttctgg gacacagcag tctcaaggga ctgagactgg ggtgggaggg cctcaaatat     2400

ttggggaacc ttctactgta ttggggtcgg gaattgaaaa atagtgctat tagtttacta     2460

aattccacag caatagcagt agctgagtgg acagataggg ttatagaaat aggacaaaga     2520

gcttgcagag ctattctcaa catacctaga agaatcagac agggcttcga aagagcttta     2580

ctataa                                                                2586


<210>  6
<211>  861
<212>  PRT
<213>  Human immunodeficiency virus type 1

<400>  6

Met Arg Val Met Gly Thr Gln Thr Ser Tyr Gln His Leu Trp Arg Trp 
1               5                   10                  15      


Gly Ile Leu Ile Leu Gly Met Leu Ile Ile Cys Lys Ala Thr Asp Trp 
            20                  25                  30          


Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu Thr 
        35                  40                  45              


Thr Leu Phe Cys Ala Ser Asp Asp Lys Ala Tyr Glu Thr Glu Ala His 
    50                  55                  60                  


Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln 
65                  70                  75                  80  


Glu Val Asn Leu Lys Asn Val Thr Glu Asp Phe Asn Met Trp Lys Asn 
                85                  90                  95      


Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp Gln 
            100                 105                 110         


Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn 
        115                 120                 125             


Cys Ser Asn Ala Asn Thr Asn Ser Thr Asn Ser Thr Ser Ala Pro Ser 
    130                 135                 140                 


Met Gly Pro Gly Glu Ile Lys Asn Cys Ser Phe Asn Val Thr Thr Glu 
145                 150                 155                 160 


Val Arg Asp Lys Glu Lys Lys Val Tyr Ala Leu Phe Tyr Lys Leu Asp 
                165                 170                 175     


Val Val Gln Ile Asn Glu Ser Asp Ser Asn Ser Thr Lys Asp Ser Thr 
            180                 185                 190         


Gln Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys 
        195                 200                 205             


Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 
    210                 215                 220                 


Gly Phe Ala Ile Leu Lys Cys Glu Asp Pro Arg Phe Asn Gly Thr Gly 
225                 230                 235                 240 


Pro Cys Asn Asn Val Ser Ser Val Gln Cys Thr His Gly Ile Met Pro 
                245                 250                 255     


Val Ala Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Lys Glu 
            260                 265                 270         


Val Met Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Asn Ile Ile 
        275                 280                 285             


Val Gln Phe Asn Glu Ser Val Pro Ile Thr Cys Ile Arg Pro Asn Asn 
    290                 295                 300                 


Asn Thr Arg Lys Gly Ile Pro Ile Gly Pro Gly Gln Val Phe Tyr Thr 
305                 310                 315                 320 


Ser Asp Ile Ile Gly Asp Ile Arg Gln Ala Tyr Cys Ser Ile Asn Lys 
                325                 330                 335     


Thr Lys Trp Asp Ala Ser Leu Gln Lys Val Ala Glu Gln Leu Arg Lys 
            340                 345                 350         


His Phe Pro Asn Lys Thr Ile Asn Phe Thr Lys Pro Ser Gly Gly Asp 
        355                 360                 365             


Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 
    370                 375                 380                 


Cys Asn Thr Thr Ser Leu Phe Asn Ser Thr Trp Lys Asn Gly Ala Thr 
385                 390                 395                 400 


Ile Gln Glu Asn Ser Thr Glu Thr Asn Gly Ile Met Thr Leu Pro Cys 
                405                 410                 415     


Arg Ile Lys Gln Ile Val Asp Met Trp Gln Glu Val Gly Gln Ala Met 
            420                 425                 430         


Tyr Ala Pro Pro Ile Ala Gly Val Ile Tyr Cys Thr Ser Asn Ile Thr 
        435                 440                 445             


Gly Ile Ile Leu Thr Arg Asp Gly Gly Ser Ser Asn Thr Asn Ser Glu 
    450                 455                 460                 


Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu 
465                 470                 475                 480 


Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro 
                485                 490                 495     


Ser Arg Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly 
            500                 505                 510         


Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
        515                 520                 525             


Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser 
    530                 535                 540                 


Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu Ala Gln 
545                 550                 555                 560 


Gln His Leu Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala 
                565                 570                 575     


Arg Val Leu Ala Leu Glu Arg Tyr Leu Gln Asp Gln Gln Leu Leu Gly 
            580                 585                 590         


Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Thr Val Pro Trp 
        595                 600                 605             


Asn Ser Ser Trp Ser Asn Lys Thr Tyr Glu Glu Ile Trp Asn Asn Met 
    610                 615                 620                 


Thr Trp Leu Gln Trp Asp Arg Glu Ile Asp Asn Tyr Thr Asn Ile Ile 
625                 630                 635                 640 


Tyr Asn Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln 
                645                 650                 655     


Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Ser 
            660                 665                 670         


Ile Thr Asn Trp Leu Trp Tyr Ile Arg Ile Phe Ile Met Ile Val Gly 
        675                 680                 685             


Gly Leu Ile Gly Leu Arg Ile Val Met Ala Ile Ile Ser Val Val Asn 
    690                 695                 700                 


Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Ile Pro Thr Pro 
705                 710                 715                 720 


Asn Pro Glu Gly Leu Asp Arg His Gly Arg Ile Glu Glu Gly Gly Gly 
                725                 730                 735     


Glu Gln Asp Arg Thr Arg Ser Ile Arg Leu Val Ser Gly Phe Leu Gly 
            740                 745                 750         


Leu Ala Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg 
        755                 760                 765             


Leu Arg Asp Cys Ile Leu Ile Val Ala Arg Thr Val Glu Leu Leu Gly 
    770                 775                 780                 


His Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu Lys Tyr 
785                 790                 795                 800 


Leu Gly Asn Leu Leu Leu Tyr Trp Gly Arg Glu Leu Lys Asn Ser Ala 
                805                 810                 815     


Ile Ser Leu Leu Asn Ser Thr Ala Ile Ala Val Ala Glu Trp Thr Asp 
            820                 825                 830         


Arg Val Ile Glu Ile Gly Gln Arg Ala Cys Arg Ala Ile Leu Asn Ile 
        835                 840                 845             


Pro Arg Arg Ile Arg Gln Gly Phe Glu Arg Ala Leu Leu 
    850                 855                 860     


<210>  7
<211>  2526
<212>  DNA
<213>  Human immunodeficiency virus type 1

<400>  7
atgagagtga aggggatatt gaggaattgt caacaatggt ggatatgggg catcttaggc       60

ttttggatgc taatgatttg taatgtggtg ggaaacttgt gggtcacagt ctattatggg      120

gtacctgtgt ggaaagaagc aaaaactact ctattctgtg catcagatgc taaatcatat      180

gagagagaag tgcataatgt atgggctaca catgcctgtg tacccacaga ccccgaccca      240

caagaactgg ttatggcaaa tgtaacagaa aattttaaca tgtggaaaaa tgacatggta      300

gatcagatgc atgaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaaa      360

ttgaccccac tctgtgtcac tttaaattgt acaagtcctg ctgcccacaa tgagagcgag      420

acaagagtaa aacattgctc tttcaatata accacagatg taaaagatag aaaacagaag      480

gtgaatgcaa ctttttatga ccttgatata gtaccactta gcagctctga caactctagc      540

aactctagtc tgtatagatt aataagttgt aatacctcaa ccataacaca agcctgtcca      600

aaggtctctt ttgacccaat tcctatacat tattgtgctc cagctggtta tgcgattcta      660

aaatgtaata ataagacatt cagtgggaaa ggaccatgtt ctaatgtcag tacagtacaa      720

tgtacacatg gaattaggcc agtggtatca actcaactgc tgttaaatgg tagcctagca      780

gaagaagaga tagtaattag atctgaaaat ctgacagaca atgccaaaac aataatagta      840

catcttaata aatctgtaga aattgagtgt ataagacctg gcaataatac aagaaaaagt      900

ataaggctag gaccaggaca aacattctat gcaacagggg atgtaatagg agacataaga      960

aaggcatatt gtaaaattaa tggaagtgag tggaatgaaa ctttaacaaa agtaagtgaa     1020

aaattaaaag aatattttaa taaaacaata agatttgccc agcactcggg aggggaccta     1080

gaagtgacaa cacatagctt taattgtaga ggagaatttt tctattgcaa tacatcagaa     1140

ctatttaata gtaatgcaac agaaagcaac atcacactcc catgcagaat aaaacaaatt     1200

ataaacatgt ggcagggggt aggacgagca atgtatgccc ctcccatcag aggagaaata     1260

aaatgtacat caaatatcac aggactacta ttaacacgcg atggaggcaa caacaataat     1320

tcaacagagg agatattcag acctgaagga ggaaatatga gggacaattg gagaagtgaa     1380

ttatacaaat ataaagtggt ggagattaag ccattgggaa tagcacccac tgaggcaaaa     1440

aggagagtgg tgcagagaga aaaaagagca gtgggaatag gagctgtgtt ccttgggttc     1500

ttgggagcag caggaagcac tatgggcgcg gcgtcaataa cgctgacggt acaggccaga     1560

caattattgt ctggtatagt gcaacagcaa agcaatttgc tgagggctat agaggcgcaa     1620

caacatctgt tgcaactcac agtctggggc attaagcagc tccaggcaag agtcctggct     1680

atagaaagat acctacagga tcaacagctc ctagggattt ggggctgctc tggaaaactc     1740

atctgcacca ctgcagtgcc ttggaactcc agttggagta ataaatctaa ggaagaaatt     1800

tggggcaaca tgacctggat gcagtgggat aaagaagtta gtaattacac attcacaata     1860

taccagttgc ttgaagaatc gcaataccag caggaacaaa atgaaaaaga actattagca     1920

ttgaacaagt ggaatgatct gtggagttgg tttaacataa caaattggtt gtggtatata     1980

aaaatattca taatgatagt aggaggctta ataggtttaa gaataatttt tgctgtactt     2040

tccatagtga atagagttag gcagggatac tcacctttgt cgtttcagac ccttaccccg     2100

aacccagggg gacccgacag gctcggaaga atcgaaggag aaggtggaga gcaagacaaa     2160

aacagatcca ttcgattagt gaacggattc ttagctctta tctgggacga cctgtggagc     2220

ctgtgccgct tcagctacca cctattgaga gacttcatat tgattgtagc gagagcggtg     2280

gaacttctgg gacgcagcag tctcaaggga ctacagaggg ggtgggaagc tcttaaatat     2340

ctgggaaatc ttatgcagta ttggggtctg gaactaaaaa gaagtgctat taatctgtta     2400

gatacaacag cagtagcagt agctgaagga acagatagga ttatagaatt agcacaaggc     2460

atttatagag ctatctgcaa catacctaga agaataagac agggctttga agcagcttta     2520

caataa                                                                2526


<210>  8
<211>  841
<212>  PRT
<213>  Human immunodeficiency virus type 1

<400>  8

Met Arg Val Lys Gly Ile Leu Arg Asn Cys Gln Gln Trp Trp Ile Trp 
1               5                   10                  15      


Gly Ile Leu Gly Phe Trp Met Leu Met Ile Cys Asn Val Val Gly Asn 
            20                  25                  30          


Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys 
        35                  40                  45              


Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ser Tyr Glu Arg Glu Val 
    50                  55                  60                  


His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asp Pro 
65                  70                  75                  80  


Gln Glu Leu Val Met Ala Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
                85                  90                  95      


Asn Asp Met Val Asp Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp 
            100                 105                 110         


Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
        115                 120                 125             


Asn Cys Thr Ser Pro Ala Ala His Asn Glu Ser Glu Thr Arg Val Lys 
    130                 135                 140                 


His Cys Ser Phe Asn Ile Thr Thr Asp Val Lys Asp Arg Lys Gln Lys 
145                 150                 155                 160 


Val Asn Ala Thr Phe Tyr Asp Leu Asp Ile Val Pro Leu Ser Ser Ser 
                165                 170                 175     


Asp Asn Ser Ser Asn Ser Ser Leu Tyr Arg Leu Ile Ser Cys Asn Thr 
            180                 185                 190         


Ser Thr Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp Pro Ile Pro 
        195                 200                 205             


Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys Cys Asn Asn 
    210                 215                 220                 


Lys Thr Phe Ser Gly Lys Gly Pro Cys Ser Asn Val Ser Thr Val Gln 
225                 230                 235                 240 


Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn 
                245                 250                 255     


Gly Ser Leu Ala Glu Glu Glu Ile Val Ile Arg Ser Glu Asn Leu Thr 
            260                 265                 270         


Asp Asn Ala Lys Thr Ile Ile Val His Leu Asn Lys Ser Val Glu Ile 
        275                 280                 285             


Glu Cys Ile Arg Pro Gly Asn Asn Thr Arg Lys Ser Ile Arg Leu Gly 
    290                 295                 300                 


Pro Gly Gln Thr Phe Tyr Ala Thr Gly Asp Val Ile Gly Asp Ile Arg 
305                 310                 315                 320 


Lys Ala Tyr Cys Lys Ile Asn Gly Ser Glu Trp Asn Glu Thr Leu Thr 
                325                 330                 335     


Lys Val Ser Glu Lys Leu Lys Glu Tyr Phe Asn Lys Thr Ile Arg Phe 
            340                 345                 350         


Ala Gln His Ser Gly Gly Asp Leu Glu Val Thr Thr His Ser Phe Asn 
        355                 360                 365             


Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Glu Leu Phe Asn Ser 
    370                 375                 380                 


Asn Ala Thr Glu Ser Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile 
385                 390                 395                 400 


Ile Asn Met Trp Gln Gly Val Gly Arg Ala Met Tyr Ala Pro Pro Ile 
                405                 410                 415     


Arg Gly Glu Ile Lys Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr 
            420                 425                 430         


Arg Asp Gly Gly Asn Asn Asn Asn Ser Thr Glu Glu Ile Phe Arg Pro 
        435                 440                 445             


Glu Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
    450                 455                 460                 


Lys Val Val Glu Ile Lys Pro Leu Gly Ile Ala Pro Thr Glu Ala Lys 
465                 470                 475                 480 


Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val 
                485                 490                 495     


Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser 
            500                 505                 510         


Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln 
        515                 520                 525             


Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln His Leu Leu 
    530                 535                 540                 


Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala 
545                 550                 555                 560 


Ile Glu Arg Tyr Leu Gln Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys 
                565                 570                 575     


Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ser Ser Trp 
            580                 585                 590         


Ser Asn Lys Ser Lys Glu Glu Ile Trp Gly Asn Met Thr Trp Met Gln 
        595                 600                 605             


Trp Asp Lys Glu Val Ser Asn Tyr Thr Phe Thr Ile Tyr Gln Leu Leu 
    610                 615                 620                 


Glu Glu Ser Gln Tyr Gln Gln Glu Gln Asn Glu Lys Glu Leu Leu Ala 
625                 630                 635                 640 


Leu Asn Lys Trp Asn Asp Leu Trp Ser Trp Phe Asn Ile Thr Asn Trp 
                645                 650                 655     


Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly 
            660                 665                 670         


Leu Arg Ile Ile Phe Ala Val Leu Ser Ile Val Asn Arg Val Arg Gln 
        675                 680                 685             


Gly Tyr Ser Pro Leu Ser Phe Gln Thr Leu Thr Pro Asn Pro Gly Gly 
    690                 695                 700                 


Pro Asp Arg Leu Gly Arg Ile Glu Gly Glu Gly Gly Glu Gln Asp Lys 
705                 710                 715                 720 


Asn Arg Ser Ile Arg Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Asp 
                725                 730                 735     


Asp Leu Trp Ser Leu Cys Arg Phe Ser Tyr His Leu Leu Arg Asp Phe 
            740                 745                 750         


Ile Leu Ile Val Ala Arg Ala Val Glu Leu Leu Gly Arg Ser Ser Leu 
        755                 760                 765             


Lys Gly Leu Gln Arg Gly Trp Glu Ala Leu Lys Tyr Leu Gly Asn Leu 
    770                 775                 780                 


Met Gln Tyr Trp Gly Leu Glu Leu Lys Arg Ser Ala Ile Asn Leu Leu 
785                 790                 795                 800 


Asp Thr Thr Ala Val Ala Val Ala Glu Gly Thr Asp Arg Ile Ile Glu 
                805                 810                 815     


Leu Ala Gln Gly Ile Tyr Arg Ala Ile Cys Asn Ile Pro Arg Arg Ile 
            820                 825                 830         


Arg Gln Gly Phe Glu Ala Ala Leu Gln 
        835                 840     


<210>  9
<211>  2592
<212>  DNA
<213>  Human immunodeficiency virus type 1

<400>  9
atgatagtga ctatgaaagc aatggagaag aggaacaaga agttatggac cttgtactta       60

gccatggctt tgataacccc atgtttgagc cttagacagc tatatgcaac agtctatgct      120

ggggtgcctg tatgggaaga tgcaacacca gtactattct gtgcttcaga tgctaaccta      180

acaagcactg aaaagcataa tatttgggca tcacaagcct gtgttcctac agaccccact      240

ccatatgaat atccattgca caatgtgaca gatgacttta atatatggaa aaattacatg      300

gtagaacaaa tgcaggaaga cattattagt ttatgggacc agagtcttaa accttgtgtt      360

caaatgactt tcctgtgtgt acaaatggag tgtacaaaca tagctggaac aacaaatgaa      420

aaccttatga agaagtgtga gtttaatgta accactgtta tcaaagacaa aaaggagaaa      480

aaacaggctc tattctatgt atcagatttg atggaactga atgagacaag cagcacaaat      540

aagacaaaca gcaaaatgta tacattaact aattgtaact ccacaaccat cacgcaagcc      600

tgtccaaagg tatcttttga accaattcca atacactatt gtgctccagc aggatatgct      660

atctttaagt gtaacagcac agaatttaat ggaacaggca catgcagaaa cataacggta      720

gttacttgta cacatggcat taggccaaca gtaagtactc agctaatatt aaatgggaca      780

ctctctaaag gaaaaataag aatgatggca aaagatattt tggaaggtgg aaaaaatatc      840

atagtgaccc taaactctac cctaaacatg acctgtgaaa gaccacaaat agacatacaa      900

gagatgagaa taggtccaat ggcctggtac agcatgggaa tagggggaac agcaggaaac      960

agctcaaggg cagcttattg caagtataat gccactgatt ggggaaaaat attaaaacaa     1020

acagctgaaa ggtatttaga actagtaaac aatacaggta gtattaacat gacattcaat     1080

cacagcagcg gtggagatct agaggtaacc catttacact ttaactgtca tggagaattc     1140

ttttattgta acacagctaa gatgtttaat tatacctttt catgtaacgg aaccacctgt     1200

agtgttagta atgttagtca aggtaacaat ggcactctac cttgcaaact gagacaggtg     1260

gtaaggtcat ggataagggg acagtcggga ctctatgcac ctcccatcaa aggtaatcta     1320

acatgtatgt caaacataac tggaatgatc ctacaaatgg ataacacatg gaacagcagc     1380

aacaacaatg taacatttag accaataggg ggagacatga aagatatatg gagaactgaa     1440

ttgttcaact acaaagtagt aagggtaaaa ccttttagtg tggcacccac acgtattgca     1500

aggccagtca taagcactag aactcataga gaaaaaagag cagtaggatt gggaatgcta     1560

ttcttggggg ttctaagtgc agcaggtagc actatgggcg cagcggcaac aacgctggcg     1620

gtacagaccc acactttgct gaagggtata gtgcaacagc aggacaacct gctaagagca     1680

atacaggccc agcagcaatt gctgaggcta tctrtatggg gtatcagaca actccgagct     1740

cgcctgctag ccttagaaac cttactacag aatcagcaac tcctaagcct atggggctgt     1800

aaaggaaagc tagtctgcta cacatcagta aaatggaata gaacatggat aggaaacgaa     1860

agcatttggg acaccttaac atggcaggaa tgggatcggc agataagcaa cataagctcc     1920

accatatatg aggaaataca aaaggcacaa gtacagcagg aacaaaatga gaaaaagttg     1980

ctggagttag atgaatgggc ctctatttgg aattggcttg acataactaa atggttgtgg     2040

tatataaaaa tagcaataat catagtagga gcactagtag gggtgagagt tatcatgata     2100

gtacttaata tagtgaaaaa cattaggcag ggatatcaac ccctctcgtt acagatcccc     2160

aaccatcacc aagaggaagc aggaacgcca ggaagaacag gaggaggagg tggagaagaa     2220

ggcaggccca ggtggatacc ctcgccgcaa gggttcttgc cactgttgta cacggacctc     2280

agaacaataa tattgtggac ttaccacctc ttgagcaact tagcatcagg gatccagaag     2340

gtgatcagct atctgaggct tggactgtgg atcctagggc agaagataat taatgtttgc     2400

agaatttgtg cagctgtaac acaatactgg ctacaagaat tgcagaatag tgctacaagc     2460

ttgctagaca cacttgcagt ggcagtagcc aattggactg acggcataat cgcagggata     2520

caaagaatag gaacaggaat tcgtaacatc ccaaggagaa ttagacaggg cttagaaaga     2580

agtttattgt aa                                                         2592


<210>  10
<211>  863
<212>  PRT
<213>  Human immunodeficiency virus type 1


<220>
<221>  misc_feature
<222>  (572)..(572)
<223>  Xaa can be any naturally occurring amino acid

<400>  10

Met Ile Val Thr Met Lys Ala Met Glu Lys Arg Asn Lys Lys Leu Trp 
1               5                   10                  15      


Thr Leu Tyr Leu Ala Met Ala Leu Ile Thr Pro Cys Leu Ser Leu Arg 
            20                  25                  30          


Gln Leu Tyr Ala Thr Val Tyr Ala Gly Val Pro Val Trp Glu Asp Ala 
        35                  40                  45              


Thr Pro Val Leu Phe Cys Ala Ser Asp Ala Asn Leu Thr Ser Thr Glu 
    50                  55                  60                  


Lys His Asn Ile Trp Ala Ser Gln Ala Cys Val Pro Thr Asp Pro Thr 
65                  70                  75                  80  


Pro Tyr Glu Tyr Pro Leu His Asn Val Thr Asp Asp Phe Asn Ile Trp 
                85                  90                  95      


Lys Asn Tyr Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp 
            100                 105                 110         


Asp Gln Ser Leu Lys Pro Cys Val Gln Met Thr Phe Leu Cys Val Gln 
        115                 120                 125             


Met Glu Cys Thr Asn Ile Ala Gly Thr Thr Asn Glu Asn Leu Met Lys 
    130                 135                 140                 


Lys Cys Glu Phe Asn Val Thr Thr Val Ile Lys Asp Lys Lys Glu Lys 
145                 150                 155                 160 


Lys Gln Ala Leu Phe Tyr Val Ser Asp Leu Met Glu Leu Asn Glu Thr 
                165                 170                 175     


Ser Ser Thr Asn Lys Thr Asn Ser Lys Met Tyr Thr Leu Thr Asn Cys 
            180                 185                 190         


Asn Ser Thr Thr Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro 
        195                 200                 205             


Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Phe Lys Cys 
    210                 215                 220                 


Asn Ser Thr Glu Phe Asn Gly Thr Gly Thr Cys Arg Asn Ile Thr Val 
225                 230                 235                 240 


Val Thr Cys Thr His Gly Ile Arg Pro Thr Val Ser Thr Gln Leu Ile 
                245                 250                 255     


Leu Asn Gly Thr Leu Ser Lys Gly Lys Ile Arg Met Met Ala Lys Asp 
            260                 265                 270         


Ile Leu Glu Gly Gly Lys Asn Ile Ile Val Thr Leu Asn Ser Thr Leu 
        275                 280                 285             


Asn Met Thr Cys Glu Arg Pro Gln Ile Asp Ile Gln Glu Met Arg Ile 
    290                 295                 300                 


Gly Pro Met Ala Trp Tyr Ser Met Gly Ile Gly Gly Thr Ala Gly Asn 
305                 310                 315                 320 


Ser Ser Arg Ala Ala Tyr Cys Lys Tyr Asn Ala Thr Asp Trp Gly Lys 
                325                 330                 335     


Ile Leu Lys Gln Thr Ala Glu Arg Tyr Leu Glu Leu Val Asn Asn Thr 
            340                 345                 350         


Gly Ser Ile Asn Met Thr Phe Asn His Ser Ser Gly Gly Asp Leu Glu 
        355                 360                 365             


Val Thr His Leu His Phe Asn Cys His Gly Glu Phe Phe Tyr Cys Asn 
    370                 375                 380                 


Thr Ala Lys Met Phe Asn Tyr Thr Phe Ser Cys Asn Gly Thr Thr Cys 
385                 390                 395                 400 


Ser Val Ser Asn Val Ser Gln Gly Asn Asn Gly Thr Leu Pro Cys Lys 
                405                 410                 415     


Leu Arg Gln Val Val Arg Ser Trp Ile Arg Gly Gln Ser Gly Leu Tyr 
            420                 425                 430         


Ala Pro Pro Ile Lys Gly Asn Leu Thr Cys Met Ser Asn Ile Thr Gly 
        435                 440                 445             


Met Ile Leu Gln Met Asp Asn Thr Trp Asn Ser Ser Asn Asn Asn Val 
    450                 455                 460                 


Thr Phe Arg Pro Ile Gly Gly Asp Met Lys Asp Ile Trp Arg Thr Glu 
465                 470                 475                 480 


Leu Phe Asn Tyr Lys Val Val Arg Val Lys Pro Phe Ser Val Ala Pro 
                485                 490                 495     


Thr Arg Ile Ala Arg Pro Val Ile Ser Thr Arg Thr His Arg Glu Lys 
            500                 505                 510         


Arg Ala Val Gly Leu Gly Met Leu Phe Leu Gly Val Leu Ser Ala Ala 
        515                 520                 525             


Gly Ser Thr Met Gly Ala Ala Ala Thr Thr Leu Ala Val Gln Thr His 
    530                 535                 540                 


Thr Leu Leu Lys Gly Ile Val Gln Gln Gln Asp Asn Leu Leu Arg Ala 
545                 550                 555                 560 


Ile Gln Ala Gln Gln Gln Leu Leu Arg Leu Ser Xaa Trp Gly Ile Arg 
                565                 570                 575     


Gln Leu Arg Ala Arg Leu Leu Ala Leu Glu Thr Leu Leu Gln Asn Gln 
            580                 585                 590         


Gln Leu Leu Ser Leu Trp Gly Cys Lys Gly Lys Leu Val Cys Tyr Thr 
        595                 600                 605             


Ser Val Lys Trp Asn Arg Thr Trp Ile Gly Asn Glu Ser Ile Trp Asp 
    610                 615                 620                 


Thr Leu Thr Trp Gln Glu Trp Asp Arg Gln Ile Ser Asn Ile Ser Ser 
625                 630                 635                 640 


Thr Ile Tyr Glu Glu Ile Gln Lys Ala Gln Val Gln Gln Glu Gln Asn 
                645                 650                 655     


Glu Lys Lys Leu Leu Glu Leu Asp Glu Trp Ala Ser Ile Trp Asn Trp 
            660                 665                 670         


Leu Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Ala Ile Ile Ile 
        675                 680                 685             


Val Gly Ala Leu Val Gly Val Arg Val Ile Met Ile Val Leu Asn Ile 
    690                 695                 700                 


Val Lys Asn Ile Arg Gln Gly Tyr Gln Pro Leu Ser Leu Gln Ile Pro 
705                 710                 715                 720 


Asn His His Gln Glu Glu Ala Gly Thr Pro Gly Arg Thr Gly Gly Gly 
                725                 730                 735     


Gly Gly Glu Glu Gly Arg Pro Arg Trp Ile Pro Ser Pro Gln Gly Phe 
            740                 745                 750         


Leu Pro Leu Leu Tyr Thr Asp Leu Arg Thr Ile Ile Leu Trp Thr Tyr 
        755                 760                 765             


His Leu Leu Ser Asn Leu Ala Ser Gly Ile Gln Lys Val Ile Ser Tyr 
    770                 775                 780                 


Leu Arg Leu Gly Leu Trp Ile Leu Gly Gln Lys Ile Ile Asn Val Cys 
785                 790                 795                 800 


Arg Ile Cys Ala Ala Val Thr Gln Tyr Trp Leu Gln Glu Leu Gln Asn 
                805                 810                 815     


Ser Ala Thr Ser Leu Leu Asp Thr Leu Ala Val Ala Val Ala Asn Trp 
            820                 825                 830         


Thr Asp Gly Ile Ile Ala Gly Ile Gln Arg Ile Gly Thr Gly Ile Arg 
        835                 840                 845             


Asn Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ser Leu Leu 
    850                 855                 860             


<210>  11
<211>  2577
<212>  DNA
<213>  Simian immunodeficiency virus


<220>
<221>  misc_feature
<222>  (295)..(295)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1473)..(1473)
<223>  n is a, c, g, or t

<400>  11
atgaggaagc cgatacatat tatttggggt ctggctttgc taatccagtt tatagagaag       60

gggacgaatg aagactatgt aacagtattc tatggagtcc ctgtctggag aaatgcgaca      120

cctactctat tttgtgccac aaatgcctcc atgacaagta cagaggtgca caatgtatgg      180

gcaactacca gttgtgtgcc aatagatcca gatcctattg tagttaggct caatacctca      240

gtctggttta atgcttataa aaattatatg gtagaaagta tgacagaaga tatgntacaa      300

ttattccaac aaagccataa gccatgtgta aaactaacac ctatgtgtat aaaaatgaat      360

tgtacaggat acaatggaac acctacaaca ccaagtacaa caacaagtac agtaacacca      420

aagacaacaa caccaatagt agatggcatg aagctacaag aatgtaactt taatcagagc      480

acaggattta aagataagaa acaaaaaatg aaagccatat tttataaagg agatcttatg      540

aagtgtcagg acaacaatga gactaactgc tattacttat ggcactgcaa caccacaact      600

atcacacaat cctgtgaaaa gtctactttt gaaccaattc ctatacatta ttgtgctcca      660

gcaggatatg ctatattgag atgtgaagat gaggatttta caggagtagg gatgtgtaaa      720

aatgtctcag tagtacattg cactcatgga ataagcccaa tggtggcaac atggttacta      780

ttaaatggaa cttaccaaac aaacacttca gtagtaatga atggtcgcaa aaatgaatct      840

gtgcttgtaa gatttggaaa agaattcgaa aacttaacaa ttacatgtat aagaccagga      900

aataggacag taagaaatct acaaatagga ccaggaatga ctttctataa cgtagaaata      960

gcaacaggag acactaggaa agcgttctgt acagtcaata agacgctatg ggaacaagca     1020

cgtaacaaaa cagagcacgt tcttgcggag cattggaaaa aagtagacaa caaaaccaat     1080

gcgaaaacaa tatggacatt ccaagatgga gatcctgaag taaaagtgca ttggtttaat     1140

tgccaaggag aattctttta ttgtgatata acaccttggt tcaatgccac atacacggga     1200

aacctcatca caaacggagc cctcatagca cattgcagaa ttaagcagat agttaatcat     1260

tggggcatag tttcaaaagg catttactta gcccctagga gagggaatgt ttcctgtact     1320

tccagcataa ctggaattat gttggaaggt caaatatata atgaaactgt taaagtgtca     1380

cctgctgcaa gagtagcaga ccaatggaga gcggagttgt ccaggtacca ggtggtagag     1440

attgrtccct tgtcagtagc cccaacaaca ggnaaaaggc cagaaataaa acaacactcc     1500

agacaaaaaa gaggcattgg aatagggctg ttcttcttgg gtcttctcag tgcagctggc     1560

agtacaatgg gcgcagcgtc aatagcgctg acggcacaga ccaggaattt gytccatggt     1620

attgtacaac agcaggccaa tctgctgcaa gccatagaga cacagcaaca tctgctacag     1680

ctctcggtct ggggagtaaa acaactccag gcaagaatgc ttgcagtcga gaagtaccta     1740

agagatcaac aactattgag cctctggggt tgtgctgaca aggtgacctg tcacactacg     1800

gtgccttgga ataattcctg ggtaaacttc acgcaaacat gtgcaaagaa cagcagtgat     1860

atacaatgta tttgggaaaa tatgacatgg caagaatggg acagattagt acagaattca     1920

acaggacaga tatataatat cttacaaata gcacatgagc aacaagagag aaataaaaag     1980

gaattatatg aactagacaa atggagctca ttatggaatt ggtttgacat aacacaatgg     2040

ctatggtata taaaaatatt tattatgata gtaggagcta ttgtaggact aagaattttg     2100

cttgtattag ttagttgctt aagaaaggtt aggcagggat atcatcctct gtcatttcag     2160

atccctaccc aaaaccagca ggatccagag cagccagaag aaataagaga agaaggtgga     2220

agaaaagaca ggatcaggtg gagggccttg cagcacgggt tcttcgcact cttgtgggtg     2280

gacctgacga gcataatcca gtggatctac cagatctgca gaacctgtct cttgaacctt     2340

tgggcagtcc tccaacacct ctgcagaatt actttcagac tgtgcaacca tctggagaac     2400

aatctcagca ccctctggac aataatcaga actgagatca ttaagaacat tgacagactt     2460

gctatttggg taggggaaaa aacagatagc atacttctag ctctccaaac tatagtcaga     2520

atcataaggg aagtacctag gcgcatcaga caagggttgg aaattgcatt aaattaa        2577


<210>  12
<211>  858
<212>  PRT
<213>  Simian immunodeficiency virus


<220>
<221>  misc_feature
<222>  (99)..(99)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (482)..(482)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (538)..(538)
<223>  Xaa can be any naturally occurring amino acid

<400>  12

Met Arg Lys Pro Ile His Ile Ile Trp Gly Leu Ala Leu Leu Ile Gln 
1               5                   10                  15      


Phe Ile Glu Lys Gly Thr Asn Glu Asp Tyr Val Thr Val Phe Tyr Gly 
            20                  25                  30          


Val Pro Val Trp Arg Asn Ala Thr Pro Thr Leu Phe Cys Ala Thr Asn 
        35                  40                  45              


Ala Ser Met Thr Ser Thr Glu Val His Asn Val Trp Ala Thr Thr Ser 
    50                  55                  60                  


Cys Val Pro Ile Asp Pro Asp Pro Ile Val Val Arg Leu Asn Thr Ser 
65                  70                  75                  80  


Val Trp Phe Asn Ala Tyr Lys Asn Tyr Met Val Glu Ser Met Thr Glu 
                85                  90                  95      


Asp Met Xaa Gln Leu Phe Gln Gln Ser His Lys Pro Cys Val Lys Leu 
            100                 105                 110         


Thr Pro Met Cys Ile Lys Met Asn Cys Thr Gly Tyr Asn Gly Thr Pro 
        115                 120                 125             


Thr Thr Pro Ser Thr Thr Thr Ser Thr Val Thr Pro Lys Thr Thr Thr 
    130                 135                 140                 


Pro Ile Val Asp Gly Met Lys Leu Gln Glu Cys Asn Phe Asn Gln Ser 
145                 150                 155                 160 


Thr Gly Phe Lys Asp Lys Lys Gln Lys Met Lys Ala Ile Phe Tyr Lys 
                165                 170                 175     


Gly Asp Leu Met Lys Cys Gln Asp Asn Asn Glu Thr Asn Cys Tyr Tyr 
            180                 185                 190         


Leu Trp His Cys Asn Thr Thr Thr Ile Thr Gln Ser Cys Glu Lys Ser 
        195                 200                 205             


Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala 
    210                 215                 220                 


Ile Leu Arg Cys Glu Asp Glu Asp Phe Thr Gly Val Gly Met Cys Lys 
225                 230                 235                 240 


Asn Val Ser Val Val His Cys Thr His Gly Ile Ser Pro Met Val Ala 
                245                 250                 255     


Thr Trp Leu Leu Leu Asn Gly Thr Tyr Gln Thr Asn Thr Ser Val Val 
            260                 265                 270         


Met Asn Gly Arg Lys Asn Glu Ser Val Leu Val Arg Phe Gly Lys Glu 
        275                 280                 285             


Phe Glu Asn Leu Thr Ile Thr Cys Ile Arg Pro Gly Asn Arg Thr Val 
    290                 295                 300                 


Arg Asn Leu Gln Ile Gly Pro Gly Met Thr Phe Tyr Asn Val Glu Ile 
305                 310                 315                 320 


Ala Thr Gly Asp Thr Arg Lys Ala Phe Cys Thr Val Asn Lys Thr Leu 
                325                 330                 335     


Trp Glu Gln Ala Arg Asn Lys Thr Glu His Val Leu Ala Glu His Trp 
            340                 345                 350         


Lys Lys Val Asp Asn Lys Thr Asn Ala Lys Thr Ile Trp Thr Phe Gln 
        355                 360                 365             


Asp Gly Asp Pro Glu Val Lys Val His Trp Phe Asn Cys Gln Gly Glu 
    370                 375                 380                 


Phe Phe Tyr Cys Asp Ile Thr Pro Trp Phe Asn Ala Thr Tyr Thr Gly 
385                 390                 395                 400 


Asn Leu Ile Thr Asn Gly Ala Leu Ile Ala His Cys Arg Ile Lys Gln 
                405                 410                 415     


Ile Val Asn His Trp Gly Ile Val Ser Lys Gly Ile Tyr Leu Ala Pro 
            420                 425                 430         


Arg Arg Gly Asn Val Ser Cys Thr Ser Ser Ile Thr Gly Ile Met Leu 
        435                 440                 445             


Glu Gly Gln Ile Tyr Asn Glu Thr Val Lys Val Ser Pro Ala Ala Arg 
    450                 455                 460                 


Val Ala Asp Gln Trp Arg Ala Glu Leu Ser Arg Tyr Gln Val Val Glu 
465                 470                 475                 480 


Ile Xaa Pro Leu Ser Val Ala Pro Thr Thr Gly Lys Arg Pro Glu Ile 
                485                 490                 495     


Lys Gln His Ser Arg Gln Lys Arg Gly Ile Gly Ile Gly Leu Phe Phe 
            500                 505                 510         


Leu Gly Leu Leu Ser Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile 
        515                 520                 525             


Ala Leu Thr Ala Gln Thr Arg Asn Leu Xaa His Gly Ile Val Gln Gln 
    530                 535                 540                 


Gln Ala Asn Leu Leu Gln Ala Ile Glu Thr Gln Gln His Leu Leu Gln 
545                 550                 555                 560 


Leu Ser Val Trp Gly Val Lys Gln Leu Gln Ala Arg Met Leu Ala Val 
                565                 570                 575     


Glu Lys Tyr Leu Arg Asp Gln Gln Leu Leu Ser Leu Trp Gly Cys Ala 
            580                 585                 590         


Asp Lys Val Thr Cys His Thr Thr Val Pro Trp Asn Asn Ser Trp Val 
        595                 600                 605             


Asn Phe Thr Gln Thr Cys Ala Lys Asn Ser Ser Asp Ile Gln Cys Ile 
    610                 615                 620                 


Trp Glu Asn Met Thr Trp Gln Glu Trp Asp Arg Leu Val Gln Asn Ser 
625                 630                 635                 640 


Thr Gly Gln Ile Tyr Asn Ile Leu Gln Ile Ala His Glu Gln Gln Glu 
                645                 650                 655     


Arg Asn Lys Lys Glu Leu Tyr Glu Leu Asp Lys Trp Ser Ser Leu Trp 
            660                 665                 670         


Asn Trp Phe Asp Ile Thr Gln Trp Leu Trp Tyr Ile Lys Ile Phe Ile 
        675                 680                 685             


Met Ile Val Gly Ala Ile Val Gly Leu Arg Ile Leu Leu Val Leu Val 
    690                 695                 700                 


Ser Cys Leu Arg Lys Val Arg Gln Gly Tyr His Pro Leu Ser Phe Gln 
705                 710                 715                 720 


Ile Pro Thr Gln Asn Gln Gln Asp Pro Glu Gln Pro Glu Glu Ile Arg 
                725                 730                 735     


Glu Glu Gly Gly Arg Lys Asp Arg Ile Arg Trp Arg Ala Leu Gln His 
            740                 745                 750         


Gly Phe Phe Ala Leu Leu Trp Val Asp Leu Thr Ser Ile Ile Gln Trp 
        755                 760                 765             


Ile Tyr Gln Ile Cys Arg Thr Cys Leu Leu Asn Leu Trp Ala Val Leu 
    770                 775                 780                 


Gln His Leu Cys Arg Ile Thr Phe Arg Leu Cys Asn His Leu Glu Asn 
785                 790                 795                 800 


Asn Leu Ser Thr Leu Trp Thr Ile Ile Arg Thr Glu Ile Ile Lys Asn 
                805                 810                 815     


Ile Asp Arg Leu Ala Ile Trp Val Gly Glu Lys Thr Asp Ser Ile Leu 
            820                 825                 830         


Leu Ala Leu Gln Thr Ile Val Arg Ile Ile Arg Glu Val Pro Arg Arg 
        835                 840                 845             


Ile Arg Gln Gly Leu Glu Ile Ala Leu Asn 
    850                 855             


<210>  13
<211>  2589
<212>  DNA
<213>  Human immunodeficiency virus type 2

<400>  13
atgatgtcta gtagaaatca gctgcttgtt actatcttac tagctagtgc ttgcttagta       60

tattgtaaac aatatgtgac tgttttttat ggcgtgccag catggaaaaa tgcatccatt      120

cccctctttt gtgcaaccaa aaatagagat acttggggaa ccatacagtg cttaccagac      180

aatgatgatt atcaggaaat agctttgaat gtgacagagg ctttcgatgc atgggataat      240

acagtaacag aacaagcagt agaagatgtc tggagactat ttgagacatc aataaaacca      300

tgtgtcaagt taacaccttt atgtatagca atgaagtgta gcaacataag cacagagagc      360

acaaccacat ccccgagccc agggagcaca ctcaaacccc tgataaatga gagcgatcca      420

tgcataaagg cagacaactg ccccagggga ctaggggatg aagagatggt caattgtcgg      480

ttcaacatga caggattaca gagagataag ccaaaacagt ataatgaaac atggtactca      540

aaagatgtgg tttgtgaacc atttaacacc accacaaacc agaccaggtg ttacatgaac      600

cattgcaaca catcagtcat cacagagtca tgtgataagc actattggga tgctataagg      660

tttagatact gtgcaccacc tggttacgcc ctactaagat gcgatgatat caattattca      720

ggctttgcac ccaattgctc taaagtagta gctgctacat gcacaaggat gatggagacg      780

caaacttcta cttggtttgg ctttaatggc actagggcag aaaatagaac atatatctat      840

tggcatggta gagataatag aactatcatc agcttaaaca aacattataa tcttactatg      900

cattgtaaga ggccaggaaa taagacagtt gtaccaataa cacttatgtc agggttaata      960

tttcactccc agccaatcaa taaaagaccc agacaagcat ggtgctggtt caaaggcgaa     1020

tggaggaaag ccatgcagga ggtgaaggaa acccttgtaa aacatcccag gtataaagga     1080

accaatgaca caaaccaaat taactttaca aaaccaggaa gaggctcaga tgcagaagtg     1140

gtatatatgt ggactaactg cagaggagaa tttctccatt gcaacatgac ttggttcctc     1200

aattgggtgg aaaacaaaac gggtcaggaa cagcacaatt atgcaccgtg ccatataaag     1260

caaataatta atatctggca caaagcaggg aaaaatgtat atttgcctcc tagggaagga     1320

gagttgacct gcaactcaac agtaaccagc ttgattgcta acattgacac ggatggcaac     1380

cagacaaata ttacctttag tgcagaggtg gcagaactat accgattaga attgggggat     1440

tataaattag tagagataac accaattggc ttcgcaccta catcagaaag gagatactcc     1500

tctactccaa ggaggaataa aagaggtgtg ttcgtgctag ggttcttagg ttttctcgcg     1560

acagcaggtt ctgcaatggg cacggcagct ttaacgctgt ctgctcagtc tcggacttta     1620

ttggccggga tagtgcagca acagcaacag ctgttggacg tggtcaagag acaacaggaa     1680

atgttgcgac tgaccgtctg gggaacgaaa aatctccagg caagagtcac tgctatcgag     1740

aaatacttaa aggaccaggc gcggctaaat tcatggggat gtgcatttag acaagtctgc     1800

cacactactg taccatgggt aaataactcc ttaaaacctg attgggacaa catgacgtgg     1860

caagagtggg aacaacaagt ccgttaccta gaggcaaata tcagtgaaca gttagaacgg     1920

gcacaaattc agcaagaaaa gaatacgtat gaactacaaa aattaaatag ctgggatgtt     1980

tttaccaact ggcttgactt aaccgcctgg gtcaagtata ttcaatatgg agtttatata     2040

atagtaggaa tagtagctct tagaatagta atatatgtag tgcaaatgtt aagtagactc     2100

aggaagggct ataggcctgt tttctcctcc cctcccggtt acatccaaca gatccatatc     2160

cacaaggacc aggaacagcc aaccagagga gaaacagaag aagacgttgg agacaacgtt     2220

ggggacagat tgtggccctg gccgatcgca tatttacatt tcctgatcca cctgctagct     2280

cgcctcttga tcgggctgta cagcatctgc agggacttac tatccaggat ctccccgatc     2340

ctccaaccga tcttccggag tcttcagaga gcgctgacaa caatcaggga ctggctgaga     2400

cttaaagcag cctacctgca gtatgggtgc gagtggatcc aagaagcgtt ccgggccttt     2460

gcaaggattg cgagagagac tcttacaaac acctggagag acttgtgggg ggcagtgcag     2520

tgggtcggga ggaggatact cgcagtccca aggaggatca ggcagggggc agaaattgcc     2580

ctcctgtga                                                             2589


<210>  14
<211>  862
<212>  PRT
<213>  Human immunodeficiency virus type 2

<400>  14

Met Met Ser Ser Arg Asn Gln Leu Leu Val Thr Ile Leu Leu Ala Ser 
1               5                   10                  15      


Ala Cys Leu Val Tyr Cys Lys Gln Tyr Val Thr Val Phe Tyr Gly Val 
            20                  25                  30          


Pro Ala Trp Lys Asn Ala Ser Ile Pro Leu Phe Cys Ala Thr Lys Asn 
        35                  40                  45              


Arg Asp Thr Trp Gly Thr Ile Gln Cys Leu Pro Asp Asn Asp Asp Tyr 
    50                  55                  60                  


Gln Glu Ile Ala Leu Asn Val Thr Glu Ala Phe Asp Ala Trp Asp Asn 
65                  70                  75                  80  


Thr Val Thr Glu Gln Ala Val Glu Asp Val Trp Arg Leu Phe Glu Thr 
                85                  90                  95      


Ser Ile Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Ile Ala Met Lys 
            100                 105                 110         


Cys Ser Asn Ile Ser Thr Glu Ser Thr Thr Thr Ser Pro Ser Pro Gly 
        115                 120                 125             


Ser Thr Leu Lys Pro Leu Ile Asn Glu Ser Asp Pro Cys Ile Lys Ala 
    130                 135                 140                 


Asp Asn Cys Pro Arg Gly Leu Gly Asp Glu Glu Met Val Asn Cys Arg 
145                 150                 155                 160 


Phe Asn Met Thr Gly Leu Gln Arg Asp Lys Pro Lys Gln Tyr Asn Glu 
                165                 170                 175     


Thr Trp Tyr Ser Lys Asp Val Val Cys Glu Pro Phe Asn Thr Thr Thr 
            180                 185                 190         


Asn Gln Thr Arg Cys Tyr Met Asn His Cys Asn Thr Ser Val Ile Thr 
        195                 200                 205             


Glu Ser Cys Asp Lys His Tyr Trp Asp Ala Ile Arg Phe Arg Tyr Cys 
    210                 215                 220                 


Ala Pro Pro Gly Tyr Ala Leu Leu Arg Cys Asp Asp Ile Asn Tyr Ser 
225                 230                 235                 240 


Gly Phe Ala Pro Asn Cys Ser Lys Val Val Ala Ala Thr Cys Thr Arg 
                245                 250                 255     


Met Met Glu Thr Gln Thr Ser Thr Trp Phe Gly Phe Asn Gly Thr Arg 
            260                 265                 270         


Ala Glu Asn Arg Thr Tyr Ile Tyr Trp His Gly Arg Asp Asn Arg Thr 
        275                 280                 285             


Ile Ile Ser Leu Asn Lys His Tyr Asn Leu Thr Met His Cys Lys Arg 
    290                 295                 300                 


Pro Gly Asn Lys Thr Val Val Pro Ile Thr Leu Met Ser Gly Leu Ile 
305                 310                 315                 320 


Phe His Ser Gln Pro Ile Asn Lys Arg Pro Arg Gln Ala Trp Cys Trp 
                325                 330                 335     


Phe Lys Gly Glu Trp Arg Lys Ala Met Gln Glu Val Lys Glu Thr Leu 
            340                 345                 350         


Val Lys His Pro Arg Tyr Lys Gly Thr Asn Asp Thr Asn Gln Ile Asn 
        355                 360                 365             


Phe Thr Lys Pro Gly Arg Gly Ser Asp Ala Glu Val Val Tyr Met Trp 
    370                 375                 380                 


Thr Asn Cys Arg Gly Glu Phe Leu His Cys Asn Met Thr Trp Phe Leu 
385                 390                 395                 400 


Asn Trp Val Glu Asn Lys Thr Gly Gln Glu Gln His Asn Tyr Ala Pro 
                405                 410                 415     


Cys His Ile Lys Gln Ile Ile Asn Ile Trp His Lys Ala Gly Lys Asn 
            420                 425                 430         


Val Tyr Leu Pro Pro Arg Glu Gly Glu Leu Thr Cys Asn Ser Thr Val 
        435                 440                 445             


Thr Ser Leu Ile Ala Asn Ile Asp Thr Asp Gly Asn Gln Thr Asn Ile 
    450                 455                 460                 


Thr Phe Ser Ala Glu Val Ala Glu Leu Tyr Arg Leu Glu Leu Gly Asp 
465                 470                 475                 480 


Tyr Lys Leu Val Glu Ile Thr Pro Ile Gly Phe Ala Pro Thr Ser Glu 
                485                 490                 495     


Arg Arg Tyr Ser Ser Thr Pro Arg Arg Asn Lys Arg Gly Val Phe Val 
            500                 505                 510         


Leu Gly Phe Leu Gly Phe Leu Ala Thr Ala Gly Ser Ala Met Gly Thr 
        515                 520                 525             


Ala Ala Leu Thr Leu Ser Ala Gln Ser Arg Thr Leu Leu Ala Gly Ile 
    530                 535                 540                 


Val Gln Gln Gln Gln Gln Leu Leu Asp Val Val Lys Arg Gln Gln Glu 
545                 550                 555                 560 


Met Leu Arg Leu Thr Val Trp Gly Thr Lys Asn Leu Gln Ala Arg Val 
                565                 570                 575     


Thr Ala Ile Glu Lys Tyr Leu Lys Asp Gln Ala Arg Leu Asn Ser Trp 
            580                 585                 590         


Gly Cys Ala Phe Arg Gln Val Cys His Thr Thr Val Pro Trp Val Asn 
        595                 600                 605             


Asn Ser Leu Lys Pro Asp Trp Asp Asn Met Thr Trp Gln Glu Trp Glu 
    610                 615                 620                 


Gln Gln Val Arg Tyr Leu Glu Ala Asn Ile Ser Glu Gln Leu Glu Arg 
625                 630                 635                 640 


Ala Gln Ile Gln Gln Glu Lys Asn Thr Tyr Glu Leu Gln Lys Leu Asn 
                645                 650                 655     


Ser Trp Asp Val Phe Thr Asn Trp Leu Asp Leu Thr Ala Trp Val Lys 
            660                 665                 670         


Tyr Ile Gln Tyr Gly Val Tyr Ile Ile Val Gly Ile Val Ala Leu Arg 
        675                 680                 685             


Ile Val Ile Tyr Val Val Gln Met Leu Ser Arg Leu Arg Lys Gly Tyr 
    690                 695                 700                 


Arg Pro Val Phe Ser Ser Pro Pro Gly Tyr Ile Gln Gln Ile His Ile 
705                 710                 715                 720 


His Lys Asp Gln Glu Gln Pro Thr Arg Gly Glu Thr Glu Glu Asp Val 
                725                 730                 735     


Gly Asp Asn Val Gly Asp Arg Leu Trp Pro Trp Pro Ile Ala Tyr Leu 
            740                 745                 750         


His Phe Leu Ile His Leu Leu Ala Arg Leu Leu Ile Gly Leu Tyr Ser 
        755                 760                 765             


Ile Cys Arg Asp Leu Leu Ser Arg Ile Ser Pro Ile Leu Gln Pro Ile 
    770                 775                 780                 


Phe Arg Ser Leu Gln Arg Ala Leu Thr Thr Ile Arg Asp Trp Leu Arg 
785                 790                 795                 800 


Leu Lys Ala Ala Tyr Leu Gln Tyr Gly Cys Glu Trp Ile Gln Glu Ala 
                805                 810                 815     


Phe Arg Ala Phe Ala Arg Ile Ala Arg Glu Thr Leu Thr Asn Thr Trp 
            820                 825                 830         


Arg Asp Leu Trp Gly Ala Val Gln Trp Val Gly Arg Arg Ile Leu Ala 
        835                 840                 845             


Val Pro Arg Arg Ile Arg Gln Gly Ala Glu Ile Ala Leu Leu 
    850                 855                 860         


<210>  15
<211>  2640
<212>  DNA
<213>  Simian immunodeficiency virus

<400>  15
atgggatgtc ttgggaatca gctgcttatc gccatcttgc ttttaagtgt ctatgggatc       60

tattgtactc tatatgtcac agtcttttat ggtgtaccag cttggaggaa tgcgacaatt      120

cccctctttt gtgcaaccaa gaatagggat acttggggaa caactcagtg cctaccagat      180

aatggtgatt attcagaagt ggcccttaat gttacagaaa gctttgatgc ctggaataat      240

acagtcacag aacaggcaat agaggatgta tggcaactct ttgagacctc aataaagcct      300

tgtgtaaaat tatccccatt atgcattact atgagatgca ataaaagtga gacagataga      360

tggggattga caaaatcaat aacaacaaca gcatcaacaa catcaacgac agcatcagca      420

aaagtagaca tggtcaatga gactagttct tgtatagccc aggataattg cacaggcttg      480

gaacaagagc aaatgataag ctgtaaattc aacatgacag ggttaaaaag agacaagaaa      540

aaagagtaca atgaaacttg gtactctgca gatttggtat gtgaacaagg gaataacact      600

ggtaatgaaa gtagatgtta catgaaccac tgtaacactt ctgttatcca agagtcttgt      660

gacaaacatt attgggatgc tattagattt aggtattgtg cacctccagg ttatgctttg      720

cttagatgta atgacacaaa ttattcaggc tttatgccta aatgttctaa ggtggtggtc      780

tcttcatgca caaggatgat ggagacacag acttctactt ggtttggctt taatggaact      840

agagcagaaa atagaactta tatttactgg catggtaggg ataataggac tataattagt      900

ttaaataagt attataatct aacaatgaaa tgtagaagac caggaaataa gacagtttta      960

ccagtcacca ttatgtctgg attggttttc cactcacaac caatcaatga taggccaaag     1020

caggcatggt gttggtttgg aggaaaatgg aaggatgcaa taaaagaggt gaagcagacc     1080

attgtcaaac atcccaggta tactggaact aacaatactg ataaaatcaa tttgacggct     1140

cctggaggag gagatccgga agttaccttc atgtggacaa attgcagagg agagttcctc     1200

tactgtaaaa tgaattggtt tctaaattgg gtagaagata ggaatacagc taaccagaag     1260

ccaaaggaac agcataaaag gaattacgtg ccatgtcata ttagacaaat aatcaacact     1320

tggcataaag taggcaaaaa tgtttatttg cctccaagag agggagacct cacgtgtaac     1380

tccacagtga ccagtctcat agcaaacata gattggattg atggaaacca aactaatatc     1440

accatgagtg cagaggtggc agaactgtat cgattggaat tgggagatta taaattagta     1500

gagatcactc caattggctt ggcccccaca gatgtgaaga ggtacactac tggtggcacc     1560

tcaagaaata aaagaggggt ctttgtgcta gggttcttgg gttttctcgc aacggcaggt     1620

tctgcaatgg gcgcggcgtc gttgacgctg accgctcagt cccgaacttt attggctggg     1680

atagtgcagc aacagcaaca gctgttggac gtggtcaaga gacaacaaga attgttgcga     1740

ctgaccgtct ggggaacaaa gaacctccag actagggtca ctgccatcga gaagtactta     1800

aaggaccagg cgcagctgaa tgcttgggga tgtgcgttta gacaagtctg ccacactact     1860

gtaccatggc caaatgcaag tctaacacca aagtggaaca atgagacttg gcaagagtgg     1920

gagcgaaagg ttgacttctt ggaagaaaat ataacagccc tcctagagga ggcacaaatt     1980

caacaagaga agaacatgta tgaattacaa aagttgaata gctgggatgt gtttggcaat     2040

tggtttgacc ttgcttcttg gataaagtat atacaatatg gagtttatat agttgtagga     2100

gtaatactgt taagaatagt gatctatata gtacaaatgc tagctaagtt aaggcagggg     2160

tataggccag tgttctcttc cccaccctct tatttccagc agacccatat ccaacaggac     2220

ccggcactgc caaccagaga aggcaaagaa agagacggtg gagaaggcgg tggcaacagc     2280

tcctggcctt ggcagataga atatattcat ttcctgatcc gccaactgat acgcctcttg     2340

acttggctat tcagcaactg cagaaccttg ctatcgagag tataccagat cctccaacca     2400

atactccaga ggctctctgc gaccctacag aggattcgag aagtcctcag gactgaactg     2460

acctacctac aatatgggtg gagctatttc catgaggcgg tccaggccgt ctggagatct     2520

gcgacagaga ctcttgcggg cgcgtgggga gacttatggg agactcttag gagaggtgga     2580

agatggatac tcgcaatccc caggaggatt agacaagggc ttgagctcac tctcttgtga     2640


<210>  16
<211>  879
<212>  PRT
<213>  Simian immunodeficiency virus

<400>  16

Met Gly Cys Leu Gly Asn Gln Leu Leu Ile Ala Ile Leu Leu Leu Ser 
1               5                   10                  15      


Val Tyr Gly Ile Tyr Cys Thr Leu Tyr Val Thr Val Phe Tyr Gly Val 
            20                  25                  30          


Pro Ala Trp Arg Asn Ala Thr Ile Pro Leu Phe Cys Ala Thr Lys Asn 
        35                  40                  45              


Arg Asp Thr Trp Gly Thr Thr Gln Cys Leu Pro Asp Asn Gly Asp Tyr 
    50                  55                  60                  


Ser Glu Val Ala Leu Asn Val Thr Glu Ser Phe Asp Ala Trp Asn Asn 
65                  70                  75                  80  


Thr Val Thr Glu Gln Ala Ile Glu Asp Val Trp Gln Leu Phe Glu Thr 
                85                  90                  95      


Ser Ile Lys Pro Cys Val Lys Leu Ser Pro Leu Cys Ile Thr Met Arg 
            100                 105                 110         


Cys Asn Lys Ser Glu Thr Asp Arg Trp Gly Leu Thr Lys Ser Ile Thr 
        115                 120                 125             


Thr Thr Ala Ser Thr Thr Ser Thr Thr Ala Ser Ala Lys Val Asp Met 
    130                 135                 140                 


Val Asn Glu Thr Ser Ser Cys Ile Ala Gln Asp Asn Cys Thr Gly Leu 
145                 150                 155                 160 


Glu Gln Glu Gln Met Ile Ser Cys Lys Phe Asn Met Thr Gly Leu Lys 
                165                 170                 175     


Arg Asp Lys Lys Lys Glu Tyr Asn Glu Thr Trp Tyr Ser Ala Asp Leu 
            180                 185                 190         


Val Cys Glu Gln Gly Asn Asn Thr Gly Asn Glu Ser Arg Cys Tyr Met 
        195                 200                 205             


Asn His Cys Asn Thr Ser Val Ile Gln Glu Ser Cys Asp Lys His Tyr 
    210                 215                 220                 


Trp Asp Ala Ile Arg Phe Arg Tyr Cys Ala Pro Pro Gly Tyr Ala Leu 
225                 230                 235                 240 


Leu Arg Cys Asn Asp Thr Asn Tyr Ser Gly Phe Met Pro Lys Cys Ser 
                245                 250                 255     


Lys Val Val Val Ser Ser Cys Thr Arg Met Met Glu Thr Gln Thr Ser 
            260                 265                 270         


Thr Trp Phe Gly Phe Asn Gly Thr Arg Ala Glu Asn Arg Thr Tyr Ile 
        275                 280                 285             


Tyr Trp His Gly Arg Asp Asn Arg Thr Ile Ile Ser Leu Asn Lys Tyr 
    290                 295                 300                 


Tyr Asn Leu Thr Met Lys Cys Arg Arg Pro Gly Asn Lys Thr Val Leu 
305                 310                 315                 320 


Pro Val Thr Ile Met Ser Gly Leu Val Phe His Ser Gln Pro Ile Asn 
                325                 330                 335     


Asp Arg Pro Lys Gln Ala Trp Cys Trp Phe Gly Gly Lys Trp Lys Asp 
            340                 345                 350         


Ala Ile Lys Glu Val Lys Gln Thr Ile Val Lys His Pro Arg Tyr Thr 
        355                 360                 365             


Gly Thr Asn Asn Thr Asp Lys Ile Asn Leu Thr Ala Pro Gly Gly Gly 
    370                 375                 380                 


Asp Pro Glu Val Thr Phe Met Trp Thr Asn Cys Arg Gly Glu Phe Leu 
385                 390                 395                 400 


Tyr Cys Lys Met Asn Trp Phe Leu Asn Trp Val Glu Asp Arg Asn Thr 
                405                 410                 415     


Ala Asn Gln Lys Pro Lys Glu Gln His Lys Arg Asn Tyr Val Pro Cys 
            420                 425                 430         


His Ile Arg Gln Ile Ile Asn Thr Trp His Lys Val Gly Lys Asn Val 
        435                 440                 445             


Tyr Leu Pro Pro Arg Glu Gly Asp Leu Thr Cys Asn Ser Thr Val Thr 
    450                 455                 460                 


Ser Leu Ile Ala Asn Ile Asp Trp Ile Asp Gly Asn Gln Thr Asn Ile 
465                 470                 475                 480 


Thr Met Ser Ala Glu Val Ala Glu Leu Tyr Arg Leu Glu Leu Gly Asp 
                485                 490                 495     


Tyr Lys Leu Val Glu Ile Thr Pro Ile Gly Leu Ala Pro Thr Asp Val 
            500                 505                 510         


Lys Arg Tyr Thr Thr Gly Gly Thr Ser Arg Asn Lys Arg Gly Val Phe 
        515                 520                 525             


Val Leu Gly Phe Leu Gly Phe Leu Ala Thr Ala Gly Ser Ala Met Gly 
    530                 535                 540                 


Ala Ala Ser Leu Thr Leu Thr Ala Gln Ser Arg Thr Leu Leu Ala Gly 
545                 550                 555                 560 


Ile Val Gln Gln Gln Gln Gln Leu Leu Asp Val Val Lys Arg Gln Gln 
                565                 570                 575     


Glu Leu Leu Arg Leu Thr Val Trp Gly Thr Lys Asn Leu Gln Thr Arg 
            580                 585                 590         


Val Thr Ala Ile Glu Lys Tyr Leu Lys Asp Gln Ala Gln Leu Asn Ala 
        595                 600                 605             


Trp Gly Cys Ala Phe Arg Gln Val Cys His Thr Thr Val Pro Trp Pro 
    610                 615                 620                 


Asn Ala Ser Leu Thr Pro Lys Trp Asn Asn Glu Thr Trp Gln Glu Trp 
625                 630                 635                 640 


Glu Arg Lys Val Asp Phe Leu Glu Glu Asn Ile Thr Ala Leu Leu Glu 
                645                 650                 655     


Glu Ala Gln Ile Gln Gln Glu Lys Asn Met Tyr Glu Leu Gln Lys Leu 
            660                 665                 670         


Asn Ser Trp Asp Val Phe Gly Asn Trp Phe Asp Leu Ala Ser Trp Ile 
        675                 680                 685             


Lys Tyr Ile Gln Tyr Gly Val Tyr Ile Val Val Gly Val Ile Leu Leu 
    690                 695                 700                 


Arg Ile Val Ile Tyr Ile Val Gln Met Leu Ala Lys Leu Arg Gln Gly 
705                 710                 715                 720 


Tyr Arg Pro Val Phe Ser Ser Pro Pro Ser Tyr Phe Gln Gln Thr His 
                725                 730                 735     


Ile Gln Gln Asp Pro Ala Leu Pro Thr Arg Glu Gly Lys Glu Arg Asp 
            740                 745                 750         


Gly Gly Glu Gly Gly Gly Asn Ser Ser Trp Pro Trp Gln Ile Glu Tyr 
        755                 760                 765             


Ile His Phe Leu Ile Arg Gln Leu Ile Arg Leu Leu Thr Trp Leu Phe 
    770                 775                 780                 


Ser Asn Cys Arg Thr Leu Leu Ser Arg Val Tyr Gln Ile Leu Gln Pro 
785                 790                 795                 800 


Ile Leu Gln Arg Leu Ser Ala Thr Leu Gln Arg Ile Arg Glu Val Leu 
                805                 810                 815     


Arg Thr Glu Leu Thr Tyr Leu Gln Tyr Gly Trp Ser Tyr Phe His Glu 
            820                 825                 830         


Ala Val Gln Ala Val Trp Arg Ser Ala Thr Glu Thr Leu Ala Gly Ala 
        835                 840                 845             


Trp Gly Asp Leu Trp Glu Thr Leu Arg Arg Gly Gly Arg Trp Ile Leu 
    850                 855                 860                 


Ala Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu Leu Thr Leu Leu 
865                 870                 875                 


<210>  17
<211>  2664
<212>  DNA
<213>  Simian immunodeficiency virus

<400>  17
atgggatgtc ttgggaatca gctgcttatc gcgctcttgc tagtaagtgt tttagagatt       60

tgttgtgttc aatatgtaac agtattctat ggtgtaccag catggaagaa tgcgacaatt      120

cccctcttct gtgcaaccag gaatagggac acttggggaa caacacaatg cttgcctgat      180

aatgatgatt actcagaatt ggcagtcaat atcacagagg cttttgatgc ttggaataat      240

acagtcacag aacaagcaat agaggatgtg tggaacctct ttgaaacatc cattaagccc      300

tgtgtaaaac ttaccccact atgtatagca atgaggtgta ataaaactga gacagatagg      360

tggggtttga caggaagagc agagacaaca acaacagcga aatcaacaac atcaacaaca      420

acaacaacag taacaccaaa ggtcataaat gaaggtgatt cttgcataaa agataatagt      480

tgtgcaggct tggaacagga gcccatgata ggttgtaaat ttaacatgac aggattaaag      540

agggacaaaa agatagaata taatgaaaca tggtattcaa gagatttaat ctgtgagcag      600

tcagcaaatg gaagtgagag taaatgttac atgcagcatt gtaacaccag tgttattcag      660

gaatcctgtg acaagcatta ttgggatgct attagattta gatactgtgc accgccaggt      720

tatgctttgc ttaggtgtaa tgattcaaat tattcaggct ttgctcctaa atgttctaag      780

gtagtggttt cttcatgcac aagaatgatg gagacgcaaa cctctacttg gtttggcttc      840

aatggtacta gggcagaaaa tagaacatac atttattggc atggcaatag taatagaacc      900

ataattagct taaataagta ttataatcta acaataagat gtaaaagacc aggaaataag      960

acagttttac cagtcaccat tatgtcaggg ttggtcttcc attcgcaaac cataaatacg     1020

agaccaaaac aggcctggtg ctggtttgaa ggaaactgga gcaaggccat ccaggaagtg     1080

aaggaaacct tggtcaaaca tcccaggtat acgggaacta atgatactag gaaaattaat     1140

ctaacagctc cagcaagagg aaatccagaa gtcactttta tgtggacaaa ttgtcgagga     1200

gaattcttat actgcaaaat gaattggttt ctcaattggg tagaggacag agaccaaaat     1260

agtaacagat ggaaacaaca aaaggagtca gagcaaaaga agagaaatta tgtgccatgt     1320

catattagac aaataatcaa cgcgtggcac aaagtaggca aaaatgtata tttgcctcct     1380

agggaaggag acctgacatg taattccact gtaactagtc tcatagcaaa gatagattgg     1440

atcaataaca atgagaccaa tatcaccatg agtgcagagg tggcagaact gtatcgattg     1500

gagttgggag attacaaatt agtagagatt actccaattg gcttggcccc cacaaatgta     1560

agaaggtaca ccacaactgg tgcctcaaga aataagagag gggtctttgt gctagggttc     1620

ttgggttttc tcgcgacagc aggttctgca atgggcgcgg cgtcgctgac gctgtcggct     1680

cagtcccgga ctttgttggc tgggatagtg cagcaacagc aacagctgtt ggatgtggtc     1740

aagagacaac aagaattgtt gcgactgacc gtctggggaa ctaagaacct ccagactaga     1800

gtcactgcta tcgagaagta cctgaaggat caggcgcggc taaattcatg gggatgtgct     1860

tttaggcaag tctgtcacac tactgtacca tggccaaatg actcattggt gcctaattgg     1920

gacaatatga cttggcaaga gtgggaagga aaggttaact tcctagaggc aaatataact     1980

caattattag aagaagcaca aattcagcaa gaaaagaata tgtatgaatt gcaaaaacta     2040

aatagctggg atatctttgg caattggttt gaccttactt cttggataag atatatacaa     2100

tatggtgtac taatagtttt aggagtagta gggttaagaa tagtgatata tgtagtgcaa     2160

atgctagcta ggttaagaca gggttatagg ccagtgttct cttcccctcc cgcttatgtt     2220

cagcagatcc ctatccacaa ggaccaggaa ccgccaacca aagaaggaga agaaggagaa     2280

ggtggagaca gaggtggcag cagatcttgg ccttggcaga tagaatatat tcatttccta     2340

atccgccaac tgatacgcct cttgacttgg ctattcagca gctgcaggga ttggctattg     2400

aggatctacc agatcctcca accagtgctc cagagactct caaggacgct gcaaagagtt     2460

cgtgaagtca tcagaattga aataacctac ctacaacatg ggtggagcta tttccaagaa     2520

gcagcacagg cgtggtggaa atttgcgcga gagactcttg cgagcgcgtg gagagacata     2580

tgggagactc tgggaagggt tggaagaggg atactcgcaa tccctaggcg cgtcaggcaa     2640

gggcttgagc tcactctctt gtga                                            2664


<210>  18
<211>  887
<212>  PRT
<213>  Simian immunodeficiency virus

<400>  18

Met Gly Cys Leu Gly Asn Gln Leu Leu Ile Ala Leu Leu Leu Val Ser 
1               5                   10                  15      


Val Leu Glu Ile Cys Cys Val Gln Tyr Val Thr Val Phe Tyr Gly Val 
            20                  25                  30          


Pro Ala Trp Lys Asn Ala Thr Ile Pro Leu Phe Cys Ala Thr Arg Asn 
        35                  40                  45              


Arg Asp Thr Trp Gly Thr Thr Gln Cys Leu Pro Asp Asn Asp Asp Tyr 
    50                  55                  60                  


Ser Glu Leu Ala Val Asn Ile Thr Glu Ala Phe Asp Ala Trp Asn Asn 
65                  70                  75                  80  


Thr Val Thr Glu Gln Ala Ile Glu Asp Val Trp Asn Leu Phe Glu Thr 
                85                  90                  95      


Ser Ile Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Ile Ala Met Arg 
            100                 105                 110         


Cys Asn Lys Thr Glu Thr Asp Arg Trp Gly Leu Thr Gly Arg Ala Glu 
        115                 120                 125             


Thr Thr Thr Thr Ala Lys Ser Thr Thr Ser Thr Thr Thr Thr Thr Val 
    130                 135                 140                 


Thr Pro Lys Val Ile Asn Glu Gly Asp Ser Cys Ile Lys Asp Asn Ser 
145                 150                 155                 160 


Cys Ala Gly Leu Glu Gln Glu Pro Met Ile Gly Cys Lys Phe Asn Met 
                165                 170                 175     


Thr Gly Leu Lys Arg Asp Lys Lys Ile Glu Tyr Asn Glu Thr Trp Tyr 
            180                 185                 190         


Ser Arg Asp Leu Ile Cys Glu Gln Ser Ala Asn Gly Ser Glu Ser Lys 
        195                 200                 205             


Cys Tyr Met Gln His Cys Asn Thr Ser Val Ile Gln Glu Ser Cys Asp 
    210                 215                 220                 


Lys His Tyr Trp Asp Ala Ile Arg Phe Arg Tyr Cys Ala Pro Pro Gly 
225                 230                 235                 240 


Tyr Ala Leu Leu Arg Cys Asn Asp Ser Asn Tyr Ser Gly Phe Ala Pro 
                245                 250                 255     


Lys Cys Ser Lys Val Val Val Ser Ser Cys Thr Arg Met Met Glu Thr 
            260                 265                 270         


Gln Thr Ser Thr Trp Phe Gly Phe Asn Gly Thr Arg Ala Glu Asn Arg 
        275                 280                 285             


Thr Tyr Ile Tyr Trp His Gly Asn Ser Asn Arg Thr Ile Ile Ser Leu 
    290                 295                 300                 


Asn Lys Tyr Tyr Asn Leu Thr Ile Arg Cys Lys Arg Pro Gly Asn Lys 
305                 310                 315                 320 


Thr Val Leu Pro Val Thr Ile Met Ser Gly Leu Val Phe His Ser Gln 
                325                 330                 335     


Thr Ile Asn Thr Arg Pro Lys Gln Ala Trp Cys Trp Phe Glu Gly Asn 
            340                 345                 350         


Trp Ser Lys Ala Ile Gln Glu Val Lys Glu Thr Leu Val Lys His Pro 
        355                 360                 365             


Arg Tyr Thr Gly Thr Asn Asp Thr Arg Lys Ile Asn Leu Thr Ala Pro 
    370                 375                 380                 


Ala Arg Gly Asn Pro Glu Val Thr Phe Met Trp Thr Asn Cys Arg Gly 
385                 390                 395                 400 


Glu Phe Leu Tyr Cys Lys Met Asn Trp Phe Leu Asn Trp Val Glu Asp 
                405                 410                 415     


Arg Asp Gln Asn Ser Asn Arg Trp Lys Gln Gln Lys Glu Ser Glu Gln 
            420                 425                 430         


Lys Lys Arg Asn Tyr Val Pro Cys His Ile Arg Gln Ile Ile Asn Ala 
        435                 440                 445             


Trp His Lys Val Gly Lys Asn Val Tyr Leu Pro Pro Arg Glu Gly Asp 
    450                 455                 460                 


Leu Thr Cys Asn Ser Thr Val Thr Ser Leu Ile Ala Lys Ile Asp Trp 
465                 470                 475                 480 


Ile Asn Asn Asn Glu Thr Asn Ile Thr Met Ser Ala Glu Val Ala Glu 
                485                 490                 495     


Leu Tyr Arg Leu Glu Leu Gly Asp Tyr Lys Leu Val Glu Ile Thr Pro 
            500                 505                 510         


Ile Gly Leu Ala Pro Thr Asn Val Arg Arg Tyr Thr Thr Thr Gly Ala 
        515                 520                 525             


Ser Arg Asn Lys Arg Gly Val Phe Val Leu Gly Phe Leu Gly Phe Leu 
    530                 535                 540                 


Ala Thr Ala Gly Ser Ala Met Gly Ala Ala Ser Leu Thr Leu Ser Ala 
545                 550                 555                 560 


Gln Ser Arg Thr Leu Leu Ala Gly Ile Val Gln Gln Gln Gln Gln Leu 
                565                 570                 575     


Leu Asp Val Val Lys Arg Gln Gln Glu Leu Leu Arg Leu Thr Val Trp 
            580                 585                 590         


Gly Thr Lys Asn Leu Gln Thr Arg Val Thr Ala Ile Glu Lys Tyr Leu 
        595                 600                 605             


Lys Asp Gln Ala Arg Leu Asn Ser Trp Gly Cys Ala Phe Arg Gln Val 
    610                 615                 620                 


Cys His Thr Thr Val Pro Trp Pro Asn Asp Ser Leu Val Pro Asn Trp 
625                 630                 635                 640 


Asp Asn Met Thr Trp Gln Glu Trp Glu Gly Lys Val Asn Phe Leu Glu 
                645                 650                 655     


Ala Asn Ile Thr Gln Leu Leu Glu Glu Ala Gln Ile Gln Gln Glu Lys 
            660                 665                 670         


Asn Met Tyr Glu Leu Gln Lys Leu Asn Ser Trp Asp Ile Phe Gly Asn 
        675                 680                 685             


Trp Phe Asp Leu Thr Ser Trp Ile Arg Tyr Ile Gln Tyr Gly Val Leu 
    690                 695                 700                 


Ile Val Leu Gly Val Val Gly Leu Arg Ile Val Ile Tyr Val Val Gln 
705                 710                 715                 720 


Met Leu Ala Arg Leu Arg Gln Gly Tyr Arg Pro Val Phe Ser Ser Pro 
                725                 730                 735     


Pro Ala Tyr Val Gln Gln Ile Pro Ile His Lys Asp Gln Glu Pro Pro 
            740                 745                 750         


Thr Lys Glu Gly Glu Glu Gly Glu Gly Gly Asp Arg Gly Gly Ser Arg 
        755                 760                 765             


Ser Trp Pro Trp Gln Ile Glu Tyr Ile His Phe Leu Ile Arg Gln Leu 
    770                 775                 780                 


Ile Arg Leu Leu Thr Trp Leu Phe Ser Ser Cys Arg Asp Trp Leu Leu 
785                 790                 795                 800 


Arg Ile Tyr Gln Ile Leu Gln Pro Val Leu Gln Arg Leu Ser Arg Thr 
                805                 810                 815     


Leu Gln Arg Val Arg Glu Val Ile Arg Ile Glu Ile Thr Tyr Leu Gln 
            820                 825                 830         


His Gly Trp Ser Tyr Phe Gln Glu Ala Ala Gln Ala Trp Trp Lys Phe 
        835                 840                 845             


Ala Arg Glu Thr Leu Ala Ser Ala Trp Arg Asp Ile Trp Glu Thr Leu 
    850                 855                 860                 


Gly Arg Val Gly Arg Gly Ile Leu Ala Ile Pro Arg Arg Val Arg Gln 
865                 870                 875                 880 


Gly Leu Glu Leu Thr Leu Leu 
                885






B4169315.1





