                         SEQUENCE LISTING
<110>  INSERM
 
<120>  Synthetic gene coding HIV-1 GAG protein

<130>  BET 09P1156

<150>  EP 08290849.2
<151>  2008-09-10 

<160>  8     

<170>  PatentIn version 3.5

<210>  1
<211>  1543
<212>  DNA
<213>  Artificial
<220>
<223>  synthetic GAG Gene
<400>  1
atgggcgcca gggccagcgt gctgagcgga ggcgagctgg acaggtggga gaagatcagg       60
ctgaggcctg gaggcaagaa gaagtataag ctgaagcaca tcgtgtgggc cagcagggag      120
ctggagaggt tcgccgtgaa ccctggcctg ctggagacca gcgagggctg caggcagatc      180
ctgggccagc tgcagcccag cctgcagacc ggcagcgagg agctgaggag cctgtacaac      240
accgtggcca ccctgtactg cgtgcaccag aggatcgaga tcaaggacac caaggaggcc      300
ctggacaaga tcgaggagga gcagaacaag tccaagaaga aggcccagca ggctgctgcc      360
gacaccggcc acagcagcca ggtgagccag aactacccta tcgtgcagaa catccagggc      420
cagatggtgc accaggccat cagccctagg accctgaacg cctgggtgaa ggtggtggag      480
gagaaggcct tcagccctga ggtgatccct atgttcagcg ccctgagcga gggagccaca      540
cctcaggacc tgaacaccat gctgaacacc gtgggaggcc accaggccgc catgcagatg      600
ctgaaggaga ccatcaacga ggaggctgcc gagtgggaca gggtgcaccc tgtgcacgct      660
ggacccatcg ctccaggcca gatgagggag cccagaggca gcgacatcgc cggcaccacc      720
agcaccctgc aggagcagat cggctggatg accaacaacc ctcccatccc tgtgggcgaa      780
atctacaaga ggtggatcat cctgggcctg aacaagatcg tgaggatgta cagccctacc      840
agcatcctgg atatcaggca gggccctaaa gagcccttca gggactacgt ggacaggttc      900
tacaagaccc tgagagccga gcaggccagc caggaggtga agaactggat gaccgagacc      960
ctgctggtgc agaacgccaa ccctgactgc aagaccatcc tgaaggccct gggacctgct     1020
gccaccctgg aggagatgat gaccgcctgc cagggcgtgg gaggcccagg ccacaaggcc     1080
agggtgctgg ccgaggccat gagccaggtg accaacaccg ccaccatcat gatgcagaga     1140
ggcaacttca ggaaccagag gaagatggtg aagtgcttca actgcggcaa ggagggccac     1200
accgccagga actgcagggc tcccaggaag aagggctgct ggaagtgcgg caaggagggc     1260
caccagatga aggactgcac cgagaggcag gccaacttcc tgggcaagat ctggcccagc     1320
tacaagggca ggccaggcaa cttcctgcag agcaggcccg agcccaccgc tccacctttc     1380
ctgcagagca ggcccgagcc caccgctcct cctgaggaga gcttcaggag cggcgtggag     1440
acaaccaccc ctcctcagaa gcaggagccc atcgacaagg agctgtaccc tctgaccagc     1500
ctgaggagcc tgttcggcaa cgaccctagc agccaggagt cga                       1543

<210>  2
<211>  2388
<212>  DNA
<213>  Artificial
<220>
<223>  synthetic gag -nef-pol

<220>
<221>  CDS
<222>  (1)..(2388)
<400>  2
atg ggc gcc agg gcc agc gtg ctg agc gga ggc gag ctg gac agg tgg         48
Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp           
1               5                   10                  15                
gag aag atc agg ctg agg cct gga ggc aag aag aag tat aag ctg aag         96
Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys           
            20                  25                  30                    
cac atc gtg tgg gcc agc agg gag ctg gag agg ttc gcc gtg aac cct        144
His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro           
        35                  40                  45                        
ggc ctg ctg gag acc agc gag ggc tgc agg cag atc ctg ggc cag ctg        192
Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu           
    50                  55                  60                            
cag ccc agc ctg cag acc ggc agc gag gag ctg agg agc ctg tac aac        240
Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn           
65                  70                  75                  80            
acc gtg gcc acc ctg tac tgc gtg cac cag agg atc gag atc aag gac        288
Thr Val Ala Thr Leu Tyr Cys Val His Gln Arg Ile Glu Ile Lys Asp           
                85                  90                  95                
acc aag gag gcc ctg gac aag atc gag gag gag cag aac aag tcc aag        336
Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys           
            100                 105                 110                   
aag aag gcc cag cag gct gct gcc gac acc ggc cac agc agc cag gtg        384
Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly His Ser Ser Gln Val           
        115                 120                 125                       
agc cag aac tac cct atc gtg cag aac atc cag ggc cag atg gtg cac        432
Ser Gln Asn Tyr Pro Ile Val Gln Asn Ile Gln Gly Gln Met Val His           
    130                 135                 140                           
cag gcc atc agc cct agg acc ctg aac gcc tgg gtg aag gtg gtg gag        480
Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu           
145                 150                 155                 160           
gag aag gcc ttc agc cct gag gtg atc cct atg ttc agc gcc ctg agc        528
Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser           
                165                 170                 175               
gag gga gcc aca cct cag gac ctg aac acc atg ctg aac acc gtg gga        576
Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly           
            180                 185                 190                   
ggc cac cag gcc gcc atg cag atg ctg aag gag acc atc aac gag gag        624
Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu           
        195                 200                 205                       
gct gcc gag tgg gac agg gtg cac cct gtg cac gct gga ccc atc gct        672
Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro Ile Ala           
    210                 215                 220                           
cca ggc cag atg agg gag ccc aga ggc agc gac atc gcc ggc acc acc        720
Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr           
225                 230                 235                 240           
agc acc ctg cag gag cag atc ggc tgg atg acc aac aac cct ccc atc        768
Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr Asn Asn Pro Pro Ile           
                245                 250                 255               
cct gtg ggc gaa atc tac aag agg tgg atc atc ctg ggc ctg aac aag        816
Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys           
            260                 265                 270                   
atc gtg agg atg tac agc cct acc agc atc ctg gat atc agg cag ggc        864
Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly           
        275                 280                 285                       
cct aaa gag ccc ttc agg gac tac gtg gac agg ttc tac aag acc ctg        912
Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu           
    290                 295                 300                           
aga gcc gag cag gcc agc cag gag gtg aag aac tgg atg acc gag acc        960
Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr           
305                 310                 315                 320           
ctg ctg gtg cag aac gcc aac cct gac tgc aag acc atc ctg aag gcc       1008
Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala           
                325                 330                 335               
ctg gga cct gct gcc acc ctg gag gag atg atg acc gcc tgc cag ggc       1056
Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly           
            340                 345                 350                   
gtg gga ggc cca ggc cac aag gcc agg gtg ctg gcc gag gcc atg agc       1104
Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser           
        355                 360                 365                       
cag gtg acc aac acc gcc acc atc atg atg cag aga ggc aac ttc agg       1152
Gln Val Thr Asn Thr Ala Thr Ile Met Met Gln Arg Gly Asn Phe Arg           
    370                 375                 380                           
aac cag agg aag atg gtg aag tgc ttc aac tgc ggc aag gag ggc cac       1200
Asn Gln Arg Lys Met Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His           
385                 390                 395                 400           
acc gcc agg aac tgc agg gct ccc agg aag aag ggc tgc tgg aag tgc       1248
Thr Ala Arg Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys           
                405                 410                 415               
ggc aag gag ggc cac cag atg aag gac tgc acc gag agg cag gcc aac       1296
Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn           
            420                 425                 430                   
ttc ctg ggc aag atc tgg ccc agc tac aag ggc agg cca ggc aac ttc       1344
Phe Leu Gly Lys Ile Trp Pro Ser Tyr Lys Gly Arg Pro Gly Asn Phe           
        435                 440                 445                       
ctg cag agc agg ccc gag ccc acc gct cca cct ttc ctg cag agc agg       1392
Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Phe Leu Gln Ser Arg           
    450                 455                 460                           
ccc gag ccc acc gct cct cct gag gag agc ttc agg agc ggc gtg gag       1440
Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg Ser Gly Val Glu           
465                 470                 475                 480           
aca acc acc cct cct cag aag cag gag ccc atc gac aag gag ctg tac       1488
Thr Thr Thr Pro Pro Gln Lys Gln Glu Pro Ile Asp Lys Glu Leu Tyr           
                485                 490                 495               
cct ctg acc agc ctg agg agc ctg ttc ggc aac gac cct agc agc cag       1536
Pro Leu Thr Ser Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gln           
            500                 505                 510                   
gag tcg acc ggg cca cta aca gaa gaa gca gag cta gaa ctg gca gaa       1584
Glu Ser Thr Gly Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu           
        515                 520                 525                       
aac aga gag att cta aaa gaa cca gta cat gga gtg tat tat gac cca       1632
Asn Arg Glu Ile Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro           
    530                 535                 540                           
tca aaa gac tta ata gca gaa ata cag aag cag ggg caa ggc caa tgg       1680
Ser Lys Asp Leu Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp           
545                 550                 555                 560           
aca tat caa att tat caa gag cca ttt aaa aat ctg aaa aca gga atg       1728
Thr Tyr Gln Ile Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Met           
                565                 570                 575               
gag tgg aga ttt gat tct aga tta gca ttt cat cac gta gct aga gaa       1776
Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His His Val Ala Arg Glu           
            580                 585                 590                   
tta cat cct gaa tat ttt aaa aat tgt aag ctt atg gca ata ttc caa       1824
Leu His Pro Glu Tyr Phe Lys Asn Cys Lys Leu Met Ala Ile Phe Gln           
        595                 600                 605                       
agt agc atg aca aaa atc tta gag cct ttt aga aaa caa aat cca gac       1872
Ser Ser Met Thr Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp           
    610                 615                 620                           
ata gtt atc tat caa tac atg gat gat ttg tat gta gga tct gac tta       1920
Ile Val Ile Tyr Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu           
625                 630                 635                 640           
gaa ata ggg cag cat aga aca aaa ata gag gag ctg aga caa cat ctg       1968
Glu Ile Gly Gln His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu           
                645                 650                 655               
ttg agg tgg gga ctt aca acc atg gta ggt ttt cca gta aca cct caa       2016
Leu Arg Trp Gly Leu Thr Thr Met Val Gly Phe Pro Val Thr Pro Gln           
            660                 665                 670                   
gta cct tta aga cca atg act tac aaa gca gct gta gat ctt tct cac       2064
Val Pro Leu Arg Pro Met Thr Tyr Lys Ala Ala Val Asp Leu Ser His           
        675                 680                 685                       
ttt tta aaa gaa aaa gga ggt tta gaa ggg cta att cat tct caa cga       2112
Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu Ile His Ser Gln Arg           
    690                 695                 700                           
aga caa gat att ctt gat ttg tgg att tat cat aca caa gga tat ttt       2160
Arg Gln Asp Ile Leu Asp Leu Trp Ile Tyr His Thr Gln Gly Tyr Phe           
705                 710                 715                 720           
cct gat tgg cag aat tac aca cca gga cca gga gtc aga tac cca tta       2208
Pro Asp Trp Gln Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu           
                725                 730                 735               
acc ttt ggt tgg tgc tac aag cta gta cca atg att gag act gta cca       2256
Thr Phe Gly Trp Cys Tyr Lys Leu Val Pro Met Ile Glu Thr Val Pro           
            740                 745                 750                   
gta aaa tta aag cca gga atg gat ggc cca aaa gtt aaa caa tgg cca       2304
Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val Lys Gln Trp Pro           
        755                 760                 765                       
ttg aca gaa gaa aaa ata aaa gca tta gta gaa att tgt aca gag atg       2352
Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile Cys Thr Glu Met           
    770                 775                 780                           
gaa aag gaa ggg aaa att tca aaa att ggg cct taa                       2388
Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro                               
785                 790                 795                               

<210>  3
<211>  795
<212>  PRT
<213>  Artificial
<220>
<223>  Synthetic Construct
<400>  3
Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
1               5                   10                  15      

Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
            20                  25                  30          

His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 
        35                  40                  45              

Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu 
    50                  55                  60                  

Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65                  70                  75                  80  

Thr Val Ala Thr Leu Tyr Cys Val His Gln Arg Ile Glu Ile Lys Asp 
                85                  90                  95      

Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys 
            100                 105                 110         

Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly His Ser Ser Gln Val 
        115                 120                 125             

Ser Gln Asn Tyr Pro Ile Val Gln Asn Ile Gln Gly Gln Met Val His 
    130                 135                 140                 

Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145                 150                 155                 160 

Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser 
                165                 170                 175     


Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly 
            180                 185                 190         

Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu 
        195                 200                 205             

Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro Ile Ala 
    210                 215                 220                 

Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr 
225                 230                 235                 240 

Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr Asn Asn Pro Pro Ile 
                245                 250                 255     

Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys 
            260                 265                 270         

Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly 
        275                 280                 285             

Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 
    290                 295                 300                 

Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr 
305                 310                 315                 320 

Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala 
                325                 330                 335     

Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly 
            340                 345                 350         

Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 
        355                 360                 365             

Gln Val Thr Asn Thr Ala Thr Ile Met Met Gln Arg Gly Asn Phe Arg 
    370                 375                 380                 

Asn Gln Arg Lys Met Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 
385                 390                 395                 400 

Thr Ala Arg Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 
                405                 410                 415     

Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn 
            420                 425                 430         

Phe Leu Gly Lys Ile Trp Pro Ser Tyr Lys Gly Arg Pro Gly Asn Phe 
        435                 440                 445             

Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Phe Leu Gln Ser Arg 
    450                 455                 460                 

Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg Ser Gly Val Glu 
465                 470                 475                 480 

Thr Thr Thr Pro Pro Gln Lys Gln Glu Pro Ile Asp Lys Glu Leu Tyr 
                485                 490                 495     


Pro Leu Thr Ser Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gln 
            500                 505                 510         

Glu Ser Thr Gly Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu 
        515                 520                 525             

Asn Arg Glu Ile Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro 
    530                 535                 540                 

Ser Lys Asp Leu Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp 
545                 550                 555                 560 

Thr Tyr Gln Ile Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Met 
                565                 570                 575     

Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His His Val Ala Arg Glu 
            580                 585                 590         

Leu His Pro Glu Tyr Phe Lys Asn Cys Lys Leu Met Ala Ile Phe Gln 
        595                 600                 605             

Ser Ser Met Thr Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp 
    610                 615                 620                 

Ile Val Ile Tyr Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu 
625                 630                 635                 640 

Glu Ile Gly Gln His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu 
                645                 650                 655     

Leu Arg Trp Gly Leu Thr Thr Met Val Gly Phe Pro Val Thr Pro Gln 
            660                 665                 670         

Val Pro Leu Arg Pro Met Thr Tyr Lys Ala Ala Val Asp Leu Ser His 
        675                 680                 685             

Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu Ile His Ser Gln Arg 
    690                 695                 700                 

Arg Gln Asp Ile Leu Asp Leu Trp Ile Tyr His Thr Gln Gly Tyr Phe 
705                 710                 715                 720 

Pro Asp Trp Gln Asn Tyr Thr Pro Gly Pro Gly Val Arg Tyr Pro Leu 
                725                 730                 735     

Thr Phe Gly Trp Cys Tyr Lys Leu Val Pro Met Ile Glu Thr Val Pro 
            740                 745                 750         

Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val Lys Gln Trp Pro 
        755                 760                 765             

Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile Cys Thr Glu Met 
    770                 775                 780                 

Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro 
785                 790                 795 

<210>  4
<211>  6000
<212>  DNA
<213>  artificial
<220>
<223>  Fragment of vector pTG17401
<400>  4
aaataaatca tataaaaaat gatttcatga ttaaaccatg ttgtgaaaaa gtcaagaacg       60
ttcacattgg cggacaatct aaaaacaata cagtgattgc agatttgcca tatatggata      120
atgcggtatc cgatgtatgc aattcactgt ataaaaagaa tgtatcaaga atatccagat      180
ttgctaattt gataaagata gatgacgatg acaagactcc tactggtgta tataattatt      240
ttaaacctaa agatgccatt cctgttatta tatccatagg aaaggataga gatgtttgtg      300
aactattaat ctcatctgat aaagcgtgtg cgtgtataga gttaaattca tataaagtag      360
ccattcttcc catggatgtt tcctttttta ccaaaggaaa tgcatcattg attattctcc      420
tgtttgattt ctctatcgat gcggcacctc tcttaagaag tgtaaccgat aataatgtta      480
ttatatctag acaccagcgt ctacatgacg agcttccgag ttccaattgg ttcaagtttt      540
acataagtat aaagtccgac tattgttcta tattatatat ggttgttgat ggatctgtga      600
tgcatgcaat agctgataat agaacttacg caaatattag caaaaatata ttagacaata      660
ctacaattaa cgatgagtgt agatgctgtt attttgaacc acagattagg attcttgata      720
gagatgagat gctcaatgga tcatcgtgtg atatgaacag acattgtatt atgatgaatt      780
tacctgatgt aggcgaattt ggatctagta tgttggggaa atatgaacct gacatgatta      840
agattgctct ttcggtggct ggtgagctcg gatctaagct tgtcgacata aaaatatagt      900
agaatttcat ttgttttttt ctatgctata aataggatcg atccgataaa gtgaaaaata      960
attctaattt attgcacggt aaggaagtag aatcataaag aaaagcttct gcaggtcgac     1020
atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac     1080
ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac     1140
ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc     1200
ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc gctaccccga ccacatgaag     1260
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc     1320
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg     1380
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac     1440
aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac     1500
ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc     1560
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac     1620
tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc     1680
ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagagc     1740
gaaaaataca tcgtcacctg ggacatgttg cagatccatg cacgtaaact cgcaagccga     1800
ctgatgcctt ctgaacaatg gaaaggcatt attgccgtaa gccgtggcgg tctggtaccg     1860
ggtgcgttac tggcgcgtga actgggtatt cgtcatgtcg ataccgtttg tatttccagc     1920
tacgatcacg acaaccagcg cgagcttaaa gtgctgaaac gcgcagaagg cgatggcgaa     1980
ggcttcatcg ttattgatga cctggtggat accggtggta ctgcggttgc gattcgtgaa     2040
atgtatccaa aagcgcactt tgtcaccatc ttcgcaaaac cggctggtcg tccgctggtt     2100
gatgactatg ttgttgatat cccgcaagat acctggattg aacagccgtg ggatatgggc     2160
gtcgtattcg tcccgccaat ctccggtcgc taatcttttc aacgcctggc actgccgggc     2220
gttgttcttt ttaacttccc tgcataatta acgatgagtg tagatgctgt tattttgaac     2280
cacagattag gattcttgat agagatgaga tgctcaatgg atcatcgtgt gatatgaaca     2340
gacattgtat tatgatgaat ttacctgatg taggcgaatt tggatctagt atgttgggga     2400
aatatgaacc tgacatgatt aagattgctc tttcggtggc tggtgagctc ggatctttta     2460
ttctatactt aaaaaatgaa aataaataca aaggttcttg agggttgtgt taaattgaaa     2520
gcgagaaata atcataaatt atttcattat cgcgatatcc gttaagtttg ctgcagctgg     2580
atccatgggc gccagggcca gcgtgctgag cggaggcgag ctggacaggt gggagaagat     2640
caggctgagg cctggaggca agaagaagta taagctgaag cacatcgtgt gggccagcag     2700
ggagctggag aggttcgccg tgaaccctgg cctgctggag accagcgagg gctgcaggca     2760
gatcctgggc cagctgcagc ccagcctgca gaccggcagc gaggagctga ggagcctgta     2820
caacaccgtg gccaccctgt actgcgtgca ccagaggatc gagatcaagg acaccaagga     2880
ggccctggac aagatcgagg aggagcagaa caagtccaag aagaaggccc agcaggctgc     2940
tgccgacacc ggccacagca gccaggtgag ccagaactac cctatcgtgc agaacatcca     3000
gggccagatg gtgcaccagg ccatcagccc taggaccctg aacgcctggg tgaaggtggt     3060
ggaggagaag gccttcagcc ctgaggtgat ccctatgttc agcgccctga gcgagggagc     3120
cacacctcag gacctgaaca ccatgctgaa caccgtggga ggccaccagg ccgccatgca     3180
gatgctgaag gagaccatca acgaggaggc tgccgagtgg gacagggtgc accctgtgca     3240
cgctggaccc atcgctccag gccagatgag ggagcccaga ggcagcgaca tcgccggcac     3300
caccagcacc ctgcaggagc agatcggctg gatgaccaac aaccctccca tccctgtggg     3360
cgaaatctac aagaggtgga tcatcctggg cctgaacaag atcgtgagga tgtacagccc     3420
taccagcatc ctggatatca ggcagggccc taaagagccc ttcagggact acgtggacag     3480
gttctacaag accctgagag ccgagcaggc cagccaggag gtgaagaact ggatgaccga     3540
gaccctgctg gtgcagaacg ccaaccctga ctgcaagacc atcctgaagg ccctgggacc     3600
tgctgccacc ctggaggaga tgatgaccgc ctgccagggc gtgggaggcc caggccacaa     3660
ggccagggtg ctggccgagg ccatgagcca ggtgaccaac accgccacca tcatgatgca     3720
gagaggcaac ttcaggaacc agaggaagat ggtgaagtgc ttcaactgcg gcaaggaggg     3780
ccacaccgcc aggaactgca gggctcccag gaagaagggc tgctggaagt gcggcaagga     3840
gggccaccag atgaaggact gcaccgagag gcaggccaac ttcctgggca agatctggcc     3900
cagctacaag ggcaggccag gcaacttcct gcagagcagg cccgagccca ccgctccacc     3960
tttcctgcag agcaggcccg agcccaccgc tcctcctgag gagagcttca ggagcggcgt     4020
ggagacaacc acccctcctc agaagcagga gcccatcgac aaggagctgt accctctgac     4080
cagcctgagg agcctgttcg gcaacgaccc tagcagccag gagtcgaccg ggccactaac     4140
agaagaagca gagctagaac tggcagaaaa cagagagatt ctaaaagaac cagtacatgg     4200
agtgtattat gacccatcaa aagacttaat agcagaaata cagaagcagg ggcaaggcca     4260
atggacatat caaatttatc aagagccatt taaaaatctg aaaacaggaa tggagtggag     4320
atttgattct agattagcat ttcatcacgt agctagagaa ttacatcctg aatattttaa     4380
aaattgtaag cttatggcaa tattccaaag tagcatgaca aaaatcttag agccttttag     4440
aaaacaaaat ccagacatag ttatctatca atacatggat gatttgtatg taggatctga     4500
cttagaaata gggcagcata gaacaaaaat agaggagctg agacaacatc tgttgaggtg     4560
gggacttaca accatggtag gttttccagt aacacctcaa gtacctttaa gaccaatgac     4620
ttacaaagca gctgtagatc tttctcactt tttaaaagaa aaaggaggtt tagaagggct     4680
aattcattct caacgaagac aagatattct tgatttgtgg atttatcata cacaaggata     4740
ttttcctgat tggcagaatt acacaccagg accaggagtc agatacccat taacctttgg     4800
ttggtgctac aagctagtac caatgattga gactgtacca gtaaaattaa agccaggaat     4860
ggatggccca aaagttaaac aatggccatt gacagaagaa aaaataaaag cattagtaga     4920
aatttgtaca gagatggaaa aggaagggaa aatttcaaaa attgggcctt aagcggccgc     4980
cccgggagat ctcgatccgg aaagttttat aggtagttga tagaacaaaa tacataattt     5040
tgtaaaaata aatcactttt tatactaata tgacacgatt accaatactt ttgttactaa     5100
tatcattagt atacgctaca ccttttcctc agacatctaa aaaaataggt gatgatgcaa     5160
ctttatcatg taatcgaaat aatacaaatg actacgttgt tatgagtgct tggtataagg     5220
agcccaattc cattattctt ttagctgcta aaagcgacgt cttgtatttt gataattata     5280
ccaaggataa aatatcttac gactctccat acgatgatct agttacaact atcacaatta     5340
aatcattgac tgctagagat gccggtactt atgtatgtgc attctttatg acatcgccta     5400
caaatgacac tgataaagta gattatgaag aatactccac agagttgatt gtaaatacag     5460
atagtgaatc gactatagac ataatactat ctggatctac acattcaccg gaaactagtt     5520
ctgagaaacc tgattatata gataattcta attgctcgtc ggtattcgaa atcgcgactc     5580
cggaaccaat tactgataat gtagaagatc atacagacac cgtcacatac actagtgata     5640
gcattaatac agtaagtgca tcatctggag aatccacaac agacgagact ccggaaccaa     5700
ttactgataa agaagaagat catacagtta cagacactgt ctcatacact acagtaagta     5760
catcatctgg aattgtcact actaaatcaa ccaccgatga tgcggatctt tatgatacgt     5820
acaatgataa tgatacagta ccatcaacta ctgtaggcgg tagtacaacc tctattagca     5880
attataaaac caaggacttt gtagaaatat ttggtattac cgcattaatt atattgtcgg     5940
ccgtggcaat attctgtatt acatattata tatataataa acgttcacgt aaatacaaag     6000

<210>  5
<211>  20
<212>  DNA
<213>  Artificial 

<220>
<223>  primer

<400>  5
catgacgagc ttccgagttc                                                   20


<210>  6
<211>  27
<212>  DNA
<213>  Artificial

<220>
<223>  primer

<400>  6
gttgaagcac ttcaccatct tcctctg                                           27


<210>  7
<211>  22
<212>  DNA
<213>  Artificial

<220>
<223>  primer

<400>  7
cctgaacaag atcgtgagga tg                                                22


<210>  8
<211>  20
<212>  DNA
<213>  Artifical

<400>  8
gctccttata ccaagcactc                                                   20


