                         SEQUENCE LISTING

<110>  Trustees of the University of Pennsylvania
 
<120>  AAV-REP-1 FOR GENE THERAPY FOR CHOROIDEREMIA AND ACHROMATOPSIA

<130>  UPN-16-7660

<150>  US 62/266,789
<151>  2015-12-14

<160>  29    

<170>  PatentIn version 3.5

<210>  1
<211>  1962
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized sequence


<220>
<221>  CDS
<222>  (1)..(1962)

<400>  1
atg gct gat acc ctg ccc tct gaa ttc gac gtg att gtg att gga acc         48
Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr           
1               5                   10                  15                

gga ctc cct gaa tcg atc atc gcc gcg gcc tgt tcc cgg tcc ggt cgg         96
Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg           
            20                  25                  30                    

cgc gtg ctg cac gtc gat tcg aga agc tac tac gga ggg aat tgg gcc        144
Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala           
        35                  40                  45                        

tca ttc tcc ttc tcc gga ctg ctc tcc tgg ctg aag gag tat cag gag        192
Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu           
    50                  55                  60                            

aac tcc gac att gtc tcc gac tca cct gtg tgg cag gac cag atc ctg        240
Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu           
65                  70                  75                  80            

gaa aac gag gaa gca ata gcc ctg agc cgg aag gac aag acc atc cag        288
Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln           
                85                  90                  95                

cac gtg gag gtg ttc tgt tat gcc tcc caa gac ctc cat gag gac gtg        336
His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val           
            100                 105                 110                   

gaa gag gct gga gcg ttg cag aag aat cat gcc ctc gtg acc tcc gct        384
Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala           
        115                 120                 125                       

aac tcc acc gag gca gcc gac agc gcc ttc ctg ccg acc gag gat gaa        432
Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu           
    130                 135                 140                           

tcc ctg tca act atg tcg tgc gaa atg ctg acc gaa cag act ccg agc        480
Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser           
145                 150                 155                 160           

tcc gac ccc gaa aac gcc ctg gaa gtg aac gga gcg gaa gtg acc ggc        528
Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly           
                165                 170                 175               

gaa aag gag aac cat tgc gac gac aag act tgt gtc cca tcc act tcc        576
Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser           
            180                 185                 190                   

gcg gag gac atg tcc gag aat gtg cct atc gcc gag gac acc acc gaa        624
Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu           
        195                 200                 205                       

cag ccc aag aag aac aga atc acg tac agc cag atc atc aag gag ggg        672
Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly           
    210                 215                 220                           

cgg agg ttt aac atc gat ctg gtg tcg aag ctg ctg tac agc cgc ggt        720
Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly           
225                 230                 235                 240           

ctg ctg atc gat ctg ctc att aag tcg aac gtg tcg aga tac gcc gag        768
Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu           
                245                 250                 255               

ttc aag aac atc aca agg att ctc gcc ttc cgg gaa gga aga gtg gaa        816
Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu           
            260                 265                 270                   

caa gtg ccg tgc tcc cgg gcc gac gtg ttc aac tca aag caa ctt acc        864
Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr           
        275                 280                 285                       

atg gtg gaa aag cgc atg ctg atg aaa ttc ctg acc ttc tgc atg gag        912
Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu           
    290                 295                 300                           

tac gaa aag tac cct gat gag tac aag ggt tac gaa gaa att act ttc        960
Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe           
305                 310                 315                 320           

tac gag tac ctc aag acc cag aag ctg acc ccg aat ctg cag tac att       1008
Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile           
                325                 330                 335               

gtg atg cac tca atc gca atg acc tcc gaa acc gcc tcc tcg acc atc       1056
Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile           
            340                 345                 350                   

gac ggg ctc aag gcc acc aag aac ttc ctg cac tgt ttg ggg cgc tac       1104
Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr           
        355                 360                 365                       

ggc aac act ccg ttc ctc ttc ccg ctg tac ggc cag gga gag ctg cct       1152
Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro           
    370                 375                 380                           

cag tgt ttc tgc cgg atg tgc gcc gtg ttc ggc gga atc tac tgt ctc       1200
Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu           
385                 390                 395                 400           

cgc cac tcg gtc cag tgc ctg gtg gtg gac aag gaa tcc agg aag tgc       1248
Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys           
                405                 410                 415               

aaa gcc att att gac cag ttc gga caa cgg atc att tcc gag cac ttt       1296
Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe           
            420                 425                 430                   

ctt gtg gag gac tca tac ttc ccg gag aac atg tgc tct cgg gtc cag       1344
Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln           
        435                 440                 445                       

tat cga cag att tcc agg gcg gtg ctc att act gac cgg agc gtc ctc       1392
Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu           
    450                 455                 460                           

aag acc gat agc gac cag cag atc tcc atc ctg acc gtg ccg gcg gaa       1440
Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu           
465                 470                 475                 480           

gaa ccc ggc act ttt gcc gtg cgc gtg atc gag ctt tgc tca tcc acc       1488
Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr           
                485                 490                 495               

atg act tgc atg aaa ggc act tac ctg gtg cac ctg acg tgc acc tca       1536
Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser           
            500                 505                 510                   

tcg aaa acc gct aga gag gac ctg gaa tcc gtc gtc caa aag ctg ttc       1584
Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe           
        515                 520                 525                       

gtg cct tac acc gag atg gaa att gaa aac gaa caa gtg gag aag ccc       1632
Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro           
    530                 535                 540                           

cgc atc ctt tgg gcc ctg tac ttt aac atg cgc gat tcc tcc gat atc       1680
Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile           
545                 550                 555                 560           

tcg cgg tcc tgc tat aac gac ttg cct tcg aac gtc tac gtc tgc tcc       1728
Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser           
                565                 570                 575               

ggg cca gac tgc ggt ctt ggc aac gac aat gcc gtg aag cag gcg gaa       1776
Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu           
            580                 585                 590                   

aca ctg ttc caa gag atc tgc cct aac gag gat ttt tgc ccg ccc ccc       1824
Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro           
        595                 600                 605                       

cca aac ccc gag gat atc atc ttg gac gga gac agc ctg cag cca gaa       1872
Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu           
    610                 615                 620                           

gca tcc gag tcc agc gcc atc ccg gag gcc aac agc gaa acc ttc aag       1920
Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys           
625                 630                 635                 640           

gag agc act aac ctg ggc aac ctg gaa gag tcc agc gaa tga               1962
Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu                       
                645                 650                                   


<210>  2
<211>  653
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  2

Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr 
1               5                   10                  15      


Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg 
            20                  25                  30          


Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala 
        35                  40                  45              


Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu 
    50                  55                  60                  


Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu 
65                  70                  75                  80  


Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln 
                85                  90                  95      


His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val 
            100                 105                 110         


Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala 
        115                 120                 125             


Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu 
    130                 135                 140                 


Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser 
145                 150                 155                 160 


Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly 
                165                 170                 175     


Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser 
            180                 185                 190         


Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu 
        195                 200                 205             


Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly 
    210                 215                 220                 


Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly 
225                 230                 235                 240 


Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu 
                245                 250                 255     


Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu 
            260                 265                 270         


Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr 
        275                 280                 285             


Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu 
    290                 295                 300                 


Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe 
305                 310                 315                 320 


Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile 
                325                 330                 335     


Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile 
            340                 345                 350         


Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr 
        355                 360                 365             


Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro 
    370                 375                 380                 


Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu 
385                 390                 395                 400 


Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys 
                405                 410                 415     


Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe 
            420                 425                 430         


Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln 
        435                 440                 445             


Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu 
    450                 455                 460                 


Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu 
465                 470                 475                 480 


Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr 
                485                 490                 495     


Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser 
            500                 505                 510         


Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe 
        515                 520                 525             


Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro 
    530                 535                 540                 


Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile 
545                 550                 555                 560 


Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser 
                565                 570                 575     


Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu 
            580                 585                 590         


Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro 
        595                 600                 605             


Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu 
    610                 615                 620                 


Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys 
625                 630                 635                 640 


Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu 
                645                 650             


<210>  3
<211>  1962
<212>  DNA
<213>  Homo sapiens


<220>
<221>  CDS
<222>  (1)..(1962)

<400>  3
atg gcg gat act ctc cct tcg gag ttt gat gtg atc gta ata ggg acg         48
Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr           
1               5                   10                  15                

ggt ttg cct gaa tcc atc att gca gct gca tgt tca aga agt ggc cgg         96
Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg           
            20                  25                  30                    

aga gtt ctg cat gtt gat tca aga agc tac tat gga gga aac tgg gcc        144
Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala           
        35                  40                  45                        

agt ttt agc ttt tca gga cta ttg tcc tgg cta aag gaa tac cag gaa        192
Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu           
    50                  55                  60                            

aac agt gac att gta agt gac agt cca gtg tgg caa gac cag atc ctt        240
Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu           
65                  70                  75                  80            

gaa aat gaa gaa gcc att gct ctt agc agg aag gac aaa act att caa        288
Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln           
                85                  90                  95                

cat gtg gaa gta ttt tgt tat gcc agt cag gat ttg cat gaa gat gtc        336
His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val           
            100                 105                 110                   

gaa gaa gct ggt gca ctg cag aaa aat cat gct ctt gtg aca tct gca        384
Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala           
        115                 120                 125                       

aac tcc aca gaa gct gca gat tct gcc ttc ctg cct acg gag gat gag        432
Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu           
    130                 135                 140                           

tca tta agc act atg agc tgt gaa atg ctc aca gaa caa act cca agc        480
Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser           
145                 150                 155                 160           

agc gat cca gag aat gcg cta gaa gta aat ggt gct gaa gtg aca ggg        528
Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly           
                165                 170                 175               

gaa aaa gaa aac cat tgt gat gat aaa act tgt gtg cca tca act tca        576
Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser           
            180                 185                 190                   

gca gaa gac atg agt gaa aat gtg cct ata gca gaa gat acc aca gag        624
Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu           
        195                 200                 205                       

caa cca aag aaa aac aga att act tac tca caa att att aaa gaa ggc        672
Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly           
    210                 215                 220                           

agg aga ttt aat att gat tta gta tca aag ctg ctg tat tct cga gga        720
Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly           
225                 230                 235                 240           

tta cta att gat ctt cta atc aaa tct aat gtt agt cga tat gca gag        768
Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu           
                245                 250                 255               

ttt aaa aat att acc agg att ctt gca ttt cga gaa gga cga gtg gaa        816
Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu           
            260                 265                 270                   

cag gtt ccg tgt tcc aga gca gat gtc ttt aat agc aaa caa ctt act        864
Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr           
        275                 280                 285                       

atg gta gaa aag cga atg cta atg aaa ttt ctt aca ttt tgt atg gaa        912
Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu           
    290                 295                 300                           

tat gag aaa tat cct gat gaa tat aaa gga tat gaa gag atc aca ttt        960
Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe           
305                 310                 315                 320           

tat gaa tat tta aag act caa aaa tta acc ccc aac ctc caa tat att       1008
Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile           
                325                 330                 335               

gtc atg cat tca att gca atg aca tca gag aca gcc agc agc acc ata       1056
Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile           
            340                 345                 350                   

gat ggt ctc aaa gct acc aaa aac ttt ctt cac tgt ctt ggg cgg tat       1104
Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr           
        355                 360                 365                       

ggc aac act cca ttt ttg ttt cct tta tat ggc caa gga gaa ctc ccc       1152
Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro           
    370                 375                 380                           

cag tgt ttc tgc agg atg tgt gct gtg ttt ggt gga att tat tgt ctt       1200
Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu           
385                 390                 395                 400           

cgc cat tca gta cag tgc ctt gta gtg gac aaa gaa tcc aga aaa tgt       1248
Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys           
                405                 410                 415               

aaa gca att ata gat cag ttt ggt cag aga ata atc tct gag cat ttc       1296
Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe           
            420                 425                 430                   

ctc gtg gag gac agt tac ttt cct gag aac atg tgc tca cgt gtg caa       1344
Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln           
        435                 440                 445                       

tac agg cag atc tcc agg gca gtg ctg att aca gat aga tct gtc cta       1392
Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu           
    450                 455                 460                           

aaa aca gat tca gat caa cag att tcc att ttg aca gtg cca gca gag       1440
Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu           
465                 470                 475                 480           

gaa cca gga act ttt gct gtt cgg gtc att gag tta tgt tct tca acg       1488
Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr           
                485                 490                 495               

atg aca tgc atg aaa ggc acc tat ttg gtt cat ttg act tgc aca tct       1536
Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser           
            500                 505                 510                   

tct aaa aca gca aga gaa gat tta gaa tca gtt gtg cag aaa ttg ttt       1584
Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe           
        515                 520                 525                       

gtt cca tat act gaa atg gag ata gaa aat gaa caa gta gaa aag cca       1632
Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro           
    530                 535                 540                           

aga att ctg tgg gct ctt tac ttc aat atg aga gat tcg tca gac atc       1680
Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile           
545                 550                 555                 560           

agc agg agc tgt tat aat gat tta cca tcc aac gtt tat gtc tgc tct       1728
Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser           
                565                 570                 575               

ggc cca gat tgt ggt tta gga aat gat aat gca gtc aaa cag gct gaa       1776
Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu           
            580                 585                 590                   

aca ctt ttc cag gaa atc tgc ccc aat gaa gat ttc tgt ccc cct cca       1824
Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro           
        595                 600                 605                       

cca aat cct gaa gac att atc ctt gat gga gac agt tta cag cca gag       1872
Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu           
    610                 615                 620                           

gct tca gaa tcc agt gcc ata cca gag gct aac tcg gag act ttc aag       1920
Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys           
625                 630                 635                 640           

gaa agc aca aac ctt gga aac cta gag gag tcc tct gaa taa               1962
Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu                       
                645                 650                                   


<210>  4
<211>  653
<212>  PRT
<213>  Homo sapiens

<400>  4

Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr 
1               5                   10                  15      


Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg 
            20                  25                  30          


Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala 
        35                  40                  45              


Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu 
    50                  55                  60                  


Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu 
65                  70                  75                  80  


Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln 
                85                  90                  95      


His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val 
            100                 105                 110         


Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala 
        115                 120                 125             


Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu 
    130                 135                 140                 


Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser 
145                 150                 155                 160 


Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly 
                165                 170                 175     


Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser 
            180                 185                 190         


Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu 
        195                 200                 205             


Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly 
    210                 215                 220                 


Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly 
225                 230                 235                 240 


Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu 
                245                 250                 255     


Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu 
            260                 265                 270         


Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr 
        275                 280                 285             


Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu 
    290                 295                 300                 


Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe 
305                 310                 315                 320 


Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile 
                325                 330                 335     


Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile 
            340                 345                 350         


Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr 
        355                 360                 365             


Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro 
    370                 375                 380                 


Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu 
385                 390                 395                 400 


Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys 
                405                 410                 415     


Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe 
            420                 425                 430         


Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln 
        435                 440                 445             


Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu 
    450                 455                 460                 


Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu 
465                 470                 475                 480 


Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr 
                485                 490                 495     


Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser 
            500                 505                 510         


Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe 
        515                 520                 525             


Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro 
    530                 535                 540                 


Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile 
545                 550                 555                 560 


Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser 
                565                 570                 575     


Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu 
            580                 585                 590         


Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro 
        595                 600                 605             


Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu 
    610                 615                 620                 


Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys 
625                 630                 635                 640 


Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu 
                645                 650             


<210>  5
<211>  1985
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed plasmid


<220>
<221>  misc_feature
<222>  (1)..(8)
<223>  NotI restriction site for subcloning into proviral plasmid

<220>
<221>  misc_feature
<222>  (4)..(16)
<223>  Kozak consensus sequence

<220>
<221>  CDS
<222>  (13)..(1971)
<223>  codon-optimized open reading frame (ORF)

<220>
<221>  misc_feature
<222>  (1972)..(1977)
<223>  BclI restriction site with embedded stop codon/ site to add 
       optional epitope tag

<220>
<221>  misc_feature
<222>  (1980)..(1985)
<223>  BamHI restriction site for subcloning into proviral plasmid

<400>  5
gcggccgcca cc atg gct gat acc ctg ccc tct gaa ttc gac gtg att gtg       51
              Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val         
              1               5                   10                      

att gga acc gga ctc cct gaa tcg atc atc gcc gcg gcc tgt tcc cgg         99
Ile Gly Thr Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg           
    15                  20                  25                            

tcc ggt cgg cgc gtg ctg cac gtc gat tcg aga agc tac tac gga ggg        147
Ser Gly Arg Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly           
30                  35                  40                  45            

aat tgg gcc tca ttc tcc ttc tcc gga ctg ctc tcc tgg ctg aag gag        195
Asn Trp Ala Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu           
                50                  55                  60                

tat cag gag aac tcc gac att gtc tcc gac tca cct gtg tgg cag gac        243
Tyr Gln Glu Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp           
            65                  70                  75                    

cag atc ctg gaa aac gag gaa gca ata gcc ctg agc cgg aag gac aag        291
Gln Ile Leu Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys           
        80                  85                  90                        

acc atc cag cac gtg gag gtg ttc tgt tat gcc tcc caa gac ctc cat        339
Thr Ile Gln His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His           
    95                  100                 105                           

gag gac gtg gaa gag gct gga gcg ttg cag aag aat cat gcc ctc gtg        387
Glu Asp Val Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val           
110                 115                 120                 125           

acc tcc gct aac tcc acc gag gca gcc gac agc gcc ttc ctg ccg acc        435
Thr Ser Ala Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr           
                130                 135                 140               

gag gat gaa tcc ctg tca act atg tcg tgc gaa atg ctg acc gaa cag        483
Glu Asp Glu Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln           
            145                 150                 155                   

act ccg agc tcc gac ccc gaa aac gcc ctg gaa gtg aac gga gcg gaa        531
Thr Pro Ser Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu           
        160                 165                 170                       

gtg acc ggc gaa aag gag aac cat tgc gac gac aag act tgt gtc cca        579
Val Thr Gly Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro           
    175                 180                 185                           

tcc act tcc gcg gag gac atg tcc gag aat gtg cct atc gcc gag gac        627
Ser Thr Ser Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp           
190                 195                 200                 205           

acc acc gaa cag ccc aag aag aac aga atc acg tac agc cag atc atc        675
Thr Thr Glu Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile           
                210                 215                 220               

aag gag ggg cgg agg ttt aac atc gat ctg gtg tcg aag ctg ctg tac        723
Lys Glu Gly Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr           
            225                 230                 235                   

agc cgc ggt ctg ctg atc gat ctg ctc att aag tcg aac gtg tcg aga        771
Ser Arg Gly Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg           
        240                 245                 250                       

tac gcc gag ttc aag aac atc aca agg att ctc gcc ttc cgg gaa gga        819
Tyr Ala Glu Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly           
    255                 260                 265                           

aga gtg gaa caa gtg ccg tgc tcc cgg gcc gac gtg ttc aac tca aag        867
Arg Val Glu Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys           
270                 275                 280                 285           

caa ctt acc atg gtg gaa aag cgc atg ctg atg aaa ttc ctg acc ttc        915
Gln Leu Thr Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe           
                290                 295                 300               

tgc atg gag tac gaa aag tac cct gat gag tac aag ggt tac gaa gaa        963
Cys Met Glu Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu           
            305                 310                 315                   

att act ttc tac gag tac ctc aag acc cag aag ctg acc ccg aat ctg       1011
Ile Thr Phe Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu           
        320                 325                 330                       

cag tac att gtg atg cac tca atc gca atg acc tcc gaa acc gcc tcc       1059
Gln Tyr Ile Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser           
    335                 340                 345                           

tcg acc atc gac ggg ctc aag gcc acc aag aac ttc ctg cac tgt ttg       1107
Ser Thr Ile Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu           
350                 355                 360                 365           

ggg cgc tac ggc aac act ccg ttc ctc ttc ccg ctg tac ggc cag gga       1155
Gly Arg Tyr Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly           
                370                 375                 380               

gag ctg cct cag tgt ttc tgc cgg atg tgc gcc gtg ttc ggc gga atc       1203
Glu Leu Pro Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile           
            385                 390                 395                   

tac tgt ctc cgc cac tcg gtc cag tgc ctg gtg gtg gac aag gaa tcc       1251
Tyr Cys Leu Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser           
        400                 405                 410                       

agg aag tgc aaa gcc att att gac cag ttc gga caa cgg atc att tcc       1299
Arg Lys Cys Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser           
    415                 420                 425                           

gag cac ttt ctt gtg gag gac tca tac ttc ccg gag aac atg tgc tct       1347
Glu His Phe Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser           
430                 435                 440                 445           

cgg gtc cag tat cga cag att tcc agg gcg gtg ctc att act gac cgg       1395
Arg Val Gln Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg           
                450                 455                 460               

agc gtc ctc aag acc gat agc gac cag cag atc tcc atc ctg acc gtg       1443
Ser Val Leu Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val           
            465                 470                 475                   

ccg gcg gaa gaa ccc ggc act ttt gcc gtg cgc gtg atc gag ctt tgc       1491
Pro Ala Glu Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys           
        480                 485                 490                       

tca tcc acc atg act tgc atg aaa ggc act tac ctg gtg cac ctg acg       1539
Ser Ser Thr Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr           
    495                 500                 505                           

tgc acc tca tcg aaa acc gct aga gag gac ctg gaa tcc gtc gtc caa       1587
Cys Thr Ser Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln           
510                 515                 520                 525           

aag ctg ttc gtg cct tac acc gag atg gaa att gaa aac gaa caa gtg       1635
Lys Leu Phe Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val           
                530                 535                 540               

gag aag ccc cgc atc ctt tgg gcc ctg tac ttt aac atg cgc gat tcc       1683
Glu Lys Pro Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser           
            545                 550                 555                   

tcc gat atc tcg cgg tcc tgc tat aac gac ttg cct tcg aac gtc tac       1731
Ser Asp Ile Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr           
        560                 565                 570                       

gtc tgc tcc ggg cca gac tgc ggt ctt ggc aac gac aat gcc gtg aag       1779
Val Cys Ser Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys           
    575                 580                 585                           

cag gcg gaa aca ctg ttc caa gag atc tgc cct aac gag gat ttt tgc       1827
Gln Ala Glu Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys           
590                 595                 600                 605           

ccg ccc ccc cca aac ccc gag gat atc atc ttg gac gga gac agc ctg       1875
Pro Pro Pro Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu           
                610                 615                 620               

cag cca gaa gca tcc gag tcc agc gcc atc ccg gag gcc aac agc gaa       1923
Gln Pro Glu Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu           
            625                 630                 635                   

acc ttc aag gag agc act aac ctg ggc aac ctg gaa gag tcc agc gaa       1971
Thr Phe Lys Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu           
        640                 645                 650                       

tgatcatagg atcc                                                       1985


<210>  6
<211>  653
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  6

Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr 
1               5                   10                  15      


Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg 
            20                  25                  30          


Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala 
        35                  40                  45              


Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu 
    50                  55                  60                  


Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu 
65                  70                  75                  80  


Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln 
                85                  90                  95      


His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val 
            100                 105                 110         


Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala 
        115                 120                 125             


Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu 
    130                 135                 140                 


Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser 
145                 150                 155                 160 


Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly 
                165                 170                 175     


Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser 
            180                 185                 190         


Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu 
        195                 200                 205             


Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly 
    210                 215                 220                 


Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly 
225                 230                 235                 240 


Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu 
                245                 250                 255     


Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu 
            260                 265                 270         


Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr 
        275                 280                 285             


Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu 
    290                 295                 300                 


Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe 
305                 310                 315                 320 


Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile 
                325                 330                 335     


Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile 
            340                 345                 350         


Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr 
        355                 360                 365             


Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro 
    370                 375                 380                 


Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu 
385                 390                 395                 400 


Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys 
                405                 410                 415     


Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe 
            420                 425                 430         


Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln 
        435                 440                 445             


Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu 
    450                 455                 460                 


Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu 
465                 470                 475                 480 


Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr 
                485                 490                 495     


Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser 
            500                 505                 510         


Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe 
        515                 520                 525             


Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro 
    530                 535                 540                 


Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile 
545                 550                 555                 560 


Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser 
                565                 570                 575     


Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu 
            580                 585                 590         


Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro 
        595                 600                 605             


Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu 
    610                 615                 620                 


Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys 
625                 630                 635                 640 


Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu 
                645                 650             


<210>  7
<211>  9187
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed plasmid


<220>
<221>  misc_feature
<222>  (1)..(145)
<223>  5' ITR

<220>
<221>  promoter
<222>  (169)..(1786)
<223>  CMV.CBA promoter

<220>
<221>  misc_feature
<222>  (1787)..(1794)
<223>  Not I cloning site, cuts at 1789

<220>
<221>  misc_feature
<222>  (1805)..(1810)
<223>  BamHI cloning site, cuts at 1806

<220>
<221>  polyA_signal
<222>  (1850)..(2052)
<223>  BGH PolyA

<220>
<221>  misc_feature
<222>  (2109)..(2252)
<223>  3' ITR

<220>
<221>  misc_feature
<222>  (2571)..(6624)
<223>  lambda stuffer

<220>
<221>  misc_feature
<222>  (7314)..(8126)
<223>  Kanamycin resistance (complementary)

<220>
<221>  misc_feature
<222>  (8485)..(9128)
<223>  Origin of replication (complementary)

<400>  7
tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc       60

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc      120

ctgcggccta gtaggctcag aggcacacag gagtttctgc aaatctagtg caggcgttac      180

ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc      240

aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt      300

ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac      360

gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac      420

cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaacatggt      480

cgaggtgagc cccacgttct gcttcactct ccccatctcc cccccctccc cacccccaat      540

tttgtattta tttatttttt aattattttg tgcagcgatg ggggcggggg gggggggggg      600

gcgcgcgcca ggcggggcgg ggcggggcga ggggcggggc ggggcgaggc ggagaggtgc      660

ggcggcagcc aatcagagcg gcgcgctccg aaagtttcct tttatggcga ggcggcggcg      720

gcggcggccc tataaaaagc gaagcgcgcg gcgggcgggg agtcgctgcg acgctgcctt      780

cgccccgtgc cccgctccgc cgccgcctcg cgccgcccgc cccggctctg actgaccgcg      840

ttactcccac aggtgagcgg gcgggacggc ccttctcctc cgggctgtaa ttagcgcttg      900

gtttaatgac ggcttgtttc ttttctgtgg ctgcgtgaaa gccttgaggg gctccgggag      960

ggccctttgt gcggggggag cggctcgggg ggtgcgtgcg tgtgtgtgtg cgtggggagc     1020

gccgcgtgcg gctccgcgct gcccggcggc tgtgagcgct gcgggcgcgg cgcggggctt     1080

tgtgcgctcc gcagtgtgcg cgaggggagc gcggccgggg gcggtgcccc gcggtgcggg     1140

gggggctgcg aggggaacaa aggctgcgtg cggggtgtgt gcgtgggggg gtgagcaggg     1200

ggtgtgggcg cgtcggtcgg gctgcaaccc cccctgcacc cccctccccg agttgctgag     1260

cacggcccgg cttcgggtgc ggggctccgt acggggcgtg gcgcggggct cgccgtgccg     1320

ggcggggggt ggcggcaggt gggggtgccg ggcggggcgg ggccgcctcg ggccggggag     1380

ggctcggggg aggggcgcgg cggcccccgg agcgccggcg gctgtcgagg cgcggcgagc     1440

cgcagccatt gccttttatg gtaatcgtgc gagagggcgc agggacttcc tttgtcccaa     1500

atctgtgcgg agccgaaatc tgggaggcgc cgccgcaccc cctctagcgg gcgcggggcg     1560

aagcggtgcg gcgccggcag gaaggaaatg ggcggggagg gccttcgtgc gtcgccgcgc     1620

cgccgtcccc ttctccctct ccagcctcgg ggctgtccgc ggggggacgg ctgccttcgg     1680

gggggacggg gcagggcggg gttcggcttc tggcgtgtga ccggcggctc tagacaattg     1740

tactaacctt cttctctttc ctctcctgac aggttggtgt acactagcgg ccgcatagta     1800

ctgcggatcc tgcagatctc gagccgaatt cctgcagccc gggggatcag cctcgactgt     1860

gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga     1920

aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag     1980

taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga     2040

agacaatagc aggcatgctg gggatgcggt gggctctatg gcttctgagg cggaaagaac     2100

cagctggggc tcgagatcca ctagggccgc aggaacccct agtgatggag ttggccactc     2160

cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg     2220

gctttgcccg ggcggcctca gtgagcgagc gacctgcagg ggcagcttga aggaaatact     2280

aaggcaaagg tactgcaagt gctcgcaaca ttcgcttatg cggattattg ccgtagtgcc     2340

gcgacgccgg gggcaagatg cagagattgc catggtacag gccgtgcggt tgatattgcc     2400

aaaacagagc tgtgggggag agttgtcgag aaagagtgcg gaagatgcaa aggcgtcggc     2460

tattcaagga tgccagcaag cgcagcatat cgcgctgtga cgatgctaat cccaaacctt     2520

acccaaccca cctggtcacg cactgttaag ccgctgtatg acgctctggt ggtgcaatgc     2580

cacaaagaag agtcaatcgc agacaacatt ttgaatgcgg tcacacgtta gcagcatgat     2640

tgccacggat ggcaacatat taacggcatg atattgactt attgaataaa attgggtaaa     2700

tttgactcaa cgatgggtta attcgctcgt tgtggtagtg agatgaaaag aggcggcgct     2760

tactaccgat tccgcctagt tggtcacttc gacgtatcgt ctggaactcc aaccatcgca     2820

ggcagagagg tctgcaaaat gcaatcccga aacagttcgc aggtaatagt tagagcctgc     2880

ataacggttt cgggattttt tatatctgca caacaggtaa gagcattgag tcgataatcg     2940

tgaagagtcg gcgagcctgg ttagccagtg ctctttccgt tgtgctgaat taagcgaata     3000

ccggaagcag aaccggatca ccaaatgcgt acaggcgtca tcgccgccca gcaacagcac     3060

aacccaaact gagccgtagc cactgtctgt cctgaattca ttagtaatag ttacgctgcg     3120

gccttttaca catgaccttc gtgaaagcgg gtggcaggag gtcgcgctaa caacctcctg     3180

ccgttttgcc cgtgcatatc ggtcacgaac aaatctgatt actaaacaca gtagcctgga     3240

tttgttctat cagtaatcga ccttattcct aattaaatag agcaaatccc cttattgggg     3300

gtaagacatg aagatgccag aaaaacatga cctgttggcc gccattctcg cggcaaagga     3360

acaaggcatc ggggcaatcc ttgcgtttgc aatggcgtac cttcgcggca gatataatgg     3420

cggtgcgttt acaaaaacag taatcgacgc aacgatgtgc gccattatcg cctggttcat     3480

tcgtgacctt ctcgacttcg ccggactaag tagcaatctc gcttatataa cgagcgtgtt     3540

tatcggctac atcggtactg actcgattgg ttcgcttatc aaacgcttcg ctgctaaaaa     3600

agccggagta gaagatggta gaaatcaata atcaacgtaa ggcgttcctc gatatgctgg     3660

cgtggtcgga gggaactgat aacggacgtc agaaaaccag aaatcatggt tatgacgtca     3720

ttgtaggcgg agagctattt actgattact ccgatcaccc tcgcaaactt gtcacgctaa     3780

acccaaaact caaatcaaca ggcgccggac gctaccagct tctttcccgt tggtgggatg     3840

cctaccgcaa gcagcttggc ctgaaagact tctctccgaa aagtcaggac gctgtggcat     3900

tgcagcagat taaggagcgt ggcgctttac ctatgattga tcgtggtgat atccgtcagg     3960

caatcgaccg ttgcagcaat atctgggctt cactgccggg cgctggttat ggtcagttcg     4020

agcataaggc tgacagcctg attgcaaaat tcaaagaagc gggcggaacg gtcagagaga     4080

ttgatgtatg agcagagtca ccgcgattat ctccgctctg gttatctgca tcatcgtctg     4140

cctgtcatgg gctgttaatc attaccgtga taacgccatt acctacaaag cccagcgcga     4200

caaaaatgcc agagaactga agctggcgaa cgcggcaatt actgacatgc agatgcgtca     4260

gcgtgatgtt gctgcgctcg atgcaaaata cacgaaggag ttagctgatg ctaaagctga     4320

aaatgatgct ctgcgtgatg atgttgccgc tggtcgtcgt cggttgcaca tcaaagcagt     4380

ctgtcagtca gtgcgtgaag ccaccaccgc ctccggcgtg gataatgcag cctccccccg     4440

actggcagac accgctgaac gggattattt caccctcaga gagaggctga tcactatgca     4500

aaaacaactg gaaggaaccc agaagtatat taatgagcag tgcagataga gttgcccata     4560

tcgatgggca actcatgcaa ttattgtgag caatacacac gcgcttccag cggagtataa     4620

atgcctaaag taataaaacc gagcaatcca tttacgaatg tttgctgggt ttctgtttta     4680

acaacatttt ctgcgccgcc acaaattttg gctgcatcga cagttttctt ctgcccaatt     4740

ccagaaacga agaaatgatg ggtgatggtt tcctttggtg ctactgctgc cggtttgttt     4800

tgaacagtaa acgtctgttg agcacatcct gtaataagca gggccagcgc agtagcgagt     4860

agcatttttt tcatggtgtt attcccgatg ctttttgaag ttcgcagaat cgtatgtgta     4920

gaaaattaaa caaaccctaa acaatgagtt gaaatttcat attgttaata tttattaatg     4980

tatgtcaggt gcgatgaatc gtcattgtat tcccggatta actatgtcca cagccctgac     5040

ggggaacttc tctgcgggag tgtccgggaa taattaaaac gatgcacaca gggtttagcg     5100

cgtacacgta ttgcattatg ccaacgcccc ggtgctgaca cggaagaaac cggacgttat     5160

gatttagcgt ggaaagattt gtgtagtgtt ctgaatgctc tcagtaaata gtaatgaatt     5220

atcaaaggta tagtaatatc ttttatgttc atggatattt gtaacccatc ggaaaactcc     5280

tgctttagca agattttccc tgtattgctg aaatgtgatt tctcttgatt tcaacctatc     5340

ataggacgtt tctataagat gcgtgtttct tgagaattta acatttacaa cctttttaag     5400

tccttttatt aacacggtgt tatcgttttc taacacgatg tgaatattat ctgtggctag     5460

atagtaaata taatgtgaga cgttgtgacg ttttagttca gaataaaaca attcacagtc     5520

taaatctttt cgcacttgat cgaatatttc tttaaaaatg gcaacctgag ccattggtaa     5580

aaccttccat gtgatacgag ggcgcgtagt ttgcattatc gtttttatcg tttcaatctg     5640

gtctgacctc cttgtgtttt gttgatgatt tatgtcaaat attaggaatg ttttcactta     5700

atagtattgg ttgcgtaaca aagtgcggtc ctgctggcat tctggaggga aatacaaccg     5760

acagatgtat gtaaggccaa cgtgctcaaa tcttcataca gaaagatttg aagtaatatt     5820

ttaaccgcta gatgaagagc aagcgcatgg agcgacaaaa tgaataaaga acaatctgct     5880

gatgatccct ccgtggatct gattcgtgta aaaaatatgc ttaatagcac catttctatg     5940

agttaccctg atgttgtaat tgcatgtata gaacataagg tgtctctgga agcattcaga     6000

gcaattgagg cagcgttggt gaagcacgat aataatatga aggattattc cctggtggtt     6060

gactgatcac cataactgct aatcattcaa actatttagt ctgtgacaga gccaacacgc     6120

agtctgtcac tgtcaggaaa gtggtaaaac tgcaactcaa ttactgcaat gccctcgtaa     6180

ttaagtgaat ttacaatatc gtcctgttcg gagggaagaa cgcgggatgt tcattcttca     6240

tcacttttaa ttgatgtata tgctctcttt tctgacgtta gtctccgacg gcaggcttca     6300

atgacccagg ctgagaaatt cccggaccct ttttgctcaa gagcgatgtt aatttgttca     6360

atcatttggt taggaaagcg gatgttgcgg gttgttgttc tgcgggttct gttcttcgtt     6420

gacatgaggt tgccccgtat tcagtgtcgc tgatttgtat tgtctgaagt tgtttttacg     6480

ttaagttgat gcagatcaat taatacgata cctgcgtcat aattgattat ttgacgtggt     6540

ttgatggcct ccacgcacgt tgtgatatgt agatgataat cattatcact ttacgggtcc     6600

tttccggtga tccgacaggt tacggcctga tgcggtattt tctccttacg catctgtgcg     6660

gtatttcaca ccgcatacgt caaagcaacc atagtacgcg ccctgtagcg gcgcattaag     6720

cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc     6780

cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc     6840

tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa     6900

aaaacttgat ttgggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg     6960

ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac     7020

actcaaccct atctcgggct attcttttga tttagacctg caggcatgca agcttactgg     7080

ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg     7140

cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt     7200

cccaacagtt gcgcagcctg aatggcgaat gcgatttatt caacaaagcc gccgtcccgt     7260

caagtcagcg taatgctctg ccagtgttac aaccaattaa ccaattctga ttagaaaaac     7320

tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat accatatttt     7380

tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca taggatggca     7440

agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc tattaatttc     7500

ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac tgaatccggt     7560

gagaatggca aaagcttatg catttctttc cagacttgtt caacaggcca gccattacgc     7620

tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg cgcctgagcg     7680

agacgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga atgcaaccgg     7740

cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata ttcttctaat     7800

acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc atcaggagta     7860

cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt tagtctgacc     7920

atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa caactctggc     7980

gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac attatcgcga     8040

gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg cttcgagcaa     8100

gacgtttccc gttgaatatg gctcataaca ccccttgtat tactgtttat gtaagcagac     8160

agttttattg ttcatgatga tatattttta tcttgtgcaa tgtaacatca gagattttga     8220

gacacaacgt ggctttgttg aataaatcga acttttgctg agttgaagga tcagatcacg     8280

catcttcccg acaacgcaga ccgttccgtg gcaaagcaaa agttcaaaat caccaactgg     8340

tccacctaca acaaagctct catcaaccgt ggctccctca ctttctggct ggatgatggg     8400

gcgattcagg cctggtatga gtcagcaaca ccttcttcac gaggcagacc tctcgacgga     8460

tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt     8520

tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt     8580

ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag     8640

ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta     8700

gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat     8760

aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg     8820

ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg     8880

agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac     8940

aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga     9000

aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt     9060

ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta     9120

cggttcctgg ccttttgctg gccttttgct cacatgtcct gcaggcagct gcgcgccagc     9180

tgcgcgc                                                               9187


<210>  8
<211>  11148
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed plasmid

<400>  8
tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc       60

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc      120

ctgcggccta gtaggctcag aggcacacag gagtttctgc aaatctagtg caggcgttac      180

ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc      240

aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt      300

ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac      360

gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac      420

cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaacatggt      480

cgaggtgagc cccacgttct gcttcactct ccccatctcc cccccctccc cacccccaat      540

tttgtattta tttatttttt aattattttg tgcagcgatg ggggcggggg gggggggggg      600

gcgcgcgcca ggcggggcgg ggcggggcga ggggcggggc ggggcgaggc ggagaggtgc      660

ggcggcagcc aatcagagcg gcgcgctccg aaagtttcct tttatggcga ggcggcggcg      720

gcggcggccc tataaaaagc gaagcgcgcg gcgggcgggg agtcgctgcg acgctgcctt      780

cgccccgtgc cccgctccgc cgccgcctcg cgccgcccgc cccggctctg actgaccgcg      840

ttactcccac aggtgagcgg gcgggacggc ccttctcctc cgggctgtaa ttagcgcttg      900

gtttaatgac ggcttgtttc ttttctgtgg ctgcgtgaaa gccttgaggg gctccgggag      960

ggccctttgt gcggggggag cggctcgggg ggtgcgtgcg tgtgtgtgtg cgtggggagc     1020

gccgcgtgcg gctccgcgct gcccggcggc tgtgagcgct gcgggcgcgg cgcggggctt     1080

tgtgcgctcc gcagtgtgcg cgaggggagc gcggccgggg gcggtgcccc gcggtgcggg     1140

gggggctgcg aggggaacaa aggctgcgtg cggggtgtgt gcgtgggggg gtgagcaggg     1200

ggtgtgggcg cgtcggtcgg gctgcaaccc cccctgcacc cccctccccg agttgctgag     1260

cacggcccgg cttcgggtgc ggggctccgt acggggcgtg gcgcggggct cgccgtgccg     1320

ggcggggggt ggcggcaggt gggggtgccg ggcggggcgg ggccgcctcg ggccggggag     1380

ggctcggggg aggggcgcgg cggcccccgg agcgccggcg gctgtcgagg cgcggcgagc     1440

cgcagccatt gccttttatg gtaatcgtgc gagagggcgc agggacttcc tttgtcccaa     1500

atctgtgcgg agccgaaatc tgggaggcgc cgccgcaccc cctctagcgg gcgcggggcg     1560

aagcggtgcg gcgccggcag gaaggaaatg ggcggggagg gccttcgtgc gtcgccgcgc     1620

cgccgtcccc ttctccctct ccagcctcgg ggctgtccgc ggggggacgg ctgccttcgg     1680

gggggacggg gcagggcggg gttcggcttc tggcgtgtga ccggcggctc tagacaattg     1740

tactaacctt cttctctttc ctctcctgac aggttggtgt acactagcgg ccgccaccat     1800

ggctgatacc ctgccctctg aattcgacgt gattgtgatt ggaaccggac tccctgaatc     1860

gatcatcgcc gcggcctgtt cccggtccgg tcggcgcgtg ctgcacgtcg attcgagaag     1920

ctactacgga gggaattggg cctcattctc cttctccgga ctgctctcct ggctgaagga     1980

gtatcaggag aactccgaca ttgtctccga ctcacctgtg tggcaggacc agatcctgga     2040

aaacgaggaa gcaatagccc tgagccggaa ggacaagacc atccagcacg tggaggtgtt     2100

ctgttatgcc tcccaagacc tccatgagga cgtggaagag gctggagcgt tgcagaagaa     2160

tcatgccctc gtgacctccg ctaactccac cgaggcagcc gacagcgcct tcctgccgac     2220

cgaggatgaa tccctgtcaa ctatgtcgtg cgaaatgctg accgaacaga ctccgagctc     2280

cgaccccgaa aacgccctgg aagtgaacgg agcggaagtg accggcgaaa aggagaacca     2340

ttgcgacgac aagacttgtg tcccatccac ttccgcggag gacatgtccg agaatgtgcc     2400

tatcgccgag gacaccaccg aacagcccaa gaagaacaga atcacgtaca gccagatcat     2460

caaggagggg cggaggttta acatcgatct ggtgtcgaag ctgctgtaca gccgcggtct     2520

gctgatcgat ctgctcatta agtcgaacgt gtcgagatac gccgagttca agaacatcac     2580

aaggattctc gccttccggg aaggaagagt ggaacaagtg ccgtgctccc gggccgacgt     2640

gttcaactca aagcaactta ccatggtgga aaagcgcatg ctgatgaaat tcctgacctt     2700

ctgcatggag tacgaaaagt accctgatga gtacaagggt tacgaagaaa ttactttcta     2760

cgagtacctc aagacccaga agctgacccc gaatctgcag tacattgtga tgcactcaat     2820

cgcaatgacc tccgaaaccg cctcctcgac catcgacggg ctcaaggcca ccaagaactt     2880

cctgcactgt ttggggcgct acggcaacac tccgttcctc ttcccgctgt acggccaggg     2940

agagctgcct cagtgtttct gccggatgtg cgccgtgttc ggcggaatct actgtctccg     3000

ccactcggtc cagtgcctgg tggtggacaa ggaatccagg aagtgcaaag ccattattga     3060

ccagttcgga caacggatca tttccgagca ctttcttgtg gaggactcat acttcccgga     3120

gaacatgtgc tctcgggtcc agtatcgaca gatttccagg gcggtgctca ttactgaccg     3180

gagcgtcctc aagaccgata gcgaccagca gatctccatc ctgaccgtgc cggcggaaga     3240

acccggcact tttgccgtgc gcgtgatcga gctttgctca tccaccatga cttgcatgaa     3300

aggcacttac ctggtgcacc tgacgtgcac ctcatcgaaa accgctagag aggacctgga     3360

atccgtcgtc caaaagctgt tcgtgcctta caccgagatg gaaattgaaa acgaacaagt     3420

ggagaagccc cgcatccttt gggccctgta ctttaacatg cgcgattcct ccgatatctc     3480

gcggtcctgc tataacgact tgccttcgaa cgtctacgtc tgctccgggc cagactgcgg     3540

tcttggcaac gacaatgccg tgaagcaggc ggaaacactg ttccaagaga tctgccctaa     3600

cgaggatttt tgcccgcccc ccccaaaccc cgaggatatc atcttggacg gagacagcct     3660

gcagccagaa gcatccgagt ccagcgccat cccggaggcc aacagcgaaa ccttcaagga     3720

gagcactaac ctgggcaacc tggaagagtc cagcgaatga tcataggatc ctgcagatct     3780

cgagccgaat tcctgcagcc cgggggatca gcctcgactg tgccttctag ttgccagcca     3840

tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc     3900

ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg     3960

gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct     4020

ggggatgcgg tgggctctat ggcttctgag gcggaaagaa ccagctgggg ctcgagatcc     4080

actagggccg caggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc     4140

gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc     4200

agtgagcgag cgacctgcag gggcagcttg aaggaaatac taaggcaaag gtactgcaag     4260

tgctcgcaac attcgcttat gcggattatt gccgtagtgc cgcgacgccg ggggcaagat     4320

gcagagattg ccatggtaca ggccgtgcgg ttgatattgc caaaacagag ctgtggggga     4380

gagttgtcga gaaagagtgc ggaagatgca aaggcgtcgg ctattcaagg atgccagcaa     4440

gcgcagcata tcgcgctgtg acgatgctaa tcccaaacct tacccaaccc acctggtcac     4500

gcactgttaa gccgctgtat gacgctctgg tggtgcaatg ccacaaagaa gagtcaatcg     4560

cagacaacat tttgaatgcg gtcacacgtt agcagcatga ttgccacgga tggcaacata     4620

ttaacggcat gatattgact tattgaataa aattgggtaa atttgactca acgatgggtt     4680

aattcgctcg ttgtggtagt gagatgaaaa gaggcggcgc ttactaccga ttccgcctag     4740

ttggtcactt cgacgtatcg tctggaactc caaccatcgc aggcagagag gtctgcaaaa     4800

tgcaatcccg aaacagttcg caggtaatag ttagagcctg cataacggtt tcgggatttt     4860

ttatatctgc acaacaggta agagcattga gtcgataatc gtgaagagtc ggcgagcctg     4920

gttagccagt gctctttccg ttgtgctgaa ttaagcgaat accggaagca gaaccggatc     4980

accaaatgcg tacaggcgtc atcgccgccc agcaacagca caacccaaac tgagccgtag     5040

ccactgtctg tcctgaattc attagtaata gttacgctgc ggccttttac acatgacctt     5100

cgtgaaagcg ggtggcagga ggtcgcgcta acaacctcct gccgttttgc ccgtgcatat     5160

cggtcacgaa caaatctgat tactaaacac agtagcctgg atttgttcta tcagtaatcg     5220

accttattcc taattaaata gagcaaatcc ccttattggg ggtaagacat gaagatgcca     5280

gaaaaacatg acctgttggc cgccattctc gcggcaaagg aacaaggcat cggggcaatc     5340

cttgcgtttg caatggcgta ccttcgcggc agatataatg gcggtgcgtt tacaaaaaca     5400

gtaatcgacg caacgatgtg cgccattatc gcctggttca ttcgtgacct tctcgacttc     5460

gccggactaa gtagcaatct cgcttatata acgagcgtgt ttatcggcta catcggtact     5520

gactcgattg gttcgcttat caaacgcttc gctgctaaaa aagccggagt agaagatggt     5580

agaaatcaat aatcaacgta aggcgttcct cgatatgctg gcgtggtcgg agggaactga     5640

taacggacgt cagaaaacca gaaatcatgg ttatgacgtc attgtaggcg gagagctatt     5700

tactgattac tccgatcacc ctcgcaaact tgtcacgcta aacccaaaac tcaaatcaac     5760

aggcgccgga cgctaccagc ttctttcccg ttggtgggat gcctaccgca agcagcttgg     5820

cctgaaagac ttctctccga aaagtcagga cgctgtggca ttgcagcaga ttaaggagcg     5880

tggcgcttta cctatgattg atcgtggtga tatccgtcag gcaatcgacc gttgcagcaa     5940

tatctgggct tcactgccgg gcgctggtta tggtcagttc gagcataagg ctgacagcct     6000

gattgcaaaa ttcaaagaag cgggcggaac ggtcagagag attgatgtat gagcagagtc     6060

accgcgatta tctccgctct ggttatctgc atcatcgtct gcctgtcatg ggctgttaat     6120

cattaccgtg ataacgccat tacctacaaa gcccagcgcg acaaaaatgc cagagaactg     6180

aagctggcga acgcggcaat tactgacatg cagatgcgtc agcgtgatgt tgctgcgctc     6240

gatgcaaaat acacgaagga gttagctgat gctaaagctg aaaatgatgc tctgcgtgat     6300

gatgttgccg ctggtcgtcg tcggttgcac atcaaagcag tctgtcagtc agtgcgtgaa     6360

gccaccaccg cctccggcgt ggataatgca gcctcccccc gactggcaga caccgctgaa     6420

cgggattatt tcaccctcag agagaggctg atcactatgc aaaaacaact ggaaggaacc     6480

cagaagtata ttaatgagca gtgcagatag agttgcccat atcgatgggc aactcatgca     6540

attattgtga gcaatacaca cgcgcttcca gcggagtata aatgcctaaa gtaataaaac     6600

cgagcaatcc atttacgaat gtttgctggg tttctgtttt aacaacattt tctgcgccgc     6660

cacaaatttt ggctgcatcg acagttttct tctgcccaat tccagaaacg aagaaatgat     6720

gggtgatggt ttcctttggt gctactgctg ccggtttgtt ttgaacagta aacgtctgtt     6780

gagcacatcc tgtaataagc agggccagcg cagtagcgag tagcattttt ttcatggtgt     6840

tattcccgat gctttttgaa gttcgcagaa tcgtatgtgt agaaaattaa acaaacccta     6900

aacaatgagt tgaaatttca tattgttaat atttattaat gtatgtcagg tgcgatgaat     6960

cgtcattgta ttcccggatt aactatgtcc acagccctga cggggaactt ctctgcggga     7020

gtgtccggga ataattaaaa cgatgcacac agggtttagc gcgtacacgt attgcattat     7080

gccaacgccc cggtgctgac acggaagaaa ccggacgtta tgatttagcg tggaaagatt     7140

tgtgtagtgt tctgaatgct ctcagtaaat agtaatgaat tatcaaaggt atagtaatat     7200

cttttatgtt catggatatt tgtaacccat cggaaaactc ctgctttagc aagattttcc     7260

ctgtattgct gaaatgtgat ttctcttgat ttcaacctat cataggacgt ttctataaga     7320

tgcgtgtttc ttgagaattt aacatttaca acctttttaa gtccttttat taacacggtg     7380

ttatcgtttt ctaacacgat gtgaatatta tctgtggcta gatagtaaat ataatgtgag     7440

acgttgtgac gttttagttc agaataaaac aattcacagt ctaaatcttt tcgcacttga     7500

tcgaatattt ctttaaaaat ggcaacctga gccattggta aaaccttcca tgtgatacga     7560

gggcgcgtag tttgcattat cgtttttatc gtttcaatct ggtctgacct ccttgtgttt     7620

tgttgatgat ttatgtcaaa tattaggaat gttttcactt aatagtattg gttgcgtaac     7680

aaagtgcggt cctgctggca ttctggaggg aaatacaacc gacagatgta tgtaaggcca     7740

acgtgctcaa atcttcatac agaaagattt gaagtaatat tttaaccgct agatgaagag     7800

caagcgcatg gagcgacaaa atgaataaag aacaatctgc tgatgatccc tccgtggatc     7860

tgattcgtgt aaaaaatatg cttaatagca ccatttctat gagttaccct gatgttgtaa     7920

ttgcatgtat agaacataag gtgtctctgg aagcattcag agcaattgag gcagcgttgg     7980

tgaagcacga taataatatg aaggattatt ccctggtggt tgactgatca ccataactgc     8040

taatcattca aactatttag tctgtgacag agccaacacg cagtctgtca ctgtcaggaa     8100

agtggtaaaa ctgcaactca attactgcaa tgccctcgta attaagtgaa tttacaatat     8160

cgtcctgttc ggagggaaga acgcgggatg ttcattcttc atcactttta attgatgtat     8220

atgctctctt ttctgacgtt agtctccgac ggcaggcttc aatgacccag gctgagaaat     8280

tcccggaccc tttttgctca agagcgatgt taatttgttc aatcatttgg ttaggaaagc     8340

ggatgttgcg ggttgttgtt ctgcgggttc tgttcttcgt tgacatgagg ttgccccgta     8400

ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga tgcagatcaa     8460

ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc tccacgcacg     8520

ttgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg atccgacagg     8580

ttacggcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcatacg     8640

tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt     8700

acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc     8760

ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct     8820

ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga tttgggtgat     8880

ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc     8940

acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcgggc     9000

tattcttttg atttagacct gcaggcatgc aagcttactg gccgtcgttt tacaacgtcg     9060

tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc     9120

cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct     9180

gaatggcgaa tgcgatttat tcaacaaagc cgccgtcccg tcaagtcagc gtaatgctct     9240

gccagtgtta caaccaatta accaattctg attagaaaaa ctcatcgagc atcaaatgaa     9300

actgcaattt attcatatca ggattatcaa taccatattt ttgaaaaagc cgtttctgta     9360

atgaaggaga aaactcaccg aggcagttcc ataggatggc aagatcctgg tatcggtctg     9420

cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca aaaataaggt     9480

tatcaagtga gaaatcacca tgagtgacga ctgaatccgg tgagaatggc aaaagcttat     9540

gcatttcttt ccagacttgt tcaacaggcc agccattacg ctcgtcatca aaatcactcg     9600

catcaaccaa accgttattc attcgtgatt gcgcctgagc gagacgaaat acgcgatcgc     9660

tgttaaaagg acaattacaa acaggaatcg aatgcaaccg gcgcaggaac actgccagcg     9720

catcaacaat attttcacct gaatcaggat attcttctaa tacctggaat gctgttttcc     9780

cggggatcgc agtggtgagt aaccatgcat catcaggagt acggataaaa tgcttgatgg     9840

tcggaagagg cataaattcc gtcagccagt ttagtctgac catctcatct gtaacatcat     9900

tggcaacgct acctttgcca tgtttcagaa acaactctgg cgcatcgggc ttcccataca     9960

atcgatagat tgtcgcacct gattgcccga cattatcgcg agcccattta tacccatata    10020

aatcagcatc catgttggaa tttaatcgcg gcttcgagca agacgtttcc cgttgaatat    10080

ggctcataac accccttgta ttactgttta tgtaagcaga cagttttatt gttcatgatg    10140

atatattttt atcttgtgca atgtaacatc agagattttg agacacaacg tggctttgtt    10200

gaataaatcg aacttttgct gagttgaagg atcagatcac gcatcttccc gacaacgcag    10260

accgttccgt ggcaaagcaa aagttcaaaa tcaccaactg gtccacctac aacaaagctc    10320

tcatcaaccg tggctccctc actttctggc tggatgatgg ggcgattcag gcctggtatg    10380

agtcagcaac accttcttca cgaggcagac ctctcgacgg atcgttccac tgagcgtcag    10440

accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct    10500

gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac    10560

caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc    10620

tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg    10680

ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt    10740

tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt    10800

gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc    10860

tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca    10920

gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata    10980

gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg    11040

ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct    11100

ggccttttgc tcacatgtcc tgcaggcagc tgcgcgccag ctgcgcgc                 11148


<210>  9
<211>  2085
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized sequence


<220>
<221>  CDS
<222>  (1)..(2085)
<223>  codon-optimized ORF

<400>  9
atg gct aag att aac acc cag tac tca cat cca tcc cgc act cac ctc         48
Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu           
1               5                   10                  15                

aaa gtc aag acc tcc gat cgg gat ctg aac cgg gct gag aat ggg ctg         96
Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu           
            20                  25                  30                    

tcg cgc gcc cac tcg tcg tcc gag gaa acc agc agc gtg ctc cag ccg        144
Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro           
        35                  40                  45                        

ggc atc gcc atg gaa act agg ggg ctg gcg gac tcc gga cag gga tcc        192
Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser           
    50                  55                  60                            

ttc act gga cag ggt att gcc cgg ctg agc aga ctg atc ttc ctg ctt        240
Phe Thr Gly Gln Gly Ile Ala Arg Leu Ser Arg Leu Ile Phe Leu Leu           
65                  70                  75                  80            

cgc cgc tgg gcg gcc aga cac gtg cac cat cag gac cag gga cct gat        288
Arg Arg Trp Ala Ala Arg His Val His His Gln Asp Gln Gly Pro Asp           
                85                  90                  95                

agc ttc ccc gac cgc ttt agg gga gcc gag ctg aaa gaa gtg tca agc        336
Ser Phe Pro Asp Arg Phe Arg Gly Ala Glu Leu Lys Glu Val Ser Ser           
            100                 105                 110                   

cag gag tca aac gcg cag gcc aac gtc ggc agc caa gag cct gca gac        384
Gln Glu Ser Asn Ala Gln Ala Asn Val Gly Ser Gln Glu Pro Ala Asp           
        115                 120                 125                       

cgg gga cgc tcg gca tgg ccg ctc gca aag tgc aac act aac act tcc        432
Arg Gly Arg Ser Ala Trp Pro Leu Ala Lys Cys Asn Thr Asn Thr Ser           
    130                 135                 140                           

aac aac acc gaa gag gaa aag aaa acc aag aag aag gat gca att gtg        480
Asn Asn Thr Glu Glu Glu Lys Lys Thr Lys Lys Lys Asp Ala Ile Val           
145                 150                 155                 160           

gtg gac cct tcc tcc aac ctg tac tac cgc tgg ttg acc gcc atc gcc        528
Val Asp Pro Ser Ser Asn Leu Tyr Tyr Arg Trp Leu Thr Ala Ile Ala           
                165                 170                 175               

ctc ccg gtc ttt tac aat tgg tat ctc ctt atc tgc cgg gcc tgc ttc        576
Leu Pro Val Phe Tyr Asn Trp Tyr Leu Leu Ile Cys Arg Ala Cys Phe           
            180                 185                 190                   

gac gaa ctg caa tca gag tac ctg atg ctg tgg ctg gtg ctg gac tat        624
Asp Glu Leu Gln Ser Glu Tyr Leu Met Leu Trp Leu Val Leu Asp Tyr           
        195                 200                 205                       

agc gcc gat gtg ctc tac gtc ctg gat gtg ctc gtg cgc gcc cgg acc        672
Ser Ala Asp Val Leu Tyr Val Leu Asp Val Leu Val Arg Ala Arg Thr           
    210                 215                 220                           

gga ttc ttg gaa caa ggc ctg atg gtg tcc gac acg aat aga ctg tgg        720
Gly Phe Leu Glu Gln Gly Leu Met Val Ser Asp Thr Asn Arg Leu Trp           
225                 230                 235                 240           

cag cac tat aag acc aca acc cag ttc aag ctt gac gtg ctc agc ctt        768
Gln His Tyr Lys Thr Thr Thr Gln Phe Lys Leu Asp Val Leu Ser Leu           
                245                 250                 255               

gtg ccg act gac ctg gcc tac ctg aaa gtc gga act aac tac ccg gaa        816
Val Pro Thr Asp Leu Ala Tyr Leu Lys Val Gly Thr Asn Tyr Pro Glu           
            260                 265                 270                   

gtc aga ttc aac cga ctc ctg aag ttc agc agg ctg ttc gag ttc ttt        864
Val Arg Phe Asn Arg Leu Leu Lys Phe Ser Arg Leu Phe Glu Phe Phe           
        275                 280                 285                       

gac cgc acc gag act cgg acc aac tac cct aac atg ttc cgg atc gga        912
Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro Asn Met Phe Arg Ile Gly           
    290                 295                 300                           

aat ctg gtg ctc tac ata ctg att atc atc cat tgg aac gcc tgt atc        960
Asn Leu Val Leu Tyr Ile Leu Ile Ile Ile His Trp Asn Ala Cys Ile           
305                 310                 315                 320           

tat ttc gcc att tcg aag ttc atc ggt ttc gga acc gat tcc tgg gtg       1008
Tyr Phe Ala Ile Ser Lys Phe Ile Gly Phe Gly Thr Asp Ser Trp Val           
                325                 330                 335               

tac ccc aac atc tcg atc ccc gaa cac ggt cgc ctg tcc cgg aag tac       1056
Tyr Pro Asn Ile Ser Ile Pro Glu His Gly Arg Leu Ser Arg Lys Tyr           
            340                 345                 350                   

atc tac tcc ctg tac tgg tcc act ctg act ctg acc acg atc ggg gaa       1104
Ile Tyr Ser Leu Tyr Trp Ser Thr Leu Thr Leu Thr Thr Ile Gly Glu           
        355                 360                 365                       

acc cct cca ccc gtg aag gac gaa gag tac ctg ttc gtg gtg gtg gac       1152
Thr Pro Pro Pro Val Lys Asp Glu Glu Tyr Leu Phe Val Val Val Asp           
    370                 375                 380                           

ttc ctg gtc gga gtg ttg att ttc gcc acc att gtg gga aac gtg ggc       1200
Phe Leu Val Gly Val Leu Ile Phe Ala Thr Ile Val Gly Asn Val Gly           
385                 390                 395                 400           

tcc atg atc tcc aac atg aac gcg tcg aga gct gag ttc caa gcc aag       1248
Ser Met Ile Ser Asn Met Asn Ala Ser Arg Ala Glu Phe Gln Ala Lys           
                405                 410                 415               

atc gac tcc att aag cag tac atg cag ttc aga aag gtc acc aag gac       1296
Ile Asp Ser Ile Lys Gln Tyr Met Gln Phe Arg Lys Val Thr Lys Asp           
            420                 425                 430                   

ctg gaa acc agg gtc atc cgc tgg ttc gac tac ctg tgg gcc aac aaa       1344
Leu Glu Thr Arg Val Ile Arg Trp Phe Asp Tyr Leu Trp Ala Asn Lys           
        435                 440                 445                       

aag act gtg gac gaa aag gaa gtg ctg aag tcg ctg ccg gat aag ctg       1392
Lys Thr Val Asp Glu Lys Glu Val Leu Lys Ser Leu Pro Asp Lys Leu           
    450                 455                 460                           

aag gcc gaa atc gcc att aac gtg cac ctt gac acc ctg aag aaa gtc       1440
Lys Ala Glu Ile Ala Ile Asn Val His Leu Asp Thr Leu Lys Lys Val           
465                 470                 475                 480           

cgg atc ttc caa gac tgt gaa gcc ggc ctc ctg gtg gag ctc gtg ctc       1488
Arg Ile Phe Gln Asp Cys Glu Ala Gly Leu Leu Val Glu Leu Val Leu           
                485                 490                 495               

aag ctg cgg ccc acc gtg ttc agc ccg gga gat tac att tgc aag aag       1536
Lys Leu Arg Pro Thr Val Phe Ser Pro Gly Asp Tyr Ile Cys Lys Lys           
            500                 505                 510                   

ggc gat atc ggc aaa gag atg tac atc atc aac gag gga aag ctg gcc       1584
Gly Asp Ile Gly Lys Glu Met Tyr Ile Ile Asn Glu Gly Lys Leu Ala           
        515                 520                 525                       

gtg gtc gcg gac gac ggc gtg acc cag ttc gtg gtg ctg tcc gac gga       1632
Val Val Ala Asp Asp Gly Val Thr Gln Phe Val Val Leu Ser Asp Gly           
    530                 535                 540                           

tcc tac ttc ggt gaa atc tca atc ctc aac atc aag ggg tcc aag tcc       1680
Ser Tyr Phe Gly Glu Ile Ser Ile Leu Asn Ile Lys Gly Ser Lys Ser           
545                 550                 555                 560           

ggc aac cgg aga act gcc aac att cgc tcc atc gga tac agc gac ctg       1728
Gly Asn Arg Arg Thr Ala Asn Ile Arg Ser Ile Gly Tyr Ser Asp Leu           
                565                 570                 575               

ttt tgc ctg tcc aag gat gac ctg atg gag gct ctg act gag tac cct       1776
Phe Cys Leu Ser Lys Asp Asp Leu Met Glu Ala Leu Thr Glu Tyr Pro           
            580                 585                 590                   

gaa gcg aag aag gct ttg gag gaa aag ggg cgg cag att ctg atg aag       1824
Glu Ala Lys Lys Ala Leu Glu Glu Lys Gly Arg Gln Ile Leu Met Lys           
        595                 600                 605                       

gac aat ttg atc gac gag gag ctc gca cgg gcc ggc gcc gac ccc aag       1872
Asp Asn Leu Ile Asp Glu Glu Leu Ala Arg Ala Gly Ala Asp Pro Lys           
    610                 615                 620                           

gat ctc gaa gag aag gtc gaa cag ctg ggt tct tcg ctt gat acc ctg       1920
Asp Leu Glu Glu Lys Val Glu Gln Leu Gly Ser Ser Leu Asp Thr Leu           
625                 630                 635                 640           

caa acc cga ttc gcg cgg ctg ctc gcc gag tac aac gcg acc cag atg       1968
Gln Thr Arg Phe Ala Arg Leu Leu Ala Glu Tyr Asn Ala Thr Gln Met           
                645                 650                 655               

aag atg aag cag aga ctg tca cag ttg gaa tcc caa gtc aag ggc gga       2016
Lys Met Lys Gln Arg Leu Ser Gln Leu Glu Ser Gln Val Lys Gly Gly           
            660                 665                 670                   

ggc gac aag ccg ctg gcg gac ggg gaa gtg ccc ggg gac gcc acc aag       2064
Gly Asp Lys Pro Leu Ala Asp Gly Glu Val Pro Gly Asp Ala Thr Lys           
        675                 680                 685                       

act gag gac aag cag cag tga                                           2085
Thr Glu Asp Lys Gln Gln                                                   
    690                                                                   


<210>  10
<211>  694
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  10

Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu 
1               5                   10                  15      


Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu 
            20                  25                  30          


Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro 
        35                  40                  45              


Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser 
    50                  55                  60                  


Phe Thr Gly Gln Gly Ile Ala Arg Leu Ser Arg Leu Ile Phe Leu Leu 
65                  70                  75                  80  


Arg Arg Trp Ala Ala Arg His Val His His Gln Asp Gln Gly Pro Asp 
                85                  90                  95      


Ser Phe Pro Asp Arg Phe Arg Gly Ala Glu Leu Lys Glu Val Ser Ser 
            100                 105                 110         


Gln Glu Ser Asn Ala Gln Ala Asn Val Gly Ser Gln Glu Pro Ala Asp 
        115                 120                 125             


Arg Gly Arg Ser Ala Trp Pro Leu Ala Lys Cys Asn Thr Asn Thr Ser 
    130                 135                 140                 


Asn Asn Thr Glu Glu Glu Lys Lys Thr Lys Lys Lys Asp Ala Ile Val 
145                 150                 155                 160 


Val Asp Pro Ser Ser Asn Leu Tyr Tyr Arg Trp Leu Thr Ala Ile Ala 
                165                 170                 175     


Leu Pro Val Phe Tyr Asn Trp Tyr Leu Leu Ile Cys Arg Ala Cys Phe 
            180                 185                 190         


Asp Glu Leu Gln Ser Glu Tyr Leu Met Leu Trp Leu Val Leu Asp Tyr 
        195                 200                 205             


Ser Ala Asp Val Leu Tyr Val Leu Asp Val Leu Val Arg Ala Arg Thr 
    210                 215                 220                 


Gly Phe Leu Glu Gln Gly Leu Met Val Ser Asp Thr Asn Arg Leu Trp 
225                 230                 235                 240 


Gln His Tyr Lys Thr Thr Thr Gln Phe Lys Leu Asp Val Leu Ser Leu 
                245                 250                 255     


Val Pro Thr Asp Leu Ala Tyr Leu Lys Val Gly Thr Asn Tyr Pro Glu 
            260                 265                 270         


Val Arg Phe Asn Arg Leu Leu Lys Phe Ser Arg Leu Phe Glu Phe Phe 
        275                 280                 285             


Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro Asn Met Phe Arg Ile Gly 
    290                 295                 300                 


Asn Leu Val Leu Tyr Ile Leu Ile Ile Ile His Trp Asn Ala Cys Ile 
305                 310                 315                 320 


Tyr Phe Ala Ile Ser Lys Phe Ile Gly Phe Gly Thr Asp Ser Trp Val 
                325                 330                 335     


Tyr Pro Asn Ile Ser Ile Pro Glu His Gly Arg Leu Ser Arg Lys Tyr 
            340                 345                 350         


Ile Tyr Ser Leu Tyr Trp Ser Thr Leu Thr Leu Thr Thr Ile Gly Glu 
        355                 360                 365             


Thr Pro Pro Pro Val Lys Asp Glu Glu Tyr Leu Phe Val Val Val Asp 
    370                 375                 380                 


Phe Leu Val Gly Val Leu Ile Phe Ala Thr Ile Val Gly Asn Val Gly 
385                 390                 395                 400 


Ser Met Ile Ser Asn Met Asn Ala Ser Arg Ala Glu Phe Gln Ala Lys 
                405                 410                 415     


Ile Asp Ser Ile Lys Gln Tyr Met Gln Phe Arg Lys Val Thr Lys Asp 
            420                 425                 430         


Leu Glu Thr Arg Val Ile Arg Trp Phe Asp Tyr Leu Trp Ala Asn Lys 
        435                 440                 445             


Lys Thr Val Asp Glu Lys Glu Val Leu Lys Ser Leu Pro Asp Lys Leu 
    450                 455                 460                 


Lys Ala Glu Ile Ala Ile Asn Val His Leu Asp Thr Leu Lys Lys Val 
465                 470                 475                 480 


Arg Ile Phe Gln Asp Cys Glu Ala Gly Leu Leu Val Glu Leu Val Leu 
                485                 490                 495     


Lys Leu Arg Pro Thr Val Phe Ser Pro Gly Asp Tyr Ile Cys Lys Lys 
            500                 505                 510         


Gly Asp Ile Gly Lys Glu Met Tyr Ile Ile Asn Glu Gly Lys Leu Ala 
        515                 520                 525             


Val Val Ala Asp Asp Gly Val Thr Gln Phe Val Val Leu Ser Asp Gly 
    530                 535                 540                 


Ser Tyr Phe Gly Glu Ile Ser Ile Leu Asn Ile Lys Gly Ser Lys Ser 
545                 550                 555                 560 


Gly Asn Arg Arg Thr Ala Asn Ile Arg Ser Ile Gly Tyr Ser Asp Leu 
                565                 570                 575     


Phe Cys Leu Ser Lys Asp Asp Leu Met Glu Ala Leu Thr Glu Tyr Pro 
            580                 585                 590         


Glu Ala Lys Lys Ala Leu Glu Glu Lys Gly Arg Gln Ile Leu Met Lys 
        595                 600                 605             


Asp Asn Leu Ile Asp Glu Glu Leu Ala Arg Ala Gly Ala Asp Pro Lys 
    610                 615                 620                 


Asp Leu Glu Glu Lys Val Glu Gln Leu Gly Ser Ser Leu Asp Thr Leu 
625                 630                 635                 640 


Gln Thr Arg Phe Ala Arg Leu Leu Ala Glu Tyr Asn Ala Thr Gln Met 
                645                 650                 655     


Lys Met Lys Gln Arg Leu Ser Gln Leu Glu Ser Gln Val Lys Gly Gly 
            660                 665                 670         


Gly Asp Lys Pro Leu Ala Asp Gly Glu Val Pro Gly Asp Ala Thr Lys 
        675                 680                 685             


Thr Glu Asp Lys Gln Gln 
    690                 


<210>  11
<211>  2250
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized sequence


<220>
<221>  CDS
<222>  (1)..(2250)
<223>  codon-optimized ORF

<400>  11
atg gct aag att aac acc cag tac tca cat cca tcc cgc act cac ctc         48
Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu           
1               5                   10                  15                

aaa gtc aag acc tcc gat cgg gat ctg aac cgg gct gag aat ggg ctg         96
Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu           
            20                  25                  30                    

tcg cgc gcc cac tcg tcg tcc gag gaa acc agc agc gtg ctc cag ccg        144
Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro           
        35                  40                  45                        

ggc atc gcc atg gaa act agg ggg ctg gcg gac tcc gga cag gga tcc        192
Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser           
    50                  55                  60                            

ttc act gga cag ggt att gcc cgg ttc ggg cgg att cag aag aag tcc        240
Phe Thr Gly Gln Gly Ile Ala Arg Phe Gly Arg Ile Gln Lys Lys Ser           
65                  70                  75                  80            

cag ccg gag aag gtc gtg cgg gct gcc agc agg ggc agg cca ctc att        288
Gln Pro Glu Lys Val Val Arg Ala Ala Ser Arg Gly Arg Pro Leu Ile           
                85                  90                  95                

ggt tgg aca cag tgg tgc gct gag gat ggt gga gat gaa tcg gaa atg        336
Gly Trp Thr Gln Trp Cys Ala Glu Asp Gly Gly Asp Glu Ser Glu Met           
            100                 105                 110                   

gca ctg gcc ggc tct ccc gga tgc agc tcg ggc ccc caa ggg aga ctg        384
Ala Leu Ala Gly Ser Pro Gly Cys Ser Ser Gly Pro Gln Gly Arg Leu           
        115                 120                 125                       

agc aga ctg atc ttc ctg ctt cgc cgc tgg gcg gcc aga cac gtg cac        432
Ser Arg Leu Ile Phe Leu Leu Arg Arg Trp Ala Ala Arg His Val His           
    130                 135                 140                           

cat cag gac cag gga cct gat agc ttc ccc gac cgc ttt agg gga gcc        480
His Gln Asp Gln Gly Pro Asp Ser Phe Pro Asp Arg Phe Arg Gly Ala           
145                 150                 155                 160           

gag ctg aaa gaa gtg tca agc cag gag tca aac gcg cag gcc aac gtc        528
Glu Leu Lys Glu Val Ser Ser Gln Glu Ser Asn Ala Gln Ala Asn Val           
                165                 170                 175               

ggc agc caa gag cct gca gac cgg gga cgc tcg gca tgg ccg ctc gca        576
Gly Ser Gln Glu Pro Ala Asp Arg Gly Arg Ser Ala Trp Pro Leu Ala           
            180                 185                 190                   

aag tgc aac act aac act tcc aac aac acc gaa gag gaa aag aaa acc        624
Lys Cys Asn Thr Asn Thr Ser Asn Asn Thr Glu Glu Glu Lys Lys Thr           
        195                 200                 205                       

aag aag aag gat gca att gtg gtg gac cct tcc tcc aac ctg tac tac        672
Lys Lys Lys Asp Ala Ile Val Val Asp Pro Ser Ser Asn Leu Tyr Tyr           
    210                 215                 220                           

cgc tgg ttg acc gcc atc gcc ctc ccg gtc ttt tac aat tgg tat ctc        720
Arg Trp Leu Thr Ala Ile Ala Leu Pro Val Phe Tyr Asn Trp Tyr Leu           
225                 230                 235                 240           

ctt atc tgc cgg gcc tgc ttc gac gaa ctg caa tca gag tac ctg atg        768
Leu Ile Cys Arg Ala Cys Phe Asp Glu Leu Gln Ser Glu Tyr Leu Met           
                245                 250                 255               

ctg tgg ctg gtg ctg gac tat agc gcc gat gtg ctc tac gtc ctg gat        816
Leu Trp Leu Val Leu Asp Tyr Ser Ala Asp Val Leu Tyr Val Leu Asp           
            260                 265                 270                   

gtg ctc gtg cgc gcc cgg acc gga ttc ttg gaa caa ggc ctg atg gtg        864
Val Leu Val Arg Ala Arg Thr Gly Phe Leu Glu Gln Gly Leu Met Val           
        275                 280                 285                       

tcc gac acg aat aga ctg tgg cag cac tat aag acc aca acc cag ttc        912
Ser Asp Thr Asn Arg Leu Trp Gln His Tyr Lys Thr Thr Thr Gln Phe           
    290                 295                 300                           

aag ctt gac gtg ctc agc ctt gtg ccg act gac ctg gcc tac ctg aaa        960
Lys Leu Asp Val Leu Ser Leu Val Pro Thr Asp Leu Ala Tyr Leu Lys           
305                 310                 315                 320           

gtc gga act aac tac ccg gaa gtc aga ttc aac cga ctc ctg aag ttc       1008
Val Gly Thr Asn Tyr Pro Glu Val Arg Phe Asn Arg Leu Leu Lys Phe           
                325                 330                 335               

agc agg ctg ttc gag ttc ttt gac cgc acc gag act cgg acc aac tac       1056
Ser Arg Leu Phe Glu Phe Phe Asp Arg Thr Glu Thr Arg Thr Asn Tyr           
            340                 345                 350                   

cct aac atg ttc cgg atc gga aat ctg gtg ctc tac ata ctg att atc       1104
Pro Asn Met Phe Arg Ile Gly Asn Leu Val Leu Tyr Ile Leu Ile Ile           
        355                 360                 365                       

atc cat tgg aac gcc tgt atc tat ttc gcc att tcg aag ttc atc ggt       1152
Ile His Trp Asn Ala Cys Ile Tyr Phe Ala Ile Ser Lys Phe Ile Gly           
    370                 375                 380                           

ttc gga acc gat tcc tgg gtg tac ccc aac atc tcg atc ccc gaa cac       1200
Phe Gly Thr Asp Ser Trp Val Tyr Pro Asn Ile Ser Ile Pro Glu His           
385                 390                 395                 400           

ggt cgc ctg tcc cgg aag tac atc tac tcc ctg tac tgg tcc act ctg       1248
Gly Arg Leu Ser Arg Lys Tyr Ile Tyr Ser Leu Tyr Trp Ser Thr Leu           
                405                 410                 415               

act ctg acc acg atc ggg gaa acc cct cca ccc gtg aag gac gaa gag       1296
Thr Leu Thr Thr Ile Gly Glu Thr Pro Pro Pro Val Lys Asp Glu Glu           
            420                 425                 430                   

tac ctg ttc gtg gtg gtg gac ttc ctg gtc gga gtg ttg att ttc gcc       1344
Tyr Leu Phe Val Val Val Asp Phe Leu Val Gly Val Leu Ile Phe Ala           
        435                 440                 445                       

acc att gtg gga aac gtg ggc tcc atg atc tcc aac atg aac gcg tcg       1392
Thr Ile Val Gly Asn Val Gly Ser Met Ile Ser Asn Met Asn Ala Ser           
    450                 455                 460                           

aga gct gag ttc caa gcc aag atc gac tcc att aag cag tac atg cag       1440
Arg Ala Glu Phe Gln Ala Lys Ile Asp Ser Ile Lys Gln Tyr Met Gln           
465                 470                 475                 480           

ttc aga aag gtc acc aag gac ctg gaa acc agg gtc atc cgc tgg ttc       1488
Phe Arg Lys Val Thr Lys Asp Leu Glu Thr Arg Val Ile Arg Trp Phe           
                485                 490                 495               

gac tac ctg tgg gcc aac aaa aag act gtg gac gaa aag gaa gtg ctg       1536
Asp Tyr Leu Trp Ala Asn Lys Lys Thr Val Asp Glu Lys Glu Val Leu           
            500                 505                 510                   

aag tcg ctg ccg gat aag ctg aag gcc gaa atc gcc att aac gtg cac       1584
Lys Ser Leu Pro Asp Lys Leu Lys Ala Glu Ile Ala Ile Asn Val His           
        515                 520                 525                       

ctt gac acc ctg aag aaa gtc cgg atc ttc caa gac tgt gaa gcc ggc       1632
Leu Asp Thr Leu Lys Lys Val Arg Ile Phe Gln Asp Cys Glu Ala Gly           
    530                 535                 540                           

ctc ctg gtg gag ctc gtg ctc aag ctg cgg ccc acc gtg ttc agc ccg       1680
Leu Leu Val Glu Leu Val Leu Lys Leu Arg Pro Thr Val Phe Ser Pro           
545                 550                 555                 560           

gga gat tac att tgc aag aag ggc gat atc ggc aaa gag atg tac atc       1728
Gly Asp Tyr Ile Cys Lys Lys Gly Asp Ile Gly Lys Glu Met Tyr Ile           
                565                 570                 575               

atc aac gag gga aag ctg gcc gtg gtc gcg gac gac ggc gtg acc cag       1776
Ile Asn Glu Gly Lys Leu Ala Val Val Ala Asp Asp Gly Val Thr Gln           
            580                 585                 590                   

ttc gtg gtg ctg tcc gac gga tcc tac ttc ggt gaa atc tca atc ctc       1824
Phe Val Val Leu Ser Asp Gly Ser Tyr Phe Gly Glu Ile Ser Ile Leu           
        595                 600                 605                       

aac atc aag ggg tcc aag tcc ggc aac cgg aga act gcc aac att cgc       1872
Asn Ile Lys Gly Ser Lys Ser Gly Asn Arg Arg Thr Ala Asn Ile Arg           
    610                 615                 620                           

tcc atc gga tac agc gac ctg ttt tgc ctg tcc aag gat gac ctg atg       1920
Ser Ile Gly Tyr Ser Asp Leu Phe Cys Leu Ser Lys Asp Asp Leu Met           
625                 630                 635                 640           

gag gct ctg act gag tac cct gaa gcg aag aag gct ttg gag gaa aag       1968
Glu Ala Leu Thr Glu Tyr Pro Glu Ala Lys Lys Ala Leu Glu Glu Lys           
                645                 650                 655               

ggg cgg cag att ctg atg aag gac aat ttg atc gac gag gag ctc gca       2016
Gly Arg Gln Ile Leu Met Lys Asp Asn Leu Ile Asp Glu Glu Leu Ala           
            660                 665                 670                   

cgg gcc ggc gcc gac ccc aag gat ctc gaa gag aag gtc gaa cag ctg       2064
Arg Ala Gly Ala Asp Pro Lys Asp Leu Glu Glu Lys Val Glu Gln Leu           
        675                 680                 685                       

ggt tct tcg ctt gat acc ctg caa acc cga ttc gcg cgg ctg ctc gcc       2112
Gly Ser Ser Leu Asp Thr Leu Gln Thr Arg Phe Ala Arg Leu Leu Ala           
    690                 695                 700                           

gag tac aac gcg acc cag atg aag atg aag cag aga ctg tca cag ttg       2160
Glu Tyr Asn Ala Thr Gln Met Lys Met Lys Gln Arg Leu Ser Gln Leu           
705                 710                 715                 720           

gaa tcc caa gtc aag ggc gga ggc gac aag ccg ctg gcg gac ggg gaa       2208
Glu Ser Gln Val Lys Gly Gly Gly Asp Lys Pro Leu Ala Asp Gly Glu           
                725                 730                 735               

gtg ccc ggg gac gcc acc aag act gag gac aag cag cag tga               2250
Val Pro Gly Asp Ala Thr Lys Thr Glu Asp Lys Gln Gln                       
            740                 745                                       


<210>  12
<211>  749
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  12

Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu 
1               5                   10                  15      


Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu 
            20                  25                  30          


Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro 
        35                  40                  45              


Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser 
    50                  55                  60                  


Phe Thr Gly Gln Gly Ile Ala Arg Phe Gly Arg Ile Gln Lys Lys Ser 
65                  70                  75                  80  


Gln Pro Glu Lys Val Val Arg Ala Ala Ser Arg Gly Arg Pro Leu Ile 
                85                  90                  95      


Gly Trp Thr Gln Trp Cys Ala Glu Asp Gly Gly Asp Glu Ser Glu Met 
            100                 105                 110         


Ala Leu Ala Gly Ser Pro Gly Cys Ser Ser Gly Pro Gln Gly Arg Leu 
        115                 120                 125             


Ser Arg Leu Ile Phe Leu Leu Arg Arg Trp Ala Ala Arg His Val His 
    130                 135                 140                 


His Gln Asp Gln Gly Pro Asp Ser Phe Pro Asp Arg Phe Arg Gly Ala 
145                 150                 155                 160 


Glu Leu Lys Glu Val Ser Ser Gln Glu Ser Asn Ala Gln Ala Asn Val 
                165                 170                 175     


Gly Ser Gln Glu Pro Ala Asp Arg Gly Arg Ser Ala Trp Pro Leu Ala 
            180                 185                 190         


Lys Cys Asn Thr Asn Thr Ser Asn Asn Thr Glu Glu Glu Lys Lys Thr 
        195                 200                 205             


Lys Lys Lys Asp Ala Ile Val Val Asp Pro Ser Ser Asn Leu Tyr Tyr 
    210                 215                 220                 


Arg Trp Leu Thr Ala Ile Ala Leu Pro Val Phe Tyr Asn Trp Tyr Leu 
225                 230                 235                 240 


Leu Ile Cys Arg Ala Cys Phe Asp Glu Leu Gln Ser Glu Tyr Leu Met 
                245                 250                 255     


Leu Trp Leu Val Leu Asp Tyr Ser Ala Asp Val Leu Tyr Val Leu Asp 
            260                 265                 270         


Val Leu Val Arg Ala Arg Thr Gly Phe Leu Glu Gln Gly Leu Met Val 
        275                 280                 285             


Ser Asp Thr Asn Arg Leu Trp Gln His Tyr Lys Thr Thr Thr Gln Phe 
    290                 295                 300                 


Lys Leu Asp Val Leu Ser Leu Val Pro Thr Asp Leu Ala Tyr Leu Lys 
305                 310                 315                 320 


Val Gly Thr Asn Tyr Pro Glu Val Arg Phe Asn Arg Leu Leu Lys Phe 
                325                 330                 335     


Ser Arg Leu Phe Glu Phe Phe Asp Arg Thr Glu Thr Arg Thr Asn Tyr 
            340                 345                 350         


Pro Asn Met Phe Arg Ile Gly Asn Leu Val Leu Tyr Ile Leu Ile Ile 
        355                 360                 365             


Ile His Trp Asn Ala Cys Ile Tyr Phe Ala Ile Ser Lys Phe Ile Gly 
    370                 375                 380                 


Phe Gly Thr Asp Ser Trp Val Tyr Pro Asn Ile Ser Ile Pro Glu His 
385                 390                 395                 400 


Gly Arg Leu Ser Arg Lys Tyr Ile Tyr Ser Leu Tyr Trp Ser Thr Leu 
                405                 410                 415     


Thr Leu Thr Thr Ile Gly Glu Thr Pro Pro Pro Val Lys Asp Glu Glu 
            420                 425                 430         


Tyr Leu Phe Val Val Val Asp Phe Leu Val Gly Val Leu Ile Phe Ala 
        435                 440                 445             


Thr Ile Val Gly Asn Val Gly Ser Met Ile Ser Asn Met Asn Ala Ser 
    450                 455                 460                 


Arg Ala Glu Phe Gln Ala Lys Ile Asp Ser Ile Lys Gln Tyr Met Gln 
465                 470                 475                 480 


Phe Arg Lys Val Thr Lys Asp Leu Glu Thr Arg Val Ile Arg Trp Phe 
                485                 490                 495     


Asp Tyr Leu Trp Ala Asn Lys Lys Thr Val Asp Glu Lys Glu Val Leu 
            500                 505                 510         


Lys Ser Leu Pro Asp Lys Leu Lys Ala Glu Ile Ala Ile Asn Val His 
        515                 520                 525             


Leu Asp Thr Leu Lys Lys Val Arg Ile Phe Gln Asp Cys Glu Ala Gly 
    530                 535                 540                 


Leu Leu Val Glu Leu Val Leu Lys Leu Arg Pro Thr Val Phe Ser Pro 
545                 550                 555                 560 


Gly Asp Tyr Ile Cys Lys Lys Gly Asp Ile Gly Lys Glu Met Tyr Ile 
                565                 570                 575     


Ile Asn Glu Gly Lys Leu Ala Val Val Ala Asp Asp Gly Val Thr Gln 
            580                 585                 590         


Phe Val Val Leu Ser Asp Gly Ser Tyr Phe Gly Glu Ile Ser Ile Leu 
        595                 600                 605             


Asn Ile Lys Gly Ser Lys Ser Gly Asn Arg Arg Thr Ala Asn Ile Arg 
    610                 615                 620                 


Ser Ile Gly Tyr Ser Asp Leu Phe Cys Leu Ser Lys Asp Asp Leu Met 
625                 630                 635                 640 


Glu Ala Leu Thr Glu Tyr Pro Glu Ala Lys Lys Ala Leu Glu Glu Lys 
                645                 650                 655     


Gly Arg Gln Ile Leu Met Lys Asp Asn Leu Ile Asp Glu Glu Leu Ala 
            660                 665                 670         


Arg Ala Gly Ala Asp Pro Lys Asp Leu Glu Glu Lys Val Glu Gln Leu 
        675                 680                 685             


Gly Ser Ser Leu Asp Thr Leu Gln Thr Arg Phe Ala Arg Leu Leu Ala 
    690                 695                 700                 


Glu Tyr Asn Ala Thr Gln Met Lys Met Lys Gln Arg Leu Ser Gln Leu 
705                 710                 715                 720 


Glu Ser Gln Val Lys Gly Gly Gly Asp Lys Pro Leu Ala Asp Gly Glu 
                725                 730                 735     


Val Pro Gly Asp Ala Thr Lys Thr Glu Asp Lys Gln Gln 
            740                 745                 


<210>  13
<211>  2085
<212>  DNA
<213>  Homo sapiens


<220>
<221>  CDS
<222>  (1)..(2085)
<223>  native open reading frame (ORF)

<400>  13
atg gcc aag atc aac acc caa tac tcc cac ccc tcc agg acc cac ctc         48
Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu           
1               5                   10                  15                

aag gta aag acc tca gac cgg gat ctc aat cgc gct gaa aat ggc ctc         96
Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu           
            20                  25                  30                    

agc aga gcc cac tcg tca agt gag gag aca tcg tca gtg ctg cag ccg        144
Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro           
        35                  40                  45                        

ggg atc gcc atg gag acc aga gga ctg gct gac tcc ggg cag ggc tcc        192
Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser           
    50                  55                  60                            

ttc acc ggc cag ggg atc gcc agg ctg tcg cgc ctc atc ttc ttg ctg        240
Phe Thr Gly Gln Gly Ile Ala Arg Leu Ser Arg Leu Ile Phe Leu Leu           
65                  70                  75                  80            

cgc agg tgg gct gcc agg cat gtg cac cac cag gac cag gga ccg gac        288
Arg Arg Trp Ala Ala Arg His Val His His Gln Asp Gln Gly Pro Asp           
                85                  90                  95                

tct ttt cct gat cgt ttc cgt gga gcc gag ctt aag gag gtg tcc agc        336
Ser Phe Pro Asp Arg Phe Arg Gly Ala Glu Leu Lys Glu Val Ser Ser           
            100                 105                 110                   

caa gaa agc aat gcc cag gca aat gtg ggc agc cag gag cca gca gac        384
Gln Glu Ser Asn Ala Gln Ala Asn Val Gly Ser Gln Glu Pro Ala Asp           
        115                 120                 125                       

aga ggg aga agc gcc tgg ccc ctg gcc aaa tgc aac act aac acc agc        432
Arg Gly Arg Ser Ala Trp Pro Leu Ala Lys Cys Asn Thr Asn Thr Ser           
    130                 135                 140                           

aac aac acg gag gag gag aag aag acg aaa aag aag gat gcg atc gtg        480
Asn Asn Thr Glu Glu Glu Lys Lys Thr Lys Lys Lys Asp Ala Ile Val           
145                 150                 155                 160           

gtg gac ccg tcc agc aac ctg tac tac cgc tgg ctg acc gcc atc gcc        528
Val Asp Pro Ser Ser Asn Leu Tyr Tyr Arg Trp Leu Thr Ala Ile Ala           
                165                 170                 175               

ctg cct gtc ttc tat aac tgg tat ctg ctt att tgc agg gcc tgt ttc        576
Leu Pro Val Phe Tyr Asn Trp Tyr Leu Leu Ile Cys Arg Ala Cys Phe           
            180                 185                 190                   

gat gag ctg cag tcc gag tac ctg atg ctg tgg ctg gtc ctg gac tac        624
Asp Glu Leu Gln Ser Glu Tyr Leu Met Leu Trp Leu Val Leu Asp Tyr           
        195                 200                 205                       

tcg gca gat gtc ctg tat gtc ttg gat gtg ctt gta cga gct cgg aca        672
Ser Ala Asp Val Leu Tyr Val Leu Asp Val Leu Val Arg Ala Arg Thr           
    210                 215                 220                           

ggt ttt ctt gag caa ggc tta atg gtc agt gat acc aac agg ctg tgg        720
Gly Phe Leu Glu Gln Gly Leu Met Val Ser Asp Thr Asn Arg Leu Trp           
225                 230                 235                 240           

cag cat tac aag acg acc acg cag ttc aag ctg gat gtg ttg tcc ctg        768
Gln His Tyr Lys Thr Thr Thr Gln Phe Lys Leu Asp Val Leu Ser Leu           
                245                 250                 255               

gtc ccc acc gac ctg gct tac tta aag gtg ggc aca aac tac cca gaa        816
Val Pro Thr Asp Leu Ala Tyr Leu Lys Val Gly Thr Asn Tyr Pro Glu           
            260                 265                 270                   

gtg agg ttc aac cgc cta ctg aag ttt tcc cgg ctc ttt gaa ttc ttt        864
Val Arg Phe Asn Arg Leu Leu Lys Phe Ser Arg Leu Phe Glu Phe Phe           
        275                 280                 285                       

gac cgc aca gag aca agg acc aac tac ccc aat atg ttc agg att ggg        912
Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro Asn Met Phe Arg Ile Gly           
    290                 295                 300                           

aac ttg gtc ttg tac att ctc atc atc atc cac tgg aat gcc tgc atc        960
Asn Leu Val Leu Tyr Ile Leu Ile Ile Ile His Trp Asn Ala Cys Ile           
305                 310                 315                 320           

tac ttt gcc att tcc aag ttc att ggt ttt ggg aca gac tcc tgg gtc       1008
Tyr Phe Ala Ile Ser Lys Phe Ile Gly Phe Gly Thr Asp Ser Trp Val           
                325                 330                 335               

tac cca aac atc tca atc cca gag cat ggg cgc ctc tcc agg aag tac       1056
Tyr Pro Asn Ile Ser Ile Pro Glu His Gly Arg Leu Ser Arg Lys Tyr           
            340                 345                 350                   

att tac agt ctc tac tgg tcc acc ttg acc ctt acc acc att ggt gag       1104
Ile Tyr Ser Leu Tyr Trp Ser Thr Leu Thr Leu Thr Thr Ile Gly Glu           
        355                 360                 365                       

acc cca ccc ccc gtg aaa gat gag gag tat ctc ttt gtg gtc gta gac       1152
Thr Pro Pro Pro Val Lys Asp Glu Glu Tyr Leu Phe Val Val Val Asp           
    370                 375                 380                           

ttc ttg gtg ggt gtt ctg att ttt gcc acc att gtg ggc aat gtg ggc       1200
Phe Leu Val Gly Val Leu Ile Phe Ala Thr Ile Val Gly Asn Val Gly           
385                 390                 395                 400           

tcc atg atc tcg aat atg aat gcc tca cgg gca gag ttc cag gcc aag       1248
Ser Met Ile Ser Asn Met Asn Ala Ser Arg Ala Glu Phe Gln Ala Lys           
                405                 410                 415               

att gat tcc atc aag cag tac atg cag ttc cgc aag gtc acc aag gac       1296
Ile Asp Ser Ile Lys Gln Tyr Met Gln Phe Arg Lys Val Thr Lys Asp           
            420                 425                 430                   

ttg gag acg cgg gtt atc cgg tgg ttt gac tac ctg tgg gcc aac aag       1344
Leu Glu Thr Arg Val Ile Arg Trp Phe Asp Tyr Leu Trp Ala Asn Lys           
        435                 440                 445                       

aag acg gtg gat gag aag gag gtg ctc aag agc ctc cca gac aag ctg       1392
Lys Thr Val Asp Glu Lys Glu Val Leu Lys Ser Leu Pro Asp Lys Leu           
    450                 455                 460                           

aag gct gag atc gcc atc aac gtg cac ctg gac acg ctg aag aag gtt       1440
Lys Ala Glu Ile Ala Ile Asn Val His Leu Asp Thr Leu Lys Lys Val           
465                 470                 475                 480           

cgc atc ttc cag gac tgt gag gca ggg ctg ctg gtg gag ctg gtg ctg       1488
Arg Ile Phe Gln Asp Cys Glu Ala Gly Leu Leu Val Glu Leu Val Leu           
                485                 490                 495               

aag ctg cga ccc act gtg ttc agc cct ggg gat tat atc tgc aag aag       1536
Lys Leu Arg Pro Thr Val Phe Ser Pro Gly Asp Tyr Ile Cys Lys Lys           
            500                 505                 510                   

gga gat att ggg aag gag atg tac atc atc aac gag ggc aag ctg gcc       1584
Gly Asp Ile Gly Lys Glu Met Tyr Ile Ile Asn Glu Gly Lys Leu Ala           
        515                 520                 525                       

gtg gtg gct gat gat ggg gtc acc cag ttc gtg gtc ctc agc gat ggc       1632
Val Val Ala Asp Asp Gly Val Thr Gln Phe Val Val Leu Ser Asp Gly           
    530                 535                 540                           

agc tac ttc ggg gag atc agc att ctg aac atc aag ggg agc aag tcg       1680
Ser Tyr Phe Gly Glu Ile Ser Ile Leu Asn Ile Lys Gly Ser Lys Ser           
545                 550                 555                 560           

ggg aac cgc agg acg gcc aac atc cgc agc att ggc tac tca gac ctg       1728
Gly Asn Arg Arg Thr Ala Asn Ile Arg Ser Ile Gly Tyr Ser Asp Leu           
                565                 570                 575               

ttc tgc ctc tca aag gac gat ctc atg gag gcc ctc acc gag tac ccc       1776
Phe Cys Leu Ser Lys Asp Asp Leu Met Glu Ala Leu Thr Glu Tyr Pro           
            580                 585                 590                   

gaa gcc aag aag gcc ctg gag gag aaa gga cgg cag atc ctg atg aaa       1824
Glu Ala Lys Lys Ala Leu Glu Glu Lys Gly Arg Gln Ile Leu Met Lys           
        595                 600                 605                       

gac aac ctg atc gat gag gag ctg gcc agg gcg ggc gcg gac ccc aag       1872
Asp Asn Leu Ile Asp Glu Glu Leu Ala Arg Ala Gly Ala Asp Pro Lys           
    610                 615                 620                           

gac ctt gag gag aaa gtg gag cag ctg ggg tcc tcc ctg gac acc ctg       1920
Asp Leu Glu Glu Lys Val Glu Gln Leu Gly Ser Ser Leu Asp Thr Leu           
625                 630                 635                 640           

cag acc agg ttt gca cgc ctc ctg gct gag tac aac gcc acc cag atg       1968
Gln Thr Arg Phe Ala Arg Leu Leu Ala Glu Tyr Asn Ala Thr Gln Met           
                645                 650                 655               

aag atg aag cag cgt ctc agc caa ctg gaa agc cag gtg aag ggt ggt       2016
Lys Met Lys Gln Arg Leu Ser Gln Leu Glu Ser Gln Val Lys Gly Gly           
            660                 665                 670                   

ggg gac aag ccc ctg gct gat ggg gaa gtt ccc ggg gat gct aca aaa       2064
Gly Asp Lys Pro Leu Ala Asp Gly Glu Val Pro Gly Asp Ala Thr Lys           
        675                 680                 685                       

aca gag gac aaa caa cag tga                                           2085
Thr Glu Asp Lys Gln Gln                                                   
    690                                                                   


<210>  14
<211>  694
<212>  PRT
<213>  Homo sapiens

<400>  14

Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu 
1               5                   10                  15      


Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu 
            20                  25                  30          


Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro 
        35                  40                  45              


Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser 
    50                  55                  60                  


Phe Thr Gly Gln Gly Ile Ala Arg Leu Ser Arg Leu Ile Phe Leu Leu 
65                  70                  75                  80  


Arg Arg Trp Ala Ala Arg His Val His His Gln Asp Gln Gly Pro Asp 
                85                  90                  95      


Ser Phe Pro Asp Arg Phe Arg Gly Ala Glu Leu Lys Glu Val Ser Ser 
            100                 105                 110         


Gln Glu Ser Asn Ala Gln Ala Asn Val Gly Ser Gln Glu Pro Ala Asp 
        115                 120                 125             


Arg Gly Arg Ser Ala Trp Pro Leu Ala Lys Cys Asn Thr Asn Thr Ser 
    130                 135                 140                 


Asn Asn Thr Glu Glu Glu Lys Lys Thr Lys Lys Lys Asp Ala Ile Val 
145                 150                 155                 160 


Val Asp Pro Ser Ser Asn Leu Tyr Tyr Arg Trp Leu Thr Ala Ile Ala 
                165                 170                 175     


Leu Pro Val Phe Tyr Asn Trp Tyr Leu Leu Ile Cys Arg Ala Cys Phe 
            180                 185                 190         


Asp Glu Leu Gln Ser Glu Tyr Leu Met Leu Trp Leu Val Leu Asp Tyr 
        195                 200                 205             


Ser Ala Asp Val Leu Tyr Val Leu Asp Val Leu Val Arg Ala Arg Thr 
    210                 215                 220                 


Gly Phe Leu Glu Gln Gly Leu Met Val Ser Asp Thr Asn Arg Leu Trp 
225                 230                 235                 240 


Gln His Tyr Lys Thr Thr Thr Gln Phe Lys Leu Asp Val Leu Ser Leu 
                245                 250                 255     


Val Pro Thr Asp Leu Ala Tyr Leu Lys Val Gly Thr Asn Tyr Pro Glu 
            260                 265                 270         


Val Arg Phe Asn Arg Leu Leu Lys Phe Ser Arg Leu Phe Glu Phe Phe 
        275                 280                 285             


Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro Asn Met Phe Arg Ile Gly 
    290                 295                 300                 


Asn Leu Val Leu Tyr Ile Leu Ile Ile Ile His Trp Asn Ala Cys Ile 
305                 310                 315                 320 


Tyr Phe Ala Ile Ser Lys Phe Ile Gly Phe Gly Thr Asp Ser Trp Val 
                325                 330                 335     


Tyr Pro Asn Ile Ser Ile Pro Glu His Gly Arg Leu Ser Arg Lys Tyr 
            340                 345                 350         


Ile Tyr Ser Leu Tyr Trp Ser Thr Leu Thr Leu Thr Thr Ile Gly Glu 
        355                 360                 365             


Thr Pro Pro Pro Val Lys Asp Glu Glu Tyr Leu Phe Val Val Val Asp 
    370                 375                 380                 


Phe Leu Val Gly Val Leu Ile Phe Ala Thr Ile Val Gly Asn Val Gly 
385                 390                 395                 400 


Ser Met Ile Ser Asn Met Asn Ala Ser Arg Ala Glu Phe Gln Ala Lys 
                405                 410                 415     


Ile Asp Ser Ile Lys Gln Tyr Met Gln Phe Arg Lys Val Thr Lys Asp 
            420                 425                 430         


Leu Glu Thr Arg Val Ile Arg Trp Phe Asp Tyr Leu Trp Ala Asn Lys 
        435                 440                 445             


Lys Thr Val Asp Glu Lys Glu Val Leu Lys Ser Leu Pro Asp Lys Leu 
    450                 455                 460                 


Lys Ala Glu Ile Ala Ile Asn Val His Leu Asp Thr Leu Lys Lys Val 
465                 470                 475                 480 


Arg Ile Phe Gln Asp Cys Glu Ala Gly Leu Leu Val Glu Leu Val Leu 
                485                 490                 495     


Lys Leu Arg Pro Thr Val Phe Ser Pro Gly Asp Tyr Ile Cys Lys Lys 
            500                 505                 510         


Gly Asp Ile Gly Lys Glu Met Tyr Ile Ile Asn Glu Gly Lys Leu Ala 
        515                 520                 525             


Val Val Ala Asp Asp Gly Val Thr Gln Phe Val Val Leu Ser Asp Gly 
    530                 535                 540                 


Ser Tyr Phe Gly Glu Ile Ser Ile Leu Asn Ile Lys Gly Ser Lys Ser 
545                 550                 555                 560 


Gly Asn Arg Arg Thr Ala Asn Ile Arg Ser Ile Gly Tyr Ser Asp Leu 
                565                 570                 575     


Phe Cys Leu Ser Lys Asp Asp Leu Met Glu Ala Leu Thr Glu Tyr Pro 
            580                 585                 590         


Glu Ala Lys Lys Ala Leu Glu Glu Lys Gly Arg Gln Ile Leu Met Lys 
        595                 600                 605             


Asp Asn Leu Ile Asp Glu Glu Leu Ala Arg Ala Gly Ala Asp Pro Lys 
    610                 615                 620                 


Asp Leu Glu Glu Lys Val Glu Gln Leu Gly Ser Ser Leu Asp Thr Leu 
625                 630                 635                 640 


Gln Thr Arg Phe Ala Arg Leu Leu Ala Glu Tyr Asn Ala Thr Gln Met 
                645                 650                 655     


Lys Met Lys Gln Arg Leu Ser Gln Leu Glu Ser Gln Val Lys Gly Gly 
            660                 665                 670         


Gly Asp Lys Pro Leu Ala Asp Gly Glu Val Pro Gly Asp Ala Thr Lys 
        675                 680                 685             


Thr Glu Asp Lys Gln Gln 
    690                 


<210>  15
<211>  2085
<212>  DNA
<213>  Homo sapiens

<400>  15
atggccaaga tcaacaccca atactcccac ccctccagga cccacctcaa ggtaaagacc       60

tcagaccgag atctcaatcg cgctgaaaat ggcctcagca gagcccactc gtcaagtgag      120

gagacatcgt cagtgctgca gccggggatc gccatggaga ccagaggact ggctgactcc      180

gggcagggct ccttcaccgg ccaggggatc gccaggctgt cgcgcctcat cttcttgctg      240

cgcaggtggg ctgccaggca tgtgcaccac caggaccagg gaccggactc ttttcctgat      300

cgtttccgtg gagccgagct taaggaggtg tccagccaag aaagcaatgc ccaggcaaat      360

gtgggcagcc aggagccagc agacagaggg agaagcgcct ggcccctggc caaatgcaac      420

actaacacca gcaacaacac ggaggaggag aagaagacga aaaagaagga tgcgatcgtg      480

gtggacccgt ccagcaacct gtactaccgc tggctgaccg ccatcgccct gcctgtcttc      540

tataactggt atctgcttat ttgcagggcc tgtttcgatg agctgcagtc cgagtacctg      600

atgctgtggc tggtcctgga ctactcggca gatgtcctgt atgtcttgga tgtgcttgta      660

cgagctcgga caggttttct cgagcaaggc ttaatggtca gtgataccaa caggctgtgg      720

cagcattaca agacgaccac gcagttcaag ctggatgtgt tgtccctggt ccccaccgac      780

ctggcttact taaaggtggg cacaaactac ccagaagtga ggttcaaccg cctactgaag      840

ttttcccggc tctttgaatt ctttgaccgc acagagacaa ggaccaacta ccccaatatg      900

ttcaggattg ggaacttggt cttgtacatt ctcatcatca tccactggaa tgcctgcatc      960

tactttgcca tttccaagtt cattggtttt gggacagact cctgggtcta cccaaacatc     1020

tcaatcccag agcatgggcg cctctccagg aagtacattt acagtctcta ctggtccacc     1080

ttgaccctta ccaccattgg tgagacccca ccccccgtga aagatgagga gtatctcttt     1140

gtggtcgtag acttcttggt gggtgttctg atttttgcca ccattgtggg caatgtgggc     1200

tccatgatct cgaatatgaa tgcctcacgg gcagagttcc aggccaagat tgattccatc     1260

aagcagtaca tgcagttccg caaggtcacc aaggacttgg agacgcgggt tatccggtgg     1320

tttgactacc tgtgggccaa caagaagacg gtggatgaga aggaggtgct caagagcctc     1380

ccagacaagc tgaaggctga gatcgccatc aacgtgcacc tggacacgct gaagaaggtt     1440

cgcatcttcc aggactgtga ggcagggctg ctggtggagc tggtgctgaa gctgcgaccc     1500

actgtgttca gccctgggga ttatatctgc aagaagggag atattgggaa ggagatgtac     1560

atcatcaacg agggcaagct ggccgtggtg gctgatgatg gggtcaccca gttcgtggtc     1620

ctcagcgatg gcagctactt cggggagatc agcattctga acatcaaggg gagcaagtcg     1680

gggaaccgca ggacggccaa catccgcagc attggctact cagacctgtt ctgcctctca     1740

aaggacgatc tcatggaggc cctcaccgag taccccgaag ccaagaaggc cctggaggag     1800

aaaggacggc agatcctgat gaaagacaac ctgatcgatg aggagctggc cagggcgggc     1860

gcggacccca aggaccttga ggagaaagtg gagcagctgg ggtcctccct ggacaccctg     1920

cagaccaggt ttgcacgcct cctggctgag tacaacgcca cccagatgaa gatgaagcag     1980

cgtctcagcc aactggaaag ccaggtgaag ggtggtgggg acaagcccct ggctgatggg     2040

gaagttcccg gggatgctac aaaaacagag gacaaacaac agtga                     2085


<210>  16
<211>  2107
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  16
gcggccgcca ccatggctaa gattaacacc cagtactcac atccatcccg cactcacctc       60

aaagtcaaga cctccgatcg ggatctgaac cgggctgaga atgggctgtc gcgcgcccac      120

tcgtcgtccg aggaaaccag cagcgtgctc cagccgggca tcgccatgga aactaggggg      180

ctggcggact ccggacaggg atccttcact ggacagggta ttgcccggct gagcagactg      240

atcttcctgc ttcgccgctg ggcggccaga cacgtgcacc atcaggacca gggacctgat      300

agcttccccg accgctttag gggagccgag ctgaaagaag tgtcaagcca ggagtcaaac      360

gcgcaggcca acgtcggcag ccaagagcct gcagaccggg gacgctcggc atggccgctc      420

gcaaagtgca acactaacac ttccaacaac accgaagagg aaaagaaaac caagaagaag      480

gatgcaattg tggtggaccc ttcctccaac ctgtactacc gctggttgac cgccatcgcc      540

ctcccggtct tttacaattg gtatctcctt atctgccggg cctgcttcga cgaactgcaa      600

tcagagtacc tgatgctgtg gctggtgctg gactatagcg ccgatgtgct ctacgtcctg      660

gatgtgctcg tgcgcgcccg gaccggattc ttggaacaag gcctgatggt gtccgacacg      720

aatagactgt ggcagcacta taagaccaca acccagttca agcttgacgt gctcagcctt      780

gtgccgactg acctggccta cctgaaagtc ggaactaact acccggaagt cagattcaac      840

cgactcctga agttcagcag gctgttcgag ttctttgacc gcaccgagac tcggaccaac      900

taccctaaca tgttccggat cggaaatctg gtgctctaca tactgattat catccattgg      960

aacgcctgta tctatttcgc catttcgaag ttcatcggtt tcggaaccga ttcctgggtg     1020

taccccaaca tctcgatccc cgaacacggt cgcctgtccc ggaagtacat ctactccctg     1080

tactggtcca ctctgactct gaccacgatc ggggaaaccc ctccacccgt gaaggacgaa     1140

gagtacctgt tcgtggtggt ggacttcctg gtcggagtgt tgattttcgc caccattgtg     1200

ggaaacgtgg gctccatgat ctccaacatg aacgcgtcga gagctgagtt ccaagccaag     1260

atcgactcca ttaagcagta catgcagttc agaaaggtca ccaaggacct ggaaaccagg     1320

gtcatccgct ggttcgacta cctgtgggcc aacaaaaaga ctgtggacga aaaggaagtg     1380

ctgaagtcgc tgccggataa gctgaaggcc gaaatcgcca ttaacgtgca ccttgacacc     1440

ctgaagaaag tccggatctt ccaagactgt gaagccggcc tcctggtgga gctcgtgctc     1500

aagctgcggc ccaccgtgtt cagcccggga gattacattt gcaagaaggg cgatatcggc     1560

aaagagatgt acatcatcaa cgagggaaag ctggccgtgg tcgcggacga cggcgtgacc     1620

cagttcgtgg tgctgtccga cggatcctac ttcggtgaaa tctcaatcct caacatcaag     1680

gggtccaagt ccggcaaccg gagaactgcc aacattcgct ccatcggata cagcgacctg     1740

ttttgcctgt ccaaggatga cctgatggag gctctgactg agtaccctga agcgaagaag     1800

gctttggagg aaaaggggcg gcagattctg atgaaggaca atttgatcga cgaggagctc     1860

gcacgggccg gcgccgaccc caaggatctc gaagagaagg tcgaacagct gggttcttcg     1920

cttgataccc tgcaaacccg attcgcgcgg ctgctcgccg agtacaacgc gacccagatg     1980

aagatgaagc agagactgtc acagttggaa tcccaagtca agggcggagg cgacaagccg     2040

ctggcggacg gggaagtgcc cggggacgcc accaagactg aggacaagca gcagtgatca     2100

tagatct                                                               2107


<210>  17
<211>  2272
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  17
gcggccgcca ccatggctaa gattaacacc cagtactcac atccatcccg cactcacctc       60

aaagtcaaga cctccgatcg ggatctgaac cgggctgaga atgggctgtc gcgcgcccac      120

tcgtcgtccg aggaaaccag cagcgtgctc cagccgggca tcgccatgga aactaggggg      180

ctggcggact ccggacaggg atccttcact ggacagggta ttgcccggtt cgggcggatt      240

cagaagaagt cccagccgga gaaggtcgtg cgggctgcca gcaggggcag gccactcatt      300

ggttggacac agtggtgcgc tgaggatggt ggagatgaat cggaaatggc actggccggc      360

tctcccggat gcagctcggg cccccaaggg agactgagca gactgatctt cctgcttcgc      420

cgctgggcgg ccagacacgt gcaccatcag gaccagggac ctgatagctt ccccgaccgc      480

tttaggggag ccgagctgaa agaagtgtca agccaggagt caaacgcgca ggccaacgtc      540

ggcagccaag agcctgcaga ccggggacgc tcggcatggc cgctcgcaaa gtgcaacact      600

aacacttcca acaacaccga agaggaaaag aaaaccaaga agaaggatgc aattgtggtg      660

gacccttcct ccaacctgta ctaccgctgg ttgaccgcca tcgccctccc ggtcttttac      720

aattggtatc tccttatctg ccgggcctgc ttcgacgaac tgcaatcaga gtacctgatg      780

ctgtggctgg tgctggacta tagcgccgat gtgctctacg tcctggatgt gctcgtgcgc      840

gcccggaccg gattcttgga acaaggcctg atggtgtccg acacgaatag actgtggcag      900

cactataaga ccacaaccca gttcaagctt gacgtgctca gccttgtgcc gactgacctg      960

gcctacctga aagtcggaac taactacccg gaagtcagat tcaaccgact cctgaagttc     1020

agcaggctgt tcgagttctt tgaccgcacc gagactcgga ccaactaccc taacatgttc     1080

cggatcggaa atctggtgct ctacatactg attatcatcc attggaacgc ctgtatctat     1140

ttcgccattt cgaagttcat cggtttcgga accgattcct gggtgtaccc caacatctcg     1200

atccccgaac acggtcgcct gtcccggaag tacatctact ccctgtactg gtccactctg     1260

actctgacca cgatcgggga aacccctcca cccgtgaagg acgaagagta cctgttcgtg     1320

gtggtggact tcctggtcgg agtgttgatt ttcgccacca ttgtgggaaa cgtgggctcc     1380

atgatctcca acatgaacgc gtcgagagct gagttccaag ccaagatcga ctccattaag     1440

cagtacatgc agttcagaaa ggtcaccaag gacctggaaa ccagggtcat ccgctggttc     1500

gactacctgt gggccaacaa aaagactgtg gacgaaaagg aagtgctgaa gtcgctgccg     1560

gataagctga aggccgaaat cgccattaac gtgcaccttg acaccctgaa gaaagtccgg     1620

atcttccaag actgtgaagc cggcctcctg gtggagctcg tgctcaagct gcggcccacc     1680

gtgttcagcc cgggagatta catttgcaag aagggcgata tcggcaaaga gatgtacatc     1740

atcaacgagg gaaagctggc cgtggtcgcg gacgacggcg tgacccagtt cgtggtgctg     1800

tccgacggat cctacttcgg tgaaatctca atcctcaaca tcaaggggtc caagtccggc     1860

aaccggagaa ctgccaacat tcgctccatc ggatacagcg acctgttttg cctgtccaag     1920

gatgacctga tggaggctct gactgagtac cctgaagcga agaaggcttt ggaggaaaag     1980

gggcggcaga ttctgatgaa ggacaatttg atcgacgagg agctcgcacg ggccggcgcc     2040

gaccccaagg atctcgaaga gaaggtcgaa cagctgggtt cttcgcttga taccctgcaa     2100

acccgattcg cgcggctgct cgccgagtac aacgcgaccc agatgaagat gaagcagaga     2160

ctgtcacagt tggaatccca agtcaagggc ggaggcgaca agccgctggc ggacggggaa     2220

gtgcccgggg acgccaccaa gactgaggac aagcagcagt gatcatagat ct             2272


<210>  18
<211>  2107
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  18
gcggccgcca ccatggccaa gatcaacacc caatactccc acccctccag gacccacctc       60

aaggtaaaga cctcagaccg ggatctcaat cgcgctgaaa atggcctcag cagagcccac      120

tcgtcaagtg aggagacatc gtcagtgctg cagccgggga tcgccatgga gaccagagga      180

ctggctgact ccgggcaggg ctccttcacc ggccagggga tcgccaggct gtcgcgcctc      240

atcttcttgc tgcgcaggtg ggctgccagg catgtgcacc accaggacca gggaccggac      300

tcttttcctg atcgtttccg tggagccgag cttaaggagg tgtccagcca agaaagcaat      360

gcccaggcaa atgtgggcag ccaggagcca gcagacagag ggagaagcgc ctggcccctg      420

gccaaatgca acactaacac cagcaacaac acggaggagg agaagaagac gaaaaagaag      480

gatgcgatcg tggtggaccc gtccagcaac ctgtactacc gctggctgac cgccatcgcc      540

ctgcctgtct tctataactg gtatctgctt atttgcaggg cctgtttcga tgagctgcag      600

tccgagtacc tgatgctgtg gctggtcctg gactactcgg cagatgtcct gtatgtcttg      660

gatgtgcttg tacgagctcg gacaggtttt cttgagcaag gcttaatggt cagtgatacc      720

aacaggctgt ggcagcatta caagacgacc acgcagttca agctggatgt gttgtccctg      780

gtccccaccg acctggctta cttaaaggtg ggcacaaact acccagaagt gaggttcaac      840

cgcctactga agttttcccg gctctttgaa ttctttgacc gcacagagac aaggaccaac      900

taccccaata tgttcaggat tgggaacttg gtcttgtaca ttctcatcat catccactgg      960

aatgcctgca tctactttgc catttccaag ttcattggtt ttgggacaga ctcctgggtc     1020

tacccaaaca tctcaatccc agagcatggg cgcctctcca ggaagtacat ttacagtctc     1080

tactggtcca ccttgaccct taccaccatt ggtgagaccc caccccccgt gaaagatgag     1140

gagtatctct ttgtggtcgt agacttcttg gtgggtgttc tgatttttgc caccattgtg     1200

ggcaatgtgg gctccatgat ctcgaatatg aatgcctcac gggcagagtt ccaggccaag     1260

attgattcca tcaagcagta catgcagttc cgcaaggtca ccaaggactt ggagacgcgg     1320

gttatccggt ggtttgacta cctgtgggcc aacaagaaga cggtggatga gaaggaggtg     1380

ctcaagagcc tcccagacaa gctgaaggct gagatcgcca tcaacgtgca cctggacacg     1440

ctgaagaagg ttcgcatctt ccaggactgt gaggcagggc tgctggtgga gctggtgctg     1500

aagctgcgac ccactgtgtt cagccctggg gattatatct gcaagaaggg agatattggg     1560

aaggagatgt acatcatcaa cgagggcaag ctggccgtgg tggctgatga tggggtcacc     1620

cagttcgtgg tcctcagcga tggcagctac ttcggggaga tcagcattct gaacatcaag     1680

gggagcaagt cggggaaccg caggacggcc aacatccgca gcattggcta ctcagacctg     1740

ttctgcctct caaaggacga tctcatggag gccctcaccg agtaccccga agccaagaag     1800

gccctggagg agaaaggacg gcagatcctg atgaaagaca acctgatcga tgaggagctg     1860

gccagggcgg gcgcggaccc caaggacctt gaggagaaag tggagcagct ggggtcctcc     1920

ctggacaccc tgcagaccag gtttgcacgc ctcctggctg agtacaacgc cacccagatg     1980

aagatgaagc agcgtctcag ccaactggaa agccaggtga agggtggtgg ggacaagccc     2040

ctggctgatg gggaagttcc cggggatgct acaaaaacag aggacaaaca acagtgatca     2100

tagatct                                                               2107


<210>  19
<211>  2430
<212>  DNA
<213>  Homo sapiens


<220>
<221>  CDS
<222>  (1)..(2430)

<400>  19
atg ttt aaa tcg ctg aca aaa gtc aac aag gtg aag cct ata gga gag         48
Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu           
1               5                   10                  15                

aac aat gag aat gaa caa agt tct cgt cgg aat gaa gaa ggc tct cac         96
Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His           
            20                  25                  30                    

cca agt aat cag tct cag caa acc aca gca cag gaa gaa aac aaa ggt        144
Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly           
        35                  40                  45                        

gaa gag aaa tct ctc aaa acc aag tca act cca gtc acg tct gaa gag        192
Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu           
    50                  55                  60                            

cca cac acc aac ata caa gac aaa ctc tcc aag aaa aat tcc tct gga        240
Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly           
65                  70                  75                  80            

gat ctg acc aca aac cct gac cct caa aat gca gca gaa cca act gga        288
Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly           
                85                  90                  95                

aca gtg cca gag cag aag gaa atg gac ccc ggg aaa gaa ggt cca aac        336
Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn           
            100                 105                 110                   

agc cca caa aac aaa ccg cct gca gct cct gtt ata aat gag tat gcc        384
Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala           
        115                 120                 125                       

gat gcc cag cta cac aac ctg gtg aaa aga atg cgt caa aga aca gcc        432
Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala           
    130                 135                 140                           

ctc tac aag aaa aag ttg gta gag gga gat ctc tcc tca ccc gaa gcc        480
Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala           
145                 150                 155                 160           

agc cca caa act gca aag ccc acg gct gta cca cca gta aaa gaa agc        528
Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser           
                165                 170                 175               

gat gat aag cca aca gaa cat tac tac agg ctg ttg tgg ttc aaa gtc        576
Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val           
            180                 185                 190                   

aaa aag atg cct tta aca gag tac tta aag cga att aaa ctt cca aac        624
Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn           
        195                 200                 205                       

agc ata gat tca tac aca gat cga ctc tat ctc ctg tgg ctc ttg ctt        672
Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu           
    210                 215                 220                           

gtc act ctt gcc tat aac tgg aac tgc tgt ttt ata cca ctg cgc ctc        720
Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu           
225                 230                 235                 240           

gtc ttc cca tat caa acc gca gac aac ata cac tac tgg ctt att gcg        768
Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala           
                245                 250                 255               

gac atc ata tgt gat atc atc tac ctt tat gat atg cta ttt atc cag        816
Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln           
            260                 265                 270                   

ccc aga ctc cag ttt gta aga gga gga gac ata ata gtg gat tca aat        864
Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn           
        275                 280                 285                       

gag cta agg aaa cac tac agg act tct aca aaa ttt cag ttg gat gtc        912
Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val           
    290                 295                 300                           

gca tca ata ata cca ttt gat att tgc tac ctc ttc ttt ggg ttt aat        960
Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn           
305                 310                 315                 320           

cca atg ttt aga gca aat agg atg tta aag tac act tca ttt ttt gaa       1008
Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu           
                325                 330                 335               

ttt aat cat cac cta gag tct ata atg gac aaa gca tat atc tac aga       1056
Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg           
            340                 345                 350                   

gtt att cga aca act gga tac ttg ctg ttt att ctg cac att aat gcc       1104
Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala           
        355                 360                 365                       

tgt gtt tat tac tgg gct tca aac tat gaa gga att ggc act act aga       1152
Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg           
    370                 375                 380                           

tgg gtg tat gat ggg gaa gga aac gag tat ctg aga tgt tat tat tgg       1200
Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp           
385                 390                 395                 400           

gca gtt cga act tta att acc att ggt ggc ctt cca gaa cca caa act       1248
Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr           
                405                 410                 415               

tta ttt gaa att gtt ttt caa ctc ttg aat ttt ttt tct gga gtt ttt       1296
Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe           
            420                 425                 430                   

gtg ttc tcc agt tta att ggt cag atg aga gat gtg att gga gca gct       1344
Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala           
        435                 440                 445                       

aca gcc aat cag aac tac ttc cgc gcc tgc atg gat gac acc att gcc       1392
Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala           
    450                 455                 460                           

tac atg aac aat tac tcc att cct aaa ctt gtg caa aag cga gtt cgg       1440
Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg           
465                 470                 475                 480           

act tgg tat gaa tat aca tgg gac tct caa aga atg cta gat gag tct       1488
Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser           
                485                 490                 495               

gat ttg ctt aag acc cta cca act acg gtc cag tta gcc ctc gcc att       1536
Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile           
            500                 505                 510                   

gat gtg aac ttc agc atc atc agc aaa gtc gac ttg ttc aag ggt tgt       1584
Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys           
        515                 520                 525                       

gat aca cag atg att tat gac atg ttg cta aga ttg aaa tcc gtt ctc       1632
Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu           
    530                 535                 540                           

tat ttg cct ggt gac ttt gtc tgc aaa aag gga gaa att ggc aag gaa       1680
Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu           
545                 550                 555                 560           

atg tat atc atc aag cat gga gaa gtc caa gtt ctt gga ggc cct gat       1728
Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp           
                565                 570                 575               

ggt act aaa gtt ctg gtt act ctg aaa gct ggg tcg gtg ttt gga gaa       1776
Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu           
            580                 585                 590                   

atc agc ctt cta gca gca gga gga gga aac cgt cga act gcc aat gtg       1824
Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val           
        595                 600                 605                       

gtg gcc cac ggg ttt gcc aat ctt tta act cta gac aaa aag acc ctc       1872
Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu           
    610                 615                 620                           

caa gaa att cta gtg cat tat cca gat tct gaa agg atc ctc atg aag       1920
Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys           
625                 630                 635                 640           

aaa gcc aga gtg ctt tta aag cag aag gct aag acc gca gaa gca acc       1968
Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr           
                645                 650                 655               

cct cca aga aaa gat ctt gcc ctc ctc ttc cca ccg aaa gaa gag aca       2016
Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr           
            660                 665                 670                   

ccc aaa ctg ttt aaa act ctc cta gga ggc aca gga aaa gca agt ctt       2064
Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu           
        675                 680                 685                       

gca aga cta ctc aaa ttg aag cga gag caa gca gct cag aag aaa gaa       2112
Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu           
    690                 695                 700                           

aat tct gaa gga gga gag gaa gaa gga aaa gaa aat gaa gat aaa caa       2160
Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln           
705                 710                 715                 720           

aaa gaa aat gaa gat aaa caa aaa gaa aat gaa gat aaa gga aaa gaa       2208
Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu           
                725                 730                 735               

aat gaa gat aaa gat aaa gga aga gag cca gaa gag aag cca ctg gac       2256
Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp           
            740                 745                 750                   

aga cct gaa tgt aca gca agt cct att gca gtg gag gaa gaa ccc cac       2304
Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His           
        755                 760                 765                       

tca gtt aga agg aca gtt tta ccc aga ggg act tct cgt caa tca ctc       2352
Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu           
    770                 775                 780                           

att atc agc atg gct cct tct gct gag ggc gga gaa gag gtt ctt act       2400
Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr           
785                 790                 795                 800           

att gaa gtc aaa gaa aag gct aag caa taa                               2430
Ile Glu Val Lys Glu Lys Ala Lys Gln                                       
                805                                                       


<210>  20
<211>  809
<212>  PRT
<213>  Homo sapiens

<400>  20

Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu 
1               5                   10                  15      


Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His 
            20                  25                  30          


Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly 
        35                  40                  45              


Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu 
    50                  55                  60                  


Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly 
65                  70                  75                  80  


Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly 
                85                  90                  95      


Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn 
            100                 105                 110         


Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala 
        115                 120                 125             


Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala 
    130                 135                 140                 


Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala 
145                 150                 155                 160 


Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser 
                165                 170                 175     


Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val 
            180                 185                 190         


Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn 
        195                 200                 205             


Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu 
    210                 215                 220                 


Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu 
225                 230                 235                 240 


Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala 
                245                 250                 255     


Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln 
            260                 265                 270         


Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn 
        275                 280                 285             


Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val 
    290                 295                 300                 


Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn 
305                 310                 315                 320 


Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu 
                325                 330                 335     


Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg 
            340                 345                 350         


Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala 
        355                 360                 365             


Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg 
    370                 375                 380                 


Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp 
385                 390                 395                 400 


Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr 
                405                 410                 415     


Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe 
            420                 425                 430         


Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala 
        435                 440                 445             


Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala 
    450                 455                 460                 


Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg 
465                 470                 475                 480 


Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser 
                485                 490                 495     


Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile 
            500                 505                 510         


Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys 
        515                 520                 525             


Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu 
    530                 535                 540                 


Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu 
545                 550                 555                 560 


Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp 
                565                 570                 575     


Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu 
            580                 585                 590         


Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val 
        595                 600                 605             


Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu 
    610                 615                 620                 


Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys 
625                 630                 635                 640 


Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr 
                645                 650                 655     


Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr 
            660                 665                 670         


Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu 
        675                 680                 685             


Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu 
    690                 695                 700                 


Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln 
705                 710                 715                 720 


Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu 
                725                 730                 735     


Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp 
            740                 745                 750         


Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His 
        755                 760                 765             


Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu 
    770                 775                 780                 


Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr 
785                 790                 795                 800 


Ile Glu Val Lys Glu Lys Ala Lys Gln 
                805                 


<210>  21
<211>  2430
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  CDS
<222>  (1)..(2430)

<400>  21
atg ttt aaa tcg ctg aca aaa gtc aac aag gtg aag cct ata gga gag         48
Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu           
1               5                   10                  15                

aac aat gag aat gaa caa agt tct cgt cgg aat gaa gaa ggc tct cac         96
Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His           
            20                  25                  30                    

cca agt aat cag tct cag caa acc aca gca cag gaa gaa aac aaa ggt        144
Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly           
        35                  40                  45                        

gaa gag aaa tct ctc aaa acc aag tca act cca gtc acg tct gaa gag        192
Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu           
    50                  55                  60                            

cca cac acc aac ata caa gac aaa ctc tcc aag aaa aat tcc tct gga        240
Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly           
65                  70                  75                  80            

gat ctg acc aca aac cct gac cct caa aat gca gca gaa cca act gga        288
Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly           
                85                  90                  95                

aca gtg cca gag cag aag gaa atg gac ccc ggg aaa gaa ggt cca aac        336
Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn           
            100                 105                 110                   

agc cca caa aac aaa ccg cca gca gct cct gtt ata aat gag tat gcc        384
Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala           
        115                 120                 125                       

gat gcc cag cta cac aac ctg gtg aaa aga atg cgt caa aga aca gcc        432
Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala           
    130                 135                 140                           

ctc tac aag aaa aag ttg gta gag gga gat ctc tcc tca ccc gaa gcc        480
Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala           
145                 150                 155                 160           

agc cca caa act gca aag ccc acg gct gta cca cca gta aaa gaa agc        528
Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser           
                165                 170                 175               

gat gat aag cca aca gaa cat tac tac agg ctg ttg tgg ttc aaa gtc        576
Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val           
            180                 185                 190                   

aaa aag atg cct tta aca gag tac tta aag cga att aaa ctt cca aac        624
Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn           
        195                 200                 205                       

agc ata gat tca tac aca gat cga ctc tat ctc ctg tgg ctc ttg ctt        672
Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu           
    210                 215                 220                           

gtc act ctt gcc tat aac tgg aac tgc tgt ttt ata cca ctg cgc ctc        720
Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu           
225                 230                 235                 240           

gtc ttc cca tat caa acc gca gac aac ata cac tac tgg ctt att gcg        768
Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala           
                245                 250                 255               

gac atc atc tgt gat atc atc tac ctt tat gat atg cta ttt atc cag        816
Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln           
            260                 265                 270                   

ccc aga ctc cag ttt gta aga gga gga gac ata ata gtg gat tca aat        864
Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn           
        275                 280                 285                       

gag cta agg aaa cac tac agg act tct aca aaa ttt cag ttg gat gtc        912
Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val           
    290                 295                 300                           

gca tca ata ata cca ttt gat att tgc tac ctc ttc ttt ggg ttt aat        960
Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn           
305                 310                 315                 320           

cca atg ttt aga gca aat agg atg tta aag tac act tca ttt ttt gaa       1008
Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu           
                325                 330                 335               

ttt aat cat cac cta gag tct ata atg gac aaa gca tat atc tac aga       1056
Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg           
            340                 345                 350                   

gtt att cga aca act gga tac ttg ctg ttt att ctg cac att aat gcc       1104
Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala           
        355                 360                 365                       

tgt gtt tat tac tgg gct tca aac tat gaa gga att ggc act act aga       1152
Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg           
    370                 375                 380                           

tgg gtg tat gat ggg gaa gga aac gag tat ctg aga tgt tat tat tgg       1200
Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp           
385                 390                 395                 400           

gca gtt cga act tta att acc att ggt ggc ctt cca gaa cca caa act       1248
Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr           
                405                 410                 415               

tta ttt gaa att gtt ttt caa ctc ttg aat ttt ttt tct gga gtt ttt       1296
Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe           
            420                 425                 430                   

gtg ttc tcc agt tta att ggt cag atg aga gat gtg att gga gca gct       1344
Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala           
        435                 440                 445                       

aca gcc aat cag aac tac ttc cgc gcc tgc atg gat gac acc att gcc       1392
Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala           
    450                 455                 460                           

tac atg aac aat tac tcc att cct aaa ctt gtg caa aag cga gtt cgg       1440
Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg           
465                 470                 475                 480           

act tgg tat gaa tat aca tgg gac tct caa aga atg cta gat gag tct       1488
Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser           
                485                 490                 495               

gat ttg ctt aag acc cta cca act acg gtc cag tta gcc ctc gcc att       1536
Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile           
            500                 505                 510                   

gat gtg aac ttc agc atc atc agc aaa gtt gac ttg ttc aag ggt tgt       1584
Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys           
        515                 520                 525                       

gat aca cag atg att tat gac atg ttg cta aga ttg aaa tcc gtt ctc       1632
Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu           
    530                 535                 540                           

tat ttg cct ggt gac ttt gtc tgc aaa aag gga gaa att ggc aag gaa       1680
Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu           
545                 550                 555                 560           

atg tat atc atc aag cat gga gaa gtc caa gtt ctt gga ggc cct gat       1728
Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp           
                565                 570                 575               

ggt act aaa gtt ctg gtt act ctg aaa gct ggg tcg gtg ttt gga gaa       1776
Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu           
            580                 585                 590                   

atc agc ctt cta gca gca gga gga gga aac cgt cga act gcc aat gtg       1824
Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val           
        595                 600                 605                       

gtg gcc cac ggg ttt gcc aat ctt tta act cta gac aaa aag acc ctc       1872
Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu           
    610                 615                 620                           

caa gaa att cta gtg cat tat cca gat tct gaa aga atc ctc atg aag       1920
Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys           
625                 630                 635                 640           

aaa gcc aga gtg ctt tta aag cag aag gct aag acc gca gaa gca acc       1968
Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr           
                645                 650                 655               

cct cca aga aaa gat ctt gcc ctc ctc ttc cca ccg aaa gaa gag aca       2016
Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr           
            660                 665                 670                   

ccc aaa ctg ttt aaa act ctc cta gga ggc aca gga aaa gca agt ctt       2064
Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu           
        675                 680                 685                       

gca aga cta ctc aaa ttg aag cga gag caa gca gct cag aag aaa gaa       2112
Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu           
    690                 695                 700                           

aat tct gaa gga gga gag gaa gaa gga aaa gaa aat gaa gat aaa caa       2160
Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln           
705                 710                 715                 720           

aaa gaa aat gaa gat aaa caa aaa gaa aat gaa gat aaa gga aaa gaa       2208
Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu           
                725                 730                 735               

aat gaa gat aaa gat aaa gga aga gag cca gaa gag aag cca ctg gac       2256
Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp           
            740                 745                 750                   

aga cct gaa tgt aca gca agt cct att gca gtg gag gaa gaa ccc cac       2304
Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His           
        755                 760                 765                       

tca gtt aga agg aca gtt tta ccc aga ggg act tct cgt caa tca ctc       2352
Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu           
    770                 775                 780                           

att atc agc atg gct cct tct gct gag ggc gga gaa gag gtt ctt act       2400
Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr           
785                 790                 795                 800           

att gaa gtc aaa gaa aag gct aag caa tga                               2430
Ile Glu Val Lys Glu Lys Ala Lys Gln                                       
                805                                                       


<210>  22
<211>  809
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  22

Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu 
1               5                   10                  15      


Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His 
            20                  25                  30          


Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly 
        35                  40                  45              


Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu 
    50                  55                  60                  


Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly 
65                  70                  75                  80  


Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly 
                85                  90                  95      


Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn 
            100                 105                 110         


Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala 
        115                 120                 125             


Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala 
    130                 135                 140                 


Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala 
145                 150                 155                 160 


Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser 
                165                 170                 175     


Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val 
            180                 185                 190         


Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn 
        195                 200                 205             


Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu 
    210                 215                 220                 


Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu 
225                 230                 235                 240 


Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala 
                245                 250                 255     


Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln 
            260                 265                 270         


Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn 
        275                 280                 285             


Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val 
    290                 295                 300                 


Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn 
305                 310                 315                 320 


Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu 
                325                 330                 335     


Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg 
            340                 345                 350         


Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala 
        355                 360                 365             


Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg 
    370                 375                 380                 


Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp 
385                 390                 395                 400 


Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr 
                405                 410                 415     


Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe 
            420                 425                 430         


Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala 
        435                 440                 445             


Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala 
    450                 455                 460                 


Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg 
465                 470                 475                 480 


Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser 
                485                 490                 495     


Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile 
            500                 505                 510         


Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys 
        515                 520                 525             


Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu 
    530                 535                 540                 


Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu 
545                 550                 555                 560 


Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp 
                565                 570                 575     


Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu 
            580                 585                 590         


Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val 
        595                 600                 605             


Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu 
    610                 615                 620                 


Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys 
625                 630                 635                 640 


Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr 
                645                 650                 655     


Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr 
            660                 665                 670         


Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu 
        675                 680                 685             


Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu 
    690                 695                 700                 


Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln 
705                 710                 715                 720 


Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu 
                725                 730                 735     


Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp 
            740                 745                 750         


Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His 
        755                 760                 765             


Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu 
    770                 775                 780                 


Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr 
785                 790                 795                 800 


Ile Glu Val Lys Glu Lys Ala Lys Gln 
                805                 


<210>  23
<211>  2454
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(12)
<223>  modified end with NotI site and Kozak

<220>
<221>  misc_feature
<222>  (1)..(8)
<223>  NotI site for subcloning

<220>
<221>  CDS
<222>  (13)..(2448)
<223>  ORF with silent mutations (stop codon and restriction sites 
       BamHI, PstI, SalI, and NdeI)

<220>
<221>  misc_feature
<222>  (2440)..(2442)
<223>  modifed stop codon

<220>
<221>  misc_feature
<222>  (2440)..(2445)
<223>  BclI site to facilitate addition of epitope tag

<220>
<221>  misc_feature
<222>  (2446)..(2448)
<223>  additional stop codon

<220>
<221>  misc_feature
<222>  (2449)..(2454)
<223>  PstI site for subcloning

<400>  23
gcggccgcca cc atg ttt aaa tcg ctg aca aaa gtc aac aag gtg aag cct       51
              Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro         
              1               5                   10                      

ata gga gag aac aat gag aat gaa caa agt tct cgt cgg aat gaa gaa         99
Ile Gly Glu Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu           
    15                  20                  25                            

ggc tct cac cca agt aat cag tct cag caa acc aca gca cag gaa gaa        147
Gly Ser His Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu           
30                  35                  40                  45            

aac aaa ggt gaa gag aaa tct ctc aaa acc aag tca act cca gtc acg        195
Asn Lys Gly Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr           
                50                  55                  60                

tct gaa gag cca cac acc aac ata caa gac aaa ctc tcc aag aaa aat        243
Ser Glu Glu Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn           
            65                  70                  75                    

tcc tct gga gat ctg acc aca aac cct gac cct caa aat gca gca gaa        291
Ser Ser Gly Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu           
        80                  85                  90                        

cca act gga aca gtg cca gag cag aag gaa atg gac ccc ggg aaa gaa        339
Pro Thr Gly Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu           
    95                  100                 105                           

ggt cca aac agc cca caa aac aaa ccg cca gca gct cct gtt ata aat        387
Gly Pro Asn Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn           
110                 115                 120                 125           

gag tat gcc gat gcc cag cta cac aac ctg gtg aaa aga atg cgt caa        435
Glu Tyr Ala Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln           
                130                 135                 140               

aga aca gcc ctc tac aag aaa aag ttg gta gag gga gat ctc tcc tca        483
Arg Thr Ala Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser           
            145                 150                 155                   

ccc gaa gcc agc cca caa act gca aag ccc acg gct gta cca cca gta        531
Pro Glu Ala Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val           
        160                 165                 170                       

aaa gaa agc gat gat aag cca aca gaa cat tac tac agg ctg ttg tgg        579
Lys Glu Ser Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp           
    175                 180                 185                           

ttc aaa gtc aaa aag atg cct tta aca gag tac tta aag cga att aaa        627
Phe Lys Val Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys           
190                 195                 200                 205           

ctt cca aac agc ata gat tca tac aca gat cga ctc tat ctc ctg tgg        675
Leu Pro Asn Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp           
                210                 215                 220               

ctc ttg ctt gtc act ctt gcc tat aac tgg aac tgc tgt ttt ata cca        723
Leu Leu Leu Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro           
            225                 230                 235                   

ctg cgc ctc gtc ttc cca tat caa acc gca gac aac ata cac tac tgg        771
Leu Arg Leu Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp           
        240                 245                 250                       

ctt att gcg gac atc atc tgt gat atc atc tac ctt tat gat atg cta        819
Leu Ile Ala Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu           
    255                 260                 265                           

ttt atc cag ccc aga ctc cag ttt gta aga gga gga gac ata ata gtg        867
Phe Ile Gln Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val           
270                 275                 280                 285           

gat tca aat gag cta agg aaa cac tac agg act tct aca aaa ttt cag        915
Asp Ser Asn Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln           
                290                 295                 300               

ttg gat gtc gca tca ata ata cca ttt gat att tgc tac ctc ttc ttt        963
Leu Asp Val Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe           
            305                 310                 315                   

ggg ttt aat cca atg ttt aga gca aat agg atg tta aag tac act tca       1011
Gly Phe Asn Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser           
        320                 325                 330                       

ttt ttt gaa ttt aat cat cac cta gag tct ata atg gac aaa gca tat       1059
Phe Phe Glu Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr           
    335                 340                 345                           

atc tac aga gtt att cga aca act gga tac ttg ctg ttt att ctg cac       1107
Ile Tyr Arg Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His           
350                 355                 360                 365           

att aat gcc tgt gtt tat tac tgg gct tca aac tat gaa gga att ggc       1155
Ile Asn Ala Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly           
                370                 375                 380               

act act aga tgg gtg tat gat ggg gaa gga aac gag tat ctg aga tgt       1203
Thr Thr Arg Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys           
            385                 390                 395                   

tat tat tgg gca gtt cga act tta att acc att ggt ggc ctt cca gaa       1251
Tyr Tyr Trp Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu           
        400                 405                 410                       

cca caa act tta ttt gaa att gtt ttt caa ctc ttg aat ttt ttt tct       1299
Pro Gln Thr Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser           
    415                 420                 425                           

gga gtt ttt gtg ttc tcc agt tta att ggt cag atg aga gat gtg att       1347
Gly Val Phe Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile           
430                 435                 440                 445           

gga gca gct aca gcc aat cag aac tac ttc cgc gcc tgc atg gat gac       1395
Gly Ala Ala Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp           
                450                 455                 460               

acc att gcc tac atg aac aat tac tcc att cct aaa ctt gtg caa aag       1443
Thr Ile Ala Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys           
            465                 470                 475                   

cga gtt cgg act tgg tat gaa tat aca tgg gac tct caa aga atg cta       1491
Arg Val Arg Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu           
        480                 485                 490                       

gat gag tct gat ttg ctt aag acc cta cca act acg gtc cag tta gcc       1539
Asp Glu Ser Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala           
    495                 500                 505                           

ctc gcc att gat gtg aac ttc agc atc atc agc aaa gtt gac ttg ttc       1587
Leu Ala Ile Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe           
510                 515                 520                 525           

aag ggt tgt gat aca cag atg att tat gac atg ttg cta aga ttg aaa       1635
Lys Gly Cys Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys           
                530                 535                 540               

tcc gtt ctc tat ttg cct ggt gac ttt gtc tgc aaa aag gga gaa att       1683
Ser Val Leu Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile           
            545                 550                 555                   

ggc aag gaa atg tat atc atc aag cat gga gaa gtc caa gtt ctt gga       1731
Gly Lys Glu Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly           
        560                 565                 570                       

ggc cct gat ggt act aaa gtt ctg gtt act ctg aaa gct ggg tcg gtg       1779
Gly Pro Asp Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val           
    575                 580                 585                           

ttt gga gaa atc agc ctt cta gca gca gga gga gga aac cgt cga act       1827
Phe Gly Glu Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr           
590                 595                 600                 605           

gcc aat gtg gtg gcc cac ggg ttt gcc aat ctt tta act cta gac aaa       1875
Ala Asn Val Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys           
                610                 615                 620               

aag acc ctc caa gaa att cta gtg cat tat cca gat tct gaa aga atc       1923
Lys Thr Leu Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile           
            625                 630                 635                   

ctc atg aag aaa gcc aga gtg ctt tta aag cag aag gct aag acc gca       1971
Leu Met Lys Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala           
        640                 645                 650                       

gaa gca acc cct cca aga aaa gat ctt gcc ctc ctc ttc cca ccg aaa       2019
Glu Ala Thr Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys           
    655                 660                 665                           

gaa gag aca ccc aaa ctg ttt aaa act ctc cta gga ggc aca gga aaa       2067
Glu Glu Thr Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys           
670                 675                 680                 685           

gca agt ctt gca aga cta ctc aaa ttg aag cga gag caa gca gct cag       2115
Ala Ser Leu Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln           
                690                 695                 700               

aag aaa gaa aat tct gaa gga gga gag gaa gaa gga aaa gaa aat gaa       2163
Lys Lys Glu Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu           
            705                 710                 715                   

gat aaa caa aaa gaa aat gaa gat aaa caa aaa gaa aat gaa gat aaa       2211
Asp Lys Gln Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys           
        720                 725                 730                       

gga aaa gaa aat gaa gat aaa gat aaa gga aga gag cca gaa gag aag       2259
Gly Lys Glu Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys           
    735                 740                 745                           

cca ctg gac aga cct gaa tgt aca gca agt cct att gca gtg gag gaa       2307
Pro Leu Asp Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu           
750                 755                 760                 765           

gaa ccc cac tca gtt aga agg aca gtt tta ccc aga ggg act tct cgt       2355
Glu Pro His Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg           
                770                 775                 780               

caa tca ctc att atc agc atg gct cct tct gct gag ggc gga gaa gag       2403
Gln Ser Leu Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu           
            785                 790                 795                   

gtt ctt act att gaa gtc aaa gaa aag gct aag caa tga tca taa           2448
Val Leu Thr Ile Glu Val Lys Glu Lys Ala Lys Gln     Ser                   
        800                 805                     810                   

ctgcag                                                                2454


<210>  24
<211>  809
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  24

Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu 
1               5                   10                  15      


Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His 
            20                  25                  30          


Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly 
        35                  40                  45              


Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu 
    50                  55                  60                  


Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly 
65                  70                  75                  80  


Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly 
                85                  90                  95      


Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn 
            100                 105                 110         


Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala 
        115                 120                 125             


Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala 
    130                 135                 140                 


Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala 
145                 150                 155                 160 


Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser 
                165                 170                 175     


Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val 
            180                 185                 190         


Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn 
        195                 200                 205             


Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu 
    210                 215                 220                 


Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu 
225                 230                 235                 240 


Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala 
                245                 250                 255     


Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln 
            260                 265                 270         


Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn 
        275                 280                 285             


Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val 
    290                 295                 300                 


Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn 
305                 310                 315                 320 


Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu 
                325                 330                 335     


Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg 
            340                 345                 350         


Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala 
        355                 360                 365             


Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg 
    370                 375                 380                 


Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp 
385                 390                 395                 400 


Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr 
                405                 410                 415     


Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe 
            420                 425                 430         


Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala 
        435                 440                 445             


Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala 
    450                 455                 460                 


Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg 
465                 470                 475                 480 


Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser 
                485                 490                 495     


Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile 
            500                 505                 510         


Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys 
        515                 520                 525             


Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu 
    530                 535                 540                 


Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu 
545                 550                 555                 560 


Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp 
                565                 570                 575     


Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu 
            580                 585                 590         


Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val 
        595                 600                 605             


Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu 
    610                 615                 620                 


Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys 
625                 630                 635                 640 


Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr 
                645                 650                 655     


Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr 
            660                 665                 670         


Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu 
        675                 680                 685             


Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu 
    690                 695                 700                 


Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln 
705                 710                 715                 720 


Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu 
                725                 730                 735     


Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp 
            740                 745                 750         


Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His 
        755                 760                 765             


Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu 
    770                 775                 780                 


Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr 
785                 790                 795                 800 


Ile Glu Val Lys Glu Lys Ala Lys Gln 
                805                 


<210>  25
<211>  11714
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (241)..(544)
<223>  CMV enhancer

<220>
<221>  misc_feature
<222>  (546)..(823)
<223>  chicken beta-actin promoter

<220>
<221>  misc_feature
<222>  (824)..(1795)
<223>  CBA exon 1 and intron

<220>
<221>  misc_feature
<222>  (1859)..(1864)
<223>  kozak

<220>
<221>  misc_feature
<222>  (1865)..(3826)
<223>  human codon optimized CHM (REP-1)

<220>
<221>  misc_feature
<222>  (3847)..(4054)
<223>  bGH poly(A) signal

<220>
<221>  misc_feature
<222>  (4104)..(4233)
<223>  3' ITR

<400>  25
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct gataccctgc cctctgaatt cgacgtgatt gtgattggaa ccggactccc     1920

tgaatcgatc atcgccgcgg cctgttcccg gtccggtcgg cgcgtgctgc acgtcgattc     1980

gagaagctac tacggaggga attgggcctc attctccttc tccggactgc tctcctggct     2040

gaaggagtat caggagaact ccgacattgt ctccgactca cctgtgtggc aggaccagat     2100

cctggaaaac gaggaagcaa tagccctgag ccggaaggac aagaccatcc agcacgtgga     2160

ggtgttctgt tatgcctccc aagacctcca tgaggacgtg gaagaggctg gagcgttgca     2220

gaagaatcat gccctcgtga cctccgctaa ctccaccgag gcagccgaca gcgccttcct     2280

gccgaccgag gatgaatccc tgtcaactat gtcgtgcgaa atgctgaccg aacagactcc     2340

gagctccgac cccgaaaacg ccctggaagt gaacggagcg gaagtgaccg gcgaaaagga     2400

gaaccattgc gacgacaaga cttgtgtccc atccacttcc gcggaggaca tgtccgagaa     2460

tgtgcctatc gccgaggaca ccaccgaaca gcccaagaag aacagaatca cgtacagcca     2520

gatcatcaag gaggggcgga ggtttaacat cgatctggtg tcgaagctgc tgtacagccg     2580

cggtctgctg atcgatctgc tcattaagtc gaacgtgtcg agatacgccg agttcaagaa     2640

catcacaagg attctcgcct tccgggaagg aagagtggaa caagtgccgt gctcccgggc     2700

cgacgtgttc aactcaaagc aacttaccat ggtggaaaag cgcatgctga tgaaattcct     2760

gaccttctgc atggagtacg aaaagtaccc tgatgagtac aagggttacg aagaaattac     2820

tttctacgag tacctcaaga cccagaagct gaccccgaat ctgcagtaca ttgtgatgca     2880

ctcaatcgca atgacctccg aaaccgcctc ctcgaccatc gacgggctca aggccaccaa     2940

gaacttcctg cactgtttgg ggcgctacgg caacactccg ttcctcttcc cgctgtacgg     3000

ccagggagag ctgcctcagt gtttctgccg gatgtgcgcc gtgttcggcg gaatctactg     3060

tctccgccac tcggtccagt gcctggtggt ggacaaggaa tccaggaagt gcaaagccat     3120

tattgaccag ttcggacaac ggatcatttc cgagcacttt cttgtggagg actcatactt     3180

cccggagaac atgtgctctc gggtccagta tcgacagatt tccagggcgg tgctcattac     3240

tgaccggagc gtcctcaaga ccgatagcga ccagcagatc tccatcctga ccgtgccggc     3300

ggaagaaccc ggcacttttg ccgtgcgcgt gatcgagctt tgctcatcca ccatgacttg     3360

catgaaaggc acttacctgg tgcacctgac gtgcacctca tcgaaaaccg ctagagagga     3420

cctggaatcc gtcgtccaaa agctgttcgt gccttacacc gagatggaaa ttgaaaacga     3480

acaagtggag aagccccgca tcctttgggc cctgtacttt aacatgcgcg attcctccga     3540

tatctcgcgg tcctgctata acgacttgcc ttcgaacgtc tacgtctgct ccgggccaga     3600

ctgcggtctt ggcaacgaca atgccgtgaa gcaggcggaa acactgttcc aagagatctg     3660

ccctaacgag gatttttgcc cgcccccccc aaaccccgag gatatcatct tggacggaga     3720

cagcctgcag ccagaagcat ccgagtccag cgccatcccg gaggccaaca gcgaaacctt     3780

caaggagagc actaacctgg gcaacctgga agagtccagc gaatgatcat aggatctctg     3840

cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct     3900

tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc     3960

attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg     4020

aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag ataagtagca     4080

tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct     4140

gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc     4200

ccgggcggcc tcagtgagcg agcgagcgcg cagccttata aggatatggt gcactctcag     4260

tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga     4320

cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc     4380

cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg     4440

cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc     4500

aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca     4560

ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa     4620

aaggaagagt atgagccata ttcaacggga aacgtcgagg ccgcgattaa attccaacat     4680

ggatgctgat ttatatgggt ataaatgggc tcgcgataat gtcgggcaat caggtgcgac     4740

aatctatcgc ttgtatggga agcccgatgc gccagagttg tttctgaaac atggcaaagg     4800

tagcgttgcc aatgatgtta cagatgagat ggtcagacta aactggctga cggaatttat     4860

gccacttccg accatcaagc attttatccg tactcctgat gatgcatggt tactcaccac     4920

tgcgatcccc ggaaaaacag cgttccaggt attagaagaa tatcctgatt caggtgaaaa     4980

tattgttgat gcgctggcag tgttcctgcg ccggttgcac tcgattcctg tttgtaattg     5040

tccttttaac agcgatcgcg tatttcgcct cgctcaggcg caatcacgaa tgaataacgg     5100

tttggttgat gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg     5160

gaaagaaatg cataaacttt tgccattctc accggattca gtcgtcactc atggtgattt     5220

ctcacttgat aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg     5280

agtcggaatc gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt     5340

ttctccttca ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa     5400

taaattgcag tttcatttga tgctcgatga gtttttctaa actgtcagac caagtttact     5460

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga     5520

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt     5580

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     5640

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     5700

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc     5760

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc     5820

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     5880

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     5940

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     6000

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     6060

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     6120

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     6180

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     6240

gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta     6300

ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt     6360

cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc     6420

cgattcatta atgcaggcgc ctgttgattt gagttttggg tttagcgtga caagtttgcg     6480

agggtgatcg gagtaatcag taaatagctc tccgcctaca atgacgtcat aaccatgatt     6540

tctggttttc tgacgtccgt tatcagttcc ctccgaccac gccagcatat cgaggaacgc     6600

cttacgttga ttattgattt ctaccatctt ctactccggc ttttttagca gcgaagcgtt     6660

tgataagcga accaatcgag tcagtaccga tgtagccgat aaacacgctc gttatataag     6720

cgagattgct acttagtccg gcgaagtcga gaaggtcacg aatgaaccag gcgataatgg     6780

cgcacatcgt tgcgtcgatt actgtttttg taaacgcacc gccattatat ctgccgcgaa     6840

ggtacgccat tgcaaacgca aggattgccc cgatgccttg ttcctttgcc gcgagaatgg     6900

cggccaacag gtcatgtttt tctggcatct tcatgtctta cccccaataa ggggatttgc     6960

tctatttaat taggaataag gtcgattact gatagaacaa atccaggcta ctgtgtttag     7020

taatcagatt tgttcgtgac cgatatgcac gggcaaaacg gcaggaggtt gttagcgcga     7080

cctcctgcca cccgctttca cgaaggtcat gtgtaaaagg ccgcagcgta actattacta     7140

atgaattcag gacagacagt ggctacggct cagtttgggt tgtgctgttg ctgggcggcg     7200

atgacgcctg tacgcatttg gtgatccggt tctgcttccg gtattcgctt aattcagcac     7260

aacggaaaga gcactggcta accaggctcg ccgactcttc acgattatcg actcaatgct     7320

cttacctgtt gtgcagatat aaaaaatccc gaaaccgtta tgcaggctct aactattacc     7380

tgcgaactgt ttcgggattg cattttgcag acctctctgc ctgcgatggt tggagttcca     7440

gacgatacgt cgaagtgacc aactaggcgg aatcggtagt aagcgccgcc tcttttcatc     7500

tcactaccac aacgagcgaa ttaacccatc gttgagtcaa atttacccaa ttttattcaa     7560

taagtcaata tcatgccgtt aatatgttgc catccgtggc aatcatgctg ctaacgtgtg     7620

accgcattca aaatgttgtc tgcgattgac tcttctttgt ggcattgcac caccagagcg     7680

tcatacagcg gcttaacagt gcgtgaccag gtgggttggg taaggtttgg gattagcatc     7740

gtcacagcgc gatatgctgc gcttgctggc atccttgaat agccgacgcc tttgcatctt     7800

ccgcactctt tctcgacaac tctcccccac agctctgttt tggcaatatc aaccgcacgg     7860

cctgtaccat ggcaatctct gcatcttgcc cccggcgtcg cggcactacg gcaataatcc     7920

gcataagcga atgttgcgag cacttgcagt acctttgcct tagtatttcc ttcaagcttt     7980

gccacaccac ggtatttccc cgataccttg tgtgcaaatt gcatcagata gttgatagcc     8040

ttttgtttgt cgttctggct gagttcgtgc ttaccgcaga atgcagccat accgaatccg     8100

gcttgtgatt gcgccatccc catagcagcc atcacatcag taccggaaag agagtcagaa     8160

gccgtggccc gtggtgagtc gctcatcatc gggctttttg gcgaatgaaa tttagctacg     8220

ctttcgagtc tcatgcgcct tctccctgta cctgaatcaa tgttaggttt ccgcagaaca     8280

ctgcgccggt atcgatatac atttggttgg caaacttgag tggtttcact gctggcgtat     8340

gaccaaagat gaacgtgtcc gcgcctttga tttctttcac gatcccgttt tgtgagttgc     8400

tgattcgttc gcggttccag attacctgct gatgatcaac tggctttcca aactcgtatt     8460

cgtcaaaggg ataatcggcg tggcagataa catatttttt atctttgctc accagttcga     8520

tgattaacgg aagttcatct gctttatggg caagagcttt agccagaatt tctttgtcgt     8580

aatcgagatt aaagaaccag ccaccgccat taagcagcca gtgattaacg tttccacgct     8640

ctgataagcc atcaatcatc atttgctcat ggtttccacg tacagctctg aaccagggga     8700

atgtgattaa ttccaggcat tcaacgttct ctgcaccacg atcaaccaaa tcgcccaccg     8760

agataagcag gtcttttttg ttgtcgaatc caatcgtatc cagtttgttc atcaggttcg     8820

tgtagcatcc gtgcagatcg ccaactaccc aaatatttcg gtatttgctg ccatcaattt     8880

tttcgtaata gcgcatctct ttcactccat ccgcgatgaa ccatgagaac gtcgttgacg     8940

atggcgtgca ttttcccgtc tttatcatca acgtattttc tgaccgtacc gcgactacat     9000

ttcagtctgc gtgctacttc tgtctgattt ccgtatgctt caacgagcat gtctggaatg     9060

gtttttactg agaacgtcat gcggcctcac ttctgctatt tcgcaggtct ttgagtttct     9120

gttggtactc tgccttgatc gccttgcact cttcgatagt ccagcgatgg cggttatggt     9180

ttgattcgat ttcgtctact gcttcctgcc cgatgcggct aatcagttcg acgcgatacg     9240

gaacgagatt tccgcttttg tgctggttgc acaccacgca ttgcttgtga atattgcgtt     9300

cattaaatcg gagttgaggt gccgcagcag ttgtccggta atgtccggca tcccactgag     9360

cagacgtgag cgttccgcac gagatacatg gtaagtcgcg gtctctttct ctgatgaagg     9420

cgtttacggc ttgttgggct tgtttaatcc agtaactgcg gggctttaag gcgagttttc     9480

gaatcttaag tttatctttc tgtttctgct cctctcgtcg tcgtttcttc tctgctgctt     9540

tttccgcttt ttcgcgttct ttacttcgtc gttcgagtgc tatcttggtt ccacactctg     9600

gagagcacca ccactgatta gcgaatgcag ggtgaaacca ttcccggcat tcatcgtttt     9660

tacatcgtct tcgcgctggt ttagccatca tcttcttcct cgtgcatcga gctattcgga     9720

tcgctcatca gttctgcgca gcagtgctca cacacgtgaa cttccagcac atgcagcttc     9780

tgaccgcagt tagcgcacgt taaagctcgc tcgacgcttt cttgttcgta acttcgattt     9840

tggtcaatca ccttgttttc ctcgcacgac gtcttagcca ccggatatcc cacaggtgag     9900

ccgtgtagtt gaaggttttt acgtcagatt cttttgggat tggcttgggt ttatttctgg     9960

tgcgtttcgt tggaaggtat ttgcagtttt cgcagattat gtcggtgata cttcgtcgct    10020

gtctcgccac acgtcctcct tttcctgcgg tagtggtaac acccctgttg gtgttctttc    10080

acaccggaga caccatcgat tccagtaagg ttgatttggt cggaagcggt tatcttcttt    10140

gcattcaccg caccgataac atcgcatcat gcagcttccc tcccgaagtc gaaatcaagc    10200

tgccctccaa atatttcgca tgactcagaa caagagccgg tatcgaatct tttagctcgt    10260

accatgtcct gatacagggc ttgataatca ttttctgaat acattttcgc gataccgtcc    10320

agcgacattc ttcctcggta cataatctcc tttggcgttt cccgatgtcc gtcacgcaca    10380

tgggatcccg tgatgacctc attaaaaaca cgctgcaatc cctcctcatc tttgcaggca    10440

agtccgattt tttgcgttga ttttttaatg cagaatatgc agttaccgag atgttccggt    10500

atttgcaaat cgaatggttg ttgcttccac catgcgagga tatcttcctt ctcaaagtct    10560

gacagttcag caagatatct gattccaggc tttggcttta gccgcttcgg ttcatcagct    10620

ctgatgccaa tccacgtggt gtaattccct cgcccgaaat ggtcatcaca gtatttggtg    10680

aagggaacga gttttaatct gtcagtgcag aacgcgccgc cgacgtatgg agtgccatat    10740

ttctttacca tatcgataaa tggcttcaga acaggcattc gcgtctgaat atcctttggt    10800

tcccataccg tataaccatt tggctgtcca agctccgggt tgatatcaac ctgcaatacg    10860

gtgagcggta tatcccagaa cttcacaact tccctgacaa accgatatgt cattggatgt    10920

tcacaacctg tatccatgaa aacgtaatgc acgtctttac ctgcccgtcg cttttgctcc    10980

attagccaga gcaaatatgc tgacgtcctg ccaccggaga aactaacgac atttatcatg    11040

cagccctgtc tccccatctc gctttccact ccagagccag tctcgcttcg tctgaccact    11100

taacgccacg ctctgtaccg aatgcctgta taagctctaa tagctccgca aattcgccta    11160

cacgcatcct gctggttgac tggcctatta ccacaaagcc attcccggca aggttaggaa    11220

caacatcctg ctgctttaat gctgcggtaa acacacactt ccagctttct gcatccagcc    11280

agcgaccatg ccattcaacc tgacgagaga cgtcacctaa gcaggcccat agcttcctgt    11340

tttggtctaa gctgcggttg cgttcctgaa tggttactac gattggtttg gttgggtctg    11400

gaaggatttg ctgtactgcg tgaatagcgt tttgctgatg tgctggagat cgaatttcaa    11460

aggttagttt tttcatgact tccctctccc ccaaataaaa aggctggcac gacaggtttc    11520

ccgactggaa agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg    11580

caccccaggc tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat    11640

aacaatttca cacaggaaac agctatgacc atgattacgc caagctgtcg actctagagg    11700

atcccctaat aagg                                                      11714


<210>  26
<211>  6647
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (241)..(544)
<223>  CMV enhancer

<220>
<221>  misc_feature
<222>  (546)..(823)
<223>  chicken beta-actin promoter

<220>
<221>  misc_feature
<222>  (824)..(1795)
<223>  CBA exon 1 and intron

<220>
<221>  misc_feature
<222>  (1859)..(1864)
<223>  Kozak

<220>
<221>  misc_feature
<222>  (1865)..(3826)
<223>  human codon optimized CHM (REM-1)

<220>
<221>  misc_feature
<222>  (3847)..(4054)
<223>  bGH poly(A) signal

<220>
<221>  misc_feature
<222>  (4104)..(4233)
<223>  3' ITR

<400>  26
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct gataccctgc cctctgaatt cgacgtgatt gtgattggaa ccggactccc     1920

tgaatcgatc atcgccgcgg cctgttcccg gtccggtcgg cgcgtgctgc acgtcgattc     1980

gagaagctac tacggaggga attgggcctc attctccttc tccggactgc tctcctggct     2040

gaaggagtat caggagaact ccgacattgt ctccgactca cctgtgtggc aggaccagat     2100

cctggaaaac gaggaagcaa tagccctgag ccggaaggac aagaccatcc agcacgtgga     2160

ggtgttctgt tatgcctccc aagacctcca tgaggacgtg gaagaggctg gagcgttgca     2220

gaagaatcat gccctcgtga cctccgctaa ctccaccgag gcagccgaca gcgccttcct     2280

gccgaccgag gatgaatccc tgtcaactat gtcgtgcgaa atgctgaccg aacagactcc     2340

gagctccgac cccgaaaacg ccctggaagt gaacggagcg gaagtgaccg gcgaaaagga     2400

gaaccattgc gacgacaaga cttgtgtccc atccacttcc gcggaggaca tgtccgagaa     2460

tgtgcctatc gccgaggaca ccaccgaaca gcccaagaag aacagaatca cgtacagcca     2520

gatcatcaag gaggggcgga ggtttaacat cgatctggtg tcgaagctgc tgtacagccg     2580

cggtctgctg atcgatctgc tcattaagtc gaacgtgtcg agatacgccg agttcaagaa     2640

catcacaagg attctcgcct tccgggaagg aagagtggaa caagtgccgt gctcccgggc     2700

cgacgtgttc aactcaaagc aacttaccat ggtggaaaag cgcatgctga tgaaattcct     2760

gaccttctgc atggagtacg aaaagtaccc tgatgagtac aagggttacg aagaaattac     2820

tttctacgag tacctcaaga cccagaagct gaccccgaat ctgcagtaca ttgtgatgca     2880

ctcaatcgca atgacctccg aaaccgcctc ctcgaccatc gacgggctca aggccaccaa     2940

gaacttcctg cactgtttgg ggcgctacgg caacactccg ttcctcttcc cgctgtacgg     3000

ccagggagag ctgcctcagt gtttctgccg gatgtgcgcc gtgttcggcg gaatctactg     3060

tctccgccac tcggtccagt gcctggtggt ggacaaggaa tccaggaagt gcaaagccat     3120

tattgaccag ttcggacaac ggatcatttc cgagcacttt cttgtggagg actcatactt     3180

cccggagaac atgtgctctc gggtccagta tcgacagatt tccagggcgg tgctcattac     3240

tgaccggagc gtcctcaaga ccgatagcga ccagcagatc tccatcctga ccgtgccggc     3300

ggaagaaccc ggcacttttg ccgtgcgcgt gatcgagctt tgctcatcca ccatgacttg     3360

catgaaaggc acttacctgg tgcacctgac gtgcacctca tcgaaaaccg ctagagagga     3420

cctggaatcc gtcgtccaaa agctgttcgt gccttacacc gagatggaaa ttgaaaacga     3480

acaagtggag aagccccgca tcctttgggc cctgtacttt aacatgcgcg attcctccga     3540

tatctcgcgg tcctgctata acgacttgcc ttcgaacgtc tacgtctgct ccgggccaga     3600

ctgcggtctt ggcaacgaca atgccgtgaa gcaggcggaa acactgttcc aagagatctg     3660

ccctaacgag gatttttgcc cgcccccccc aaaccccgag gatatcatct tggacggaga     3720

cagcctgcag ccagaagcat ccgagtccag cgccatcccg gaggccaaca gcgaaacctt     3780

caaggagagc actaacctgg gcaacctgga agagtccagc gaatgatcat aggatctctg     3840

cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct     3900

tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc     3960

attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg     4020

aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag ataagtagca     4080

tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct     4140

gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc     4200

ccgggcggcc tcagtgagcg agcgagcgcg cagccttata aggatatggt gcactctcag     4260

tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga     4320

cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc     4380

cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg     4440

cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc     4500

aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca     4560

ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa     4620

aaggaagagt atgagccata ttcaacggga aacgtcgagg ccgcgattaa attccaacat     4680

ggatgctgat ttatatgggt ataaatgggc tcgcgataat gtcgggcaat caggtgcgac     4740

aatctatcgc ttgtatggga agcccgatgc gccagagttg tttctgaaac atggcaaagg     4800

tagcgttgcc aatgatgtta cagatgagat ggtcagacta aactggctga cggaatttat     4860

gccacttccg accatcaagc attttatccg tactcctgat gatgcatggt tactcaccac     4920

tgcgatcccc ggaaaaacag cgttccaggt attagaagaa tatcctgatt caggtgaaaa     4980

tattgttgat gcgctggcag tgttcctgcg ccggttgcac tcgattcctg tttgtaattg     5040

tccttttaac agcgatcgcg tatttcgcct cgctcaggcg caatcacgaa tgaataacgg     5100

tttggttgat gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg     5160

gaaagaaatg cataaacttt tgccattctc accggattca gtcgtcactc atggtgattt     5220

ctcacttgat aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg     5280

agtcggaatc gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt     5340

ttctccttca ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa     5400

taaattgcag tttcatttga tgctcgatga gtttttctaa actgtcagac caagtttact     5460

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga     5520

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt     5580

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     5640

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     5700

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc     5760

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc     5820

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     5880

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     5940

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     6000

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     6060

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     6120

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     6180

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     6240

gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta     6300

ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt     6360

cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc     6420

cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca     6480

acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc     6540

cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg     6600

accatgatta cgccaagctg tcgactctag aggatcccct aataagg                   6647


<210>  27
<211>  11971
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (241)..(544)
<223>  CMV Enhancer

<220>
<221>  misc_feature
<222>  (546)..(823)
<223>  chicken beta-actin promoter

<220>
<221>  misc_feature
<222>  (824)..(1795)
<223>  CBA exon 1 and intron

<220>
<221>  misc_feature
<222>  (1859)..(1864)
<223>  kozak

<220>
<221>  misc_feature
<222>  (1865)..(3826)
<223>  human codon optimized CHM (REP-1)

<220>
<221>  misc_feature
<222>  (3847)..(4054)
<223>  bGH poly(A) signal

<220>
<221>  misc_feature
<222>  (4104)..(4233)
<223>  3' ITR

<400>  27
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct gataccctgc cctctgaatt cgacgtgatt gtgattggaa ccggactccc     1920

tgaatcgatc atcgccgcgg cctgttcccg gtccggtcgg cgcgtgctgc acgtcgattc     1980

gagaagctac tacggaggga attgggcctc attctccttc tccggactgc tctcctggct     2040

gaaggagtat caggagaact ccgacattgt ctccgactca cctgtgtggc aggaccagat     2100

cctggaaaac gaggaagcaa tagccctgag ccggaaggac aagaccatcc agcacgtgga     2160

ggtgttctgt tatgcctccc aagacctcca tgaggacgtg gaagaggctg gagcgttgca     2220

gaagaatcat gccctcgtga cctccgctaa ctccaccgag gcagccgaca gcgccttcct     2280

gccgaccgag gatgaatccc tgtcaactat gtcgtgcgaa atgctgaccg aacagactcc     2340

gagctccgac cccgaaaacg ccctggaagt gaacggagcg gaagtgaccg gcgaaaagga     2400

gaaccattgc gacgacaaga cttgtgtccc atccacttcc gcggaggaca tgtccgagaa     2460

tgtgcctatc gccgaggaca ccaccgaaca gcccaagaag aacagaatca cgtacagcca     2520

gatcatcaag gaggggcgga ggtttaacat cgatctggtg tcgaagctgc tgtacagccg     2580

cggtctgctg atcgatctgc tcattaagtc gaacgtgtcg agatacgccg agttcaagaa     2640

catcacaagg attctcgcct tccgggaagg aagagtggaa caagtgccgt gctcccgggc     2700

cgacgtgttc aactcaaagc aacttaccat ggtggaaaag cgcatgctga tgaaattcct     2760

gaccttctgc atggagtacg aaaagtaccc tgatgagtac aagggttacg aagaaattac     2820

tttctacgag tacctcaaga cccagaagct gaccccgaat ctgcagtaca ttgtgatgca     2880

ctcaatcgca atgacctccg aaaccgcctc ctcgaccatc gacgggctca aggccaccaa     2940

gaacttcctg cactgtttgg ggcgctacgg caacactccg ttcctcttcc cgctgtacgg     3000

ccagggagag ctgcctcagt gtttctgccg gatgtgcgcc gtgttcggcg gaatctactg     3060

tctccgccac tcggtccagt gcctggtggt ggacaaggaa tccaggaagt gcaaagccat     3120

tattgaccag ttcggacaac ggatcatttc cgagcacttt cttgtggagg actcatactt     3180

cccggagaac atgtgctctc gggtccagta tcgacagatt tccagggcgg tgctcattac     3240

tgaccggagc gtcctcaaga ccgatagcga ccagcagatc tccatcctga ccgtgccggc     3300

ggaagaaccc ggcacttttg ccgtgcgcgt gatcgagctt tgctcatcca ccatgacttg     3360

catgaaaggc acttacctgg tgcacctgac gtgcacctca tcgaaaaccg ctagagagga     3420

cctggaatcc gtcgtccaaa agctgttcgt gccttacacc gagatggaaa ttgaaaacga     3480

acaagtggag aagccccgca tcctttgggc cctgtacttt aacatgcgcg attcctccga     3540

tatctcgcgg tcctgctata acgacttgcc ttcgaacgtc tacgtctgct ccgggccaga     3600

ctgcggtctt ggcaacgaca atgccgtgaa gcaggcggaa acactgttcc aagagatctg     3660

ccctaacgag gatttttgcc cgcccccccc aaaccccgag gatatcatct tggacggaga     3720

cagcctgcag ccagaagcat ccgagtccag cgccatcccg gaggccaaca gcgaaacctt     3780

caaggagagc actaacctgg gcaacctgga agagtccagc gaatgatcat aggatctctg     3840

cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct     3900

tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc     3960

attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg     4020

aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag ataagtagca     4080

tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct     4140

gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc     4200

ccgggcggcc tcagtgagcg agcgagcgcg cagccttaat taacctaata aggaaaatga     4260

agggaagttc ctatactttc tagagaatag gaacttctat agggagtcga ataagggcga     4320

cacaaaaggt attctaaatg cataataaat actgataaca tcttatagtt tgtattatat     4380

tttgtattat cgttgacatg tataattttg atatcaaaaa ctgattttcc ctttattatt     4440

ttcgagattt attttcttaa ttctctttaa caaactagaa atattgtata tacaaaaaat     4500

cataaataat agatgaatag tttaattata ggtgttcatc aatcgaaaaa gcaacgtatc     4560

ttatttaaag tgcgttgctt ttttctcatt tataaggtta aataattctc atatatcaag     4620

caaagtgaca ggcgccctta aatattctga caaatgctct ttccctaaac tccccccata     4680

aaaaaacccg ccgaagcggg tttttacgtt atttgcggat taacgattac tcgttatcag     4740

aaccgcccag gatgcctggc agttccctac tctcgccgct gcgctcggtc gttcggctgc     4800

gggacctcag cgctagcgga gtgtatactg gcttactatg ttggcactga tgagggtgtc     4860

agtgaagtgc ttcatgtggc aggagaaaaa aggctgcacc ggtgcgtcag cagaatatgt     4920

gatacaggat atattccgct tcctcgctca ctgactcgct acgctcggtc gttcgactgc     4980

ggcgagcgga aatggcttac gaacggggcg gagatttcct ggaagatgcc aggaagatac     5040

ttaacaggga agtgagaggg ccgcggcaaa gccgtttttc cataggctcc gcccccctga     5100

caagcatcac gaaatctgac gctcaaatca gtggtggcga aacccgacag gactataaag     5160

ataccaggcg tttccccctg gcggctccct cgtgcgctct cctgttcctg cctttcggtt     5220

taccggtgtc attccgctgt tatggccgcg tttgtctcat tccacgcctg acactcagtt     5280

ccgggtaggc agttcgctcc aagctggact gtatgcacga accccccgtt cagtccgacc     5340

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggaaagacat gcaaaagcac     5400

cactggcagc agccactggt aattgattta gaggagttag tcttgaagtc atgcgccggt     5460

taaggctaaa ctgaaaggac aagttttggt gactgcgctc ctccaagcca gttacctcgg     5520

ttcaaagagt tggtagctca gagaaccttc gaaaaaccgc cctgcaaggc ggttttttcg     5580

ttttcagagc aagagattac gcgcagacca aaacgatctc aagaagatca tcttattaag     5640

ctccttttta tttgggggag agggaagtca tgaaaaaact aacctttgaa attcgatctc     5700

cagcacatca gcaaaacgct attcacgcag tacagcaaat ccttccagac ccaaccaaac     5760

caatcgtagt aaccattcag gaacgcaacc gcagcttaga ccaaaacagg aagctatggg     5820

cctgcttagg tgacgtctct cgtcaggttg aatggcatgg tcgctggctg gatgcagaaa     5880

gctggaagtg tgtgtttacc gcagcattaa agcagcagga tgttgttcct aaccttgccg     5940

ggaatggctt tgtggtaata ggccagtcaa ccagcaggat gcgtgtaggc gaatttgcgg     6000

agctattaga gcttatacag gcattcggta cagagcgtgg cgttaagtgg tcagacgaag     6060

cgagactggc tctggagtgg aaagcgagat ggggagacag ggctgcatga taaatgtcgt     6120

tagtttctcc ggtggcagga cgtcagcata tttgctctgg ctaatggagc aaaagcgacg     6180

ggcaggtaaa gacgtgcatt acgttttcat ggatacaggt tgtgaacatc caatgacata     6240

tcggtttgtc agggaagttg tgaagttctg ggatataccg ctcaccgtat tgcaggttga     6300

tatcaacccg gagcttggac agccaaatgg ttatacggta tgggaaccaa aggatattca     6360

gacgcgaatg cctgttctga agccatttat cgatatggta aagaaatatg gcactccata     6420

cgtcggcggc gcgttctgca ctgacagatt aaaactcgtt cccttcacca aatactgtga     6480

tgaccatttc gggcgaggga attacaccac gtggattggc atcagagctg atgaaccgaa     6540

gcggctaaag ccaaagcctg gaatcagata tcttgctgaa ctgtcagact ttgagaagga     6600

agatatcctc gcatggtgga agcaacaacc attcgatttg caaataccgg aacatctcgg     6660

taactgcata ttctgcatta aaaaatcaac gcaaaaaatc ggacttgcct gcaaagatga     6720

ggagggattg cagcgtgttt ttaatgaggt catcacggga tcccatgtgc gtgacggaca     6780

tcgggaaacg ccaaaggaga ttatgtaccg aggaagaatg tcgctggacg gtatcgcgaa     6840

aatgtattca gaaaatgatt atcaagccct gtatcaggac atggtacgag ctaaaagatt     6900

cgataccggc tcttgttctg agtcatgcga aatatttgga gggcagcttg atttcgactt     6960

cgggagggaa gctgcatgat gcgatgttat cggtgcggtg aatgcaaaga agataaccgc     7020

ttccgaccaa atcaacctta ctggaatcga tggtgtctcc ggtgtgaaag aacaccaaca     7080

ggggtgttac cactaccgca ggaaaaggag gacgtgtggc gagacagcga cgaagtatca     7140

ccgacataat ctgcgaaaac tgcaaatacc ttccaacgaa acgcaccaga aataaaccca     7200

agccaatccc aaaagaatct gacgtaaaaa ccttcaacta cacggctcac ctgtgggata     7260

tccggtggct aagacgtcgt gcgaggaaaa caaggtgatt gaccaaaatc gaagttacga     7320

acaagaaagc gtcgagcgag ctttaacgtg cgctaactgc ggtcagaagc tgcatgtgct     7380

ggaagttcac gtgtgtgagc actgctgcgc agaactgatg agcgatccga atagctcgat     7440

gcacgaggaa gaagatgatg gctaaaccag cgcgaagacg atgtaaaaac gatgaatgcc     7500

gggaatggtt tcaccctgca ttcgctaatc agtggtggtg ctctccagag tgtggaacca     7560

agatagcact cgaacgacga agtaaagaac gcgaaaaagc ggaaaaagca gcagagaaga     7620

aacgacgacg agaggagcag aaacagaaag ataaacttaa gattcgaaaa ctcgccttaa     7680

agccccgcag ttactggatt aaacaagccc aacaagccgt aaacgccttc atcagagaaa     7740

gagaccgcga cttaccatgt atctcgtgcg gaacgctcac gtctgctcag tgggatgccg     7800

gacattaccg gacaactgct gcggcacctc aactccgatt taatgaacgc aatattcaca     7860

agcaatgcgt ggtgtgcaac cagcacaaaa gcggaaatct cgttccgtat cgcgtcgaac     7920

tgattagccg catcgggcag gaagcagtag acgaaatcga atcaaaccat aaccgccatc     7980

gctggactat cgaagagtgc aaggcgatca aggcagagta ccaacagaaa ctcaaagacc     8040

tgcgaaatag cagaagtgag gccgcatgac gttctcagta aaaaccattc cagacatgct     8100

cgttgaagca tacggaaatc agacagaagt agcacgcaga ctgaaatgta gtcgcggtac     8160

ggtcagaaaa tacgttgatg ataaagacgg gaaaatgcac gccatcgtca acgacgttct     8220

catggttcat cgcggatgga gtgaaagaga tgcgctatta cgaaaaaatt gatggcagca     8280

aataccgaaa tatttgggta gttggcgatc tgcacggatg ctacacgaac ctgatgaaca     8340

aactggatac gattggattc gacaacaaaa aagacctgct tatctcggtg ggcgatttgg     8400

ttgatcgtgg tgcagagaac gttgaatgcc tggaattaat cacattcccc tggttcagag     8460

ctgtacgtgg aaaccatgag caaatgatga ttgatggctt atcagagcgt ggaaacgtta     8520

atcactggct gcttaatggc ggtggctggt tctttaatct cgattacgac aaagaaattc     8580

tggctaaagc tcttgcccat aaagcagatg aacttccgtt aatcatcgaa ctggtgagca     8640

aagataaaaa atatgttatc tgccacgccg attatccctt tgacgaatac gagtttggaa     8700

agccagttga tcatcagcag gtaatctgga accgcgaacg aatcagcaac tcacaaaacg     8760

ggatcgtgaa agaaatcaaa ggcgcggaca cgttcatctt tggtcatacg ccagcagtga     8820

aaccactcaa gtttgccaac caaatgtata tcgataccgg cgcagtgttc tgcggaaacc     8880

taacattgat tcaggtacag ggagaaggcg catgagactc gaaagcgtag ctaaatttca     8940

ttcgccaaaa agcccgatga tgagcgactc accacgggcc acggcttctg actctctttc     9000

cggtactgat gtgatggctg ctatggggat ggcgcaatca caagccggat tcggtatggc     9060

tgcattctgc ggtaagcacg aactcagcca gaacgacaaa caaaaggcta tcaactatct     9120

gatgcaattt gcacacaagg tatcggggaa ataccgtggt gtggcaaagc ttgaaggaaa     9180

tactaaggca aaggtactgc aagtgctcgc aacattcgct tatgcggatt attgccgtag     9240

tgccgcgacg ccgggggcaa gatgcagaga ttgccatggt acaggccgtg cggttgatat     9300

tgccaaaaca gagctgtggg ggagagttgt cgagaaagag tgcggaagat gcaaaggcgt     9360

cggctattca aggatgccag caagcgcagc atatcgcgct gtgacgatgc taatcccaaa     9420

ccttacccaa cccacctggt cacgcactgt taagccgctg tatgacgctc tggtggtgca     9480

atgccacaaa gaagagtcaa tcgcagacaa cattttgaat gcggtcacac gttagcagca     9540

tgattgccac ggatggcaac atattaacgg catgatattg acttattgaa taaaattggg     9600

taaatttgac tcaacgatgg gttaattcgc tcgttgtggt agtgagatga aaagaggcgg     9660

cgcttactac cgattccgcc tagttggtca cttcgacgta tcgtctggaa ctccaaccat     9720

cgcaggcaga gaggtctgca aaatgcaatc ccgaaacagt tcgcaggtaa tagttagagc     9780

ctgcataacg gtttcgggat tttttatatc tgcacaacag gtaagagcat tgagtcgata     9840

atcgtgaaga gtcggcgagc ctggttagcc agtgctcttt ccgttgtgct gaattaagcg     9900

aataccggaa gcagaaccgg atcaccaaat gcgtacaggc gtcatcgccg cccagcaaca     9960

gcacaaccca aactgagccg tagccactgt ctgtcctgaa ttcattagta atagttacgc    10020

tgcggccttt tacacatgac cttcgtgaaa gcgggtggca ggaggtcgcg ctaacaacct    10080

cctgccgttt tgcccgtgca tatcggtcac gaacaaatct gattactaaa cacagtagcc    10140

tggatttgtt ctatcagtaa tcgaccttat tcctaattaa atagagcaaa tccccttatt    10200

gggggtaaga catgaagatg ccagaaaaac atgacctgtt ggccgccatt ctcgcggcaa    10260

aggaacaagg catcggggca atccttgcgt ttgcaatggc gtaccttcgc ggcagatata    10320

atggcggtgc gtttacaaaa acagtaatcg acgcaacgat gtgcgccatt atcgcctggt    10380

tcattcgtga ccttctcgac ttcgccggac taagtagcaa tctcgcttat ataacgagcg    10440

tgtttatcgg ctacatcggt actgactcga ttggttcgct tatcaaacgc ttcgctgcta    10500

aaaaagccgg agtagaagat ggtagaaatc aataatcaac gtaaggcgtt cctcgatatg    10560

ctggcgtggt cggagggaac tgataacgga cgtcagaaaa ccagaaatca tggttatgac    10620

gtcattgtag gcggagagct atttactgat tactccgatc accctcgcaa acttgtcacg    10680

ctaaacccaa aactcaaatc aacaggcgca gcttttagaa aaactcatcg agcatcaaat    10740

gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa agccgtttct    10800

gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc tggtatcggt    10860

ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg tcaaaaataa    10920

ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat ggcaaaagtt    10980

tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca tcaaaatcac    11040

tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgaggcga aatacgcgat    11100

cgctgttaaa aggacaatta caaacaggaa tcgagtgcaa ccggcgcagg aacactgcca    11160

gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg aacgctgttt    11220

ttccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata aaatgcttga    11280

tggtcggaag tggcataaat tccgtcagcc agtttagtct gaccatctca tctgtaacat    11340

cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg ggcttcccat    11400

acaagcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat ttatacccat    11460

ataaatcagc atccatgttg gaatttaatc gcggcctcga cgtttcccgt tgaatatggc    11520

tcatattctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg    11580

gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggtcagtgtt acaaccaatt    11640

aaccaattct gaacattatc gcgagcccat ttatacctga atatggctca taacacccct    11700

tgtttgcctg gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa ctcagaagtg    11760

aaacgccgta gcgccgatgg tagtgtgggg actccccatg cgagagtagg gaactgccag    11820

gcatcaaata aaacgaaagg ctcagtcgaa agactgggcc tttcgcccgg gctaattagg    11880

gggtgtcgcc cttattcgac tctataggga agttcctatt ctctagaaag tataggaact    11940

tctgaagggg ggtcgatcga cttaattaag g                                   11971


<210>  28
<211>  6900
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (241)..(544)
<223>  CMV enhancer

<220>
<221>  misc_feature
<222>  (546)..(823)
<223>  chicken beta actin promoter

<220>
<221>  misc_feature
<222>  (824)..(1795)
<223>  CBA exon 1 and intron

<220>
<221>  misc_feature
<222>  (1859)..(1864)
<223>  kozak

<220>
<221>  misc_feature
<222>  (1865)..(3826)
<223>  human codon optimized CHM (REP-1)

<220>
<221>  misc_feature
<222>  (3847)..(4054)
<223>  bGH poly(A) signal

<220>
<221>  misc_feature
<222>  (4104)..(4233)
<223>  3' ITR

<400>  28
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct gataccctgc cctctgaatt cgacgtgatt gtgattggaa ccggactccc     1920

tgaatcgatc atcgccgcgg cctgttcccg gtccggtcgg cgcgtgctgc acgtcgattc     1980

gagaagctac tacggaggga attgggcctc attctccttc tccggactgc tctcctggct     2040

gaaggagtat caggagaact ccgacattgt ctccgactca cctgtgtggc aggaccagat     2100

cctggaaaac gaggaagcaa tagccctgag ccggaaggac aagaccatcc agcacgtgga     2160

ggtgttctgt tatgcctccc aagacctcca tgaggacgtg gaagaggctg gagcgttgca     2220

gaagaatcat gccctcgtga cctccgctaa ctccaccgag gcagccgaca gcgccttcct     2280

gccgaccgag gatgaatccc tgtcaactat gtcgtgcgaa atgctgaccg aacagactcc     2340

gagctccgac cccgaaaacg ccctggaagt gaacggagcg gaagtgaccg gcgaaaagga     2400

gaaccattgc gacgacaaga cttgtgtccc atccacttcc gcggaggaca tgtccgagaa     2460

tgtgcctatc gccgaggaca ccaccgaaca gcccaagaag aacagaatca cgtacagcca     2520

gatcatcaag gaggggcgga ggtttaacat cgatctggtg tcgaagctgc tgtacagccg     2580

cggtctgctg atcgatctgc tcattaagtc gaacgtgtcg agatacgccg agttcaagaa     2640

catcacaagg attctcgcct tccgggaagg aagagtggaa caagtgccgt gctcccgggc     2700

cgacgtgttc aactcaaagc aacttaccat ggtggaaaag cgcatgctga tgaaattcct     2760

gaccttctgc atggagtacg aaaagtaccc tgatgagtac aagggttacg aagaaattac     2820

tttctacgag tacctcaaga cccagaagct gaccccgaat ctgcagtaca ttgtgatgca     2880

ctcaatcgca atgacctccg aaaccgcctc ctcgaccatc gacgggctca aggccaccaa     2940

gaacttcctg cactgtttgg ggcgctacgg caacactccg ttcctcttcc cgctgtacgg     3000

ccagggagag ctgcctcagt gtttctgccg gatgtgcgcc gtgttcggcg gaatctactg     3060

tctccgccac tcggtccagt gcctggtggt ggacaaggaa tccaggaagt gcaaagccat     3120

tattgaccag ttcggacaac ggatcatttc cgagcacttt cttgtggagg actcatactt     3180

cccggagaac atgtgctctc gggtccagta tcgacagatt tccagggcgg tgctcattac     3240

tgaccggagc gtcctcaaga ccgatagcga ccagcagatc tccatcctga ccgtgccggc     3300

ggaagaaccc ggcacttttg ccgtgcgcgt gatcgagctt tgctcatcca ccatgacttg     3360

catgaaaggc acttacctgg tgcacctgac gtgcacctca tcgaaaaccg ctagagagga     3420

cctggaatcc gtcgtccaaa agctgttcgt gccttacacc gagatggaaa ttgaaaacga     3480

acaagtggag aagccccgca tcctttgggc cctgtacttt aacatgcgcg attcctccga     3540

tatctcgcgg tcctgctata acgacttgcc ttcgaacgtc tacgtctgct ccgggccaga     3600

ctgcggtctt ggcaacgaca atgccgtgaa gcaggcggaa acactgttcc aagagatctg     3660

ccctaacgag gatttttgcc cgcccccccc aaaccccgag gatatcatct tggacggaga     3720

cagcctgcag ccagaagcat ccgagtccag cgccatcccg gaggccaaca gcgaaacctt     3780

caaggagagc actaacctgg gcaacctgga agagtccagc gaatgatcat aggatctctg     3840

cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct     3900

tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc     3960

attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg     4020

aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag ataagtagca     4080

tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct     4140

gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc     4200

ccgggcggcc tcagtgagcg agcgagcgcg cagccttaat taacctaata aggaaaatga     4260

agggaagttc ctatactttc tagagaatag gaacttctat agggagtcga ataagggcga     4320

cacaaaaggt attctaaatg cataataaat actgataaca tcttatagtt tgtattatat     4380

tttgtattat cgttgacatg tataattttg atatcaaaaa ctgattttcc ctttattatt     4440

ttcgagattt attttcttaa ttctctttaa caaactagaa atattgtata tacaaaaaat     4500

cataaataat agatgaatag tttaattata ggtgttcatc aatcgaaaaa gcaacgtatc     4560

ttatttaaag tgcgttgctt ttttctcatt tataaggtta aataattctc atatatcaag     4620

caaagtgaca ggcgccctta aatattctga caaatgctct ttccctaaac tccccccata     4680

aaaaaacccg ccgaagcggg tttttacgtt atttgcggat taacgattac tcgttatcag     4740

aaccgcccag gatgcctggc agttccctac tctcgccgct gcgctcggtc gttcggctgc     4800

gggacctcag cgctagcgga gtgtatactg gcttactatg ttggcactga tgagggtgtc     4860

agtgaagtgc ttcatgtggc aggagaaaaa aggctgcacc ggtgcgtcag cagaatatgt     4920

gatacaggat atattccgct tcctcgctca ctgactcgct acgctcggtc gttcgactgc     4980

ggcgagcgga aatggcttac gaacggggcg gagatttcct ggaagatgcc aggaagatac     5040

ttaacaggga agtgagaggg ccgcggcaaa gccgtttttc cataggctcc gcccccctga     5100

caagcatcac gaaatctgac gctcaaatca gtggtggcga aacccgacag gactataaag     5160

ataccaggcg tttccccctg gcggctccct cgtgcgctct cctgttcctg cctttcggtt     5220

taccggtgtc attccgctgt tatggccgcg tttgtctcat tccacgcctg acactcagtt     5280

ccgggtaggc agttcgctcc aagctggact gtatgcacga accccccgtt cagtccgacc     5340

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggaaagacat gcaaaagcac     5400

cactggcagc agccactggt aattgattta gaggagttag tcttgaagtc atgcgccggt     5460

taaggctaaa ctgaaaggac aagttttggt gactgcgctc ctccaagcca gttacctcgg     5520

ttcaaagagt tggtagctca gagaaccttc gaaaaaccgc cctgcaaggc ggttttttcg     5580

ttttcagagc aagagattac gcgcagacca aaacgatctc aagaagatca tcttattaag     5640

cttttagaaa aactcatcga gcatcaaatg aaactgcaat ttattcatat caggattatc     5700

aataccatat ttttgaaaaa gccgtttctg taatgaagga gaaaactcac cgaggcagtt     5760

ccataggatg gcaagatcct ggtatcggtc tgcgattccg actcgtccaa catcaataca     5820

acctattaat ttcccctcgt caaaaataag gttatcaagt gagaaatcac catgagtgac     5880

gactgaatcc ggtgagaatg gcaaaagttt atgcatttct ttccagactt gttcaacagg     5940

ccagccatta cgctcgtcat caaaatcact cgcatcaacc aaaccgttat tcattcgtga     6000

ttgcgcctga gcgaggcgaa atacgcgatc gctgttaaaa ggacaattac aaacaggaat     6060

cgagtgcaac cggcgcagga acactgccag cgcatcaaca atattttcac ctgaatcagg     6120

atattcttct aatacctgga acgctgtttt tccggggatc gcagtggtga gtaaccatgc     6180

atcatcagga gtacggataa aatgcttgat ggtcggaagt ggcataaatt ccgtcagcca     6240

gtttagtctg accatctcat ctgtaacatc attggcaacg ctacctttgc catgtttcag     6300

aaacaactct ggcgcatcgg gcttcccata caagcgatag attgtcgcac ctgattgccc     6360

gacattatcg cgagcccatt tatacccata taaatcagca tccatgttgg aatttaatcg     6420

cggcctcgac gtttcccgtt gaatatggct catattcttc ctttttcaat attattgaag     6480

catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa     6540

acaaataggg gtcagtgtta caaccaatta accaattctg aacattatcg cgagcccatt     6600

tatacctgaa tatggctcat aacacccctt gtttgcctgg cggcagtagc gcggtggtcc     6660

cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtgggga     6720

ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa     6780

gactgggcct ttcgcccggg ctaattaggg ggtgtcgccc ttattcgact ctatagggaa     6840

gttcctattc tctagaaagt ataggaactt ctgaaggggg gtcgatcgac ttaattaagg     6900


<210>  29
<211>  12074
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  29
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

atggcggata ctctcccttc ggagtttgat gtgatcgtaa tagggacggg tttgcctgaa     1920

tccatcattg cagctgcatg ttcaagaagt ggccggagag ttctgcatgt tgattcaaga     1980

agctactatg gaggaaactg ggccagtttt agcttttcag gactattgtc ctggctaaag     2040

gaataccagg aaaacagtga cattgtaagt gacagtccag tgtggcaaga ccagatcctt     2100

gaaaatgaag aagccattgc tcttagcagg aaggacaaaa ctattcaaca tgtggaagta     2160

ttttgttatg ccagtcagga tttgcatgaa gatgtcgaag aagctggtgc actgcagaaa     2220

aatcatgctc ttgtgacatc tgcaaactcc acagaagctg cagattctgc cttcctgcct     2280

acggaggatg agtcattaag cactatgagc tgtgaaatgc tcacagaaca aactccaagc     2340

agcgatccag agaatgcgct agaagtaaat ggtgctgaag tgacagggga aaaagaaaac     2400

cattgtgatg ataaaacttg tgtgccatca acttcagcag aagacatgag tgaaaatgtg     2460

cctatagcag aagataccac agagcaacca aagaaaaaca gaattactta ctcacaaatt     2520

attaaagaag gcaggagatt taatattgat ttagtatcaa agctgctgta ttctcgagga     2580

ttactaattg atcttctaat caaatctaat gttagtcgat atgcagagtt taaaaatatt     2640

accaggattc ttgcatttcg agaaggacga gtggaacagg ttccgtgttc cagagcagat     2700

gtctttaata gcaaacaact tactatggta gaaaagcgaa tgctaatgaa atttcttaca     2760

ttttgtatgg aatatgagaa atatcctgat gaatataaag gatatgaaga gatcacattt     2820

tatgaatatt taaagactca aaaattaacc cccaacctcc aatatattgt catgcattca     2880

attgcaatga catcagagac agccagcagc accatagatg gtctcaaagc taccaaaaac     2940

tttcttcact gtcttgggcg gtatggcaac actccatttt tgtttccttt atatggccaa     3000

ggagaactcc cccagtgttt ctgcaggatg tgtgctgtgt ttggtggaat ttattgtctt     3060

cgccattcag tacagtgcct tgtagtggac aaagaatcca gaaaatgtaa agcaattata     3120

gatcagtttg gtcagagaat aatctctgag catttcctcg tggaggacag ttactttcct     3180

gagaacatgt gctcacgtgt gcaatacagg cagatctcca gggcagtgct gattacagat     3240

agatctgtcc taaaaacaga ttcagatcaa cagatttcca ttttgacagt gccagcagag     3300

gaaccaggaa cttttgctgt tcgggtcatt gagttatgtt cttcaacgat gacatgcatg     3360

aaaggcacct atttggttca tttgacttgc acatcttcta aaacagcaag agaagattta     3420

gaatcagttg tgcagaaatt gtttgttcca tatactgaaa tggagataga aaatgaacaa     3480

gtagaaaagc caagaattct gtgggctctt tacttcaata tgagagattc gtcagacatc     3540

agcaggagct gttataatga tttaccatcc aacgtttatg tctgctctgg cccagattgt     3600

ggtttaggaa atgataatgc agtcaaacag gctgaaacac ttttccagga aatctgcccc     3660

aatgaagatt tctgtccccc tccaccaaat cctgaagaca ttatccttga tggagacagt     3720

ttacagccag aggcttcaga atccagtgcc ataccagagg ctaactcgga gactttcaag     3780

gaaagcacaa accttggaaa cctagaggag tcctctgaat aaggatctgc ctcgactgtg     3840

ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa     3900

ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt     3960

aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa     4020

gacaatagca ggcatgctgg ggactcgagt tctacgtaga taagtagcat ggcgggttaa     4080

tcattaacta caaggaaccc ctagtgatgg agttggccac tccctctctg cgcgctcgct     4140

cgctcactga ggccgggcga ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct     4200

cagtgagcga gcgagcgcgc agccttaatt aacctaagga aaatgaagtg aagttcctat     4260

actttctaga gaataggaac ttctatagtg agtcgaataa gggcgacaca aaatttattc     4320

taaatgcata ataaatactg ataacatctt atagtttgta ttatattttg tattatcgtt     4380

gacatgtata attttgatat caaaaactga ttttcccttt attattttcg agatttattt     4440

tcttaattct ctttaacaaa ctagaaatat tgtatataca aaaaatcata aataatagat     4500

gaatagttta attataggtg ttcatcaatc gaaaaagcaa cgtatcttat ttaaagtgcg     4560

ttgctttttt ctcatttata aggttaaata attctcatat atcaagcaaa gtgacaggcg     4620

cccttaaata ttctgacaaa tgctctttcc ctaaactccc cccataaaaa aacccgccga     4680

agcgggtttt tacgttattt gcggattaac gattactcgt tatcagaacc gcccaggggg     4740

cccgagctta acctttttat ttgggggaga gggaagtcat gaaaaaacta acctttgaaa     4800

ttcgatctcc agcacatcag caaaacgcta ttcacgcagt acagcaaatc cttccagacc     4860

caaccaaacc aatcgtagta accattcagg aacgcaaccg cagcttagac caaaacagga     4920

agctatgggc ctgcttaggt gacgtctctc gtcaggttga atggcatggt cgctggctgg     4980

atgcagaaag ctggaagtgt gtgtttaccg cagcattaaa gcagcaggat gttgttccta     5040

accttgccgg gaatggcttt gtggtaatag gccagtcaac cagcaggatg cgtgtaggcg     5100

aatttgcgga gctattagag cttatacagg cattcggtac agagcgtggc gttaagtggt     5160

cagacgaagc gagactggct ctggagtgga aagcgagatg gggagacagg gctgcatgat     5220

aaatgtcgtt agtttctccg gtggcaggac gtcagcatat ttgctctggc taatggagca     5280

aaagcgacgg gcaggtaaag acgtgcatta cgttttcatg gatacaggtt gtgaacatcc     5340

aatgacatat cggtttgtca gggaagttgt gaagttctgg gatataccgc tcaccgtatt     5400

gcaggttgat atcaacccgg agcttggaca gccaaatggt tatacggtat gggaaccaaa     5460

ggatattcag acgcgaatgc ctgttctgaa gccatttatc gatatggtaa agaaatatgg     5520

cactccatac gtcggcggcg cgttctgcac tgacagatta aaactcgttc ccttcaccaa     5580

atactgtgat gaccatttcg ggcgagggaa ttacaccacg tggattggca tcagagctga     5640

tgaaccgaag cggctaaagc caaagcctgg aatcagatat cttgctgaac tgtcagactt     5700

tgagaaggaa gatatcctcg catggtggaa gcaacaacca ttcgatttgc aaataccgga     5760

acatctcggt aactgcatat tctgcattaa aaaatcaacg caaaaaatcg gacttgcctg     5820

caaagatgag gagggattgc agcgtgtttt taatgaggtc atcacgggat cccatgtgcg     5880

tgacggacat cgggaaacgc caaaggagat tatgtaccga ggaagaatgt cgctggacgg     5940

tatcgcgaaa atgtattcag aaaatgatta tcaagccctg tatcaggaca tggtacgagc     6000

taaaagattc gataccggct cttgttctga gtcatgcgaa atatttggag ggcagcttga     6060

tttcgacttc gggagggaag ctgcatgatg cgatgttatc ggtgcggtga atgcaaagaa     6120

gataaccgct tccgaccaaa tcaaccttac tggaatcgat ggtgtctccg gtgtgaaaga     6180

acaccaacag gggtgttacc actaccgcag gaaaaggagg acgtgtggcg agacagcgac     6240

gaagtatcac cgacataatc tgcgaaaact gcaaatacct tccaacgaaa cgcaccagaa     6300

ataaacccaa gccaatccca aaagaatctg acgtaaaaac cttcaactac acggctcacc     6360

tgtgggatat ccggtggcta agacgtcgtg cgaggaaaac aaggtgattg accaaaatcg     6420

aagttacgaa caagaaagcg tcgagcgagc tttaacgtgc gctaactgcg gtcagaagct     6480

gcatgtgctg gaagttcacg tgtgtgagca ctgctgcgca gaactgatga gcgatccgaa     6540

tagctcgatg cacgaggaag aagatgatgg ctaaaccagc gcgaagacga tgtaaaaacg     6600

atgaatgccg ggaatggttt caccctgcat tcgctaatca gtggtggtgc tctccagagt     6660

gtggaaccaa gatagcactc gaacgacgaa gtaaagaacg cgaaaaagcg gaaaaagcag     6720

cagagaagaa acgacgacga gaggagcaga aacagaaaga taaacttaag attcgaaaac     6780

tcgccttaaa gccccgcagt tactggatta aacaagccca acaagccgta aacgccttca     6840

tcagagaaag agaccgcgac ttaccatgta tctcgtgcgg aacgctcacg tctgctcagt     6900

gggatgccgg acattaccgg acaactgctg cggcacctca actccgattt aatgaacgca     6960

atattcacaa gcaatgcgtg gtgtgcaacc agcacaaaag cggaaatctc gttccgtatc     7020

gcgtcgaact gattagccgc atcgggcagg aagcagtaga cgaaatcgaa tcaaaccata     7080

accgccatcg ctggactatc gaagagtgca aggcgatcaa ggcagagtac caacagaaac     7140

tcaaagacct gcgaaatagc agaagtgagg ccgcatgacg ttctcagtaa aaaccattcc     7200

agacatgctc gttgaagcat acggaaatca gacagaagta gcacgcagac tgaaatgtag     7260

tcgcggtacg gtcagaaaat acgttgatga taaagacggg aaaatgcacg ccatcgtcaa     7320

cgacgttctc atggttcatc gcggatggag tgaaagagat gcgctattac gaaaaaattg     7380

atggcagcaa ataccgaaat atttgggtag ttggcgatct gcacggatgc tacacgaacc     7440

tgatgaacaa actggatacg attggattcg acaacaaaaa agacctgctt atctcggtgg     7500

gcgatttggt tgatcgtggt gcagagaacg ttgaatgcct ggaattaatc acattcccct     7560

ggttcagagc tgtacgtgga aaccatgagc aaatgatgat tgatggctta tcagagcgtg     7620

gaaacgttaa tcactggctg cttaatggcg gtggctggtt ctttaatctc gattacgaca     7680

aagaaattct ggctaaagct cttgcccata aagcagatga acttccgtta atcatcgaac     7740

tggtgagcaa agataaaaaa tatgttatct gccacgccga ttatcccttt gacgaatacg     7800

agtttggaaa gccagttgat catcagcagg taatctggaa ccgcgaacga atcagcaact     7860

cacaaaacgg gatcgtgaaa gaaatcaaag gcgcggacac gttcatcttt ggtcatacgc     7920

cagcagtgaa accactcaag tttgccaacc aaatgtatat cgataccggc gcagtgttct     7980

gcggaaacct aacattgatt caggtacagg gagaaggcgc atgagactcg aaagcgtagc     8040

taaatttcat tcgccaaaaa gcccgatgat gagcgactca ccacgggcca cggcttctga     8100

ctctctttcc ggtactgatg tgatggctgc tatggggatg gcgcaatcac aagccggatt     8160

cggtatggct gcattctgcg gtaagcacga actcagccag aacgacaaac aaaaggctat     8220

caactatctg atgcaatttg cacacaaggt atcggggaaa taccgtggtg tggcaaagct     8280

tgaaggaaat actaaggcaa aggtactgca agtgctcgca acattcgctt atgcggatta     8340

ttgccgtagt gccgcgacgc cgggggcaag atgcagagat tgccatggta caggccgtgc     8400

ggttgatatt gccaaaacag agctgtgggg gagagttgtc gagaaagagt gcggaagatg     8460

caaaggcgtc ggctattcaa ggatgccagc aagcgcagca tatcgcgctg tgacgatgct     8520

aatcccaaac cttacccaac ccacctggtc acgcactgtt aagccgctgt atgacgctct     8580

ggtggtgcaa tgccacaaag aagagtcaat cgcagacaac attttgaatg cggtcacacg     8640

ttagcagcat gattgccacg gatggcaaca tattaacggc atgatattga cttattgaat     8700

aaaattgggt aaatttgact caacgatggg ttaattcgct cgttgtggta gtgagatgaa     8760

aagaggcggc gcttactacc gattccgcct agttggtcac ttcgacgtat cgtctggaac     8820

tccaaccatc gcaggcagag aggtctgcaa aatgcaatcc cgaaacagtt cgcaggtaat     8880

agttagagcc tgcataacgg tttcgggatt ttttatatct gcacaacagg taagagcatt     8940

gagtcgataa tcgtgaagag tcggcgagcc tggttagcca gtgctctttc cgttgtgctg     9000

aattaagcga ataccggaag cagaaccgga tcaccaaatg cgtacaggcg tcatcgccgc     9060

ccagcaacag cacaacccaa actgagccgt agccactgtc tgtcctgaat tcattagtaa     9120

tagttacgct gcggcctttt acacatgacc ttcgtgaaag cgggtggcag gaggtcgcgc     9180

taacaacctc ctgccgtttt gcccgtgcat atcggtcacg aacaaatctg attactaaac     9240

acagtagcct ggatttgttc tatcagtaat cgaccttatt cctaattaaa tagagcaaat     9300

ccccttattg ggggtaagac atgaagatgc cagaaaaaca tgacctgttg gccgccattc     9360

tcgcggcaaa ggaacaaggc atcggggcaa tccttgcgtt tgcaatggcg taccttcgcg     9420

gcagatataa tggcggtgcg tttacaaaaa cagtaatcga cgcaacgatg tgcgccatta     9480

tcgcctggtt cattcgtgac cttctcgact tcgccggact aagtagcaat ctcgcttata     9540

taacgagcgt gtttatcggc tacatcggta ctgactcgat tggttcgctt atcaaacgct     9600

tcgctgctaa aaaagccgga gtagaagatg gtagaaatca ataatcaacg taaggcgttc     9660

ctcgatatgc tggcgtggtc ggagggaact gataacggac gtcagaaaac cagaaatcat     9720

ggttatgacg tcattgtagg cggagagcta tttactgatt actccgatca ccctcgcaaa     9780

cttgtcacgc taaacccaaa actcaaatca acaggcgctt aagactggcc gtcgttttac     9840

aacacagaaa gagtttgtag aaacgcaaaa aggccatccg tcaggggcct tctgcttagt     9900

ttgatgcctg gcagttccct actctcgcct tccgcttcct cgctcactga ctcgctgcgc     9960

tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc    10020

acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg    10080

aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat    10140

cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag    10200

gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga    10260

tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg    10320

tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt    10380

cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac    10440

gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc    10500

ggtgctacag agttcttgaa gtggtgggct aactacggct acactagaag aacagtattt    10560

ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc    10620

ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc    10680

agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg    10740

aacgacgcgc gcgtaactca cgttaaggga ttttggtcat gagcttgcgc cgtcccgtca    10800

agtcagcgta atgctctgct tttagaaaaa ctcatcgagc atcaaatgaa actgcaattt    10860

attcatatca ggattatcaa taccatattt ttgaaaaagc cgtttctgta atgaaggaga    10920

aaactcaccg aggcagttcc ataggatggc aagatcctgg tatcggtctg cgattccgac    10980

tcgtccaaca tcaatacaac ctattaattt cccctcgtca aaaataaggt tatcaagtga    11040

gaaatcacca tgagtgacga ctgaatccgg tgagaatggc aaaagtttat gcatttcttt    11100

ccagacttgt tcaacaggcc agccattacg ctcgtcatca aaatcactcg catcaaccaa    11160

accgttattc attcgtgatt gcgcctgagc gaggcgaaat acgcgatcgc tgttaaaagg    11220

acaattacaa acaggaatcg agtgcaaccg gcgcaggaac actgccagcg catcaacaat    11280

attttcacct gaatcaggat attcttctaa tacctggaac gctgtttttc cggggatcgc    11340

agtggtgagt aaccatgcat catcaggagt acggataaaa tgcttgatgg tcggaagtgg    11400

cataaattcc gtcagccagt ttagtctgac catctcatct gtaacatcat tggcaacgct    11460

acctttgcca tgtttcagaa acaactctgg cgcatcgggc ttcccataca agcgatagat    11520

tgtcgcacct gattgcccga cattatcgcg agcccattta tacccatata aatcagcatc    11580

catgttggaa tttaatcgcg gcctcgacgt ttcccgttga atatggctca tattcttcct    11640

ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga    11700

atgtatttag aaaaataaac aaataggggt cagtgttaca accaattaac caattctgaa    11760

cattatcgcg agcccattta tacctgaata tggctcataa caccccttgt ttgcctggcg    11820

gcagtagcgc ggtggtccca cctgacccca tgccgaactc agaagtgaaa cgccgtagcg    11880

ccgatggtag tgtggggact ccccatgcga gagtagggaa ctgccaggca tcaaataaaa    11940

cgaaaggctc agtcgaaaga ctgggccttt cgcccgggct aattaggggg tgtcgccctt    12000

attcgactct atagtgaagt tcctattctc tagaaagtat aggaacttct gaagtggggt    12060

cgacttaatt aagg                                                      12074


