                         SEQUENCE LISTING

<110>  Universitaetsklinikum Freiburg
 
<120>  Animal model for cancer

<130>  LSD1

<160>  11    

<170>  PatentIn version 3.5

<210>  1
<211>  2559
<212>  DNA
<213>  homo sapiens

<400>  1
atgttatctg ggaagaaggc ggcagccgcg gcggcggcgg ctgcagcggc agcaaccggg       60

acggaggctg gccctgggac agcaggcggc tccgagaacg ggtctgaggt ggccgcgcag      120

cccgcgggcc tgtcgggccc agccgaggtc gggccggggg cggtggggga gcgcacaccc      180

cgcaagaaag agcctccgcg ggcctcgccc cccgggggcc tggcggaacc gccggggtcc      240

gcagggcctc aggccggccc tactgtcgtg cctgggtctg cgacccccat ggaaactgga      300

atagcagaga ctccggaggg gcgtcggacc agccggcgca agcgggcgaa ggtagagtac      360

agagagatgg atgaaagctt ggccaacctc tcagaagatg agtattattc agaagaagag      420

agaaatgcca aagcagagaa ggaaaagaag cttcccccac caccccctca agccccacct      480

gaggaagaaa atgaaagtga gcctgaagaa ccatcgggtg tggagggcgc agctttccag      540

agccgacttc ctcatgaccg gatgacttct caagaagcag cctgttttcc agatattatc      600

agtggaccac aacagaccca gaaggttttt cttttcatta gaaaccgcac actgcagttg      660

tggttggata atccaaagat tcagctgaca tttgaggcta ctctccaaca attagaagca      720

ccttataaca gtgatactgt gcttgtccac cgagttcaca gttatttaga gcgtcatggt      780

cttatcaact tcggcatcta taagaggata aaacccctac caactaaaaa gacaggaaag      840

gtaattatta taggctctgg ggtctcaggc ttggcagcag ctcgacagtt acaaagtttt      900

ggaatggatg tcacactttt ggaagccagg gatcgtgtgg gtggacgagt tgccacattt      960

cgcaaaggaa actatgtagc tgatcttgga gccatggtgg taacaggtct tggagggaat     1020

cctatggctg tggtcagcaa acaagtaaat atggaactgg ccaagatcaa gcaaaaatgc     1080

ccactttatg aagccaacgg acaagctgtt cctaaagaga aagatgaaat ggtagagcaa     1140

gagtttaacc ggttgctaga agctacatct taccttagtc atcaactaga cttcaatgtc     1200

ctcaataata agcctgtgtc ccttggccag gcattggaag ttgtcattca gttacaagag     1260

aagcatgtca aagatgagca gattgaacat tggaagaaga tagtgaaaac tcaggaagaa     1320

ttgaaagaac ttcttaataa gatggtaaat ttgaaagaga aaattaaaga actccatcag     1380

caatacaaag aagcatctga agtaaagcca cccagagata ttactgccga gttcttagtg     1440

aaaagcaaac acagggatct gaccgcccta tgcaaggaat atgatgaatt agctgaaaca     1500

caaggaaagc tagaagaaaa acttcaggag ttggaagcga atcccccaag tgatgtatat     1560

ctctcatcaa gagacagaca aatacttgat tggcattttg caaatcttga atttgctaat     1620

gccacacctc tctcaactct ctcccttaag cactgggatc aggatgatga ctttgagttc     1680

actggcagcc acctgacagt aaggaatggc tactcgtgtg tgcctgtggc tttagcagaa     1740

ggcctagaca ttaaactgaa tacagcagtg cgacaggttc gctacacggc ttcaggatgt     1800

gaagtgatag ctgtgaatac ccgctccacg agtcaaacct ttatttataa atgcgacgca     1860

gttctctgta cccttcccct gggtgtgctg aagcagcagc caccagccgt tcagtttgtg     1920

ccacctctcc ctgagtggaa aacatctgca gtccaaagga tgggatttgg caaccttaac     1980

aaggtggtgt tgtgttttga tcgggtgttc tgggatccaa gtgtcaattt gttcgggcat     2040

gttggcagta cgactgccag caggggtgag ctcttcctct tctggaacct ctataaagct     2100

ccaatactgt tggcactagt ggcaggagaa gctgctggta tcatggaaaa cataagtgac     2160

gatgtgattg ttggccgatg cctggccatt ctcaaaggga tttttggtag cagtgcagta     2220

cctcagccca aagaaactgt ggtgtctcgt tggcgtgctg atccctgggc tcggggctct     2280

tattcctatg ttgctgcagg atcatctgga aatgactatg atttaatggc tcagccaatc     2340

actcctggcc cctcgattcc aggtgcccca cagccgattc cacgactctt ctttgcggga     2400

gaacatacga tccgtaacta cccagccaca gtgcatggtg ctctgctgag tgggctgcga     2460

gaagcgggaa gaattgcaga ccagtttttg ggggccatgt atacgctgcc tcgccaggcc     2520

acaccaggtg ttcctgcaca gcagtcccca agcatgtga                            2559


<210>  2
<211>  3053
<212>  DNA
<213>  homo sapiens


<220>
<221>  CDS
<222>  (150)..(2708)

<400>  2
ggcgcggcgg gagcgcgctt ggcgcgtgcg tacgcgacgg cggttggcgg cgcgcgggca       60

gcgtgaagcg aggcgaggca aggcttttcg gacccacgga gcgacagagc gagcggcccc      120

tacggccgtc ggcggcccgg cggcccgag atg tta tct ggg aag aag gcg gca        173
                                Met Leu Ser Gly Lys Lys Ala Ala           
                                1               5                         

gcc gcg gcg gcg gcg gct gca gcg gca gca acc ggg acg gag gct ggc        221
Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Thr Gly Thr Glu Ala Gly           
    10                  15                  20                            

cct ggg aca gca ggc ggc tcc gag aac ggg tct gag gtg gcc gcg cag        269
Pro Gly Thr Ala Gly Gly Ser Glu Asn Gly Ser Glu Val Ala Ala Gln           
25                  30                  35                  40            

ccc gcg ggc ctg tcg ggc cca gcc gag gtc ggg ccg ggg gcg gtg ggg        317
Pro Ala Gly Leu Ser Gly Pro Ala Glu Val Gly Pro Gly Ala Val Gly           
                45                  50                  55                

gag cgc aca ccc cgc aag aaa gag cct ccg cgg gcc tcg ccc ccc ggg        365
Glu Arg Thr Pro Arg Lys Lys Glu Pro Pro Arg Ala Ser Pro Pro Gly           
            60                  65                  70                    

ggc ctg gcg gaa ccg ccg ggg tcc gca ggg cct cag gcc ggc cct act        413
Gly Leu Ala Glu Pro Pro Gly Ser Ala Gly Pro Gln Ala Gly Pro Thr           
        75                  80                  85                        

gtc gtg cct ggg tct gcg acc ccc atg gaa act gga ata gca gag act        461
Val Val Pro Gly Ser Ala Thr Pro Met Glu Thr Gly Ile Ala Glu Thr           
    90                  95                  100                           

ccg gag ggg cgt cgg acc agc cgg cgc aag cgg gcg aag gta gag tac        509
Pro Glu Gly Arg Arg Thr Ser Arg Arg Lys Arg Ala Lys Val Glu Tyr           
105                 110                 115                 120           

aga gag atg gat gaa agc ttg gcc aac ctc tca gaa gat gag tat tat        557
Arg Glu Met Asp Glu Ser Leu Ala Asn Leu Ser Glu Asp Glu Tyr Tyr           
                125                 130                 135               

tca gaa gaa gag aga aat gcc aaa gca gag aag gaa aag aag ctt ccc        605
Ser Glu Glu Glu Arg Asn Ala Lys Ala Glu Lys Glu Lys Lys Leu Pro           
            140                 145                 150                   

cca cca ccc cct caa gcc cca cct gag gaa gaa aat gaa agt gag cct        653
Pro Pro Pro Pro Gln Ala Pro Pro Glu Glu Glu Asn Glu Ser Glu Pro           
        155                 160                 165                       

gaa gaa cca tcg ggt gtg gag ggc gca gct ttc cag agc cga ctt cct        701
Glu Glu Pro Ser Gly Val Glu Gly Ala Ala Phe Gln Ser Arg Leu Pro           
    170                 175                 180                           

cat gac cgg atg act tct caa gaa gca gcc tgt ttt cca gat att atc        749
His Asp Arg Met Thr Ser Gln Glu Ala Ala Cys Phe Pro Asp Ile Ile           
185                 190                 195                 200           

agt gga cca caa cag acc cag aag gtt ttt ctt ttc att aga aac cgc        797
Ser Gly Pro Gln Gln Thr Gln Lys Val Phe Leu Phe Ile Arg Asn Arg           
                205                 210                 215               

aca ctg cag ttg tgg ttg gat aat cca aag att cag ctg aca ttt gag        845
Thr Leu Gln Leu Trp Leu Asp Asn Pro Lys Ile Gln Leu Thr Phe Glu           
            220                 225                 230                   

gct act ctc caa caa tta gaa gca cct tat aac agt gat act gtg ctt        893
Ala Thr Leu Gln Gln Leu Glu Ala Pro Tyr Asn Ser Asp Thr Val Leu           
        235                 240                 245                       

gtc cac cga gtt cac agt tat tta gag cgt cat ggt ctt atc aac ttc        941
Val His Arg Val His Ser Tyr Leu Glu Arg His Gly Leu Ile Asn Phe           
    250                 255                 260                           

ggc atc tat aag agg ata aaa ccc cta cca act aaa aag aca gga aag        989
Gly Ile Tyr Lys Arg Ile Lys Pro Leu Pro Thr Lys Lys Thr Gly Lys           
265                 270                 275                 280           

gta att att ata ggc tct ggg gtc tca ggc ttg gca gca gct cga cag       1037
Val Ile Ile Ile Gly Ser Gly Val Ser Gly Leu Ala Ala Ala Arg Gln           
                285                 290                 295               

tta caa agt ttt gga atg gat gtc aca ctt ttg gaa gcc agg gat cgt       1085
Leu Gln Ser Phe Gly Met Asp Val Thr Leu Leu Glu Ala Arg Asp Arg           
            300                 305                 310                   

gtg ggt gga cga gtt gcc aca ttt cgc aaa gga aac tat gta gct gat       1133
Val Gly Gly Arg Val Ala Thr Phe Arg Lys Gly Asn Tyr Val Ala Asp           
        315                 320                 325                       

ctt gga gcc atg gtg gta aca ggt ctt gga ggg aat cct atg gct gtg       1181
Leu Gly Ala Met Val Val Thr Gly Leu Gly Gly Asn Pro Met Ala Val           
    330                 335                 340                           

gtc agc aaa caa gta aat atg gaa ctg gcc aag atc aag caa aaa tgc       1229
Val Ser Lys Gln Val Asn Met Glu Leu Ala Lys Ile Lys Gln Lys Cys           
345                 350                 355                 360           

cca ctt tat gaa gcc aac gga caa gct gtt cct aaa gag aaa gat gaa       1277
Pro Leu Tyr Glu Ala Asn Gly Gln Ala Val Pro Lys Glu Lys Asp Glu           
                365                 370                 375               

atg gta gag caa gag ttt aac cgg ttg cta gaa gct aca tct tac ctt       1325
Met Val Glu Gln Glu Phe Asn Arg Leu Leu Glu Ala Thr Ser Tyr Leu           
            380                 385                 390                   

agt cat caa cta gac ttc aat gtc ctc aat aat aag cct gtg tcc ctt       1373
Ser His Gln Leu Asp Phe Asn Val Leu Asn Asn Lys Pro Val Ser Leu           
        395                 400                 405                       

ggc cag gca ttg gaa gtt gtc att cag tta caa gag aag cat gtc aaa       1421
Gly Gln Ala Leu Glu Val Val Ile Gln Leu Gln Glu Lys His Val Lys           
    410                 415                 420                           

gat gag cag att gaa cat tgg aag aag ata gtg aaa act cag gaa gaa       1469
Asp Glu Gln Ile Glu His Trp Lys Lys Ile Val Lys Thr Gln Glu Glu           
425                 430                 435                 440           

ttg aaa gaa ctt ctt aat aag atg gta aat ttg aaa gag aaa att aaa       1517
Leu Lys Glu Leu Leu Asn Lys Met Val Asn Leu Lys Glu Lys Ile Lys           
                445                 450                 455               

gaa ctc cat cag caa tac aaa gaa gca tct gaa gta aag cca ccc aga       1565
Glu Leu His Gln Gln Tyr Lys Glu Ala Ser Glu Val Lys Pro Pro Arg           
            460                 465                 470                   

gat att act gcc gag ttc tta gtg aaa agc aaa cac agg gat ctg acc       1613
Asp Ile Thr Ala Glu Phe Leu Val Lys Ser Lys His Arg Asp Leu Thr           
        475                 480                 485                       

gcc cta tgc aag gaa tat gat gaa tta gct gaa aca caa gga aag cta       1661
Ala Leu Cys Lys Glu Tyr Asp Glu Leu Ala Glu Thr Gln Gly Lys Leu           
    490                 495                 500                           

gaa gaa aaa ctt cag gag ttg gaa gcg aat ccc cca agt gat gta tat       1709
Glu Glu Lys Leu Gln Glu Leu Glu Ala Asn Pro Pro Ser Asp Val Tyr           
505                 510                 515                 520           

ctc tca tca aga gac aga caa ata ctt gat tgg cat ttt gca aat ctt       1757
Leu Ser Ser Arg Asp Arg Gln Ile Leu Asp Trp His Phe Ala Asn Leu           
                525                 530                 535               

gaa ttt gct aat gcc aca cct ctc tca act ctc tcc ctt aag cac tgg       1805
Glu Phe Ala Asn Ala Thr Pro Leu Ser Thr Leu Ser Leu Lys His Trp           
            540                 545                 550                   

gat cag gat gat gac ttt gag ttc act ggc agc cac ctg aca gta agg       1853
Asp Gln Asp Asp Asp Phe Glu Phe Thr Gly Ser His Leu Thr Val Arg           
        555                 560                 565                       

aat ggc tac tcg tgt gtg cct gtg gct tta gca gaa ggc cta gac att       1901
Asn Gly Tyr Ser Cys Val Pro Val Ala Leu Ala Glu Gly Leu Asp Ile           
    570                 575                 580                           

aaa ctg aat aca gca gtg cga cag gtt cgc tac acg gct tca gga tgt       1949
Lys Leu Asn Thr Ala Val Arg Gln Val Arg Tyr Thr Ala Ser Gly Cys           
585                 590                 595                 600           

gaa gtg ata gct gtg aat acc cgc tcc acg agt caa acc ttt att tat       1997
Glu Val Ile Ala Val Asn Thr Arg Ser Thr Ser Gln Thr Phe Ile Tyr           
                605                 610                 615               

aaa tgc gac gca gtt ctc tgt acc ctt ccc ctg ggt gtg ctg aag cag       2045
Lys Cys Asp Ala Val Leu Cys Thr Leu Pro Leu Gly Val Leu Lys Gln           
            620                 625                 630                   

cag cca cca gcc gtt cag ttt gtg cca cct ctc cct gag tgg aaa aca       2093
Gln Pro Pro Ala Val Gln Phe Val Pro Pro Leu Pro Glu Trp Lys Thr           
        635                 640                 645                       

tct gca gtc caa agg atg gga ttt ggc aac ctt aac aag gtg gtg ttg       2141
Ser Ala Val Gln Arg Met Gly Phe Gly Asn Leu Asn Lys Val Val Leu           
    650                 655                 660                           

tgt ttt gat cgg gtg ttc tgg gat cca agt gtc aat ttg ttc ggg cat       2189
Cys Phe Asp Arg Val Phe Trp Asp Pro Ser Val Asn Leu Phe Gly His           
665                 670                 675                 680           

gtt ggc agt acg act gcc agc agg ggt gag ctc ttc ctc ttc tgg aac       2237
Val Gly Ser Thr Thr Ala Ser Arg Gly Glu Leu Phe Leu Phe Trp Asn           
                685                 690                 695               

ctc tat aaa gct cca ata ctg ttg gca cta gtg gca gga gaa gct gct       2285
Leu Tyr Lys Ala Pro Ile Leu Leu Ala Leu Val Ala Gly Glu Ala Ala           
            700                 705                 710                   

ggt atc atg gaa aac ata agt gac gat gtg att gtt ggc cga tgc ctg       2333
Gly Ile Met Glu Asn Ile Ser Asp Asp Val Ile Val Gly Arg Cys Leu           
        715                 720                 725                       

gcc att ctc aaa ggg att ttt ggt agc agt gca gta cct cag ccc aaa       2381
Ala Ile Leu Lys Gly Ile Phe Gly Ser Ser Ala Val Pro Gln Pro Lys           
    730                 735                 740                           

gaa act gtg gtg tct cgt tgg cgt gct gat ccc tgg gct cgg ggc tct       2429
Glu Thr Val Val Ser Arg Trp Arg Ala Asp Pro Trp Ala Arg Gly Ser           
745                 750                 755                 760           

tat tcc tat gtt gct gca gga tca tct gga aat gac tat gat tta atg       2477
Tyr Ser Tyr Val Ala Ala Gly Ser Ser Gly Asn Asp Tyr Asp Leu Met           
                765                 770                 775               

gct cag cca atc act cct ggc ccc tcg att cca ggt gcc cca cag ccg       2525
Ala Gln Pro Ile Thr Pro Gly Pro Ser Ile Pro Gly Ala Pro Gln Pro           
            780                 785                 790                   

att cca cga ctc ttc ttt gcg gga gaa cat acg atc cgt aac tac cca       2573
Ile Pro Arg Leu Phe Phe Ala Gly Glu His Thr Ile Arg Asn Tyr Pro           
        795                 800                 805                       

gcc aca gtg cat ggt gct ctg ctg agt ggg ctg cga gaa gcg gga aga       2621
Ala Thr Val His Gly Ala Leu Leu Ser Gly Leu Arg Glu Ala Gly Arg           
    810                 815                 820                           

att gca gac cag ttt ttg ggg gcc atg tat acg ctg cct cgc cag gcc       2669
Ile Ala Asp Gln Phe Leu Gly Ala Met Tyr Thr Leu Pro Arg Gln Ala           
825                 830                 835                 840           

aca cca ggt gtt cct gca cag cag tcc cca agc atg tga gacagatgca        2718
Thr Pro Gly Val Pro Ala Gln Gln Ser Pro Ser Met                           
                845                 850                                   

ttctaaggga agaggcccat gtgcctgttt ctgccatgta aggaaggctc ttctagcaat     2778

actagatccc actgagaaaa tccaccctgg catctgggct cctgatcagc tgatggagct     2838

cctgatttga caaaggagct tgcctccttt gaatgaccta gagcacaggg aggaacttgt     2898

ccattagttt ggaattgtgt tcttcgtaaa gactgaggca agcaagtgct gtgaaataac     2958

atcatcttag tcccttggtg tgtggggttt ttgttttttt tttatatttt gagaataaaa     3018

cttcatataa aattggcaaa aaaaaaaaaa aaaaa                                3053


<210>  3
<211>  852
<212>  PRT
<213>  homo sapiens

<400>  3

Met Leu Ser Gly Lys Lys Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
1               5                   10                  15      


Ala Ala Thr Gly Thr Glu Ala Gly Pro Gly Thr Ala Gly Gly Ser Glu 
            20                  25                  30          


Asn Gly Ser Glu Val Ala Ala Gln Pro Ala Gly Leu Ser Gly Pro Ala 
        35                  40                  45              


Glu Val Gly Pro Gly Ala Val Gly Glu Arg Thr Pro Arg Lys Lys Glu 
    50                  55                  60                  


Pro Pro Arg Ala Ser Pro Pro Gly Gly Leu Ala Glu Pro Pro Gly Ser 
65                  70                  75                  80  


Ala Gly Pro Gln Ala Gly Pro Thr Val Val Pro Gly Ser Ala Thr Pro 
                85                  90                  95      


Met Glu Thr Gly Ile Ala Glu Thr Pro Glu Gly Arg Arg Thr Ser Arg 
            100                 105                 110         


Arg Lys Arg Ala Lys Val Glu Tyr Arg Glu Met Asp Glu Ser Leu Ala 
        115                 120                 125             


Asn Leu Ser Glu Asp Glu Tyr Tyr Ser Glu Glu Glu Arg Asn Ala Lys 
    130                 135                 140                 


Ala Glu Lys Glu Lys Lys Leu Pro Pro Pro Pro Pro Gln Ala Pro Pro 
145                 150                 155                 160 


Glu Glu Glu Asn Glu Ser Glu Pro Glu Glu Pro Ser Gly Val Glu Gly 
                165                 170                 175     


Ala Ala Phe Gln Ser Arg Leu Pro His Asp Arg Met Thr Ser Gln Glu 
            180                 185                 190         


Ala Ala Cys Phe Pro Asp Ile Ile Ser Gly Pro Gln Gln Thr Gln Lys 
        195                 200                 205             


Val Phe Leu Phe Ile Arg Asn Arg Thr Leu Gln Leu Trp Leu Asp Asn 
    210                 215                 220                 


Pro Lys Ile Gln Leu Thr Phe Glu Ala Thr Leu Gln Gln Leu Glu Ala 
225                 230                 235                 240 


Pro Tyr Asn Ser Asp Thr Val Leu Val His Arg Val His Ser Tyr Leu 
                245                 250                 255     


Glu Arg His Gly Leu Ile Asn Phe Gly Ile Tyr Lys Arg Ile Lys Pro 
            260                 265                 270         


Leu Pro Thr Lys Lys Thr Gly Lys Val Ile Ile Ile Gly Ser Gly Val 
        275                 280                 285             


Ser Gly Leu Ala Ala Ala Arg Gln Leu Gln Ser Phe Gly Met Asp Val 
    290                 295                 300                 


Thr Leu Leu Glu Ala Arg Asp Arg Val Gly Gly Arg Val Ala Thr Phe 
305                 310                 315                 320 


Arg Lys Gly Asn Tyr Val Ala Asp Leu Gly Ala Met Val Val Thr Gly 
                325                 330                 335     


Leu Gly Gly Asn Pro Met Ala Val Val Ser Lys Gln Val Asn Met Glu 
            340                 345                 350         


Leu Ala Lys Ile Lys Gln Lys Cys Pro Leu Tyr Glu Ala Asn Gly Gln 
        355                 360                 365             


Ala Val Pro Lys Glu Lys Asp Glu Met Val Glu Gln Glu Phe Asn Arg 
    370                 375                 380                 


Leu Leu Glu Ala Thr Ser Tyr Leu Ser His Gln Leu Asp Phe Asn Val 
385                 390                 395                 400 


Leu Asn Asn Lys Pro Val Ser Leu Gly Gln Ala Leu Glu Val Val Ile 
                405                 410                 415     


Gln Leu Gln Glu Lys His Val Lys Asp Glu Gln Ile Glu His Trp Lys 
            420                 425                 430         


Lys Ile Val Lys Thr Gln Glu Glu Leu Lys Glu Leu Leu Asn Lys Met 
        435                 440                 445             


Val Asn Leu Lys Glu Lys Ile Lys Glu Leu His Gln Gln Tyr Lys Glu 
    450                 455                 460                 


Ala Ser Glu Val Lys Pro Pro Arg Asp Ile Thr Ala Glu Phe Leu Val 
465                 470                 475                 480 


Lys Ser Lys His Arg Asp Leu Thr Ala Leu Cys Lys Glu Tyr Asp Glu 
                485                 490                 495     


Leu Ala Glu Thr Gln Gly Lys Leu Glu Glu Lys Leu Gln Glu Leu Glu 
            500                 505                 510         


Ala Asn Pro Pro Ser Asp Val Tyr Leu Ser Ser Arg Asp Arg Gln Ile 
        515                 520                 525             


Leu Asp Trp His Phe Ala Asn Leu Glu Phe Ala Asn Ala Thr Pro Leu 
    530                 535                 540                 


Ser Thr Leu Ser Leu Lys His Trp Asp Gln Asp Asp Asp Phe Glu Phe 
545                 550                 555                 560 


Thr Gly Ser His Leu Thr Val Arg Asn Gly Tyr Ser Cys Val Pro Val 
                565                 570                 575     


Ala Leu Ala Glu Gly Leu Asp Ile Lys Leu Asn Thr Ala Val Arg Gln 
            580                 585                 590         


Val Arg Tyr Thr Ala Ser Gly Cys Glu Val Ile Ala Val Asn Thr Arg 
        595                 600                 605             


Ser Thr Ser Gln Thr Phe Ile Tyr Lys Cys Asp Ala Val Leu Cys Thr 
    610                 615                 620                 


Leu Pro Leu Gly Val Leu Lys Gln Gln Pro Pro Ala Val Gln Phe Val 
625                 630                 635                 640 


Pro Pro Leu Pro Glu Trp Lys Thr Ser Ala Val Gln Arg Met Gly Phe 
                645                 650                 655     


Gly Asn Leu Asn Lys Val Val Leu Cys Phe Asp Arg Val Phe Trp Asp 
            660                 665                 670         


Pro Ser Val Asn Leu Phe Gly His Val Gly Ser Thr Thr Ala Ser Arg 
        675                 680                 685             


Gly Glu Leu Phe Leu Phe Trp Asn Leu Tyr Lys Ala Pro Ile Leu Leu 
    690                 695                 700                 


Ala Leu Val Ala Gly Glu Ala Ala Gly Ile Met Glu Asn Ile Ser Asp 
705                 710                 715                 720 


Asp Val Ile Val Gly Arg Cys Leu Ala Ile Leu Lys Gly Ile Phe Gly 
                725                 730                 735     


Ser Ser Ala Val Pro Gln Pro Lys Glu Thr Val Val Ser Arg Trp Arg 
            740                 745                 750         


Ala Asp Pro Trp Ala Arg Gly Ser Tyr Ser Tyr Val Ala Ala Gly Ser 
        755                 760                 765             


Ser Gly Asn Asp Tyr Asp Leu Met Ala Gln Pro Ile Thr Pro Gly Pro 
    770                 775                 780                 


Ser Ile Pro Gly Ala Pro Gln Pro Ile Pro Arg Leu Phe Phe Ala Gly 
785                 790                 795                 800 


Glu His Thr Ile Arg Asn Tyr Pro Ala Thr Val His Gly Ala Leu Leu 
                805                 810                 815     


Ser Gly Leu Arg Glu Ala Gly Arg Ile Ala Asp Gln Phe Leu Gly Ala 
            820                 825                 830         


Met Tyr Thr Leu Pro Arg Gln Ala Thr Pro Gly Val Pro Ala Gln Gln 
        835                 840                 845             


Ser Pro Ser Met 
    850         


<210>  4
<211>  3030
<212>  DNA
<213>  mus musculus


<220>
<221>  CDS
<222>  (139)..(2700)

<400>  4
gggcgcgtgc gcacgcgggg gtgtttggct tcgcacggag cgtgagaggt gcggggcgga       60

gaggcgcgag gcggctgcgg acccacggag cggcagaccg atcggcccct gcggcccgcg      120

gcggccaggc ggcccgag atg ttg tct ggg aag aag gcg gcg gcg gcg gca        171
                    Met Leu Ser Gly Lys Lys Ala Ala Ala Ala Ala           
                    1               5                   10                

gcg gca gcg gcg gcg gcg gcg gct gct ggg acc gag gcc ggg tcc ggg        219
Ala Ala Ala Ala Ala Ala Ala Ala Ala Gly Thr Glu Ala Gly Ser Gly           
            15                  20                  25                    

gcg gcg ggc ggt gcc gag aac ggc tct gag gtg gcc gcg ccg ccc gcg        267
Ala Ala Gly Gly Ala Glu Asn Gly Ser Glu Val Ala Ala Pro Pro Ala           
        30                  35                  40                        

ggc ctg acg ggc ccc acc gac atg gct acg ggg gcg gcg ggc gag cgc        315
Gly Leu Thr Gly Pro Thr Asp Met Ala Thr Gly Ala Ala Gly Glu Arg           
    45                  50                  55                            

act ccc cga aag aag gag cct ccg cgg gcc tcg ccg ccc ggg ggc cta        363
Thr Pro Arg Lys Lys Glu Pro Pro Arg Ala Ser Pro Pro Gly Gly Leu           
60                  65                  70                  75            

gcc gag ccg ccg ggg tct gct ggg ccc cag gcg ggg ccc aca gcc ggg        411
Ala Glu Pro Pro Gly Ser Ala Gly Pro Gln Ala Gly Pro Thr Ala Gly           
                80                  85                  90                

ccc ggc tcc gcg acg ccc atg gag acc gga ata gcc gag acc ccg gag        459
Pro Gly Ser Ala Thr Pro Met Glu Thr Gly Ile Ala Glu Thr Pro Glu           
            95                  100                 105                   

ggc cga cgg acc agc cgg cgc aag cgg gcc aag gta gaa tac aga gaa        507
Gly Arg Arg Thr Ser Arg Arg Lys Arg Ala Lys Val Glu Tyr Arg Glu           
        110                 115                 120                       

atg gat gaa agc ttg gcc aac ctc tca gaa gat gaa tat tat tcg gaa        555
Met Asp Glu Ser Leu Ala Asn Leu Ser Glu Asp Glu Tyr Tyr Ser Glu           
    125                 130                 135                           

gaa gaa aga aat gct aaa gca gag aag gaa aag aag ctt ccc cca cca        603
Glu Glu Arg Asn Ala Lys Ala Glu Lys Glu Lys Lys Leu Pro Pro Pro           
140                 145                 150                 155           

cct cct caa gcc cca cct gag gaa gaa aat gaa agt gag ccg gaa gag        651
Pro Pro Gln Ala Pro Pro Glu Glu Glu Asn Glu Ser Glu Pro Glu Glu           
                160                 165                 170               

ccg tct ggt gtg gag ggt gca gct ttt caa agc cga ctt ccc cat gac        699
Pro Ser Gly Val Glu Gly Ala Ala Phe Gln Ser Arg Leu Pro His Asp           
            175                 180                 185                   

cga atg acc tct cag gaa gca gcc tgt ttc cca gac atc atc agt ggg        747
Arg Met Thr Ser Gln Glu Ala Ala Cys Phe Pro Asp Ile Ile Ser Gly           
        190                 195                 200                       

cct cag cag aca cag aag gtt ttt ctg ttc atc agg aat cgc aca ttg        795
Pro Gln Gln Thr Gln Lys Val Phe Leu Phe Ile Arg Asn Arg Thr Leu           
    205                 210                 215                           

cag tta tgg ctg gac aac cca aag atc cag ctg acg ttt gaa gcc act        843
Gln Leu Trp Leu Asp Asn Pro Lys Ile Gln Leu Thr Phe Glu Ala Thr           
220                 225                 230                 235           

ctc cag cag ctg gaa gcg cct tac aac agc gat act gtg ctt gtc cac        891
Leu Gln Gln Leu Glu Ala Pro Tyr Asn Ser Asp Thr Val Leu Val His           
                240                 245                 250               

cga gtt cac agt tac tta gag cgc cat ggt ctt atc aac ttc ggc atc        939
Arg Val His Ser Tyr Leu Glu Arg His Gly Leu Ile Asn Phe Gly Ile           
            255                 260                 265                   

tac aag agg ata aaa ccc tta cca att aaa aag aca gga aag gtg att        987
Tyr Lys Arg Ile Lys Pro Leu Pro Ile Lys Lys Thr Gly Lys Val Ile           
        270                 275                 280                       

att ata ggt tca ggt gtt tct ggc ttg gca gca gct cga cag cta cag       1035
Ile Ile Gly Ser Gly Val Ser Gly Leu Ala Ala Ala Arg Gln Leu Gln           
    285                 290                 295                           

agt ttt ggg atg gat gtc aca ctt ctg gaa gcc agg gat cga gta ggt       1083
Ser Phe Gly Met Asp Val Thr Leu Leu Glu Ala Arg Asp Arg Val Gly           
300                 305                 310                 315           

gga cga gtt gct aca ttt cga aaa gga aac tat gta gct gat ctt ggc       1131
Gly Arg Val Ala Thr Phe Arg Lys Gly Asn Tyr Val Ala Asp Leu Gly           
                320                 325                 330               

gcc atg gtt gta aca ggt ctt gga ggg aat ccc atg gct gtc gtc agc       1179
Ala Met Val Val Thr Gly Leu Gly Gly Asn Pro Met Ala Val Val Ser           
            335                 340                 345                   

aaa caa gta aat atg gaa ctg gcc aag atc aag caa aaa tgc cca ctt       1227
Lys Gln Val Asn Met Glu Leu Ala Lys Ile Lys Gln Lys Cys Pro Leu           
        350                 355                 360                       

tat gaa gcc aat gga caa gct gtt cca aaa gaa aaa gat gaa atg gta       1275
Tyr Glu Ala Asn Gly Gln Ala Val Pro Lys Glu Lys Asp Glu Met Val           
    365                 370                 375                           

gaa caa gaa ttt aac cgg ttg cta gaa gcc act tct tac ctt agt cac       1323
Glu Gln Glu Phe Asn Arg Leu Leu Glu Ala Thr Ser Tyr Leu Ser His           
380                 385                 390                 395           

cag tta gac ttc aac gtc ctc aat aat aaa cct gta tcc ctt ggc cag       1371
Gln Leu Asp Phe Asn Val Leu Asn Asn Lys Pro Val Ser Leu Gly Gln           
                400                 405                 410               

gca ttg gag gtt gtc att cag ctg caa gaa aag cat gtc aaa gat gag       1419
Ala Leu Glu Val Val Ile Gln Leu Gln Glu Lys His Val Lys Asp Glu           
            415                 420                 425                   

cag att gaa cat tgg aag aag ata gtg aaa act cag gag gag ttg aaa       1467
Gln Ile Glu His Trp Lys Lys Ile Val Lys Thr Gln Glu Glu Leu Lys           
        430                 435                 440                       

gag ctt ctt aat aag atg gta aat ttg aag gag aaa att aaa gag ctc       1515
Glu Leu Leu Asn Lys Met Val Asn Leu Lys Glu Lys Ile Lys Glu Leu           
    445                 450                 455                           

cat cag caa tac aaa gaa gct tca gaa gtg aag ccg ccc aga gat atc       1563
His Gln Gln Tyr Lys Glu Ala Ser Glu Val Lys Pro Pro Arg Asp Ile           
460                 465                 470                 475           

aca gcc gag ttc ctg gtg aag agc aag cac agg gac ctg act gcc ctc       1611
Thr Ala Glu Phe Leu Val Lys Ser Lys His Arg Asp Leu Thr Ala Leu           
                480                 485                 490               

tgc aag gaa tat gat gaa tta gct gaa aca caa gga aag cta gaa gaa       1659
Cys Lys Glu Tyr Asp Glu Leu Ala Glu Thr Gln Gly Lys Leu Glu Glu           
            495                 500                 505                   

aaa ctt caa gaa ttg gaa gcc aat ccc cca agt gat gta tac ctc tca       1707
Lys Leu Gln Glu Leu Glu Ala Asn Pro Pro Ser Asp Val Tyr Leu Ser           
        510                 515                 520                       

tca aga gac aga caa ata ctt gac tgg cat ttt gca aat ctt gaa ttt       1755
Ser Arg Asp Arg Gln Ile Leu Asp Trp His Phe Ala Asn Leu Glu Phe           
    525                 530                 535                           

gcc aac gcc aca cct ctc tct acc ctc tct ctt aaa cat tgg gat cag       1803
Ala Asn Ala Thr Pro Leu Ser Thr Leu Ser Leu Lys His Trp Asp Gln           
540                 545                 550                 555           

gat gat gac ttt gag ttt act gga agc cac ctg aca gta agg aat ggc       1851
Asp Asp Asp Phe Glu Phe Thr Gly Ser His Leu Thr Val Arg Asn Gly           
                560                 565                 570               

tac tca tgt gtg cct gtg gct tta gct gaa ggc ttg gac att aaa ctg       1899
Tyr Ser Cys Val Pro Val Ala Leu Ala Glu Gly Leu Asp Ile Lys Leu           
            575                 580                 585                   

aac aca gca gtg cgg cag gtt cgc tac aca gcc tca gga tgt gaa gtg       1947
Asn Thr Ala Val Arg Gln Val Arg Tyr Thr Ala Ser Gly Cys Glu Val           
        590                 595                 600                       

att gct gtg aac aca cgt tcc aca agt caa acc ttt att tat aag tgt       1995
Ile Ala Val Asn Thr Arg Ser Thr Ser Gln Thr Phe Ile Tyr Lys Cys           
    605                 610                 615                           

gat gca gtt ctc tgt aca ctt cct ttg gga gtg ttg aag cag cag cca       2043
Asp Ala Val Leu Cys Thr Leu Pro Leu Gly Val Leu Lys Gln Gln Pro           
620                 625                 630                 635           

cca gct gtt cag ttt gtg cca cct ctt cct gag tgg aaa aca tct gca       2091
Pro Ala Val Gln Phe Val Pro Pro Leu Pro Glu Trp Lys Thr Ser Ala           
                640                 645                 650               

gtc caa agg atg gga ttt ggc aac ctt aac aag gtg gtg tta tgc ttt       2139
Val Gln Arg Met Gly Phe Gly Asn Leu Asn Lys Val Val Leu Cys Phe           
            655                 660                 665                   

gac cgt gtg ttc tgg gac cca agt gtc aat ttg ttt ggg cac gtt ggc       2187
Asp Arg Val Phe Trp Asp Pro Ser Val Asn Leu Phe Gly His Val Gly           
        670                 675                 680                       

agt aca act gct agc agg ggt gag ctc ttc ctc ttc tgg aac cta tat       2235
Ser Thr Thr Ala Ser Arg Gly Glu Leu Phe Leu Phe Trp Asn Leu Tyr           
    685                 690                 695                           

aaa gct cca ata cta ttg gcc ctg gta gca gga gaa gct gct ggc att       2283
Lys Ala Pro Ile Leu Leu Ala Leu Val Ala Gly Glu Ala Ala Gly Ile           
700                 705                 710                 715           

atg gag aac att agt gat gat gtg att gtc ggc cgg tgc ctg gcc att       2331
Met Glu Asn Ile Ser Asp Asp Val Ile Val Gly Arg Cys Leu Ala Ile           
                720                 725                 730               

ctc aaa ggg att ttt ggc agc agt gca gtc cca cag ccc aag gaa act       2379
Leu Lys Gly Ile Phe Gly Ser Ser Ala Val Pro Gln Pro Lys Glu Thr           
            735                 740                 745                   

gtg gta tct cgt tgg cgt gct gat ccg tgg gcc cgg ggc tcc tat tct       2427
Val Val Ser Arg Trp Arg Ala Asp Pro Trp Ala Arg Gly Ser Tyr Ser           
        750                 755                 760                       

tat gtg gct gca gga tcc tct gga aat gac tat gat tta atg gct cag       2475
Tyr Val Ala Ala Gly Ser Ser Gly Asn Asp Tyr Asp Leu Met Ala Gln           
    765                 770                 775                           

ccg atc act cct ggc ccc tca att cca ggt gcc cca cag cca atc cca       2523
Pro Ile Thr Pro Gly Pro Ser Ile Pro Gly Ala Pro Gln Pro Ile Pro           
780                 785                 790                 795           

aga ctc ttc ttt gct gga gaa cac aca atc cgg aac tac cca gct aca       2571
Arg Leu Phe Phe Ala Gly Glu His Thr Ile Arg Asn Tyr Pro Ala Thr           
                800                 805                 810               

gtc cat ggt gct ctg ttg agt ggg ctt cga gaa gca gga agg att gcc       2619
Val His Gly Ala Leu Leu Ser Gly Leu Arg Glu Ala Gly Arg Ile Ala           
            815                 820                 825                   

gac cag ttt ttg gga gcc atg tac act ttg cct cgt cag gcc aca cca       2667
Asp Gln Phe Leu Gly Ala Met Tyr Thr Leu Pro Arg Gln Ala Thr Pro           
        830                 835                 840                       

ggt gtc cct gca cag cag tcc cca agt atg tga gacagatggt tctgaacaga     2720
Gly Val Pro Ala Gln Gln Ser Pro Ser Met                                   
    845                 850                                               

gagatccaac ggcatgtcat ctgccacgta agcaagctct tctagcaata ctagatccta     2780

ctgagaaact ccatgtcatc agctactggg actcctagtt tgacagcaga ggctggctcc     2840

tttggctgac agcaacttac ccattgattt ggaagtacag ctccataaag actgctcgag     2900

aagcaagtgg tgtgagataa cctcttagtc tatggtgttt gtttgttttt gttttttttt     2960

aatatatttt gagaataaaa ctttaaaata attttatatg aaaatttatt tttaaaaaaa     3020

aaaaaaaaaa                                                            3030


<210>  5
<211>  853
<212>  PRT
<213>  mus musculus

<400>  5

Met Leu Ser Gly Lys Lys Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
1               5                   10                  15      


Ala Ala Ala Ala Gly Thr Glu Ala Gly Ser Gly Ala Ala Gly Gly Ala 
            20                  25                  30          


Glu Asn Gly Ser Glu Val Ala Ala Pro Pro Ala Gly Leu Thr Gly Pro 
        35                  40                  45              


Thr Asp Met Ala Thr Gly Ala Ala Gly Glu Arg Thr Pro Arg Lys Lys 
    50                  55                  60                  


Glu Pro Pro Arg Ala Ser Pro Pro Gly Gly Leu Ala Glu Pro Pro Gly 
65                  70                  75                  80  


Ser Ala Gly Pro Gln Ala Gly Pro Thr Ala Gly Pro Gly Ser Ala Thr 
                85                  90                  95      


Pro Met Glu Thr Gly Ile Ala Glu Thr Pro Glu Gly Arg Arg Thr Ser 
            100                 105                 110         


Arg Arg Lys Arg Ala Lys Val Glu Tyr Arg Glu Met Asp Glu Ser Leu 
        115                 120                 125             


Ala Asn Leu Ser Glu Asp Glu Tyr Tyr Ser Glu Glu Glu Arg Asn Ala 
    130                 135                 140                 


Lys Ala Glu Lys Glu Lys Lys Leu Pro Pro Pro Pro Pro Gln Ala Pro 
145                 150                 155                 160 


Pro Glu Glu Glu Asn Glu Ser Glu Pro Glu Glu Pro Ser Gly Val Glu 
                165                 170                 175     


Gly Ala Ala Phe Gln Ser Arg Leu Pro His Asp Arg Met Thr Ser Gln 
            180                 185                 190         


Glu Ala Ala Cys Phe Pro Asp Ile Ile Ser Gly Pro Gln Gln Thr Gln 
        195                 200                 205             


Lys Val Phe Leu Phe Ile Arg Asn Arg Thr Leu Gln Leu Trp Leu Asp 
    210                 215                 220                 


Asn Pro Lys Ile Gln Leu Thr Phe Glu Ala Thr Leu Gln Gln Leu Glu 
225                 230                 235                 240 


Ala Pro Tyr Asn Ser Asp Thr Val Leu Val His Arg Val His Ser Tyr 
                245                 250                 255     


Leu Glu Arg His Gly Leu Ile Asn Phe Gly Ile Tyr Lys Arg Ile Lys 
            260                 265                 270         


Pro Leu Pro Ile Lys Lys Thr Gly Lys Val Ile Ile Ile Gly Ser Gly 
        275                 280                 285             


Val Ser Gly Leu Ala Ala Ala Arg Gln Leu Gln Ser Phe Gly Met Asp 
    290                 295                 300                 


Val Thr Leu Leu Glu Ala Arg Asp Arg Val Gly Gly Arg Val Ala Thr 
305                 310                 315                 320 


Phe Arg Lys Gly Asn Tyr Val Ala Asp Leu Gly Ala Met Val Val Thr 
                325                 330                 335     


Gly Leu Gly Gly Asn Pro Met Ala Val Val Ser Lys Gln Val Asn Met 
            340                 345                 350         


Glu Leu Ala Lys Ile Lys Gln Lys Cys Pro Leu Tyr Glu Ala Asn Gly 
        355                 360                 365             


Gln Ala Val Pro Lys Glu Lys Asp Glu Met Val Glu Gln Glu Phe Asn 
    370                 375                 380                 


Arg Leu Leu Glu Ala Thr Ser Tyr Leu Ser His Gln Leu Asp Phe Asn 
385                 390                 395                 400 


Val Leu Asn Asn Lys Pro Val Ser Leu Gly Gln Ala Leu Glu Val Val 
                405                 410                 415     


Ile Gln Leu Gln Glu Lys His Val Lys Asp Glu Gln Ile Glu His Trp 
            420                 425                 430         


Lys Lys Ile Val Lys Thr Gln Glu Glu Leu Lys Glu Leu Leu Asn Lys 
        435                 440                 445             


Met Val Asn Leu Lys Glu Lys Ile Lys Glu Leu His Gln Gln Tyr Lys 
    450                 455                 460                 


Glu Ala Ser Glu Val Lys Pro Pro Arg Asp Ile Thr Ala Glu Phe Leu 
465                 470                 475                 480 


Val Lys Ser Lys His Arg Asp Leu Thr Ala Leu Cys Lys Glu Tyr Asp 
                485                 490                 495     


Glu Leu Ala Glu Thr Gln Gly Lys Leu Glu Glu Lys Leu Gln Glu Leu 
            500                 505                 510         


Glu Ala Asn Pro Pro Ser Asp Val Tyr Leu Ser Ser Arg Asp Arg Gln 
        515                 520                 525             


Ile Leu Asp Trp His Phe Ala Asn Leu Glu Phe Ala Asn Ala Thr Pro 
    530                 535                 540                 


Leu Ser Thr Leu Ser Leu Lys His Trp Asp Gln Asp Asp Asp Phe Glu 
545                 550                 555                 560 


Phe Thr Gly Ser His Leu Thr Val Arg Asn Gly Tyr Ser Cys Val Pro 
                565                 570                 575     


Val Ala Leu Ala Glu Gly Leu Asp Ile Lys Leu Asn Thr Ala Val Arg 
            580                 585                 590         


Gln Val Arg Tyr Thr Ala Ser Gly Cys Glu Val Ile Ala Val Asn Thr 
        595                 600                 605             


Arg Ser Thr Ser Gln Thr Phe Ile Tyr Lys Cys Asp Ala Val Leu Cys 
    610                 615                 620                 


Thr Leu Pro Leu Gly Val Leu Lys Gln Gln Pro Pro Ala Val Gln Phe 
625                 630                 635                 640 


Val Pro Pro Leu Pro Glu Trp Lys Thr Ser Ala Val Gln Arg Met Gly 
                645                 650                 655     


Phe Gly Asn Leu Asn Lys Val Val Leu Cys Phe Asp Arg Val Phe Trp 
            660                 665                 670         


Asp Pro Ser Val Asn Leu Phe Gly His Val Gly Ser Thr Thr Ala Ser 
        675                 680                 685             


Arg Gly Glu Leu Phe Leu Phe Trp Asn Leu Tyr Lys Ala Pro Ile Leu 
    690                 695                 700                 


Leu Ala Leu Val Ala Gly Glu Ala Ala Gly Ile Met Glu Asn Ile Ser 
705                 710                 715                 720 


Asp Asp Val Ile Val Gly Arg Cys Leu Ala Ile Leu Lys Gly Ile Phe 
                725                 730                 735     


Gly Ser Ser Ala Val Pro Gln Pro Lys Glu Thr Val Val Ser Arg Trp 
            740                 745                 750         


Arg Ala Asp Pro Trp Ala Arg Gly Ser Tyr Ser Tyr Val Ala Ala Gly 
        755                 760                 765             


Ser Ser Gly Asn Asp Tyr Asp Leu Met Ala Gln Pro Ile Thr Pro Gly 
    770                 775                 780                 


Pro Ser Ile Pro Gly Ala Pro Gln Pro Ile Pro Arg Leu Phe Phe Ala 
785                 790                 795                 800 


Gly Glu His Thr Ile Arg Asn Tyr Pro Ala Thr Val His Gly Ala Leu 
                805                 810                 815     


Leu Ser Gly Leu Arg Glu Ala Gly Arg Ile Ala Asp Gln Phe Leu Gly 
            820                 825                 830         


Ala Met Tyr Thr Leu Pro Arg Gln Ala Thr Pro Gly Val Pro Ala Gln 
        835                 840                 845             


Gln Ser Pro Ser Met 
    850             


<210>  6
<211>  8717
<212>  DNA
<213>  artificial sequence

<220>
<223>  pBS-ROSA26-AOF2


<220>
<221>  promoter
<222>  (625)..(645)
<223>  T7 promoter

<220>
<221>  misc_feature
<222>  (1018)..(1381)
<223>  HS4 insulator

<220>
<221>  promoter
<222>  (1414)..(2225)
<223>  ROSA26 promoter

<220>
<221>  Intron
<222>  (2311)..(2878)
<223>  rabbit beta-globin intron

<220>
<221>  misc_feature
<222>  (3079)..(3106)
<223>  sequence encoding FLAG tag

<220>
<221>  CDS
<222>  (3079)..(5667)

<220>
<221>  misc_feature
<222>  (3112)..(5667)
<223>  seqence encoding human LSD1

<220>
<221>  polyA_site
<222>  (5686)..(5819)
<223>  SV40 poly A site

<220>
<221>  misc_feature
<222>  (5896)..(6434)
<223>  HS4 insulator

<220>
<221>  misc_feature
<222>  (6914)..(7581)
<223>  pUC origin

<220>
<221>  misc_feature
<222>  (8589)..(7729)
<223>  sequence encoding beta-lactamase

<400>  6
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc      180

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc      240

ctaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag      300

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa      360

agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac      420

cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg      480

caactgttgg gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg      540

gggatgtgct gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg      600

taaaacgacg gccagtgagc gcgcgtaata cgactcacta tagggcgaat tgggtaccgg      660

gccccccctc gaggtcgacg gtatcgataa gcttgattcg agctctgtac atgtccgcgg      720

tcgcgacgta cgcgtatcga tggcgccagc tgcaggcggc cgccatatgc atcctaggcc      780

tattaatatt ccggagtata cgtagccggc taacgttaac aaccggtacc gagttggcgc      840

gcctgggagc tcacggggac agcccccccc caaagccccc agggatgtaa ttacgtccct      900

cccccgctag ggggcagcag cgagccgccc ggggctccgc tccggtccgg cgctcccccc      960

gcatccccga gccggcagcg tgcggggaca gcccgggcac ggggaaggtg gcacgggatc     1020

gctttcctct gaacgcttct cgctgctctt tgagcctgca gacacctggg gggatacggg     1080

gaaaaagctt taggctgaaa gagagattta gaatgacggg cgcgcctggg agctcacggg     1140

gacagccccc ccccaaagcc cccagggatg taattacgtc cctcccccgc tagggggcag     1200

cagcgagccg cccggggctc cgctccggtc cggcgctccc cccgcatccc cgagccggca     1260

gcgtgcgggg acagcccggg cacggggaag gtggcacggg atcgctttcc tctgaacgct     1320

tctcgctgct ctttgagcct gcagacacct ggggggatac ggggaaaaag ctgggcgcgc     1380

caattaaccc tcactaaagg gggtacctct agtcgactag atgaaggaga gcctttctct     1440

ctgggcaaga gcggtgcaat ggtgtgtaaa ggtagctgag aagacgaaaa gggcaagcat     1500

cttcctgcta ccaggctggg gaggcccagg cccacgaccc cgaggagagg gaacgcaggg     1560

agactgaggt gacccttctt tcccccgggg cccggtcgtg tggttcggtg tctcttttct     1620

gttggaccct taccttgacc caggcgctgc cggggcctgg gcccgggctg cggcgcacgg     1680

cactcccggg aggcagcgag actcgagtta ggcccaacgc ggcgccacgg cgtttcctgg     1740

ccgggaatgg cccgtacccg tgaggtgggg gtggggggca gaaaaggcgg agcgagcccg     1800

aggcggggag ggggagggcc aggggcggag ggggccggca ctactgtgtt ggcggactgg     1860

cgggactagg gctgcgtgag tctctgagcg caggcgggcg gcggccgccc ctcccccggc     1920

ggcggcagcg gcggcagcgg cggcagctca ctcagcccgc tgcccgagcg gaaacgccac     1980

tgaccgcacg gggattccca gtgccggcgc caggggcacg cgggacacgc cccctcccgc     2040

cgcgccattg gcctctccgc ccaccgcccc acacttattg gccggtgcgc cgccaatcag     2100

cggaggctgc cggggccgcc taaagaagag gctgtgcttt ggggctccgg ctcctcagag     2160

agcctcggct aggtagggga tcgggactct ggcgggaggg cggcttggtg cgtttgcggg     2220

gatccactag ttctagaact atagctagca tgcgcaaatt taaagcgctg atatcgatcg     2280

cgcgcagatc ctaagaactt ccaggggagg tttggggacc cttgattgtt ctttcttttt     2340

cgctattgta aaattcatgt tatatggagg gggcaaagtt ttcagggtgt tgtttagaat     2400

gggaagatgt cccttgtatc accatggacc ctcatgataa ttttgtttct ttcactttct     2460

actctgttga caaccattgt ctcctcttat tttcttttca ttttctgtaa ctttttcgtt     2520

aaactttagc ttgcatttgt aacgaatttt taaattcact tttgtttatt tgtcagattg     2580

taagtacttt ctctaatcac ttttttttca aggcaatcag ggtatattat attgtacttc     2640

agcacagttt tagagaacaa ttgttataat taaatgataa ggtagaatat ttctgcatat     2700

aaattctggc tggcgtggaa atattcttat tggtagaaac aactacaccc tggtcatcat     2760

cctgcctttc tctttatggt tacaatgata tacactgttt gagatgagga taaaatactc     2820

tgagtccaaa ccgggcccct ctgctaacca tgttcatgcc ttcttctctt tcctacagct     2880

cctgggcaac gtgctggtta tgtgctgtct catcaaatgg caaagaattc atggctggtg     2940

accacgtcgt ggaatgcctt cgaattcagc acctgcacat gggacgtcga cctgaggtaa     3000

ttataacccg ggccctatat atggatccag atcgatcatc aggatcggta ccgggccccc     3060

cctcgagaag cttccacc atg gac tac aag gac gac gat gac aag gaa ttc       3111
                    Met Asp Tyr Lys Asp Asp Asp Asp Lys Glu Phe           
                    1               5                   10                

tta tct ggg aag aag gcg gca gcc gcg gcg gcg gcg gct gca gcg gca       3159
Leu Ser Gly Lys Lys Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala           
            15                  20                  25                    

gca acc ggg acg gag gct ggc cct ggg aca gca ggc ggc tcc gag aac       3207
Ala Thr Gly Thr Glu Ala Gly Pro Gly Thr Ala Gly Gly Ser Glu Asn           
        30                  35                  40                        

ggg tct gag gtg gcc gcg cag ccc gcg ggc ctg tcg ggc cca gcc gag       3255
Gly Ser Glu Val Ala Ala Gln Pro Ala Gly Leu Ser Gly Pro Ala Glu           
    45                  50                  55                            

gtc ggg ccg ggg gcg gtg ggg gag cgc aca ccc cgc aag aaa gag cct       3303
Val Gly Pro Gly Ala Val Gly Glu Arg Thr Pro Arg Lys Lys Glu Pro           
60                  65                  70                  75            

ccg cgg gcc tcg ccc ccc ggg ggc ctg gcg gaa ccg ccg ggg tcc gca       3351
Pro Arg Ala Ser Pro Pro Gly Gly Leu Ala Glu Pro Pro Gly Ser Ala           
                80                  85                  90                

ggg cct cag gcc ggc cct act gtc gtg cct ggg tct gcg acc ccc atg       3399
Gly Pro Gln Ala Gly Pro Thr Val Val Pro Gly Ser Ala Thr Pro Met           
            95                  100                 105                   

gaa act gga ata gca gag act ccg gag ggg cgt cgg acc agc cgg cgc       3447
Glu Thr Gly Ile Ala Glu Thr Pro Glu Gly Arg Arg Thr Ser Arg Arg           
        110                 115                 120                       

aag cgg gcg aag gta gag tac aga gag atg gat gaa agc ttg gcc aac       3495
Lys Arg Ala Lys Val Glu Tyr Arg Glu Met Asp Glu Ser Leu Ala Asn           
    125                 130                 135                           

ctc tca gaa gat gag tat tat tca gaa gaa gag aga aat gcc aaa gca       3543
Leu Ser Glu Asp Glu Tyr Tyr Ser Glu Glu Glu Arg Asn Ala Lys Ala           
140                 145                 150                 155           

gag aag gaa aag aag ctt ccc cca cca ccc cct caa gcc cca cct gag       3591
Glu Lys Glu Lys Lys Leu Pro Pro Pro Pro Pro Gln Ala Pro Pro Glu           
                160                 165                 170               

gaa gaa aat gaa agt gag cct gaa gaa cca tcg ggt gtg gag ggc gca       3639
Glu Glu Asn Glu Ser Glu Pro Glu Glu Pro Ser Gly Val Glu Gly Ala           
            175                 180                 185                   

gct ttc cag agc cga ctt cct cat gac cgg atg act tct caa gaa gca       3687
Ala Phe Gln Ser Arg Leu Pro His Asp Arg Met Thr Ser Gln Glu Ala           
        190                 195                 200                       

gcc tgt ttt cca gat att atc agt gga cca caa cag acc cag aag gtt       3735
Ala Cys Phe Pro Asp Ile Ile Ser Gly Pro Gln Gln Thr Gln Lys Val           
    205                 210                 215                           

ttt ctt ttc att aga aac cgc aca ctg cag ttg tgg ttg gat aat cca       3783
Phe Leu Phe Ile Arg Asn Arg Thr Leu Gln Leu Trp Leu Asp Asn Pro           
220                 225                 230                 235           

aag att cag ctg aca ttt gag gct act ctc caa caa tta gaa gca cct       3831
Lys Ile Gln Leu Thr Phe Glu Ala Thr Leu Gln Gln Leu Glu Ala Pro           
                240                 245                 250               

tat aac agt gat act gtg ctt gtc cac cga gtt cac agt tat tta gag       3879
Tyr Asn Ser Asp Thr Val Leu Val His Arg Val His Ser Tyr Leu Glu           
            255                 260                 265                   

cgt cat ggt ctt atc aac ttc ggc atc tat aag agg ata aaa ccc cta       3927
Arg His Gly Leu Ile Asn Phe Gly Ile Tyr Lys Arg Ile Lys Pro Leu           
        270                 275                 280                       

cca act aaa aag aca gga aag gta att att ata ggc tct ggg gtc tca       3975
Pro Thr Lys Lys Thr Gly Lys Val Ile Ile Ile Gly Ser Gly Val Ser           
    285                 290                 295                           

ggc ttg gca gca gct cga cag tta caa agt ttt gga atg gat gtc aca       4023
Gly Leu Ala Ala Ala Arg Gln Leu Gln Ser Phe Gly Met Asp Val Thr           
300                 305                 310                 315           

ctt ttg gaa gcc agg gat cgt gtg ggt gga cga gtt gcc aca ttt cgc       4071
Leu Leu Glu Ala Arg Asp Arg Val Gly Gly Arg Val Ala Thr Phe Arg           
                320                 325                 330               

aaa gga aac tat gta gct gat ctt gga gcc atg gtg gta aca ggt ctt       4119
Lys Gly Asn Tyr Val Ala Asp Leu Gly Ala Met Val Val Thr Gly Leu           
            335                 340                 345                   

gga ggg aat cct atg gct gtg gtc agc aaa caa gta aat atg gaa ctg       4167
Gly Gly Asn Pro Met Ala Val Val Ser Lys Gln Val Asn Met Glu Leu           
        350                 355                 360                       

gcc aag atc aag caa aaa tgc cca ctt tat gaa gcc aac gga caa gct       4215
Ala Lys Ile Lys Gln Lys Cys Pro Leu Tyr Glu Ala Asn Gly Gln Ala           
    365                 370                 375                           

gtt cct aaa gag aaa gat gaa atg gta gag caa gag ttt aac cgg ttg       4263
Val Pro Lys Glu Lys Asp Glu Met Val Glu Gln Glu Phe Asn Arg Leu           
380                 385                 390                 395           

cta gaa gct aca tct tac ctt agt cat caa cta gac ttc aat gtc ctc       4311
Leu Glu Ala Thr Ser Tyr Leu Ser His Gln Leu Asp Phe Asn Val Leu           
                400                 405                 410               

aat aat aag cct gtg tcc ctt ggc cag gca ttg gaa gtt gtc att cag       4359
Asn Asn Lys Pro Val Ser Leu Gly Gln Ala Leu Glu Val Val Ile Gln           
            415                 420                 425                   

tta caa gag aag cat gtc aaa gat gag cag att gaa cat tgg aag aag       4407
Leu Gln Glu Lys His Val Lys Asp Glu Gln Ile Glu His Trp Lys Lys           
        430                 435                 440                       

ata gtg aaa act cag gaa gaa ttg aaa gaa ctt ctt aat aag atg gta       4455
Ile Val Lys Thr Gln Glu Glu Leu Lys Glu Leu Leu Asn Lys Met Val           
    445                 450                 455                           

aat ttg aaa gag aaa att aaa gaa ctc cat cag caa tac aaa gaa gca       4503
Asn Leu Lys Glu Lys Ile Lys Glu Leu His Gln Gln Tyr Lys Glu Ala           
460                 465                 470                 475           

tct gaa gta aag cca ccc aga gat att act gcc gag ttc tta gtg aaa       4551
Ser Glu Val Lys Pro Pro Arg Asp Ile Thr Ala Glu Phe Leu Val Lys           
                480                 485                 490               

agc aaa cac agg gat ctg acc gcc cta tgc aag gaa tat gat gaa tta       4599
Ser Lys His Arg Asp Leu Thr Ala Leu Cys Lys Glu Tyr Asp Glu Leu           
            495                 500                 505                   

gct gaa aca caa gga aag cta gaa gaa aaa ctt cag gag ttg gaa gcg       4647
Ala Glu Thr Gln Gly Lys Leu Glu Glu Lys Leu Gln Glu Leu Glu Ala           
        510                 515                 520                       

aat ccc cca agt gat gta tat ctc tca tca aga gac aga caa ata ctt       4695
Asn Pro Pro Ser Asp Val Tyr Leu Ser Ser Arg Asp Arg Gln Ile Leu           
    525                 530                 535                           

gat tgg cat ttt gca aat ctt gaa ttt gct aat gcc aca cct ctc tca       4743
Asp Trp His Phe Ala Asn Leu Glu Phe Ala Asn Ala Thr Pro Leu Ser           
540                 545                 550                 555           

act ctc tcc ctt aag cac tgg gat cag gat gat gac ttt gag ttc act       4791
Thr Leu Ser Leu Lys His Trp Asp Gln Asp Asp Asp Phe Glu Phe Thr           
                560                 565                 570               

ggc agc cac ctg aca gta agg aat ggc tac tcg tgt gtg cct gtg gct       4839
Gly Ser His Leu Thr Val Arg Asn Gly Tyr Ser Cys Val Pro Val Ala           
            575                 580                 585                   

tta gca gaa ggc cta gac att aaa ctg aat aca gca gtg cga cag gtt       4887
Leu Ala Glu Gly Leu Asp Ile Lys Leu Asn Thr Ala Val Arg Gln Val           
        590                 595                 600                       

cgc tac acg gct tca gga tgt gaa gtg ata gct gtg aat acc cgc tcc       4935
Arg Tyr Thr Ala Ser Gly Cys Glu Val Ile Ala Val Asn Thr Arg Ser           
    605                 610                 615                           

acg agt caa acc ttt att tat aaa tgc gac gca gtt ctc tgt acc ctt       4983
Thr Ser Gln Thr Phe Ile Tyr Lys Cys Asp Ala Val Leu Cys Thr Leu           
620                 625                 630                 635           

ccc ctg ggt gtg ctg aag cag cag cca cca gcc gtt cag ttt gtg cca       5031
Pro Leu Gly Val Leu Lys Gln Gln Pro Pro Ala Val Gln Phe Val Pro           
                640                 645                 650               

cct ctc cct gag tgg aaa aca tct gca gtc caa agg atg gga ttt ggc       5079
Pro Leu Pro Glu Trp Lys Thr Ser Ala Val Gln Arg Met Gly Phe Gly           
            655                 660                 665                   

aac ctt aac aag gtg gtg ttg tgt ttt gat cgg gtg ttc tgg gat cca       5127
Asn Leu Asn Lys Val Val Leu Cys Phe Asp Arg Val Phe Trp Asp Pro           
        670                 675                 680                       

agt gtc aat ttg ttc ggg cat gtt ggc agt acg act gcc agc agg ggt       5175
Ser Val Asn Leu Phe Gly His Val Gly Ser Thr Thr Ala Ser Arg Gly           
    685                 690                 695                           

gag ctc ttc ctc ttc tgg aac ctc tat aaa gct cca ata ctg ttg gca       5223
Glu Leu Phe Leu Phe Trp Asn Leu Tyr Lys Ala Pro Ile Leu Leu Ala           
700                 705                 710                 715           

cta gtg gca gga gaa gct gct ggt atc atg gaa aac ata agt gac gat       5271
Leu Val Ala Gly Glu Ala Ala Gly Ile Met Glu Asn Ile Ser Asp Asp           
                720                 725                 730               

gtg att gtt ggc cga tgc ctg gcc att ctc aaa ggg att ttt ggt agc       5319
Val Ile Val Gly Arg Cys Leu Ala Ile Leu Lys Gly Ile Phe Gly Ser           
            735                 740                 745                   

agt gca gta cct cag ccc aaa gaa act gtg gtg tct cgt tgg cgt gct       5367
Ser Ala Val Pro Gln Pro Lys Glu Thr Val Val Ser Arg Trp Arg Ala           
        750                 755                 760                       

gat ccc tgg gct cgg ggc tct tat tcc tat gtt gct gca gga tca tct       5415
Asp Pro Trp Ala Arg Gly Ser Tyr Ser Tyr Val Ala Ala Gly Ser Ser           
    765                 770                 775                           

gga aat gac tat gat tta atg gct cag cca atc act cct ggc ccc tcg       5463
Gly Asn Asp Tyr Asp Leu Met Ala Gln Pro Ile Thr Pro Gly Pro Ser           
780                 785                 790                 795           

att cca ggt gcc cca cag ccg att cca cga ctc ttc ttt gcg gga gaa       5511
Ile Pro Gly Ala Pro Gln Pro Ile Pro Arg Leu Phe Phe Ala Gly Glu           
                800                 805                 810               

cat acg atc cgt aac tac cca gcc aca gtg cat ggt gct ctg ctg agt       5559
His Thr Ile Arg Asn Tyr Pro Ala Thr Val His Gly Ala Leu Leu Ser           
            815                 820                 825                   

ggg ctg cga gaa gcg gga aga att gca gac cag ttt ttg ggg gcc atg       5607
Gly Leu Arg Glu Ala Gly Arg Ile Ala Asp Gln Phe Leu Gly Ala Met           
        830                 835                 840                       

tat acg ctg cct cgc cag gcc aca cca ggt gtt cct gca cag cag tcc       5655
Tyr Thr Leu Pro Arg Gln Ala Thr Pro Gly Val Pro Ala Gln Gln Ser           
    845                 850                 855                           

cca agc atg tga gctaggatct tattaaagca gaacttgttt attgcagctt           5707
Pro Ser Met                                                               
860                                                                       

ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac     5767

tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggtcgactc     5827

tagcagtgaa agtctgcaat gaattcgagt tggcgcgcct gtcattctaa atctctcttt     5887

cagcctaaag ctttttcccc gtatcccccc aggtgtctgc aggctcaaag agcagcgaga     5947

agcgttcaga ggaaagcgat cccgtgccac cttccccgtg cccgggctgt ccccgcacgc     6007

tgccggctcg gggatgcggg gggagcgccg gaccggagcg gagccccggg cggctcgctg     6067

ctgcccccta gcgggggagg gacgtaatta catccctggt gggctttggg aggggggctg     6127

tccccgtgag ctcccaggcg cgcctgtcat tctaaatctc tctttcagcc taaagctttt     6187

tccccgtatc cccccaggtg tctgcaggct caaagagcag cgagaagcgt tcagaggaaa     6247

gcgatcccgt gccaccttcc ccgtgcccgg gctgtccccg cacgctgccg gctcggggat     6307

gcggggggag cgccggaccg gagcggagcc ccgggcggct cgctgctgcc ccctagcggg     6367

ggagggacgt aattacatcc ctgggggctt tggggggggg ctgtccccgt gagctcccag     6427

gcgcgccaac tcgctagagg taccggttgt taacgttagc cggctacgta tactccggaa     6487

tattaatagg cctaggatgc atatggcggc cgccaccgcg gtggagctcc agcttttgtt     6547

gcgcgcttgg cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta tccgctcaca     6607

attccacaca acatacgagc cggaagcata aagtgtaaag cctggggtgc ctaatgagtg     6667

agctaactca cattaattgc gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg     6727

tgccagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg tattgggcgc     6787

tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gcgagcggta     6847

tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa cgcaggaaag     6907

aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc gttgctggcg     6967

tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg     7027

tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg     7087

cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga     7147

agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc     7207

tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt     7267

aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact     7327

ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg     7387

cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct gaagccagtt     7447

accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt     7507

ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct     7567

ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg     7627

gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt     7687

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt     7747

gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc     7807

gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg     7867

cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc     7927

gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg     7987

gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca     8047

ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga     8107

tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct     8167

ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg     8227

cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca     8287

accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata     8347

cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct     8407

tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact     8467

cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa     8527

acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc     8587

atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga     8647

tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga     8707

aaagtgccac                                                            8717


<210>  7
<211>  862
<212>  PRT
<213>  artificial sequence

<220>
<223>  Synthetic Construct

<400>  7

Met Asp Tyr Lys Asp Asp Asp Asp Lys Glu Phe Leu Ser Gly Lys Lys 
1               5                   10                  15      


Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Thr Gly Thr Glu 
            20                  25                  30          


Ala Gly Pro Gly Thr Ala Gly Gly Ser Glu Asn Gly Ser Glu Val Ala 
        35                  40                  45              


Ala Gln Pro Ala Gly Leu Ser Gly Pro Ala Glu Val Gly Pro Gly Ala 
    50                  55                  60                  


Val Gly Glu Arg Thr Pro Arg Lys Lys Glu Pro Pro Arg Ala Ser Pro 
65                  70                  75                  80  


Pro Gly Gly Leu Ala Glu Pro Pro Gly Ser Ala Gly Pro Gln Ala Gly 
                85                  90                  95      


Pro Thr Val Val Pro Gly Ser Ala Thr Pro Met Glu Thr Gly Ile Ala 
            100                 105                 110         


Glu Thr Pro Glu Gly Arg Arg Thr Ser Arg Arg Lys Arg Ala Lys Val 
        115                 120                 125             


Glu Tyr Arg Glu Met Asp Glu Ser Leu Ala Asn Leu Ser Glu Asp Glu 
    130                 135                 140                 


Tyr Tyr Ser Glu Glu Glu Arg Asn Ala Lys Ala Glu Lys Glu Lys Lys 
145                 150                 155                 160 


Leu Pro Pro Pro Pro Pro Gln Ala Pro Pro Glu Glu Glu Asn Glu Ser 
                165                 170                 175     


Glu Pro Glu Glu Pro Ser Gly Val Glu Gly Ala Ala Phe Gln Ser Arg 
            180                 185                 190         


Leu Pro His Asp Arg Met Thr Ser Gln Glu Ala Ala Cys Phe Pro Asp 
        195                 200                 205             


Ile Ile Ser Gly Pro Gln Gln Thr Gln Lys Val Phe Leu Phe Ile Arg 
    210                 215                 220                 


Asn Arg Thr Leu Gln Leu Trp Leu Asp Asn Pro Lys Ile Gln Leu Thr 
225                 230                 235                 240 


Phe Glu Ala Thr Leu Gln Gln Leu Glu Ala Pro Tyr Asn Ser Asp Thr 
                245                 250                 255     


Val Leu Val His Arg Val His Ser Tyr Leu Glu Arg His Gly Leu Ile 
            260                 265                 270         


Asn Phe Gly Ile Tyr Lys Arg Ile Lys Pro Leu Pro Thr Lys Lys Thr 
        275                 280                 285             


Gly Lys Val Ile Ile Ile Gly Ser Gly Val Ser Gly Leu Ala Ala Ala 
    290                 295                 300                 


Arg Gln Leu Gln Ser Phe Gly Met Asp Val Thr Leu Leu Glu Ala Arg 
305                 310                 315                 320 


Asp Arg Val Gly Gly Arg Val Ala Thr Phe Arg Lys Gly Asn Tyr Val 
                325                 330                 335     


Ala Asp Leu Gly Ala Met Val Val Thr Gly Leu Gly Gly Asn Pro Met 
            340                 345                 350         


Ala Val Val Ser Lys Gln Val Asn Met Glu Leu Ala Lys Ile Lys Gln 
        355                 360                 365             


Lys Cys Pro Leu Tyr Glu Ala Asn Gly Gln Ala Val Pro Lys Glu Lys 
    370                 375                 380                 


Asp Glu Met Val Glu Gln Glu Phe Asn Arg Leu Leu Glu Ala Thr Ser 
385                 390                 395                 400 


Tyr Leu Ser His Gln Leu Asp Phe Asn Val Leu Asn Asn Lys Pro Val 
                405                 410                 415     


Ser Leu Gly Gln Ala Leu Glu Val Val Ile Gln Leu Gln Glu Lys His 
            420                 425                 430         


Val Lys Asp Glu Gln Ile Glu His Trp Lys Lys Ile Val Lys Thr Gln 
        435                 440                 445             


Glu Glu Leu Lys Glu Leu Leu Asn Lys Met Val Asn Leu Lys Glu Lys 
    450                 455                 460                 


Ile Lys Glu Leu His Gln Gln Tyr Lys Glu Ala Ser Glu Val Lys Pro 
465                 470                 475                 480 


Pro Arg Asp Ile Thr Ala Glu Phe Leu Val Lys Ser Lys His Arg Asp 
                485                 490                 495     


Leu Thr Ala Leu Cys Lys Glu Tyr Asp Glu Leu Ala Glu Thr Gln Gly 
            500                 505                 510         


Lys Leu Glu Glu Lys Leu Gln Glu Leu Glu Ala Asn Pro Pro Ser Asp 
        515                 520                 525             


Val Tyr Leu Ser Ser Arg Asp Arg Gln Ile Leu Asp Trp His Phe Ala 
    530                 535                 540                 


Asn Leu Glu Phe Ala Asn Ala Thr Pro Leu Ser Thr Leu Ser Leu Lys 
545                 550                 555                 560 


His Trp Asp Gln Asp Asp Asp Phe Glu Phe Thr Gly Ser His Leu Thr 
                565                 570                 575     


Val Arg Asn Gly Tyr Ser Cys Val Pro Val Ala Leu Ala Glu Gly Leu 
            580                 585                 590         


Asp Ile Lys Leu Asn Thr Ala Val Arg Gln Val Arg Tyr Thr Ala Ser 
        595                 600                 605             


Gly Cys Glu Val Ile Ala Val Asn Thr Arg Ser Thr Ser Gln Thr Phe 
    610                 615                 620                 


Ile Tyr Lys Cys Asp Ala Val Leu Cys Thr Leu Pro Leu Gly Val Leu 
625                 630                 635                 640 


Lys Gln Gln Pro Pro Ala Val Gln Phe Val Pro Pro Leu Pro Glu Trp 
                645                 650                 655     


Lys Thr Ser Ala Val Gln Arg Met Gly Phe Gly Asn Leu Asn Lys Val 
            660                 665                 670         


Val Leu Cys Phe Asp Arg Val Phe Trp Asp Pro Ser Val Asn Leu Phe 
        675                 680                 685             


Gly His Val Gly Ser Thr Thr Ala Ser Arg Gly Glu Leu Phe Leu Phe 
    690                 695                 700                 


Trp Asn Leu Tyr Lys Ala Pro Ile Leu Leu Ala Leu Val Ala Gly Glu 
705                 710                 715                 720 


Ala Ala Gly Ile Met Glu Asn Ile Ser Asp Asp Val Ile Val Gly Arg 
                725                 730                 735     


Cys Leu Ala Ile Leu Lys Gly Ile Phe Gly Ser Ser Ala Val Pro Gln 
            740                 745                 750         


Pro Lys Glu Thr Val Val Ser Arg Trp Arg Ala Asp Pro Trp Ala Arg 
        755                 760                 765             


Gly Ser Tyr Ser Tyr Val Ala Ala Gly Ser Ser Gly Asn Asp Tyr Asp 
    770                 775                 780                 


Leu Met Ala Gln Pro Ile Thr Pro Gly Pro Ser Ile Pro Gly Ala Pro 
785                 790                 795                 800 


Gln Pro Ile Pro Arg Leu Phe Phe Ala Gly Glu His Thr Ile Arg Asn 
                805                 810                 815     


Tyr Pro Ala Thr Val His Gly Ala Leu Leu Ser Gly Leu Arg Glu Ala 
            820                 825                 830         


Gly Arg Ile Ala Asp Gln Phe Leu Gly Ala Met Tyr Thr Leu Pro Arg 
        835                 840                 845             


Gln Ala Thr Pro Gly Val Pro Ala Gln Gln Ser Pro Ser Met 
    850                 855                 860         


<210>  8
<211>  20
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer 1

<400>  8
aatgccttcg aattcagcac                                                   20


<210>  9
<211>  20
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer 2

<400>  9
ccttgtcatc gtcgtccttg                                                   20


<210>  10
<211>  18
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer 3

<400>  10
gactacaagg acgacgat                                                     18


<210>  11
<211>  31
<212>  DNA
<213>  artificial sequence

<220>
<223>  primer 4

<400>  11
ccgctcgagt cagctttcat ccatctctct g                                      31


