                         SEQUENCE LISTING

<110>  Cancer Research Technology Limited and Azeria Therapeutics 
       Limited
 
<120>  Screen for inhibitors

<130>  P256015WO

<150>  GB 1909228.7
<151>  2019-06-27

<160>  41    

<170>  PatentIn version 3.5

<210>  1
<211>  245
<212>  PRT
<213>  Enterobacteria phage T4

<400>  1

Met Lys Ser Gly Ile Tyr Gln Ile Lys Asn Thr Leu Asn Asn Lys Val 
1               5                   10                  15      


Tyr Val Gly Ser Ala Lys Asp Phe Glu Lys Arg Trp Lys Arg His Phe 
            20                  25                  30          


Lys Asp Leu Glu Lys Gly Cys His Ser Ser Ile Lys Leu Gln Arg Ser 
        35                  40                  45              


Phe Asn Lys His Gly Asn Val Phe Glu Cys Ser Ile Leu Glu Glu Ile 
    50                  55                  60                  


Pro Tyr Glu Lys Asp Leu Ile Ile Glu Arg Glu Asn Phe Trp Ile Lys 
65                  70                  75                  80  


Glu Leu Asn Ser Lys Ile Asn Gly Tyr Asn Ile Ala Asp Ala Thr Phe 
                85                  90                  95      


Gly Asp Thr Cys Ser Thr His Pro Leu Lys Glu Glu Ile Ile Lys Lys 
            100                 105                 110         


Arg Ser Glu Thr Val Lys Ala Lys Met Leu Lys Leu Gly Pro Asp Gly 
        115                 120                 125             


Arg Lys Ala Leu Tyr Ser Lys Pro Gly Ser Lys Asn Gly Arg Trp Asn 
    130                 135                 140                 


Pro Glu Thr His Lys Phe Cys Lys Cys Gly Val Arg Ile Gln Thr Ser 
145                 150                 155                 160 


Ala Tyr Thr Cys Ser Lys Cys Arg Asn Arg Ser Gly Glu Asn Asn Ser 
                165                 170                 175     


Phe Phe Asn His Lys His Ser Asp Ile Thr Lys Ser Lys Ile Ser Glu 
            180                 185                 190         


Lys Met Lys Gly Lys Lys Pro Ser Asn Ile Lys Lys Ile Ser Cys Asp 
        195                 200                 205             


Gly Val Ile Phe Asp Cys Ala Ala Asp Ala Ala Arg His Phe Lys Ile 
    210                 215                 220                 


Ser Ser Gly Leu Val Thr Tyr Arg Val Lys Ser Asp Lys Trp Asn Trp 
225                 230                 235                 240 


Phe Tyr Ile Asn Ala 
                245 


<210>  2
<211>  201
<212>  PRT
<213>  Enterobacteria phage T4

<400>  2

Met Lys Ser Gly Ile Tyr Gln Ile Lys Asn Thr Leu Asn Asn Lys Val 
1               5                   10                  15      


Tyr Val Gly Ser Ala Lys Asp Phe Glu Lys Arg Trp Lys Arg His Phe 
            20                  25                  30          


Lys Asp Leu Glu Lys Gly Cys His Ser Ser Ile Lys Leu Gln Arg Ser 
        35                  40                  45              


Phe Asn Lys His Gly Asn Val Phe Glu Cys Ser Ile Leu Glu Glu Ile 
    50                  55                  60                  


Pro Tyr Glu Lys Asp Leu Ile Ile Glu Arg Glu Asn Phe Trp Ile Lys 
65                  70                  75                  80  


Glu Leu Asn Ser Lys Ile Asn Gly Tyr Asn Ile Ala Asp Ala Thr Phe 
                85                  90                  95      


Gly Asp Thr Cys Ser Thr His Pro Leu Lys Glu Glu Ile Ile Lys Lys 
            100                 105                 110         


Arg Ser Glu Thr Val Lys Ala Lys Met Leu Lys Leu Gly Pro Asp Gly 
        115                 120                 125             


Arg Lys Ala Leu Tyr Ser Lys Pro Gly Ser Lys Asn Gly Arg Trp Asn 
    130                 135                 140                 


Pro Glu Thr His Lys Phe Cys Lys Cys Gly Val Arg Ile Gln Thr Ser 
145                 150                 155                 160 


Ala Tyr Thr Cys Ser Lys Cys Arg Asn Arg Ser Gly Glu Asn Asn Ser 
                165                 170                 175     


Phe Phe Asn His Lys His Ser Asp Ile Thr Lys Ser Lys Ile Ser Glu 
            180                 185                 190         


Lys Met Lys Gly Lys Lys Pro Ser Asn 
        195                 200     


<210>  3
<211>  266
<212>  PRT
<213>  Bacillus mojavensis

<400>  3

Met Lys Ser Gly Val Tyr Lys Ile Thr Asn Lys Asn Thr Gly Lys Phe 
1               5                   10                  15      


Tyr Ile Gly Ser Ser Glu Asp Cys Glu Ser Arg Leu Lys Val His Phe 
            20                  25                  30          


Arg Asn Leu Lys Asn Asn Arg His Ile Asn Arg Tyr Leu Asn Asn Ser 
        35                  40                  45              


Phe Asn Lys His Gly Glu Gln Val Phe Ile Gly Glu Val Ile His Ile 
    50                  55                  60                  


Leu Pro Ile Glu Glu Ala Ile Ala Lys Glu Gln Trp Tyr Ile Asp Asn 
65                  70                  75                  80  


Phe Tyr Glu Glu Met Tyr Asn Ile Ser Lys Ser Ala Tyr His Gly Gly 
                85                  90                  95      


Asp Leu Thr Ser Tyr His Pro Asp Lys Arg Asn Ile Ile Leu Lys Arg 
            100                 105                 110         


Ala Asp Ser Leu Lys Lys Val Tyr Leu Lys Met Thr Ser Glu Glu Lys 
        115                 120                 125             


Ala Lys Arg Trp Gln Cys Val Gln Gly Glu Asn Asn Pro Met Phe Gly 
    130                 135                 140                 


Arg Lys His Thr Glu Thr Thr Lys Leu Lys Ile Ser Asn His Asn Lys 
145                 150                 155                 160 


Leu Tyr Tyr Ser Thr His Lys Asn Pro Phe Lys Gly Lys Lys His Ser 
                165                 170                 175     


Glu Glu Ser Lys Thr Lys Leu Ser Glu Tyr Ala Ser Gln Arg Val Gly 
            180                 185                 190         


Glu Lys Asn Pro Phe Tyr Gly Lys Thr His Ser Asp Glu Phe Lys Thr 
        195                 200                 205             


Tyr Met Ser Lys Lys Phe Lys Gly Arg Lys Pro Lys Asn Ser Arg Pro 
    210                 215                 220                 


Val Ile Ile Asp Gly Thr Glu Tyr Glu Ser Ala Thr Glu Ala Ser Arg 
225                 230                 235                 240 


Gln Leu Asn Val Val Pro Ala Thr Ile Leu His Arg Ile Lys Ser Lys 
                245                 250                 255     


Asn Glu Lys Tyr Ser Gly Tyr Phe Tyr Lys 
            260                 265     


<210>  4
<211>  245
<212>  PRT
<213>  Enterobacteria phage TulA

<400>  4

Met Lys Ser Gly Ile Tyr Gln Ile Lys Asn Thr Leu Asn Gly Lys Val 
1               5                   10                  15      


Tyr Val Gly Ser Ala Lys Asp Phe Glu Lys Arg Trp Lys Arg His Phe 
            20                  25                  30          


Lys Asp Leu Glu Asn Gly Val His Ser Ser Ile Lys Phe Gln Arg Ser 
        35                  40                  45              


Phe Asn Lys His Gly Asn Val Phe Glu Cys Ser Val Leu Glu Glu Ile 
    50                  55                  60                  


Pro Tyr Glu Lys Asp Leu Ile Ile Glu Arg Glu Asn Phe Trp Ile Lys 
65                  70                  75                  80  


Glu Leu Asn Ser Lys Ile Asn Gly Tyr Asn Ile Ala Asp Ala Ser Phe 
                85                  90                  95      


Gly Asp Val Leu Ser Asn His Pro Leu Lys Glu Glu Ile Ala Lys Lys 
            100                 105                 110         


Arg Ala Glu Thr Val Lys Ala Lys Met Leu Lys Leu Gly Pro Asp Gly 
        115                 120                 125             


Arg Lys Ala Leu Tyr Gly Lys His Gly Ser Lys Asn Gly Arg Trp Asn 
    130                 135                 140                 


Pro Glu Asn His Lys Phe Cys Lys Cys Gly Val Arg Ile Pro Ser Ser 
145                 150                 155                 160 


Ala Asp Thr Cys Gly Lys Cys Arg Lys Arg Ser Gly Glu Asn Asn Pro 
                165                 170                 175     


Phe Phe Asn His Lys His Ser Glu Lys Thr Lys Thr Lys Leu Ser Glu 
            180                 185                 190         


Lys Met Lys Gly Lys Lys Pro Ala Asn Ile Lys Lys Ile Ser Cys Asp 
        195                 200                 205             


Gly Ile Ile Phe Glu Cys Ala Ala Asp Ala Ala Arg His Phe Glu Ile 
    210                 215                 220                 


Ser Ser Gly Leu Val Thr Tyr Arg Val Lys Ser Asp Lys Trp Asn Trp 
225                 230                 235                 240 


Phe Tyr Ile Asn Ala 
                245 


<210>  5
<211>  174
<212>  PRT
<213>  Bacteriophage SP01

<400>  5

Met Glu Trp Lys Asp Ile Lys Gly Tyr Glu Gly His Tyr Gln Val Ser 
1               5                   10                  15      


Asn Thr Gly Glu Val Tyr Ser Ile Lys Ser Gly Lys Thr Leu Lys His 
            20                  25                  30          


Gln Ile Pro Lys Asp Gly Tyr His Arg Ile Gly Leu Phe Lys Gly Gly 
        35                  40                  45              


Lys Gly Lys Thr Phe Gln Val His Arg Leu Val Ala Ile His Phe Cys 
    50                  55                  60                  


Glu Gly Tyr Glu Glu Gly Leu Val Val Asp His Lys Asp Gly Asn Lys 
65                  70                  75                  80  


Asp Asn Asn Leu Ser Thr Asn Leu Arg Trp Val Thr Gln Lys Ile Asn 
                85                  90                  95      


Val Glu Asn Gln Met Ser Arg Gly Thr Leu Asn Val Ser Lys Ala Gln 
            100                 105                 110         


Gln Ile Ala Lys Ile Lys Asn Gln Lys Pro Ile Ile Val Ile Ser Pro 
        115                 120                 125             


Asp Gly Ile Glu Lys Glu Tyr Pro Ser Thr Lys Cys Ala Cys Glu Glu 
    130                 135                 140                 


Leu Gly Leu Thr Arg Gly Lys Val Thr Asp Val Leu Lys Gly His Arg 
145                 150                 155                 160 


Ile His His Lys Gly Tyr Thr Phe Arg Tyr Lys Leu Asn Gly 
                165                 170                 


<210>  6
<211>  579
<212>  PRT
<213>  Escherichia coli

<400>  6

Met Lys Met Gly Lys Ile Ala Val Thr Pro Asn Asn Asp Lys Ala Ala 
1               5                   10                  15      


Ser His Ser Lys Asp Glu Asn Leu Ala Thr Asn Thr Ile Lys Lys Asn 
            20                  25                  30          


Val Lys Pro Lys Tyr Gly Asp Ala Ala Phe Leu Glu Tyr Thr Arg Leu 
        35                  40                  45              


Ile Val Asn His Pro Asn Tyr Phe Gly Met Pro Asp Pro Phe Gly Glu 
    50                  55                  60                  


Lys Gly Glu Ile Gln Trp Glu Ala Pro Ser Asn Arg Ala Ser Gly Lys 
65                  70                  75                  80  


Phe Lys His Thr His Gln Arg Arg Tyr Glu Trp Trp Lys Asn Lys Ala 
                85                  90                  95      


Arg Ser Ile Gly Ile Asp Pro Asp Thr Glu Lys Ala Trp Ile Ser Lys 
            100                 105                 110         


Thr Ala Lys Leu Ile His Pro Leu Gly Val Lys Pro Cys Lys Lys Cys 
        115                 120                 125             


Gly Lys Glu Met Glu Leu Ser Tyr Ser Tyr Pro Asn Glu His Phe Phe 
    130                 135                 140                 


Ser Arg Val Arg Lys Leu Asn Tyr Ile Asp Glu Thr Phe Glu Leu Ser 
145                 150                 155                 160 


Gln Asn Glu His Ile Val Asp Leu Leu Thr Arg Leu Asp Asp Arg Phe 
                165                 170                 175     


Gly Glu Arg Ile Tyr Leu Asp Leu Pro His Leu Phe Ser Thr Lys Ser 
            180                 185                 190         


Ile Thr Ile Pro Asp Ile Ser Ser Asn Leu Glu Ala Trp Ile Glu Tyr 
        195                 200                 205             


Leu Lys Glu Gln Tyr Ile Pro Gln Glu Ser Arg Met Leu Ser Pro Gly 
    210                 215                 220                 


Ala Met Ala Asn Pro Pro Asp Arg Phe Asp Gly Phe His Ser Phe Asn 
225                 230                 235                 240 


Arg Cys Cys Arg Ser Ile Ala Asp Lys Gly Arg Thr Lys Glu Asn Leu 
                245                 250                 255     


Lys Ser Tyr Val Thr Asp Arg Arg Val Phe Glu Tyr Trp Val Asp Gly 
            260                 265                 270         


Asp Trp Val Ala Ala Asp Arg Leu Met Gly Gln Val Arg Thr Asn Asn 
        275                 280                 285             


Ile Phe Ile Asn Glu Glu Cys Leu Asn Ala Gly Asn Gly Gly Leu His 
    290                 295                 300                 


Pro Thr Pro Cys Gln Ala Asp His Ile Gly Pro Ile Ser Leu Gly Phe 
305                 310                 315                 320 


Ser His Arg Pro Gln Phe Gln Leu Leu Cys Lys Ser Cys Asn Ser Ala 
                325                 330                 335     


Lys Asn Asn Arg Met Tyr Leu Ser Asp Ile Ile Ser Leu Leu Glu Ala 
            340                 345                 350         


Glu Asn Glu Gly His Thr Val Ile Ser Trp Phe Ala Glu Glu Val Trp 
        355                 360                 365             


Asn Arg Leu Lys His Ser Val Asp Asp Ser Glu Lys Ala Leu Arg Leu 
    370                 375                 380                 


Ser Lys Ile Leu Arg Asp Asn Arg His Thr Tyr Met Asn Leu Leu Lys 
385                 390                 395                 400 


Lys Ile Met Asp Glu Gly Tyr Tyr Thr Phe Leu Ala Ser Leu Leu His 
                405                 410                 415     


Leu Glu Val Ala Asn Tyr Asn Pro Ile Phe Glu Gly Leu Cys Ile Ser 
            420                 425                 430         


Asn His Leu Thr His Tyr Lys Ser Leu Lys Lys Ile Lys Arg Glu Ser 
        435                 440                 445             


Lys Tyr Ala Ala Val Gln Lys Thr Arg Arg Ile Arg Ile Ala Phe Thr 
    450                 455                 460                 


Ser Leu Asn Asp Tyr His Arg Lys Glu Asn Arg Asn Ala Phe Ile Val 
465                 470                 475                 480 


Ser Asn Glu Leu Ser Glu Lys Phe Phe Ser Glu Ala Met Asp Asn Leu 
                485                 490                 495     


Lys Ser Leu Ser Glu Ile Thr Ser Cys Leu Asp Glu Lys Ile Ser Gly 
            500                 505                 510         


Ile Ile Ser Glu Asn Ser Asp Ser Lys Asn Glu Phe Arg Thr Ile Ile 
        515                 520                 525             


Thr Asp Leu Arg Glu Ile Val Thr Asn Asn Lys Glu Lys Phe Asn Leu 
    530                 535                 540                 


Ile Leu Lys Tyr Leu Ile Ser Gly Met Ser Glu Ile Gly Lys Glu Leu 
545                 550                 555                 560 


Glu Ser Tyr Trp Glu Asn Asp Arg Tyr Val Arg Ser Ile Pro Glu Glu 
                565                 570                 575     


Phe Ile Glu 
            


<210>  7
<211>  1782
<212>  DNA
<213>  Homo sapiens

<400>  7
accatgaccc tccacaccaa agcatctggg atggccctac tgcatcagat ccaagggaac       60

gagctggagc ccctgaaccg tccgcagctc aagatccccc tggagcggcc cctgggcgag      120

gtgtacctgg acagcagcaa gcccgccgtg tacaactacc ccgagggcgc cgcctacgag      180

ttcaacgccg cggccgccgc caacgcgcag gtctacggtc agaccggcct cccctacggc      240

cccgggtctg aggctgcggc gttcggctcc aacggcctgg ggggtttccc cccactcaac      300

agcgtgtctc cgagcccgct gatgctactg cacccgccgc cgcagctgtc gcctttcctg      360

cagccccacg gccagcaggt gccctactac ctggagaacg agcccagcgg ctacacggtg      420

cgcgaggcag gcccgccggc attctacagg ccaaattcag ataatcgacg ccagggtggc      480

agagaaagat tggccagtac caatgacaag ggaagtatgg ctatggaatc tgccaaggag      540

actcgctact gtgcagtgtg caatgactat gcttcaggct accattatgg agtctggtcc      600

tgtgagggct gcaaggcctt cttcaagaga agtattcaag gacataacga ctatatgtgt      660

ccagccacca accagtgcac cattgataaa aacaggagga agagctgcca ggcctgccgg      720

ctccgtaaat gctacgaagt gggaatgatg aaaggtggga tacgaaaaga ccgaagagga      780

gggagaatgt tgaaacacaa gcgccagaga gatgatgggg agggcagggg tgaagtgggg      840

tctgctggag acatgagagc tgccaacctt tggccaagcc cgctcatgat caaacgctct      900

aagaagaaca gcctggcctt gtccctgacg gccgaccaga tggtcagtgc cttgttggat      960

gctgagcccc cgatactcta ttccgagtat gatcctacca gacccttcag tgaagcttcg     1020

atgatgggct tactgaccaa cctggcagac agggagctgg ttcacatgat caactgggcg     1080

aagagggtgc caggctttgt ggatttgacc ctccatgatc aggtccacct tctagaatgt     1140

gcctggctag agatcctgat gattggtctc gtctggcgct ccatggagca cccagggaag     1200

ctactgtttg ctcctaactt gctcttggac aggaaccagg gaaaatgtgt agagggcatg     1260

gtggagatct tcgacatgct gctggctaca tcatctcggt tccgcatgat gaatctgcag     1320

ggagaggagt ttgtgtgcct caaatctatt attttgctta attctggagt gtacacattt     1380

ctgtccagca ccctgaagtc tctggaagag aaggaccata tccaccgagt cctggacaag     1440

atcacagaca ctttgatcca cctgatggcc aaggcaggcc tgaccctgca gcagcagcac     1500

cagcggctgg cccagctcct cctcatcctc tcccacatca ggcacatgag taacaaaggc     1560

atggagcatc tgtacagcat gaagtgcaag aacgtggtgc ccctctatga cctgctgctg     1620

gagatgctgg acgcccaccg cctacatgcg cccactagcc gtggaggggc atccgtggag     1680

gagacggacc aaagccactt ggccactgcg ggctctactt catcgcattc cttgcaaaag     1740

tattacatca cgggggaggc agagggtttc cctgccacgg tc                        1782


<210>  8
<211>  1413
<212>  DNA
<213>  Homo sapiens

<400>  8
ttaggaactg tgaagatgga agggcatgaa accagcgact ggaacagcta ctacgcagac       60

acgcaggagg cctactcctc cgtcccggtc agcaacatga actcaggcct gggctccatg      120

aactccatga acacctacat gaccatgaac accatgacta cgagcggcaa catgaccccg      180

gcgtccttca acatgtccta tgccaacccg ggcctagggg ccggactgag tcccggcgca      240

gtagccggca tgccgggggg ctcggcgggc gccatgaaca gcatgactgc ggccggcgtg      300

acggccatgg gtacggcgct gagcccgagc ggcatgggcg ccatgggtgc gcagcaggcg      360

gcctccatga atggcctggg cccctacgcg gccgccatga acccgtgcat gagccccatg      420

gcgtacgcgc cgtccaacct gggccgcagc cgcgcgggcg gcggcggcga cgccaagacg      480

ttcaagcgca gctacccgca cgccaagccg ccctactcgt acatctcgct catcaccatg      540

gccatccagc aggcgcccag caagatgctc acgctgagcg agatctacca gtggatcatg      600

gacctcttcc cctattaccg gcagaaccag cagcgctggc agaactccat ccgccactcg      660

ctgtccttca atgactgctt cgtcaaggtg gcacgctccc cggacaagcc gggcaagggc      720

tcctactgga cgctgcaccc ggactccggc aacatgttcg agaacggctg ctacttgcgc      780

cgccagaagc gcttcaagtg cgagaagcag ccgggggccg gcggcggggg cgggagcgga      840

agcgggggca gcggcgccaa gggcggccct gagagccgca aggacccctc tggcgcctct      900

aaccccagcg ccgactcgcc cctccatcgg ggtgtgcacg ggaagaccgg ccagctagag      960

ggcgcgccgg cccccgggcc cgccgccagc ccccagactc tggaccacag tggggcgacg     1020

gcgacagggg gcgcctcgga gttgaagact ccagcctcct caactgcgcc ccccataagc     1080

tccgggcccg gggcgctggc ctctgtgccc gcctctcacc cggcacacgg cttggcaccc     1140

cacgagtccc agctgcacct gaaaggggac ccccactact ccttcaacca cccgttctcc     1200

atcaacaacc tcatgtcctc ctcggagcag cagcataagc tggacttcaa ggcatacgaa     1260

caggcactgc aatactcgcc ttacggctct acgttgcccg ccagcctgcc tctaggcagc     1320

gcctcggtga ccaccaggag ccccatcgag ccctcagccc tggagccggc gtactaccaa     1380

ggtgtgtatt ccagacccgt cctaaacact tcc                                  1413


<210>  9
<211>  1257
<212>  DNA
<213>  Homo sapiens

<400>  9
ttaggaactg tgaagatgga agggcatgaa accagcgact ggaacagcta ctacgcagac       60

acgcaggagg cctactcctc cgtcccggtc agcaacatga actcaggcct gggctccatg      120

aactccatga acacctacat gaccatgaac accatgacta cgagcggcaa catgaccccg      180

gcgtccttca acatgtccta tgccaacccg ggcctagggg ccggactgag tcccggcgca      240

gtagccggca tgccgggggg ctcggcgggc gccatgaaca gcatgactgc ggccggcgtg      300

acggccatgg gtacggcgct gagcccgagc ggcatgggcg ccatgggtgc gcagcaggcg      360

gcctccatga atggcctggg cccctacgcg gccgccatga acccgtgcat gagccccatg      420

gcgtacgcgc cgtccaacct gggccgcagc cgcgcgggcg gcggcggcga cgccaagacg      480

ttcaagcgca gctacccgca cgccaagccg ccctactcgt acatctcgct catcaccatg      540

gccatccagc aggcgcccag caagatgctc acgctgagcg agatctacca gtggatcatg      600

gacctcttcc cctattaccg gcagaaccag cagcgcttca agtgcgagaa gcagccgggg      660

gccggcggcg ggggcgggag cggaagcggg ggcagcggcg ccaagggcgg ccctgagagc      720

cgcaaggacc cctctggcgc ctctaacccc agcgccgact cgcccctcca tcggggtgtg      780

cacgggaaga ccggccagct agagggcgcg ccggcccccg ggcccgccgc cagcccccag      840

actctggacc acagtggggc gacggcgaca gggggcgcct cggagttgaa gactccagcc      900

tcctcaactg cgccccccat aagctccggg cccggggcgc tggcctctgt gcccgcctct      960

cacccggcac acggcttggc accccacgag tcccagctgc acctgaaagg ggacccccac     1020

tactccttca accacccgtt ctccatcaac aacctcatgt cctcctcgga gcagcagcat     1080

aagctggact tcaaggcata cgaacaggca ctgcaatact cgccttacgg ctctacgttg     1140

cccgccagcc tgcctctagg cagcgcctcg gtgaccacca ggagccccat cgagccctca     1200

gccctggagc cggcgtacta ccaaggtgtg tattccagac ccgtcctaaa cacttcc        1257


<210>  10
<211>  472
<212>  PRT
<213>  Homo sapiens

<400>  10

Met Leu Gly Thr Val Lys Met Glu Gly His Glu Thr Ser Asp Trp Asn 
1               5                   10                  15      


Ser Tyr Tyr Ala Asp Thr Gln Glu Ala Tyr Ser Ser Val Pro Val Ser 
            20                  25                  30          


Asn Met Asn Ser Gly Leu Gly Ser Met Asn Ser Met Asn Thr Tyr Met 
        35                  40                  45              


Thr Met Asn Thr Met Thr Thr Ser Gly Asn Met Thr Pro Ala Ser Phe 
    50                  55                  60                  


Asn Met Ser Tyr Ala Asn Pro Gly Leu Gly Ala Gly Leu Ser Pro Gly 
65                  70                  75                  80  


Ala Val Ala Gly Met Pro Gly Gly Ser Ala Gly Ala Met Asn Ser Met 
                85                  90                  95      


Thr Ala Ala Gly Val Thr Ala Met Gly Thr Ala Leu Ser Pro Ser Gly 
            100                 105                 110         


Met Gly Ala Met Gly Ala Gln Gln Ala Ala Ser Met Asn Gly Leu Gly 
        115                 120                 125             


Pro Tyr Ala Ala Ala Met Asn Pro Cys Met Ser Pro Met Ala Tyr Ala 
    130                 135                 140                 


Pro Ser Asn Leu Gly Arg Ser Arg Ala Gly Gly Gly Gly Asp Ala Lys 
145                 150                 155                 160 


Thr Phe Lys Arg Ser Tyr Pro His Ala Lys Pro Pro Tyr Ser Tyr Ile 
                165                 170                 175     


Ser Leu Ile Thr Met Ala Ile Gln Gln Ala Pro Ser Lys Met Leu Thr 
            180                 185                 190         


Leu Ser Glu Ile Tyr Gln Trp Ile Met Asp Leu Phe Pro Tyr Tyr Arg 
        195                 200                 205             


Gln Asn Gln Gln Arg Trp Gln Asn Ser Ile Arg His Ser Leu Ser Phe 
    210                 215                 220                 


Asn Asp Cys Phe Val Lys Val Ala Arg Ser Pro Asp Lys Pro Gly Lys 
225                 230                 235                 240 


Gly Ser Tyr Trp Thr Leu His Pro Asp Ser Gly Asn Met Phe Glu Asn 
                245                 250                 255     


Gly Cys Tyr Leu Arg Arg Gln Lys Arg Phe Lys Cys Glu Lys Gln Pro 
            260                 265                 270         


Gly Ala Gly Gly Gly Gly Gly Ser Gly Ser Gly Gly Ser Gly Ala Lys 
        275                 280                 285             


Gly Gly Pro Glu Ser Arg Lys Asp Pro Ser Gly Ala Ser Asn Pro Ser 
    290                 295                 300                 


Ala Asp Ser Pro Leu His Arg Gly Val His Gly Lys Thr Gly Gln Leu 
305                 310                 315                 320 


Glu Gly Ala Pro Ala Pro Gly Pro Ala Ala Ser Pro Gln Thr Leu Asp 
                325                 330                 335     


His Ser Gly Ala Thr Ala Thr Gly Gly Ala Ser Glu Leu Lys Thr Pro 
            340                 345                 350         


Ala Ser Ser Thr Ala Pro Pro Ile Ser Ser Gly Pro Gly Ala Leu Ala 
        355                 360                 365             


Ser Val Pro Ala Ser His Pro Ala His Gly Leu Ala Pro His Glu Ser 
    370                 375                 380                 


Gln Leu His Leu Lys Gly Asp Pro His Tyr Ser Phe Asn His Pro Phe 
385                 390                 395                 400 


Ser Ile Asn Asn Leu Met Ser Ser Ser Glu Gln Gln His Lys Leu Asp 
                405                 410                 415     


Phe Lys Ala Tyr Glu Gln Ala Leu Gln Tyr Ser Pro Tyr Gly Ser Thr 
            420                 425                 430         


Leu Pro Ala Ser Leu Pro Leu Gly Ser Ala Ser Val Thr Thr Arg Ser 
        435                 440                 445             


Pro Ile Glu Pro Ser Ala Leu Glu Pro Ala Tyr Tyr Gln Gly Val Tyr 
    450                 455                 460                 


Ser Arg Pro Val Leu Asn Thr Ser 
465                 470         


<210>  11
<211>  419
<212>  PRT
<213>  Homo sapiens

<400>  11

Leu Gly Thr Val Lys Met Glu Gly His Glu Thr Ser Asp Trp Asn Ser 
1               5                   10                  15      


Tyr Tyr Ala Asp Thr Gln Glu Ala Tyr Ser Ser Val Pro Val Ser Asn 
            20                  25                  30          


Met Asn Ser Gly Leu Gly Ser Met Asn Ser Met Asn Thr Tyr Met Thr 
        35                  40                  45              


Met Asn Thr Met Thr Thr Ser Gly Asn Met Thr Pro Ala Ser Phe Asn 
    50                  55                  60                  


Met Ser Tyr Ala Asn Pro Gly Leu Gly Ala Gly Leu Ser Pro Gly Ala 
65                  70                  75                  80  


Val Ala Gly Met Pro Gly Gly Ser Ala Gly Ala Met Asn Ser Met Thr 
                85                  90                  95      


Ala Ala Gly Val Thr Ala Met Gly Thr Ala Leu Ser Pro Ser Gly Met 
            100                 105                 110         


Gly Ala Met Gly Ala Gln Gln Ala Ala Ser Met Asn Gly Leu Gly Pro 
        115                 120                 125             


Tyr Ala Ala Ala Met Asn Pro Cys Met Ser Pro Met Ala Tyr Ala Pro 
    130                 135                 140                 


Ser Asn Leu Gly Arg Ser Arg Ala Gly Gly Gly Gly Asp Ala Lys Thr 
145                 150                 155                 160 


Phe Lys Arg Ser Tyr Pro His Ala Lys Pro Pro Tyr Ser Tyr Ile Ser 
                165                 170                 175     


Leu Ile Thr Met Ala Ile Gln Gln Ala Pro Ser Lys Met Leu Thr Leu 
            180                 185                 190         


Ser Glu Ile Tyr Gln Trp Ile Met Asp Leu Phe Pro Tyr Tyr Arg Gln 
        195                 200                 205             


Asn Gln Gln Arg Phe Lys Cys Glu Lys Gln Pro Gly Ala Gly Gly Gly 
    210                 215                 220                 


Gly Gly Ser Gly Ser Gly Gly Ser Gly Ala Lys Gly Gly Pro Glu Ser 
225                 230                 235                 240 


Arg Lys Asp Pro Ser Gly Ala Ser Asn Pro Ser Ala Asp Ser Pro Leu 
                245                 250                 255     


His Arg Gly Val His Gly Lys Thr Gly Gln Leu Glu Gly Ala Pro Ala 
            260                 265                 270         


Pro Gly Pro Ala Ala Ser Pro Gln Thr Leu Asp His Ser Gly Ala Thr 
        275                 280                 285             


Ala Thr Gly Gly Ala Ser Glu Leu Lys Thr Pro Ala Ser Ser Thr Ala 
    290                 295                 300                 


Pro Pro Ile Ser Ser Gly Pro Gly Ala Leu Ala Ser Val Pro Ala Ser 
305                 310                 315                 320 


His Pro Ala His Gly Leu Ala Pro His Glu Ser Gln Leu His Leu Lys 
                325                 330                 335     


Gly Asp Pro His Tyr Ser Phe Asn His Pro Phe Ser Ile Asn Asn Leu 
            340                 345                 350         


Met Ser Ser Ser Glu Gln Gln His Lys Leu Asp Phe Lys Ala Tyr Glu 
        355                 360                 365             


Gln Ala Leu Gln Tyr Ser Pro Tyr Gly Ser Thr Leu Pro Ala Ser Leu 
    370                 375                 380                 


Pro Leu Gly Ser Ala Ser Val Thr Thr Arg Ser Pro Ile Glu Pro Ser 
385                 390                 395                 400 


Ala Leu Glu Pro Ala Tyr Tyr Gln Gly Val Tyr Ser Arg Pro Val Leu 
                405                 410                 415     


Asn Thr Ser 
            


<210>  12
<211>  595
<212>  PRT
<213>  Homo sapiens

<400>  12

Met Thr Met Thr Leu His Thr Lys Ala Ser Gly Met Ala Leu Leu His 
1               5                   10                  15      


Gln Ile Gln Gly Asn Glu Leu Glu Pro Leu Asn Arg Pro Gln Leu Lys 
            20                  25                  30          


Ile Pro Leu Glu Arg Pro Leu Gly Glu Val Tyr Leu Asp Ser Ser Lys 
        35                  40                  45              


Pro Ala Val Tyr Asn Tyr Pro Glu Gly Ala Ala Tyr Glu Phe Asn Ala 
    50                  55                  60                  


Ala Ala Ala Ala Asn Ala Gln Val Tyr Gly Gln Thr Gly Leu Pro Tyr 
65                  70                  75                  80  


Gly Pro Gly Ser Glu Ala Ala Ala Phe Gly Ser Asn Gly Leu Gly Gly 
                85                  90                  95      


Phe Pro Pro Leu Asn Ser Val Ser Pro Ser Pro Leu Met Leu Leu His 
            100                 105                 110         


Pro Pro Pro Gln Leu Ser Pro Phe Leu Gln Pro His Gly Gln Gln Val 
        115                 120                 125             


Pro Tyr Tyr Leu Glu Asn Glu Pro Ser Gly Tyr Thr Val Arg Glu Ala 
    130                 135                 140                 


Gly Pro Pro Ala Phe Tyr Arg Pro Asn Ser Asp Asn Arg Arg Gln Gly 
145                 150                 155                 160 


Gly Arg Glu Arg Leu Ala Ser Thr Asn Asp Lys Gly Ser Met Ala Met 
                165                 170                 175     


Glu Ser Ala Lys Glu Thr Arg Tyr Cys Ala Val Cys Asn Asp Tyr Ala 
            180                 185                 190         


Ser Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala Phe 
        195                 200                 205             


Phe Lys Arg Ser Ile Gln Gly His Asn Asp Tyr Met Cys Pro Ala Thr 
    210                 215                 220                 


Asn Gln Cys Thr Ile Asp Lys Asn Arg Arg Lys Ser Cys Gln Ala Cys 
225                 230                 235                 240 


Arg Leu Arg Lys Cys Tyr Glu Val Gly Met Met Lys Gly Gly Ile Arg 
                245                 250                 255     


Lys Asp Arg Arg Gly Gly Arg Met Leu Lys His Lys Arg Gln Arg Asp 
            260                 265                 270         


Asp Gly Glu Gly Arg Gly Glu Val Gly Ser Ala Gly Asp Met Arg Ala 
        275                 280                 285             


Ala Asn Leu Trp Pro Ser Pro Leu Met Ile Lys Arg Ser Lys Lys Asn 
    290                 295                 300                 


Ser Leu Ala Leu Ser Leu Thr Ala Asp Gln Met Val Ser Ala Leu Leu 
305                 310                 315                 320 


Asp Ala Glu Pro Pro Ile Leu Tyr Ser Glu Tyr Asp Pro Thr Arg Pro 
                325                 330                 335     


Phe Ser Glu Ala Ser Met Met Gly Leu Leu Thr Asn Leu Ala Asp Arg 
            340                 345                 350         


Glu Leu Val His Met Ile Asn Trp Ala Lys Arg Val Pro Gly Phe Val 
        355                 360                 365             


Asp Leu Thr Leu His Asp Gln Val His Leu Leu Glu Cys Ala Trp Leu 
    370                 375                 380                 


Glu Ile Leu Met Ile Gly Leu Val Trp Arg Ser Met Glu His Pro Gly 
385                 390                 395                 400 


Lys Leu Leu Phe Ala Pro Asn Leu Leu Leu Asp Arg Asn Gln Gly Lys 
                405                 410                 415     


Cys Val Glu Gly Met Val Glu Ile Phe Asp Met Leu Leu Ala Thr Ser 
            420                 425                 430         


Ser Arg Phe Arg Met Met Asn Leu Gln Gly Glu Glu Phe Val Cys Leu 
        435                 440                 445             


Lys Ser Ile Ile Leu Leu Asn Ser Gly Val Tyr Thr Phe Leu Ser Ser 
    450                 455                 460                 


Thr Leu Lys Ser Leu Glu Glu Lys Asp His Ile His Arg Val Leu Asp 
465                 470                 475                 480 


Lys Ile Thr Asp Thr Leu Ile His Leu Met Ala Lys Ala Gly Leu Thr 
                485                 490                 495     


Leu Gln Gln Gln His Gln Arg Leu Ala Gln Leu Leu Leu Ile Leu Ser 
            500                 505                 510         


His Ile Arg His Met Ser Asn Lys Gly Met Glu His Leu Tyr Ser Met 
        515                 520                 525             


Lys Cys Lys Asn Val Val Pro Leu Tyr Asp Leu Leu Leu Glu Met Leu 
    530                 535                 540                 


Asp Ala His Arg Leu His Ala Pro Thr Ser Arg Gly Gly Ala Ser Val 
545                 550                 555                 560 


Glu Glu Thr Asp Gln Ser His Leu Ala Thr Ala Gly Ser Thr Ser Ser 
                565                 570                 575     


His Ser Leu Gln Lys Tyr Tyr Ile Thr Gly Glu Ala Glu Gly Phe Pro 
            580                 585                 590         


Ala Thr Val 
        595 


<210>  13
<211>  595
<212>  PRT
<213>  Homo sapiens

<400>  13

Met Thr Met Thr Leu His Thr Lys Ala Ser Gly Met Ala Leu Leu His 
1               5                   10                  15      


Gln Ile Gln Gly Asn Glu Leu Glu Pro Leu Asn Arg Pro Gln Leu Lys 
            20                  25                  30          


Ile Pro Leu Glu Arg Pro Leu Gly Glu Val Tyr Leu Asp Ser Ser Lys 
        35                  40                  45              


Pro Ala Val Tyr Asn Tyr Pro Glu Gly Ala Ala Tyr Glu Phe Asn Ala 
    50                  55                  60                  


Ala Ala Ala Ala Asn Ala Gln Val Tyr Gly Gln Thr Gly Leu Pro Tyr 
65                  70                  75                  80  


Gly Pro Gly Ser Glu Ala Ala Ala Phe Gly Ser Asn Gly Leu Gly Gly 
                85                  90                  95      


Phe Pro Pro Leu Asn Ser Val Ser Pro Ser Pro Leu Met Leu Leu His 
            100                 105                 110         


Pro Pro Pro Gln Leu Ser Pro Phe Leu Gln Pro His Gly Gln Gln Val 
        115                 120                 125             


Pro Tyr Tyr Leu Glu Asn Glu Pro Ser Gly Tyr Thr Val Arg Glu Ala 
    130                 135                 140                 


Gly Pro Pro Ala Phe Tyr Arg Pro Asn Ser Asp Asn Arg Arg Gln Gly 
145                 150                 155                 160 


Gly Arg Glu Arg Leu Ala Ser Thr Asn Asp Lys Gly Ser Met Ala Met 
                165                 170                 175     


Glu Ser Ala Lys Glu Thr Arg Tyr Cys Ala Val Cys Asn Asp Tyr Ala 
            180                 185                 190         


Ser Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala Phe 
        195                 200                 205             


Phe Lys Arg Ser Ile Gln Gly His Asn Asp Tyr Met Cys Pro Ala Thr 
    210                 215                 220                 


Asn Gln Cys Thr Ile Asp Lys Asn Arg Arg Lys Ser Cys Gln Ala Cys 
225                 230                 235                 240 


Arg Leu Arg Lys Cys Tyr Glu Val Gly Met Met Lys Gly Gly Ile Arg 
                245                 250                 255     


Lys Asp Arg Arg Gly Gly Arg Met Leu Lys His Lys Arg Gln Arg Asp 
            260                 265                 270         


Asp Gly Glu Gly Arg Gly Glu Val Gly Ser Ala Gly Asp Met Arg Ala 
        275                 280                 285             


Ala Asn Leu Trp Pro Ser Pro Leu Met Ile Lys Arg Ser Lys Lys Asn 
    290                 295                 300                 


Ser Leu Ala Leu Ser Leu Thr Ala Asp Gln Met Val Ser Ala Leu Leu 
305                 310                 315                 320 


Asp Ala Glu Pro Pro Ile Leu Tyr Ser Glu Tyr Asp Pro Thr Arg Pro 
                325                 330                 335     


Phe Ser Glu Ala Ser Met Met Gly Leu Leu Thr Asn Leu Ala Asp Arg 
            340                 345                 350         


Glu Leu Val His Met Ile Asn Trp Ala Lys Arg Val Pro Gly Phe Val 
        355                 360                 365             


Asp Leu Thr Leu His Asp Gln Val His Leu Leu Glu Cys Ala Trp Leu 
    370                 375                 380                 


Glu Ile Leu Met Ile Gly Leu Val Trp Arg Ser Met Glu His Pro Gly 
385                 390                 395                 400 


Lys Leu Leu Phe Ala Pro Asn Leu Leu Leu Asp Arg Asn Gln Gly Lys 
                405                 410                 415     


Cys Val Glu Gly Met Val Glu Ile Phe Asp Met Leu Leu Ala Thr Ser 
            420                 425                 430         


Ser Arg Phe Arg Met Met Asn Leu Gln Gly Glu Glu Phe Val Cys Leu 
        435                 440                 445             


Lys Ser Ile Ile Leu Leu Asn Ser Gly Val Tyr Thr Phe Leu Ser Ser 
    450                 455                 460                 


Thr Leu Lys Ser Leu Glu Glu Lys Asp His Ile His Arg Val Leu Asp 
465                 470                 475                 480 


Lys Ile Thr Asp Thr Leu Ile His Leu Met Ala Lys Ala Gly Leu Thr 
                485                 490                 495     


Leu Gln Gln Gln His Gln Arg Leu Ala Gln Leu Leu Leu Ile Leu Ser 
            500                 505                 510         


His Ile Arg His Met Ser Asn Lys Gly Met Glu His Leu Tyr Ser Met 
        515                 520                 525             


Lys Cys Lys Asn Val Val Pro Leu Tyr Gly Leu Leu Leu Glu Met Leu 
    530                 535                 540                 


Asp Ala His Arg Leu His Ala Pro Thr Ser Arg Gly Gly Ala Ser Val 
545                 550                 555                 560 


Glu Glu Thr Asp Gln Ser His Leu Ala Thr Ala Gly Ser Thr Ser Ser 
                565                 570                 575     


His Ser Leu Gln Lys Tyr Tyr Ile Thr Gly Glu Ala Glu Gly Phe Pro 
            580                 585                 590         


Ala Thr Val 
        595 


<210>  14
<211>  595
<212>  PRT
<213>  Homo sapiens

<400>  14

Met Thr Met Thr Leu His Thr Lys Ala Ser Gly Met Ala Leu Leu His 
1               5                   10                  15      


Gln Ile Gln Gly Asn Glu Leu Glu Pro Leu Asn Arg Pro Gln Leu Lys 
            20                  25                  30          


Ile Pro Leu Glu Arg Pro Leu Gly Glu Val Tyr Leu Asp Ser Ser Lys 
        35                  40                  45              


Pro Ala Val Tyr Asn Tyr Pro Glu Gly Ala Ala Tyr Glu Phe Asn Ala 
    50                  55                  60                  


Ala Ala Ala Ala Asn Ala Gln Val Tyr Gly Gln Thr Gly Leu Pro Tyr 
65                  70                  75                  80  


Gly Pro Gly Ser Glu Ala Ala Ala Phe Gly Ser Asn Gly Leu Gly Gly 
                85                  90                  95      


Phe Pro Pro Leu Asn Ser Val Ser Pro Ser Pro Leu Met Leu Leu His 
            100                 105                 110         


Pro Pro Pro Gln Leu Ser Pro Phe Leu Gln Pro His Gly Gln Gln Val 
        115                 120                 125             


Pro Tyr Tyr Leu Glu Asn Glu Pro Ser Gly Tyr Thr Val Arg Glu Ala 
    130                 135                 140                 


Gly Pro Pro Ala Phe Tyr Arg Pro Asn Ser Asp Asn Arg Arg Gln Gly 
145                 150                 155                 160 


Gly Arg Glu Arg Leu Ala Ser Thr Asn Asp Lys Gly Ser Met Ala Met 
                165                 170                 175     


Glu Ser Ala Lys Glu Thr Arg Tyr Cys Ala Val Cys Asn Asp Tyr Ala 
            180                 185                 190         


Ser Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala Phe 
        195                 200                 205             


Phe Lys Arg Ser Ile Gln Gly His Asn Asp Tyr Met Cys Pro Ala Thr 
    210                 215                 220                 


Asn Gln Cys Thr Ile Asp Lys Asn Arg Arg Lys Ser Cys Gln Ala Cys 
225                 230                 235                 240 


Arg Leu Arg Lys Cys Tyr Glu Val Gly Met Met Lys Gly Gly Ile Arg 
                245                 250                 255     


Lys Asp Arg Arg Gly Gly Arg Met Leu Lys His Lys Arg Gln Arg Asp 
            260                 265                 270         


Asp Gly Glu Gly Arg Gly Glu Val Gly Ser Ala Gly Asp Met Arg Ala 
        275                 280                 285             


Ala Asn Leu Trp Pro Ser Pro Leu Met Ile Lys Arg Ser Lys Lys Asn 
    290                 295                 300                 


Ser Leu Ala Leu Ser Leu Thr Ala Asp Gln Met Val Ser Ala Leu Leu 
305                 310                 315                 320 


Asp Ala Glu Pro Pro Ile Leu Tyr Ser Glu Tyr Asp Pro Thr Arg Pro 
                325                 330                 335     


Phe Ser Glu Ala Ser Met Met Gly Leu Leu Thr Asn Leu Ala Asp Arg 
            340                 345                 350         


Glu Leu Val His Met Ile Asn Trp Ala Lys Arg Val Pro Gly Phe Val 
        355                 360                 365             


Asp Leu Thr Leu His Asp Gln Val His Leu Leu Glu Cys Ala Trp Leu 
    370                 375                 380                 


Glu Ile Leu Met Ile Gly Leu Val Trp Arg Ser Met Glu His Pro Gly 
385                 390                 395                 400 


Lys Leu Leu Phe Ala Pro Asn Leu Leu Leu Asp Arg Asn Gln Gly Lys 
                405                 410                 415     


Cys Val Glu Gly Met Val Glu Ile Phe Asp Met Leu Leu Ala Thr Ser 
            420                 425                 430         


Ser Arg Phe Arg Met Met Asn Leu Gln Gly Glu Glu Phe Val Cys Leu 
        435                 440                 445             


Lys Ser Ile Ile Leu Leu Asn Ser Gly Val Tyr Thr Phe Leu Ser Ser 
    450                 455                 460                 


Thr Leu Lys Ser Leu Glu Glu Lys Asp His Ile His Arg Val Leu Asp 
465                 470                 475                 480 


Lys Ile Thr Asp Thr Leu Ile His Leu Met Ala Lys Ala Gly Leu Thr 
                485                 490                 495     


Leu Gln Gln Gln His Gln Arg Leu Ala Gln Leu Leu Leu Ile Leu Ser 
            500                 505                 510         


His Ile Arg His Met Ser Asn Lys Gly Met Glu His Leu Tyr Ser Met 
        515                 520                 525             


Lys Cys Lys Asn Val Val Pro Leu Ser Asp Leu Leu Leu Glu Met Leu 
    530                 535                 540                 


Asp Ala His Arg Leu His Ala Pro Thr Ser Arg Gly Gly Ala Ser Val 
545                 550                 555                 560 


Glu Glu Thr Asp Gln Ser His Leu Ala Thr Ala Gly Ser Thr Ser Ser 
                565                 570                 575     


His Ser Leu Gln Lys Tyr Tyr Ile Thr Gly Glu Ala Glu Gly Phe Pro 
            580                 585                 590         


Ala Thr Val 
        595 


<210>  15
<211>  644
<212>  PRT
<213>  Homo sapiens

<400>  15

Met Glu Val Gln Leu Gly Leu Gly Arg Val Tyr Pro Arg Pro Pro Ser 
1               5                   10                  15      


Lys Thr Tyr Arg Gly Ala Phe Gln Asn Leu Phe Gln Ser Val Arg Glu 
            20                  25                  30          


Val Ile Gln Asn Pro Gly Pro Arg His Pro Glu Ala Ala Ser Ala Ala 
        35                  40                  45              


Pro Pro Gly Ala Ser Leu Leu Leu Leu Gln Gln Gln Gln Gln Gln Gln 
    50                  55                  60                  


Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln Gln 
65                  70                  75                  80  


Glu Thr Ser Pro Arg Gln Gln Gln Gln Gln Gln Gly Glu Asp Gly Ser 
                85                  90                  95      


Pro Gln Ala His Arg Arg Gly Pro Thr Gly Tyr Leu Val Leu Asp Glu 
            100                 105                 110         


Glu Gln Gln Pro Ser Gln Pro Gln Ser Ala Leu Glu Cys His Pro Glu 
        115                 120                 125             


Arg Gly Cys Val Pro Glu Pro Gly Ala Ala Val Ala Ala Ser Lys Gly 
    130                 135                 140                 


Leu Pro Gln Gln Leu Pro Ala Pro Pro Asp Glu Asp Asp Ser Ala Ala 
145                 150                 155                 160 


Pro Ser Thr Leu Ser Leu Leu Gly Pro Thr Phe Pro Gly Leu Ser Ser 
                165                 170                 175     


Cys Ser Ala Asp Leu Lys Asp Ile Leu Ser Glu Ala Ser Thr Met Gln 
            180                 185                 190         


Leu Leu Gln Gln Gln Gln Gln Glu Ala Val Ser Glu Gly Ser Ser Ser 
        195                 200                 205             


Gly Arg Ala Arg Glu Ala Ser Gly Ala Pro Thr Ser Ser Lys Asp Asn 
    210                 215                 220                 


Tyr Leu Gly Gly Thr Ser Thr Ile Ser Asp Asn Ala Lys Glu Leu Cys 
225                 230                 235                 240 


Lys Ala Val Ser Val Ser Met Gly Leu Gly Val Glu Ala Leu Glu His 
                245                 250                 255     


Leu Ser Pro Gly Glu Gln Leu Arg Gly Asp Cys Met Tyr Ala Pro Leu 
            260                 265                 270         


Leu Gly Val Pro Pro Ala Val Arg Pro Thr Pro Cys Ala Pro Leu Ala 
        275                 280                 285             


Glu Cys Lys Gly Ser Leu Leu Asp Asp Ser Ala Gly Lys Ser Thr Glu 
    290                 295                 300                 


Asp Thr Ala Glu Tyr Ser Pro Phe Lys Gly Gly Tyr Thr Lys Gly Leu 
305                 310                 315                 320 


Glu Gly Glu Ser Leu Gly Cys Ser Gly Ser Ala Ala Ala Gly Ser Ser 
                325                 330                 335     


Gly Thr Leu Glu Leu Pro Ser Thr Leu Ser Leu Tyr Lys Ser Gly Ala 
            340                 345                 350         


Leu Asp Glu Ala Ala Ala Tyr Gln Ser Arg Asp Tyr Tyr Asn Phe Pro 
        355                 360                 365             


Leu Ala Leu Ala Gly Pro Pro Pro Pro Pro Pro Pro Pro His Pro His 
    370                 375                 380                 


Ala Arg Ile Lys Leu Glu Asn Pro Leu Asp Tyr Gly Ser Ala Trp Ala 
385                 390                 395                 400 


Ala Ala Ala Ala Gln Cys Arg Tyr Gly Asp Leu Ala Ser Leu His Gly 
                405                 410                 415     


Ala Gly Ala Ala Gly Pro Gly Ser Gly Ser Pro Ser Ala Ala Ala Ser 
            420                 425                 430         


Ser Ser Trp His Thr Leu Phe Thr Ala Glu Glu Gly Gln Leu Tyr Gly 
        435                 440                 445             


Pro Cys Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
    450                 455                 460                 


Gly Gly Gly Gly Gly Gly Gly Gly Gly Glu Ala Gly Ala Val Ala Pro 
465                 470                 475                 480 


Tyr Gly Tyr Thr Arg Pro Pro Gln Gly Leu Ala Gly Gln Glu Ser Asp 
                485                 490                 495     


Phe Thr Ala Pro Asp Val Trp Tyr Pro Gly Gly Met Val Ser Arg Val 
            500                 505                 510         


Pro Tyr Pro Ser Pro Thr Cys Val Lys Ser Glu Met Gly Pro Trp Met 
        515                 520                 525             


Asp Ser Tyr Ser Gly Pro Tyr Gly Asp Met Arg Leu Glu Thr Ala Arg 
    530                 535                 540                 


Asp His Val Leu Pro Ile Asp Tyr Tyr Phe Pro Pro Gln Lys Thr Cys 
545                 550                 555                 560 


Leu Ile Cys Gly Asp Glu Ala Ser Gly Cys His Tyr Gly Ala Leu Thr 
                565                 570                 575     


Cys Gly Ser Cys Lys Val Phe Phe Lys Arg Ala Ala Glu Gly Lys Gln 
            580                 585                 590         


Lys Tyr Leu Cys Ala Ser Arg Asn Asp Cys Thr Ile Asp Lys Phe Arg 
        595                 600                 605             


Arg Lys Asn Cys Pro Ser Cys Arg Leu Arg Lys Cys Tyr Glu Ala Gly 
    610                 615                 620                 


Met Thr Leu Gly Glu Lys Phe Arg Val Gly Asn Cys Lys His Leu Lys 
625                 630                 635                 640 


Met Thr Arg Pro 
                


<210>  16
<211>  439
<212>  PRT
<213>  Homo sapiens

<400>  16

Met Asn Ser Gly Leu Gly Ser Met Asn Ser Met Asn Thr Tyr Met Thr 
1               5                   10                  15      


Met Asn Thr Met Thr Thr Ser Gly Asn Met Thr Pro Ala Ser Phe Asn 
            20                  25                  30          


Met Ser Tyr Ala Asn Pro Gly Leu Gly Ala Gly Leu Ser Pro Gly Ala 
        35                  40                  45              


Val Ala Gly Met Pro Gly Gly Ser Ala Gly Ala Met Asn Ser Met Thr 
    50                  55                  60                  


Ala Ala Gly Val Thr Ala Met Gly Thr Ala Leu Ser Pro Ser Gly Met 
65                  70                  75                  80  


Gly Ala Met Gly Ala Gln Gln Ala Ala Ser Met Asn Gly Leu Gly Pro 
                85                  90                  95      


Tyr Ala Ala Ala Met Asn Pro Cys Met Ser Pro Met Ala Tyr Ala Pro 
            100                 105                 110         


Ser Asn Leu Gly Arg Ser Arg Ala Gly Gly Gly Gly Asp Ala Lys Thr 
        115                 120                 125             


Phe Lys Arg Ser Tyr Pro His Ala Lys Pro Pro Tyr Ser Tyr Ile Ser 
    130                 135                 140                 


Leu Ile Thr Met Ala Ile Gln Gln Ala Pro Ser Lys Met Leu Thr Leu 
145                 150                 155                 160 


Ser Glu Ile Tyr Gln Trp Ile Met Asp Leu Phe Pro Tyr Tyr Arg Gln 
                165                 170                 175     


Asn Gln Gln Arg Trp Gln Asn Ser Ile Arg His Ser Leu Ser Phe Asn 
            180                 185                 190         


Asp Cys Phe Val Lys Val Ala Arg Ser Pro Asp Lys Pro Gly Lys Gly 
        195                 200                 205             


Ser Tyr Trp Thr Leu His Pro Asp Ser Gly Asn Met Phe Glu Asn Gly 
    210                 215                 220                 


Cys Tyr Leu Arg Arg Gln Lys Arg Phe Lys Cys Glu Lys Gln Pro Gly 
225                 230                 235                 240 


Ala Gly Gly Gly Gly Gly Ser Gly Ser Gly Gly Ser Gly Ala Lys Gly 
                245                 250                 255     


Gly Pro Glu Ser Arg Lys Asp Pro Ser Gly Ala Ser Asn Pro Ser Ala 
            260                 265                 270         


Asp Ser Pro Leu His Arg Gly Val His Gly Lys Thr Gly Gln Leu Glu 
        275                 280                 285             


Gly Ala Pro Ala Pro Gly Pro Ala Ala Ser Pro Gln Thr Leu Asp His 
    290                 295                 300                 


Ser Gly Ala Thr Ala Thr Gly Gly Ala Ser Glu Leu Lys Thr Pro Ala 
305                 310                 315                 320 


Ser Ser Thr Ala Pro Pro Ile Ser Ser Gly Pro Gly Ala Leu Ala Ser 
                325                 330                 335     


Val Pro Ala Ser His Pro Ala His Gly Leu Ala Pro His Glu Ser Gln 
            340                 345                 350         


Leu His Leu Lys Gly Asp Pro His Tyr Ser Phe Asn His Pro Phe Ser 
        355                 360                 365             


Ile Asn Asn Leu Met Ser Ser Ser Glu Gln Gln His Lys Leu Asp Phe 
    370                 375                 380                 


Lys Ala Tyr Glu Gln Ala Leu Gln Tyr Ser Pro Tyr Gly Ser Thr Leu 
385                 390                 395                 400 


Pro Ala Ser Leu Pro Leu Gly Ser Ala Ser Val Thr Thr Arg Ser Pro 
                405                 410                 415     


Ile Glu Pro Ser Ala Leu Glu Pro Ala Tyr Tyr Gln Gly Val Tyr Ser 
            420                 425                 430         


Arg Pro Val Leu Asn Thr Ser 
        435                 


<210>  17
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Linker nucleotide sequence used in the endo construct

<400>  17
ggcggatcag gcggaagc                                                     18


<210>  18
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Linker peptide sequence used in the endo construct

<400>  18

Gly Gly Ser Gly Gly Ser 
1               5       


<210>  19
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Linker peptide sequence

<400>  19

Gly Gly Gly Ser 
1               


<210>  20
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Linker peptide sequence

<400>  20

Gly Gly Gly Gly Ser 
1               5   


<210>  21
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Linker peptide sequence

<400>  21

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1               5                   10  


<210>  22
<211>  5
<212>  DNA
<213>  Enterobacteria phage T4


<220>
<221>  misc_feature
<222>  (2)..(4)
<223>  n is a, c, g, or t

<400>  22
cnnng                                                                    5


<210>  23
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Motif defining homing endonuclease class

<400>  23

Leu Ala Gly Leu Ile Asp Ala Asp Gly 
1               5                   


<210>  24
<211>  24
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  FLAG-tag coding sequence

<400>  24
gactacaagg acgacgatga caag                                              24


<210>  25
<211>  66
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  P2A peptide coding sequence for ribosome skipping

<400>  25
ggaagcggag ctactaactt cagcctgctg aagcaggctg gcgacgtgga ggagaaccct       60

ggacct                                                                  66


<210>  26
<211>  681
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Humanised MonsterGFP coding sequence

<400>  26
atgggcgtga tcaagcccga catgaagatc aagctgcgga tggagggcgc cgtgaacggc       60

cacaaattcg tgatcgaggg cgacgggaaa ggcaagccct ttgagggtaa gcagactatg      120

gacctgaccg tgatcgaggg cgcccccctg cccttcgctt atgacattct caccaccgtg      180

ttcgactacg gtaaccgtgt cttcgccaag taccccaagg acatccctga ctacttcaag      240

cagaccttcc ccgagggcta ctcgtgggag cgaagcatga catacgagga ccagggaatc      300

tgtatcgcta caaacgacat caccatgatg aagggtgtgg acgactgctt cgtgtacaaa      360

atccgcttcg acggggtcaa cttccctgct aatggcccgg tgatgcagcg caagacccta      420

aagtgggagc ccagtaccga gaagatgtac gtgcgggacg gcgtactgaa gggcgatgtt      480

aatatggcac tgctcttgga gggaggcggc cactaccgct gcgacttcaa gaccacctac      540

aaagccaaga aggtggtgca gcttcccgac taccacttcg tggaccaccg catcgagatc      600

gtgagccacg acaaggacta caacaaagtc aagctgtacg agcacgccga agcccacagc      660

ggactacccc gccaggccgg c                                                681


<210>  27
<211>  15
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Nucleotide sequence with STOP codons in all three reading frames

<400>  27
tgaataacta gatga                                                        15


<210>  28
<211>  1428
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Endo Cassette nucleotide sequence as Sal1-BamH1 restriction 
       endonuclease fragment

<400>  28
gtcgacgcca ccatgaaaag cggaatttat cagattaaaa atactttaaa caataaagta       60

tatgtaggaa gtgctaaaga ttttgaaaag agatggaaga ggcattttaa agatttagaa      120

aaaggatgcc attcttctat aaaacttcag aggtctttta acaaacatgg taatgtgttt      180

gaatgttcta ttttggaaga aattccatat gagaaagatt tgattattga acgagaaaat      240

ttttggatta aagagcttaa ttctaaaatt aatggataca atattgctga tgcaacgttt      300

ggtgatacat gttctacgca tccattaaaa gaagaaatta ttaagaaacg ttctgaaact      360

gttaaagcta agatgcttaa acttggacct gatggtcgga aagctcttta cagtaaaccc      420

ggaagtaaaa acgggcgttg gaatccagaa acccataagt tttgtaagtg cggtgttcgc      480

atacaaactt ctgcttatac ttgtagtaaa tgcagaaatc gttcaggtga aaataattca      540

ttctttaatc ataagcattc agacataact aaatctaaaa tatcagaaaa gatgaaaggt      600

aaaaagccta gtaatggcgg atcaggcgga agcgctgact acaaggacga cgatgacaag      660

ggaagcggag ctactaactt cagcctgctg aagcaggctg gcgacgtgga ggagaaccct      720

ggacctatgg gcgtgatcaa gcccgacatg aagatcaagc tgcggatgga gggcgccgtg      780

aacggccaca aattcgtgat cgagggcgac gggaaaggca agccctttga gggtaagcag      840

actatggacc tgaccgtgat cgagggcgcc cccctgccct tcgcttatga cattctcacc      900

accgtgttcg actacggtaa ccgtgtcttc gccaagtacc ccaaggacat ccctgactac      960

ttcaagcaga ccttccccga gggctactcg tgggagcgaa gcatgacata cgaggaccag     1020

ggaatctgta tcgctacaaa cgacatcacc atgatgaagg gtgtggacga ctgcttcgtg     1080

tacaaaatcc gcttcgacgg ggtcaacttc cctgctaatg gcccggtgat gcagcgcaag     1140

accctaaagt gggagcccag taccgagaag atgtacgtgc gggacggcgt actgaagggc     1200

gatgttaata tggcactgct cttggaggga ggcggccact accgctgcga cttcaagacc     1260

acctacaaag ccaagaaggt ggtgcagctt cccgactacc acttcgtgga ccaccgcatc     1320

gagatcgtga gccacgacaa ggactacaac aaagtcaagc tgtacgagca cgccgaagcc     1380

cacagcggac taccccgcca ggccggctga ataactagat gaggatcc                  1428


<210>  29
<211>  3706
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pMK-RQ Endo cassette

<400>  29
ctaaattgta agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc       60

attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga      120

gatagggttg agtggccgct acagggcgct cccattcgcc attcaggctg cgcaactgtt      180

gggaagggcg tttcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt      240

gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg      300

acggccagtg agcgcgacgt aatacgactc actatagggc gaattgaagg aaggccgtca      360

aggccgcatg tcgacgccac catgaaaagc ggaatttatc agattaaaaa tactttaaac      420

aataaagtat atgtaggaag tgctaaagat tttgaaaaga gatggaagag gcattttaaa      480

gatttagaaa aaggatgcca ttcttctata aaacttcaga ggtcttttaa caaacatggt      540

aatgtgtttg aatgttctat tttggaagaa attccatatg agaaagattt gattattgaa      600

cgagaaaatt tttggattaa agagcttaat tctaaaatta atggatacaa tattgctgat      660

gcaacgtttg gtgatacatg ttctacgcat ccattaaaag aagaaattat taagaaacgt      720

tctgaaactg ttaaagctaa gatgcttaaa cttggacctg atggtcggaa agctctttac      780

agtaaacccg gaagtaaaaa cgggcgttgg aatccagaaa cccataagtt ttgtaagtgc      840

ggtgttcgca tacaaacttc tgcttatact tgtagtaaat gcagaaatcg ttcaggtgaa      900

aataattcat tctttaatca taagcattca gacataacta aatctaaaat atcagaaaag      960

atgaaaggta aaaagcctag taatggcgga tcaggcggaa gcgctgacta caaggacgac     1020

gatgacaagg gaagcggagc tactaacttc agcctgctga agcaggctgg cgacgtggag     1080

gagaaccctg gacctatggg cgtgatcaag cccgacatga agatcaagct gcggatggag     1140

ggcgccgtga acggccacaa attcgtgatc gagggcgacg ggaaaggcaa gccctttgag     1200

ggtaagcaga ctatggacct gaccgtgatc gagggcgccc ccctgccctt cgcttatgac     1260

attctcacca ccgtgttcga ctacggtaac cgtgtcttcg ccaagtaccc caaggacatc     1320

cctgactact tcaagcagac cttccccgag ggctactcgt gggagcgaag catgacatac     1380

gaggaccagg gaatctgtat cgctacaaac gacatcacca tgatgaaggg tgtggacgac     1440

tgcttcgtgt acaaaatccg cttcgacggg gtcaacttcc ctgctaatgg cccggtgatg     1500

cagcgcaaga ccctaaagtg ggagcccagt accgagaaga tgtacgtgcg ggacggcgta     1560

ctgaagggcg atgttaatat ggcactgctc ttggagggag gcggccacta ccgctgcgac     1620

ttcaagacca cctacaaagc caagaaggtg gtgcagcttc ccgactacca cttcgtggac     1680

caccgcatcg agatcgtgag ccacgacaag gactacaaca aagtcaagct gtacgagcac     1740

gccgaagccc acagcggact accccgccag gccggctgaa taactagatg aggatccctg     1800

ggcctcatgg gccttccttt cactgcccgc tttccagtcg ggaaacctgt cgtgccagct     1860

gcattaacat ggtcatagct gtttccttgc gtattgggcg ctctccgctt cctcgctcac     1920

tgactcgctg cgctcggtcg ttcgggtaaa gcctggggtg cctaatgagc aaaaggccag     1980

caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc     2040

cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta     2100

taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg     2160

ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc     2220

tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac     2280

gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac     2340

ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg     2400

aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga     2460

agaacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt     2520

agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag     2580

cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct     2640

gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg     2700

atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat     2760

gagtaaactt ggtctgacag ttattagaaa aattcatcca gcagacgata aaacgcaata     2820

cgctggctat ccggtgccgc aatgccatac agcaccagaa aacgatccgc ccattcgccg     2880

cccagttctt ccgcaatatc acgggtggcc agcgcaatat cctgataacg atccgccacg     2940

cccagacggc cgcaatcaat aaagccgcta aaacggccat tttccaccat aatgttcggc     3000

aggcacgcat caccatgggt caccaccaga tcttcgccat ccggcatgct cgctttcaga     3060

cgcgcaaaca gctctgccgg tgccaggccc tgatgttctt catccagatc atcctgatcc     3120

accaggcccg cttccatacg ggtacgcgca cgttcaatac gatgtttcgc ctgatgatca     3180

aacggacagg tcgccgggtc cagggtatgc agacgacgca tggcatccgc cataatgctc     3240

actttttctg ccggcgccag atggctagac agcagatcct gacccggcac ttcgcccagc     3300

agcagccaat cacggcccgc ttcggtcacc acatccagca ccgccgcaca cggaacaccg     3360

gtggtggcca gccagctcag acgcgccgct tcatcctgca gctcgttcag cgcaccgctc     3420

agatcggttt tcacaaacag caccggacga ccctgcgcgc tcagacgaaa caccgccgca     3480

tcagagcagc caatggtctg ctgcgcccaa tcatagccaa acagacgttc cacccacgct     3540

gccgggctac ccgcatgcag gccatcctgt tcaatcatac tcttcctttt tcaatattat     3600

tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa     3660

aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccac                    3706


<210>  30
<211>  4789
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pTRE3G Endo cassette

<400>  30
ctcgagttta ctccctatca gtgatagaga acgtatgaag agtttactcc ctatcagtga       60

tagagaacgt atgcagactt tactccctat cagtgataga gaacgtataa ggagtttact      120

ccctatcagt gatagagaac gtatgaccag tttactccct atcagtgata gagaacgtat      180

ctacagttta ctccctatca gtgatagaga acgtatatcc agtttactcc ctatcagtga      240

tagagaacgt ataagcttta ggcgtgtacg gtgggcgcct ataaaagcag agctcgttta      300

gtgaaccgtc agatcgcctg gagcaattcc acaacacttt tgtcttatac caactttccg      360

taccacttcc taccctcgta aagtcgacgc caccatgaaa agcggaattt atcagattaa      420

aaatacttta aacaataaag tatatgtagg aagtgctaaa gattttgaaa agagatggaa      480

gaggcatttt aaagatttag aaaaaggatg ccattcttct ataaaacttc agaggtcttt      540

taacaaacat ggtaatgtgt ttgaatgttc tattttggaa gaaattccat atgagaaaga      600

tttgattatt gaacgagaaa atttttggat taaagagctt aattctaaaa ttaatggata      660

caatattgct gatgcaacgt ttggtgatac atgttctacg catccattaa aagaagaaat      720

tattaagaaa cgttctgaaa ctgttaaagc taagatgctt aaacttggac ctgatggtcg      780

gaaagctctt tacagtaaac ccggaagtaa aaacgggcgt tggaatccag aaacccataa      840

gttttgtaag tgcggtgttc gcatacaaac ttctgcttat acttgtagta aatgcagaaa      900

tcgttcaggt gaaaataatt cattctttaa tcataagcat tcagacataa ctaaatctaa      960

aatatcagaa aagatgaaag gtaaaaagcc tagtaatggc ggatcaggcg gaagcgctga     1020

ctacaaggac gacgatgaca agggaagcgg agctactaac ttcagcctgc tgaagcaggc     1080

tggcgacgtg gaggagaacc ctggacctat gggcgtgatc aagcccgaca tgaagatcaa     1140

gctgcggatg gagggcgccg tgaacggcca caaattcgtg atcgagggcg acgggaaagg     1200

caagcccttt gagggtaagc agactatgga cctgaccgtg atcgagggcg cccccctgcc     1260

cttcgcttat gacattctca ccaccgtgtt cgactacggt aaccgtgtct tcgccaagta     1320

ccccaaggac atccctgact acttcaagca gaccttcccc gagggctact cgtgggagcg     1380

aagcatgaca tacgaggacc agggaatctg tatcgctaca aacgacatca ccatgatgaa     1440

gggtgtggac gactgcttcg tgtacaaaat ccgcttcgac ggggtcaact tccctgctaa     1500

tggcccggtg atgcagcgca agaccctaaa gtgggagccc agtaccgaga agatgtacgt     1560

gcgggacggc gtactgaagg gcgatgttaa tatggcactg ctcttggagg gaggcggcca     1620

ctaccgctgc gacttcaaga ccacctacaa agccaagaag gtggtgcagc ttcccgacta     1680

ccacttcgtg gaccaccgca tcgagatcgt gagccacgac aaggactaca acaaagtcaa     1740

gctgtacgag cacgccgaag cccacagcgg actaccccgc caggccggct gaataactag     1800

atgaggatcc aatgtaactg tattcagcga tgacgaaatt cttagctatt gtaatactct     1860

agaggatctt tgtgaaggaa ccttacttct gtggtgtgac ataattggac aaactaccta     1920

cagagattta aagctctaag gtaaatataa aatttttaag tgtataatgt gttaaactac     1980

tgattctaat tgtttgtgta ttttagattc caacctatgg aactgatgaa tgggagcagt     2040

ggtggaatgc ctttaatgag gaaaacctgt tttgctcaga agaaatgcca tctagtgatg     2100

atgaggctac tgctgactct caacattcta ctcctccaaa aaagaagaga aaggtagaag     2160

accccaagga ctttccttca gaattgctaa gttttttgag tcatgctgtg tttagtaata     2220

gaactcttgc ttgctttgct atttacacca caaaggaaaa agctgcactg ctatacaaga     2280

aaattatgga aaaatattct gtaaccttta taagtaggca taacagttat aatcataaca     2340

tactgttttt tcttactcca cacaggcata gagtgtctgc tattaataac tatgctcaaa     2400

aattgtgtac ctttagcttt ttaatttgta aaggggttaa taaggaatat ttgatgtata     2460

gtgccttgac tagagatcat aatcagccat accacatttg tagaggtttt acttgcttta     2520

aaaaacctcc cacacctccc cctgaacctg aaacataaaa tgaatgcaat tgttgttgtt     2580

aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca     2640

aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct     2700

tatcatgtct gcggctctag agctgcatta atgaatcggc caacgcgcgg ggagaggcgg     2760

tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg     2820

gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg     2880

ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa     2940

ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg     3000

acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc     3060

tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc     3120

ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc     3180

ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg     3240

ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc     3300

actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga     3360

gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg gtatctgcgc     3420

tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac     3480

caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg     3540

atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc     3600

acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa     3660

ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta     3720

ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt     3780

tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag     3840

tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag caataaacca     3900

gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc     3960

tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt     4020

tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag     4080

ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt     4140

tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat     4200

ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt     4260

gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc     4320

ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat     4380

cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag     4440

ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt     4500

ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg     4560

gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta     4620

ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc     4680

gcgcacattt ccccgaaaag tgccacctga cgtctaagaa accattatta tcatgacatt     4740

aacctataaa aataggcgta tcacgaggcc ctttcgtctt caagaattc                 4789


<210>  31
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ER-Gibson_F PCR primer sequence

<400>  31
ggcggatcag gcggaagcac catgaccctc cacaccaaag catc                        44


<210>  32
<211>  42
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ER-Gibson R PCR primer sequence

<400>  32
gtcgtccttg tagtcagcga ccgtggcagg gaaaccctct gc                          42


<210>  33
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  FOXA1-Gibson_F PCR primer sequence

<400>  33
ggcggatcag gcggaagctt aggaactgtg aagatggaag g                           41


<210>  34
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  FOXA1-Gibson_R PCR primer sequence

<400>  34
gtcgtccttg tagtcagcgg aagtgtttag gacgggtctg g                           41


<210>  35
<211>  17
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  TRE3G-seqF sequencing primer

<400>  35
tttaggcgtg tacggtg                                                      17


<210>  36
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  GIY-seqF sequencing primer

<400>  36
gttcgcatac aaacttctgc                                                   20


<210>  37
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  hMGFP-seqR sequencing primer

<400>  37
caggtccata gtctgcttac                                                   20


<210>  38
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  TRE3G-seqR sequencing primer

<400>  38
ctctgtaggt agtttgtcc                                                    19


<210>  39
<211>  6571
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pTRE3G Endo ER nucleotide sequence

<400>  39
ctcgagttta ctccctatca gtgatagaga acgtatgaag agtttactcc ctatcagtga       60

tagagaacgt atgcagactt tactccctat cagtgataga gaacgtataa ggagtttact      120

ccctatcagt gatagagaac gtatgaccag tttactccct atcagtgata gagaacgtat      180

ctacagttta ctccctatca gtgatagaga acgtatatcc agtttactcc ctatcagtga      240

tagagaacgt ataagcttta ggcgtgtacg gtgggcgcct ataaaagcag agctcgttta      300

gtgaaccgtc agatcgcctg gagcaattcc acaacacttt tgtcttatac caactttccg      360

taccacttcc taccctcgta aagtcgacgc caccatgaaa agcggaattt atcagattaa      420

aaatacttta aacaataaag tatatgtagg aagtgctaaa gattttgaaa agagatggaa      480

gaggcatttt aaagatttag aaaaaggatg ccattcttct ataaaacttc agaggtcttt      540

taacaaacat ggtaatgtgt ttgaatgttc tattttggaa gaaattccat atgagaaaga      600

tttgattatt gaacgagaaa atttttggat taaagagctt aattctaaaa ttaatggata      660

caatattgct gatgcaacgt ttggtgatac atgttctacg catccattaa aagaagaaat      720

tattaagaaa cgttctgaaa ctgttaaagc taagatgctt aaacttggac ctgatggtcg      780

gaaagctctt tacagtaaac ccggaagtaa aaacgggcgt tggaatccag aaacccataa      840

gttttgtaag tgcggtgttc gcatacaaac ttctgcttat acttgtagta aatgcagaaa      900

tcgttcaggt gaaaataatt cattctttaa tcataagcat tcagacataa ctaaatctaa      960

aatatcagaa aagatgaaag gtaaaaagcc tagtaatggc ggatcaggcg gaagcaccat     1020

gaccctccac accaaagcat ctgggatggc cctactgcat cagatccaag ggaacgagct     1080

ggagcccctg aaccgtccgc agctcaagat ccccctggag cggcccctgg gcgaggtgta     1140

cctggacagc agcaagcccg ccgtgtacaa ctaccccgag ggcgccgcct acgagttcaa     1200

cgccgcggcc gccgccaacg cgcaggtcta cggtcagacc ggcctcccct acggccccgg     1260

gtctgaggct gcggcgttcg gctccaacgg cctggggggt ttccccccac tcaacagcgt     1320

gtctccgagc ccgctgatgc tactgcaccc gccgccgcag ctgtcgcctt tcctgcagcc     1380

ccacggccag caggtgccct actacctgga gaacgagccc agcggctaca cggtgcgcga     1440

ggcaggcccg ccggcattct acaggccaaa ttcagataat cgacgccagg gtggcagaga     1500

aagattggcc agtaccaatg acaagggaag tatggctatg gaatctgcca aggagactcg     1560

ctactgtgca gtgtgcaatg actatgcttc aggctaccat tatggagtct ggtcctgtga     1620

gggctgcaag gccttcttca agagaagtat tcaaggacat aacgactata tgtgtccagc     1680

caccaaccag tgcaccattg ataaaaacag gaggaagagc tgccaggcct gccggctccg     1740

taaatgctac gaagtgggaa tgatgaaagg tgggatacga aaagaccgaa gaggagggag     1800

aatgttgaaa cacaagcgcc agagagatga tggggagggc aggggtgaag tggggtctgc     1860

tggagacatg agagctgcca acctttggcc aagcccgctc atgatcaaac gctctaagaa     1920

gaacagcctg gccttgtccc tgacggccga ccagatggtc agtgccttgt tggatgctga     1980

gcccccgata ctctattccg agtatgatcc taccagaccc ttcagtgaag cttcgatgat     2040

gggcttactg accaacctgg cagacaggga gctggttcac atgatcaact gggcgaagag     2100

ggtgccaggc tttgtggatt tgaccctcca tgatcaggtc caccttctag aatgtgcctg     2160

gctagagatc ctgatgattg gtctcgtctg gcgctccatg gagcacccag ggaagctact     2220

gtttgctcct aacttgctct tggacaggaa ccagggaaaa tgtgtagagg gcatggtgga     2280

gatcttcgac atgctgctgg ctacatcatc tcggttccgc atgatgaatc tgcagggaga     2340

ggagtttgtg tgcctcaaat ctattatttt gcttaattct ggagtgtaca catttctgtc     2400

cagcaccctg aagtctctgg aagagaagga ccatatccac cgagtcctgg acaagatcac     2460

agacactttg atccacctga tggccaaggc aggcctgacc ctgcagcagc agcaccagcg     2520

gctggcccag ctcctcctca tcctctccca catcaggcac atgagtaaca aaggcatgga     2580

gcatctgtac agcatgaagt gcaagaacgt ggtgcccctc tatgacctgc tgctggagat     2640

gctggacgcc caccgcctac atgcgcccac tagccgtgga ggggcatccg tggaggagac     2700

ggaccaaagc cacttggcca ctgcgggctc tacttcatcg cattccttgc aaaagtatta     2760

catcacgggg gaggcagagg gtttccctgc cacggtcgct gactacaagg acgacgatga     2820

caagggaagc ggagctacta acttcagcct gctgaagcag gctggcgacg tggaggagaa     2880

ccctggacct atgggcgtga tcaagcccga catgaagatc aagctgcgga tggagggcgc     2940

cgtgaacggc cacaaattcg tgatcgaggg cgacgggaaa ggcaagccct ttgagggtaa     3000

gcagactatg gacctgaccg tgatcgaggg cgcccccctg cccttcgctt atgacattct     3060

caccaccgtg ttcgactacg gtaaccgtgt cttcgccaag taccccaagg acatccctga     3120

ctacttcaag cagaccttcc ccgagggcta ctcgtgggag cgaagcatga catacgagga     3180

ccagggaatc tgtatcgcta caaacgacat caccatgatg aagggtgtgg acgactgctt     3240

cgtgtacaaa atccgcttcg acggggtcaa cttccctgct aatggcccgg tgatgcagcg     3300

caagacccta aagtgggagc ccagtaccga gaagatgtac gtgcgggacg gcgtactgaa     3360

gggcgatgtt aatatggcac tgctcttgga gggaggcggc cactaccgct gcgacttcaa     3420

gaccacctac aaagccaaga aggtggtgca gcttcccgac taccacttcg tggaccaccg     3480

catcgagatc gtgagccacg acaaggacta caacaaagtc aagctgtacg agcacgccga     3540

agcccacagc ggactacccc gccaggccgg ctgaataact agatgaggat ccaatgtaac     3600

tgtattcagc gatgacgaaa ttcttagcta ttgtaatact ctagaggatc tttgtgaagg     3660

aaccttactt ctgtggtgtg acataattgg acaaactacc tacagagatt taaagctcta     3720

aggtaaatat aaaattttta agtgtataat gtgttaaact actgattcta attgtttgtg     3780

tattttagat tccaacctat ggaactgatg aatgggagca gtggtggaat gcctttaatg     3840

aggaaaacct gttttgctca gaagaaatgc catctagtga tgatgaggct actgctgact     3900

ctcaacattc tactcctcca aaaaagaaga gaaaggtaga agaccccaag gactttcctt     3960

cagaattgct aagttttttg agtcatgctg tgtttagtaa tagaactctt gcttgctttg     4020

ctatttacac cacaaaggaa aaagctgcac tgctatacaa gaaaattatg gaaaaatatt     4080

ctgtaacctt tataagtagg cataacagtt ataatcataa catactgttt tttcttactc     4140

cacacaggca tagagtgtct gctattaata actatgctca aaaattgtgt acctttagct     4200

ttttaatttg taaaggggtt aataaggaat atttgatgta tagtgccttg actagagatc     4260

ataatcagcc ataccacatt tgtagaggtt ttacttgctt taaaaaacct cccacacctc     4320

cccctgaacc tgaaacataa aatgaatgca attgttgttg ttaacttgtt tattgcagct     4380

tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc atttttttca     4440

ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt ctgcggctct     4500

agagctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc     4560

ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc     4620

agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa     4680

catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt     4740

tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg     4800

gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg     4860

ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag     4920

cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc     4980

caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa     5040

ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg     5100

taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc     5160

taactacggc tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac     5220

cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg     5280

tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt     5340

gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt     5400

catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa     5460

atcaatctaa agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga     5520

ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt     5580

gtagataact acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg     5640

agacccacgc tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga     5700

gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga     5760

agctagagta agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg     5820

catcgtggtg tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc     5880

aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc     5940

gatcgttgtc agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca     6000

taattctctt actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac     6060

caagtcattc tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg     6120

ggataatacc gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc     6180

ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg     6240

tgcacccaac tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac     6300

aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat     6360

actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata     6420

catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa     6480

agtgccacct gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg     6540

tatcacgagg ccctttcgtc ttcaagaatt c                                    6571


<210>  40
<211>  6202
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pTRE3G Endo FOXA1 nucleotide sequence

<400>  40
ctcgagttta ctccctatca gtgatagaga acgtatgaag agtttactcc ctatcagtga       60

tagagaacgt atgcagactt tactccctat cagtgataga gaacgtataa ggagtttact      120

ccctatcagt gatagagaac gtatgaccag tttactccct atcagtgata gagaacgtat      180

ctacagttta ctccctatca gtgatagaga acgtatatcc agtttactcc ctatcagtga      240

tagagaacgt ataagcttta ggcgtgtacg gtgggcgcct ataaaagcag agctcgttta      300

gtgaaccgtc agatcgcctg gagcaattcc acaacacttt tgtcttatac caactttccg      360

taccacttcc taccctcgta aagtcgacgc caccatgaaa agcggaattt atcagattaa      420

aaatacttta aacaataaag tatatgtagg aagtgctaaa gattttgaaa agagatggaa      480

gaggcatttt aaagatttag aaaaaggatg ccattcttct ataaaacttc agaggtcttt      540

taacaaacat ggtaatgtgt ttgaatgttc tattttggaa gaaattccat atgagaaaga      600

tttgattatt gaacgagaaa atttttggat taaagagctt aattctaaaa ttaatggata      660

caatattgct gatgcaacgt ttggtgatac atgttctacg catccattaa aagaagaaat      720

tattaagaaa cgttctgaaa ctgttaaagc taagatgctt aaacttggac ctgatggtcg      780

gaaagctctt tacagtaaac ccggaagtaa aaacgggcgt tggaatccag aaacccataa      840

gttttgtaag tgcggtgttc gcatacaaac ttctgcttat acttgtagta aatgcagaaa      900

tcgttcaggt gaaaataatt cattctttaa tcataagcat tcagacataa ctaaatctaa      960

aatatcagaa aagatgaaag gtaaaaagcc tagtaatggc ggatcaggcg gaagcttagg     1020

aactgtgaag atggaagggc atgaaaccag cgactggaac agctactacg cagacacgca     1080

ggaggcctac tcctccgtcc cggtcagcaa catgaactca ggcctgggct ccatgaactc     1140

catgaacacc tacatgacca tgaacaccat gactacgagc ggcaacatga ccccggcgtc     1200

cttcaacatg tcctatgcca acccgggcct aggggccgga ctgagtcccg gcgcagtagc     1260

cggcatgccg gggggctcgg cgggcgccat gaacagcatg actgcggccg gcgtgacggc     1320

catgggtacg gcgctgagcc cgagcggcat gggcgccatg ggtgcgcagc aggcggcctc     1380

catgaatggc ctgggcccct acgcggccgc catgaacccg tgcatgagcc ccatggcgta     1440

cgcgccgtcc aacctgggcc gcagccgcgc gggcggcggc ggcgacgcca agacgttcaa     1500

gcgcagctac ccgcacgcca agccgcccta ctcgtacatc tcgctcatca ccatggccat     1560

ccagcaggcg cccagcaaga tgctcacgct gagcgagatc taccagtgga tcatggacct     1620

cttcccctat taccggcaga accagcagcg ctggcagaac tccatccgcc actcgctgtc     1680

cttcaatgac tgcttcgtca aggtggcacg ctccccggac aagccgggca agggctccta     1740

ctggacgctg cacccggact ccggcaacat gttcgagaac ggctgctact tgcgccgcca     1800

gaagcgcttc aagtgcgaga agcagccggg ggccggcggc gggggcggga gcggaagcgg     1860

gggcagcggc gccaagggcg gccctgagag ccgcaaggac ccctctggcg cctctaaccc     1920

cagcgccgac tcgcccctcc atcggggtgt gcacgggaag accggccagc tagagggcgc     1980

gccggccccc gggcccgccg ccagccccca gactctggac cacagtgggg cgacggcgac     2040

agggggcgcc tcggagttga agactccagc ctcctcaact gcgcccccca taagctccgg     2100

gcccggggcg ctggcctctg tgcccgcctc tcacccggca cacggcttgg caccccacga     2160

gtcccagctg cacctgaaag gggaccccca ctactccttc aaccacccgt tctccatcaa     2220

caacctcatg tcctcctcgg agcagcagca taagctggac ttcaaggcat acgaacaggc     2280

actgcaatac tcgccttacg gctctacgtt gcccgccagc ctgcctctag gcagcgcctc     2340

ggtgaccacc aggagcccca tcgagccctc agccctggag ccggcgtact accaaggtgt     2400

gtattccaga cccgtcctaa acacttccgc tgactacaag gacgacgatg acaagggaag     2460

cggagctact aacttcagcc tgctgaagca ggctggcgac gtggaggaga accctggacc     2520

tatgggcgtg atcaagcccg acatgaagat caagctgcgg atggagggcg ccgtgaacgg     2580

ccacaaattc gtgatcgagg gcgacgggaa aggcaagccc tttgagggta agcagactat     2640

ggacctgacc gtgatcgagg gcgcccccct gcccttcgct tatgacattc tcaccaccgt     2700

gttcgactac ggtaaccgtg tcttcgccaa gtaccccaag gacatccctg actacttcaa     2760

gcagaccttc cccgagggct actcgtggga gcgaagcatg acatacgagg accagggaat     2820

ctgtatcgct acaaacgaca tcaccatgat gaagggtgtg gacgactgct tcgtgtacaa     2880

aatccgcttc gacggggtca acttccctgc taatggcccg gtgatgcagc gcaagaccct     2940

aaagtgggag cccagtaccg agaagatgta cgtgcgggac ggcgtactga agggcgatgt     3000

taatatggca ctgctcttgg agggaggcgg ccactaccgc tgcgacttca agaccaccta     3060

caaagccaag aaggtggtgc agcttcccga ctaccacttc gtggaccacc gcatcgagat     3120

cgtgagccac gacaaggact acaacaaagt caagctgtac gagcacgccg aagcccacag     3180

cggactaccc cgccaggccg gctgaataac tagatgagga tccaatgtaa ctgtattcag     3240

cgatgacgaa attcttagct attgtaatac tctagaggat ctttgtgaag gaaccttact     3300

tctgtggtgt gacataattg gacaaactac ctacagagat ttaaagctct aaggtaaata     3360

taaaattttt aagtgtataa tgtgttaaac tactgattct aattgtttgt gtattttaga     3420

ttccaaccta tggaactgat gaatgggagc agtggtggaa tgcctttaat gaggaaaacc     3480

tgttttgctc agaagaaatg ccatctagtg atgatgaggc tactgctgac tctcaacatt     3540

ctactcctcc aaaaaagaag agaaaggtag aagaccccaa ggactttcct tcagaattgc     3600

taagtttttt gagtcatgct gtgtttagta atagaactct tgcttgcttt gctatttaca     3660

ccacaaagga aaaagctgca ctgctataca agaaaattat ggaaaaatat tctgtaacct     3720

ttataagtag gcataacagt tataatcata acatactgtt ttttcttact ccacacaggc     3780

atagagtgtc tgctattaat aactatgctc aaaaattgtg tacctttagc tttttaattt     3840

gtaaaggggt taataaggaa tatttgatgt atagtgcctt gactagagat cataatcagc     3900

cataccacat ttgtagaggt tttacttgct ttaaaaaacc tcccacacct ccccctgaac     3960

ctgaaacata aaatgaatgc aattgttgtt gttaacttgt ttattgcagc ttataatggt     4020

tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct     4080

agttgtggtt tgtccaaact catcaatgta tcttatcatg tctgcggctc tagagctgca     4140

ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgct cttccgcttc     4200

ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc     4260

aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc     4320

aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag     4380

gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc     4440

gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt     4500

tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct     4560

ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg     4620

ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct     4680

tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat     4740

tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg     4800

ctacactaga agaacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa     4860

aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt     4920

ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc     4980

tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt     5040

atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta     5100

aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat     5160

ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac     5220

tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg     5280

ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag     5340

tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt     5400

aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt     5460

gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt     5520

tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt     5580

cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct     5640

tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt     5700

ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac     5760

cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa     5820

actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa     5880

ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca     5940

aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct     6000

ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga     6060

atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc     6120

tgacgtctaa gaaaccatta ttatcatgac attaacctat aaaaataggc gtatcacgag     6180

gccctttcgt cttcaagaat tc                                              6202


<210>  41
<211>  6046
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  pTRE3G Endo FOXA1 delDBD nucleotide sequence

<400>  41
ctcgagttta ctccctatca gtgatagaga acgtatgaag agtttactcc ctatcagtga       60

tagagaacgt atgcagactt tactccctat cagtgataga gaacgtataa ggagtttact      120

ccctatcagt gatagagaac gtatgaccag tttactccct atcagtgata gagaacgtat      180

ctacagttta ctccctatca gtgatagaga acgtatatcc agtttactcc ctatcagtga      240

tagagaacgt ataagcttta ggcgtgtacg gtgggcgcct ataaaagcag agctcgttta      300

gtgaaccgtc agatcgcctg gagcaattcc acaacacttt tgtcttatac caactttccg      360

taccacttcc taccctcgta aagtcgacgc caccatgaaa agcggaattt atcagattaa      420

aaatacttta aacaataaag tatatgtagg aagtgctaaa gattttgaaa agagatggaa      480

gaggcatttt aaagatttag aaaaaggatg ccattcttct ataaaacttc agaggtcttt      540

taacaaacat ggtaatgtgt ttgaatgttc tattttggaa gaaattccat atgagaaaga      600

tttgattatt gaacgagaaa atttttggat taaagagctt aattctaaaa ttaatggata      660

caatattgct gatgcaacgt ttggtgatac atgttctacg catccattaa aagaagaaat      720

tattaagaaa cgttctgaaa ctgttaaagc taagatgctt aaacttggac ctgatggtcg      780

gaaagctctt tacagtaaac ccggaagtaa aaacgggcgt tggaatccag aaacccataa      840

gttttgtaag tgcggtgttc gcatacaaac ttctgcttat acttgtagta aatgcagaaa      900

tcgttcaggt gaaaataatt cattctttaa tcataagcat tcagacataa ctaaatctaa      960

aatatcagaa aagatgaaag gtaaaaagcc tagtaatggc ggatcaggcg gaagcttagg     1020

aactgtgaag atggaagggc atgaaaccag cgactggaac agctactacg cagacacgca     1080

ggaggcctac tcctccgtcc cggtcagcaa catgaactca ggcctgggct ccatgaactc     1140

catgaacacc tacatgacca tgaacaccat gactacgagc ggcaacatga ccccggcgtc     1200

cttcaacatg tcctatgcca acccgggcct aggggccgga ctgagtcccg gcgcagtagc     1260

cggcatgccg gggggctcgg cgggcgccat gaacagcatg actgcggccg gcgtgacggc     1320

catgggtacg gcgctgagcc cgagcggcat gggcgccatg ggtgcgcagc aggcggcctc     1380

catgaatggc ctgggcccct acgcggccgc catgaacccg tgcatgagcc ccatggcgta     1440

cgcgccgtcc aacctgggcc gcagccgcgc gggcggcggc ggcgacgcca agacgttcaa     1500

gcgcagctac ccgcacgcca agccgcccta ctcgtacatc tcgctcatca ccatggccat     1560

ccagcaggcg cccagcaaga tgctcacgct gagcgagatc taccagtgga tcatggacct     1620

cttcccctat taccggcaga accagcagcg cttcaagtgc gagaagcagc cgggggccgg     1680

cggcgggggc gggagcggaa gcgggggcag cggcgccaag ggcggccctg agagccgcaa     1740

ggacccctct ggcgcctcta accccagcgc cgactcgccc ctccatcggg gtgtgcacgg     1800

gaagaccggc cagctagagg gcgcgccggc ccccgggccc gccgccagcc cccagactct     1860

ggaccacagt ggggcgacgg cgacaggggg cgcctcggag ttgaagactc cagcctcctc     1920

aactgcgccc cccataagct ccgggcccgg ggcgctggcc tctgtgcccg cctctcaccc     1980

ggcacacggc ttggcacccc acgagtccca gctgcacctg aaaggggacc cccactactc     2040

cttcaaccac ccgttctcca tcaacaacct catgtcctcc tcggagcagc agcataagct     2100

ggacttcaag gcatacgaac aggcactgca atactcgcct tacggctcta cgttgcccgc     2160

cagcctgcct ctaggcagcg cctcggtgac caccaggagc cccatcgagc cctcagccct     2220

ggagccggcg tactaccaag gtgtgtattc cagacccgtc ctaaacactt ccgctgacta     2280

caaggacgac gatgacaagg gaagcggagc tactaacttc agcctgctga agcaggctgg     2340

cgacgtggag gagaaccctg gacctatggg cgtgatcaag cccgacatga agatcaagct     2400

gcggatggag ggcgccgtga acggccacaa attcgtgatc gagggcgacg ggaaaggcaa     2460

gccctttgag ggtaagcaga ctatggacct gaccgtgatc gagggcgccc ccctgccctt     2520

cgcttatgac attctcacca ccgtgttcga ctacggtaac cgtgtcttcg ccaagtaccc     2580

caaggacatc cctgactact tcaagcagac cttccccgag ggctactcgt gggagcgaag     2640

catgacatac gaggaccagg gaatctgtat cgctacaaac gacatcacca tgatgaaggg     2700

tgtggacgac tgcttcgtgt acaaaatccg cttcgacggg gtcaacttcc ctgctaatgg     2760

cccggtgatg cagcgcaaga ccctaaagtg ggagcccagt accgagaaga tgtacgtgcg     2820

ggacggcgta ctgaagggcg atgttaatat ggcactgctc ttggagggag gcggccacta     2880

ccgctgcgac ttcaagacca cctacaaagc caagaaggtg gtgcagcttc ccgactacca     2940

cttcgtggac caccgcatcg agatcgtgag ccacgacaag gactacaaca aagtcaagct     3000

gtacgagcac gccgaagccc acagcggact accccgccag gccggctgaa taactagatg     3060

aggatccaat gtaactgtat tcagcgatga cgaaattctt agctattgta atactctaga     3120

ggatctttgt gaaggaacct tacttctgtg gtgtgacata attggacaaa ctacctacag     3180

agatttaaag ctctaaggta aatataaaat ttttaagtgt ataatgtgtt aaactactga     3240

ttctaattgt ttgtgtattt tagattccaa cctatggaac tgatgaatgg gagcagtggt     3300

ggaatgcctt taatgaggaa aacctgtttt gctcagaaga aatgccatct agtgatgatg     3360

aggctactgc tgactctcaa cattctactc ctccaaaaaa gaagagaaag gtagaagacc     3420

ccaaggactt tccttcagaa ttgctaagtt ttttgagtca tgctgtgttt agtaatagaa     3480

ctcttgcttg ctttgctatt tacaccacaa aggaaaaagc tgcactgcta tacaagaaaa     3540

ttatggaaaa atattctgta acctttataa gtaggcataa cagttataat cataacatac     3600

tgttttttct tactccacac aggcatagag tgtctgctat taataactat gctcaaaaat     3660

tgtgtacctt tagcttttta atttgtaaag gggttaataa ggaatatttg atgtatagtg     3720

ccttgactag agatcataat cagccatacc acatttgtag aggttttact tgctttaaaa     3780

aacctcccac acctccccct gaacctgaaa cataaaatga atgcaattgt tgttgttaac     3840

ttgtttattg cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat     3900

aaagcatttt tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat     3960

catgtctgcg gctctagagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt     4020

gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct     4080

gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga     4140

taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc     4200

cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg     4260

ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg     4320

aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt     4380

tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt     4440

gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg     4500

cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact     4560

ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt     4620

cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct     4680

gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac     4740

cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc     4800

tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg     4860

ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta     4920

aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca     4980

atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc     5040

ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc     5100

tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc     5160

agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat     5220

taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt     5280

tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc     5340

cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag     5400

ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt     5460

tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac     5520

tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg     5580

cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat     5640

tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc     5700

gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc     5760

tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa     5820

atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg     5880

tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg     5940

cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca tgacattaac     6000

ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa gaattc                    6046


