               SEQUENCE LISTING

<110> NovioGendix Research B.V.
      Smit, Franciscus Petrus

<120> Molecular markers in prostate cancer

<130> 4/2MP66/7P

<150> PCT/EP2011/057714
<151> 2011-05-12

<160> 16

<170> BiSSAP 1.0

<210> 1
<211> 1681
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1681
<223> /mol_type="DNA"
      /note="HOXC6"
      /organism="Homo sapiens"

<400> 1
ttttgtctgt cctggattgg agccgtccct ataaccatct agttccgagt acaaactgga      60

gacagaaata aatattaaag aaatcataga ccgaccaggt aaaggcaaag ggatgaattc     120

ctacttcact aacccttcct tatcctgcca cctcgccggg ggccaggacg tcctccccaa     180

cgtcgccctc aattccaccg cctatgatcc agtgaggcat ttctcgacct atggagcggc     240

cgttgcccag aaccggatct actcgactcc cttttattcg ccacaggaga atgtcgtgtt     300

cagttccagc cgggggccgt atgactatgg atctaattcc ttttaccagg agaaagacat     360

gctctcaaac tgcagacaaa acaccttagg acataacaca cagacctcaa tcgctcagga     420

ttttagttct gagcagggca ggactgcgcc ccaggaccag aaagccagta tccagattta     480

cccctggatg cagcgaatga attcgcacag tggggtcggc tacggagcgg accggaggcg     540

cggccgccag atctactcgc ggtaccagac cctggaactg gagaaggaat ttcacttcaa     600

tcgctaccta acgcggcgcc ggcgcatcga gatcgccaac gcgctttgcc tgaccgagcg     660

acagatcaaa atctggttcc agaaccgccg gatgaagtgg aaaaaagaat ctaatctcac     720

atccactctc tcggggggcg gcggaggggc caccgccgac agcctgggcg gaaaagagga     780

aaagcgggaa gagacagaag aggagaagca gaaagagtga ccaggactgt ccctgccacc     840

cctctctccc tttctccctc gctccccacc aactctcccc taatcacaca ctctgtattt     900

atcactggca caattgatgt gttttgattc cctaaaacaa aattagggag tcaaacgtgg     960

acctgaaagt cagctctgga ccccctccct caccgcacaa ctctctttca ccacgcgcct    1020

cctcctcctc gctcccttgc tagctcgttc tcggcttgtc tacaggccct tttccccgtc    1080

caggccttgg gggctcggac cctgaactca gactctacag attgccctcc aagtgaggac    1140

ttggctcccc cactccttcg acgcccccac ccccgccccc cgtgcagaga gccggctcct    1200

gggcctgctg gggcctctgc tccagggcct cagggcccgg cctggcagcc ggggagggcc    1260

ggaggcccaa ggagggcgcg ccttggcccc acaccaaccc ccagggcctc cccgcagtcc    1320

ctgcctagcc cctctgcccc agcaaatgcc cagcccaggc aaattgtatt taaagaatcc    1380

tgggggtcat tatggcattt tacaaactgt gaccgtttct gtgtgaagat ttttagctgt    1440

atttgtggtc tctgtattta tatttatgtt tagcaccgtc agtgttccta tccaatttca    1500

aaaaaggaaa aaaaagaggg aaaattacaa aaagagagaa aaaaagtgaa tgacgtttgt    1560

ttagccagta ggagaaaata aataaataaa taaatccctt cgtgttaccc tcctgtataa    1620

atccaacctc tgggtccgtt ctcgaatatt taataaaact gatattattt ttaaaacttt    1680

a                                                                    1681


<210> 2
<211> 235
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..235
<223> /mol_type="protein"
      /note="HOXC6"
      /organism="Homo sapiens"

<400> 2
Met Asn Ser Tyr Phe Thr Asn Pro Ser Leu Ser Cys His Leu Ala Gly 
1               5                   10                   15    
Gly Gln Asp Val Leu Pro Asn Val Ala Leu Asn Ser Thr Ala Tyr Asp 
            20                   25                  30        
Pro Val Arg His Phe Ser Thr Tyr Gly Ala Ala Val Ala Gln Asn Arg 
        35                   40                  45            
Ile Tyr Ser Thr Pro Phe Tyr Ser Pro Gln Glu Asn Val Val Phe Ser 
    50                   55                  60                
Ser Ser Arg Gly Pro Tyr Asp Tyr Gly Ser Asn Ser Phe Tyr Gln Glu 
65                   70                  75                  80
Lys Asp Met Leu Ser Asn Cys Arg Gln Asn Thr Leu Gly His Asn Thr 
                85                   90                  95    
Gln Thr Ser Ile Ala Gln Asp Phe Ser Ser Glu Gln Gly Arg Thr Ala 
            100                  105                110        
Pro Gln Asp Gln Lys Ala Ser Ile Gln Ile Tyr Pro Trp Met Gln Arg 
        115                  120                125            
Met Asn Ser His Ser Gly Val Gly Tyr Gly Ala Asp Arg Arg Arg Gly 
    130                  135                140                
Arg Gln Ile Tyr Ser Arg Tyr Gln Thr Leu Glu Leu Glu Lys Glu Phe 
145                  150                155                  160
His Phe Asn Arg Tyr Leu Thr Arg Arg Arg Arg Ile Glu Ile Ala Asn 
                165                  170                175    
Ala Leu Cys Leu Thr Glu Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg 
            180                  185                190        
Arg Met Lys Trp Lys Lys Glu Ser Asn Leu Thr Ser Thr Leu Ser Gly 
        195                  200                205            
Gly Gly Gly Gly Ala Thr Ala Asp Ser Leu Gly Gly Lys Glu Glu Lys 
    210                  215                220                
Arg Glu Glu Thr Glu Glu Glu Lys Gln Lys Glu 
225                  230                235

<210> 3
<211> 2005
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2005
<223> /mol_type="DNA"
      /note="SFRP2"
      /organism="Homo sapiens"

<400> 3
caacggctca ttctgctccc ccgggtcgga gccccccgga gctgcgcgcg ggcttgcagc      60

gcctcgcccg cgctgtcctc ccggtgtccc gcttctccgc gccccagccg ccggctgcca     120

gcttttcggg gccccgagtc gcacccagcg aagagagcgg gcccgggaca agctcgaact     180

ccggccgcct cgcccttccc cggctccgct ccctctgccc cctcggggtc gcgcgcccac     240

gatgctgcag ggccctggct cgctgctgct gctcttcctc gcctcgcact gctgcctggg     300

ctcggcgcgc gggctcttcc tctttggcca gcccgacttc tcctacaagc gcagcaattg     360

caagcccatc cctgccaacc tgcagctgtg ccacggcatc gaataccaga acatgcggct     420

gcccaacctg ctgggccacg agaccatgaa ggaggtgctg gagcaggccg gcgcttggat     480

cccgctggtc atgaagcagt gccacccgga caccaagaag ttcctgtgct cgctcttcgc     540

ccccgtctgc ctcgatgacc tagacgagac catccagcca tgccactcgc tctgcgtgca     600

ggtgaaggac cgctgcgccc cggtcatgtc cgccttcggc ttcccctggc ccgacatgct     660

tgagtgcgac cgtttccccc aggacaacga cctttgcatc cccctcgcta gcagcgacca     720

cctcctgcca gccaccgagg aagctccaaa ggtatgtgaa gcctgcaaaa ataaaaatga     780

tgatgacaac gacataatgg aaacgctttg taaaaatgat tttgcactga aaataaaagt     840

gaaggagata acctacatca accgagatac caaaatcatc ctggagacca agagcaagac     900

catttacaag ctgaacggtg tgtccgaaag ggacctgaag aaatcggtgc tgtggctcaa     960

agacagcttg cagtgcacct gtgaggagat gaacgacatc aacgcgccct atctggtcat    1020

gggacagaaa cagggtgggg agctggtgat cacctcggtg aagcggtggc agaaggggca    1080

gagagagttc aagcgcatct cccgcagcat ccgcaagctg cagtgctagt cccggcatcc    1140

tgatggctcc gacaggcctg ctccagagca cggctgacca tttctgctcc gggatctcag    1200

ctcccgttcc ccaagcacac tcctagctgc tccagtctca gcctgggcag cttccccctg    1260

ccttttgcac gtttgcatcc ccagcatttc ctgagttata aggccacagg agtggatagc    1320

tgttttcacc taaaggaaaa gcccacccga atcttgtaga aatattcaaa ctaataaaat    1380

catgaatatt tttatgaagt ttaaaaatag ctcactttaa agctagtttt gaataggtgc    1440

aactgtgact tgggtctggt tggttgttgt ttgttgtttt gagtcagctg attttcactt    1500

cccactgagg ttgtcataac atgcaaattg cttcaatttt ctctgtggcc caaacttgtg    1560

ggtcacaaac cctgttgaga taaagctggc tgttatctca acatcttcat cagctccaga    1620

ctgagactca gtgtctaagt cttacaacaa ttcatcattt tataccttca atgggaactt    1680

aaactgttac atgtatcaca ttccagctac aatacttcca tttattagaa gcacattaac    1740

catttctata gcatgatttc ttcaagtaaa aggcaaaaga tataaatttt ataattgact    1800

tgagtacttt aagccttgtt taaaacattt cttacttaac ttttgcaaat taaacccatt    1860

gtagcttacc tgtaatatac atagtagttt acctttaaaa gttgtaaaaa tattgcttta    1920

accaacactg taaatatttc agataaacat tatattcttg tatataaact ttacatcctg    1980

ttttacctat aaaaaaaaaa aaaaa                                          2005


<210> 4
<211> 295
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..295
<223> /mol_type="protein"
      /note="SFRP2"
      /organism="Homo sapiens"

<400> 4
Met Leu Gln Gly Pro Gly Ser Leu Leu Leu Leu Phe Leu Ala Ser His 
1               5                   10                   15    
Cys Cys Leu Gly Ser Ala Arg Gly Leu Phe Leu Phe Gly Gln Pro Asp 
            20                   25                  30        
Phe Ser Tyr Lys Arg Ser Asn Cys Lys Pro Ile Pro Ala Asn Leu Gln 
        35                   40                  45            
Leu Cys His Gly Ile Glu Tyr Gln Asn Met Arg Leu Pro Asn Leu Leu 
    50                   55                  60                
Gly His Glu Thr Met Lys Glu Val Leu Glu Gln Ala Gly Ala Trp Ile 
65                   70                  75                  80
Pro Leu Val Met Lys Gln Cys His Pro Asp Thr Lys Lys Phe Leu Cys 
                85                   90                  95    
Ser Leu Phe Ala Pro Val Cys Leu Asp Asp Leu Asp Glu Thr Ile Gln 
            100                  105                110        
Pro Cys His Ser Leu Cys Val Gln Val Lys Asp Arg Cys Ala Pro Val 
        115                  120                125            
Met Ser Ala Phe Gly Phe Pro Trp Pro Asp Met Leu Glu Cys Asp Arg 
    130                  135                140                
Phe Pro Gln Asp Asn Asp Leu Cys Ile Pro Leu Ala Ser Ser Asp His 
145                  150                155                  160
Leu Leu Pro Ala Thr Glu Glu Ala Pro Lys Val Cys Glu Ala Cys Lys 
                165                  170                175    
Asn Lys Asn Asp Asp Asp Asn Asp Ile Met Glu Thr Leu Cys Lys Asn 
            180                  185                190        
Asp Phe Ala Leu Lys Ile Lys Val Lys Glu Ile Thr Tyr Ile Asn Arg 
        195                  200                205            
Asp Thr Lys Ile Ile Leu Glu Thr Lys Ser Lys Thr Ile Tyr Lys Leu 
    210                  215                220                
Asn Gly Val Ser Glu Arg Asp Leu Lys Lys Ser Val Leu Trp Leu Lys 
225                  230                235                  240
Asp Ser Leu Gln Cys Thr Cys Glu Glu Met Asn Asp Ile Asn Ala Pro 
                245                  250                255    
Tyr Leu Val Met Gly Gln Lys Gln Gly Gly Glu Leu Val Ile Thr Ser 
            260                  265                270        
Val Lys Arg Trp Gln Lys Gly Gln Arg Glu Phe Lys Arg Ile Ser Arg 
        275                  280                285            
Ser Ile Arg Lys Leu Gln Cys 
    290                  295

<210> 5
<211> 1814
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1814
<223> /mol_type="DNA"
      /note="HOXD10"
      /organism="Homo sapiens"

<400> 5
cggggaatgt tttcctagag atgtcagcct acaaaggaca caatctctct tcttcaaatt      60

cttccccaaa atgtcctttc ccaacagctc tcctgctgct aatacttttt tagtagattc     120

cttgatcagt gcctgcagga gtgacagttt ttattccagc agcgccagca tgtacatgcc     180

accacctagc gcagacatgg ggacctatgg aatgcaaacc tgtggactgc tcccgtctct     240

ggccaaaaga gaagtgaacc accaaaatat gggtatgaat gtgcatcctt atatacctca     300

agtagacagt tggacagatc cgaacagatc ttgtcgaata gagcaacctg ttacacagca     360

agtccccact tgctccttca ccaccaacat taaggaagaa tccaattgct gcatgtattc     420

tgataagcgc aacaaactca tttcggccga ggtcccttcg taccagaggc tggtccctga     480

gtcttgtccc gttgagaacc ctgaggttcc cgtccctgga tattttagac tgagtcagac     540

ctacgccacc gggaaaaccc aagagtacaa taatagcccc gaaggcagct ccactgtcat     600

gctccagctc aaccctcgtg gcgcggccaa gccgcagctc tccgctgccc agctgcagat     660

ggaaaagaag atgaacgagc ccgtgagcgg ccaggagccc accaaagtct cccaggtgga     720

gagccccgag gccaaaggcg gccttcccga agagaggagc tgcctggctg aggtctccgt     780

gtccagtccc gaagtgcagg agaaggaaag caaagaggaa atcaagtctg atacaccaac     840

cagcaattgg ctcactgcaa agagtggcag aaagaagagg tgcccttaca ctaagcacca     900

aacgctggaa ttagaaaaag agttcttgtt caatatgtac ctcacccgcg agcgccgcct     960

agagatcagt aagagcgtta acctcaccga caggcaggtc aagatttggt ttcaaaaccg    1020

ccgaatgaaa ctcaagaaga tgagccgaga gaaccggatc cgagaactga ccgccaacct    1080

cacgttttct taggtctgag gccggtctga ggccggtcag aggccaggat tggagagggg    1140

gcaccgcgtt ccagggccca gtgctggagg actgggaaag cggaaacaaa accttcaccg    1200

ctctttgttt gttgttttgt tgtattttgt tttcctgcta gaatgtgact ttggggtcat    1260

tatgttcgtg ctgcaagtga tctgtaatcc ctatgagtat atatatatat atatatatat    1320

atatataaaa acttagcacg tgtaatttat tattttttca tcgtaatgca gggtaactat    1380

tattgcgcat tttcatttgg gtcttaactt attggaactg tagagcatcc atccatccat    1440

ccatccagca atgtgacttt ttcatgtctt tcctaacaca aaaggtctat gtgtgtggtt    1500

agtccatgaa ctcatggcat tttgaataca tccagtactt taaaaatgac atatatattt    1560

aaaaaaaaaa gattaagaaa acccacaagt tggagggagg gggacttaaa aagcacatta    1620

caatgtatct tttcacaaat gaatttagca gttgtccttg gtgagatggg atattggcga    1680

tttatgcctt gtagcctttc ccttgtggtg catctgtggt ttggtagaag tacaacagca    1740

acctgtcctt tctgtgcatg ttctggtcgc atgtataatg caataaactc tggaaatgag    1800

ttcaaaaaaa aaaa                                                      1814


<210> 6
<211> 340
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..340
<223> /mol_type="protein"
      /note="HOXD10"
      /organism="Homo sapiens"

<400> 6
Met Ser Phe Pro Asn Ser Ser Pro Ala Ala Asn Thr Phe Leu Val Asp 
1               5                   10                   15    
Ser Leu Ile Ser Ala Cys Arg Ser Asp Ser Phe Tyr Ser Ser Ser Ala 
            20                   25                  30        
Ser Met Tyr Met Pro Pro Pro Ser Ala Asp Met Gly Thr Tyr Gly Met 
        35                   40                  45            
Gln Thr Cys Gly Leu Leu Pro Ser Leu Ala Lys Arg Glu Val Asn His 
    50                   55                  60                
Gln Asn Met Gly Met Asn Val His Pro Tyr Ile Pro Gln Val Asp Ser 
65                   70                  75                  80
Trp Thr Asp Pro Asn Arg Ser Cys Arg Ile Glu Gln Pro Val Thr Gln 
                85                   90                  95    
Gln Val Pro Thr Cys Ser Phe Thr Thr Asn Ile Lys Glu Glu Ser Asn 
            100                  105                110        
Cys Cys Met Tyr Ser Asp Lys Arg Asn Lys Leu Ile Ser Ala Glu Val 
        115                  120                125            
Pro Ser Tyr Gln Arg Leu Val Pro Glu Ser Cys Pro Val Glu Asn Pro 
    130                  135                140                
Glu Val Pro Val Pro Gly Tyr Phe Arg Leu Ser Gln Thr Tyr Ala Thr 
145                  150                155                  160
Gly Lys Thr Gln Glu Tyr Asn Asn Ser Pro Glu Gly Ser Ser Thr Val 
                165                  170                175    
Met Leu Gln Leu Asn Pro Arg Gly Ala Ala Lys Pro Gln Leu Ser Ala 
            180                  185                190        
Ala Gln Leu Gln Met Glu Lys Lys Met Asn Glu Pro Val Ser Gly Gln 
        195                  200                205            
Glu Pro Thr Lys Val Ser Gln Val Glu Ser Pro Glu Ala Lys Gly Gly 
    210                  215                220                
Leu Pro Glu Glu Arg Ser Cys Leu Ala Glu Val Ser Val Ser Ser Pro 
225                  230                235                  240
Glu Val Gln Glu Lys Glu Ser Lys Glu Glu Ile Lys Ser Asp Thr Pro 
                245                  250                255    
Thr Ser Asn Trp Leu Thr Ala Lys Ser Gly Arg Lys Lys Arg Cys Pro 
            260                  265                270        
Tyr Thr Lys His Gln Thr Leu Glu Leu Glu Lys Glu Phe Leu Phe Asn 
        275                  280                285            
Met Tyr Leu Thr Arg Glu Arg Arg Leu Glu Ile Ser Lys Ser Val Asn 
    290                  295                300                
Leu Thr Asp Arg Gln Val Lys Ile Trp Phe Gln Asn Arg Arg Met Lys 
305                  310                315                  320
Leu Lys Lys Met Ser Arg Glu Asn Arg Ile Arg Glu Leu Thr Ala Asn 
                325                  330                335    
Leu Thr Phe Ser 
            340

<210> 7
<211> 3604
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..3604
<223> /mol_type="DNA"
      /note="RORB"
      /organism="Homo sapiens"

<400> 7
tctctcccct ctctttctct ctcgctgctc ccttcctccc tgtaactgaa cagtgaaaat       60

tcacattgtg gatccgctaa caggcacaga tgtcatgtga aaacgcacat gctctgccat      120

ccacaccgcc tttctttctt ttctttctgt ttcctttttt cccccttgtt ccttctccct      180

cttctttgta actaacaaaa ccaccaccaa ctcctcctcc tgctgctgcc cttcctcctc      240

ctcctcagtc caagtgatca caaaagaaat cttctgagcc ggaggcggtg gcatttttta      300

aaaagcaagc acattggaga gaaagaaaaa gaaaaacaaa accaaaacaa aacccaggca      360

ccagacagcc agaacatttt tttttcaccc ttcctgaaaa caaacaaaca aacaaacaat      420

catcaaaaca gtcaccacca acatcaaaac tgttaacata gcggcggcgg cggcaaacgt      480

caccctgcag ccacggcgtc cgcctaaagg gatggttttc tcggcagagc agctcttcgc      540

cgaccacctt cttcactcgt gctgagcggg atttttgggc tctccggggt tcgggctggg      600

agcagcttca tgactacgcg gagcgggaga gcggccacac catgcgagca caaattgaag      660

tgataccatg caaaatttgt ggcgataagt cctctgggat ccactacgga gtcatcacat      720

gtgaaggctg caagggattc tttaggagga gccagcagaa caatgcttct tattcctgcc      780

caaggcagag aaactgttta attgacagaa cgaacagaaa ccgttgccaa cactgccgac      840

tgcagaagtg tcttgcccta ggaatgtcaa gagatgctgt gaagtttggg aggatgtcca      900

agaagcaaag ggacagcctg tatgctgagg tgcagaagca ccagcagcgg ctgcaggaac      960

agcggcagca gcagagtggg gaggcagaag cccttgccag ggtgtacagc agcagcatta     1020

gcaacggcct gagcaacctg aacaacgaga ccagcggcac ttatgccaac gggcacgtca     1080

ttgacctgcc caagtctgag ggttattaca acgtcgattc cggtcagccg tcccctgatc     1140

agtcaggact tgacatgact ggaatcaaac agataaagca agaacctatc tatgacctca     1200

catccgtacc caacttgttt acctatagct ctttcaacaa tgggcagtta gcaccaggga     1260

taaccatgac tgaaatcgac cgaattgcac agaacatcat taagtcccat ttggagacat     1320

gtcaatacac catggaagag ctgcaccagc tggcgtggca gacccacacc tatgaagaaa     1380

ttaaagcata tcaaagcaag tccagggaag cactgtggca acaatgtgcc atccagatca     1440

ctcacgccat ccaatacgtg gtggagtttg caaagcggat aacaggcttc atggagctct     1500

gtcaaaatga tcaaattcta cttctgaagt caggttgctt ggaagtggtt ttagtgagaa     1560

tgtgccgtgc cttcaaccca ttaaacaaca ctgttctgtt tgaaggaaaa tatggaggaa     1620

tgcaaatgtt caaagcctta ggttctgatg acctagtgaa tgaagcattt gactttgcaa     1680

agaatttgtg ttccttgcag ctgaccgagg aggagatcgc tttgttctca tctgctgttc     1740

tgatatctcc agaccgagcc tggcttatag aaccaaggaa agtccagaag cttcaggaaa     1800

aaatttattt tgcacttcaa catgtgattc agaagaatca cctggatgat gagaccttgg     1860

caaagttaat agccaagata ccaaccatca cggcagtttg caacttgcac ggggagaagc     1920

tgcaggtatt taagcaatct catccagaga tagtgaatac actgtttcct ccgttataca     1980

aggagctctt taatcctgac tgtgccaccg gctgcaaatg aaggggacaa gagaactgtc     2040

tcatagtcat ggaatgcatc accattaaga caaaagcaat gtgttcatga agacttaaga     2100

aaaatgtcac tactgcaaca ttaggaatgt cctgcactta atagaattat ttttcaccgc     2160

tacagtttga agaatgtaaa tatgcacctg agtggggctc ttttatttgt ttgtttgttt     2220

ttgaaatgac cataaatata caaatatagg acactgggtg ttatcctttt tttaatttta     2280

ttcgggtatg ttttgggaga caactgttta tagaatttta ttgtagatat atacaagaaa     2340

agagcggtac tttacatgat tacttttcct gttgattgtt caaatataat ttaagaaaat     2400

tccacttaat aggcttacct atttctatgt ttttaggtag ttgatgcatg tgtaaatttg     2460

tagctgtctt ggaaagtact gtgcatgtat gtaataagta tataatatgt gagaatatta     2520

tatatgacta ttacttatac atgcacatgc actgtggctt aaataccata cctactagca     2580

atggaggttc agtcaggctc tcttctatga tttaccttct gtgttatatg ttacctttat     2640

gttagacaat caggattttg ttttcccagc cagagttttc atctatagtc aatggcagga     2700

cggtaccaac tcagagttaa gtctacaaag gaataaacat aatgtgtggc ctctatatac     2760

aaactctatt tctgtcaatg acatcaaagc cttgtcaaga tggttcatat tgggaaggag     2820

acagtatttt aagccatttt cctgtttcaa gaattaggcc acagataaca ttgcaaggtc     2880

caagactttt ttgaccaaac agtagatatt ttctattttt caccagaaca cataaaaaca     2940

ctttttttct tttggatttc tggttgtgaa acaagcttga tttcagtgct tattgtgtct     3000

tcaactgaaa aatacaatct gtggattatg actaccagca atttttttct aggaaagtta     3060

aaagaataaa tcagaaccca gggcaacaat gccatttcat gtaaacattt tctctctcac     3120

catgttttgg caagaaaagg tagaaagaga agacccagag tgaagaagta attctttata     3180

ttcctttctt taatgtattt gttaggaaaa gtggcaataa agggggaggc atattataaa     3240

atgctataat ataaaaatgt agcaaaaact tgacagacta gaaaaaaaaa gatctgtgtt     3300

attctaggga actaatgtac cccaaagcca aaactaattc ctgtgaagtt tacagttaca     3360

tcatccattt accctagaat tattttttta gcaactttta gaaataaaga atacaactgt     3420

gacattagga tcagagattt tagacttcct tgtacaaatt ctcacttctc cacctgctca     3480

ccaatgaaat taatcataag aaaagcatat attccaagaa atttgttctg cctgtgtcct     3540

ggaggcctat acctctgtta ttttctgata caaaataaaa cttaaaaaaa agaaaacaag     3600

ctaa                                                                  3604


<210> 8
<211> 459
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..459
<223> /mol_type="protein"
      /note="RORB"
      /organism="Homo sapiens"

<400> 8
Met Arg Ala Gln Ile Glu Val Ile Pro Cys Lys Ile Cys Gly Asp Lys 
1               5                   10                   15    
Ser Ser Gly Ile His Tyr Gly Val Ile Thr Cys Glu Gly Cys Lys Gly 
            20                   25                  30        
Phe Phe Arg Arg Ser Gln Gln Asn Asn Ala Ser Tyr Ser Cys Pro Arg 
        35                   40                  45            
Gln Arg Asn Cys Leu Ile Asp Arg Thr Asn Arg Asn Arg Cys Gln His 
    50                   55                  60                
Cys Arg Leu Gln Lys Cys Leu Ala Leu Gly Met Ser Arg Asp Ala Val 
65                   70                  75                  80
Lys Phe Gly Arg Met Ser Lys Lys Gln Arg Asp Ser Leu Tyr Ala Glu 
                85                   90                  95    
Val Gln Lys His Gln Gln Arg Leu Gln Glu Gln Arg Gln Gln Gln Ser 
            100                  105                110        
Gly Glu Ala Glu Ala Leu Ala Arg Val Tyr Ser Ser Ser Ile Ser Asn 
        115                  120                125            
Gly Leu Ser Asn Leu Asn Asn Glu Thr Ser Gly Thr Tyr Ala Asn Gly 
    130                  135                140                
His Val Ile Asp Leu Pro Lys Ser Glu Gly Tyr Tyr Asn Val Asp Ser 
145                  150                155                  160
Gly Gln Pro Ser Pro Asp Gln Ser Gly Leu Asp Met Thr Gly Ile Lys 
                165                  170                175    
Gln Ile Lys Gln Glu Pro Ile Tyr Asp Leu Thr Ser Val Pro Asn Leu 
            180                  185                190        
Phe Thr Tyr Ser Ser Phe Asn Asn Gly Gln Leu Ala Pro Gly Ile Thr 
        195                  200                205            
Met Thr Glu Ile Asp Arg Ile Ala Gln Asn Ile Ile Lys Ser His Leu 
    210                  215                220                
Glu Thr Cys Gln Tyr Thr Met Glu Glu Leu His Gln Leu Ala Trp Gln 
225                  230                235                  240
Thr His Thr Tyr Glu Glu Ile Lys Ala Tyr Gln Ser Lys Ser Arg Glu 
                245                  250                255    
Ala Leu Trp Gln Gln Cys Ala Ile Gln Ile Thr His Ala Ile Gln Tyr 
            260                  265                270        
Val Val Glu Phe Ala Lys Arg Ile Thr Gly Phe Met Glu Leu Cys Gln 
        275                  280                285            
Asn Asp Gln Ile Leu Leu Leu Lys Ser Gly Cys Leu Glu Val Val Leu 
    290                  295                300                
Val Arg Met Cys Arg Ala Phe Asn Pro Leu Asn Asn Thr Val Leu Phe 
305                  310                315                  320
Glu Gly Lys Tyr Gly Gly Met Gln Met Phe Lys Ala Leu Gly Ser Asp 
                325                  330                335    
Asp Leu Val Asn Glu Ala Phe Asp Phe Ala Lys Asn Leu Cys Ser Leu 
            340                  345                350        
Gln Leu Thr Glu Glu Glu Ile Ala Leu Phe Ser Ser Ala Val Leu Ile 
        355                  360                365            
Ser Pro Asp Arg Ala Trp Leu Ile Glu Pro Arg Lys Val Gln Lys Leu 
    370                  375                380                
Gln Glu Lys Ile Tyr Phe Ala Leu Gln His Val Ile Gln Lys Asn His 
385                  390                395                  400
Leu Asp Asp Glu Thr Leu Ala Lys Leu Ile Ala Lys Ile Pro Thr Ile 
                405                  410                415    
Thr Ala Val Cys Asn Leu His Gly Glu Lys Leu Gln Val Phe Lys Gln 
            420                  425                430        
Ser His Pro Glu Ile Val Asn Thr Leu Phe Pro Pro Leu Tyr Lys Glu 
        435                  440                445            
Leu Phe Asn Pro Asp Cys Ala Thr Gly Cys Lys 
    450                  455                

<210> 9
<211> 3412
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..3412
<223> /mol_type="DNA"
      /note="RRM2"
      /organism="Homo sapiens"

<400> 9
aggcgcagcc aatgggaagg gtcggaggca tggcacagcc aatgggaagg gccggggcac       60

caaagccaat gggaagggcc gggagcgcgc ggcgcgggag atttaaaggc tgctggagtg      120

aggggtcgcc cgtgcaccct gtcccagccg tcctgtcctg gctgctcgct ctgcttcgct      180

gcgcctccac tatgctctcc ctccgtgtcc cgctcgcgcc catcacggac ccgcagcagc      240

tgcagctctc gccgctgaag gggctcagct tggtcgacaa ggagaacacg ccgccggccc      300

tgagcgggac ccgcgtcctg gccagcaaga ccgcgaggag gatcttccag gagcccacgg      360

agccgaaaac taaagcagct gcccccggcg tggaggatga gccgctgctg agagaaaacc      420

cccgccgctt tgtcatcttc cccatcgagt accatgatat ctggcagatg tataagaagg      480

cagaggcttc cttttggacc gccgaggagg tggacctctc caaggacatt cagcactggg      540

aatccctgaa acccgaggag agatatttta tatcccatgt tctggctttc tttgcagcaa      600

gcgatggcat agtaaatgaa aacttggtgg agcgatttag ccaagaagtt cagattacag      660

aagcccgctg tttctatggc ttccaaattg ccatggaaaa catacattct gaaatgtata      720

gtcttcttat tgacacttac ataaaagatc ccaaagaaag ggaatttctc ttcaatgcca      780

ttgaaacgat gccttgtgtc aagaagaagg cagactgggc cttgcgctgg attggggaca      840

aagaggctac ctatggtgaa cgtgttgtag cctttgctgc agtggaaggc attttctttt      900

ccggttcttt tgcgtcgata ttctggctca agaaacgagg actgatgcct ggcctcacat      960

tttctaatga acttattagc agagatgagg gtttacactg tgattttgct tgcctgatgt     1020

tcaaacacct ggtacacaaa ccatcggagg agagagtaag agaaataatt atcaatgctg     1080

ttcggataga acaggagttc ctcactgagg ccttgcctgt gaagctcatt gggatgaatt     1140

gcactctaat gaagcaatac attgagtttg tggcagacag acttatgctg gaactgggtt     1200

ttagcaaggt tttcagagta gagaacccat ttgactttat ggagaatatt tcactggaag     1260

gaaagactaa cttctttgag aagagagtag gcgagtatca gaggatggga gtgatgtcaa     1320

gtccaacaga gaattctttt accttggatg ctgacttcta aatgaactga agatgtgccc     1380

ttacttggct gatttttttt ttccatctca taagaaaaat cagctgaagt gttaccaact     1440

agccacacca tgaattgtcc gtaatgttca ttaacagcat ctttaaaact gtgtagctac     1500

ctcacaacca gtcctgtctg tttatagtgc tggtagtatc accttttgcc agaaggcctg     1560

gctggctgtg acttaccata gcagtgacaa tggcagtctt ggctttaaag tgaggggtga     1620

ccctttagtg agcttagcac agcgggatta aacagtcctt taaccagcac agccagttaa     1680

aagatgcagc ctcactgctt caacgcagat tttaatgttt acttaaatat aaacctggca     1740

ctttacaaac aaataaacat tgtttgtact cacaaggcga taatagcttg atttatttgg     1800

tttctacacc aaatacattc tcctgaccac taatgggagc caattcacaa ttcactaagt     1860

gactaaagta agttaaactt gtgtagacta agcatgtaat ttttaagttt tattttaatg     1920

aattaaaata tttgttaacc aactttaaag tcagtcctgt gtatacctag atattagtca     1980

gttggtgcca gatagaagac aggttgtgtt tttatcctgt ggcttgtgta gtgtcctggg     2040

attctctgcc ccctctgagt agagtgttgt gggataaagg aatctctcag ggcaaggagc     2100

ttcttaagtt aaatcactag aaatttaggg gtgatctggg ccttcatatg tgtgagaagc     2160

cgtttcattt tatttctcac tgtattttcc tcaacgtctg gttgatgaga aaaaattctt     2220

gaagagtttt catatgtggg agctaaggta gtattgtaaa atttcaagtc atccttaaac     2280

aaaatgatcc acctaagatc ttgcccctgt taagtggtga aatcaactag aggtggttcc     2340

tacaagttgt tcattctagt tttgtttggt gtaagtaggt tgtgtgagtt aattcattta     2400

tatttactat gtctgttaaa tcagaaattt tttattatct atgttcttct agattttacc     2460

tgtagttcat acttcagtca cccagtgtct tattctggca ttgtctaaat ctgagcattg     2520

tctaggggga tcttaaactt tagtaggaaa ccatgagctg ttaatacagt ttccattcaa     2580

atattaattt cagaatgaaa cataattttt tttttttttt ttgagatgga gtctcgctct     2640

gttgcccagg ctggagtgca gtggcgcgat tttggctcac tgtaacctcc atctcctggg     2700

ttcaagcaat tctcctgtct cagcctccct agtagctggg actgcaggta tgtgctacca     2760

cacctggcta atttttgtat ttttagtaga gatggagttt caccatattg gtcaggctgg     2820

tcttgaactc ctgacctcag gtgatccacc cacctcggcc tcccaaagtg ctgggattgc     2880

aggcgtgata aacaaatatt cttaataggg ctactttgaa ttaatctgcc tttatgtttg     2940

ggagaagaaa gctgagacat tgcatgaaag atgatgagag ataaatgttg atcttttggc     3000

cccatttgtt aattgtattc agtatttgaa cgtcgtcctg tttattgtta gttttcttca     3060

tcatttattg tatagacaat ttttaaatct ctgtaatatg atacattttc ctatctttta     3120

agttattgtt acctaaagtt aatccagatt atatggtcct tatatgtgta caacattaaa     3180

atgaaaggct ttgtcttgca ttgtgaggta caggcggaag ttggaatcag gttttaggat     3240

tctgtctctc attagctgaa taatgtgagg attaacttct gccagctcag accatttcct     3300

aatcagttga aagggaaaca agtatttcag tctcaaaatt gaataatgca caagtcttaa     3360

gtgattaaaa taaaactgtt cttatgtcag tttcaaaaaa aaaaaaaaaa aa             3412


<210> 10
<211> 389
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..389
<223> /mol_type="protein"
      /note="RRM2"
      /organism="Homo sapiens"

<400> 10
Met Leu Ser Leu Arg Val Pro Leu Ala Pro Ile Thr Asp Pro Gln Gln 
1               5                   10                   15    
Leu Gln Leu Ser Pro Leu Lys Gly Leu Ser Leu Val Asp Lys Glu Asn 
            20                   25                  30        
Thr Pro Pro Ala Leu Ser Gly Thr Arg Val Leu Ala Ser Lys Thr Ala 
        35                   40                  45            
Arg Arg Ile Phe Gln Glu Pro Thr Glu Pro Lys Thr Lys Ala Ala Ala 
    50                   55                  60                
Pro Gly Val Glu Asp Glu Pro Leu Leu Arg Glu Asn Pro Arg Arg Phe 
65                   70                  75                  80
Val Ile Phe Pro Ile Glu Tyr His Asp Ile Trp Gln Met Tyr Lys Lys 
                85                   90                  95    
Ala Glu Ala Ser Phe Trp Thr Ala Glu Glu Val Asp Leu Ser Lys Asp 
            100                  105                110        
Ile Gln His Trp Glu Ser Leu Lys Pro Glu Glu Arg Tyr Phe Ile Ser 
        115                  120                125            
His Val Leu Ala Phe Phe Ala Ala Ser Asp Gly Ile Val Asn Glu Asn 
    130                  135                140                
Leu Val Glu Arg Phe Ser Gln Glu Val Gln Ile Thr Glu Ala Arg Cys 
145                  150                155                  160
Phe Tyr Gly Phe Gln Ile Ala Met Glu Asn Ile His Ser Glu Met Tyr 
                165                  170                175    
Ser Leu Leu Ile Asp Thr Tyr Ile Lys Asp Pro Lys Glu Arg Glu Phe 
            180                  185                190        
Leu Phe Asn Ala Ile Glu Thr Met Pro Cys Val Lys Lys Lys Ala Asp 
        195                  200                205            
Trp Ala Leu Arg Trp Ile Gly Asp Lys Glu Ala Thr Tyr Gly Glu Arg 
    210                  215                220                
Val Val Ala Phe Ala Ala Val Glu Gly Ile Phe Phe Ser Gly Ser Phe 
225                  230                235                  240
Ala Ser Ile Phe Trp Leu Lys Lys Arg Gly Leu Met Pro Gly Leu Thr 
                245                  250                255    
Phe Ser Asn Glu Leu Ile Ser Arg Asp Glu Gly Leu His Cys Asp Phe 
            260                  265                270        
Ala Cys Leu Met Phe Lys His Leu Val His Lys Pro Ser Glu Glu Arg 
        275                  280                285            
Val Arg Glu Ile Ile Ile Asn Ala Val Arg Ile Glu Gln Glu Phe Leu 
    290                  295                300                
Thr Glu Ala Leu Pro Val Lys Leu Ile Gly Met Asn Cys Thr Leu Met 
305                  310                315                  320
Lys Gln Tyr Ile Glu Phe Val Ala Asp Arg Leu Met Leu Glu Leu Gly 
                325                  330                335    
Phe Ser Lys Val Phe Arg Val Glu Asn Pro Phe Asp Phe Met Glu Asn 
            340                  345                350        
Ile Ser Leu Glu Gly Lys Thr Asn Phe Phe Glu Lys Arg Val Gly Glu 
        355                  360                365            
Tyr Gln Arg Met Gly Val Met Ser Ser Pro Thr Glu Asn Ser Phe Thr 
    370                  375                380                
Leu Asp Ala Asp Phe 
385                

<210> 11
<211> 3027
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..3027
<223> /mol_type="DNA"
      /note="TGM4"
      /organism="Homo sapiens"

<400> 11
ggaccgactg tgtggaagca ccaggcatca gagatagagt cttccctggc attgcaggag      60

agaatctgaa gggatgatgg atgcatcaaa agagctgcaa gttctccaca ttgacttctt     120

gaatcaggac aacgccgttt ctcaccacac atgggagttc caaacgagca gtcctgtgtt     180

ccggcgagga caggtgtttc acctgcggct ggtgctgaac cagcccctac aatcctacca     240

ccaactgaaa ctggaattca gcacagggcc gaatcctagc atcgccaaac acaccctggt     300

ggtgctcgac ccgaggacgc cctcagacca ctacaactgg caggcaaccc ttcaaaatga     360

gtctggcaaa gaggtcacag tggctgtcac cagttccccc aatgccatcc tgggcaagta     420

ccaactaaac gtgaaaactg gaaaccacat ccttaagtct gaagaaaaca tcctatacct     480

tctcttcaac ccatggtgta aagaggacat ggttttcatg cctgatgagg acgagcgcaa     540

agagtacatc ctcaatgaca cgggctgcca ttacgtgggg gctgccagaa gtatcaaatg     600

caaaccctgg aactttggtc agtttgagaa aaatgtcctg gactgctgca tttccctgct     660

gactgagagc tccctcaagc ccacagatag gagggacccc gtgctggtgt gcagggccat     720

gtgtgctatg atgagctttg agaaaggcca gggcgtgctc attgggaatt ggactgggga     780

ctacgaaggt ggcacagccc catacaagtg gacaggcagt gccccgatcc tgcagcagta     840

ctacaacacg aagcaggctg tgtgctttgg ccagtgctgg gtgtttgctg ggatcctgac     900

tacagtgctg agagcgttgg gcatcccagc acgcagtgtg acaggcttcg attcagctca     960

cgacacagaa aggaacctca cggtggacac ctatgtgaat gagaatggcg agaaaatcac    1020

cagtatgacc cacgactctg tctggaattt ccatgtgtgg acggatgcct ggatgaagcg    1080

accggatctg cccaagggct acgacggctg gcaggctgtg gacgcaacgc cgcaggagcg    1140

aagccagggt gtcttctgct gtgggccatc accactgacc gccatccgca aaggtgacat    1200

ctttattgtc tatgacacca gattcgtctt ctcagaagtg aatggtgaca ggctcatctg    1260

gttggtgaag atggtgaatg ggcaggagga gttacacgta atttcaatgg agaccacaag    1320

catcgggaaa aacatcagca ccaaggcagt gggccaagac aggcggagag atatcaccta    1380

tgagtacaag tatccagaag gctcctctga ggagaggcag gtcatggatc atgccttcct    1440

ccttctcagt tctgagaggg agcacagacg acctgtaaaa gagaactttc ttcacatgtc    1500

ggtacaatca gatgatgtgc tgctgggaaa ctctgttaat ttcaccgtga ttcttaaaag    1560

gaagaccgct gccctacaga atgtcaacat cttgggctcc tttgaactac agttgtacac    1620

tggcaagaag atggcaaaac tgtgtgacct caataagacc tcgcagatcc aaggtcaagt    1680

atcagaagtg actctgacct tggactccaa gacctacatc aacagcctgg ctatattaga    1740

tgatgagcca gttatcagag gtttcatcat tgcggaaatt gtggagtcta aggaaatcat    1800

ggcctctgaa gtattcacgt ctttccagta ccctgagttc tctatagagt tgcctaacac    1860

aggcagaatt ggccagctac ttgtctgcaa ttgtatcttc aagaataccc tggccatccc    1920

tttgactgac gtcaagttct ctttggaaag cctgggcatc tcctcactac agacctctga    1980

ccatgggacg gtgcagcctg gtgagaccat ccaatcccaa ataaaatgca ccccaataaa    2040

aactggaccc aagaaattta tcgtcaagtt aagttccaaa caagtgaaag agattaatgc    2100

tcagaagatt gttctcatca ccaagtagcc ttgtctgatg ctgtggagcc ttagttgaga    2160

tttcagcatt tcctaccttg tgcttagctt tcagattatg gatgattaaa tttgatgact    2220

tatatgaggg cagattcaag agccagcagg tcaaaaaggc caacacaacc ataagcagcc    2280

agacccacaa ggccaggtcc tgtgctatca cagggtcacc tcttttacag ttagaaacac    2340

cagccgaggc cacagaatcc catccctttc ctgagtcatg gcctcaaaaa tcagggccac    2400

cattgtctca attcaaatcc atagatttcg aagccacaga gtctctccct ggagcagcag    2460

actatgggca gcccagtgct gccacctgct gacgaccctt gagaagctgc catatcttca    2520

ggccatgggt tcaccagccc tgaaggcacc tgtcaactgg agtgctctct cagcactggg    2580

atgggcctga tagaagtgca ttctcctcct attgcctcca ttctcctctc tctatccctg    2640

aaatccagga agtccctctc ctggtgctcc aagcagtttg aagcccaatc tgcaaggaca    2700

tttctcaagg gccatgtggt tttgcagaca accctgtcct caggcctgaa ctcaccatag    2760

agacccatgt cagcaaacgg tgaccagcaa atcctcttcc cttattctaa agctgcccct    2820

tgggagactc cagggagaag gcattgcttc ctccctggtg tgaactcttt ctttggtatt    2880

ccatccacta tcctggcaac tcaaggctgc ttctgttaac tgaagcctgc tccttcttgt    2940

tctgccctcc agagatttgc tcaaatgatc aataagcttt aaattaaact ctacttcaaa    3000

aaaaaaaaaa aaaaaaaaaa aaaaaaa                                        3027


<210> 12
<211> 684
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..684
<223> /mol_type="protein"
      /note="TGM4"
      /organism="Homo sapiens"

<400> 12
Met Met Asp Ala Ser Lys Glu Leu Gln Val Leu His Ile Asp Phe Leu 
1               5                   10                   15    
Asn Gln Asp Asn Ala Val Ser His His Thr Trp Glu Phe Gln Thr Ser 
            20                   25                  30        
Ser Pro Val Phe Arg Arg Gly Gln Val Phe His Leu Arg Leu Val Leu 
        35                   40                  45            
Asn Gln Pro Leu Gln Ser Tyr His Gln Leu Lys Leu Glu Phe Ser Thr 
    50                   55                  60                
Gly Pro Asn Pro Ser Ile Ala Lys His Thr Leu Val Val Leu Asp Pro 
65                   70                  75                  80
Arg Thr Pro Ser Asp His Tyr Asn Trp Gln Ala Thr Leu Gln Asn Glu 
                85                   90                  95    
Ser Gly Lys Glu Val Thr Val Ala Val Thr Ser Ser Pro Asn Ala Ile 
            100                  105                110        
Leu Gly Lys Tyr Gln Leu Asn Val Lys Thr Gly Asn His Ile Leu Lys 
        115                  120                125            
Ser Glu Glu Asn Ile Leu Tyr Leu Leu Phe Asn Pro Trp Cys Lys Glu 
    130                  135                140                
Asp Met Val Phe Met Pro Asp Glu Asp Glu Arg Lys Glu Tyr Ile Leu 
145                  150                155                  160
Asn Asp Thr Gly Cys His Tyr Val Gly Ala Ala Arg Ser Ile Lys Cys 
                165                  170                175    
Lys Pro Trp Asn Phe Gly Gln Phe Glu Lys Asn Val Leu Asp Cys Cys 
            180                  185                190        
Ile Ser Leu Leu Thr Glu Ser Ser Leu Lys Pro Thr Asp Arg Arg Asp 
        195                  200                205            
Pro Val Leu Val Cys Arg Ala Met Cys Ala Met Met Ser Phe Glu Lys 
    210                  215                220                
Gly Gln Gly Val Leu Ile Gly Asn Trp Thr Gly Asp Tyr Glu Gly Gly 
225                  230                235                  240
Thr Ala Pro Tyr Lys Trp Thr Gly Ser Ala Pro Ile Leu Gln Gln Tyr 
                245                  250                255    
Tyr Asn Thr Lys Gln Ala Val Cys Phe Gly Gln Cys Trp Val Phe Ala 
            260                  265                270        
Gly Ile Leu Thr Thr Val Leu Arg Ala Leu Gly Ile Pro Ala Arg Ser 
        275                  280                285            
Val Thr Gly Phe Asp Ser Ala His Asp Thr Glu Arg Asn Leu Thr Val 
    290                  295                300                
Asp Thr Tyr Val Asn Glu Asn Gly Glu Lys Ile Thr Ser Met Thr His 
305                  310                315                  320
Asp Ser Val Trp Asn Phe His Val Trp Thr Asp Ala Trp Met Lys Arg 
                325                  330                335    
Pro Asp Leu Pro Lys Gly Tyr Asp Gly Trp Gln Ala Val Asp Ala Thr 
            340                  345                350        
Pro Gln Glu Arg Ser Gln Gly Val Phe Cys Cys Gly Pro Ser Pro Leu 
        355                  360                365            
Thr Ala Ile Arg Lys Gly Asp Ile Phe Ile Val Tyr Asp Thr Arg Phe 
    370                  375                380                
Val Phe Ser Glu Val Asn Gly Asp Arg Leu Ile Trp Leu Val Lys Met 
385                  390                395                  400
Val Asn Gly Gln Glu Glu Leu His Val Ile Ser Met Glu Thr Thr Ser 
                405                  410                415    
Ile Gly Lys Asn Ile Ser Thr Lys Ala Val Gly Gln Asp Arg Arg Arg 
            420                  425                430        
Asp Ile Thr Tyr Glu Tyr Lys Tyr Pro Glu Gly Ser Ser Glu Glu Arg 
        435                  440                445            
Gln Val Met Asp His Ala Phe Leu Leu Leu Ser Ser Glu Arg Glu His 
    450                  455                460                
Arg Arg Pro Val Lys Glu Asn Phe Leu His Met Ser Val Gln Ser Asp 
465                  470                475                  480
Asp Val Leu Leu Gly Asn Ser Val Asn Phe Thr Val Ile Leu Lys Arg 
                485                  490                495    
Lys Thr Ala Ala Leu Gln Asn Val Asn Ile Leu Gly Ser Phe Glu Leu 
            500                  505                510        
Gln Leu Tyr Thr Gly Lys Lys Met Ala Lys Leu Cys Asp Leu Asn Lys 
        515                  520                525            
Thr Ser Gln Ile Gln Gly Gln Val Ser Glu Val Thr Leu Thr Leu Asp 
    530                  535                540                
Ser Lys Thr Tyr Ile Asn Ser Leu Ala Ile Leu Asp Asp Glu Pro Val 
545                  550                555                  560
Ile Arg Gly Phe Ile Ile Ala Glu Ile Val Glu Ser Lys Glu Ile Met 
                565                  570                575    
Ala Ser Glu Val Phe Thr Ser Phe Gln Tyr Pro Glu Phe Ser Ile Glu 
            580                  585                590        
Leu Pro Asn Thr Gly Arg Ile Gly Gln Leu Leu Val Cys Asn Cys Ile 
        595                  600                605            
Phe Lys Asn Thr Leu Ala Ile Pro Leu Thr Asp Val Lys Phe Ser Leu 
    610                  615                620                
Glu Ser Leu Gly Ile Ser Ser Leu Gln Thr Ser Asp His Gly Thr Val 
625                  630                635                  640
Gln Pro Gly Glu Thr Ile Gln Ser Gln Ile Lys Cys Thr Pro Ile Lys 
                645                  650                655    
Thr Gly Pro Lys Lys Phe Ile Val Lys Leu Ser Ser Lys Gln Val Lys 
            660                  665                670        
Glu Ile Asn Ala Gln Lys Ile Val Leu Ile Thr Lys 
        675                  680                

<210> 13
<211> 2101
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2101
<223> /mol_type="DNA"
      /note="SNAI2"
      /organism="Homo sapiens"

<400> 13
agttcgtaaa ggagccgggt gacttcagag gcgccggccc gtccgtctgc cgcacctgag      60

cacggcccct gcccgagcct ggcccgccgc gatgctgtag ggaccgccgt gtcctcccgc     120

cggaccgtta tccgcgccgg gcgcccgcca gacccgctgg caagatgccg cgctccttcc     180

tggtcaagaa gcatttcaac gcctccaaaa agccaaacta cagcgaactg gacacacata     240

cagtgattat ttccccgtat ctctatgaga gttactccat gcctgtcata ccacaaccag     300

agatcctcag ctcaggagca tacagcccca tcactgtgtg gactaccgct gctccattcc     360

acgcccagct acccaatggc ctctctcctc tttccggata ctcctcatct ttggggcgag     420

tgagtccccc tcctccatct gacacctcct ccaaggacca cagtggctca gaaagcccca     480

ttagtgatga agaggaaaga ctacagtcca agctttcaga cccccatgcc attgaagctg     540

aaaagtttca gtgcaattta tgcaataaga cctattcaac tttttctggg ctggccaaac     600

ataagcagct gcactgcgat gcccagtcta gaaaatcttt cagctgtaaa tactgtgaca     660

aggaatatgt gagcctgggc gccctgaaga tgcatattcg gacccacaca ttaccttgtg     720

tttgcaagat ctgcggcaag gcgttttcca gaccctggtt gcttcaagga cacattagaa     780

ctcacacggg ggagaagcct ttttcttgcc ctcactgcaa cagagcattt gcagacaggt     840

caaatctgag ggctcatctg cagacccatt ctgatgtaaa gaaataccag tgcaaaaact     900

gctccaaaac cttctccaga atgtctctcc tgcacaaaca tgaggaatct ggctgctgtg     960

tagcacactg agtgacgcaa tcaatgttta ctcgaacaga atgcatttct tcactccgaa    1020

gccaaatgac aaataaagtc caaaggcatt ttctcctgtg ctgaccaacc aaataatatg    1080

tatagacaca cacacatatg cacacacaca cacacacacc cacagagaga gagctgcaag    1140

agcatggaat tcatgtgttt aaagataatc ctttccatgt gaagtttaaa attactatat    1200

atttgctgat ggctagattg agagaataaa agacagtaac ctttctcttc aaagataaaa    1260

tgaaaagcac attgcatctt ttcttcctaa aaaaatgcaa agatttacat tgctgccaaa    1320

tcatttcaac tgaaaagaac agtattgctt tgtaatagag tctgtaatag gatttcccat    1380

aggaagagat ctgccagacg cgaactcagg tgccttaaaa agtattccaa gtttactcca    1440

ttacatgtcg gttgtctggt tgccattgtt gaactaaagc ctttttttga ttacctgtag    1500

tgctttaaag tatattttta aaagggagga aaaaaataac aagaacaaaa cacaggagaa    1560

tgtattaaaa gtatttttgt tttgttttgt ttttgccaat taacagtatg tgccttgggg    1620

gaggagggaa agattagctt tgaacattcc tggcgcatgc tccattgtct tactatttta    1680

aaacatttta ataatttttg aaaattaatt aaagatggga ataagtgcaa aagaggattc    1740

ttacaaattc attaatgtac ttaaactatt tcaaatgcat accacaaatg caataataca    1800

ataccccttc caagtgcctt tttaaattgt atagttgatg agtcaatgta aatttgtgtt    1860

tatttttata tgattgaatg agttctgtat gaaactgaga tgttgtctat agctatgtct    1920

ataaacaacc tgaagacttg tgaaatcaat gtttcttttt taaaaaacaa ttttcaagtt    1980

ttttttacaa taaacagttt tgatttaaaa tctcgtttgt atactatttt cagagacttt    2040

acttgcttca tgattagtac caaaccactg tacaaagaat tgtttgttaa caagaaaaaa    2100

a                                                                    2101


<210> 14
<211> 268
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..268
<223> /mol_type="protein"
      /note="SNAI2"
      /organism="Homo sapiens"

<400> 14
Met Pro Arg Ser Phe Leu Val Lys Lys His Phe Asn Ala Ser Lys Lys 
1               5                   10                   15    
Pro Asn Tyr Ser Glu Leu Asp Thr His Thr Val Ile Ile Ser Pro Tyr 
            20                   25                  30        
Leu Tyr Glu Ser Tyr Ser Met Pro Val Ile Pro Gln Pro Glu Ile Leu 
        35                   40                  45            
Ser Ser Gly Ala Tyr Ser Pro Ile Thr Val Trp Thr Thr Ala Ala Pro 
    50                   55                  60                
Phe His Ala Gln Leu Pro Asn Gly Leu Ser Pro Leu Ser Gly Tyr Ser 
65                   70                  75                  80
Ser Ser Leu Gly Arg Val Ser Pro Pro Pro Pro Ser Asp Thr Ser Ser 
                85                   90                  95    
Lys Asp His Ser Gly Ser Glu Ser Pro Ile Ser Asp Glu Glu Glu Arg 
            100                  105                110        
Leu Gln Ser Lys Leu Ser Asp Pro His Ala Ile Glu Ala Glu Lys Phe 
        115                  120                125            
Gln Cys Asn Leu Cys Asn Lys Thr Tyr Ser Thr Phe Ser Gly Leu Ala 
    130                  135                140                
Lys His Lys Gln Leu His Cys Asp Ala Gln Ser Arg Lys Ser Phe Ser 
145                  150                155                  160
Cys Lys Tyr Cys Asp Lys Glu Tyr Val Ser Leu Gly Ala Leu Lys Met 
                165                  170                175    
His Ile Arg Thr His Thr Leu Pro Cys Val Cys Lys Ile Cys Gly Lys 
            180                  185                190        
Ala Phe Ser Arg Pro Trp Leu Leu Gln Gly His Ile Arg Thr His Thr 
        195                  200                205            
Gly Glu Lys Pro Phe Ser Cys Pro His Cys Asn Arg Ala Phe Ala Asp 
    210                  215                220                
Arg Ser Asn Leu Arg Ala His Leu Gln Thr His Ser Asp Val Lys Lys 
225                  230                235                  240
Tyr Gln Cys Lys Asn Cys Ser Lys Thr Phe Ser Arg Met Ser Leu Leu 
                245                  250                255    
His Lys His Glu Glu Ser Gly Cys Cys Val Ala His 
            260                  265            

<210> 15
<211> 2300
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2300
<223> /mol_type="DNA"
      /note="HOXC4"
      /organism="Homo sapiens"

<400> 15
ttattgtggt ttgtccgttc cgagcgctcc gcagaacagt cctccctgta agagcctaac      60

cattgccagg gaaacctgcc ctgggcgctc ccttcattag cagtattttt tttaaattaa     120

tctgattaat aattattttt cccccattta attttttttc ctcccaggtg gagttgccga     180

agctgggggc agctggggag ggtggggatg ggaggggaga gacagaagtt gagggcatct     240

ctctcttcct tcccgaccct ctggccccca aggggcagga ggaatgcagg agcaggagtt     300

gagcttggga gctgcagatg cctccgcccc tcctctctcc caggctcttc ctcctgcccc     360

cttcttgcaa ctctccttaa ttttgtttgg cttttggatg attataatta tttttatttt     420

tgaatttata taaagtatat gtgtgtgtgt gtggagctga gacaggctcg gcagcggcac     480

agaatgaggg aagacgagaa agagagtggg agagagagag gcagagaggg agagagggag     540

agtgacagca gcgctcgcgg gggctcaacc cccagacctc cagaaatgac gtcagaatca     600

tttgcatccc gctgcctcta cctgcctggt ccagctggga ccctgcctcg ccggccgcat     660

ggccagaggg ttggaaatta atgatcatga gctcgtattt gatggactct aactacatcg     720

atccgaaatt tcctccatgc gaagaatatt cgcaaaatag ctacatccct gaacacagtc     780

cggaatatta cggccggacc agggaatcgg gattccagca tcaccaccag gagctgtacc     840

caccaccgcc tccgcgccct agctaccctg agcgccagta tagctgcacc agtctccagg     900

ggcccggcaa ttcgcgaggc cacgggccgg cccaggcggg ccaccaccac cccgagaaat     960

cacagtcgct ctgcgagccg gcgcctctct caggcgcctc cgcctccccg tccccagccc    1020

cgccagcctg cagccagcca gcccccgacc atccctccag cgccgccagc aagcaaccca    1080

tagtctaccc atggatgaaa aaaattcacg ttagcacggt gaaccccaat tataacggag    1140

gggaacccaa gcgctcgagg acagcctata cccggcagca agtcctggaa ttagagaaag    1200

agtttcatta caaccgctac ctgacccgaa ggagaaggat cgagatcgcc cactcgctgt    1260

gcctctctga gaggcagatc aaaatctggt tccaaaaccg tcgcatgaaa tggaagaagg    1320

accaccgact ccccaacacc aaagtcaggt cagcaccccc ggccggcgct gcgcccagca    1380

ccctttcggc agctaccccg ggtacttctg aagaccactc ccagagcgcc acgccgccgg    1440

agcagcaacg ggcagaggac attaccaggt tataaaacat aactcacacc cctgccccca    1500

ccccatgccc ccaccctccc ctcacacaca aattgactct tatttataga atttaatata    1560

tatatatata tatatatata taggttcttt tctctcttcc tctcaccttg tcccttgtca    1620

gttccaaaca gacaaaacag ataaacaaac aagccccctg ccctcctctc cctcccactg    1680

ttaaggaccc ttttaagcat gtgatgttgt cttagcatgg tacctgctgg gtgttttttt    1740

ttaaaaggcc attttggggg gttatttatt ttttaagaaa aaaagctgca aaaattatat    1800

attgcaaggt gtgatggtct ggcttgggtg aatttcaggg gaaatgagga aaagaaaaaa    1860

ggaaagaaat tttaaagcca attctcatcc ttctcctcct cctccttccc cccctctttc    1920

cttaggcctt ttgcattgaa aatgcaccag gggaggttag tgagggggaa gtcattttaa    1980

ggagaacaaa gctatgaagt tcttttgtat tattgttggg ggggggtgtg ggaggagagg    2040

gggcgaagac agcagacaaa gctaaatgca tctggagagc ctctcagagc tgttcagttt    2100

gaggagccaa aagaaaatca aaatgaactt tcagttcaga gaggcagtct ataggtagaa    2160

tctctcccca cccctatcgt ggttattgtg tttttggact gaatttactt gattattgta    2220

aaacttgcaa taaagaattt tagtgtcgat gtgaaatgcc ccgtgatcaa taataaacca    2280

gtggatgtga attagtttta                                                2300


<210> 16
<211> 264
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..264
<223> /mol_type="protein"
      /note="HOXC4"
      /organism="Homo sapiens"

<400> 16
Met Ile Met Ser Ser Tyr Leu Met Asp Ser Asn Tyr Ile Asp Pro Lys 
1               5                   10                   15    
Phe Pro Pro Cys Glu Glu Tyr Ser Gln Asn Ser Tyr Ile Pro Glu His 
            20                   25                  30        
Ser Pro Glu Tyr Tyr Gly Arg Thr Arg Glu Ser Gly Phe Gln His His 
        35                   40                  45            
His Gln Glu Leu Tyr Pro Pro Pro Pro Pro Arg Pro Ser Tyr Pro Glu 
    50                   55                  60                
Arg Gln Tyr Ser Cys Thr Ser Leu Gln Gly Pro Gly Asn Ser Arg Gly 
65                   70                  75                  80
His Gly Pro Ala Gln Ala Gly His His His Pro Glu Lys Ser Gln Ser 
                85                   90                  95    
Leu Cys Glu Pro Ala Pro Leu Ser Gly Ala Ser Ala Ser Pro Ser Pro 
            100                  105                110        
Ala Pro Pro Ala Cys Ser Gln Pro Ala Pro Asp His Pro Ser Ser Ala 
        115                  120                125            
Ala Ser Lys Gln Pro Ile Val Tyr Pro Trp Met Lys Lys Ile His Val 
    130                  135                140                
Ser Thr Val Asn Pro Asn Tyr Asn Gly Gly Glu Pro Lys Arg Ser Arg 
145                  150                155                  160
Thr Ala Tyr Thr Arg Gln Gln Val Leu Glu Leu Glu Lys Glu Phe His 
                165                  170                175    
Tyr Asn Arg Tyr Leu Thr Arg Arg Arg Arg Ile Glu Ile Ala His Ser 
            180                  185                190        
Leu Cys Leu Ser Glu Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg 
        195                  200                205            
Met Lys Trp Lys Lys Asp His Arg Leu Pro Asn Thr Lys Val Arg Ser 
    210                  215                220                
Ala Pro Pro Ala Gly Ala Ala Pro Ser Thr Leu Ser Ala Ala Thr Pro 
225                  230                235                  240
Gly Thr Ser Glu Asp His Ser Gln Ser Ala Thr Pro Pro Glu Gln Gln 
                245                  250                255    
Arg Ala Glu Asp Ile Thr Arg Leu 
            260                

