               SEQUENCE LISTING

<110> NovioGendix Research B.V.
      Smit, Franciscus Petrus

<120> Molecular markers in prostate cancer

<130> 4/2MP68/8P

<150> PCT/EP2011/057716
<151> 2011-05-12

<160> 40

<170> BiSSAP 1.0

<210> 1
<211> 2051
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2051
<223> /mol_type="DNA"
      /note="ACSM1"
      /organism="Homo sapiens"

<400> 1
agccatctct tcccaaggca ggtggtgact tgagaactct gtgcctggtt tctgaggact      60

gtttcaccat gcagtggcta atgaggttcc ggaccctctg gggcatccac aaatccttcc     120

acaacatcca ccctgcccct tcacagctgc gctgccggtc tttatcagaa tttggagccc     180

caagatggaa tgactatgaa gtaccggagg aatttaactt tgcaagttat gtactggact     240

actgggctca aaaggagaag gagggcaaga gaggtccaaa tccagctttt tggtgggtga     300

atggccaagg ggatgaagta aagtggagct tcagagagat gggagaccta acccgccgtg     360

tagccaacgt cttcacacag acctgtggcc tacaacaggg agaccatctg gccttgatgc     420

tgcctcgagt tcctgagtgg tggctggtgg ctgtgggctg catgcgaaca gggatcatct     480

tcattcctgc gaccatcctg ttgaaggcca aagacattct ctatcgacta cagttgtcta     540

aagccaaggg cattgtgacc atagatgccc ttgcctcaga ggtggactcc atagcttctc     600

agtgcccctc tctgaaaacc aagctcctgg tgtctgatca cagccgtgaa gggtggctgg     660

acttccgatc gctggttaaa tcagcatccc cagaacacac ctgtgttaag tcaaagacct     720

tggacccaat ggtcatcttc ttcaccagtg ggaccacagg cttccccaag atggcaaaac     780

actcccatgg gttggcctta caaccctcct tcccaggaag taggaaatta cggagcctga     840

agacatctga tgtctcctgg tgcctgtcgg actcaggatg gattgtggct accatttgga     900

ccctggtaga accatggaca gcgggttgta cagtctttat ccaccatctg ccacagtttg     960

acaccaaggt catcatacag acattgttga aataccccat taaccacttt tggggggtat    1020

catctatata tcgaatgatt ctgcagcagg atttcaccag catcaggttc cctgccctgg    1080

agcactgcta tactggcggg gaggtcgtgt tgcccaagga tcaggaggag tggaaaagac    1140

ggacgggcct tctgctctac gagaactatg ggcagtcgga aacgggacta atttgtgcca    1200

cctactgggg aatgaagatc aagccgggtt tcatggggaa ggccactcca ccctacgacg    1260

tccaggtcat tgatgacaag ggcagcatcc tgccacctaa cacagaagga aacattggca    1320

tcagaatcaa acctgtcagg cctgtgagcc tcttcatgtg ctatgagggt gacccagaga    1380

agacagctaa agtggaatgt ggggacttct acaacactgg ggacagaggt aagatggatg    1440

aagagggcta catttgtttc ctggggagga gtgatgacat cattaatgcc tctgggtatc    1500

gcatcgggcc tgcagaggtt gaaagcgctt tggtggagca cccagcggtg gcggagtcag    1560

ccgtggtggg cagcccagac ccgattcgag gggaggtggt gaaggccttt attgtcctga    1620

ccccacagtt cctgtcccat gacaaggatc agctgaccaa ggaactgcag cagcatgtca    1680

agtcagtgac agccccatac aagtacccaa ggaaggtgga gtttgtctca gagctgccaa    1740

aaaccatcac tggcaagatt gaacggaagg aacttcggaa aaaggagact ggtcagatgt    1800

aatcggcagt gaactcagaa cgcactgcac acctaaggca aatccctggc cactttagtc    1860

tccccactat ggtgaggacg agggtggggc attgagagtg ttgatttggg aaagtatcag    1920

gagtgccatg attccaatgt tttccttctt ttaaattaaa ttcagttgct ctgcttcctc    1980

caagtcctct gtatctttag aatttcccag gtgagcactc ataacgcaag taataaaata    2040

ctgatatcaa c                                                         2051


<210> 2
<211> 577
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..577
<223> /mol_type="protein"
      /note="ACSM1"
      /organism="Homo sapiens"

<400> 2
Met Gln Trp Leu Met Arg Phe Arg Thr Leu Trp Gly Ile His Lys Ser 
1               5                   10                   15    
Phe His Asn Ile His Pro Ala Pro Ser Gln Leu Arg Cys Arg Ser Leu 
            20                   25                  30        
Ser Glu Phe Gly Ala Pro Arg Trp Asn Asp Tyr Glu Val Pro Glu Glu 
        35                   40                  45            
Phe Asn Phe Ala Ser Tyr Val Leu Asp Tyr Trp Ala Gln Lys Glu Lys 
    50                   55                  60                
Glu Gly Lys Arg Gly Pro Asn Pro Ala Phe Trp Trp Val Asn Gly Gln 
65                   70                  75                  80
Gly Asp Glu Val Lys Trp Ser Phe Arg Glu Met Gly Asp Leu Thr Arg 
                85                   90                  95    
Arg Val Ala Asn Val Phe Thr Gln Thr Cys Gly Leu Gln Gln Gly Asp 
            100                  105                110        
His Leu Ala Leu Met Leu Pro Arg Val Pro Glu Trp Trp Leu Val Ala 
        115                  120                125            
Val Gly Cys Met Arg Thr Gly Ile Ile Phe Ile Pro Ala Thr Ile Leu 
    130                  135                140                
Leu Lys Ala Lys Asp Ile Leu Tyr Arg Leu Gln Leu Ser Lys Ala Lys 
145                  150                155                  160
Gly Ile Val Thr Ile Asp Ala Leu Ala Ser Glu Val Asp Ser Ile Ala 
                165                  170                175    
Ser Gln Cys Pro Ser Leu Lys Thr Lys Leu Leu Val Ser Asp His Ser 
            180                  185                190        
Arg Glu Gly Trp Leu Asp Phe Arg Ser Leu Val Lys Ser Ala Ser Pro 
        195                  200                205            
Glu His Thr Cys Val Lys Ser Lys Thr Leu Asp Pro Met Val Ile Phe 
    210                  215                220                
Phe Thr Ser Gly Thr Thr Gly Phe Pro Lys Met Ala Lys His Ser His 
225                  230                235                  240
Gly Leu Ala Leu Gln Pro Ser Phe Pro Gly Ser Arg Lys Leu Arg Ser 
                245                  250                255    
Leu Lys Thr Ser Asp Val Ser Trp Cys Leu Ser Asp Ser Gly Trp Ile 
            260                  265                270        
Val Ala Thr Ile Trp Thr Leu Val Glu Pro Trp Thr Ala Gly Cys Thr 
        275                  280                285            
Val Phe Ile His His Leu Pro Gln Phe Asp Thr Lys Val Ile Ile Gln 
    290                  295                300                
Thr Leu Leu Lys Tyr Pro Ile Asn His Phe Trp Gly Val Ser Ser Ile 
305                  310                315                  320
Tyr Arg Met Ile Leu Gln Gln Asp Phe Thr Ser Ile Arg Phe Pro Ala 
                325                  330                335    
Leu Glu His Cys Tyr Thr Gly Gly Glu Val Val Leu Pro Lys Asp Gln 
            340                  345                350        
Glu Glu Trp Lys Arg Arg Thr Gly Leu Leu Leu Tyr Glu Asn Tyr Gly 
        355                  360                365            
Gln Ser Glu Thr Gly Leu Ile Cys Ala Thr Tyr Trp Gly Met Lys Ile 
    370                  375                380                
Lys Pro Gly Phe Met Gly Lys Ala Thr Pro Pro Tyr Asp Val Gln Val 
385                  390                395                  400
Ile Asp Asp Lys Gly Ser Ile Leu Pro Pro Asn Thr Glu Gly Asn Ile 
                405                  410                415    
Gly Ile Arg Ile Lys Pro Val Arg Pro Val Ser Leu Phe Met Cys Tyr 
            420                  425                430        
Glu Gly Asp Pro Glu Lys Thr Ala Lys Val Glu Cys Gly Asp Phe Tyr 
        435                  440                445            
Asn Thr Gly Asp Arg Gly Lys Met Asp Glu Glu Gly Tyr Ile Cys Phe 
    450                  455                460                
Leu Gly Arg Ser Asp Asp Ile Ile Asn Ala Ser Gly Tyr Arg Ile Gly 
465                  470                475                  480
Pro Ala Glu Val Glu Ser Ala Leu Val Glu His Pro Ala Val Ala Glu 
                485                  490                495    
Ser Ala Val Val Gly Ser Pro Asp Pro Ile Arg Gly Glu Val Val Lys 
            500                  505                510        
Ala Phe Ile Val Leu Thr Pro Gln Phe Leu Ser His Asp Lys Asp Gln 
        515                  520                525            
Leu Thr Lys Glu Leu Gln Gln His Val Lys Ser Val Thr Ala Pro Tyr 
    530                  535                540                
Lys Tyr Pro Arg Lys Val Glu Phe Val Ser Glu Leu Pro Lys Thr Ile 
545                  550                555                  560
Thr Gly Lys Ile Glu Arg Lys Glu Leu Arg Lys Lys Glu Thr Gly Gln 
                565                  570                575    
Met 
    

<210> 3
<211> 2660
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2660
<223> /mol_type="DNA"
      /note="ALDH3B2"
      /organism="Homo sapiens"

<400> 3
accccattga ttaccccatt gccaggcgtg ggcacgggag ttggtttggg agctgccagt      60

ctcctgggag gatcgcagtc agcagagcag ggctgaggcc tgggggtagg agcagagcct     120

gcgcatctgg aggcagcatg tccaagaaag ggagtggagg tgcagcgaag gacccagggg     180

cagagcccac gctgggatgg accccttcga ggacacgctg cggcggctgc gtgaggcctt     240

caactgaggg cgcacgcggc cggccgagtt ccgggctgcg cagctccagg gcctgggcca     300

cttccttcaa gaaaacaagc agcttctgcg cgacgtgctg gcccaggacc tgcataagcc     360

agctttcgag gcagacatat ctgagctcat cctttgccag aacgaggttg actacgctct     420

caagaacctg caggcctgga tgaaggatga accacggtcc acgaacctgt tcatgaagct     480

ggactcggtc ttcatctgga aggaaccctt tggcctggtc ctcatcatcg caccctggaa     540

ctacccactg aacctgaccc tggtgctcct ggtgggcgcc ctcgccgcag ggagttgcgt     600

ggtgctgaag ccgtcagaaa tcagccaggg cacagagaag gtcctggctg aggtgctgcc     660

ccagtacctg gaccagagct gctttgccgt ggtgctgggc ggaccccagg agacagggca     720

gctgctagag cacaagttgg actacatctt cttcacaggg agccctcgtg tgggcaagat     780

tgtcatgact gctgccacca agcacctgac gcctgtcacc ctggagctgg ggggcaagaa     840

cccctgctac gtggacgaca actgcgaccc ccagaccgtg gccaaccgcg tggcctggtt     900

ctgctacttc aatgccggcc agacctgcgt ggcccctgac tacgtcctgt gcagccccga     960

gatgcaggag aggctgctgc ccgccctgca gagcaccatc acccgtttct atggcgacga    1020

cccccagagc tccccaaacc tgggccgcat catcaaccag aaacagttcc agcggctgcg    1080

ggcattgctg ggctgcggcc gcgtggccat tgggggccag agcaacgaga gcgatcgcta    1140

catcgccccc acggtgctgg tggacgtgca ggagacggag cctgtgatgc aggaggagat    1200

cttcgggccc atcctgccca tcgtgaacgt gcagagcgtg gacgaggcca tcaagttcat    1260

caaccggcag gagaagcccc tggccctgta cgccttctcc aacagcagcc aggttgtgaa    1320

ccagatgctg gagcggacca gcagcggcag ctttggaggc aatgagggct tcacctacat    1380

atctctgctg tccgtgccat tcgggggagt cggccacagt gggatgggcc ggtaccacgg    1440

caagttcacc ttcgacacct tctcccacca ccgcacctgc ctgctcgccc cctccggcct    1500

ggagaaatta aaggagatcc actacccacc ctataccgac tggaaccagc agctgttacg    1560

ctggggcatg ggctcccaga gctgcaccct cctgtgagcg tcccacccgc ctccaacggg    1620

tcacacagag aaacctgagt ctagccatga ggggcttatg ctcccaactc acattgttcc    1680

tccagaccgc aggttccccc agcctcaggt tgctggagct gtcacatgac tgcatcctgc    1740

ctgccagggc tgcaaagcaa ggtcttgctt ctatctgggg gacgctgctc gagagaggcc    1800

aagaggccgc agaacatgcc aggtgtcctc actcacccca ccctccccaa ttccagccct    1860

ttgccctctc ggtcagggtt ggccaggccc agtcacaggg gcagtgtcac cctggaaaat    1920

acagtgccct gccttcttag gggcatcagc cctgaacggt tgagagcgtg gagccctcca    1980

ggcctttgct ctcccctcta ggcacacgcg cacttccatc tctgccccat cccaactgca    2040

ccagcactgc ctcccccagg gatcctctca catcccacac tggtctctgc accacccctc    2100

tggttcacac cgcaccctgc actcacccac agcagctcca tccactggga aaactggggt    2160

ttgcatcact ccactgcaca gtgttagtgg gacctggggg caagtccctt gacttctctg    2220

agcctcagtt tccttatgtg aaagttgctg gaaccaaaat ggagtcactt atgccaaact    2280

ctaataaaat ggagtcgggg ggccacatag aagccctcac acacacatgc ccgtaacagg    2340

atttatcaca agacacgcct gcatgtagac cagacacagg gcgtatggaa agcacgtcct    2400

caagactgta gtattccaga tgagctgcag atgcttacct accacggccg tctccaccag    2460

aaaaccatcg ccaactcctg cgatcagctt gtgacttaca aaccttgttt aaaagctgct    2520

tacatggact tctgtccttt aaaagcttcc ccttggctgt ggccctctgt gtatgcctgg    2580

gatccttcca agcactcata gcccagatag gaatcctctg ctcctcccaa ataaattcat    2640

ctgttctgga aaaaaaaaaa                                                2660


<210> 4
<211> 385
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..385
<223> /mol_type="protein"
      /note="ALDH3B2"
      /organism="Homo sapiens"

<400> 4
Met Lys Asp Glu Pro Arg Ser Thr Asn Leu Phe Met Lys Leu Asp Ser 
1               5                   10                   15    
Val Phe Ile Trp Lys Glu Pro Phe Gly Leu Val Leu Ile Ile Ala Pro 
            20                   25                  30        
Trp Asn Tyr Pro Leu Asn Leu Thr Leu Val Leu Leu Val Gly Ala Leu 
        35                   40                  45            
Ala Ala Gly Ser Cys Val Val Leu Lys Pro Ser Glu Ile Ser Gln Gly 
    50                   55                  60                
Thr Glu Lys Val Leu Ala Glu Val Leu Pro Gln Tyr Leu Asp Gln Ser 
65                   70                  75                  80
Cys Phe Ala Val Val Leu Gly Gly Pro Gln Glu Thr Gly Gln Leu Leu 
                85                   90                  95    
Glu His Lys Leu Asp Tyr Ile Phe Phe Thr Gly Ser Pro Arg Val Gly 
            100                  105                110        
Lys Ile Val Met Thr Ala Ala Thr Lys His Leu Thr Pro Val Thr Leu 
        115                  120                125            
Glu Leu Gly Gly Lys Asn Pro Cys Tyr Val Asp Asp Asn Cys Asp Pro 
    130                  135                140                
Gln Thr Val Ala Asn Arg Val Ala Trp Phe Cys Tyr Phe Asn Ala Gly 
145                  150                155                  160
Gln Thr Cys Val Ala Pro Asp Tyr Val Leu Cys Ser Pro Glu Met Gln 
                165                  170                175    
Glu Arg Leu Leu Pro Ala Leu Gln Ser Thr Ile Thr Arg Phe Tyr Gly 
            180                  185                190        
Asp Asp Pro Gln Ser Ser Pro Asn Leu Gly Arg Ile Ile Asn Gln Lys 
        195                  200                205            
Gln Phe Gln Arg Leu Arg Ala Leu Leu Gly Cys Gly Arg Val Ala Ile 
    210                  215                220                
Gly Gly Gln Ser Asn Glu Ser Asp Arg Tyr Ile Ala Pro Thr Val Leu 
225                  230                235                  240
Val Asp Val Gln Glu Thr Glu Pro Val Met Gln Glu Glu Ile Phe Gly 
                245                  250                255    
Pro Ile Leu Pro Ile Val Asn Val Gln Ser Val Asp Glu Ala Ile Lys 
            260                  265                270        
Phe Ile Asn Arg Gln Glu Lys Pro Leu Ala Leu Tyr Ala Phe Ser Asn 
        275                  280                285            
Ser Ser Gln Val Val Asn Gln Met Leu Glu Arg Thr Ser Ser Gly Ser 
    290                  295                300                
Phe Gly Gly Asn Glu Gly Phe Thr Tyr Ile Ser Leu Leu Ser Val Pro 
305                  310                315                  320
Phe Gly Gly Val Gly His Ser Gly Met Gly Arg Tyr His Gly Lys Phe 
                325                  330                335    
Thr Phe Asp Thr Phe Ser His His Arg Thr Cys Leu Leu Ala Pro Ser 
            340                  345                350        
Gly Leu Glu Lys Leu Lys Glu Ile His Tyr Pro Pro Tyr Thr Asp Trp 
        355                  360                365            
Asn Gln Gln Leu Leu Arg Trp Gly Met Gly Ser Gln Ser Cys Thr Leu 
    370                  375                380                
Leu 
385

<210> 5
<211> 1934
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1934
<223> /mol_type="DNA"
      /note="CGREF1"
      /organism="Homo sapiens"

<400> 5
cacacgcgca cactcacacg ggcgcgcgca gcccctccgg ccgcgggcgc agcgggggcg      60

ctggtggagc tgcgaagggc caggtccggc gggcggggcg gcggctggca ctggctccgg     120

actctgcccg gccagggcgg cggctccagc cgggagggcg acgtggagcg gccacgtgga     180

gcggcccggg ggaggctggc ggcgggaggc gaggcgcggg cggcgcagca gccaggagcg     240

cccacggagc tggaccccca gagccgcgcg gcgccgcagc agttccagga aggatgttac     300

ctttgacgat gacagtgtta atcctgctgc tgctccccac gggtcaggct gccccaaagg     360

atggagtcac aaggccagac tctgaagtgc agcatcagct cctgcccaac cccttccagc     420

caggccagga gcagctcgga cttctgcaga gctacctaaa gggactagga aggacagaag     480

tgcaactgga gcatctgagc cgggagcagg ttctcctcta cctctttgcc ctccatgact     540

atgaccagag tggacagctg gatggcctgg agctgctgtc catgttgaca gctgctctgg     600

cccctggagc tgccaactct cctaccacca acccggtgat cttgatagtg gacaaagtgc     660

tcgagaccca ggacctgaat ggggatgggc tcatgacccc tgctgagctc atcaacttcc     720

cgggagtagc cctcaggcac gtggagcccg gagagcccct tgctccatct cctcaggagc     780

cacaagctgt tggaaggcag tccctattag ctaaaagccc attaagacaa gaaacacagg     840

aagcccctgg tcccagagaa gaagcaaagg gccaggtaga ggccagaagg gagtctttgg     900

atcctgtcca ggagcctggg ggccaggcag aggctgatgg agatgttcca gggcccagag     960

gggaagctga gggccaggca gaggctaaag gagatgcccc tgggcccaga ggggaagctg    1020

ggggccaggc agaggctgaa ggagatgccc ccgggcccag aggggaagct gggggccagg    1080

cagaggctga aggagatgcc cccgggccca gaggggaagc tgggggccag gcagaggcca    1140

gggagaatgg agaggaggcc aaggaacttc caggggaaac actggagtct aagaacaccc    1200

aaaatgactt tgaggtgcac attgttcaag tggagaatga tgagatctag atcttgaaga    1260

tacaggtacc ccacgaagtc tcagtgccag aacataagcc ctgaagtggg caggggaaat    1320

gtacgctggg acaaggacca tctctgtgcc ccctgcctgg tcccagtagg tatcaggtct    1380

ttctgtgcag ctcagggaga ccctaagtta aggggcagat taccaataaa gaactgaatg    1440

aattcatccc cccggccacc tctctacccg tccagcctgc ccagaccctc tcagaggaac    1500

ggggttgggg accgaaagga cagggatgcc gcctgcccag tgtttctggg cctcacggtg    1560

ctccggcagc agagcgcatg gtgctagcca tggccggctg cagaggaccc agtgaggaaa    1620

gctcagtcta tccctgggcc ccaaaccctc accggttccc cctcacctgg tgttcagaca    1680

ccccatgctc tcctgcagct cagggcaggt gaccccatcc ccagtaatat taatcatcac    1740

tagaactttt tgagagcctt gtacacatca ggcatcatgc tgggcatttt atatatgatt    1800

ttatcctcac aataattctg tagccaagca gaattggttc catttgacag atgaagaaat    1860

tgaggcagat tgcgttaagt gctgtaccct aaggtgatat gcagctaatt aaatggcaga    1920

tttgaatcca aaaa                                                      1934


<210> 6
<211> 318
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..318
<223> /mol_type="protein"
      /note="CGREF1"
      /organism="Homo sapiens"

<400> 6
Met Leu Pro Leu Thr Met Thr Val Leu Ile Leu Leu Leu Leu Pro Thr 
1               5                   10                   15    
Gly Gln Ala Ala Pro Lys Asp Gly Val Thr Arg Pro Asp Ser Glu Val 
            20                   25                  30        
Gln His Gln Leu Leu Pro Asn Pro Phe Gln Pro Gly Gln Glu Gln Leu 
        35                   40                  45            
Gly Leu Leu Gln Ser Tyr Leu Lys Gly Leu Gly Arg Thr Glu Val Gln 
    50                   55                  60                
Leu Glu His Leu Ser Arg Glu Gln Val Leu Leu Tyr Leu Phe Ala Leu 
65                   70                  75                  80
His Asp Tyr Asp Gln Ser Gly Gln Leu Asp Gly Leu Glu Leu Leu Ser 
                85                   90                  95    
Met Leu Thr Ala Ala Leu Ala Pro Gly Ala Ala Asn Ser Pro Thr Thr 
            100                  105                110        
Asn Pro Val Ile Leu Ile Val Asp Lys Val Leu Glu Thr Gln Asp Leu 
        115                  120                125            
Asn Gly Asp Gly Leu Met Thr Pro Ala Glu Leu Ile Asn Phe Pro Gly 
    130                  135                140                
Val Ala Leu Arg His Val Glu Pro Gly Glu Pro Leu Ala Pro Ser Pro 
145                  150                155                  160
Gln Glu Pro Gln Ala Val Gly Arg Gln Ser Leu Leu Ala Lys Ser Pro 
                165                  170                175    
Leu Arg Gln Glu Thr Gln Glu Ala Pro Gly Pro Arg Glu Glu Ala Lys 
            180                  185                190        
Gly Gln Val Glu Ala Arg Arg Glu Ser Leu Asp Pro Val Gln Glu Pro 
        195                  200                205            
Gly Gly Gln Ala Glu Ala Asp Gly Asp Val Pro Gly Pro Arg Gly Glu 
    210                  215                220                
Ala Glu Gly Gln Ala Glu Ala Lys Gly Asp Ala Pro Gly Pro Arg Gly 
225                  230                235                  240
Glu Ala Gly Gly Gln Ala Glu Ala Glu Gly Asp Ala Pro Gly Pro Arg 
                245                  250                255    
Gly Glu Ala Gly Gly Gln Ala Glu Ala Glu Gly Asp Ala Pro Gly Pro 
            260                  265                270        
Arg Gly Glu Ala Gly Gly Gln Ala Glu Ala Arg Glu Asn Gly Glu Glu 
        275                  280                285            
Ala Lys Glu Leu Pro Gly Glu Thr Leu Glu Ser Lys Asn Thr Gln Asn 
    290                  295                300                
Asp Phe Glu Val His Ile Val Gln Val Glu Asn Asp Glu Ile 
305                  310                315            

<210> 7
<211> 2471
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2471
<223> /mol_type="DNA"
      /note="COMP"
      /organism="Homo sapiens"

<400> 7
agaaagcgag cagccaccca gctccccgcc accgccatgg tccccgacac cgcctgcgtt      60

cttctgctca ccctggctgc cctcggcgcg tccggacagg gccagagccc gttgggctca     120

gacctgggcc cgcagatgct tcgggaactg caggaaacca acgcggcgct gcaggacgtg     180

cgggagctgc tgcggcagca ggtcagggag atcacgttcc tgaaaaacac ggtgatggag     240

tgtgacgcgt gcgggatgca gcagtcagta cgcaccggcc tacccagcgt gcggcccctg     300

ctccactgcg cgcccggctt ctgcttcccc ggcgtggcct gcatccagac ggagagcggc     360

gcgcgctgcg gcccctgccc cgcgggcttc acgggcaacg gctcgcactg caccgacgtc     420

aacgagtgca acgcccaccc ctgcttcccc cgagtccgct gtatcaacac cagcccgggg     480

ttccgctgcg aggcttgccc gccggggtac agcggcccca cccaccaggg cgtggggctg     540

gctttcgcca aggccaacaa gcaggtttgc acggacatca acgagtgtga gaccgggcaa     600

cataactgcg tccccaactc cgtgtgcatc aacacccggg gctccttcca gtgcggcccg     660

tgccagcccg gcttcgtggg cgaccaggcg tccggctgcc agcggcgcgc acagcgcttc     720

tgccccgacg gctcgcccag cgagtgccac gagcatgcag actgcgtcct agagcgcgat     780

ggctcgcggt cgtgcgtgtg tgccgttggc tgggccggca acgggatcct ctgtggtcgc     840

gacactgacc tagacggctt cccggacgag aagctgcgct gcccggagcg ccagtgccgt     900

aaggacaact gcgtgactgt gcccaactca gggcaggagg atgtggaccg cgatggcatc     960

ggagacgcct gcgatccgga tgccgacggg gacggggtcc ccaatgaaaa ggacaactgc    1020

ccgctggtgc ggaacccaga ccagcgcaac acggacgagg acaagtgggg cgatgcgtgc    1080

gacaactgcc ggtcccagaa gaacgacgac caaaaggaca cagaccagga cggccggggc    1140

gatgcgtgcg acgacgacat cgacggcgac cggatccgca accaggccga caactgccct    1200

agggtaccca actcagacca gaaggacagt gatggcgatg gtatagggga tgcctgtgac    1260

aactgtcccc agaagagcaa cccggatcag gcggatgtgg accacgactt tgtgggagat    1320

gcttgtgaca gcgatcaaga ccaggatgga gacggacatc aggactctcg ggacaactgt    1380

cccacggtgc ctaacagtgc ccaggaggac tcagaccacg atggccaggg tgatgcctgc    1440

gacgacgacg acgacaatga cggagtccct gacagtcggg acaactgccg cctggtgcct    1500

aaccccggcc aggaggacgc ggacagggac ggcgtgggcg acgtgtgcca ggacgacttt    1560

gatgcagaca aggtggtaga caagatcgac gtgtgtccgg agaacgctga agtcacgctc    1620

accgacttca gggccttcca gacagtcgtg ctggacccgg agggtgacgc gcagattgac    1680

cccaactggg tggtgctcaa ccagggaagg gagatcgtgc agacaatgaa cagcgaccca    1740

ggcctggctg tgggttacac tgccttcaat ggcgtggact tcgagggcac gttccatgtg    1800

aacacggtca cggatgacga ctatgcgggc ttcatctttg gctaccagga cagctccagc    1860

ttctacgtgg tcatgtggaa gcagatggag caaacgtatt ggcaggcgaa ccccttccgt    1920

gctgtggccg agcctggcat ccaactcaag gctgtgaagt cttccacagg ccccggggaa    1980

cagctgcgga acgctctgtg gcatacagga gacacagagt cccaggtgcg gctgctgtgg    2040

aaggacccgc gaaacgtggg ttggaaggac aagaagtcct atcgttggtt cctgcagcac    2100

cggccccaag tgggctacat cagggtgcga ttctatgagg gccctgagct ggtggccgac    2160

agcaacgtgg tcttggacac aaccatgcgg ggtggccgcc tgggggtctt ctgcttctcc    2220

caggagaaca tcatctgggc caacctgcgt taccgctgca atgacaccat cccagaggac    2280

tatgagaccc atcagctgcg gcaagcctag ggaccagggt gaggacccgc cggatgacag    2340

ccaccctcac cgcggctgga tgggggctct gcacccagcc ccaaggggtg gccgtcctga    2400

gggggaagtg agaagggctc agagaggaca aaataaagtg tgtgtgcagg gaaaaaaaaa    2460

aaaaaaaaaa a                                                         2471


<210> 8
<211> 757
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..757
<223> /mol_type="protein"
      /note="COMP"
      /organism="Homo sapiens"

<400> 8
Met Val Pro Asp Thr Ala Cys Val Leu Leu Leu Thr Leu Ala Ala Leu 
1               5                   10                   15    
Gly Ala Ser Gly Gln Gly Gln Ser Pro Leu Gly Ser Asp Leu Gly Pro 
            20                   25                  30        
Gln Met Leu Arg Glu Leu Gln Glu Thr Asn Ala Ala Leu Gln Asp Val 
        35                   40                  45            
Arg Glu Leu Leu Arg Gln Gln Val Arg Glu Ile Thr Phe Leu Lys Asn 
    50                   55                  60                
Thr Val Met Glu Cys Asp Ala Cys Gly Met Gln Gln Ser Val Arg Thr 
65                   70                  75                  80
Gly Leu Pro Ser Val Arg Pro Leu Leu His Cys Ala Pro Gly Phe Cys 
                85                   90                  95    
Phe Pro Gly Val Ala Cys Ile Gln Thr Glu Ser Gly Ala Arg Cys Gly 
            100                  105                110        
Pro Cys Pro Ala Gly Phe Thr Gly Asn Gly Ser His Cys Thr Asp Val 
        115                  120                125            
Asn Glu Cys Asn Ala His Pro Cys Phe Pro Arg Val Arg Cys Ile Asn 
    130                  135                140                
Thr Ser Pro Gly Phe Arg Cys Glu Ala Cys Pro Pro Gly Tyr Ser Gly 
145                  150                155                  160
Pro Thr His Gln Gly Val Gly Leu Ala Phe Ala Lys Ala Asn Lys Gln 
                165                  170                175    
Val Cys Thr Asp Ile Asn Glu Cys Glu Thr Gly Gln His Asn Cys Val 
            180                  185                190        
Pro Asn Ser Val Cys Ile Asn Thr Arg Gly Ser Phe Gln Cys Gly Pro 
        195                  200                205            
Cys Gln Pro Gly Phe Val Gly Asp Gln Ala Ser Gly Cys Gln Arg Arg 
    210                  215                220                
Ala Gln Arg Phe Cys Pro Asp Gly Ser Pro Ser Glu Cys His Glu His 
225                  230                235                  240
Ala Asp Cys Val Leu Glu Arg Asp Gly Ser Arg Ser Cys Val Cys Ala 
                245                  250                255    
Val Gly Trp Ala Gly Asn Gly Ile Leu Cys Gly Arg Asp Thr Asp Leu 
            260                  265                270        
Asp Gly Phe Pro Asp Glu Lys Leu Arg Cys Pro Glu Arg Gln Cys Arg 
        275                  280                285            
Lys Asp Asn Cys Val Thr Val Pro Asn Ser Gly Gln Glu Asp Val Asp 
    290                  295                300                
Arg Asp Gly Ile Gly Asp Ala Cys Asp Pro Asp Ala Asp Gly Asp Gly 
305                  310                315                  320
Val Pro Asn Glu Lys Asp Asn Cys Pro Leu Val Arg Asn Pro Asp Gln 
                325                  330                335    
Arg Asn Thr Asp Glu Asp Lys Trp Gly Asp Ala Cys Asp Asn Cys Arg 
            340                  345                350        
Ser Gln Lys Asn Asp Asp Gln Lys Asp Thr Asp Gln Asp Gly Arg Gly 
        355                  360                365            
Asp Ala Cys Asp Asp Asp Ile Asp Gly Asp Arg Ile Arg Asn Gln Ala 
    370                  375                380                
Asp Asn Cys Pro Arg Val Pro Asn Ser Asp Gln Lys Asp Ser Asp Gly 
385                  390                395                  400
Asp Gly Ile Gly Asp Ala Cys Asp Asn Cys Pro Gln Lys Ser Asn Pro 
                405                  410                415    
Asp Gln Ala Asp Val Asp His Asp Phe Val Gly Asp Ala Cys Asp Ser 
            420                  425                430        
Asp Gln Asp Gln Asp Gly Asp Gly His Gln Asp Ser Arg Asp Asn Cys 
        435                  440                445            
Pro Thr Val Pro Asn Ser Ala Gln Glu Asp Ser Asp His Asp Gly Gln 
    450                  455                460                
Gly Asp Ala Cys Asp Asp Asp Asp Asp Asn Asp Gly Val Pro Asp Ser 
465                  470                475                  480
Arg Asp Asn Cys Arg Leu Val Pro Asn Pro Gly Gln Glu Asp Ala Asp 
                485                  490                495    
Arg Asp Gly Val Gly Asp Val Cys Gln Asp Asp Phe Asp Ala Asp Lys 
            500                  505                510        
Val Val Asp Lys Ile Asp Val Cys Pro Glu Asn Ala Glu Val Thr Leu 
        515                  520                525            
Thr Asp Phe Arg Ala Phe Gln Thr Val Val Leu Asp Pro Glu Gly Asp 
    530                  535                540                
Ala Gln Ile Asp Pro Asn Trp Val Val Leu Asn Gln Gly Arg Glu Ile 
545                  550                555                  560
Val Gln Thr Met Asn Ser Asp Pro Gly Leu Ala Val Gly Tyr Thr Ala 
                565                  570                575    
Phe Asn Gly Val Asp Phe Glu Gly Thr Phe His Val Asn Thr Val Thr 
            580                  585                590        
Asp Asp Asp Tyr Ala Gly Phe Ile Phe Gly Tyr Gln Asp Ser Ser Ser 
        595                  600                605            
Phe Tyr Val Val Met Trp Lys Gln Met Glu Gln Thr Tyr Trp Gln Ala 
    610                  615                620                
Asn Pro Phe Arg Ala Val Ala Glu Pro Gly Ile Gln Leu Lys Ala Val 
625                  630                635                  640
Lys Ser Ser Thr Gly Pro Gly Glu Gln Leu Arg Asn Ala Leu Trp His 
                645                  650                655    
Thr Gly Asp Thr Glu Ser Gln Val Arg Leu Leu Trp Lys Asp Pro Arg 
            660                  665                670        
Asn Val Gly Trp Lys Asp Lys Lys Ser Tyr Arg Trp Phe Leu Gln His 
        675                  680                685            
Arg Pro Gln Val Gly Tyr Ile Arg Val Arg Phe Tyr Glu Gly Pro Glu 
    690                  695                700                
Leu Val Ala Asp Ser Asn Val Val Leu Asp Thr Thr Met Arg Gly Gly 
705                  710                715                  720
Arg Leu Gly Val Phe Cys Phe Ser Gln Glu Asn Ile Ile Trp Ala Asn 
                725                  730                735    
Leu Arg Tyr Arg Cys Asn Asp Thr Ile Pro Glu Asp Tyr Glu Thr His 
            740                  745                750        
Gln Leu Arg Gln Ala 
        755        

<210> 9
<211> 1692
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1692
<223> /mol_type="DNA"
      /note="C19orf48"
      /organism="Homo sapiens"

<400> 9
tgaaatgggg tttcccaaac aggcgtgtgt attggacgcc tcgggcggag cgcgggctgg      60

cgccgaggac cggccttgcg agcggcgcgc actataaaat ggcgcgtgct gcaacccgcg     120

cccgcttcgg agagagaaat gctgggagac agggtttcac catattggcc aggctggtct     180

cgaactcctg acttcgtgat ctgcccacct cggcttccca aagtgctgag gttgcaggcg     240

tgagccaccg tgcccggccg cgtttcctac tctttaagct ctgttagctt ggcctctgtc     300

cctgaaggtg cagcttcaag cttaggacca cccaccatgc ctatccaggt gctgaagggc     360

ctgaccatca ctcattaaga acagaggagg ctgcctgtta ctcctggtgt tgcatccctc     420

cagacactct gctgtttcct gcctaggcgt ggctgcagcc atggctagga aagcgctgcc     480

acccacccac ctgggccaga gctggttctg ctcctgctgc agggacactg agctggctat     540

ctcggcgctt cgggcaagaa ctgcaacagg ctctcctggg tcctgcaggt gtacagccgg     600

gcccctgcct tgtgcctcag ctctcgagag ctgctgctgc cgggtgacct gatccaacct     660

gataaggtgc catcttcagc taccactgca aggccctgag ggcaacagca gcacggcact     720

gcccacccgg ctgctgatgg cctggtgcca gctgggagtc ctcccggcac ttcgaggcca     780

ctgagccacc cttccagccc cagcccacca tggacagggg tatccagctt cctcctcaac     840

ctcgtcctct gcccctgagc cagtgacgcc caaggacatg cctgttaccc aggtcctgta     900

ccagcactag ctggtcaagg gcatgacagt gctggaggcc gtcttggaga tccaggccat     960

cactggcagc aggctgctct ccatggtgcc agggcccgcc aggccaccag gctcatgctg    1020

ggacccaacc cagtgcacaa ggacttggct gctgagccac acacccagga gaaggtggat    1080

aagtgggcta ccaagggctt cctgcaggct aggggaggag ccacccccgc ttccctattg    1140

tgaccaggcc tatggggagg agctgtccat acgccaccgt gagacctggg cctggctctc    1200

aaggacagac accgcctggc ctggtgctcc aggggtgaag caggccagaa tcctggggga    1260

gctgctcctg gtttgagctg cattcaggaa gtgcgggaca tggtagggga ggcaaaaagc    1320

cttgggcact accctccctg tggagctgtt cggtgtccgt cgagctagcc acaccctgac    1380

accatgttca agggtaccgg aagagaaggg tgtctgcccc caacctcccc tgtgggtgtc    1440

actggccaga tgtcatgagg gaagcaggcc ttgtgagtgg acactgacca tgagtccctg    1500

gggggagtga tcccccaggc atcgtgtgcc atgttgcact tctgcccagg cagcagggtg    1560

ggtgggtacc atgggtgccc acccctccac cacatggggc cccaaagcac tgcaggccaa    1620

gcagggcaac cccacaccct tgacataaaa gcatcttgaa gcttttaaaa aaaaaaaaaa    1680

aaaaaaaaaa aa                                                        1692


<210> 10
<211> 117
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..117
<223> /mol_type="protein"
      /note="C19orf48"
      /organism="Homo sapiens"

<400> 10
Met Thr Val Leu Glu Ala Val Leu Glu Ile Gln Ala Ile Thr Gly Ser 
1               5                   10                   15    
Arg Leu Leu Ser Met Val Pro Gly Pro Ala Arg Pro Pro Gly Ser Cys 
            20                   25                  30        
Trp Asp Pro Thr Gln Cys Thr Arg Thr Trp Leu Leu Ser His Thr Pro 
        35                   40                  45            
Arg Arg Arg Trp Ile Ser Gly Leu Pro Arg Ala Ser Cys Arg Leu Gly 
    50                   55                  60                
Glu Glu Pro Pro Pro Leu Pro Tyr Cys Asp Gln Ala Tyr Gly Glu Glu 
65                   70                  75                  80
Leu Ser Ile Arg His Arg Glu Thr Trp Ala Trp Leu Ser Arg Thr Asp 
                85                   90                  95    
Thr Ala Trp Pro Gly Ala Pro Gly Val Lys Gln Ala Arg Ile Leu Gly 
            100                  105                110        
Glu Leu Leu Leu Val 
        115        

<210> 11
<211> 2403
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2403
<223> /mol_type="DNA"
      /note="DLX1"
      /organism="Homo sapiens"

<400> 11
aagctttgaa ccgagtttgg ggagctcagc agcatcatgc ttagactttt caaagagaca      60

aactccattt tcttatgaat ggaaagtgaa aacccctgtt ccgcttaaat tgggttcctt     120

cctgtcctga gaaacataga gacccccaaa agggaagcag aggagagaaa gtcccacacc     180

cagaccccgc gagaagagat gaccatgacc accatgccag aaagtctcaa cagccccgtg     240

tcgggcaagg cggtgtttat ggagtttggg ccgcccaacc agcaaatgtc tccttctccc     300

atgtcccacg ggcactactc catgcactgt ttacactcgg cgggccattc gcagcccgac     360

ggcgcctaca gctcagcctc gtccttctcc cgaccgctgg gctaccccta cgtcaactcg     420

gtcagcagcc acgcatccag cccctacatc agttcggtgc agtcctaccc gggcagcgcc     480

agcctcgccc agagccgcct ggaggaccca ggggcggact cggagaagag cacggtggtg     540

gaaggcggtg aagtgcgctt caatggcaag ggaaaaaaga tccgtaaacc caggacgatt     600

tattccagtt tgcagttgca ggctttgaac cggaggttcc agcaaactca gtacctagct     660

ctgccggaga gggcggagct cgcggcctct ttgggactca cacagactca ggtcaagatc     720

tggttccaaa acaagcgatc caagttcaag aagctgatga agcagggtgg ggcggctctg     780

gagggtagtg cgttggccaa cggtcgggcc ctgtctgctg gctccccacc cgtgccgccc     840

ggctggaacc ctaactcttc atccgggaag ggctcaggag gaaacgcggg ctcctatatc     900

cccagctaca catcgtggta cccttcagcg caccaagaag ctatgcagca accccaactt     960

atgtgaggtt gcccgcccgt ctccttcttg tctccccggc ccaggtccct cccgcctcca    1020

ggtccatcca tcccgtccgg aaaagaagga cccagaggga agaaggaaca gtggaggcgg    1080

gacgccctcc atctcctcgg agccccgcga ggtccggccc agcaacttcc cggcatccgc    1140

gctctagcct gaaccctggc ctgggccgag cagtggcagc agagagtggc ctcggaggga    1200

agccactgcc acctgagaca gcccaagcag caagataaac ccgctccacc cgacccgccg    1260

accttcagct ttgtgggact atcaggaaaa aacaaaacaa aaacaaaatg tagaaaaagc    1320

aaaagctctt ttctgtcctg tcagtctcct gtctcctttt gctctgtctg tgcgctggta    1380

aagtccaggt cctcatccgt ccgctgtcct cattctgcgg cctcagcaaa aagccacaag    1440

gtctgagcgg cccgggtcct gccgggctga ccatctccgg atcctgggac actctgcctg    1500

accatctgtg tagctggtgt gggaatctgg gggcattgga gggagggggt tttatttatt    1560

gagaaatgga cttcgcctga ggctgtttgc caattcaggg ttctgctggg cgcaaggaac    1620

gcactgttca aacgcactgt ttactttaag cgcacgggga gaaacgaata aggaggacgt    1680

ggtgattttt aatttataca gtaacttttg tacttctctg gtatggagag tttggagccg    1740

aatgatttgc attttttaca tgtccgacat tatttaataa ataattttta aaagaaaaga    1800

acgataaatg aagccaacat gattttctca tttcgggagg aactctgttg cttcgcctgg    1860

acaagaagga aaatgctgat ttcctccttg ggtagaaaga gggagcgagg gcaaatgggg    1920

agtagagaga aaacaggcga gaacaagcac tctaattcca gtgggcttta aaataagaca    1980

aaatcagctt tacaacaatc cctagaggct cgaccacaga ataatgccag tcaccaccct    2040

gaacgcacaa tctccagtgc aggatctaat gactgtacat attattgtta ttattattat    2100

tgttattatt gttgttctgt aaacatgttg cacaagctta gcctttttgc gttctgttgt    2160

gtgtggctgt aaaaccccat gctttgtgaa atgagaatct tgacattttt cttgtgaaat    2220

ttggaaaatg tgatcaattg aaatcaactg tgttttgtgt tctctatgtc aaagtttagt    2280

tttatattga gaatgttaac ttattgcttt gtatcttggg aaaaaaactt tgtaaataag    2340

ttataaagtt tctttgagac agtaaaatta tgatttcttg aaaaaaaaaa aaaaaaaaaa    2400

aaa                                                                  2403


<210> 12
<211> 255
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..255
<223> /mol_type="protein"
      /note="DLX1"
      /organism="Homo sapiens"

<400> 12
Met Thr Met Thr Thr Met Pro Glu Ser Leu Asn Ser Pro Val Ser Gly 
1               5                   10                   15    
Lys Ala Val Phe Met Glu Phe Gly Pro Pro Asn Gln Gln Met Ser Pro 
            20                   25                  30        
Ser Pro Met Ser His Gly His Tyr Ser Met His Cys Leu His Ser Ala 
        35                   40                  45            
Gly His Ser Gln Pro Asp Gly Ala Tyr Ser Ser Ala Ser Ser Phe Ser 
    50                   55                  60                
Arg Pro Leu Gly Tyr Pro Tyr Val Asn Ser Val Ser Ser His Ala Ser 
65                   70                  75                  80
Ser Pro Tyr Ile Ser Ser Val Gln Ser Tyr Pro Gly Ser Ala Ser Leu 
                85                   90                  95    
Ala Gln Ser Arg Leu Glu Asp Pro Gly Ala Asp Ser Glu Lys Ser Thr 
            100                  105                110        
Val Val Glu Gly Gly Glu Val Arg Phe Asn Gly Lys Gly Lys Lys Ile 
        115                  120                125            
Arg Lys Pro Arg Thr Ile Tyr Ser Ser Leu Gln Leu Gln Ala Leu Asn 
    130                  135                140                
Arg Arg Phe Gln Gln Thr Gln Tyr Leu Ala Leu Pro Glu Arg Ala Glu 
145                  150                155                  160
Leu Ala Ala Ser Leu Gly Leu Thr Gln Thr Gln Val Lys Ile Trp Phe 
                165                  170                175    
Gln Asn Lys Arg Ser Lys Phe Lys Lys Leu Met Lys Gln Gly Gly Ala 
            180                  185                190        
Ala Leu Glu Gly Ser Ala Leu Ala Asn Gly Arg Ala Leu Ser Ala Gly 
        195                  200                205            
Ser Pro Pro Val Pro Pro Gly Trp Asn Pro Asn Ser Ser Ser Gly Lys 
    210                  215                220                
Gly Ser Gly Gly Asn Ala Gly Ser Tyr Ile Pro Ser Tyr Thr Ser Trp 
225                  230                235                  240
Tyr Pro Ser Ala His Gln Glu Ala Met Gln Gln Pro Gln Leu Met 
                245                  250                255

<210> 13
<211> 2068
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2068
<223> /mol_type="DNA"
      /note="GLYATL1"
      /organism="Homo sapiens"

<400> 13
agtgttggcc aatcccagca gccatacttc aactactcat agactgctga atgttcaaac      60

tgtgttcaaa taagatggtg tcacaagaag gatctgaagt ggagcttcta gtatccccag     120

gagcgcgaag tgaacacgga aggtacctgc aggatccaat tgtgtccatt gatctctcag     180

agtggctgag gataatagag tttcttcttc aaggtctcaa ggtgtatggc tctgtgtatc     240

acatcaatca cgggaacccc ttcaacatgg aggtgctggt ggattcctgg cctgaatatc     300

agatggttat tatccggcct caaaagcagg agatgactga tgacatggat tcatacacaa     360

acgtatatcg tatgttctcc aaagagcctc aaaaatcaga agaagttttg aaaaattgtg     420

agatcgtaaa ctggaaacag agactccaaa tccaaggtct tcaagaaagt ttaggtgagg     480

ggataagagt ggctacattt tcaaagtcag tgaaagtaga gcattcgaga gcactcctct     540

tggttacgga agatattctg aagctcaatg cctccagtaa aagcaagctt ggaagctggg     600

ctgagacagg ccacccagat gatgaatttg aaagtgaaac tcccaacttt aagtatgccc     660

agctggatgt ctcttattct gggctggtaa atgacaactg gaagcgaggg aagaatgaga     720

ggagcctgca ttacatcaag cgctgcatag aagacctgcc agcagcctgt atgctcggcc     780

cagagggagt cccggtctca tgggtaacca tggacccttc ttgtgaagta ggaatggcct     840

acagcatgga aaaataccga aggacaggca acatggcacg agtgatggtg cgatacatga     900

aatatctgcg tcagaagaat attccatttt acatctctgt gttggaagaa aatgaagact     960

cccgcagatt tgtggggcag tttggtttct ttgaggcctc ctgtgagtgg caccaatgga    1020

cttgctaccc acagaatcta gttccatttt agacaatgaa gctgcttagt aatctctgcc    1080

aagccatctc ttaatattaa agcagacacc acagaataga tttcttcact tacaaatgca    1140

tattgggcac ttataataca gcaggaactc ttctcacctg gagccttgat gttaaaagac    1200

acagccatgc tcttgaggag cttacaatcc tggctggagg caggggaggg tatattcttt    1260

aaatatgctt aagtgttata gggaaagacg gggttaccag taaacatgta actagaaagc    1320

caggctcagt tcttacctct gggaatcaga actctttatg caacttggtt aatagaatct    1380

actatctgga agataaatga aggattttaa taaaattttc aatagaataa acctaatctg    1440

tatggatact ttatcaaaaa tgaatgtccc tgctatttct ggatttatga ggcaatggta    1500

cactaaagaa tggaatcagt tcagtgagta gaaaggtatc caaggtgaag cctgagacga    1560

atggctttcc caggctacct tccatcactg ttgtacagaa aagaaatcca gagaatcaaa    1620

tggactggcc ttgggggtct ctgctatgga aatgccattt tttgtgtctc ctttctccta    1680

ctctttctca catcctcttc atgattgaag catggcacaa ggcaaggtgt tgcctgcgag    1740

tctggttgta agttcagcct ttggtgtttg cactactgct atcataaggg gtcagggaca    1800

ttccggggag aagtgaccac taaggtgagg attagagagt gagtagaagt gagccagaca    1860

aaaaaagcag aaaatgcaga tgatggaaag gacatgtgcc atgcactatc ataagaactt    1920

cctaactgaa cactgatact acaattctga atccctgatc ttaaaaaata attatacttc    1980

accaacaaaa cttggcctct tttggttcca ctctgccacc ctgccattgg aacttggatt    2040

actgtgaaca ttgcagctat agcaaaat                                       2068


<210> 14
<211> 333
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..333
<223> /mol_type="protein"
      /note="GLYATL1"
      /organism="Homo sapiens"

<400> 14
Met Phe Lys Leu Cys Ser Asn Lys Met Val Ser Gln Glu Gly Ser Glu 
1               5                   10                   15    
Val Glu Leu Leu Val Ser Pro Gly Ala Arg Ser Glu His Gly Arg Tyr 
            20                   25                  30        
Leu Gln Asp Pro Ile Val Ser Ile Asp Leu Ser Glu Trp Leu Arg Ile 
        35                   40                  45            
Ile Glu Phe Leu Leu Gln Gly Leu Lys Val Tyr Gly Ser Val Tyr His 
    50                   55                  60                
Ile Asn His Gly Asn Pro Phe Asn Met Glu Val Leu Val Asp Ser Trp 
65                   70                  75                  80
Pro Glu Tyr Gln Met Val Ile Ile Arg Pro Gln Lys Gln Glu Met Thr 
                85                   90                  95    
Asp Asp Met Asp Ser Tyr Thr Asn Val Tyr Arg Met Phe Ser Lys Glu 
            100                  105                110        
Pro Gln Lys Ser Glu Glu Val Leu Lys Asn Cys Glu Ile Val Asn Trp 
        115                  120                125            
Lys Gln Arg Leu Gln Ile Gln Gly Leu Gln Glu Ser Leu Gly Glu Gly 
    130                  135                140                
Ile Arg Val Ala Thr Phe Ser Lys Ser Val Lys Val Glu His Ser Arg 
145                  150                155                  160
Ala Leu Leu Leu Val Thr Glu Asp Ile Leu Lys Leu Asn Ala Ser Ser 
                165                  170                175    
Lys Ser Lys Leu Gly Ser Trp Ala Glu Thr Gly His Pro Asp Asp Glu 
            180                  185                190        
Phe Glu Ser Glu Thr Pro Asn Phe Lys Tyr Ala Gln Leu Asp Val Ser 
        195                  200                205            
Tyr Ser Gly Leu Val Asn Asp Asn Trp Lys Arg Gly Lys Asn Glu Arg 
    210                  215                220                
Ser Leu His Tyr Ile Lys Arg Cys Ile Glu Asp Leu Pro Ala Ala Cys 
225                  230                235                  240
Met Leu Gly Pro Glu Gly Val Pro Val Ser Trp Val Thr Met Asp Pro 
                245                  250                255    
Ser Cys Glu Val Gly Met Ala Tyr Ser Met Glu Lys Tyr Arg Arg Thr 
            260                  265                270        
Gly Asn Met Ala Arg Val Met Val Arg Tyr Met Lys Tyr Leu Arg Gln 
        275                  280                285            
Lys Asn Ile Pro Phe Tyr Ile Ser Val Leu Glu Glu Asn Glu Asp Ser 
    290                  295                300                
Arg Arg Phe Val Gly Gln Phe Gly Phe Phe Glu Ala Ser Cys Glu Trp 
305                  310                315                  320
His Gln Trp Thr Cys Tyr Pro Gln Asn Leu Val Pro Phe 
                325                  330            

<210> 15
<211> 1369
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1369
<223> /mol_type="DNA"
      /note="MS4A8B"
      /organism="Homo sapiens"

<400> 15
aaacaggaaa taaatacgaa tgaaactgag ctctaagcag catgtaacct ggcctgcatc      60

caggaaatag aggacttcgg atccttctaa ccctaccacc caactggccc cagtacattc     120

attctctcag gaaaaaaaac aaggtcccca cagcaaagaa aaggaatagg atcaagagat     180

acgtggctgc tggcagagca agcatgaatt cgatgacttc agcagttccg gtggccaatt     240

ctgtgttggt ggtggcaccc cacaatggtt atcctgtgac cccaggaatt atgtctcacg     300

tgcccctgta tccaaacagc cagccgcaag tccacctagt tcctgggaac ccacctagtt     360

tggtgtcgaa tgtgaatggg cagcctgtgc agaaagctct gaaagaaggc aaaaccttgg     420

gggccatcca gatcatcatt ggcctggctc acatcggcct cggctccatc atggcgacgg     480

ttctcgtagg ggaatacctg tctatttcat tctacggagg ctttcccttc tggggaggct     540

tgtggtttat catttcagga tctctctccg tggcagcaga aaatcagcca tattcttatt     600

gcctgctgtc tggcagtttg ggcttgaaca tcgtcagtgc aatctgctct gcagttggag     660

tcatactctt catcacagat ctaagtattc cccacccata tgcctacccc gactattatc     720

cttacgcctg gggtgtgaac cctggaatgg cgatttctgg cgtgctgctg gtcttctgcc     780

tcctggagtt tggcatcgca tgcgcatctt cccactttgg ctgccagttg gtctgctgtc     840

aatcaagcaa tgtgagtgtc atctatccaa acatctatgc agcaaaccca gtgatcaccc     900

cagaaccggt gacctcacca ccaagttatt ccagtgagat ccaagcaaat aagtaaggct     960

acagattctg gaagcatctt tcactgggac caaaagaagt cctcctccct ttctgggctt    1020

ccataaccca ggtcgttcct gttctgacag ctgaggaaac gtctctccca ctgtttgtac    1080

tctcaccttc attcttcaat tcagtctagg aaaccatgct gtttctctat caagaagaag    1140

acagagattt taaacagatg ttaaccaaga gggactccct agggcacatg catcagcaca    1200

tatgtgggca tccagcctct ggggccttgg cacacacaca ttcgtgtgct ctgctgcatg    1260

tgagcttgtg ggttagagga acaaatatct agacattcaa tcttcactct ttcaattgtg    1320

cattcattta ataaatagat actgagcatt caaaaaaaaa aaaaaaaaa                1369


<210> 16
<211> 250
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..250
<223> /mol_type="protein"
      /note="MS4A8B"
      /organism="Homo sapiens"

<400> 16
Met Asn Ser Met Thr Ser Ala Val Pro Val Ala Asn Ser Val Leu Val 
1               5                   10                   15    
Val Ala Pro His Asn Gly Tyr Pro Val Thr Pro Gly Ile Met Ser His 
            20                   25                  30        
Val Pro Leu Tyr Pro Asn Ser Gln Pro Gln Val His Leu Val Pro Gly 
        35                   40                  45            
Asn Pro Pro Ser Leu Val Ser Asn Val Asn Gly Gln Pro Val Gln Lys 
    50                   55                  60                
Ala Leu Lys Glu Gly Lys Thr Leu Gly Ala Ile Gln Ile Ile Ile Gly 
65                   70                  75                  80
Leu Ala His Ile Gly Leu Gly Ser Ile Met Ala Thr Val Leu Val Gly 
                85                   90                  95    
Glu Tyr Leu Ser Ile Ser Phe Tyr Gly Gly Phe Pro Phe Trp Gly Gly 
            100                  105                110        
Leu Trp Phe Ile Ile Ser Gly Ser Leu Ser Val Ala Ala Glu Asn Gln 
        115                  120                125            
Pro Tyr Ser Tyr Cys Leu Leu Ser Gly Ser Leu Gly Leu Asn Ile Val 
    130                  135                140                
Ser Ala Ile Cys Ser Ala Val Gly Val Ile Leu Phe Ile Thr Asp Leu 
145                  150                155                  160
Ser Ile Pro His Pro Tyr Ala Tyr Pro Asp Tyr Tyr Pro Tyr Ala Trp 
                165                  170                175    
Gly Val Asn Pro Gly Met Ala Ile Ser Gly Val Leu Leu Val Phe Cys 
            180                  185                190        
Leu Leu Glu Phe Gly Ile Ala Cys Ala Ser Ser His Phe Gly Cys Gln 
        195                  200                205            
Leu Val Cys Cys Gln Ser Ser Asn Val Ser Val Ile Tyr Pro Asn Ile 
    210                  215                220                
Tyr Ala Ala Asn Pro Val Ile Thr Pro Glu Pro Val Thr Ser Pro Pro 
225                  230                235                  240
Ser Tyr Ser Ser Glu Ile Gln Ala Asn Lys 
                245                  250

<210> 17
<211> 2930
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2930
<223> /mol_type="DNA"
      /note="NKAIN1"
      /organism="Homo sapiens"

<400> 17
agtgctgctc tgcgctgcgc cgcgctcggg gctcgctctc cttgctccgc gctccccgcc      60

agccgccccg gggcaggagg cgcgcctgac ggacggcccg ctagacaaag gaggcgcggc     120

tcggcggggc cagcgcgcgg acggacggac catggactcg gagcgcgggc ggccggcccc     180

agccttgggg accggacact cccgggcccg gccctaggcg cccggccccg ccgcccggcg     240

cgcccagcgg ggaggacgtg gagcccgcgc ggcgcgagca ggcggcggcc gcggagcaag     300

aagggcgccg cggcgtgcgg cccgcgcagc ccccggagcc atgggcaagt gcagcgggcg     360

ctgcacgctg gtcgccttct gctgcctgca gctggtggct gcgctggagc ggcagatctt     420

tgacttcctg ggctaccagt gggctcccat cctagccaac ttcctgcaca tcatggcagt     480

catcctgggc atctttggca ccgtgcagta ccgctcccgg tacctcatcc tgtatgcagc     540

ctggctggtg ctctgggttg gctggaatgc atttatcatc tgcttctact tggaggttgg     600

acagctgtcc caggaccggg acttcatcat gaccttcaac acatccctgc accgctcctg     660

gtggatggag aatgggccag gctgcctggt gacacctgtt ctgaactccc gcctggctct     720

ggaggaccac catgtcatct ctgtcactgg ctgcctgctt gactacccct acattgaagc     780

cctcagcagc gccctgcaga tcttcctggc actgttcggc ttcgtgttcg cctgctacgt     840

gagcaaagtg ttcctggagg aggaggacag ctttgacttc atcggcggct ttgactccta     900

cggataccag gcgccccaga agacgtcgca tttacagctg cagcctctgt acacgtcggg     960

gtagcctctg ccccgcgccc accccggcgc ctcgccctgg gctgaccgca gctgccgcga    1020

gctcgggcca aggcgcaggc gtgtccccct ggtggcccgc gcgctcactg cagcctgtgc    1080

ccaaccccgc gtctgcatct ggagatgcgg acttggacgt ggacttggac ttggacttgg    1140

atttgagctt ggctcttcgc agcccggact tcggaggagt ggggcggggc gggggagggg    1200

caccacgggt tttttgtttt ttgtttgttt gtttttaatc tcagccttgg cgtgagctgg    1260

ggccttcctc tcttctccag cctctccctt tcactcttca cccagcatcc tgcccccctg    1320

tccaaaaaca gcaggacatc agacccatcc catcccacca cactcactca ccagctctgg    1380

ggaaagctac tgtgaactag gagcaggatt cctgggttct aatcgcaggt ccatcactga    1440

ctgtgacgtc tagcaaagcc cttgccctct ctgagcctcg gtttccgcac ctcaagtaat    1500

taatccctta gcaaatggac tcttttagac ttctcattta actcaattcc ctgagctaga    1560

ctgggattaa aattctcatt ttgcagtaca ttaaaactga ggcccagaga tgtgatttgc    1620

ttgaggccac acagctagat ttttggtgga agtgggcctt gaacacagtg tactttctgc    1680

agtttctgac tgtaaaaccc agtgtctgct ctctgagttc catttccaag cccccctcca    1740

tcttggacct atgtggtctc caccatattc acacaccacc accaccactt gccaatgcct    1800

ctcttaaagc aatataccca ttcgttctct tattgggaac tggatggatg aagccccaaa    1860

ttcagcccca cccacagaga agccttccta cactcagcct ctgtccaccc ttggcaaatc    1920

tttcaagctc tctcctccag gaaagtgggg ccccaactca gtcactccac ccccttccag    1980

gtccctgagg ctggttctac tgtatcccca tcacctccac aactccactc acccctgacg    2040

gctccatcca cctcaccagt tggaaggctt gtggtttcag agaggagcaa tgctggtcag    2100

cgctgcccag actccagtgt ttacagatca ccagcattta caaccaatcc aatggccaga    2160

agcctcctct aacaagccca gaaggagttc tgaaggggca gatgggggtg tgagtagtcg    2220

gggagtcggg attgccagca ccctcaccct tccttggggg caagtagagg tgagaacact    2280

ttccccacct ccctccacag acactcctga ggacgctgca tcccacgcac tgcctggtgc    2340

gtccatagag agaggatcag gtctcagcat ttcatctgtg aaagaggcat ggccctgggt    2400

tagaaaggag ggcaggagac atggaggaac tggggggcac ccagatggtg cagatggttt    2460

gcacacctga gcctgtctgt ggtgaccatt ccgctcctct cccactaccc tccaatctat    2520

cattccctac tctctaaggc caaaatatcc tgagcaaggc tggcaacccc accccaccat    2580

cccaaatgca agcagccagg cccaggagtt cctctggccc ccacaggcat ggagctccca    2640

gctggtgggt acagcttgag aggggggcag ctccctcagg ctaagctact gcccttcact    2700

gggccagccc tgcctccagc cctcacctct ctcaccccaa ctctccccca agcccctttc    2760

tactcaacgg gtgtagccac tggtgctttg aagccttttg tttttataag atggtttttg    2820

caaggggacc aggttctctt ttcactggga ccttgcaagg aggggagtgc tctcctggtt    2880

tctgtgcagg cgggttgatt aaagatggtg ttttcttctc taaaaaaaaa               2930


<210> 18
<211> 207
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..207
<223> /mol_type="protein"
      /note="NKAIN1"
      /organism="Homo sapiens"

<400> 18
Met Gly Lys Cys Ser Gly Arg Cys Thr Leu Val Ala Phe Cys Cys Leu 
1               5                   10                   15    
Gln Leu Val Ala Ala Leu Glu Arg Gln Ile Phe Asp Phe Leu Gly Tyr 
            20                   25                  30        
Gln Trp Ala Pro Ile Leu Ala Asn Phe Leu His Ile Met Ala Val Ile 
        35                   40                  45            
Leu Gly Ile Phe Gly Thr Val Gln Tyr Arg Ser Arg Tyr Leu Ile Leu 
    50                   55                  60                
Tyr Ala Ala Trp Leu Val Leu Trp Val Gly Trp Asn Ala Phe Ile Ile 
65                   70                  75                  80
Cys Phe Tyr Leu Glu Val Gly Gln Leu Ser Gln Asp Arg Asp Phe Ile 
                85                   90                  95    
Met Thr Phe Asn Thr Ser Leu His Arg Ser Trp Trp Met Glu Asn Gly 
            100                  105                110        
Pro Gly Cys Leu Val Thr Pro Val Leu Asn Ser Arg Leu Ala Leu Glu 
        115                  120                125            
Asp His His Val Ile Ser Val Thr Gly Cys Leu Leu Asp Tyr Pro Tyr 
    130                  135                140                
Ile Glu Ala Leu Ser Ser Ala Leu Gln Ile Phe Leu Ala Leu Phe Gly 
145                  150                155                  160
Phe Val Phe Ala Cys Tyr Val Ser Lys Val Phe Leu Glu Glu Glu Asp 
                165                  170                175    
Ser Phe Asp Phe Ile Gly Gly Phe Asp Ser Tyr Gly Tyr Gln Ala Pro 
            180                  185                190        
Gln Lys Thr Ser His Leu Gln Leu Gln Pro Leu Tyr Thr Ser Gly 
        195                  200                205        

<210> 19
<211> 4052
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..4052
<223> /mol_type="DNA"
      /note="PPFIA2"
      /organism="Homo sapiens"

<400> 19
gaggcaagtg aggagagaag atgctgtagc gtcctcaccg gctgccagca gggaaatggt       60

ccaggagtgc tgggtgtgag cctcccttct cctcaagccg gagactgcgg ttgtcattga      120

tcaattgaag aagcaaggac ccgaaatcac agacattagc aatgatgtgt gaagtgatgc      180

ccacgattaa tgaggacacc ccaatgagcc aaagggggtc ccaaagcagt ggctcggact      240

cagactccca ttttgagcag ctgatggtga atatgctaga tgaaagggat cgtcttctag      300

acacccttcg ggagacccag gaaagcctct cacttgccca gcaaagactt caggatgtca      360

tctatgaccg agactcactc cagagacagc tcaattcagc cctgccacag gatatcgaat      420

ccctaacagg agggctggct ggttctaagg gggctgatcc accggaattt gctgcactga      480

caaaagaatt aaatgcctgc agggaacaac ttctagaaaa ggaagaagaa atctctgaac      540

ttaaagctga aagaaacaac acaagactat tactggagca tttggagtgc cttgtgtcac      600

gacatgaaag atcactaaga atgacggtgg taaaacggca agcccagtct ccctcaggag      660

tatccagtga agttgaagtt ctcaaggcac tgaaatcttt gtttgagcac cacaaggcct      720

tggatgaaaa ggtaagggag cgactgaggg tttctttaga aagagtctct gcactggaag      780

aagaactagc tgctgctaat caggagattg ttgccttgcg tgaacaaaat gttcatatac      840

aaagaaaaat ggcatcaagc gagggatcca cagagtcaga acatcttgaa gggatggaac      900

ctggacagaa agtccatgag aagcgtttgt ccaatggttc tatagactca accgatgaaa      960

ctagtcaaat agttgaacta caagaattgc ttgaaaagca aaactatgaa atggcccaga     1020

tgaaagaacg tttagcagcc ctttcttccc gagtgggaga ggtggaacag gaagcagaga     1080

cagcaagaaa ggatctcatt aaaacagaag aaatgaacac caagtatcaa agggacatta     1140

gggaggccat ggcacaaaag gaagatatgg aagaaagaat tacaaccctt gaaaagcgtt     1200

acctcagtgc tcagagagaa tctacctcca tacatgacat gaatgataaa ctagaaaatg     1260

agttagcaaa taaagaagct atcctgcggc agatggaaga gaaaaacaga cagttacaag     1320

aacgtcttga gctagctgaa caaaagttgc agcagaccat gagaaaggct gaaaccttgc     1380

ctgaagtaga ggctgaactg gctcagagaa ttgcagccct aaccaaggct gaagagagac     1440

atggaaatat tgaagaacgt atgagacatt tagagggtca acttgaagag aagaatcaag     1500

aacttcaaag agctaggcaa agagagaaaa tgaatgagga gcataacaag agattatcgg     1560

atacggttga tagacttctg actgaatcca atgaacgcct acaactacac ttaaaggaaa     1620

gaatggctgc tctagaagaa aagaatgttt taattcaaga atcagaaact ttcagaaaga     1680

atcttgaaga atctttacat gataaggaaa gattagcaga agaaattgaa aagctgagat     1740

ctgaacttga ccaattgaaa atgagaactg gctctttaat tgaacccaca ataccaagaa     1800

ctcatctaga cacctcagct gagttgcggt actcagtggg atccctagtg gacagccagt     1860

ctgattacag aacaactaaa gtaataagaa gaccaaggag aggccgcatg ggtgtgcgaa     1920

gagatgagcc aaaggtgaaa tctcttgggg atcacgagtg gaatagaact caacagattg     1980

gagtactaag cagccaccct tttgaaagtg acactgaaat gtctgatatt gatgatgatg     2040

acagagaaac aatttttagc tcaatggatc ttctctctcc aagtggtcat tccgatgccc     2100

agacgctagc catgatgctt caggaacaat tggatgccat caacaaagaa atcaggctaa     2160

ttcaggaaga aaaagaatct acagagttgc gtgctgaaga aattgaaaat agagtggcta     2220

gtgtgagcct cgaaggcctg aatttggcaa gggtccaccc aggtacctcc attactgcct     2280

ctgttacagc ttcatcgctg gccagttcat ctccccccag tggacactca actccaaagc     2340

tcacccctcg aagccctgcc agggaaatgg atcggatggg agtcatgaca ctgccaagtg     2400

atctgaggaa acatcggaga aagattgcag ttgtggaaga agatggtcga gaggacaaag     2460

caacaattaa atgtgaaact tctcctcctc ctacccctag agccctcaga atgactcaca     2520

ctctcccttc ttcctaccac aatgatgctc gaagtagttt atctgtctct cttgagccag     2580

aaagcctcgg gcttggtagt gccaacagca gccaagactc tcttcacaaa gcccccaaga     2640

agaaaggaat caagtcttca ataggacgtt tgtttggtaa aaaagaaaaa gctcgacttg     2700

ggcagctccg aggctttatg gagactgaag ctgcagctca ggagtccctg gggttaggca     2760

aactcggaac tcaagctgag aaggatcgaa gactaaagaa aaagcatgaa cttcttgaag     2820

aagctcggag aaagggatta ccttttgccc agtgggatgg gccaactgtg gtcgcatggc     2880

tagagctttg gttgggaatg cctgcgtggt acgtggcagc ctgccgagcc aacgtgaaga     2940

gtggtgccat catgtctgct ttatctgaca ctgagatcca gagagaaatt ggaatcagca     3000

atccactgca tcgcttaaaa cttcgattag caatccagga gatggtttcc ctaacaagtc     3060

cttcagctcc tccaacatct cgaactcctt caggcaacgt ttgggtgact catgaagaaa     3120

tggaaaatct tgcagctcca gcaaaaacga aagaatctga ggaaggaagc tgggcccagt     3180

gtccggtttt tctacagacc ctggcttatg gagatatgaa tcatgagtgg attggaaatg     3240

aatggcttcc cagcttgggg ttacctcagt acagaagtta ctttatggaa tgcttggtag     3300

atgcaagaat gttagatcac ctaacaaaaa aagatctccg tgtccattta aaaatggtgg     3360

atagtttcca tcgaacaagt ttacaatatg gaattatgtg cttaaagagg ttgaattatg     3420

acagaaaaga actagaaaga agacgggaag caagccaaca tgaaataaaa gacgtgttgg     3480

tgtggagcaa tgaccgagtt attcgctgga tacaagcaat tggacttcga gaatatgcaa     3540

ataatatact tgagagcggt gtgcatggct cacttatagc cctggatgaa aactttgact     3600

acagcagctt agctttatta ttacagattc caacacagaa cacccaggca aggcagattc     3660

ttgaaagaga atacaataac ctcttggccc tgggaactga aaggcgactg gatgaaagtg     3720

atgacaagaa cttcagacgt ggatcaacct ggagaaggca gtttcctcct cgtgaagtac     3780

atggaatcag catgatgcct gggtcctcag aaacattacc agctggattt aggttaacca     3840

caacctctgg gcagtcaaga aaaatgacaa cagatgttgc ttcatcaaga ctgcagaggt     3900

tagacaactc cactgttcgc acatactcat gttgaccagc cactcaaagg aggcagcact     3960

gacctgctat ggcgtctttt cagtctactc tacctaaagt gcactaccat ctaagaagac     4020

gagcagtgaa aacctttgtg aaaactgaat tc                                   4052


<210> 20
<211> 1257
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1257
<223> /mol_type="protein"
      /note="PPFIA2"
      /organism="Homo sapiens"

<400> 20
Met Met Cys Glu Val Met Pro Thr Ile Asn Glu Asp Thr Pro Met Ser 
1               5                   10                   15    
Gln Arg Gly Ser Gln Ser Ser Gly Ser Asp Ser Asp Ser His Phe Glu 
            20                   25                  30        
Gln Leu Met Val Asn Met Leu Asp Glu Arg Asp Arg Leu Leu Asp Thr 
        35                   40                  45            
Leu Arg Glu Thr Gln Glu Ser Leu Ser Leu Ala Gln Gln Arg Leu Gln 
    50                   55                  60                
Asp Val Ile Tyr Asp Arg Asp Ser Leu Gln Arg Gln Leu Asn Ser Ala 
65                   70                  75                  80
Leu Pro Gln Asp Ile Glu Ser Leu Thr Gly Gly Leu Ala Gly Ser Lys 
                85                   90                  95    
Gly Ala Asp Pro Pro Glu Phe Ala Ala Leu Thr Lys Glu Leu Asn Ala 
            100                  105                110        
Cys Arg Glu Gln Leu Leu Glu Lys Glu Glu Glu Ile Ser Glu Leu Lys 
        115                  120                125            
Ala Glu Arg Asn Asn Thr Arg Leu Leu Leu Glu His Leu Glu Cys Leu 
    130                  135                140                
Val Ser Arg His Glu Arg Ser Leu Arg Met Thr Val Val Lys Arg Gln 
145                  150                155                  160
Ala Gln Ser Pro Ser Gly Val Ser Ser Glu Val Glu Val Leu Lys Ala 
                165                  170                175    
Leu Lys Ser Leu Phe Glu His His Lys Ala Leu Asp Glu Lys Val Arg 
            180                  185                190        
Glu Arg Leu Arg Val Ser Leu Glu Arg Val Ser Ala Leu Glu Glu Glu 
        195                  200                205            
Leu Ala Ala Ala Asn Gln Glu Ile Val Ala Leu Arg Glu Gln Asn Val 
    210                  215                220                
His Ile Gln Arg Lys Met Ala Ser Ser Glu Gly Ser Thr Glu Ser Glu 
225                  230                235                  240
His Leu Glu Gly Met Glu Pro Gly Gln Lys Val His Glu Lys Arg Leu 
                245                  250                255    
Ser Asn Gly Ser Ile Asp Ser Thr Asp Glu Thr Ser Gln Ile Val Glu 
            260                  265                270        
Leu Gln Glu Leu Leu Glu Lys Gln Asn Tyr Glu Met Ala Gln Met Lys 
        275                  280                285            
Glu Arg Leu Ala Ala Leu Ser Ser Arg Val Gly Glu Val Glu Gln Glu 
    290                  295                300                
Ala Glu Thr Ala Arg Lys Asp Leu Ile Lys Thr Glu Glu Met Asn Thr 
305                  310                315                  320
Lys Tyr Gln Arg Asp Ile Arg Glu Ala Met Ala Gln Lys Glu Asp Met 
                325                  330                335    
Glu Glu Arg Ile Thr Thr Leu Glu Lys Arg Tyr Leu Ser Ala Gln Arg 
            340                  345                350        
Glu Ser Thr Ser Ile His Asp Met Asn Asp Lys Leu Glu Asn Glu Leu 
        355                  360                365            
Ala Asn Lys Glu Ala Ile Leu Arg Gln Met Glu Glu Lys Asn Arg Gln 
    370                  375                380                
Leu Gln Glu Arg Leu Glu Leu Ala Glu Gln Lys Leu Gln Gln Thr Met 
385                  390                395                  400
Arg Lys Ala Glu Thr Leu Pro Glu Val Glu Ala Glu Leu Ala Gln Arg 
                405                  410                415    
Ile Ala Ala Leu Thr Lys Ala Glu Glu Arg His Gly Asn Ile Glu Glu 
            420                  425                430        
Arg Met Arg His Leu Glu Gly Gln Leu Glu Glu Lys Asn Gln Glu Leu 
        435                  440                445            
Gln Arg Ala Arg Gln Arg Glu Lys Met Asn Glu Glu His Asn Lys Arg 
    450                  455                460                
Leu Ser Asp Thr Val Asp Arg Leu Leu Thr Glu Ser Asn Glu Arg Leu 
465                  470                475                  480
Gln Leu His Leu Lys Glu Arg Met Ala Ala Leu Glu Glu Lys Asn Val 
                485                  490                495    
Leu Ile Gln Glu Ser Glu Thr Phe Arg Lys Asn Leu Glu Glu Ser Leu 
            500                  505                510        
His Asp Lys Glu Arg Leu Ala Glu Glu Ile Glu Lys Leu Arg Ser Glu 
        515                  520                525            
Leu Asp Gln Leu Lys Met Arg Thr Gly Ser Leu Ile Glu Pro Thr Ile 
    530                  535                540                
Pro Arg Thr His Leu Asp Thr Ser Ala Glu Leu Arg Tyr Ser Val Gly 
545                  550                555                  560
Ser Leu Val Asp Ser Gln Ser Asp Tyr Arg Thr Thr Lys Val Ile Arg 
                565                  570                575    
Arg Pro Arg Arg Gly Arg Met Gly Val Arg Arg Asp Glu Pro Lys Val 
            580                  585                590        
Lys Ser Leu Gly Asp His Glu Trp Asn Arg Thr Gln Gln Ile Gly Val 
        595                  600                605            
Leu Ser Ser His Pro Phe Glu Ser Asp Thr Glu Met Ser Asp Ile Asp 
    610                  615                620                
Asp Asp Asp Arg Glu Thr Ile Phe Ser Ser Met Asp Leu Leu Ser Pro 
625                  630                635                  640
Ser Gly His Ser Asp Ala Gln Thr Leu Ala Met Met Leu Gln Glu Gln 
                645                  650                655    
Leu Asp Ala Ile Asn Lys Glu Ile Arg Leu Ile Gln Glu Glu Lys Glu 
            660                  665                670        
Ser Thr Glu Leu Arg Ala Glu Glu Ile Glu Asn Arg Val Ala Ser Val 
        675                  680                685            
Ser Leu Glu Gly Leu Asn Leu Ala Arg Val His Pro Gly Thr Ser Ile 
    690                  695                700                
Thr Ala Ser Val Thr Ala Ser Ser Leu Ala Ser Ser Ser Pro Pro Ser 
705                  710                715                  720
Gly His Ser Thr Pro Lys Leu Thr Pro Arg Ser Pro Ala Arg Glu Met 
                725                  730                735    
Asp Arg Met Gly Val Met Thr Leu Pro Ser Asp Leu Arg Lys His Arg 
            740                  745                750        
Arg Lys Ile Ala Val Val Glu Glu Asp Gly Arg Glu Asp Lys Ala Thr 
        755                  760                765            
Ile Lys Cys Glu Thr Ser Pro Pro Pro Thr Pro Arg Ala Leu Arg Met 
    770                  775                780                
Thr His Thr Leu Pro Ser Ser Tyr His Asn Asp Ala Arg Ser Ser Leu 
785                  790                795                  800
Ser Val Ser Leu Glu Pro Glu Ser Leu Gly Leu Gly Ser Ala Asn Ser 
                805                  810                815    
Ser Gln Asp Ser Leu His Lys Ala Pro Lys Lys Lys Gly Ile Lys Ser 
            820                  825                830        
Ser Ile Gly Arg Leu Phe Gly Lys Lys Glu Lys Ala Arg Leu Gly Gln 
        835                  840                845            
Leu Arg Gly Phe Met Glu Thr Glu Ala Ala Ala Gln Glu Ser Leu Gly 
    850                  855                860                
Leu Gly Lys Leu Gly Thr Gln Ala Glu Lys Asp Arg Arg Leu Lys Lys 
865                  870                875                  880
Lys His Glu Leu Leu Glu Glu Ala Arg Arg Lys Gly Leu Pro Phe Ala 
                885                  890                895    
Gln Trp Asp Gly Pro Thr Val Val Ala Trp Leu Glu Leu Trp Leu Gly 
            900                  905                910        
Met Pro Ala Trp Tyr Val Ala Ala Cys Arg Ala Asn Val Lys Ser Gly 
        915                  920                925            
Ala Ile Met Ser Ala Leu Ser Asp Thr Glu Ile Gln Arg Glu Ile Gly 
    930                  935                940                
Ile Ser Asn Pro Leu His Arg Leu Lys Leu Arg Leu Ala Ile Gln Glu 
945                  950                955                  960
Met Val Ser Leu Thr Ser Pro Ser Ala Pro Pro Thr Ser Arg Thr Pro 
                965                  970                975    
Ser Gly Asn Val Trp Val Thr His Glu Glu Met Glu Asn Leu Ala Ala 
            980                  985                990        
Pro Ala Lys Thr Lys Glu Ser Glu Glu Gly Ser Trp Ala Gln Cys Pro 
        995                  1000                1005            
Val Phe Leu Gln Thr Leu Ala Tyr Gly Asp Met Asn His Glu Trp Ile 
    1010                1015                1020                
Gly Asn Glu Trp Leu Pro Ser Leu Gly Leu Pro Gln Tyr Arg Ser Tyr 
1025                1030                1035                1040
Phe Met Glu Cys Leu Val Asp Ala Arg Met Leu Asp His Leu Thr Lys 
                1045                1050                1055    
Lys Asp Leu Arg Val His Leu Lys Met Val Asp Ser Phe His Arg Thr 
            1060                1065                1070        
Ser Leu Gln Tyr Gly Ile Met Cys Leu Lys Arg Leu Asn Tyr Asp Arg 
        1075                1080                1085            
Lys Glu Leu Glu Arg Arg Arg Glu Ala Ser Gln His Glu Ile Lys Asp 
    1090                1095                1100                
Val Leu Val Trp Ser Asn Asp Arg Val Ile Arg Trp Ile Gln Ala Ile 
1105                1110                1115                1120
Gly Leu Arg Glu Tyr Ala Asn Asn Ile Leu Glu Ser Gly Val His Gly 
                1125                1130                1135    
Ser Leu Ile Ala Leu Asp Glu Asn Phe Asp Tyr Ser Ser Leu Ala Leu 
            1140                1145                1150        
Leu Leu Gln Ile Pro Thr Gln Asn Thr Gln Ala Arg Gln Ile Leu Glu 
        1155                1160                1165            
Arg Glu Tyr Asn Asn Leu Leu Ala Leu Gly Thr Glu Arg Arg Leu Asp 
    1170                1175                1180                
Glu Ser Asp Asp Lys Asn Phe Arg Arg Gly Ser Thr Trp Arg Arg Gln 
1185                1190                1195                1200
Phe Pro Pro Arg Glu Val His Gly Ile Ser Met Met Pro Gly Ser Ser 
                1205                1210                1215    
Glu Thr Leu Pro Ala Gly Phe Arg Leu Thr Thr Thr Ser Gly Gln Ser 
            1220                1225                1230        
Arg Lys Met Thr Thr Asp Val Ala Ser Ser Arg Leu Gln Arg Leu Asp 
        1235                1240                1245            
Asn Ser Thr Val Arg Thr Tyr Ser Cys 
    1250                1255        

<210> 21
<211> 12701
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..12701
<223> /mol_type="DNA"
      /note="PTPRT"
      /organism="Homo sapiens"

<400> 21
cctcccgcct cagttcgcgc cgcgcctcgg cttggaacgc aggagcgccg gctccgggag       60

cccgagcgga gccagccgcg cgcacagcca gcggccgcgc cggcgatgcg gggccacccc      120

gcgcccgccc cagtcccggc cccggccccc gcgggaaggg gctgagctgc ccgccgccgc      180

ccggatggcg agcctcgccg cgctcgccct cagcctgctc ctgaggctgc agctgccgcc      240

actgcccggc gcccgggctc agagcgccgc aggtggctgt tcctttgatg agcactacag      300

caactgtggt tatagtgtgg ctctagggac caatgggttc acctgggagc agattaacac      360

atgggagaaa ccaatgctgg accaggcagt gcccacagga tctttcatga tggtgaacag      420

ctctgggaga gcctctggcc agaaggccca ccttctcctg ccaaccctga aggagaatga      480

cacccactgc atcgacttcc attactactt ctccagccgt gacaggtcca gcccaggggc      540

cttgaacgtc tacgtgaagg tgaatggtgg cccccaaggg aaccctgtgt ggaatgtgtc      600

cggggtcgtc actgagggct gggtgaaggc agagctcgcc atcagcactt tctggccaca      660

tttctatcag gtgatatttg aatccgtctc attgaagggt catcctggct acatcgccgt      720

ggacgaggtc cgggtccttg ctcatccatg cagaaaagca cctcattttc tgcgactcca      780

aaacgtggag gtgaatgtgg ggcagaatgc cacatttcag tgcattgctg gtgggaagtg      840

gtctcagcat gacaagcttt ggctccagca atggaatggc agggacacgg ccctgatggt      900

cacccgtgtg gtcaaccaca ggcgcttctc agccacagtc agtgtggcag acactgccca      960

gcggagcgtc agcaagtacc gctgtgtgat ccgctctgat ggtgggtctg gtgtgtccaa     1020

ctacgcggag ctgatcgtga aagagcctcc cacgcccatt gctcccccag agctgctggc     1080

tgtgggggcc acatacctgt ggatcaagcc aaatgccaac tccatcatcg gggatggccc     1140

catcatcctg aaggaagtgg aatatcgcac caccacaggc acgtgggcag agacccacat     1200

agtcgactct cccaactata agctgtggca tctggacccc gatgttgagt atgagatccg     1260

agtgctcctc acacgaccag gtgagggggg tacgggaccg ccagggcctc ccctcaccac     1320

caggaccaag tgtgcagatc cggtacatgg cccacagaac gtggaaatcg tagacatcag     1380

agcccggcag ctgaccctgc agtgggagcc cttcggctac gcggtgaccc gctgccatag     1440

ctacaacctc accgtgcagt accagtatgt gttcaaccag cagcagtacg aggccgagga     1500

ggtcatccag acctcctccc actacaccct gcgaggcctg cgccccttca tgaccatccg     1560

gctgcgactc ttgctgtcta accccgaggg ccgaatggag agcgaggagc tggtggtgca     1620

gactgaggaa gacgttccag gagctgttcc tctagaatcc atccaagggg ggccctttga     1680

ggagaagatc tacatccagt ggaaacctcc caatgagacc aatggggtca tcacgctcta     1740

cgagatcaac tacaaggctg tcggctcgct ggacccaagt gctgacctct cgagccagag     1800

ggggaaagtg ttcaagctcc ggaatgaaac ccaccacctc tttgtgggtc tgtacccagg     1860

gaccacctat tccttcacca tcaaggccag cacagcaaag ggctttgggc cccctgtcac     1920

cactcggatt gccaccaaaa tttcagctcc atccatgcct gagtacgaca cagacacccc     1980

attgaatgag acagacacga ccatcacagt gatgctgaaa cccgctcagt cccggggagc     2040

tcctgtcagt gtttatcagc tggttgtcaa ggaggagcga cttcagaagt cacggagggc     2100

agctgacatt attgagtgct tttcggtgcc cgtgagctat cggaatgcct ccagcctcga     2160

ttctctacac tactttgctg ctgagttgaa gcctgccaac ctgcctgtca cccagccatt     2220

tacagtgggt gacaataaga catacaatgg ctactggaac cctcctctct ctcccctgaa     2280

aagctacagc atctacttcc aggcactcag caaagccaat ggagagacca aaatcaactg     2340

tgttcgtctg gctacaaaag caccaatggg cagcgcccag gtgaccccgg ggactccact     2400

ctgcctcctc accacaggtg cctccaccca gaattctaac actgtggagc cagagaagca     2460

ggtggacaac accgtgaaga tggctggcgt gatcgctggc ctcctcatgt tcatcatcat     2520

tctcctgggc gtgatgctca ccatcaaaag gagaagaaat gcttattcct actcctatta     2580

cttgaagctg gccaagaagc agaaggagac ccagagtgga gcccagaggg agatggggcc     2640

tgtggcctct gccgacaaac ccaccaccaa gctcagcgcc agccgcaatg atgaaggctt     2700

ctcttctagt tctcaggacg tcaacggatt cacagatggc agccgcgggg agctttccca     2760

gcccaccctc acgatccaga ctcatcccta ccgcacctgt gaccctgtgg agatgagcta     2820

cccccgggac cagttccaac ccgccatccg ggtggctgac ttgctgcagc acatcacgca     2880

gatgaagaga ggccagggct acgggttcaa ggaggaatac gaggccttac cagaggggca     2940

gacagcttcg tgggacacag ccaaggagga tgaaaaccgc aataagaatc gatatgggaa     3000

catcatatcc tacgaccatt cccgggtgag gctgctggtg ctggatggag acccgcactc     3060

tgactacatc aatgccaact acattgacgg ataccatcga cctcggcact acattgcgac     3120

tcaaggtccg atgcaggaga ctgtaaagga cttttggaga atgatctggc aggagaactc     3180

cgccagcatc gtcatggtca caaacctggt ggaagtgggc agggtgaaat gtgtgcgata     3240

ctggccagat gacacggagg tctacggaga cattaaagtc accctgattg aaacagagcc     3300

cctggcagaa tacgtcatac gcaccttcac agtccagaag aaaggctacc atgagatccg     3360

ggagctccgc ctcttccact tcaccagctg gcctgaccac ggcgttccct gctatgccac     3420

tggccttctg ggcttcgtcc gccaggtcaa gttcctcaac cccccggaag ctgggcccat     3480

agtggtccac tgcagtgctg gggctgggcg gactggctgc ttcattgcca ttgacaccat     3540

gcttgacatg gccgagaatg aaggggtggt ggacatcttc aactgcgtgc gtgagctccg     3600

ggcccaaagg gtcaacctgg tacagacaga ggagcaatat gtgtttgtgc acgatgccat     3660

cctggaagcg tgcctctgtg gcaacactgc catccctgtg tgtgagttcc gttctctcta     3720

ctacaatatc agcaggctgg acccccagac aaactccagc caaatcaaag atgaatttca     3780

gaccctcaac attgtgacac cccgtgtgcg gcccgaggac tgcagcattg ggctcctgcc     3840

ccggaaccat gataagaatc gaagtatgga cgtgctgcct ctggaccgct gcctgccctt     3900

ccttatctca gtggacggag aatccagcaa ttacatcaac gcagcactga tggatagcca     3960

caagcagcct gccgccttcg tggtcaccca gcaccctcta cccaacaccg tggcagactt     4020

ctggaggctg gtgttcgatt acaactgctc ctctgtggtg atgctgaatg agatggacac     4080

tgcccagttc tgtatgcagt actggcctga gaagacctcc gggtgctatg ggcccatcca     4140

ggtggagttc gtctccgcag acatcgacga ggacatcatc cacagaatat tccgcatctg     4200

taacatggcc cggccacagg atggttatcg tatagtccag cacctccagt acattggctg     4260

gcctgcctac cgggacacgc ccccctccaa gcgctctctg ctcaaagtgg tccgacgact     4320

ggagaagtgg caggagcagt atgacgggag ggagggacgt actgtggtcc actgcctaaa     4380

tgggggaggc cgtagtggaa ccttctgtgc catctgcagt gtgtgtgaga tgatccagca     4440

gcaaaacatc attgacgtgt tccacatcgt gaaaacactg cgtaacaaca aatccaacat     4500

ggtggagacc ctggaacagt ataaatttgt atacgaggtg gcactggaat atttaagctc     4560

cttttagctc aatgggatgg ggaacctgcc ggagtccaga ggctgctgtg accaagcccc     4620

cttttgtgtg aatggcagta actgggctca ggagctctga ggtggcaccc tgcctgactc     4680

caaggagaag actggtggcc ctgtgttcca cggggggctc tgcaccttct gaggggtctc     4740

ctgttgccgt gggagatgct gctccaaaag gcccaggctt ccttttcaac ctaaccagcc     4800

acagccaagg gcccaagcag aagtacaccc acaagcaagg ccttggattt ctggctccca     4860

gaccacctgc ttttgttctg agtttgtgga tctcttggca agccaactgt gcaggtgctg     4920

gggagtggga ggctcccctg ccctccttct ccttaggagt ggaggagatg tgtgttctgc     4980

tcctctacgt catggaaaag attgaggctc ttgggggtca ctgctctgct gccccctgca     5040

acctccttca ggggcctctg gcaccagaca tttgcagtct ggaccagtgt gaccttacga     5100

tgttccctag gccacaagag aggcccccca tcctcacacc taacctgcat ggggcttcgc     5160

ccacaaccat tctgtacccc ttccccagcc tgggccttga ccgtccagca ttcactggcc     5220

ggccagctgt gtccacagca gtttttgata aaggtgttct ttgctttttt gtgtggtcag     5280

tgggaggggg tggaactgca gggaacttct ctgctcctcc ttgtctttgt aaaaagggac     5340

cacctccctg gggcagggct tgggctgacc tgtaggatgt aacccctgtg tttctttggt     5400

ggtagctttc tttggaagag acaaacaaga taagatttga ttattttcca aagtgtatgt     5460

gaaaagaaac tttcttttgg agggtgtaaa atcttagtct cttatgtcaa aaagaagggg     5520

gcgggggagt ttgagtatgt acctctaaga caaatctctc gggcctttta ttttttcctg     5580

gcaatgtcct taaaagctcc caccctggga cagcatgcca ctgagcaagg agagatgggt     5640

gagcctgaag atggtccctt tggtttctgg ggcaaataga gcaccagctt tgtgcataat     5700

ttggatgtcc aaatttgaac tccttcctaa agaaacccag cagccacctt gaaaaaggcc     5760

attgtggagc ccattatact ttgatttaaa ataggccaag agaatcaggc ctggagatct     5820

agggtcttgt ccaaagtgtg agtgagtcaa tgagagggaa ccaacatttg ctaagtctct     5880

actgtatgcc agggatcatg cttggcactt tccataggac atttcacaca gtccttagaa     5940

cccccaggag agagctactg acttgttatc atctccattt gatcatctcc tccaatgagg     6000

aaacccacgc accttcctta gtaatgaaat cctgggttcc aaaggggcag gtaatggcaa     6060

tgagacttct ccgtgctgtt ttcttcatct tctctaagcc aagcaattat tttatggagg     6120

gaaaataagg ccagaaactt ctgagcagat aactccacaa atggaaattt agtactttct     6180

tcctgatgcc agttcttctg ggaagcgcag aatttcagat atattttagt aacacattcc     6240

cagctcccca ggaaagccag tctcatctaa tttcttagtc agtaaaaaca attccctgtt     6300

ccttcaggct atgaatggac cagccaggga aactctcgac cttgatctct agccagtgct     6360

taggcccaat atctgacagc ctcaggtggg ctgggaccta ggaagctcca tcttgaaggc     6420

tggtctagcc ccagacaggg catgaggggc agagaattca agaaggtaca gctttggccc     6480

tcaagagccc actgtatgct ggggaaatgg aaccatggtg cagtagtgtg gagtggatga     6540

gtgttccatg agcctaggag caagaaagtc tcttcggcct cgggcttcct ggagaagggg     6600

acgtccattc ctgctgggtc ttaacaagca taaaaaggaa aaaaaggaaa ctcaggcaaa     6660

gggatccata tgtgcaatgg caaagaaatg tgaaaaggca ttgggagaag cagtctgggg     6720

gaggccagcc cagtgcgggc acagcacaac acggggagca gcaagagatg agccagggtc     6780

caggagacag atgcccatcg cgagtacaga ctttgtccta ttggcaacaa ggagtccatg     6840

gagctttaga gagatgcact cagcttcgtg ttggccaaga ctccttctgg gccaatgggg     6900

ctgcctcttt tcctttcatc agacactgtg aaaacattcc cttaagcgtg cactttttaa     6960

tatcacatct atttgtctgt ctgctcattg ttttgttgct ggaactaaat atgcaatgga     7020

tcatgagact cagattctat gagaaaccca gggtctctgc tttaccacgg agcagggtca     7080

ccaacccaga tctccaggcc catgaggatg gaacatgaaa ggagccgaca aaagttgctt     7140

ccattggcat gggctctgga gctgtccaga agtccaggga caccagactt gatcaaggaa     7200

gggctgtcac tttagaggtt caaaaggaag tgcctcaaag caaaggcaag caaaggaacc     7260

ccacgatgaa cttgctcttt tcctttgatg agcctctccc caggtgtatt tcagcagacc     7320

ccggggaccc acccccactg ggcctgctgg cctccctcgg ctccagccca atgccccagc     7380

tggccttccc cagcctgcaa ggagcctgta gcatggcaaa tctgcctgct gtatgctatt     7440

ttcttagatc ttggtacatc cagacaggat gagggtggag ggagagctat ttaacacaaa     7500

tcctaagatt tttttctgct caggaagggg tgaaatagct ggcagataca aaagacagtg     7560

gcttttatca ttttaaatgg taggaattta aggtgtgact tcagggagaa acaaacttgc     7620

aaaaaaaaaa aatctcaggc catgttgggg taacccagca agggccagtg atgatttccc     7680

ccagctcatc cccttatttt cccacaaccc aaccattctc taaagcagga cagtgaatag     7740

gtcttaggcc agtgcacaca ggaagaaatt gaggcttatg gatggggatg acttccctaa     7800

gatcccatgg gacaaggatg tggcaaggct tggatgagat ggggcaccag tgcccaggaa     7860

tttgaacatt ttcctttacc caggaaatct ccggagccaa caccaccacc cccagggggt     7920

ctccccaccc caccccattt acagggtgag ctcagcctgt catgagcaga ggaaaatatt     7980

attaatgctc tctgagtctt tacaacagga gctcttacct catagatgtg ggctctgttt     8040

ggggaagatg caaggaagta atgagaagcc caggaaattt ctccacctgt gtttatggcc     8100

taaatagctt caggatgtat cttagctgca ctccaacatt gcatcctttc tggggtgaag     8160

aatctgggcc aaccaggggt ccttgggcct ctagaaggcc acagtaggcc tctctttgtg     8220

ggaatggaag gggacagttt gcttttagtg ctggccctct ctgtgggtgt ggcctgcaaa     8280

ggaaccaaca gaccctatgc tggggactct aacatgtgag ctcattaaat tcttccagca     8340

ttctaaagga gggtttgtga ttgtcaccat ttactgatga ggaaactaag gctcctaggg     8400

gagaaatcac ttgcccacag ttccacagct agtgagtgaa tgaaccagga tttaaaccgg     8460

ttttttctca ctacagagac aatatttttc caccattgta tctcacattt ttcccaggag     8520

gttacccata acagaagaga ctagagtgga acagatacgt cagtggataa agctcaaagc     8580

aaacaacagt aagcttaaaa ttccttcata gtctcatgtt ttacgttcac aattcatgca     8640

aaatttgcat tccactttct gatttagcct tgttggtttt aatatgactc tatgaatatt     8700

tcaaaaaaaa atgtgctctg ttcctcatgt tgttctgttc tgttcacccc gctatgacgg     8760

accctaggtc agctggtctt cagcttgacc ctagaattga ctctaggagc agtgaccctg     8820

ctgcctccca gagccagtta taggctcaag atcaagacca actgaccttc tcctaggcag     8880

ctcctttggt gtgtgggtgc tctgacctca ctgttcatga ggggacctca actaaggcat     8940

cttccagttg ggtgctggaa ggaacccatt aactcacact agaatgatga ggatttgctc     9000

atctggcgtg gagaaggatg agcccacaaa accctaaagg gaaaagagaa gctggacaca     9060

gctgtactca gcagattcct gaatgctagg ctggaaagtg gtgcctgttg tccaagtgga     9120

gtcacatggt tgctaatgtg ggcaagtctg aggacacact tcatgagcag ctggggtctg     9180

gaaggctcct cactttaccc tagccacaca taattactgg gtgcctacag cacctagcac     9240

cttggagggg gcactattag gaaatcgaga ttactatggc acaattaatt cctgggtaag     9300

gcatggggtt gtggtggaca gagctcagtc tttagtttga acgaaaacat acatacatga     9360

aaaacataca tgaaaaaagg accctcatca acattagaag gggtagattt ggagcacttt     9420

aggcaggaaa acaggaacgc aaggccagga aactggaacc cagtgaatac tcagaaccga     9480

ggatgcagat gacttattta gcaaaatggt cacttctgtg acatagctgg agaaaggatg     9540

ggtaacagct tgccagagcc acttggaaca agggcaaatc tcagtgtctg gggcaaaaga     9600

tgatgcattt ccctctgacc catcatgttt attcatcctc cactccccat tgccacacta     9660

gctcttgctg taagtcctca ccaggatcta catttcctcg tcgctggtgg gaacccctta     9720

gagtacatag aggtatcagt ccagtaagac tgctctacac aacagaagtg aggcccaggg     9780

agtagcagcc aggcccttat cctgttacct ctgcaggagt gactgcccaa cccagatcca     9840

gagacattga aggaaatgat aattccttgg tacctcactg ccttgggaca aaatgaagaa     9900

agccaccctt ccttaggctg cagcttgcca ctcctgggct gggtaaacag gtcatcagca     9960

ccaagctcaa ccaggagtaa cactctggaa gacatgggtg agcccaagag gaagcatgaa    10020

caggacgctg ttcctaagtc atgtcaacag gttgtgctgg gccaggatcc ccagggaaaa    10080

aaatggtcaa cccaactgga gggtaggtta gaagaaaaaa aacataaacg tggatagtca    10140

tgtcatctca aatccctgac ttggcttccc cattacttaa cagtctgagc tccttcttag    10200

cctgtgacca gcttcaaatc acagccaagt aaaacaagga aataggaaaa gtaaatccaa    10260

ctagaagaga caagctgaga ttcagatttg tttactcctc ccatgcaaag tttccctgtt    10320

ggaggttttc catgtataca tgtctagaag tgatagaatg caaggccttg gctttgtctt    10380

gcagggatct gcctttgagg tcatagactg aacagcaggg agagaggtta gtggtggagt    10440

gtggggggag ctgttctagc tccagtttct tctgacacat ttttcaggat catggatctg    10500

atcctccgaa gcacagcaga gatatctaag ccatatttgt gcacatgagc agactcttct    10560

agttttttag taaccaggga tgggcttttg catggcactg actatagaga tgtcttgtag    10620

agatcaagcc agtcttttgc atcccacctg cccacctcca gaagagatgg gaaaaggtca    10680

tcaaagggca ttcaccaact gaaatccact catgaatgtt aggtctctaa aaggaggcat    10740

caacactcac aatggtagcc tccaaaccta gcatcccacc tatctaagag ctcaggggtg    10800

gtccactggg gcagatacaa gggaagtgca agggctcagg atgaaagaaa atctattggg    10860

aagagtttta ggggcttgat cattatgggg cttccttcta tatctgagaa ctgctctggg    10920

tggtgagatg tggactctga tccttaattg gaatgttcgg agaatgagtg tctggtggcc    10980

ttgaagtgtt ggacagaaaa gtatcagtat aaaagcctgg agctcagggt aattaatgta    11040

gttcatggtt ccttagtgag caggactctt ggatgtggag gagaaagggt cataggaagt    11100

aaaccaccaa aattacaaaa ttgagtctct gtacaattac ttcagtgcct ttgggcttat    11160

gaatacaaat cagtgggcct tctctatgat ggtccaacaa actctcagtg tccaccctgt    11220

ccctgtatct cccatggaag atgaataatg tcaggtgttc tttgggtcaa aggccccagg    11280

gcagtctgga ggcttagagg gcagagtggt gtcattccat gtaaagttag gcttctgagg    11340

ggtcaggcag aatatggtgt ccatatcttc catagctctg cagattcttg gatgaagtca    11400

agcacagttt gctagaccca ggtcactcct ctgagtataa ctaggaccca tgagtgaaac    11460

ttaatagctg taaggaagaa cctgctgtct gccagagagg ataagctgcc catctcagca    11520

gctgtctaaa agaaggcagg tgtctcttta aagggaagag aagcattggt gaaatggatt    11580

tcaggtcact tccattccag atgggtgaga tcttgtggag ctgggatcat gtttgaactc    11640

attcatacct gtagagcacg aatccaagta gattgtgttt ggtctgtaca ggctgaagcc    11700

ccctgctctc ccacccaagt gcccccactg agcaggccaa catgctgttg tggccacata    11760

tactgggctg atccaggctg gttatcacca aacagcaaac catagggaac agctgctttg    11820

ccatagaccc aatacccatg tagatctctc atgagagcag ccataactca gacccactga    11880

ccaacagggc catgagtgac agccagaacc agtgaaggtc caagtaggac acagagcagg    11940

gcttttctta ccatacacat tatctccaga ggttatttct accccactcc ctattcaagg    12000

cctgttggag cacactgcaa aagcaaaagc acagtaactc aatttacaca tgattataat    12060

catttccagt gcacacattt catcaccagg tggatcctga gctagcccat gtaaatccgg    12120

gttaacccat attggtaatc atactcaaaa gcacttttca ccctacattc tactagccaa    12180

tcaaagacaa agagttgtgg cctctaccat tgccttggct tctggacacc ctcacaagct    12240

atcccaaggt tcccgctcaa ctccagggag gctgacatct tcacatccac tgggcatata    12300

atattgcatg agaccaaagt ctccacactc tttgcagcct cctccatgaa tcccaatggc    12360

ctgcacttgt acagtttggg tgtttgatag ataaagcacg tatgagaaga gaaaacaaaa    12420

taaatcaact ttttaaaaaa gccagcactg tgctgtcaat gttttttttt tcttttcaat    12480

tctagctcag aaaagcagaa ggtaaataat gtcaggtcaa tgaatatcag atatattttt    12540

tgactgtaca ttacagtgaa gtgtaatctt tttacacctg caagtccatc ttatttattc    12600

ttgtaaatgt tccctgacaa tgtttgtaat atggctgtgt taaaaaatct atacaataaa    12660

gctgtgaccc tgagattcat gttttcctaa gataaaaaaa a                        12701


<210> 22
<211> 1460
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1460
<223> /mol_type="protein"
      /note="PTPRT"
      /organism="Homo sapiens"

<400> 22
Met Ala Ser Leu Ala Ala Leu Ala Leu Ser Leu Leu Leu Arg Leu Gln 
1               5                   10                   15    
Leu Pro Pro Leu Pro Gly Ala Arg Ala Gln Ser Ala Ala Gly Gly Cys 
            20                   25                  30        
Ser Phe Asp Glu His Tyr Ser Asn Cys Gly Tyr Ser Val Ala Leu Gly 
        35                   40                  45            
Thr Asn Gly Phe Thr Trp Glu Gln Ile Asn Thr Trp Glu Lys Pro Met 
    50                   55                  60                
Leu Asp Gln Ala Val Pro Thr Gly Ser Phe Met Met Val Asn Ser Ser 
65                   70                  75                  80
Gly Arg Ala Ser Gly Gln Lys Ala His Leu Leu Leu Pro Thr Leu Lys 
                85                   90                  95    
Glu Asn Asp Thr His Cys Ile Asp Phe His Tyr Tyr Phe Ser Ser Arg 
            100                  105                110        
Asp Arg Ser Ser Pro Gly Ala Leu Asn Val Tyr Val Lys Val Asn Gly 
        115                  120                125            
Gly Pro Gln Gly Asn Pro Val Trp Asn Val Ser Gly Val Val Thr Glu 
    130                  135                140                
Gly Trp Val Lys Ala Glu Leu Ala Ile Ser Thr Phe Trp Pro His Phe 
145                  150                155                  160
Tyr Gln Val Ile Phe Glu Ser Val Ser Leu Lys Gly His Pro Gly Tyr 
                165                  170                175    
Ile Ala Val Asp Glu Val Arg Val Leu Ala His Pro Cys Arg Lys Ala 
            180                  185                190        
Pro His Phe Leu Arg Leu Gln Asn Val Glu Val Asn Val Gly Gln Asn 
        195                  200                205            
Ala Thr Phe Gln Cys Ile Ala Gly Gly Lys Trp Ser Gln His Asp Lys 
    210                  215                220                
Leu Trp Leu Gln Gln Trp Asn Gly Arg Asp Thr Ala Leu Met Val Thr 
225                  230                235                  240
Arg Val Val Asn His Arg Arg Phe Ser Ala Thr Val Ser Val Ala Asp 
                245                  250                255    
Thr Ala Gln Arg Ser Val Ser Lys Tyr Arg Cys Val Ile Arg Ser Asp 
            260                  265                270        
Gly Gly Ser Gly Val Ser Asn Tyr Ala Glu Leu Ile Val Lys Glu Pro 
        275                  280                285            
Pro Thr Pro Ile Ala Pro Pro Glu Leu Leu Ala Val Gly Ala Thr Tyr 
    290                  295                300                
Leu Trp Ile Lys Pro Asn Ala Asn Ser Ile Ile Gly Asp Gly Pro Ile 
305                  310                315                  320
Ile Leu Lys Glu Val Glu Tyr Arg Thr Thr Thr Gly Thr Trp Ala Glu 
                325                  330                335    
Thr His Ile Val Asp Ser Pro Asn Tyr Lys Leu Trp His Leu Asp Pro 
            340                  345                350        
Asp Val Glu Tyr Glu Ile Arg Val Leu Leu Thr Arg Pro Gly Glu Gly 
        355                  360                365            
Gly Thr Gly Pro Pro Gly Pro Pro Leu Thr Thr Arg Thr Lys Cys Ala 
    370                  375                380                
Asp Pro Val His Gly Pro Gln Asn Val Glu Ile Val Asp Ile Arg Ala 
385                  390                395                  400
Arg Gln Leu Thr Leu Gln Trp Glu Pro Phe Gly Tyr Ala Val Thr Arg 
                405                  410                415    
Cys His Ser Tyr Asn Leu Thr Val Gln Tyr Gln Tyr Val Phe Asn Gln 
            420                  425                430        
Gln Gln Tyr Glu Ala Glu Glu Val Ile Gln Thr Ser Ser His Tyr Thr 
        435                  440                445            
Leu Arg Gly Leu Arg Pro Phe Met Thr Ile Arg Leu Arg Leu Leu Leu 
    450                  455                460                
Ser Asn Pro Glu Gly Arg Met Glu Ser Glu Glu Leu Val Val Gln Thr 
465                  470                475                  480
Glu Glu Asp Val Pro Gly Ala Val Pro Leu Glu Ser Ile Gln Gly Gly 
                485                  490                495    
Pro Phe Glu Glu Lys Ile Tyr Ile Gln Trp Lys Pro Pro Asn Glu Thr 
            500                  505                510        
Asn Gly Val Ile Thr Leu Tyr Glu Ile Asn Tyr Lys Ala Val Gly Ser 
        515                  520                525            
Leu Asp Pro Ser Ala Asp Leu Ser Ser Gln Arg Gly Lys Val Phe Lys 
    530                  535                540                
Leu Arg Asn Glu Thr His His Leu Phe Val Gly Leu Tyr Pro Gly Thr 
545                  550                555                  560
Thr Tyr Ser Phe Thr Ile Lys Ala Ser Thr Ala Lys Gly Phe Gly Pro 
                565                  570                575    
Pro Val Thr Thr Arg Ile Ala Thr Lys Ile Ser Ala Pro Ser Met Pro 
            580                  585                590        
Glu Tyr Asp Thr Asp Thr Pro Leu Asn Glu Thr Asp Thr Thr Ile Thr 
        595                  600                605            
Val Met Leu Lys Pro Ala Gln Ser Arg Gly Ala Pro Val Ser Val Tyr 
    610                  615                620                
Gln Leu Val Val Lys Glu Glu Arg Leu Gln Lys Ser Arg Arg Ala Ala 
625                  630                635                  640
Asp Ile Ile Glu Cys Phe Ser Val Pro Val Ser Tyr Arg Asn Ala Ser 
                645                  650                655    
Ser Leu Asp Ser Leu His Tyr Phe Ala Ala Glu Leu Lys Pro Ala Asn 
            660                  665                670        
Leu Pro Val Thr Gln Pro Phe Thr Val Gly Asp Asn Lys Thr Tyr Asn 
        675                  680                685            
Gly Tyr Trp Asn Pro Pro Leu Ser Pro Leu Lys Ser Tyr Ser Ile Tyr 
    690                  695                700                
Phe Gln Ala Leu Ser Lys Ala Asn Gly Glu Thr Lys Ile Asn Cys Val 
705                  710                715                  720
Arg Leu Ala Thr Lys Ala Pro Met Gly Ser Ala Gln Val Thr Pro Gly 
                725                  730                735    
Thr Pro Leu Cys Leu Leu Thr Thr Gly Ala Ser Thr Gln Asn Ser Asn 
            740                  745                750        
Thr Val Glu Pro Glu Lys Gln Val Asp Asn Thr Val Lys Met Ala Gly 
        755                  760                765            
Val Ile Ala Gly Leu Leu Met Phe Ile Ile Ile Leu Leu Gly Val Met 
    770                  775                780                
Leu Thr Ile Lys Arg Arg Arg Asn Ala Tyr Ser Tyr Ser Tyr Tyr Leu 
785                  790                795                  800
Lys Leu Ala Lys Lys Gln Lys Glu Thr Gln Ser Gly Ala Gln Arg Glu 
                805                  810                815    
Met Gly Pro Val Ala Ser Ala Asp Lys Pro Thr Thr Lys Leu Ser Ala 
            820                  825                830        
Ser Arg Asn Asp Glu Gly Phe Ser Ser Ser Ser Gln Asp Val Asn Gly 
        835                  840                845            
Phe Thr Asp Gly Ser Arg Gly Glu Leu Ser Gln Pro Thr Leu Thr Ile 
    850                  855                860                
Gln Thr His Pro Tyr Arg Thr Cys Asp Pro Val Glu Met Ser Tyr Pro 
865                  870                875                  880
Arg Asp Gln Phe Gln Pro Ala Ile Arg Val Ala Asp Leu Leu Gln His 
                885                  890                895    
Ile Thr Gln Met Lys Arg Gly Gln Gly Tyr Gly Phe Lys Glu Glu Tyr 
            900                  905                910        
Glu Ala Leu Pro Glu Gly Gln Thr Ala Ser Trp Asp Thr Ala Lys Glu 
        915                  920                925            
Asp Glu Asn Arg Asn Lys Asn Arg Tyr Gly Asn Ile Ile Ser Tyr Asp 
    930                  935                940                
His Ser Arg Val Arg Leu Leu Val Leu Asp Gly Asp Pro His Ser Asp 
945                  950                955                  960
Tyr Ile Asn Ala Asn Tyr Ile Asp Gly Tyr His Arg Pro Arg His Tyr 
                965                  970                975    
Ile Ala Thr Gln Gly Pro Met Gln Glu Thr Val Lys Asp Phe Trp Arg 
            980                  985                990        
Met Ile Trp Gln Glu Asn Ser Ala Ser Ile Val Met Val Thr Asn Leu 
        995                  1000                1005            
Val Glu Val Gly Arg Val Lys Cys Val Arg Tyr Trp Pro Asp Asp Thr 
    1010                1015                1020                
Glu Val Tyr Gly Asp Ile Lys Val Thr Leu Ile Glu Thr Glu Pro Leu 
1025                1030                1035                1040
Ala Glu Tyr Val Ile Arg Thr Phe Thr Val Gln Lys Lys Gly Tyr His 
                1045                1050                1055    
Glu Ile Arg Glu Leu Arg Leu Phe His Phe Thr Ser Trp Pro Asp His 
            1060                1065                1070        
Gly Val Pro Cys Tyr Ala Thr Gly Leu Leu Gly Phe Val Arg Gln Val 
        1075                1080                1085            
Lys Phe Leu Asn Pro Pro Glu Ala Gly Pro Ile Val Val His Cys Ser 
    1090                1095                1100                
Ala Gly Ala Gly Arg Thr Gly Cys Phe Ile Ala Ile Asp Thr Met Leu 
1105                1110                1115                1120
Asp Met Ala Glu Asn Glu Gly Val Val Asp Ile Phe Asn Cys Val Arg 
                1125                1130                1135    
Glu Leu Arg Ala Gln Arg Val Asn Leu Val Gln Thr Glu Glu Gln Tyr 
            1140                1145                1150        
Val Phe Val His Asp Ala Ile Leu Glu Ala Cys Leu Cys Gly Asn Thr 
        1155                1160                1165            
Ala Ile Pro Val Cys Glu Phe Arg Ser Leu Tyr Tyr Asn Ile Ser Arg 
    1170                1175                1180                
Leu Asp Pro Gln Thr Asn Ser Ser Gln Ile Lys Asp Glu Phe Gln Thr 
1185                1190                1195                1200
Leu Asn Ile Val Thr Pro Arg Val Arg Pro Glu Asp Cys Ser Ile Gly 
                1205                1210                1215    
Leu Leu Pro Arg Asn His Asp Lys Asn Arg Ser Met Asp Val Leu Pro 
            1220                1225                1230        
Leu Asp Arg Cys Leu Pro Phe Leu Ile Ser Val Asp Gly Glu Ser Ser 
        1235                1240                1245            
Asn Tyr Ile Asn Ala Ala Leu Met Asp Ser His Lys Gln Pro Ala Ala 
    1250                1255                1260                
Phe Val Val Thr Gln His Pro Leu Pro Asn Thr Val Ala Asp Phe Trp 
1265                1270                1275                1280
Arg Leu Val Phe Asp Tyr Asn Cys Ser Ser Val Val Met Leu Asn Glu 
                1285                1290                1295    
Met Asp Thr Ala Gln Phe Cys Met Gln Tyr Trp Pro Glu Lys Thr Ser 
            1300                1305                1310        
Gly Cys Tyr Gly Pro Ile Gln Val Glu Phe Val Ser Ala Asp Ile Asp 
        1315                1320                1325            
Glu Asp Ile Ile His Arg Ile Phe Arg Ile Cys Asn Met Ala Arg Pro 
    1330                1335                1340                
Gln Asp Gly Tyr Arg Ile Val Gln His Leu Gln Tyr Ile Gly Trp Pro 
1345                1350                1355                1360
Ala Tyr Arg Asp Thr Pro Pro Ser Lys Arg Ser Leu Leu Lys Val Val 
                1365                1370                1375    
Arg Arg Leu Glu Lys Trp Gln Glu Gln Tyr Asp Gly Arg Glu Gly Arg 
            1380                1385                1390        
Thr Val Val His Cys Leu Asn Gly Gly Gly Arg Ser Gly Thr Phe Cys 
        1395                1400                1405            
Ala Ile Cys Ser Val Cys Glu Met Ile Gln Gln Gln Asn Ile Ile Asp 
    1410                1415                1420                
Val Phe His Ile Val Lys Thr Leu Arg Asn Asn Lys Ser Asn Met Val 
1425                1430                1435                1440
Glu Thr Leu Glu Gln Tyr Lys Phe Val Tyr Glu Val Ala Leu Glu Tyr 
                1445                1450                1455    
Leu Ser Ser Phe 
            1460

<210> 23
<211> 4510
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..4510
<223> /mol_type="DNA"
      /note="TDRD1"
      /organism="Homo sapiens"

<400> 23
gctgaggcca ggagggcgca ctggggattg gaggcgaggg aagtgcaggg cgcatcccag       60

gcggcagggc tcccagcatc ggcagtcgcc atcaccgcca gaccgcagag acaggttcgg      120

atccgcggtc ctcttgcctc tttccaggcc tcgatgagtg ttaaatcgcc atttaatgtg      180

atgtcaagaa ataatttgga agcacctcct tgtaagatga cagagccatt taattttgag      240

aaaaatgaaa acaagcttcc accacatgag tctttaagaa gtcctggaac acttcctaac      300

caccctaatt tcaggctgaa aagctcagag aatggaaata aaaagaacaa ttttttgctt      360

tgtgagcaaa ccaaacaata tttggctagt caggaagaca attcagtttc ttcaaacccg      420

aatggcatca acggagaagt agttggctcc aaaggagaca ggaaaaaatt gccagcagga      480

aactcagtgt caccaccaag tgctgaaagt aattcaccac ccaaagaagt gaatattaag      540

cctggaaata atgtacgtcc tgcaaaatca aaaaaactaa acaagttggt cgagaattcc      600

ttgtccataa gtaatccagg gctcttcacc tccttaggac ctcctcttcg gtccacaact      660

tgccatcgct gtggcctatt tggatcgctg aggtgctctc agtgcaagca gacctactat      720

tgctccacag catgtcaaag aagagactgg tctgcacaca gcatcgtgtg caggcctgtt      780

cagccaaatt tccacaaact tgaaaataaa tcatctattg aaacaaagga tgtggaggta      840

aacaataaga gtgactgtcc acttggagtt actaaggaaa tagccatttg ggctgagaga      900

ataatgtttt ctgatttgag aagtctacaa ctcaagaaaa ccatggaaat aaagggtacg      960

gttaccgaat tcaaacaccc aggggacttc tacgtgcagt tatattcttc agaagtttta     1020

gaatacatga accaactctc tgccagctta aaagaaacat atgcaaatgt gcatgaaaaa     1080

gactatattc ctgttaaggg ggaagtttgt attgccaagt acactgttga tcagacctgg     1140

aacagagcaa tcatacaaaa cgttgatgtg cagcaaaaga aggcacatgt cttatatatt     1200

gattatggaa atgaagaaat aattccatta aacagaattt accacctcaa caggaacatt     1260

gacttgtttc ctccttgtgc cataaagtgc tttgtagcca atgttatccc agcagaaggg     1320

aattggagca gtgattgtat caaagctact aaaccactgt taatggagca gtactgctcc     1380

ataaagattg tcgacatctt ggaagaggaa gtggttacct ttgctgtaga agttgagctg     1440

ccaaattcag gaaaactttt agaccatgtg cttatagaaa tgggatatgg cttgaaaccc     1500

agtggacaag attctaagaa ggaaaatgca gatcaaagtg atcctgaaga tgttggaaaa     1560

atgacaactg aaaacaacat tgtcgtagac aaaagtgacc taatcccaaa agtgttaact     1620

ttgaatgtag gtgatgagtt ttgtggtgtg gttgcccaca ttcaaacacc agaagacttc     1680

ttttgtcaac aactgcaaag tggccgaaag cttgctgaac ttcaggcatc ccttagcaag     1740

tactgtgatc agttgcctcc acgctctgat ttttatccag ccattggtga tatatgttgt     1800

gctcagttct cagaggatga tcagtggtac cgtgcctctg ttttggctta cgcttctgaa     1860

gaatctgtac tggtcggata tgtagattat ggaaactttg aaatccttag tttgatgaga     1920

ctttgtccca taatcccaaa gttgttggaa ttgccaatgc aagctataaa gtgtgtacta     1980

gcaggagtaa agccatcatt aggaatttgg actccagaag ctatttgtct catgaaaaaa     2040

cttgtacaga acaaaataat cacagtgaaa gtggtggaca agttggaaaa cagttccctg     2100

gtggagctta ttgataaatc cgagacgcct catgtcagtg ttagcaaagt tctcctagat     2160

gcaggctttg ctgtgggaga acagagtatg gtgacagata aacccagtga cgtgaaagaa     2220

accagtgttc ccttgggtgt ggaaggaaaa gtaaatccat tggagtggac atgggttgaa     2280

cttggtgttg accaaacagt agatgttgtg gtctgtgtga tatatagtcc tggagaattt     2340

tattgccatg tgcttaaaga ggatgcttta aagaaactca atgatttgaa caagtcatta     2400

gcagaacact gccagcagaa gttacctaat ggtttcaagg cagagatagg acaaccttgt     2460

tgtgcttttt ttgcaggtga tggtagttgg tatcgtgctt tagtcaagga aatcttacca     2520

aatggacatg ttaaagtaca ttttgtggat tatggaaaca tcgaagaagt tactgcagat     2580

gaactccgaa tgatatcatc aacattttta aaccttccct ttcagggaat acggtgccag     2640

ttagcagata tacagtctag aaacaaacat tggtctgaag aagccataac aagattccag     2700

atgtgtgttg ctgggataaa attgcaagcc agagtggttg aagtcactga aaatgggata     2760

ggagttgaac tcaccgatct ctccacttgt tatcccagaa taattagtga tgttctgatt     2820

gatgaacatc tggttttaaa atctgcttca ccacataaag acttaccaaa tgacagactt     2880

gttaataaac atgagcttca agttcatgta cagggacttc aagctacctc ttcagctgag     2940

caatggaaga cgatagaatt gccagtggat aaaactatac aagcaaatgt attagaaatc     3000

ataagcccaa acttgtttta tgctctacca aaagggatgc cagaaaatca ggaaaagctg     3060

tgcatgttga cagctgaatt attagaatac tgcaatgctc cgaaaagtcg accaccctat     3120

agaccaagaa ttggagacgc atgctgtgcc aaatacacaa gtgatgattt ttggtatcgt     3180

gcagttgttc tggggacatc agacactgat gtggaagtgc tctatgcaga ctatggaaac     3240

attgaaaccc tgcctctttg cagagtgcaa ccaatcacct ctagccacct ggcgcttcct     3300

ttccaaatta ttagatgttc acttgaagga ttaatggaat tgaatggaag ctcttctcaa     3360

ttaataataa tgctattaaa aaatttcatg ttgaatcaga atgtaatgct ttctgtgaaa     3420

ggaattacaa agaatgtcca tacagtgtca gttgagaaat gttctgagaa tgggactgtc     3480

gatgtagctg ataagctagt gacatttggt ctggcaaaaa acatcacacc tcaaaggcag     3540

agtgctttaa atacagaaaa gatgtatagg atgaattgct gctgcacaga gttacagaaa     3600

caagttgaaa aacatgaaca tattcttctc ttcctcttaa acaattcaac caatcaaaat     3660

aaatttattg aaatgaaaaa actgttaaaa aaaacagcat ctcttggagg taaaccctta     3720

tgagacagga aacagcaaag gctagcttta ggagagaaag tacagcacct ggtgttttta     3780

tttatgagaa ccttttcttt gtccactttc tctgtaatga ccttctatcc ctccgttttt     3840

gcctgcctgc cattctccta ttaggttggt ggtttttatt ttcctctaag ttccttccac     3900

caaataaata ttacgtaaaa aattcatacc aaatcaatga gaatactggc aaggaataca     3960

tagggacttt ctgctatata tgtaactttt tattacttaa aggtaccgaa ggaaggccag     4020

gtgcagtggc tcacgcccag cactttggga ggctgaggtg ggaggatccc ttgaggccag     4080

gagttcaagg ttacagtgag ctatgatagt gccactgcac tccagcctgg gtgacagatt     4140

ttgtcttaaa aaaaaaaaaa aaaaagttga tatgagtttt attttctgtc cgtttgaaat     4200

attttgtaat attccctgca ttctctgtcg tctgcctctt ccacataatg tcctttgctt     4260

tcatgtttgt tatcttcttt ttctgttcac tcagaggtca tcaatttctt tctctccgtc     4320

cttaattgga ttatttttct tttggccttt gggcacagag tctgacctct ggaccactct     4380

aactggagaa ggaactttat gttccctctc ctgctgtgtc cacaacctta gaaatctgta     4440

gctagatttt tgttgttata gatagaattt actgtttctg aaacccaaat acagttatca     4500

gtttaaggtt                                                            4510


<210> 24
<211> 1189
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..1189
<223> /mol_type="protein"
      /note="TDRD1"
      /organism="Homo sapiens"

<400> 24
Met Ser Val Lys Ser Pro Phe Asn Val Met Ser Arg Asn Asn Leu Glu 
1               5                   10                   15    
Ala Pro Pro Cys Lys Met Thr Glu Pro Phe Asn Phe Glu Lys Asn Glu 
            20                   25                  30        
Asn Lys Leu Pro Pro His Glu Ser Leu Arg Ser Pro Gly Thr Leu Pro 
        35                   40                  45            
Asn His Pro Asn Phe Arg Leu Lys Ser Ser Glu Asn Gly Asn Lys Lys 
    50                   55                  60                
Asn Asn Phe Leu Leu Cys Glu Gln Thr Lys Gln Tyr Leu Ala Ser Gln 
65                   70                  75                  80
Glu Asp Asn Ser Val Ser Ser Asn Pro Asn Gly Ile Asn Gly Glu Val 
                85                   90                  95    
Val Gly Ser Lys Gly Asp Arg Lys Lys Leu Pro Ala Gly Asn Ser Val 
            100                  105                110        
Ser Pro Pro Ser Ala Glu Ser Asn Ser Pro Pro Lys Glu Val Asn Ile 
        115                  120                125            
Lys Pro Gly Asn Asn Val Arg Pro Ala Lys Ser Lys Lys Leu Asn Lys 
    130                  135                140                
Leu Val Glu Asn Ser Leu Ser Ile Ser Asn Pro Gly Leu Phe Thr Ser 
145                  150                155                  160
Leu Gly Pro Pro Leu Arg Ser Thr Thr Cys His Arg Cys Gly Leu Phe 
                165                  170                175    
Gly Ser Leu Arg Cys Ser Gln Cys Lys Gln Thr Tyr Tyr Cys Ser Thr 
            180                  185                190        
Ala Cys Gln Arg Arg Asp Trp Ser Ala His Ser Ile Val Cys Arg Pro 
        195                  200                205            
Val Gln Pro Asn Phe His Lys Leu Glu Asn Lys Ser Ser Ile Glu Thr 
    210                  215                220                
Lys Asp Val Glu Val Asn Asn Lys Ser Asp Cys Pro Leu Gly Val Thr 
225                  230                235                  240
Lys Glu Ile Ala Ile Trp Ala Glu Arg Ile Met Phe Ser Asp Leu Arg 
                245                  250                255    
Ser Leu Gln Leu Lys Lys Thr Met Glu Ile Lys Gly Thr Val Thr Glu 
            260                  265                270        
Phe Lys His Pro Gly Asp Phe Tyr Val Gln Leu Tyr Ser Ser Glu Val 
        275                  280                285            
Leu Glu Tyr Met Asn Gln Leu Ser Ala Ser Leu Lys Glu Thr Tyr Ala 
    290                  295                300                
Asn Val His Glu Lys Asp Tyr Ile Pro Val Lys Gly Glu Val Cys Ile 
305                  310                315                  320
Ala Lys Tyr Thr Val Asp Gln Thr Trp Asn Arg Ala Ile Ile Gln Asn 
                325                  330                335    
Val Asp Val Gln Gln Lys Lys Ala His Val Leu Tyr Ile Asp Tyr Gly 
            340                  345                350        
Asn Glu Glu Ile Ile Pro Leu Asn Arg Ile Tyr His Leu Asn Arg Asn 
        355                  360                365            
Ile Asp Leu Phe Pro Pro Cys Ala Ile Lys Cys Phe Val Ala Asn Val 
    370                  375                380                
Ile Pro Ala Glu Gly Asn Trp Ser Ser Asp Cys Ile Lys Ala Thr Lys 
385                  390                395                  400
Pro Leu Leu Met Glu Gln Tyr Cys Ser Ile Lys Ile Val Asp Ile Leu 
                405                  410                415    
Glu Glu Glu Val Val Thr Phe Ala Val Glu Val Glu Leu Pro Asn Ser 
            420                  425                430        
Gly Lys Leu Leu Asp His Val Leu Ile Glu Met Gly Tyr Gly Leu Lys 
        435                  440                445            
Pro Ser Gly Gln Asp Ser Lys Lys Glu Asn Ala Asp Gln Ser Asp Pro 
    450                  455                460                
Glu Asp Val Gly Lys Met Thr Thr Glu Asn Asn Ile Val Val Asp Lys 
465                  470                475                  480
Ser Asp Leu Ile Pro Lys Val Leu Thr Leu Asn Val Gly Asp Glu Phe 
                485                  490                495    
Cys Gly Val Val Ala His Ile Gln Thr Pro Glu Asp Phe Phe Cys Gln 
            500                  505                510        
Gln Leu Gln Ser Gly Arg Lys Leu Ala Glu Leu Gln Ala Ser Leu Ser 
        515                  520                525            
Lys Tyr Cys Asp Gln Leu Pro Pro Arg Ser Asp Phe Tyr Pro Ala Ile 
    530                  535                540                
Gly Asp Ile Cys Cys Ala Gln Phe Ser Glu Asp Asp Gln Trp Tyr Arg 
545                  550                555                  560
Ala Ser Val Leu Ala Tyr Ala Ser Glu Glu Ser Val Leu Val Gly Tyr 
                565                  570                575    
Val Asp Tyr Gly Asn Phe Glu Ile Leu Ser Leu Met Arg Leu Cys Pro 
            580                  585                590        
Ile Ile Pro Lys Leu Leu Glu Leu Pro Met Gln Ala Ile Lys Cys Val 
        595                  600                605            
Leu Ala Gly Val Lys Pro Ser Leu Gly Ile Trp Thr Pro Glu Ala Ile 
    610                  615                620                
Cys Leu Met Lys Lys Leu Val Gln Asn Lys Ile Ile Thr Val Lys Val 
625                  630                635                  640
Val Asp Lys Leu Glu Asn Ser Ser Leu Val Glu Leu Ile Asp Lys Ser 
                645                  650                655    
Glu Thr Pro His Val Ser Val Ser Lys Val Leu Leu Asp Ala Gly Phe 
            660                  665                670        
Ala Val Gly Glu Gln Ser Met Val Thr Asp Lys Pro Ser Asp Val Lys 
        675                  680                685            
Glu Thr Ser Val Pro Leu Gly Val Glu Gly Lys Val Asn Pro Leu Glu 
    690                  695                700                
Trp Thr Trp Val Glu Leu Gly Val Asp Gln Thr Val Asp Val Val Val 
705                  710                715                  720
Cys Val Ile Tyr Ser Pro Gly Glu Phe Tyr Cys His Val Leu Lys Glu 
                725                  730                735    
Asp Ala Leu Lys Lys Leu Asn Asp Leu Asn Lys Ser Leu Ala Glu His 
            740                  745                750        
Cys Gln Gln Lys Leu Pro Asn Gly Phe Lys Ala Glu Ile Gly Gln Pro 
        755                  760                765            
Cys Cys Ala Phe Phe Ala Gly Asp Gly Ser Trp Tyr Arg Ala Leu Val 
    770                  775                780                
Lys Glu Ile Leu Pro Asn Gly His Val Lys Val His Phe Val Asp Tyr 
785                  790                795                  800
Gly Asn Ile Glu Glu Val Thr Ala Asp Glu Leu Arg Met Ile Ser Ser 
                805                  810                815    
Thr Phe Leu Asn Leu Pro Phe Gln Gly Ile Arg Cys Gln Leu Ala Asp 
            820                  825                830        
Ile Gln Ser Arg Asn Lys His Trp Ser Glu Glu Ala Ile Thr Arg Phe 
        835                  840                845            
Gln Met Cys Val Ala Gly Ile Lys Leu Gln Ala Arg Val Val Glu Val 
    850                  855                860                
Thr Glu Asn Gly Ile Gly Val Glu Leu Thr Asp Leu Ser Thr Cys Tyr 
865                  870                875                  880
Pro Arg Ile Ile Ser Asp Val Leu Ile Asp Glu His Leu Val Leu Lys 
                885                  890                895    
Ser Ala Ser Pro His Lys Asp Leu Pro Asn Asp Arg Leu Val Asn Lys 
            900                  905                910        
His Glu Leu Gln Val His Val Gln Gly Leu Gln Ala Thr Ser Ser Ala 
        915                  920                925            
Glu Gln Trp Lys Thr Ile Glu Leu Pro Val Asp Lys Thr Ile Gln Ala 
    930                  935                940                
Asn Val Leu Glu Ile Ile Ser Pro Asn Leu Phe Tyr Ala Leu Pro Lys 
945                  950                955                  960
Gly Met Pro Glu Asn Gln Glu Lys Leu Cys Met Leu Thr Ala Glu Leu 
                965                  970                975    
Leu Glu Tyr Cys Asn Ala Pro Lys Ser Arg Pro Pro Tyr Arg Pro Arg 
            980                  985                990        
Ile Gly Asp Ala Cys Cys Ala Lys Tyr Thr Ser Asp Asp Phe Trp Tyr 
        995                  1000                1005            
Arg Ala Val Val Leu Gly Thr Ser Asp Thr Asp Val Glu Val Leu Tyr 
    1010                1015                1020                
Ala Asp Tyr Gly Asn Ile Glu Thr Leu Pro Leu Cys Arg Val Gln Pro 
1025                1030                1035                1040
Ile Thr Ser Ser His Leu Ala Leu Pro Phe Gln Ile Ile Arg Cys Ser 
                1045                1050                1055    
Leu Glu Gly Leu Met Glu Leu Asn Gly Ser Ser Ser Gln Leu Ile Ile 
            1060                1065                1070        
Met Leu Leu Lys Asn Phe Met Leu Asn Gln Asn Val Met Leu Ser Val 
        1075                1080                1085            
Lys Gly Ile Thr Lys Asn Val His Thr Val Ser Val Glu Lys Cys Ser 
    1090                1095                1100                
Glu Asn Gly Thr Val Asp Val Ala Asp Lys Leu Val Thr Phe Gly Leu 
1105                1110                1115                1120
Ala Lys Asn Ile Thr Pro Gln Arg Gln Ser Ala Leu Asn Thr Glu Lys 
                1125                1130                1135    
Met Tyr Arg Met Asn Cys Cys Cys Thr Glu Leu Gln Lys Gln Val Glu 
            1140                1145                1150        
Lys His Glu His Ile Leu Leu Phe Leu Leu Asn Asn Ser Thr Asn Gln 
        1155                1160                1165            
Asn Lys Phe Ile Glu Met Lys Lys Leu Leu Lys Lys Thr Ala Ser Leu 
    1170                1175                1180                
Gly Gly Lys Pro Leu 
1185                

<210> 25
<211> 2144
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2144
<223> /mol_type="DNA"
      /note="UGT2B15"
      /organism="Homo sapiens"

<400> 25
aaacaacaac tggaaaagaa gcattgcata agaccaggat gtctctgaaa tggacgtcag      60

tctttctgct gatacagctc agttgttact ttagctctgg aagctgtgga aaggtgctag     120

tgtggcccac agaatacagc cattggataa atatgaagac aatcctggaa gagcttgttc     180

agaggggtca tgaggtgact gtgttgacat cttcggcttc tactcttgtc aatgccagta     240

aatcatctgc tattaaatta gaagtttatc ctacatcttt aactaaaaat tatttggaag     300

attctcttct gaaaattctc gatagatgga tatatggtgt ttcaaaaaat acattttggt     360

catatttttc acaattacaa gaattgtgtt gggaatatta tgactacagt aacaagctct     420

gtaaagatgc agttttgaat aagaaactta tgatgaaact acaagagtca aagtttgatg     480

tcattctggc agatgccctt aatccctgtg gtgagctact ggctgaacta tttaacatac     540

cctttctgta cagtcttcga ttctctgttg gctacacatt tgagaagaat ggtggaggat     600

ttctgttccc tccttcctat gtacctgttg ttatgtcaga attaagtgat caaatgattt     660

tcatggagag gataaaaaat atgatacata tgctttattt tgacttttgg tttcaaattt     720

atgatctgaa gaagtgggac cagttttata gtgaagttct aggaagaccc actacattat     780

ttgagacaat ggggaaagct gaaatgtggc tcattcgaac ctattgggat tttgaatttc     840

ctcgcccatt cttaccaaat gttgattttg ttggaggact tcactgtaaa ccagccaaac     900

ccctgcctaa ggaaatggaa gagtttgtgc agagctctgg agaaaatggt attgtggtgt     960

tttctctggg gtcgatgatc agtaacatgt cagaagaaag tgccaacatg attgcatcag    1020

cccttgccca gatcccacaa aaggttctat ggagatttga tggcaagaag ccaaatactt    1080

taggttccaa tactcgactg tacaagtggt taccccagaa tgaccttctt ggtcatccca    1140

aaaccaaagc ttttataact catggtggaa ccaatggcat ctatgaggcg atctaccatg    1200

ggatccctat ggtgggcatt cccttgtttg cggatcaaca tgataacatt gctcacatga    1260

aagccaaggg agcagccctc agtgtggaca tcaggaccat gtcaagtaga gatttgctca    1320

atgcattgaa gtcagtcatt aatgaccctg tctataaaga gaatgtcatg aaattatcaa    1380

gaattcatca tgaccaacca atgaagcccc tggatcgagc agtcttctgg attgagtttg    1440

tcatgcgcca caaaggagcc aagcaccttc gagtcgcagc tcacaacctc acctggatcc    1500

agtaccactc tttggatgtg atagcattcc tgctggcctg cgtggcaact gtgatattta    1560

tcatcacaaa attttgcctg ttttgtttcc gaaagcttgc caaaaaagga aagaagaaga    1620

aaagagatta gttatatcaa aagcctgaag tggaatgact gaaagatggg actcctcctt    1680

tatttcagca tggagggttt taaatggagg atttcctttt tcctgtgaca aaacatcttt    1740

tcacaactta ccttgttaag acaaaattta ttttccaggg atttaatacg tactttagct    1800

gaattattct atgtcaatga tttttaagct atgaaaaata caatgggggg aaggatagca    1860

tttggagata tacctaatgt taaatgacga gttactggat gcagcacgcc aacatggcac    1920

atgtatacat atgtagctaa cctgcacgtt gtgcacatgt accctaaaac ttaaagtata    1980

atttaaaaaa agcaaaaaaa aaaaatacaa ctcttttttt taaaccagga aggaaaatgt    2040

gaacatggaa acaacttcta gtattggatc tgaaaataaa gtgtcatcca agccataaaa    2100

aaaaaagaaa agaaaaataa aaataatata aaaccttaaa aaaa                     2144


<210> 26
<211> 530
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..530
<223> /mol_type="protein"
      /note="UGT2B15"
      /organism="Homo sapiens"

<400> 26
Met Ser Leu Lys Trp Thr Ser Val Phe Leu Leu Ile Gln Leu Ser Cys 
1               5                   10                   15    
Tyr Phe Ser Ser Gly Ser Cys Gly Lys Val Leu Val Trp Pro Thr Glu 
            20                   25                  30        
Tyr Ser His Trp Ile Asn Met Lys Thr Ile Leu Glu Glu Leu Val Gln 
        35                   40                  45            
Arg Gly His Glu Val Thr Val Leu Thr Ser Ser Ala Ser Thr Leu Val 
    50                   55                  60                
Asn Ala Ser Lys Ser Ser Ala Ile Lys Leu Glu Val Tyr Pro Thr Ser 
65                   70                  75                  80
Leu Thr Lys Asn Tyr Leu Glu Asp Ser Leu Leu Lys Ile Leu Asp Arg 
                85                   90                  95    
Trp Ile Tyr Gly Val Ser Lys Asn Thr Phe Trp Ser Tyr Phe Ser Gln 
            100                  105                110        
Leu Gln Glu Leu Cys Trp Glu Tyr Tyr Asp Tyr Ser Asn Lys Leu Cys 
        115                  120                125            
Lys Asp Ala Val Leu Asn Lys Lys Leu Met Met Lys Leu Gln Glu Ser 
    130                  135                140                
Lys Phe Asp Val Ile Leu Ala Asp Ala Leu Asn Pro Cys Gly Glu Leu 
145                  150                155                  160
Leu Ala Glu Leu Phe Asn Ile Pro Phe Leu Tyr Ser Leu Arg Phe Ser 
                165                  170                175    
Val Gly Tyr Thr Phe Glu Lys Asn Gly Gly Gly Phe Leu Phe Pro Pro 
            180                  185                190        
Ser Tyr Val Pro Val Val Met Ser Glu Leu Ser Asp Gln Met Ile Phe 
        195                  200                205            
Met Glu Arg Ile Lys Asn Met Ile His Met Leu Tyr Phe Asp Phe Trp 
    210                  215                220                
Phe Gln Ile Tyr Asp Leu Lys Lys Trp Asp Gln Phe Tyr Ser Glu Val 
225                  230                235                  240
Leu Gly Arg Pro Thr Thr Leu Phe Glu Thr Met Gly Lys Ala Glu Met 
                245                  250                255    
Trp Leu Ile Arg Thr Tyr Trp Asp Phe Glu Phe Pro Arg Pro Phe Leu 
            260                  265                270        
Pro Asn Val Asp Phe Val Gly Gly Leu His Cys Lys Pro Ala Lys Pro 
        275                  280                285            
Leu Pro Lys Glu Met Glu Glu Phe Val Gln Ser Ser Gly Glu Asn Gly 
    290                  295                300                
Ile Val Val Phe Ser Leu Gly Ser Met Ile Ser Asn Met Ser Glu Glu 
305                  310                315                  320
Ser Ala Asn Met Ile Ala Ser Ala Leu Ala Gln Ile Pro Gln Lys Val 
                325                  330                335    
Leu Trp Arg Phe Asp Gly Lys Lys Pro Asn Thr Leu Gly Ser Asn Thr 
            340                  345                350        
Arg Leu Tyr Lys Trp Leu Pro Gln Asn Asp Leu Leu Gly His Pro Lys 
        355                  360                365            
Thr Lys Ala Phe Ile Thr His Gly Gly Thr Asn Gly Ile Tyr Glu Ala 
    370                  375                380                
Ile Tyr His Gly Ile Pro Met Val Gly Ile Pro Leu Phe Ala Asp Gln 
385                  390                395                  400
His Asp Asn Ile Ala His Met Lys Ala Lys Gly Ala Ala Leu Ser Val 
                405                  410                415    
Asp Ile Arg Thr Met Ser Ser Arg Asp Leu Leu Asn Ala Leu Lys Ser 
            420                  425                430        
Val Ile Asn Asp Pro Val Tyr Lys Glu Asn Val Met Lys Leu Ser Arg 
        435                  440                445            
Ile His His Asp Gln Pro Met Lys Pro Leu Asp Arg Ala Val Phe Trp 
    450                  455                460                
Ile Glu Phe Val Met Arg His Lys Gly Ala Lys His Leu Arg Val Ala 
465                  470                475                  480
Ala His Asn Leu Thr Trp Ile Gln Tyr His Ser Leu Asp Val Ile Ala 
                485                  490                495    
Phe Leu Leu Ala Cys Val Ala Thr Val Ile Phe Ile Ile Thr Lys Phe 
            500                  505                510        
Cys Leu Phe Cys Phe Arg Lys Leu Ala Lys Lys Gly Lys Lys Lys Lys 
        515                  520                525            
Arg Asp 
    530

<210> 27
<211> 1681
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1681
<223> /mol_type="DNA"
      /note="HOXC6"
      /organism="Homo sapiens"

<400> 27
ttttgtctgt cctggattgg agccgtccct ataaccatct agttccgagt acaaactgga      60

gacagaaata aatattaaag aaatcataga ccgaccaggt aaaggcaaag ggatgaattc     120

ctacttcact aacccttcct tatcctgcca cctcgccggg ggccaggacg tcctccccaa     180

cgtcgccctc aattccaccg cctatgatcc agtgaggcat ttctcgacct atggagcggc     240

cgttgcccag aaccggatct actcgactcc cttttattcg ccacaggaga atgtcgtgtt     300

cagttccagc cgggggccgt atgactatgg atctaattcc ttttaccagg agaaagacat     360

gctctcaaac tgcagacaaa acaccttagg acataacaca cagacctcaa tcgctcagga     420

ttttagttct gagcagggca ggactgcgcc ccaggaccag aaagccagta tccagattta     480

cccctggatg cagcgaatga attcgcacag tggggtcggc tacggagcgg accggaggcg     540

cggccgccag atctactcgc ggtaccagac cctggaactg gagaaggaat ttcacttcaa     600

tcgctaccta acgcggcgcc ggcgcatcga gatcgccaac gcgctttgcc tgaccgagcg     660

acagatcaaa atctggttcc agaaccgccg gatgaagtgg aaaaaagaat ctaatctcac     720

atccactctc tcggggggcg gcggaggggc caccgccgac agcctgggcg gaaaagagga     780

aaagcgggaa gagacagaag aggagaagca gaaagagtga ccaggactgt ccctgccacc     840

cctctctccc tttctccctc gctccccacc aactctcccc taatcacaca ctctgtattt     900

atcactggca caattgatgt gttttgattc cctaaaacaa aattagggag tcaaacgtgg     960

acctgaaagt cagctctgga ccccctccct caccgcacaa ctctctttca ccacgcgcct    1020

cctcctcctc gctcccttgc tagctcgttc tcggcttgtc tacaggccct tttccccgtc    1080

caggccttgg gggctcggac cctgaactca gactctacag attgccctcc aagtgaggac    1140

ttggctcccc cactccttcg acgcccccac ccccgccccc cgtgcagaga gccggctcct    1200

gggcctgctg gggcctctgc tccagggcct cagggcccgg cctggcagcc ggggagggcc    1260

ggaggcccaa ggagggcgcg ccttggcccc acaccaaccc ccagggcctc cccgcagtcc    1320

ctgcctagcc cctctgcccc agcaaatgcc cagcccaggc aaattgtatt taaagaatcc    1380

tgggggtcat tatggcattt tacaaactgt gaccgtttct gtgtgaagat ttttagctgt    1440

atttgtggtc tctgtattta tatttatgtt tagcaccgtc agtgttccta tccaatttca    1500

aaaaaggaaa aaaaagaggg aaaattacaa aaagagagaa aaaaagtgaa tgacgtttgt    1560

ttagccagta ggagaaaata aataaataaa taaatccctt cgtgttaccc tcctgtataa    1620

atccaacctc tgggtccgtt ctcgaatatt taataaaact gatattattt ttaaaacttt    1680

a                                                                    1681


<210> 28
<211> 235
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..235
<223> /mol_type="protein"
      /note="HOXC6"
      /organism="Homo sapiens"

<400> 28
Met Asn Ser Tyr Phe Thr Asn Pro Ser Leu Ser Cys His Leu Ala Gly 
1               5                   10                   15    
Gly Gln Asp Val Leu Pro Asn Val Ala Leu Asn Ser Thr Ala Tyr Asp 
            20                   25                  30        
Pro Val Arg His Phe Ser Thr Tyr Gly Ala Ala Val Ala Gln Asn Arg 
        35                   40                  45            
Ile Tyr Ser Thr Pro Phe Tyr Ser Pro Gln Glu Asn Val Val Phe Ser 
    50                   55                  60                
Ser Ser Arg Gly Pro Tyr Asp Tyr Gly Ser Asn Ser Phe Tyr Gln Glu 
65                   70                  75                  80
Lys Asp Met Leu Ser Asn Cys Arg Gln Asn Thr Leu Gly His Asn Thr 
                85                   90                  95    
Gln Thr Ser Ile Ala Gln Asp Phe Ser Ser Glu Gln Gly Arg Thr Ala 
            100                  105                110        
Pro Gln Asp Gln Lys Ala Ser Ile Gln Ile Tyr Pro Trp Met Gln Arg 
        115                  120                125            
Met Asn Ser His Ser Gly Val Gly Tyr Gly Ala Asp Arg Arg Arg Gly 
    130                  135                140                
Arg Gln Ile Tyr Ser Arg Tyr Gln Thr Leu Glu Leu Glu Lys Glu Phe 
145                  150                155                  160
His Phe Asn Arg Tyr Leu Thr Arg Arg Arg Arg Ile Glu Ile Ala Asn 
                165                  170                175    
Ala Leu Cys Leu Thr Glu Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg 
            180                  185                190        
Arg Met Lys Trp Lys Lys Glu Ser Asn Leu Thr Ser Thr Leu Ser Gly 
        195                  200                205            
Gly Gly Gly Gly Ala Thr Ala Asp Ser Leu Gly Gly Lys Glu Glu Lys 
    210                  215                220                
Arg Glu Glu Thr Glu Glu Glu Lys Gln Lys Glu 
225                  230                235

<210> 29
<211> 2005
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2005
<223> /mol_type="DNA"
      /note="SFRP2"
      /organism="Homo sapiens"

<400> 29
caacggctca ttctgctccc ccgggtcgga gccccccgga gctgcgcgcg ggcttgcagc      60

gcctcgcccg cgctgtcctc ccggtgtccc gcttctccgc gccccagccg ccggctgcca     120

gcttttcggg gccccgagtc gcacccagcg aagagagcgg gcccgggaca agctcgaact     180

ccggccgcct cgcccttccc cggctccgct ccctctgccc cctcggggtc gcgcgcccac     240

gatgctgcag ggccctggct cgctgctgct gctcttcctc gcctcgcact gctgcctggg     300

ctcggcgcgc gggctcttcc tctttggcca gcccgacttc tcctacaagc gcagcaattg     360

caagcccatc cctgccaacc tgcagctgtg ccacggcatc gaataccaga acatgcggct     420

gcccaacctg ctgggccacg agaccatgaa ggaggtgctg gagcaggccg gcgcttggat     480

cccgctggtc atgaagcagt gccacccgga caccaagaag ttcctgtgct cgctcttcgc     540

ccccgtctgc ctcgatgacc tagacgagac catccagcca tgccactcgc tctgcgtgca     600

ggtgaaggac cgctgcgccc cggtcatgtc cgccttcggc ttcccctggc ccgacatgct     660

tgagtgcgac cgtttccccc aggacaacga cctttgcatc cccctcgcta gcagcgacca     720

cctcctgcca gccaccgagg aagctccaaa ggtatgtgaa gcctgcaaaa ataaaaatga     780

tgatgacaac gacataatgg aaacgctttg taaaaatgat tttgcactga aaataaaagt     840

gaaggagata acctacatca accgagatac caaaatcatc ctggagacca agagcaagac     900

catttacaag ctgaacggtg tgtccgaaag ggacctgaag aaatcggtgc tgtggctcaa     960

agacagcttg cagtgcacct gtgaggagat gaacgacatc aacgcgccct atctggtcat    1020

gggacagaaa cagggtgggg agctggtgat cacctcggtg aagcggtggc agaaggggca    1080

gagagagttc aagcgcatct cccgcagcat ccgcaagctg cagtgctagt cccggcatcc    1140

tgatggctcc gacaggcctg ctccagagca cggctgacca tttctgctcc gggatctcag    1200

ctcccgttcc ccaagcacac tcctagctgc tccagtctca gcctgggcag cttccccctg    1260

ccttttgcac gtttgcatcc ccagcatttc ctgagttata aggccacagg agtggatagc    1320

tgttttcacc taaaggaaaa gcccacccga atcttgtaga aatattcaaa ctaataaaat    1380

catgaatatt tttatgaagt ttaaaaatag ctcactttaa agctagtttt gaataggtgc    1440

aactgtgact tgggtctggt tggttgttgt ttgttgtttt gagtcagctg attttcactt    1500

cccactgagg ttgtcataac atgcaaattg cttcaatttt ctctgtggcc caaacttgtg    1560

ggtcacaaac cctgttgaga taaagctggc tgttatctca acatcttcat cagctccaga    1620

ctgagactca gtgtctaagt cttacaacaa ttcatcattt tataccttca atgggaactt    1680

aaactgttac atgtatcaca ttccagctac aatacttcca tttattagaa gcacattaac    1740

catttctata gcatgatttc ttcaagtaaa aggcaaaaga tataaatttt ataattgact    1800

tgagtacttt aagccttgtt taaaacattt cttacttaac ttttgcaaat taaacccatt    1860

gtagcttacc tgtaatatac atagtagttt acctttaaaa gttgtaaaaa tattgcttta    1920

accaacactg taaatatttc agataaacat tatattcttg tatataaact ttacatcctg    1980

ttttacctat aaaaaaaaaa aaaaa                                          2005


<210> 30
<211> 295
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..295
<223> /mol_type="protein"
      /note="SFRP2"
      /organism="Homo sapiens"

<400> 30
Met Leu Gln Gly Pro Gly Ser Leu Leu Leu Leu Phe Leu Ala Ser His 
1               5                   10                   15    
Cys Cys Leu Gly Ser Ala Arg Gly Leu Phe Leu Phe Gly Gln Pro Asp 
            20                   25                  30        
Phe Ser Tyr Lys Arg Ser Asn Cys Lys Pro Ile Pro Ala Asn Leu Gln 
        35                   40                  45            
Leu Cys His Gly Ile Glu Tyr Gln Asn Met Arg Leu Pro Asn Leu Leu 
    50                   55                  60                
Gly His Glu Thr Met Lys Glu Val Leu Glu Gln Ala Gly Ala Trp Ile 
65                   70                  75                  80
Pro Leu Val Met Lys Gln Cys His Pro Asp Thr Lys Lys Phe Leu Cys 
                85                   90                  95    
Ser Leu Phe Ala Pro Val Cys Leu Asp Asp Leu Asp Glu Thr Ile Gln 
            100                  105                110        
Pro Cys His Ser Leu Cys Val Gln Val Lys Asp Arg Cys Ala Pro Val 
        115                  120                125            
Met Ser Ala Phe Gly Phe Pro Trp Pro Asp Met Leu Glu Cys Asp Arg 
    130                  135                140                
Phe Pro Gln Asp Asn Asp Leu Cys Ile Pro Leu Ala Ser Ser Asp His 
145                  150                155                  160
Leu Leu Pro Ala Thr Glu Glu Ala Pro Lys Val Cys Glu Ala Cys Lys 
                165                  170                175    
Asn Lys Asn Asp Asp Asp Asn Asp Ile Met Glu Thr Leu Cys Lys Asn 
            180                  185                190        
Asp Phe Ala Leu Lys Ile Lys Val Lys Glu Ile Thr Tyr Ile Asn Arg 
        195                  200                205            
Asp Thr Lys Ile Ile Leu Glu Thr Lys Ser Lys Thr Ile Tyr Lys Leu 
    210                  215                220                
Asn Gly Val Ser Glu Arg Asp Leu Lys Lys Ser Val Leu Trp Leu Lys 
225                  230                235                  240
Asp Ser Leu Gln Cys Thr Cys Glu Glu Met Asn Asp Ile Asn Ala Pro 
                245                  250                255    
Tyr Leu Val Met Gly Gln Lys Gln Gly Gly Glu Leu Val Ile Thr Ser 
            260                  265                270        
Val Lys Arg Trp Gln Lys Gly Gln Arg Glu Phe Lys Arg Ile Ser Arg 
        275                  280                285            
Ser Ile Arg Lys Leu Gln Cys 
    290                  295

<210> 31
<211> 1814
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..1814
<223> /mol_type="DNA"
      /note="HOXD10"
      /organism="Homo sapiens"

<400> 31
cggggaatgt tttcctagag atgtcagcct acaaaggaca caatctctct tcttcaaatt      60

cttccccaaa atgtcctttc ccaacagctc tcctgctgct aatacttttt tagtagattc     120

cttgatcagt gcctgcagga gtgacagttt ttattccagc agcgccagca tgtacatgcc     180

accacctagc gcagacatgg ggacctatgg aatgcaaacc tgtggactgc tcccgtctct     240

ggccaaaaga gaagtgaacc accaaaatat gggtatgaat gtgcatcctt atatacctca     300

agtagacagt tggacagatc cgaacagatc ttgtcgaata gagcaacctg ttacacagca     360

agtccccact tgctccttca ccaccaacat taaggaagaa tccaattgct gcatgtattc     420

tgataagcgc aacaaactca tttcggccga ggtcccttcg taccagaggc tggtccctga     480

gtcttgtccc gttgagaacc ctgaggttcc cgtccctgga tattttagac tgagtcagac     540

ctacgccacc gggaaaaccc aagagtacaa taatagcccc gaaggcagct ccactgtcat     600

gctccagctc aaccctcgtg gcgcggccaa gccgcagctc tccgctgccc agctgcagat     660

ggaaaagaag atgaacgagc ccgtgagcgg ccaggagccc accaaagtct cccaggtgga     720

gagccccgag gccaaaggcg gccttcccga agagaggagc tgcctggctg aggtctccgt     780

gtccagtccc gaagtgcagg agaaggaaag caaagaggaa atcaagtctg atacaccaac     840

cagcaattgg ctcactgcaa agagtggcag aaagaagagg tgcccttaca ctaagcacca     900

aacgctggaa ttagaaaaag agttcttgtt caatatgtac ctcacccgcg agcgccgcct     960

agagatcagt aagagcgtta acctcaccga caggcaggtc aagatttggt ttcaaaaccg    1020

ccgaatgaaa ctcaagaaga tgagccgaga gaaccggatc cgagaactga ccgccaacct    1080

cacgttttct taggtctgag gccggtctga ggccggtcag aggccaggat tggagagggg    1140

gcaccgcgtt ccagggccca gtgctggagg actgggaaag cggaaacaaa accttcaccg    1200

ctctttgttt gttgttttgt tgtattttgt tttcctgcta gaatgtgact ttggggtcat    1260

tatgttcgtg ctgcaagtga tctgtaatcc ctatgagtat atatatatat atatatatat    1320

atatataaaa acttagcacg tgtaatttat tattttttca tcgtaatgca gggtaactat    1380

tattgcgcat tttcatttgg gtcttaactt attggaactg tagagcatcc atccatccat    1440

ccatccagca atgtgacttt ttcatgtctt tcctaacaca aaaggtctat gtgtgtggtt    1500

agtccatgaa ctcatggcat tttgaataca tccagtactt taaaaatgac atatatattt    1560

aaaaaaaaaa gattaagaaa acccacaagt tggagggagg gggacttaaa aagcacatta    1620

caatgtatct tttcacaaat gaatttagca gttgtccttg gtgagatggg atattggcga    1680

tttatgcctt gtagcctttc ccttgtggtg catctgtggt ttggtagaag tacaacagca    1740

acctgtcctt tctgtgcatg ttctggtcgc atgtataatg caataaactc tggaaatgag    1800

ttcaaaaaaa aaaa                                                      1814


<210> 32
<211> 340
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..340
<223> /mol_type="protein"
      /note="HOXD10"
      /organism="Homo sapiens"

<400> 32
Met Ser Phe Pro Asn Ser Ser Pro Ala Ala Asn Thr Phe Leu Val Asp 
1               5                   10                   15    
Ser Leu Ile Ser Ala Cys Arg Ser Asp Ser Phe Tyr Ser Ser Ser Ala 
            20                   25                  30        
Ser Met Tyr Met Pro Pro Pro Ser Ala Asp Met Gly Thr Tyr Gly Met 
        35                   40                  45            
Gln Thr Cys Gly Leu Leu Pro Ser Leu Ala Lys Arg Glu Val Asn His 
    50                   55                  60                
Gln Asn Met Gly Met Asn Val His Pro Tyr Ile Pro Gln Val Asp Ser 
65                   70                  75                  80
Trp Thr Asp Pro Asn Arg Ser Cys Arg Ile Glu Gln Pro Val Thr Gln 
                85                   90                  95    
Gln Val Pro Thr Cys Ser Phe Thr Thr Asn Ile Lys Glu Glu Ser Asn 
            100                  105                110        
Cys Cys Met Tyr Ser Asp Lys Arg Asn Lys Leu Ile Ser Ala Glu Val 
        115                  120                125            
Pro Ser Tyr Gln Arg Leu Val Pro Glu Ser Cys Pro Val Glu Asn Pro 
    130                  135                140                
Glu Val Pro Val Pro Gly Tyr Phe Arg Leu Ser Gln Thr Tyr Ala Thr 
145                  150                155                  160
Gly Lys Thr Gln Glu Tyr Asn Asn Ser Pro Glu Gly Ser Ser Thr Val 
                165                  170                175    
Met Leu Gln Leu Asn Pro Arg Gly Ala Ala Lys Pro Gln Leu Ser Ala 
            180                  185                190        
Ala Gln Leu Gln Met Glu Lys Lys Met Asn Glu Pro Val Ser Gly Gln 
        195                  200                205            
Glu Pro Thr Lys Val Ser Gln Val Glu Ser Pro Glu Ala Lys Gly Gly 
    210                  215                220                
Leu Pro Glu Glu Arg Ser Cys Leu Ala Glu Val Ser Val Ser Ser Pro 
225                  230                235                  240
Glu Val Gln Glu Lys Glu Ser Lys Glu Glu Ile Lys Ser Asp Thr Pro 
                245                  250                255    
Thr Ser Asn Trp Leu Thr Ala Lys Ser Gly Arg Lys Lys Arg Cys Pro 
            260                  265                270        
Tyr Thr Lys His Gln Thr Leu Glu Leu Glu Lys Glu Phe Leu Phe Asn 
        275                  280                285            
Met Tyr Leu Thr Arg Glu Arg Arg Leu Glu Ile Ser Lys Ser Val Asn 
    290                  295                300                
Leu Thr Asp Arg Gln Val Lys Ile Trp Phe Gln Asn Arg Arg Met Lys 
305                  310                315                  320
Leu Lys Lys Met Ser Arg Glu Asn Arg Ile Arg Glu Leu Thr Ala Asn 
                325                  330                335    
Leu Thr Phe Ser 
            340

<210> 33
<211> 3604
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..3604
<223> /mol_type="DNA"
      /note="RORB"
      /organism="Homo sapiens"

<400> 33
tctctcccct ctctttctct ctcgctgctc ccttcctccc tgtaactgaa cagtgaaaat       60

tcacattgtg gatccgctaa caggcacaga tgtcatgtga aaacgcacat gctctgccat      120

ccacaccgcc tttctttctt ttctttctgt ttcctttttt cccccttgtt ccttctccct      180

cttctttgta actaacaaaa ccaccaccaa ctcctcctcc tgctgctgcc cttcctcctc      240

ctcctcagtc caagtgatca caaaagaaat cttctgagcc ggaggcggtg gcatttttta      300

aaaagcaagc acattggaga gaaagaaaaa gaaaaacaaa accaaaacaa aacccaggca      360

ccagacagcc agaacatttt tttttcaccc ttcctgaaaa caaacaaaca aacaaacaat      420

catcaaaaca gtcaccacca acatcaaaac tgttaacata gcggcggcgg cggcaaacgt      480

caccctgcag ccacggcgtc cgcctaaagg gatggttttc tcggcagagc agctcttcgc      540

cgaccacctt cttcactcgt gctgagcggg atttttgggc tctccggggt tcgggctggg      600

agcagcttca tgactacgcg gagcgggaga gcggccacac catgcgagca caaattgaag      660

tgataccatg caaaatttgt ggcgataagt cctctgggat ccactacgga gtcatcacat      720

gtgaaggctg caagggattc tttaggagga gccagcagaa caatgcttct tattcctgcc      780

caaggcagag aaactgttta attgacagaa cgaacagaaa ccgttgccaa cactgccgac      840

tgcagaagtg tcttgcccta ggaatgtcaa gagatgctgt gaagtttggg aggatgtcca      900

agaagcaaag ggacagcctg tatgctgagg tgcagaagca ccagcagcgg ctgcaggaac      960

agcggcagca gcagagtggg gaggcagaag cccttgccag ggtgtacagc agcagcatta     1020

gcaacggcct gagcaacctg aacaacgaga ccagcggcac ttatgccaac gggcacgtca     1080

ttgacctgcc caagtctgag ggttattaca acgtcgattc cggtcagccg tcccctgatc     1140

agtcaggact tgacatgact ggaatcaaac agataaagca agaacctatc tatgacctca     1200

catccgtacc caacttgttt acctatagct ctttcaacaa tgggcagtta gcaccaggga     1260

taaccatgac tgaaatcgac cgaattgcac agaacatcat taagtcccat ttggagacat     1320

gtcaatacac catggaagag ctgcaccagc tggcgtggca gacccacacc tatgaagaaa     1380

ttaaagcata tcaaagcaag tccagggaag cactgtggca acaatgtgcc atccagatca     1440

ctcacgccat ccaatacgtg gtggagtttg caaagcggat aacaggcttc atggagctct     1500

gtcaaaatga tcaaattcta cttctgaagt caggttgctt ggaagtggtt ttagtgagaa     1560

tgtgccgtgc cttcaaccca ttaaacaaca ctgttctgtt tgaaggaaaa tatggaggaa     1620

tgcaaatgtt caaagcctta ggttctgatg acctagtgaa tgaagcattt gactttgcaa     1680

agaatttgtg ttccttgcag ctgaccgagg aggagatcgc tttgttctca tctgctgttc     1740

tgatatctcc agaccgagcc tggcttatag aaccaaggaa agtccagaag cttcaggaaa     1800

aaatttattt tgcacttcaa catgtgattc agaagaatca cctggatgat gagaccttgg     1860

caaagttaat agccaagata ccaaccatca cggcagtttg caacttgcac ggggagaagc     1920

tgcaggtatt taagcaatct catccagaga tagtgaatac actgtttcct ccgttataca     1980

aggagctctt taatcctgac tgtgccaccg gctgcaaatg aaggggacaa gagaactgtc     2040

tcatagtcat ggaatgcatc accattaaga caaaagcaat gtgttcatga agacttaaga     2100

aaaatgtcac tactgcaaca ttaggaatgt cctgcactta atagaattat ttttcaccgc     2160

tacagtttga agaatgtaaa tatgcacctg agtggggctc ttttatttgt ttgtttgttt     2220

ttgaaatgac cataaatata caaatatagg acactgggtg ttatcctttt tttaatttta     2280

ttcgggtatg ttttgggaga caactgttta tagaatttta ttgtagatat atacaagaaa     2340

agagcggtac tttacatgat tacttttcct gttgattgtt caaatataat ttaagaaaat     2400

tccacttaat aggcttacct atttctatgt ttttaggtag ttgatgcatg tgtaaatttg     2460

tagctgtctt ggaaagtact gtgcatgtat gtaataagta tataatatgt gagaatatta     2520

tatatgacta ttacttatac atgcacatgc actgtggctt aaataccata cctactagca     2580

atggaggttc agtcaggctc tcttctatga tttaccttct gtgttatatg ttacctttat     2640

gttagacaat caggattttg ttttcccagc cagagttttc atctatagtc aatggcagga     2700

cggtaccaac tcagagttaa gtctacaaag gaataaacat aatgtgtggc ctctatatac     2760

aaactctatt tctgtcaatg acatcaaagc cttgtcaaga tggttcatat tgggaaggag     2820

acagtatttt aagccatttt cctgtttcaa gaattaggcc acagataaca ttgcaaggtc     2880

caagactttt ttgaccaaac agtagatatt ttctattttt caccagaaca cataaaaaca     2940

ctttttttct tttggatttc tggttgtgaa acaagcttga tttcagtgct tattgtgtct     3000

tcaactgaaa aatacaatct gtggattatg actaccagca atttttttct aggaaagtta     3060

aaagaataaa tcagaaccca gggcaacaat gccatttcat gtaaacattt tctctctcac     3120

catgttttgg caagaaaagg tagaaagaga agacccagag tgaagaagta attctttata     3180

ttcctttctt taatgtattt gttaggaaaa gtggcaataa agggggaggc atattataaa     3240

atgctataat ataaaaatgt agcaaaaact tgacagacta gaaaaaaaaa gatctgtgtt     3300

attctaggga actaatgtac cccaaagcca aaactaattc ctgtgaagtt tacagttaca     3360

tcatccattt accctagaat tattttttta gcaactttta gaaataaaga atacaactgt     3420

gacattagga tcagagattt tagacttcct tgtacaaatt ctcacttctc cacctgctca     3480

ccaatgaaat taatcataag aaaagcatat attccaagaa atttgttctg cctgtgtcct     3540

ggaggcctat acctctgtta ttttctgata caaaataaaa cttaaaaaaa agaaaacaag     3600

ctaa                                                                  3604


<210> 34
<211> 459
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..459
<223> /mol_type="protein"
      /note="RORB"
      /organism="Homo sapiens"

<400> 34
Met Arg Ala Gln Ile Glu Val Ile Pro Cys Lys Ile Cys Gly Asp Lys 
1               5                   10                   15    
Ser Ser Gly Ile His Tyr Gly Val Ile Thr Cys Glu Gly Cys Lys Gly 
            20                   25                  30        
Phe Phe Arg Arg Ser Gln Gln Asn Asn Ala Ser Tyr Ser Cys Pro Arg 
        35                   40                  45            
Gln Arg Asn Cys Leu Ile Asp Arg Thr Asn Arg Asn Arg Cys Gln His 
    50                   55                  60                
Cys Arg Leu Gln Lys Cys Leu Ala Leu Gly Met Ser Arg Asp Ala Val 
65                   70                  75                  80
Lys Phe Gly Arg Met Ser Lys Lys Gln Arg Asp Ser Leu Tyr Ala Glu 
                85                   90                  95    
Val Gln Lys His Gln Gln Arg Leu Gln Glu Gln Arg Gln Gln Gln Ser 
            100                  105                110        
Gly Glu Ala Glu Ala Leu Ala Arg Val Tyr Ser Ser Ser Ile Ser Asn 
        115                  120                125            
Gly Leu Ser Asn Leu Asn Asn Glu Thr Ser Gly Thr Tyr Ala Asn Gly 
    130                  135                140                
His Val Ile Asp Leu Pro Lys Ser Glu Gly Tyr Tyr Asn Val Asp Ser 
145                  150                155                  160
Gly Gln Pro Ser Pro Asp Gln Ser Gly Leu Asp Met Thr Gly Ile Lys 
                165                  170                175    
Gln Ile Lys Gln Glu Pro Ile Tyr Asp Leu Thr Ser Val Pro Asn Leu 
            180                  185                190        
Phe Thr Tyr Ser Ser Phe Asn Asn Gly Gln Leu Ala Pro Gly Ile Thr 
        195                  200                205            
Met Thr Glu Ile Asp Arg Ile Ala Gln Asn Ile Ile Lys Ser His Leu 
    210                  215                220                
Glu Thr Cys Gln Tyr Thr Met Glu Glu Leu His Gln Leu Ala Trp Gln 
225                  230                235                  240
Thr His Thr Tyr Glu Glu Ile Lys Ala Tyr Gln Ser Lys Ser Arg Glu 
                245                  250                255    
Ala Leu Trp Gln Gln Cys Ala Ile Gln Ile Thr His Ala Ile Gln Tyr 
            260                  265                270        
Val Val Glu Phe Ala Lys Arg Ile Thr Gly Phe Met Glu Leu Cys Gln 
        275                  280                285            
Asn Asp Gln Ile Leu Leu Leu Lys Ser Gly Cys Leu Glu Val Val Leu 
    290                  295                300                
Val Arg Met Cys Arg Ala Phe Asn Pro Leu Asn Asn Thr Val Leu Phe 
305                  310                315                  320
Glu Gly Lys Tyr Gly Gly Met Gln Met Phe Lys Ala Leu Gly Ser Asp 
                325                  330                335    
Asp Leu Val Asn Glu Ala Phe Asp Phe Ala Lys Asn Leu Cys Ser Leu 
            340                  345                350        
Gln Leu Thr Glu Glu Glu Ile Ala Leu Phe Ser Ser Ala Val Leu Ile 
        355                  360                365            
Ser Pro Asp Arg Ala Trp Leu Ile Glu Pro Arg Lys Val Gln Lys Leu 
    370                  375                380                
Gln Glu Lys Ile Tyr Phe Ala Leu Gln His Val Ile Gln Lys Asn His 
385                  390                395                  400
Leu Asp Asp Glu Thr Leu Ala Lys Leu Ile Ala Lys Ile Pro Thr Ile 
                405                  410                415    
Thr Ala Val Cys Asn Leu His Gly Glu Lys Leu Gln Val Phe Lys Gln 
            420                  425                430        
Ser His Pro Glu Ile Val Asn Thr Leu Phe Pro Pro Leu Tyr Lys Glu 
        435                  440                445            
Leu Phe Asn Pro Asp Cys Ala Thr Gly Cys Lys 
    450                  455                

<210> 35
<211> 3412
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..3412
<223> /mol_type="DNA"
      /note="RRM2"
      /organism="Homo sapiens"

<400> 35
aggcgcagcc aatgggaagg gtcggaggca tggcacagcc aatgggaagg gccggggcac       60

caaagccaat gggaagggcc gggagcgcgc ggcgcgggag atttaaaggc tgctggagtg      120

aggggtcgcc cgtgcaccct gtcccagccg tcctgtcctg gctgctcgct ctgcttcgct      180

gcgcctccac tatgctctcc ctccgtgtcc cgctcgcgcc catcacggac ccgcagcagc      240

tgcagctctc gccgctgaag gggctcagct tggtcgacaa ggagaacacg ccgccggccc      300

tgagcgggac ccgcgtcctg gccagcaaga ccgcgaggag gatcttccag gagcccacgg      360

agccgaaaac taaagcagct gcccccggcg tggaggatga gccgctgctg agagaaaacc      420

cccgccgctt tgtcatcttc cccatcgagt accatgatat ctggcagatg tataagaagg      480

cagaggcttc cttttggacc gccgaggagg tggacctctc caaggacatt cagcactggg      540

aatccctgaa acccgaggag agatatttta tatcccatgt tctggctttc tttgcagcaa      600

gcgatggcat agtaaatgaa aacttggtgg agcgatttag ccaagaagtt cagattacag      660

aagcccgctg tttctatggc ttccaaattg ccatggaaaa catacattct gaaatgtata      720

gtcttcttat tgacacttac ataaaagatc ccaaagaaag ggaatttctc ttcaatgcca      780

ttgaaacgat gccttgtgtc aagaagaagg cagactgggc cttgcgctgg attggggaca      840

aagaggctac ctatggtgaa cgtgttgtag cctttgctgc agtggaaggc attttctttt      900

ccggttcttt tgcgtcgata ttctggctca agaaacgagg actgatgcct ggcctcacat      960

tttctaatga acttattagc agagatgagg gtttacactg tgattttgct tgcctgatgt     1020

tcaaacacct ggtacacaaa ccatcggagg agagagtaag agaaataatt atcaatgctg     1080

ttcggataga acaggagttc ctcactgagg ccttgcctgt gaagctcatt gggatgaatt     1140

gcactctaat gaagcaatac attgagtttg tggcagacag acttatgctg gaactgggtt     1200

ttagcaaggt tttcagagta gagaacccat ttgactttat ggagaatatt tcactggaag     1260

gaaagactaa cttctttgag aagagagtag gcgagtatca gaggatggga gtgatgtcaa     1320

gtccaacaga gaattctttt accttggatg ctgacttcta aatgaactga agatgtgccc     1380

ttacttggct gatttttttt ttccatctca taagaaaaat cagctgaagt gttaccaact     1440

agccacacca tgaattgtcc gtaatgttca ttaacagcat ctttaaaact gtgtagctac     1500

ctcacaacca gtcctgtctg tttatagtgc tggtagtatc accttttgcc agaaggcctg     1560

gctggctgtg acttaccata gcagtgacaa tggcagtctt ggctttaaag tgaggggtga     1620

ccctttagtg agcttagcac agcgggatta aacagtcctt taaccagcac agccagttaa     1680

aagatgcagc ctcactgctt caacgcagat tttaatgttt acttaaatat aaacctggca     1740

ctttacaaac aaataaacat tgtttgtact cacaaggcga taatagcttg atttatttgg     1800

tttctacacc aaatacattc tcctgaccac taatgggagc caattcacaa ttcactaagt     1860

gactaaagta agttaaactt gtgtagacta agcatgtaat ttttaagttt tattttaatg     1920

aattaaaata tttgttaacc aactttaaag tcagtcctgt gtatacctag atattagtca     1980

gttggtgcca gatagaagac aggttgtgtt tttatcctgt ggcttgtgta gtgtcctggg     2040

attctctgcc ccctctgagt agagtgttgt gggataaagg aatctctcag ggcaaggagc     2100

ttcttaagtt aaatcactag aaatttaggg gtgatctggg ccttcatatg tgtgagaagc     2160

cgtttcattt tatttctcac tgtattttcc tcaacgtctg gttgatgaga aaaaattctt     2220

gaagagtttt catatgtggg agctaaggta gtattgtaaa atttcaagtc atccttaaac     2280

aaaatgatcc acctaagatc ttgcccctgt taagtggtga aatcaactag aggtggttcc     2340

tacaagttgt tcattctagt tttgtttggt gtaagtaggt tgtgtgagtt aattcattta     2400

tatttactat gtctgttaaa tcagaaattt tttattatct atgttcttct agattttacc     2460

tgtagttcat acttcagtca cccagtgtct tattctggca ttgtctaaat ctgagcattg     2520

tctaggggga tcttaaactt tagtaggaaa ccatgagctg ttaatacagt ttccattcaa     2580

atattaattt cagaatgaaa cataattttt tttttttttt ttgagatgga gtctcgctct     2640

gttgcccagg ctggagtgca gtggcgcgat tttggctcac tgtaacctcc atctcctggg     2700

ttcaagcaat tctcctgtct cagcctccct agtagctggg actgcaggta tgtgctacca     2760

cacctggcta atttttgtat ttttagtaga gatggagttt caccatattg gtcaggctgg     2820

tcttgaactc ctgacctcag gtgatccacc cacctcggcc tcccaaagtg ctgggattgc     2880

aggcgtgata aacaaatatt cttaataggg ctactttgaa ttaatctgcc tttatgtttg     2940

ggagaagaaa gctgagacat tgcatgaaag atgatgagag ataaatgttg atcttttggc     3000

cccatttgtt aattgtattc agtatttgaa cgtcgtcctg tttattgtta gttttcttca     3060

tcatttattg tatagacaat ttttaaatct ctgtaatatg atacattttc ctatctttta     3120

agttattgtt acctaaagtt aatccagatt atatggtcct tatatgtgta caacattaaa     3180

atgaaaggct ttgtcttgca ttgtgaggta caggcggaag ttggaatcag gttttaggat     3240

tctgtctctc attagctgaa taatgtgagg attaacttct gccagctcag accatttcct     3300

aatcagttga aagggaaaca agtatttcag tctcaaaatt gaataatgca caagtcttaa     3360

gtgattaaaa taaaactgtt cttatgtcag tttcaaaaaa aaaaaaaaaa aa             3412


<210> 36
<211> 389
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..389
<223> /mol_type="protein"
      /note="RRM2"
      /organism="Homo sapiens"

<400> 36
Met Leu Ser Leu Arg Val Pro Leu Ala Pro Ile Thr Asp Pro Gln Gln 
1               5                   10                   15    
Leu Gln Leu Ser Pro Leu Lys Gly Leu Ser Leu Val Asp Lys Glu Asn 
            20                   25                  30        
Thr Pro Pro Ala Leu Ser Gly Thr Arg Val Leu Ala Ser Lys Thr Ala 
        35                   40                  45            
Arg Arg Ile Phe Gln Glu Pro Thr Glu Pro Lys Thr Lys Ala Ala Ala 
    50                   55                  60                
Pro Gly Val Glu Asp Glu Pro Leu Leu Arg Glu Asn Pro Arg Arg Phe 
65                   70                  75                  80
Val Ile Phe Pro Ile Glu Tyr His Asp Ile Trp Gln Met Tyr Lys Lys 
                85                   90                  95    
Ala Glu Ala Ser Phe Trp Thr Ala Glu Glu Val Asp Leu Ser Lys Asp 
            100                  105                110        
Ile Gln His Trp Glu Ser Leu Lys Pro Glu Glu Arg Tyr Phe Ile Ser 
        115                  120                125            
His Val Leu Ala Phe Phe Ala Ala Ser Asp Gly Ile Val Asn Glu Asn 
    130                  135                140                
Leu Val Glu Arg Phe Ser Gln Glu Val Gln Ile Thr Glu Ala Arg Cys 
145                  150                155                  160
Phe Tyr Gly Phe Gln Ile Ala Met Glu Asn Ile His Ser Glu Met Tyr 
                165                  170                175    
Ser Leu Leu Ile Asp Thr Tyr Ile Lys Asp Pro Lys Glu Arg Glu Phe 
            180                  185                190        
Leu Phe Asn Ala Ile Glu Thr Met Pro Cys Val Lys Lys Lys Ala Asp 
        195                  200                205            
Trp Ala Leu Arg Trp Ile Gly Asp Lys Glu Ala Thr Tyr Gly Glu Arg 
    210                  215                220                
Val Val Ala Phe Ala Ala Val Glu Gly Ile Phe Phe Ser Gly Ser Phe 
225                  230                235                  240
Ala Ser Ile Phe Trp Leu Lys Lys Arg Gly Leu Met Pro Gly Leu Thr 
                245                  250                255    
Phe Ser Asn Glu Leu Ile Ser Arg Asp Glu Gly Leu His Cys Asp Phe 
            260                  265                270        
Ala Cys Leu Met Phe Lys His Leu Val His Lys Pro Ser Glu Glu Arg 
        275                  280                285            
Val Arg Glu Ile Ile Ile Asn Ala Val Arg Ile Glu Gln Glu Phe Leu 
    290                  295                300                
Thr Glu Ala Leu Pro Val Lys Leu Ile Gly Met Asn Cys Thr Leu Met 
305                  310                315                  320
Lys Gln Tyr Ile Glu Phe Val Ala Asp Arg Leu Met Leu Glu Leu Gly 
                325                  330                335    
Phe Ser Lys Val Phe Arg Val Glu Asn Pro Phe Asp Phe Met Glu Asn 
            340                  345                350        
Ile Ser Leu Glu Gly Lys Thr Asn Phe Phe Glu Lys Arg Val Gly Glu 
        355                  360                365            
Tyr Gln Arg Met Gly Val Met Ser Ser Pro Thr Glu Asn Ser Phe Thr 
    370                  375                380                
Leu Asp Ala Asp Phe 
385                

<210> 37
<211> 3027
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..3027
<223> /mol_type="DNA"
      /note="TGM4"
      /organism="Homo sapiens"

<400> 37
ggaccgactg tgtggaagca ccaggcatca gagatagagt cttccctggc attgcaggag      60

agaatctgaa gggatgatgg atgcatcaaa agagctgcaa gttctccaca ttgacttctt     120

gaatcaggac aacgccgttt ctcaccacac atgggagttc caaacgagca gtcctgtgtt     180

ccggcgagga caggtgtttc acctgcggct ggtgctgaac cagcccctac aatcctacca     240

ccaactgaaa ctggaattca gcacagggcc gaatcctagc atcgccaaac acaccctggt     300

ggtgctcgac ccgaggacgc cctcagacca ctacaactgg caggcaaccc ttcaaaatga     360

gtctggcaaa gaggtcacag tggctgtcac cagttccccc aatgccatcc tgggcaagta     420

ccaactaaac gtgaaaactg gaaaccacat ccttaagtct gaagaaaaca tcctatacct     480

tctcttcaac ccatggtgta aagaggacat ggttttcatg cctgatgagg acgagcgcaa     540

agagtacatc ctcaatgaca cgggctgcca ttacgtgggg gctgccagaa gtatcaaatg     600

caaaccctgg aactttggtc agtttgagaa aaatgtcctg gactgctgca tttccctgct     660

gactgagagc tccctcaagc ccacagatag gagggacccc gtgctggtgt gcagggccat     720

gtgtgctatg atgagctttg agaaaggcca gggcgtgctc attgggaatt ggactgggga     780

ctacgaaggt ggcacagccc catacaagtg gacaggcagt gccccgatcc tgcagcagta     840

ctacaacacg aagcaggctg tgtgctttgg ccagtgctgg gtgtttgctg ggatcctgac     900

tacagtgctg agagcgttgg gcatcccagc acgcagtgtg acaggcttcg attcagctca     960

cgacacagaa aggaacctca cggtggacac ctatgtgaat gagaatggcg agaaaatcac    1020

cagtatgacc cacgactctg tctggaattt ccatgtgtgg acggatgcct ggatgaagcg    1080

accggatctg cccaagggct acgacggctg gcaggctgtg gacgcaacgc cgcaggagcg    1140

aagccagggt gtcttctgct gtgggccatc accactgacc gccatccgca aaggtgacat    1200

ctttattgtc tatgacacca gattcgtctt ctcagaagtg aatggtgaca ggctcatctg    1260

gttggtgaag atggtgaatg ggcaggagga gttacacgta atttcaatgg agaccacaag    1320

catcgggaaa aacatcagca ccaaggcagt gggccaagac aggcggagag atatcaccta    1380

tgagtacaag tatccagaag gctcctctga ggagaggcag gtcatggatc atgccttcct    1440

ccttctcagt tctgagaggg agcacagacg acctgtaaaa gagaactttc ttcacatgtc    1500

ggtacaatca gatgatgtgc tgctgggaaa ctctgttaat ttcaccgtga ttcttaaaag    1560

gaagaccgct gccctacaga atgtcaacat cttgggctcc tttgaactac agttgtacac    1620

tggcaagaag atggcaaaac tgtgtgacct caataagacc tcgcagatcc aaggtcaagt    1680

atcagaagtg actctgacct tggactccaa gacctacatc aacagcctgg ctatattaga    1740

tgatgagcca gttatcagag gtttcatcat tgcggaaatt gtggagtcta aggaaatcat    1800

ggcctctgaa gtattcacgt ctttccagta ccctgagttc tctatagagt tgcctaacac    1860

aggcagaatt ggccagctac ttgtctgcaa ttgtatcttc aagaataccc tggccatccc    1920

tttgactgac gtcaagttct ctttggaaag cctgggcatc tcctcactac agacctctga    1980

ccatgggacg gtgcagcctg gtgagaccat ccaatcccaa ataaaatgca ccccaataaa    2040

aactggaccc aagaaattta tcgtcaagtt aagttccaaa caagtgaaag agattaatgc    2100

tcagaagatt gttctcatca ccaagtagcc ttgtctgatg ctgtggagcc ttagttgaga    2160

tttcagcatt tcctaccttg tgcttagctt tcagattatg gatgattaaa tttgatgact    2220

tatatgaggg cagattcaag agccagcagg tcaaaaaggc caacacaacc ataagcagcc    2280

agacccacaa ggccaggtcc tgtgctatca cagggtcacc tcttttacag ttagaaacac    2340

cagccgaggc cacagaatcc catccctttc ctgagtcatg gcctcaaaaa tcagggccac    2400

cattgtctca attcaaatcc atagatttcg aagccacaga gtctctccct ggagcagcag    2460

actatgggca gcccagtgct gccacctgct gacgaccctt gagaagctgc catatcttca    2520

ggccatgggt tcaccagccc tgaaggcacc tgtcaactgg agtgctctct cagcactggg    2580

atgggcctga tagaagtgca ttctcctcct attgcctcca ttctcctctc tctatccctg    2640

aaatccagga agtccctctc ctggtgctcc aagcagtttg aagcccaatc tgcaaggaca    2700

tttctcaagg gccatgtggt tttgcagaca accctgtcct caggcctgaa ctcaccatag    2760

agacccatgt cagcaaacgg tgaccagcaa atcctcttcc cttattctaa agctgcccct    2820

tgggagactc cagggagaag gcattgcttc ctccctggtg tgaactcttt ctttggtatt    2880

ccatccacta tcctggcaac tcaaggctgc ttctgttaac tgaagcctgc tccttcttgt    2940

tctgccctcc agagatttgc tcaaatgatc aataagcttt aaattaaact ctacttcaaa    3000

aaaaaaaaaa aaaaaaaaaa aaaaaaa                                        3027


<210> 38
<211> 684
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..684
<223> /mol_type="protein"
      /note="TGM4"
      /organism="Homo sapiens"

<400> 38
Met Met Asp Ala Ser Lys Glu Leu Gln Val Leu His Ile Asp Phe Leu 
1               5                   10                   15    
Asn Gln Asp Asn Ala Val Ser His His Thr Trp Glu Phe Gln Thr Ser 
            20                   25                  30        
Ser Pro Val Phe Arg Arg Gly Gln Val Phe His Leu Arg Leu Val Leu 
        35                   40                  45            
Asn Gln Pro Leu Gln Ser Tyr His Gln Leu Lys Leu Glu Phe Ser Thr 
    50                   55                  60                
Gly Pro Asn Pro Ser Ile Ala Lys His Thr Leu Val Val Leu Asp Pro 
65                   70                  75                  80
Arg Thr Pro Ser Asp His Tyr Asn Trp Gln Ala Thr Leu Gln Asn Glu 
                85                   90                  95    
Ser Gly Lys Glu Val Thr Val Ala Val Thr Ser Ser Pro Asn Ala Ile 
            100                  105                110        
Leu Gly Lys Tyr Gln Leu Asn Val Lys Thr Gly Asn His Ile Leu Lys 
        115                  120                125            
Ser Glu Glu Asn Ile Leu Tyr Leu Leu Phe Asn Pro Trp Cys Lys Glu 
    130                  135                140                
Asp Met Val Phe Met Pro Asp Glu Asp Glu Arg Lys Glu Tyr Ile Leu 
145                  150                155                  160
Asn Asp Thr Gly Cys His Tyr Val Gly Ala Ala Arg Ser Ile Lys Cys 
                165                  170                175    
Lys Pro Trp Asn Phe Gly Gln Phe Glu Lys Asn Val Leu Asp Cys Cys 
            180                  185                190        
Ile Ser Leu Leu Thr Glu Ser Ser Leu Lys Pro Thr Asp Arg Arg Asp 
        195                  200                205            
Pro Val Leu Val Cys Arg Ala Met Cys Ala Met Met Ser Phe Glu Lys 
    210                  215                220                
Gly Gln Gly Val Leu Ile Gly Asn Trp Thr Gly Asp Tyr Glu Gly Gly 
225                  230                235                  240
Thr Ala Pro Tyr Lys Trp Thr Gly Ser Ala Pro Ile Leu Gln Gln Tyr 
                245                  250                255    
Tyr Asn Thr Lys Gln Ala Val Cys Phe Gly Gln Cys Trp Val Phe Ala 
            260                  265                270        
Gly Ile Leu Thr Thr Val Leu Arg Ala Leu Gly Ile Pro Ala Arg Ser 
        275                  280                285            
Val Thr Gly Phe Asp Ser Ala His Asp Thr Glu Arg Asn Leu Thr Val 
    290                  295                300                
Asp Thr Tyr Val Asn Glu Asn Gly Glu Lys Ile Thr Ser Met Thr His 
305                  310                315                  320
Asp Ser Val Trp Asn Phe His Val Trp Thr Asp Ala Trp Met Lys Arg 
                325                  330                335    
Pro Asp Leu Pro Lys Gly Tyr Asp Gly Trp Gln Ala Val Asp Ala Thr 
            340                  345                350        
Pro Gln Glu Arg Ser Gln Gly Val Phe Cys Cys Gly Pro Ser Pro Leu 
        355                  360                365            
Thr Ala Ile Arg Lys Gly Asp Ile Phe Ile Val Tyr Asp Thr Arg Phe 
    370                  375                380                
Val Phe Ser Glu Val Asn Gly Asp Arg Leu Ile Trp Leu Val Lys Met 
385                  390                395                  400
Val Asn Gly Gln Glu Glu Leu His Val Ile Ser Met Glu Thr Thr Ser 
                405                  410                415    
Ile Gly Lys Asn Ile Ser Thr Lys Ala Val Gly Gln Asp Arg Arg Arg 
            420                  425                430        
Asp Ile Thr Tyr Glu Tyr Lys Tyr Pro Glu Gly Ser Ser Glu Glu Arg 
        435                  440                445            
Gln Val Met Asp His Ala Phe Leu Leu Leu Ser Ser Glu Arg Glu His 
    450                  455                460                
Arg Arg Pro Val Lys Glu Asn Phe Leu His Met Ser Val Gln Ser Asp 
465                  470                475                  480
Asp Val Leu Leu Gly Asn Ser Val Asn Phe Thr Val Ile Leu Lys Arg 
                485                  490                495    
Lys Thr Ala Ala Leu Gln Asn Val Asn Ile Leu Gly Ser Phe Glu Leu 
            500                  505                510        
Gln Leu Tyr Thr Gly Lys Lys Met Ala Lys Leu Cys Asp Leu Asn Lys 
        515                  520                525            
Thr Ser Gln Ile Gln Gly Gln Val Ser Glu Val Thr Leu Thr Leu Asp 
    530                  535                540                
Ser Lys Thr Tyr Ile Asn Ser Leu Ala Ile Leu Asp Asp Glu Pro Val 
545                  550                555                  560
Ile Arg Gly Phe Ile Ile Ala Glu Ile Val Glu Ser Lys Glu Ile Met 
                565                  570                575    
Ala Ser Glu Val Phe Thr Ser Phe Gln Tyr Pro Glu Phe Ser Ile Glu 
            580                  585                590        
Leu Pro Asn Thr Gly Arg Ile Gly Gln Leu Leu Val Cys Asn Cys Ile 
        595                  600                605            
Phe Lys Asn Thr Leu Ala Ile Pro Leu Thr Asp Val Lys Phe Ser Leu 
    610                  615                620                
Glu Ser Leu Gly Ile Ser Ser Leu Gln Thr Ser Asp His Gly Thr Val 
625                  630                635                  640
Gln Pro Gly Glu Thr Ile Gln Ser Gln Ile Lys Cys Thr Pro Ile Lys 
                645                  650                655    
Thr Gly Pro Lys Lys Phe Ile Val Lys Leu Ser Ser Lys Gln Val Lys 
            660                  665                670        
Glu Ile Asn Ala Gln Lys Ile Val Leu Ile Thr Lys 
        675                  680                

<210> 39
<211> 2101
<212> DNA
<213> Homo sapiens

<220> 
<221> source
<222> 1..2101
<223> /mol_type="DNA"
      /note="SNAI2"
      /organism="Homo sapiens"

<400> 39
agttcgtaaa ggagccgggt gacttcagag gcgccggccc gtccgtctgc cgcacctgag      60

cacggcccct gcccgagcct ggcccgccgc gatgctgtag ggaccgccgt gtcctcccgc     120

cggaccgtta tccgcgccgg gcgcccgcca gacccgctgg caagatgccg cgctccttcc     180

tggtcaagaa gcatttcaac gcctccaaaa agccaaacta cagcgaactg gacacacata     240

cagtgattat ttccccgtat ctctatgaga gttactccat gcctgtcata ccacaaccag     300

agatcctcag ctcaggagca tacagcccca tcactgtgtg gactaccgct gctccattcc     360

acgcccagct acccaatggc ctctctcctc tttccggata ctcctcatct ttggggcgag     420

tgagtccccc tcctccatct gacacctcct ccaaggacca cagtggctca gaaagcccca     480

ttagtgatga agaggaaaga ctacagtcca agctttcaga cccccatgcc attgaagctg     540

aaaagtttca gtgcaattta tgcaataaga cctattcaac tttttctggg ctggccaaac     600

ataagcagct gcactgcgat gcccagtcta gaaaatcttt cagctgtaaa tactgtgaca     660

aggaatatgt gagcctgggc gccctgaaga tgcatattcg gacccacaca ttaccttgtg     720

tttgcaagat ctgcggcaag gcgttttcca gaccctggtt gcttcaagga cacattagaa     780

ctcacacggg ggagaagcct ttttcttgcc ctcactgcaa cagagcattt gcagacaggt     840

caaatctgag ggctcatctg cagacccatt ctgatgtaaa gaaataccag tgcaaaaact     900

gctccaaaac cttctccaga atgtctctcc tgcacaaaca tgaggaatct ggctgctgtg     960

tagcacactg agtgacgcaa tcaatgttta ctcgaacaga atgcatttct tcactccgaa    1020

gccaaatgac aaataaagtc caaaggcatt ttctcctgtg ctgaccaacc aaataatatg    1080

tatagacaca cacacatatg cacacacaca cacacacacc cacagagaga gagctgcaag    1140

agcatggaat tcatgtgttt aaagataatc ctttccatgt gaagtttaaa attactatat    1200

atttgctgat ggctagattg agagaataaa agacagtaac ctttctcttc aaagataaaa    1260

tgaaaagcac attgcatctt ttcttcctaa aaaaatgcaa agatttacat tgctgccaaa    1320

tcatttcaac tgaaaagaac agtattgctt tgtaatagag tctgtaatag gatttcccat    1380

aggaagagat ctgccagacg cgaactcagg tgccttaaaa agtattccaa gtttactcca    1440

ttacatgtcg gttgtctggt tgccattgtt gaactaaagc ctttttttga ttacctgtag    1500

tgctttaaag tatattttta aaagggagga aaaaaataac aagaacaaaa cacaggagaa    1560

tgtattaaaa gtatttttgt tttgttttgt ttttgccaat taacagtatg tgccttgggg    1620

gaggagggaa agattagctt tgaacattcc tggcgcatgc tccattgtct tactatttta    1680

aaacatttta ataatttttg aaaattaatt aaagatggga ataagtgcaa aagaggattc    1740

ttacaaattc attaatgtac ttaaactatt tcaaatgcat accacaaatg caataataca    1800

ataccccttc caagtgcctt tttaaattgt atagttgatg agtcaatgta aatttgtgtt    1860

tatttttata tgattgaatg agttctgtat gaaactgaga tgttgtctat agctatgtct    1920

ataaacaacc tgaagacttg tgaaatcaat gtttcttttt taaaaaacaa ttttcaagtt    1980

ttttttacaa taaacagttt tgatttaaaa tctcgtttgt atactatttt cagagacttt    2040

acttgcttca tgattagtac caaaccactg tacaaagaat tgtttgttaa caagaaaaaa    2100

a                                                                    2101


<210> 40
<211> 268
<212> PRT
<213> Homo sapiens

<220> 
<221> SOURCE
<222> 1..268
<223> /mol_type="protein"
      /note="SNAI2"
      /organism="Homo sapiens"

<400> 40
Met Pro Arg Ser Phe Leu Val Lys Lys His Phe Asn Ala Ser Lys Lys 
1               5                   10                   15    
Pro Asn Tyr Ser Glu Leu Asp Thr His Thr Val Ile Ile Ser Pro Tyr 
            20                   25                  30        
Leu Tyr Glu Ser Tyr Ser Met Pro Val Ile Pro Gln Pro Glu Ile Leu 
        35                   40                  45            
Ser Ser Gly Ala Tyr Ser Pro Ile Thr Val Trp Thr Thr Ala Ala Pro 
    50                   55                  60                
Phe His Ala Gln Leu Pro Asn Gly Leu Ser Pro Leu Ser Gly Tyr Ser 
65                   70                  75                  80
Ser Ser Leu Gly Arg Val Ser Pro Pro Pro Pro Ser Asp Thr Ser Ser 
                85                   90                  95    
Lys Asp His Ser Gly Ser Glu Ser Pro Ile Ser Asp Glu Glu Glu Arg 
            100                  105                110        
Leu Gln Ser Lys Leu Ser Asp Pro His Ala Ile Glu Ala Glu Lys Phe 
        115                  120                125            
Gln Cys Asn Leu Cys Asn Lys Thr Tyr Ser Thr Phe Ser Gly Leu Ala 
    130                  135                140                
Lys His Lys Gln Leu His Cys Asp Ala Gln Ser Arg Lys Ser Phe Ser 
145                  150                155                  160
Cys Lys Tyr Cys Asp Lys Glu Tyr Val Ser Leu Gly Ala Leu Lys Met 
                165                  170                175    
His Ile Arg Thr His Thr Leu Pro Cys Val Cys Lys Ile Cys Gly Lys 
            180                  185                190        
Ala Phe Ser Arg Pro Trp Leu Leu Gln Gly His Ile Arg Thr His Thr 
        195                  200                205            
Gly Glu Lys Pro Phe Ser Cys Pro His Cys Asn Arg Ala Phe Ala Asp 
    210                  215                220                
Arg Ser Asn Leu Arg Ala His Leu Gln Thr His Ser Asp Val Lys Lys 
225                  230                235                  240
Tyr Gln Cys Lys Asn Cys Ser Lys Thr Phe Ser Arg Met Ser Leu Leu 
                245                  250                255    
His Lys His Glu Glu Ser Gly Cys Cys Val Ala His 
            260                  265            

