                         SEQUENCE LISTING

<110>  University of South Florida
 
<120>  COMPOSITIONS AND METHODS FOR TREATING ENDOMETRIOSIS

<130>  292103-2530

<150>  US 62/081,464
<151>  2014-11-18

<160>  314   

<170>  PatentIn version 3.5

<210>  1
<211>  2031
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mRNA sequence for human ATG7

<400>  1
atggcggcag ctacggggga tcctggactc tctaaactgc agtttgcccc ttttagtagt       60

gccttggatg ttgggttttg gcatgagttg acccagaaga agctgaacga gtatcggctg      120

gatgaagctc ccaaggacat taagggttat tactacaatg gtgactctgc tgggctgcca      180

gctcgcttaa cattggagtt cagtgctttt gacatgagtg ctcccacccc agcccgttgc      240

tgcccagcta ttggaacact gtataacacc aacacactcg agtctttcaa gactgcagat      300

aagaagctcc ttttggaaca agcagcaaat gagatatggg aatccataaa atcaggcact      360

gctcttgaaa accctgtact cctcaacaag ttcctcctct tgacatttgc agatctaaag      420

aagtaccact tctactattg gttttgctat cctgccctct gtcttccaga gagtttacct      480

ctcattcagg ggccagtggg tttggatcaa aggttttcac taaaacagat tgaagcacta      540

gagtgtgcat atgataatct ttgtcaaaca gaaggagtca cagctcttcc ttacttctta      600

atcaagtatg atgagaacat ggtgctggtt tccttgctta aacactacag tgatttcttc      660

caaggtcaaa ggacgaagat aacaattggt gtatatgatc cctgtaactt agcccagtac      720

cctggatggc ctttgaggaa ttttttggtc ctagcagccc acagatggag tagcagtttc      780

cagtctgttg aagttgtttg cttccgtgac cgtaccatgc agggggcgag agacgttgcc      840

cacagcatca tcttcgaagt gaagcttcca gaaatggcat ttagcccaga ttgtcctaaa      900

gcagttggat gggaaaagaa ccagaaagga ggcatgggac caaggatggt gaacctcagt      960

gaatgtatgg accctaaaag gttagctgag tcatcagtgg atctaaatct caaactgatg     1020

tgttggagat tggttcctac tttagacttg gacaaggttg tgtctgtcaa atgtctgctg     1080

cttggagccg gcaccttggg ttgcaatgta gctaggacgt tgatgggttg gggcgtgaga     1140

cacatcacat ttgtggacaa tgccaagatc tcctactcca atcctgtgag gcagcctctc     1200

tatgagtttg aagattgcct agggggtggt aagcccaagg ctctggcagc agcggaccgg     1260

ctccagaaaa tattccccgg tgtgaatgcc agaggattca acatgagcat acctatgcct     1320

gggcatccag tgaacttctc cagtgtcact ctggagcaag cccgcagaga tgtggagcaa     1380

ctggagcagc tcatcgaaag ccatgatgtc gtcttcctat tgatggacac cagggagagc     1440

cggtggcttc ctgccgtcat tgctgcaagc aagagaaagc tggtcatcaa tgctgctttg     1500

ggatttgaca catttgttgt catgagacat ggtctgaaga aaccaaagca gcaaggagct     1560

ggggacttgt gtccaaacca ccctgtggca tctgctgacc tcctgggctc atcgcttttt     1620

gccaacatcc ctggttacaa gcttggctgc tacttctgca atgatgtggt ggccccagga     1680

gattcaacca gagaccggac cttggaccag cagtgcactg tgagtcgtcc aggactggcc     1740

gtgattgcag gagccctggc cgtggaattg atggtatctg ttttgcagca tccagaaggg     1800

ggctatgcca ttgccagcag cagtgacgat cggatgaatg agcctccaac ctctcttggg     1860

cttgtgcctc accaggttct tgatcaatat gaacgagaag gatttaactt cctagccaag     1920

gtgtttaatt cttcacattc cttcttagaa gacttgactg gtcttacatt gctgcatcaa     1980

gaaacccaag ctgctgagat ctgggacatg agcgatgatg agaccatctg a              2031


<210>  2
<211>  676
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  2 polypeptide sequence for human ATG7

<400>  2

Met Ala Ala Ala Thr Gly Asp Pro Gly Leu Ser Lys Leu Gln Phe Ala 
1               5                   10                  15      


Pro Phe Ser Ser Ala Leu Asp Val Gly Phe Trp His Glu Leu Thr Gln 
            20                  25                  30          


Lys Lys Leu Asn Glu Tyr Arg Leu Asp Glu Ala Pro Lys Asp Ile Lys 
        35                  40                  45              


Gly Tyr Tyr Tyr Asn Gly Asp Ser Ala Gly Leu Pro Ala Arg Leu Thr 
    50                  55                  60                  


Leu Glu Phe Ser Ala Phe Asp Met Ser Ala Pro Thr Pro Ala Arg Cys 
65                  70                  75                  80  


Cys Pro Ala Ile Gly Thr Leu Tyr Asn Thr Asn Thr Leu Glu Ser Phe 
                85                  90                  95      


Lys Thr Ala Asp Lys Lys Leu Leu Leu Glu Gln Ala Ala Asn Glu Ile 
            100                 105                 110         


Trp Glu Ser Ile Lys Ser Gly Thr Ala Leu Glu Asn Pro Val Leu Leu 
        115                 120                 125             


Asn Lys Phe Leu Leu Leu Thr Phe Ala Asp Leu Lys Lys Tyr His Phe 
    130                 135                 140                 


Tyr Tyr Trp Phe Cys Tyr Pro Ala Leu Cys Leu Pro Glu Ser Leu Pro 
145                 150                 155                 160 


Leu Ile Gln Gly Pro Val Gly Leu Asp Gln Arg Phe Ser Leu Lys Gln 
                165                 170                 175     


Ile Glu Ala Leu Glu Cys Ala Tyr Asp Asn Leu Cys Gln Thr Glu Gly 
            180                 185                 190         


Val Thr Ala Leu Pro Tyr Phe Leu Ile Lys Tyr Asp Glu Asn Met Val 
        195                 200                 205             


Leu Val Ser Leu Leu Lys His Tyr Ser Asp Phe Phe Gln Gly Gln Arg 
    210                 215                 220                 


Thr Lys Ile Thr Ile Gly Val Tyr Asp Pro Cys Asn Leu Ala Gln Tyr 
225                 230                 235                 240 


Pro Gly Trp Pro Leu Arg Asn Phe Leu Val Leu Ala Ala His Arg Trp 
                245                 250                 255     


Ser Ser Ser Phe Gln Ser Val Glu Val Val Cys Phe Arg Asp Arg Thr 
            260                 265                 270         


Met Gln Gly Ala Arg Asp Val Ala His Ser Ile Ile Phe Glu Val Lys 
        275                 280                 285             


Leu Pro Glu Met Ala Phe Ser Pro Asp Cys Pro Lys Ala Val Gly Trp 
    290                 295                 300                 


Glu Lys Asn Gln Lys Gly Gly Met Gly Pro Arg Met Val Asn Leu Ser 
305                 310                 315                 320 


Glu Cys Met Asp Pro Lys Arg Leu Ala Glu Ser Ser Val Asp Leu Asn 
                325                 330                 335     


Leu Lys Leu Met Cys Trp Arg Leu Val Pro Thr Leu Asp Leu Asp Lys 
            340                 345                 350         


Val Val Ser Val Lys Cys Leu Leu Leu Gly Ala Gly Thr Leu Gly Cys 
        355                 360                 365             


Asn Val Ala Arg Thr Leu Met Gly Trp Gly Val Arg His Ile Thr Phe 
    370                 375                 380                 


Val Asp Asn Ala Lys Ile Ser Tyr Ser Asn Pro Val Arg Gln Pro Leu 
385                 390                 395                 400 


Tyr Glu Phe Glu Asp Cys Leu Gly Gly Gly Lys Pro Lys Ala Leu Ala 
                405                 410                 415     


Ala Ala Asp Arg Leu Gln Lys Ile Phe Pro Gly Val Asn Ala Arg Gly 
            420                 425                 430         


Phe Asn Met Ser Ile Pro Met Pro Gly His Pro Val Asn Phe Ser Ser 
        435                 440                 445             


Val Thr Leu Glu Gln Ala Arg Arg Asp Val Glu Gln Leu Glu Gln Leu 
    450                 455                 460                 


Ile Glu Ser His Asp Val Val Phe Leu Leu Met Asp Thr Arg Glu Ser 
465                 470                 475                 480 


Arg Trp Leu Pro Ala Val Ile Ala Ala Ser Lys Arg Lys Leu Val Ile 
                485                 490                 495     


Asn Ala Ala Leu Gly Phe Asp Thr Phe Val Val Met Arg His Gly Leu 
            500                 505                 510         


Lys Lys Pro Lys Gln Gln Gly Ala Gly Asp Leu Cys Pro Asn His Pro 
        515                 520                 525             


Val Ala Ser Ala Asp Leu Leu Gly Ser Ser Leu Phe Ala Asn Ile Pro 
    530                 535                 540                 


Gly Tyr Lys Leu Gly Cys Tyr Phe Cys Asn Asp Val Val Ala Pro Gly 
545                 550                 555                 560 


Asp Ser Thr Arg Asp Arg Thr Leu Asp Gln Gln Cys Thr Val Ser Arg 
                565                 570                 575     


Pro Gly Leu Ala Val Ile Ala Gly Ala Leu Ala Val Glu Leu Met Val 
            580                 585                 590         


Ser Val Leu Gln His Pro Glu Gly Gly Tyr Ala Ile Ala Ser Ser Ser 
        595                 600                 605             


Asp Asp Arg Met Asn Glu Pro Pro Thr Ser Leu Gly Leu Val Pro His 
    610                 615                 620                 


Gln Val Leu Asp Gln Tyr Glu Arg Glu Gly Phe Asn Phe Leu Ala Lys 
625                 630                 635                 640 


Val Phe Asn Ser Ser His Ser Phe Leu Glu Asp Leu Thr Gly Leu Thr 
                645                 650                 655     


Leu Leu His Gln Glu Thr Gln Ala Ala Glu Ile Trp Asp Met Ser Asp 
            660                 665                 670         


Asp Glu Thr Ile 
        675     


<210>  3
<211>  23
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  ATG7 specific siRNA

<400>  3
caaagugcuu acagugcagg uag                                               23


<210>  4
<211>  828
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mRNA sequence for human ATG5

<400>  4
atgacagatg acaaagatgt gcttcgagat gtgtggtttg gacgaattcc aacttgtttc       60

acgctatatc aggatgagat aactgaaagg gaagcagaac catactattt gcttttgcca      120

agagtaagtt atttgacgtt ggtaactgac aaagtgaaaa agcactttca gaaggttatg      180

agacaagaag acattagtga gatatggttt gaatatgaag gcacaccact gaaatggcat      240

tatccaattg gtttgctatt tgatcttctt gcatcaagtt cagctcttcc ttggaacatc      300

acagtacatt ttaagagttt tccagaaaaa gaccttctgc actgtccatc taaggatgca      360

attgaagctc attttatgtc atgtatgaaa gaagctgatg ctttaaaaca taaaagtcaa      420

gtaatcaatg aaatgcagaa aaaagatcac aagcaactct ggatgggatt gcaaaatgac      480

agatttgacc agttttgggc catcaatcgg aaactcatgg aatatcctgc agaagaaaat      540

ggatttcgtt atatcccctt tagaatatat cagacaacga ctgaaagacc tttcattcag      600

aagctgtttc gtcctgtggc tgcagatgga cagttgcaca cactaggaga tctcctcaaa      660

gaagtttgtc cttctgctat tgatcctgaa gatggggaaa aaaagaatca agtgatgatt      720

catggaattg agccaatgtt ggaaacacct ctgcagtggc tgagtgaaca tctgagctac      780

ccggataatt ttcttcatat tagtatcatc ccacagccaa cagattga                   828


<210>  5
<211>  275
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  polypeptide sequence for human ATG5

<400>  5

Met Thr Asp Asp Lys Asp Val Leu Arg Asp Val Trp Phe Gly Arg Ile 
1               5                   10                  15      


Pro Thr Cys Phe Thr Leu Tyr Gln Asp Glu Ile Thr Glu Arg Glu Ala 
            20                  25                  30          


Glu Pro Tyr Tyr Leu Leu Leu Pro Arg Val Ser Tyr Leu Thr Leu Val 
        35                  40                  45              


Thr Asp Lys Val Lys Lys His Phe Gln Lys Val Met Arg Gln Glu Asp 
    50                  55                  60                  


Ile Ser Glu Ile Trp Phe Glu Tyr Glu Gly Thr Pro Leu Lys Trp His 
65                  70                  75                  80  


Tyr Pro Ile Gly Leu Leu Phe Asp Leu Leu Ala Ser Ser Ser Ala Leu 
                85                  90                  95      


Pro Trp Asn Ile Thr Val His Phe Lys Ser Phe Pro Glu Lys Asp Leu 
            100                 105                 110         


Leu His Cys Pro Ser Lys Asp Ala Ile Glu Ala His Phe Met Ser Cys 
        115                 120                 125             


Met Lys Glu Ala Asp Ala Leu Lys His Lys Ser Gln Val Ile Asn Glu 
    130                 135                 140                 


Met Gln Lys Lys Asp His Lys Gln Leu Trp Met Gly Leu Gln Asn Asp 
145                 150                 155                 160 


Arg Phe Asp Gln Phe Trp Ala Ile Asn Arg Lys Leu Met Glu Tyr Pro 
                165                 170                 175     


Ala Glu Glu Asn Gly Phe Arg Tyr Ile Pro Phe Arg Ile Tyr Gln Thr 
            180                 185                 190         


Thr Thr Glu Arg Pro Phe Ile Gln Lys Leu Phe Arg Pro Val Ala Ala 
        195                 200                 205             


Asp Gly Gln Leu His Thr Leu Gly Asp Leu Leu Lys Glu Val Cys Pro 
    210                 215                 220                 


Ser Ala Ile Asp Pro Glu Asp Gly Glu Lys Lys Asn Gln Val Met Ile 
225                 230                 235                 240 


His Gly Ile Glu Pro Met Leu Glu Thr Pro Leu Gln Trp Leu Ser Glu 
                245                 250                 255     


His Leu Ser Tyr Pro Asp Asn Phe Leu His Ile Ser Ile Ile Pro Gln 
            260                 265                 270         


Pro Thr Asp 
        275 


<210>  6
<211>  23
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  aacauucaacgcugucggugagu

<400>  6
aacauucaac gcugucggug agu                                               23


<210>  7
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ATG5 specific shRNA 1 coding region

<400>  7
ggcattatcc aattggttta                                                   20


<210>  8
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ATG5 specific shRNA 2 coding region

<400>  8
gcagaaccat actatttgct                                                   20


<210>  9
<211>  2664
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mRNA sequence for human hVps34

<400>  9
atgggggaag cagagaagtt tcactacatc tatagttgtg acctggatat caacgtccag       60

cttaagatag gaagcttgga agggaagaga gaacaaaaga gttataaagc tgtcctggaa      120

gacccaatgt tgaagttctc aggactatat caagagacat gctctgatct ttatgttact      180

tgtcaagttt ttgcagaagg gaagcctttg gccttgccag tgagaacatc ctacaaagca      240

tttagtacaa gatggaactg gaatgaatgg ctgaaactac cagtaaaata ccctgacctg      300

cccaggaatg cccaagtggc cctcaccata tgggatgtgt atggtcccgg aaaagcagtg      360

cctgtaggag gaacaacggt ttcgctcttt ggaaaatacg gcatgtttcg ccaagggatg      420

catgacttga aagtctggcc taatgtagaa gcagatggat cagaacccac aaaaactcct      480

ggcagaacaa gtagcactct ctcagaagat cagatgagcc gtcttgccaa gctcaccaaa      540

gctcatcgac aaggacacat ggtgaaagta gattggctgg atagattgac atttagagaa      600

atagaaatga taaatgagag tgaaaaacga agttctaatt tcatgtacct gatggttgaa      660

tttcgatgtg tcaagtgtga tgataaggaa tatggtattg tttattatga aaaggacggt      720

gatgaatcat ctccaatttt aacaagtttt gaattagtga aagttcctga cccccagatg      780

tctatggaga atttagttga gagcaaacac cacaagcttg cccggagttt aagaagtgga      840

ccttctgacc acgatctgaa acccaatgct gccacgagag atcagttaaa tattattgtg      900

agttatccac caaccaagca acttacatat gaagaacaag atcttgtttg gaagtttaga      960

tattatctta cgaatcaaga aaaagccttg acaaaattct tgaaatgtgt taattgggat     1020

ctacctcaag aggccaaaca ggccttggaa cttctgggaa aatggaagcc gatggatgta     1080

gaggactcct tggagctgtt atcctctcat tacaccaacc caactgtgag gcgttatgct     1140

gttgcccggt tgcgacaggc cgatgatgag gatttgttga tgtacctatt acaattggtc     1200

caggctctca aatatgaaaa ttttgatgat ataaagaatg gattggaacc taccaagaag     1260

gatagtcaga gttcagtgtc agaaaatgtg tcaaattctg gaataaattc tgcagaaata     1320

gatagctccc aaattataac cagccccctt ccttcagtct cttcacctcc tcctgcatca     1380

aaaacaaaag aagttccaga tggcgaaaat ctggaacaag atctctgtac cttcttgata     1440

tcgagagcct gcaaaaactc aacactggct aattatttat actggtatgt gatagtggaa     1500

tgtgaagatc aagatactca gcagagagat ccaaagaccc atgagatgta cttgaacgta     1560

atgagaagat tcagccaagc attgttgaag ggtgataagt ctgtcagagt tatgcgttct     1620

ttgctggctg cacaacagac atttgtagat cggttggtgc atctaatgaa ggcagtacaa     1680

cgcgaaagtg gaaatcgtaa gaaaaagaat gagagactac aggcattgct tggagataat     1740

gaaaagatga atttgtcaga tgtggaactt atcccgttgc ctttagaacc ccaagtgaaa     1800

attagaggaa taattccgga aacagctaca ctgtttaaaa gtgcccttat gcctgcacag     1860

ttgtttttta agacggaaga tggaggcaaa tatccagtta tatttaagca tggagatgat     1920

ttacgtcaag atcaacttat tcttcaaatc atttcactca tggacaagct gttacggaaa     1980

gaaaatctgg acttgaaatt gacaccttat aaggtgttag ccaccagtac aaaacatggc     2040

ttcatgcagt ttatccagtc agttcctgtg gctgaagttc ttgatacaga gggaagcatt     2100

cagaactttt ttagaaaata tgcaccaagt gagaatgggc caaatgggat tagtgctgag     2160

gtcatggaca cttacgttaa aagctgtgct ggatattgcg tgatcaccta tatacttgga     2220

gttggagaca ggcacctgga taaccttttg ctaacaaaaa caggcaaact cttccacata     2280

gactttggat atattttggg tcgggatcca aagcctcttc ctccaccaat gaagctgaat     2340

aaagaaatgg tagaaggaat ggggggcaca cagagtgagc agtaccaaga gttccgtaaa     2400

cagtgttaca cggctttcct ccacctgcga aggtattcta atctgatttt gaacttgttt     2460

tccttgatgg ttgatgcaaa cattccagat attgcacttg aaccagataa aactgtgaaa     2520

aaggttcagg ataaattccg cttagacctg tcggatgaag aggctgtgca ttacatgcag     2580

agtctgattg atgagagtgt ccatgctctt tttgctgcag tggtggaaca gattcacaag     2640

tttgcccagt actggagaaa atga                                            2664


<210>  10
<211>  887
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  polypeptide sequence for human hVps34

<400>  10

Met Gly Glu Ala Glu Lys Phe His Tyr Ile Tyr Ser Cys Asp Leu Asp 
1               5                   10                  15      


Ile Asn Val Gln Leu Lys Ile Gly Ser Leu Glu Gly Lys Arg Glu Gln 
            20                  25                  30          


Lys Ser Tyr Lys Ala Val Leu Glu Asp Pro Met Leu Lys Phe Ser Gly 
        35                  40                  45              


Leu Tyr Gln Glu Thr Cys Ser Asp Leu Tyr Val Thr Cys Gln Val Phe 
    50                  55                  60                  


Ala Glu Gly Lys Pro Leu Ala Leu Pro Val Arg Thr Ser Tyr Lys Ala 
65                  70                  75                  80  


Phe Ser Thr Arg Trp Asn Trp Asn Glu Trp Leu Lys Leu Pro Val Lys 
                85                  90                  95      


Tyr Pro Asp Leu Pro Arg Asn Ala Gln Val Ala Leu Thr Ile Trp Asp 
            100                 105                 110         


Val Tyr Gly Pro Gly Lys Ala Val Pro Val Gly Gly Thr Thr Val Ser 
        115                 120                 125             


Leu Phe Gly Lys Tyr Gly Met Phe Arg Gln Gly Met His Asp Leu Lys 
    130                 135                 140                 


Val Trp Pro Asn Val Glu Ala Asp Gly Ser Glu Pro Thr Lys Thr Pro 
145                 150                 155                 160 


Gly Arg Thr Ser Ser Thr Leu Ser Glu Asp Gln Met Ser Arg Leu Ala 
                165                 170                 175     


Lys Leu Thr Lys Ala His Arg Gln Gly His Met Val Lys Val Asp Trp 
            180                 185                 190         


Leu Asp Arg Leu Thr Phe Arg Glu Ile Glu Met Ile Asn Glu Ser Glu 
        195                 200                 205             


Lys Arg Ser Ser Asn Phe Met Tyr Leu Met Val Glu Phe Arg Cys Val 
    210                 215                 220                 


Lys Cys Asp Asp Lys Glu Tyr Gly Ile Val Tyr Tyr Glu Lys Asp Gly 
225                 230                 235                 240 


Asp Glu Ser Ser Pro Ile Leu Thr Ser Phe Glu Leu Val Lys Val Pro 
                245                 250                 255     


Asp Pro Gln Met Ser Met Glu Asn Leu Val Glu Ser Lys His His Lys 
            260                 265                 270         


Leu Ala Arg Ser Leu Arg Ser Gly Pro Ser Asp His Asp Leu Lys Pro 
        275                 280                 285             


Asn Ala Ala Thr Arg Asp Gln Leu Asn Ile Ile Val Ser Tyr Pro Pro 
    290                 295                 300                 


Thr Lys Gln Leu Thr Tyr Glu Glu Gln Asp Leu Val Trp Lys Phe Arg 
305                 310                 315                 320 


Tyr Tyr Leu Thr Asn Gln Glu Lys Ala Leu Thr Lys Phe Leu Lys Cys 
                325                 330                 335     


Val Asn Trp Asp Leu Pro Gln Glu Ala Lys Gln Ala Leu Glu Leu Leu 
            340                 345                 350         


Gly Lys Trp Lys Pro Met Asp Val Glu Asp Ser Leu Glu Leu Leu Ser 
        355                 360                 365             


Ser His Tyr Thr Asn Pro Thr Val Arg Arg Tyr Ala Val Ala Arg Leu 
    370                 375                 380                 


Arg Gln Ala Asp Asp Glu Asp Leu Leu Met Tyr Leu Leu Gln Leu Val 
385                 390                 395                 400 


Gln Ala Leu Lys Tyr Glu Asn Phe Asp Asp Ile Lys Asn Gly Leu Glu 
                405                 410                 415     


Pro Thr Lys Lys Asp Ser Gln Ser Ser Val Ser Glu Asn Val Ser Asn 
            420                 425                 430         


Ser Gly Ile Asn Ser Ala Glu Ile Asp Ser Ser Gln Ile Ile Thr Ser 
        435                 440                 445             


Pro Leu Pro Ser Val Ser Ser Pro Pro Pro Ala Ser Lys Thr Lys Glu 
    450                 455                 460                 


Val Pro Asp Gly Glu Asn Leu Glu Gln Asp Leu Cys Thr Phe Leu Ile 
465                 470                 475                 480 


Ser Arg Ala Cys Lys Asn Ser Thr Leu Ala Asn Tyr Leu Tyr Trp Tyr 
                485                 490                 495     


Val Ile Val Glu Cys Glu Asp Gln Asp Thr Gln Gln Arg Asp Pro Lys 
            500                 505                 510         


Thr His Glu Met Tyr Leu Asn Val Met Arg Arg Phe Ser Gln Ala Leu 
        515                 520                 525             


Leu Lys Gly Asp Lys Ser Val Arg Val Met Arg Ser Leu Leu Ala Ala 
    530                 535                 540                 


Gln Gln Thr Phe Val Asp Arg Leu Val His Leu Met Lys Ala Val Gln 
545                 550                 555                 560 


Arg Glu Ser Gly Asn Arg Lys Lys Lys Asn Glu Arg Leu Gln Ala Leu 
                565                 570                 575     


Leu Gly Asp Asn Glu Lys Met Asn Leu Ser Asp Val Glu Leu Ile Pro 
            580                 585                 590         


Leu Pro Leu Glu Pro Gln Val Lys Ile Arg Gly Ile Ile Pro Glu Thr 
        595                 600                 605             


Ala Thr Leu Phe Lys Ser Ala Leu Met Pro Ala Gln Leu Phe Phe Lys 
    610                 615                 620                 


Thr Glu Asp Gly Gly Lys Tyr Pro Val Ile Phe Lys His Gly Asp Asp 
625                 630                 635                 640 


Leu Arg Gln Asp Gln Leu Ile Leu Gln Ile Ile Ser Leu Met Asp Lys 
                645                 650                 655     


Leu Leu Arg Lys Glu Asn Leu Asp Leu Lys Leu Thr Pro Tyr Lys Val 
            660                 665                 670         


Leu Ala Thr Ser Thr Lys His Gly Phe Met Gln Phe Ile Gln Ser Val 
        675                 680                 685             


Pro Val Ala Glu Val Leu Asp Thr Glu Gly Ser Ile Gln Asn Phe Phe 
    690                 695                 700                 


Arg Lys Tyr Ala Pro Ser Glu Asn Gly Pro Asn Gly Ile Ser Ala Glu 
705                 710                 715                 720 


Val Met Asp Thr Tyr Val Lys Ser Cys Ala Gly Tyr Cys Val Ile Thr 
                725                 730                 735     


Tyr Ile Leu Gly Val Gly Asp Arg His Leu Asp Asn Leu Leu Leu Thr 
            740                 745                 750         


Lys Thr Gly Lys Leu Phe His Ile Asp Phe Gly Tyr Ile Leu Gly Arg 
        755                 760                 765             


Asp Pro Lys Pro Leu Pro Pro Pro Met Lys Leu Asn Lys Glu Met Val 
    770                 775                 780                 


Glu Gly Met Gly Gly Thr Gln Ser Glu Gln Tyr Gln Glu Phe Arg Lys 
785                 790                 795                 800 


Gln Cys Tyr Thr Ala Phe Leu His Leu Arg Arg Tyr Ser Asn Leu Ile 
                805                 810                 815     


Leu Asn Leu Phe Ser Leu Met Val Asp Ala Asn Ile Pro Asp Ile Ala 
            820                 825                 830         


Leu Glu Pro Asp Lys Thr Val Lys Lys Val Gln Asp Lys Phe Arg Leu 
        835                 840                 845             


Asp Leu Ser Asp Glu Glu Ala Val His Tyr Met Gln Ser Leu Ile Asp 
    850                 855                 860                 


Glu Ser Val His Ala Leu Phe Ala Ala Val Val Glu Gln Ile His Lys 
865                 870                 875                 880 


Phe Ala Gln Tyr Trp Arg Lys 
                885         


<210>  11
<211>  3153
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mRNA sequence for human ULK1 (also referred to as ATG1)

<400>  11
atggagcccg gccgcggcgg cacagagacc gtgggcaagt tcgagttctc ccgcaaggac       60

ctgatcggcc acggcgcctt cgcggtggtc ttcaagggcc gccaccgcga gaagcacgat      120

ttggaggtcg ccgtcaagtg cattaacaag aagaacctcg ccaagtctca gacgctgctg      180

gggaaggaaa tcaaaatcct gaaggaactg aaacatgaaa acatcgtggc cctgtacgac      240

ttccaggaaa tggctaattc tgtctacctg gttatggagt actgcaacgg tggggacctg      300

gccgactacc tgcacgccat gcgcacgctg agcgaggaca ccatcaggct cttcctgcag      360

cagatcgcgg gcgccatgcg gcttctgcac agcaaaggca tcatccaccg cgacctgaaa      420

ccgcagaaca tcctgctgtc caaccccgcc ggccgccgcg ccaaccccaa cagcatccgc      480

gtcaagatcg ctgacttcgg cttcgcgcgg tacctccaga gcaacatgat ggcggccaca      540

ctctgcggct cccccatgta catggccccc gaggtcatca tgtcccagca ctacgacggg      600

aaggcggacc tgtggagcat cggcaccatc gtctaccagt gcctgacggg gaaggcgccc      660

ttccaggcca gcagccccca ggacctgcgc ctgttctacg agaagaacaa gacgttggtc      720

cccaccatcc cccgggagac ctcggccccg ctgcggcagc tgctcctggc cctactgcaa      780

cgcaaccaca aggaccgcat ggacttcgat gagttttttc atcacccttt cctcgatgcc      840

agcccctcgg tcaggaaatc cccacccgtg cctgtgccct cgtacccaag ctcggggtcc      900

ggcagcagct ccagcagcag ctccacctcc cacctggcct ccccgccgtc cctgggcgag      960

atgcagcagc tgcagaagac cctggcctcc ccggctgaca ccgctggctt cctgcacagc     1020

tcccgggact ctggtggcag caaggactct tcctgtgaca cagacgactt cgtcatggtc     1080

cccgcgcagt ttccaggtga cctggtggct gaggcgccca gtgccaaacc cccgccagac     1140

agcctgatgt gcagtgggag ctcactggtg gcctctgcgg gcttggagag ccacggccgg     1200

accccatctc catccccacc ctgcagcagc tcccccagtc cctcaggccg ggctggcccg     1260

ttctccagca gcaggtgcgg cgcctctgtc cccatcccag tccccacgca ggtgcagaac     1320

taccagcgca ttgagcgaaa cctgcagtca cccacccagt tccaaacacc tcggtcctct     1380

gccatccgca ggtcaggcag caccagcccc ctgggctttg caagggccag cccctcgccc     1440

cctgcccacg ctgagcatgg aggcgtcctg gccaggaaga tgtctctggg tggaggccgg     1500

ccctacacgc catctcctca agttggaacc atccctgagc ggccaggctg gagcgggacg     1560

ccctccccac agggagctga gatgcggggt ggcaggtccc ctcgtccagg ctcctctgca     1620

cccgagcact ctccccgcac ttccgggctg ggctgccgcc tgcacagcgc ccccaacctg     1680

tctgacttgc acgtcgtccg ccccaagctg cccaaacccc ccacggaccc cctgggagct     1740

gtgttcagcc caccacaggc cagccctccc cagccgtccc acggcctgca gtcctgccgg     1800

aacctgcggg gctcacccaa gctgcccgac ttcctgcagc gaaaccccct gccccccatc     1860

ctgggctccc ccaccaaggc tgtgccctcc tttgacttcc cgaagacccc cagctcccag     1920

aacctgctgg ccctcctagc ccggcagggc gtggtgatga cgccccctcg aaaccggacg     1980

ctgcccgacc tctcggaggt gggacccttc catggtcagc cgttgggccc tggcctgcgg     2040

ccaggcgagg accccaaggg cccctttggc cggtctttca gcaccagccg cctcactgac     2100

ctgctcctta aggcggcgtt tgggacacaa gccccggacc cgggcagcac ggagagcctg     2160

caggagaagc ccatggagat cgcaccctca gctggctttg gagggagcct gcacccagga     2220

gcccgtgctg ggggcaccag cagcccttcc ccggtggtct tcaccgtggg ctctcccccg     2280

agcgggagca cgccccccca gggcccccgc accaggatgt tctcagcggg ccccactggc     2340

tctgccagct cttctgcccg ccacctggtg cctgggccct gcagcgaggc cccagcccct     2400

gagctccctg ctccaggaca cggctgcagc tttgccgacc ccattgctgc gaacctggag     2460

ggggctgtga ccttcgaggc ccccgacctc cctgaggaga ccctcatgga gcaagagcac     2520

acggagatcc tgcgtggcct gcgcttcacg ctgctgttcg tgcagcacgt cctggagatc     2580

gcagccctga agggcagcgc cagtgaggcg gcggggggcc ctgagtacca gctgcaggag     2640

agtgtggtgg ccgaccagat cagcctgctg agccgagaat ggggcttcgc ggaacagctg     2700

gtgctgtacc tgaaggtggc cgagctactg tcctccggcc tgcaaagtgc catcgaccag     2760

atccgggccg gcaagctctg cctgtcgtcc actgtgaagc aggtggtgcg caggctgaat     2820

gagctgtaca aggccagcgt ggtgtcctgc cagggcctga gcctgcggct gcagcgcttc     2880

ttcctggaca agcagcggct cctggaccgc attcacagca tcactgccga gaggctcatc     2940

ttcagccacg ctgtgcagat ggtgcagtcg gctgccctgg acgagatgtt ccagcaccgt     3000

gagggctgcg tcccacgcta ccacaaggcc ctgctgctcc tggaggggct gcagcacatg     3060

ctctcggacc aggccgacat cgagaacgtc accaagtgca agctgtgcat tgagcggaga     3120

ctctcggcgc tgctgactgg catctgtgcc tga                                  3153


<210>  12
<211>  1050
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  polypeptide sequence for human ULK1 (also referred to as ATG1)

<400>  12

Met Glu Pro Gly Arg Gly Gly Thr Glu Thr Val Gly Lys Phe Glu Phe 
1               5                   10                  15      


Ser Arg Lys Asp Leu Ile Gly His Gly Ala Phe Ala Val Val Phe Lys 
            20                  25                  30          


Gly Arg His Arg Glu Lys His Asp Leu Glu Val Ala Val Lys Cys Ile 
        35                  40                  45              


Asn Lys Lys Asn Leu Ala Lys Ser Gln Thr Leu Leu Gly Lys Glu Ile 
    50                  55                  60                  


Lys Ile Leu Lys Glu Leu Lys His Glu Asn Ile Val Ala Leu Tyr Asp 
65                  70                  75                  80  


Phe Gln Glu Met Ala Asn Ser Val Tyr Leu Val Met Glu Tyr Cys Asn 
                85                  90                  95      


Gly Gly Asp Leu Ala Asp Tyr Leu His Ala Met Arg Thr Leu Ser Glu 
            100                 105                 110         


Asp Thr Ile Arg Leu Phe Leu Gln Gln Ile Ala Gly Ala Met Arg Leu 
        115                 120                 125             


Leu His Ser Lys Gly Ile Ile His Arg Asp Leu Lys Pro Gln Asn Ile 
    130                 135                 140                 


Leu Leu Ser Asn Pro Ala Gly Arg Arg Ala Asn Pro Asn Ser Ile Arg 
145                 150                 155                 160 


Val Lys Ile Ala Asp Phe Gly Phe Ala Arg Tyr Leu Gln Ser Asn Met 
                165                 170                 175     


Met Ala Ala Thr Leu Cys Gly Ser Pro Met Tyr Met Ala Pro Glu Val 
            180                 185                 190         


Ile Met Ser Gln His Tyr Asp Gly Lys Ala Asp Leu Trp Ser Ile Gly 
        195                 200                 205             


Thr Ile Val Tyr Gln Cys Leu Thr Gly Lys Ala Pro Phe Gln Ala Ser 
    210                 215                 220                 


Ser Pro Gln Asp Leu Arg Leu Phe Tyr Glu Lys Asn Lys Thr Leu Val 
225                 230                 235                 240 


Pro Thr Ile Pro Arg Glu Thr Ser Ala Pro Leu Arg Gln Leu Leu Leu 
                245                 250                 255     


Ala Leu Leu Gln Arg Asn His Lys Asp Arg Met Asp Phe Asp Glu Phe 
            260                 265                 270         


Phe His His Pro Phe Leu Asp Ala Ser Pro Ser Val Arg Lys Ser Pro 
        275                 280                 285             


Pro Val Pro Val Pro Ser Tyr Pro Ser Ser Gly Ser Gly Ser Ser Ser 
    290                 295                 300                 


Ser Ser Ser Ser Thr Ser His Leu Ala Ser Pro Pro Ser Leu Gly Glu 
305                 310                 315                 320 


Met Gln Gln Leu Gln Lys Thr Leu Ala Ser Pro Ala Asp Thr Ala Gly 
                325                 330                 335     


Phe Leu His Ser Ser Arg Asp Ser Gly Gly Ser Lys Asp Ser Ser Cys 
            340                 345                 350         


Asp Thr Asp Asp Phe Val Met Val Pro Ala Gln Phe Pro Gly Asp Leu 
        355                 360                 365             


Val Ala Glu Ala Pro Ser Ala Lys Pro Pro Pro Asp Ser Leu Met Cys 
    370                 375                 380                 


Ser Gly Ser Ser Leu Val Ala Ser Ala Gly Leu Glu Ser His Gly Arg 
385                 390                 395                 400 


Thr Pro Ser Pro Ser Pro Pro Cys Ser Ser Ser Pro Ser Pro Ser Gly 
                405                 410                 415     


Arg Ala Gly Pro Phe Ser Ser Ser Arg Cys Gly Ala Ser Val Pro Ile 
            420                 425                 430         


Pro Val Pro Thr Gln Val Gln Asn Tyr Gln Arg Ile Glu Arg Asn Leu 
        435                 440                 445             


Gln Ser Pro Thr Gln Phe Gln Thr Pro Arg Ser Ser Ala Ile Arg Arg 
    450                 455                 460                 


Ser Gly Ser Thr Ser Pro Leu Gly Phe Ala Arg Ala Ser Pro Ser Pro 
465                 470                 475                 480 


Pro Ala His Ala Glu His Gly Gly Val Leu Ala Arg Lys Met Ser Leu 
                485                 490                 495     


Gly Gly Gly Arg Pro Tyr Thr Pro Ser Pro Gln Val Gly Thr Ile Pro 
            500                 505                 510         


Glu Arg Pro Gly Trp Ser Gly Thr Pro Ser Pro Gln Gly Ala Glu Met 
        515                 520                 525             


Arg Gly Gly Arg Ser Pro Arg Pro Gly Ser Ser Ala Pro Glu His Ser 
    530                 535                 540                 


Pro Arg Thr Ser Gly Leu Gly Cys Arg Leu His Ser Ala Pro Asn Leu 
545                 550                 555                 560 


Ser Asp Leu His Val Val Arg Pro Lys Leu Pro Lys Pro Pro Thr Asp 
                565                 570                 575     


Pro Leu Gly Ala Val Phe Ser Pro Pro Gln Ala Ser Pro Pro Gln Pro 
            580                 585                 590         


Ser His Gly Leu Gln Ser Cys Arg Asn Leu Arg Gly Ser Pro Lys Leu 
        595                 600                 605             


Pro Asp Phe Leu Gln Arg Asn Pro Leu Pro Pro Ile Leu Gly Ser Pro 
    610                 615                 620                 


Thr Lys Ala Val Pro Ser Phe Asp Phe Pro Lys Thr Pro Ser Ser Gln 
625                 630                 635                 640 


Asn Leu Leu Ala Leu Leu Ala Arg Gln Gly Val Val Met Thr Pro Pro 
                645                 650                 655     


Arg Asn Arg Thr Leu Pro Asp Leu Ser Glu Val Gly Pro Phe His Gly 
            660                 665                 670         


Gln Pro Leu Gly Pro Gly Leu Arg Pro Gly Glu Asp Pro Lys Gly Pro 
        675                 680                 685             


Phe Gly Arg Ser Phe Ser Thr Ser Arg Leu Thr Asp Leu Leu Leu Lys 
    690                 695                 700                 


Ala Ala Phe Gly Thr Gln Ala Pro Asp Pro Gly Ser Thr Glu Ser Leu 
705                 710                 715                 720 


Gln Glu Lys Pro Met Glu Ile Ala Pro Ser Ala Gly Phe Gly Gly Ser 
                725                 730                 735     


Leu His Pro Gly Ala Arg Ala Gly Gly Thr Ser Ser Pro Ser Pro Val 
            740                 745                 750         


Val Phe Thr Val Gly Ser Pro Pro Ser Gly Ser Thr Pro Pro Gln Gly 
        755                 760                 765             


Pro Arg Thr Arg Met Phe Ser Ala Gly Pro Thr Gly Ser Ala Ser Ser 
    770                 775                 780                 


Ser Ala Arg His Leu Val Pro Gly Pro Cys Ser Glu Ala Pro Ala Pro 
785                 790                 795                 800 


Glu Leu Pro Ala Pro Gly His Gly Cys Ser Phe Ala Asp Pro Ile Ala 
                805                 810                 815     


Ala Asn Leu Glu Gly Ala Val Thr Phe Glu Ala Pro Asp Leu Pro Glu 
            820                 825                 830         


Glu Thr Leu Met Glu Gln Glu His Thr Glu Ile Leu Arg Gly Leu Arg 
        835                 840                 845             


Phe Thr Leu Leu Phe Val Gln His Val Leu Glu Ile Ala Ala Leu Lys 
    850                 855                 860                 


Gly Ser Ala Ser Glu Ala Ala Gly Gly Pro Glu Tyr Gln Leu Gln Glu 
865                 870                 875                 880 


Ser Val Val Ala Asp Gln Ile Ser Leu Leu Ser Arg Glu Trp Gly Phe 
                885                 890                 895     


Ala Glu Gln Leu Val Leu Tyr Leu Lys Val Ala Glu Leu Leu Ser Ser 
            900                 905                 910         


Gly Leu Gln Ser Ala Ile Asp Gln Ile Arg Ala Gly Lys Leu Cys Leu 
        915                 920                 925             


Ser Ser Thr Val Lys Gln Val Val Arg Arg Leu Asn Glu Leu Tyr Lys 
    930                 935                 940                 


Ala Ser Val Val Ser Cys Gln Gly Leu Ser Leu Arg Leu Gln Arg Phe 
945                 950                 955                 960 


Phe Leu Asp Lys Gln Arg Leu Leu Asp Arg Ile His Ser Ile Thr Ala 
                965                 970                 975     


Glu Arg Leu Ile Phe Ser His Ala Val Gln Met Val Gln Ser Ala Ala 
            980                 985                 990         


Leu Asp Glu Met Phe Gln His Arg  Glu Gly Cys Val Pro  Arg Tyr His 
        995                 1000                 1005             


Lys Ala  Leu Leu Leu Leu Glu  Gly Leu Gln His Met  Leu Ser Asp 
    1010                 1015                 1020             


Gln Ala  Asp Ile Glu Asn Val  Thr Lys Cys Lys Leu  Cys Ile Glu 
    1025                 1030                 1035             


Arg Arg  Leu Ser Ala Leu Leu  Thr Gly Ile Cys Ala  
    1040                 1045                 1050 


<210>  13
<211>  1353
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mRNA sequence for human beclin-1

<400>  13
atggaagggt ctaagacgtc caacaacagc accatgcagg tgagcttcgt gtgccagcgc       60

tgcagccagc ccctgaaact ggacacgagt ttcaagatcc tggaccgtgt caccatccag      120

gaactcacag ctccattact taccacagcc caggcgaaac caggagagac ccaggaggaa      180

gagactaact caggagagga gccatttatt gaaactcctc gccaggatgg tgtctctcgc      240

agattcatcc ccccagccag gatgatgtcc acagaaagtg ccaacagctt cactctgatt      300

ggggaggcat ctgatggcgg caccatggag aacctcagcc gaagactgaa ggtcactggg      360

gacctttttg acatcatgtc gggccagaca gatgtggatc acccactctg tgaggaatgc      420

acagatactc ttttagacca gctggacact cagctcaacg tcactgaaaa tgagtgtcag      480

aactacaaac gctgtttgga gatcttagag caaatgaatg aggatgacag tgaacagtta      540

cagatggagc taaaggagct ggcactagag gaggagaggc tgatccagga gctggaagac      600

gtggaaaaga accgcaagat agtggcagaa aatctcgaga aggtccaggc tgaggctgag      660

agactggatc aggaggaagc tcagtatcag agagaataca gtgaatttaa acgacagcag      720

ctggagctgg atgatgagct gaagagtgtt gaaaaccaga tgcgttatgc ccagacgcag      780

ctggataagc tgaagaaaac caacgtcttt aatgcaacct tccacatctg gcacagtgga      840

cagtttggca caatcaataa cttcaggctg ggtcgcctgc ccagtgttcc cgtggaatgg      900

aatgagatta atgctgcttg gggccagact gtgttgctgc tccatgctct ggccaataag      960

atgggtctga aatttcagag ataccgactt gttccttacg gaaaccattc atatctggag     1020

tctctgacag acaaatctaa ggagctgccg ttatactgtt ctggggggtt gcggtttttc     1080

tgggacaaca agtttgacca tgcaatggtg gctttcctgg actgtgtgca gcagttcaaa     1140

gaagaggttg agaaaggcga gacacgtttt tgtcttccct acaggatgga tgtggagaaa     1200

ggcaagattg aagacacagg aggcagtggc ggctcctatt ccatcaaaac ccagtttaac     1260

tctgaggagc agtggacaaa agctctcaag ttcatgctga cgaatcttaa gtggggtctt     1320

gcttgggtgt cctcacaatt ttataacaaa tga                                  1353


<210>  14
<211>  450
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  polypeptide sequence for human beclin-1

<400>  14

Met Glu Gly Ser Lys Thr Ser Asn Asn Ser Thr Met Gln Val Ser Phe 
1               5                   10                  15      


Val Cys Gln Arg Cys Ser Gln Pro Leu Lys Leu Asp Thr Ser Phe Lys 
            20                  25                  30          


Ile Leu Asp Arg Val Thr Ile Gln Glu Leu Thr Ala Pro Leu Leu Thr 
        35                  40                  45              


Thr Ala Gln Ala Lys Pro Gly Glu Thr Gln Glu Glu Glu Thr Asn Ser 
    50                  55                  60                  


Gly Glu Glu Pro Phe Ile Glu Thr Pro Arg Gln Asp Gly Val Ser Arg 
65                  70                  75                  80  


Arg Phe Ile Pro Pro Ala Arg Met Met Ser Thr Glu Ser Ala Asn Ser 
                85                  90                  95      


Phe Thr Leu Ile Gly Glu Ala Ser Asp Gly Gly Thr Met Glu Asn Leu 
            100                 105                 110         


Ser Arg Arg Leu Lys Val Thr Gly Asp Leu Phe Asp Ile Met Ser Gly 
        115                 120                 125             


Gln Thr Asp Val Asp His Pro Leu Cys Glu Glu Cys Thr Asp Thr Leu 
    130                 135                 140                 


Leu Asp Gln Leu Asp Thr Gln Leu Asn Val Thr Glu Asn Glu Cys Gln 
145                 150                 155                 160 


Asn Tyr Lys Arg Cys Leu Glu Ile Leu Glu Gln Met Asn Glu Asp Asp 
                165                 170                 175     


Ser Glu Gln Leu Gln Met Glu Leu Lys Glu Leu Ala Leu Glu Glu Glu 
            180                 185                 190         


Arg Leu Ile Gln Glu Leu Glu Asp Val Glu Lys Asn Arg Lys Ile Val 
        195                 200                 205             


Ala Glu Asn Leu Glu Lys Val Gln Ala Glu Ala Glu Arg Leu Asp Gln 
    210                 215                 220                 


Glu Glu Ala Gln Tyr Gln Arg Glu Tyr Ser Glu Phe Lys Arg Gln Gln 
225                 230                 235                 240 


Leu Glu Leu Asp Asp Glu Leu Lys Ser Val Glu Asn Gln Met Arg Tyr 
                245                 250                 255     


Ala Gln Thr Gln Leu Asp Lys Leu Lys Lys Thr Asn Val Phe Asn Ala 
            260                 265                 270         


Thr Phe His Ile Trp His Ser Gly Gln Phe Gly Thr Ile Asn Asn Phe 
        275                 280                 285             


Arg Leu Gly Arg Leu Pro Ser Val Pro Val Glu Trp Asn Glu Ile Asn 
    290                 295                 300                 


Ala Ala Trp Gly Gln Thr Val Leu Leu Leu His Ala Leu Ala Asn Lys 
305                 310                 315                 320 


Met Gly Leu Lys Phe Gln Arg Tyr Arg Leu Val Pro Tyr Gly Asn His 
                325                 330                 335     


Ser Tyr Leu Glu Ser Leu Thr Asp Lys Ser Lys Glu Leu Pro Leu Tyr 
            340                 345                 350         


Cys Ser Gly Gly Leu Arg Phe Phe Trp Asp Asn Lys Phe Asp His Ala 
        355                 360                 365             


Met Val Ala Phe Leu Asp Cys Val Gln Gln Phe Lys Glu Glu Val Glu 
    370                 375                 380                 


Lys Gly Glu Thr Arg Phe Cys Leu Pro Tyr Arg Met Asp Val Glu Lys 
385                 390                 395                 400 


Gly Lys Ile Glu Asp Thr Gly Gly Ser Gly Gly Ser Tyr Ser Ile Lys 
                405                 410                 415     


Thr Gln Phe Asn Ser Glu Glu Gln Trp Thr Lys Ala Leu Lys Phe Met 
            420                 425                 430         


Leu Thr Asn Leu Lys Trp Gly Leu Ala Trp Val Ser Ser Gln Phe Tyr 
        435                 440                 445             


Asn Lys 
    450 


<210>  15
<211>  1048
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  mRNA sequence for human LC3A (transcript variant 1) Genbank 
       accession no. NM_032514.3 GI: 377652328

<400>  15
atgttgtgac ctgacgtcac cgggcgagtt acctcccgca gccgcagccg ccgtgctcag       60

cgcgagcccc ggagcccttg agcgcgaggc gcggagcccc cggagccccc aaaccgcaga      120

cacatccccg cgccccagag ccccggcctg cgcgcccagc cgggcccgcg cgatgccctc      180

agaccggcct ttcaagcagc ggcggagctt cgccgaccgc tgtaaggagg tacagcagat      240

ccgcgaccag caccccagca aaatcccggt gatcatcgag cgctacaagg gtgagaagca      300

gctgcccgtc ctggacaaga ccaagttttt ggtcccggac catgtcaaca tgagcgagtt      360

ggtcaagatc atccggcgcc gcctgcagct gaaccccacg caggccttct tcctgctggt      420

gaaccagcac agcatggtga gtgtgtccac gcccatcgcg gacatctacg agcaggagaa      480

agacgaggac ggcttcctct atatggtcta cgcctcccag gaaaccttcg gcttctgagc      540

cagcagtagg ggggctcggc ctgggagtcg ggcggccccg gtcaggccct gcccagagag      600

ctcctggttc ctgaactgag ctgcctctac cgtggtgggc tgggcaggca tgtgcccccc      660

tagtcagagg gcaccaaccc acctactctg cccctgggtg gatcctgggc cggtcgtgtt      720

agggttgtcc ctctgggtgc tggctggtgg gatgggggag ggtggggagc agctcccagc      780

acccctgctg tgtggttcat ctttttttta ggcccctgcc tgtctgccca tctgcccctc      840

acccacccga ggctctgccc accgcctgga cctgcccacc cctgaaagac tggcccctgg      900

ctccccgccc ctcggtctcc acgtggtgta tggatctgtg gtcattgtcc ctctgcagaa      960

taaagattgc tcaggcctgc ctggcaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa     1020

aaaaaaaaaa aaaaaaaaaa aaaaaaaa                                        1048


<210>  16
<211>  121
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  polypeptide sequence for human LC3A

<400>  16

Met Pro Ser Asp Arg Pro Phe Lys Gln Arg Arg Ser Phe Ala Asp Arg 
1               5                   10                  15      


Cys Lys Glu Val Gln Gln Ile Arg Asp Gln His Pro Ser Lys Ile Pro 
            20                  25                  30          


Val Ile Ile Glu Arg Tyr Lys Gly Glu Lys Gln Leu Pro Val Leu Asp 
        35                  40                  45              


Lys Thr Lys Phe Leu Val Pro Asp His Val Asn Met Ser Glu Leu Val 
    50                  55                  60                  


Lys Ile Ile Arg Arg Arg Leu Gln Leu Asn Pro Thr Gln Ala Phe Phe 
65                  70                  75                  80  


Leu Leu Val Asn Gln His Ser Met Val Ser Val Ser Thr Pro Ile Ala 
                85                  90                  95      


Asp Ile Tyr Glu Gln Glu Lys Asp Glu Asp Gly Phe Leu Tyr Met Val 
            100                 105                 110         


Tyr Ala Ser Gln Glu Thr Phe Gly Phe 
        115                 120     


<210>  17
<211>  2304
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  LC3B mRNA, Genbank Accession No. NM_022818.4

<400>  17
acgctgcgtg ccgctgctgg gttccgccac gcccgtcatg gcggcggccc cggccggctc       60

tggccccgcc cctcggtgac gcgtcgcgag tcacctgacc aggctgcggg ctgaggagat      120

acaagggaag tggctatcgc cagagtcgga ttcgccgccg cagcagccgc cgcccccggg      180

agccgccggg accctcgcgt cgtcgccgcc gccgccgccc agatccctgc accatgccgt      240

cggagaagac cttcaagcag cgccgcacct tcgaacaaag agtagaagat gtccgactta      300

ttcgagagca gcatccaacc aaaatcccgg tgataataga acgatacaag ggtgagaagc      360

agcttcctgt tctggataaa acaaagttcc ttgtacctga ccatgtcaac atgagtgagc      420

tcatcaagat aattagaagg cgcttacagc tcaatgctaa tcaggccttc ttcctgttgg      480

tgaacggaca cagcatggtc agcgtctcca caccaatctc agaggtgtat gagagtgaga      540

aagatgaaga tggattcctg tacatggtct atgcctccca ggagacgttc gggatgaaat      600

tgtcagtgta aaaccagaaa aaatgcagct cttctagaat tgtttaaacc cttaccaagg      660

aaaaaaaagg gatgttacca actgagatcg atcagttcat ccaatcacag atcatgaaac      720

agtagtgttc ccacctagga gtgttaggaa gttgtgtttg tgtttcaagc agaaaaactg      780

agctccaagt gagcacattc agctttggaa actatattat ttaatgtagg ctagcttgtt      840

ttcaaatttt aaaagtttaa aaataaaata ctttgcattc taagttgcca ataaaataga      900

ccttcaagtt attttaatgc tcttttctca ctaataggaa cttgtaattc cagcagtaat      960

ttaaaggctt tcagagagac cctgagtctt ctcttcaggt tcacagaacc cgccgccttt     1020

ttgggtagaa gttttctact cagctagaga gatctcccta agaggatctt taggcctgag     1080

ttgtgaagcg caacccccgc aaaacgcatt tgccatcaca gttggcacaa acgcagggta     1140

aacgggctgt gtgagaaaac ggccctgact gtaaactgct gaaggtccct gactcctaag     1200

agaaccacac ccaaagtcct cactcttgca ggggtagaca tttctggttt ggtttgttct     1260

ctagatagtt acacacataa agacaccact caaaaggaaa cttgaataat ttataatttt     1320

gatcgagttt cttaaaagac cctggagaaa gagtggcatt tcttctgttt caggttttgt     1380

ctgagttcaa actagtgcct gtgttgttac ggaaagcagc agtgtaccag tgtcactctg     1440

gagtacagcg ggagaaacac aaaatagtat aactgaaaac attaacattc agacacactc     1500

ccttctgcct tccggcttaa agctgtggat gatccacgtt tttgtttttt taatgttaaa     1560

tgtgtaactc agtattactg aaaaggtacc cacattttga atagtagtta tcactcttag     1620

gtcagacagc catcagaatt ctcccacacc aagtgcatgt cagttgtgga gaaaacatag     1680

caaaaagagc cgtacgctct ttacagatac taatgtcaag agttaaacct cctcaggttc     1740

aacctgtgat aaaagactag tgcttcccag tacttgcatg gggttcacta tttatagttt     1800

tcttgggagt atcacaggaa aatcacaatt acaccacttt agaccctatg tgtagcaggt     1860

cacaacttac ccttgtgtgt ttagatgtgt atgaaatacc tgtatacgtt agtgaaagct     1920

gtttactgta acggggaaaa ccagattctt tgcatctggg ccctctactg attgttaaag     1980

gagttcctgt cacctgctcc ccccaccccc gcatgcgtct gtccacttgg ctaactttta     2040

atatgtgtat ttttacatta tgtatattct taactggact gtctcgttta gactgtatac     2100

atcatatctg acattattgt aactaccgtg tgatcagtaa gattcctgta agaaatactg     2160

ctttttaaga aaaaaaataa catgctgagg ggtgacctat atcccatgtg agtggtcact     2220

ttatttatag gatctttaaa acatttttaa tgaactaagt tgaataaagg cacaattaaa     2280

aactgtcaaa aaaaaaaaaa aaaa                                            2304


<210>  18
<211>  125
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human LC3B polypeptide sequence

<400>  18

Met Pro Ser Glu Lys Thr Phe Lys Gln Arg Arg Thr Phe Glu Gln Arg 
1               5                   10                  15      


Val Glu Asp Val Arg Leu Ile Arg Glu Gln His Pro Thr Lys Ile Pro 
            20                  25                  30          


Val Ile Ile Glu Arg Tyr Lys Gly Glu Lys Gln Leu Pro Val Leu Asp 
        35                  40                  45              


Lys Thr Lys Phe Leu Val Pro Asp His Val Asn Met Ser Glu Leu Ile 
    50                  55                  60                  


Lys Ile Ile Arg Arg Arg Leu Gln Leu Asn Ala Asn Gln Ala Phe Phe 
65                  70                  75                  80  


Leu Leu Val Asn Gly His Ser Met Val Ser Val Ser Thr Pro Ile Ser 
                85                  90                  95      


Glu Val Tyr Glu Ser Glu Lys Asp Glu Asp Gly Phe Leu Tyr Met Val 
            100                 105                 110         


Tyr Ala Ser Gln Glu Thr Phe Gly Met Lys Leu Ser Val 
        115                 120                 125 


<210>  19
<211>  125
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens microtubule-associated protein 1 light chain 3 alpha
       (MAP1LC3A), transcript variant 2, polypeptide NCBI Reference 
       Sequence: NM_181509.2

<400>  19

Met Lys Met Arg Phe Phe Ser Ser Pro Cys Gly Lys Ala Ala Val Asp 
1               5                   10                  15      


Pro Ala Asp Arg Cys Lys Glu Val Gln Gln Ile Arg Asp Gln His Pro 
            20                  25                  30          


Ser Lys Ile Pro Val Ile Ile Glu Arg Tyr Lys Gly Glu Lys Gln Leu 
        35                  40                  45              


Pro Val Leu Asp Lys Thr Lys Phe Leu Val Pro Asp His Val Asn Met 
    50                  55                  60                  


Ser Glu Leu Val Lys Ile Ile Arg Arg Arg Leu Gln Leu Asn Pro Thr 
65                  70                  75                  80  


Gln Ala Phe Phe Leu Leu Val Asn Gln His Ser Met Val Ser Val Ser 
                85                  90                  95      


Thr Pro Ile Ala Asp Ile Tyr Glu Gln Glu Lys Asp Glu Asp Gly Phe 
            100                 105                 110         


Leu Tyr Met Val Tyr Ala Ser Gln Glu Thr Phe Gly Phe 
        115                 120                 125 


<210>  20
<211>  994
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens microtubule-associated protein 1 light chain 3 alpha
       (MAP1LC3A), transcript variant 2, mRNA NCBI Reference Sequence: 
       NM_181509.2

<400>  20
gccaggtgca aggagaaagg attttgagga ggggactcca tggcttccga gttgctgact       60

gaccctccac ctcagaggta gttctgacac tgtctcagtt ttgcagatga agatgagatt      120

cttcagttct ccatgtggaa aagcagctgt ggacccagcc gaccgctgta aggaggtaca      180

gcagatccgc gaccagcacc ccagcaaaat cccggtgatc atcgagcgct acaagggtga      240

gaagcagctg cccgtcctgg acaagaccaa gtttttggtc ccggaccatg tcaacatgag      300

cgagttggtc aagatcatcc ggcgccgcct gcagctgaac cccacgcagg ccttcttcct      360

gctggtgaac cagcacagca tggtgagtgt gtccacgccc atcgcggaca tctacgagca      420

ggagaaagac gaggacggct tcctctatat ggtctacgcc tcccaggaaa ccttcggctt      480

ctgagccagc agtagggggg ctcggcctgg gagtcgggcg gccccggtca ggccctgccc      540

agagagctcc tggttcctga actgagctgc ctctaccgtg gtgggctggg caggcatgtg      600

cccccctagt cagagggcac caacccacct actctgcccc tgggtggatc ctgggccggt      660

cgtgttaggg ttgtccctct gggtgctggc tggtgggatg ggggagggtg gggagcagct      720

cccagcaccc ctgctgtgtg gttcatcttt tttttaggcc cctgcctgtc tgcccatctg      780

cccctcaccc acccgaggct ctgcccaccg cctggacctg cccacccctg aaagactggc      840

ccctggctcc ccgcccctcg gtctccacgt ggtgtatgga tctgtggtca ttgtccctct      900

gcagaataaa gattgctcag gcctgcctgg caaaaaaaaa aaaaaaaaaa aaaaaaaaaa      960

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa                                  994


<210>  21
<211>  2304
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens microtubule-associated protein 1 light chain 3 beta 
       (MAP1LC3B), mRNA NCBI Reference Sequence: NM_022818.4

<400>  21
acgctgcgtg ccgctgctgg gttccgccac gcccgtcatg gcggcggccc cggccggctc       60

tggccccgcc cctcggtgac gcgtcgcgag tcacctgacc aggctgcggg ctgaggagat      120

acaagggaag tggctatcgc cagagtcgga ttcgccgccg cagcagccgc cgcccccggg      180

agccgccggg accctcgcgt cgtcgccgcc gccgccgccc agatccctgc accatgccgt      240

cggagaagac cttcaagcag cgccgcacct tcgaacaaag agtagaagat gtccgactta      300

ttcgagagca gcatccaacc aaaatcccgg tgataataga acgatacaag ggtgagaagc      360

agcttcctgt tctggataaa acaaagttcc ttgtacctga ccatgtcaac atgagtgagc      420

tcatcaagat aattagaagg cgcttacagc tcaatgctaa tcaggccttc ttcctgttgg      480

tgaacggaca cagcatggtc agcgtctcca caccaatctc agaggtgtat gagagtgaga      540

aagatgaaga tggattcctg tacatggtct atgcctccca ggagacgttc gggatgaaat      600

tgtcagtgta aaaccagaaa aaatgcagct cttctagaat tgtttaaacc cttaccaagg      660

aaaaaaaagg gatgttacca actgagatcg atcagttcat ccaatcacag atcatgaaac      720

agtagtgttc ccacctagga gtgttaggaa gttgtgtttg tgtttcaagc agaaaaactg      780

agctccaagt gagcacattc agctttggaa actatattat ttaatgtagg ctagcttgtt      840

ttcaaatttt aaaagtttaa aaataaaata ctttgcattc taagttgcca ataaaataga      900

ccttcaagtt attttaatgc tcttttctca ctaataggaa cttgtaattc cagcagtaat      960

ttaaaggctt tcagagagac cctgagtctt ctcttcaggt tcacagaacc cgccgccttt     1020

ttgggtagaa gttttctact cagctagaga gatctcccta agaggatctt taggcctgag     1080

ttgtgaagcg caacccccgc aaaacgcatt tgccatcaca gttggcacaa acgcagggta     1140

aacgggctgt gtgagaaaac ggccctgact gtaaactgct gaaggtccct gactcctaag     1200

agaaccacac ccaaagtcct cactcttgca ggggtagaca tttctggttt ggtttgttct     1260

ctagatagtt acacacataa agacaccact caaaaggaaa cttgaataat ttataatttt     1320

gatcgagttt cttaaaagac cctggagaaa gagtggcatt tcttctgttt caggttttgt     1380

ctgagttcaa actagtgcct gtgttgttac ggaaagcagc agtgtaccag tgtcactctg     1440

gagtacagcg ggagaaacac aaaatagtat aactgaaaac attaacattc agacacactc     1500

ccttctgcct tccggcttaa agctgtggat gatccacgtt tttgtttttt taatgttaaa     1560

tgtgtaactc agtattactg aaaaggtacc cacattttga atagtagtta tcactcttag     1620

gtcagacagc catcagaatt ctcccacacc aagtgcatgt cagttgtgga gaaaacatag     1680

caaaaagagc cgtacgctct ttacagatac taatgtcaag agttaaacct cctcaggttc     1740

aacctgtgat aaaagactag tgcttcccag tacttgcatg gggttcacta tttatagttt     1800

tcttgggagt atcacaggaa aatcacaatt acaccacttt agaccctatg tgtagcaggt     1860

cacaacttac ccttgtgtgt ttagatgtgt atgaaatacc tgtatacgtt agtgaaagct     1920

gtttactgta acggggaaaa ccagattctt tgcatctggg ccctctactg attgttaaag     1980

gagttcctgt cacctgctcc ccccaccccc gcatgcgtct gtccacttgg ctaactttta     2040

atatgtgtat ttttacatta tgtatattct taactggact gtctcgttta gactgtatac     2100

atcatatctg acattattgt aactaccgtg tgatcagtaa gattcctgta agaaatactg     2160

ctttttaaga aaaaaaataa catgctgagg ggtgacctat atcccatgtg agtggtcact     2220

ttatttatag gatctttaaa acatttttaa tgaactaagt tgaataaagg cacaattaaa     2280

aactgtcaaa aaaaaaaaaa aaaa                                            2304


<210>  22
<211>  3910
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ATG9 mRNA transcript variant 1 Genbank accession no. 
       NM_001077198.2 GI: 544583527

<400>  22
gttgtgtctc cgccccccgt gcttattggc cagcctggct gcgggtgaca gtgagtggca       60

gacacccggc ctagcgccgc gggtcgcgcc gagccgagcc gagccgagcg gagccggcgg      120

agcctctgga atcacccggg tcgctgttcc tgagcagctg cagagcatcg agggctggag      180

aggagcacat actgtccatg gagctggtgg tcaaggtgga cagggggcgg tggtgatggc      240

gcagtttgac actgaatacc agcgcctaga ggcctcctat agtgattcac ccccagggga      300

ggaggacctg ttggtgcacg tcgccgaggg gagcaagtca ccttggcacc atattgaaaa      360

ccttgacctc ttcttctctc gagtttataa tctgcaccag aagaatggct tcacatgtat      420

gctcatcggg gagatctttg agctcatgca gttcctcttt gtggttgcct tcactacctt      480

cctggtcagc tgcgtggact atgacatcct atttgccaac aagatggtga accacagtct      540

tcaccctact gaacccgtca aggtcactct gccagacgcc tttttgcctg ctcaagtctg      600

tagtgccagg attcaggaaa atggctccct tatcaccatc ctggtcattg ctggtgtctt      660

ctggatccac cggcttatca agttcatcta taacatttgc tgctactggg agatccactc      720

cttctacctg cacgctctgc gcatccctat gtctgccctt ccgtattgca cgtggcaaga      780

agtgcaggcc cggatcgtgc agacgcagaa ggagcaccag atctgcatcc acaaacgtga      840

gctgacagaa ctggacatct accaccgcat cctccgtttc cagaactaca tggtggcact      900

ggttaacaaa tccctcctgc ctctgcgctt ccgcctgcct ggcctcgggg aagctgtctt      960

cttcacccgt ggtctcaagt acaactttga gctgatcctc ttctggggac ctggctctct     1020

gtttctcaat gaatggagcc tcaaggccga gtacaaacgt ggggggcaac ggctagagct     1080

ggcccagcgc ctcagcaacc gcatcctgtg gattggcatc gctaacttcc tgctgtgccc     1140

cctcatcctc atatggcaaa tcctctatgc cttcttcagc tatgctgagg tgctgaagcg     1200

ggagccgggg gccctgggag cacgctgctg gtcactctat ggccgctgct acctccgcca     1260

cttcaacgag ctggagcacg agctgcagtc ccgcctcaac cgtggctaca agcccgcctc     1320

caagtacatg aattgcttct tgtcacctct tttgacactg ctggccaaga atggagcctt     1380

cttcgctggc tccatcctgg ctgtgcttat tgccctcacc atttatgacg aagatgtgtt     1440

ggctgtggaa catgtgctga ccaccgtcac actcctgggg gtcaccgtga ccgtgtgcag     1500

gtcctttatc ccggaccagc acatggtgtt ctgccctgag cagctgctcc gcgtgatcct     1560

cgctcacatc cactacatgc ctgaccactg gcagggtaat gcccaccgct cgcagacccg     1620

ggacgagttt gcccagctct tccagtacaa ggcagtgttc attttggaag agttgctgag     1680

ccccattgtc acacccctca tcctcatctt ctgcctgcgc ccacgggccc tggagattat     1740

agacttcttc cgaaacttca ccgtggaggt cgttggtgtg ggagatacct gctcctttgc     1800

tcagatggat gttcgccagc atggtcatcc ccagtggcta tctgctgggc agacagaggc     1860

ctcagtgtac cagcaagctg aggatggaaa gacagagttg tcactcatgc actttgccat     1920

caccaaccct ggctggcagc caccacgtga gagcacagcc ttcctaggct tcctcaagga     1980

gcaggttcag cgggatggag cagctgctag cctcgcccaa gggggtctgc tccctgaaaa     2040

tgccctcttt acgtctatcc agtccttaca atctgagtct gagcccctga gccttatcgc     2100

aaatgtggta gctggctcat cctgccgggg ccctccactg cccagagacc tgcagggctc     2160

caggcacagg gctgaagtcg cctctgccct gcgctccttc tccccgctgc aacccgggca     2220

ggcgcccaca ggccgggctc acagcaccat gacaggctct ggggtggatg ccaggacagc     2280

cagctccggg agcagcgtgt gggaaggaca gctgcagagc ctggtgctgt cagaatatgc     2340

atccacagag atgagcctgc atgccctcta tatgcaccag ctccacaagc agcaggccca     2400

ggctgaacct gagcggcatg tatggcaccg ccgggagagt gatgagagtg gagaaagcgc     2460

ccctgatgaa gggggagagg gcgcccgggc cccccagtct atccctcgct ctgctagcta     2520

tccctgtgca gcaccccggc ctggagctcc tgagaccact gccctgcatg ggggcttcca     2580

gaggcgctac ggtggcatca cagatcctgg cacagtgccc agggttccct ctcatttctc     2640

tcggctgcct cttggagggt gggcagaaga tgggcagtcg gcatcaaggc accctgagcc     2700

cgtgcccgaa gagggctcgg aggatgagct accccctcag gtgcacaagg tatagacaag     2760

gctgagcagg gttcctgtgg cccaggatgg aggccaccgc tgccctgcca tcccgtctgc     2820

ctgccatggg acggctcctc tgagtgttcc ctggccccac gtgtgtggtg tttgtgtgtc     2880

tgtgcctggc caagggaggt gccaacactg ggcttgccac agccccagga gaggaatttg     2940

gggcctagga accgagggca cacgggactc tagcctcatc cccaggaccc ccttggctca     3000

gagtgtggtg ctagaaactg gcccccagcc cagccccagt actgccacct ttacacctac     3060

ccctgcaagt ccccagaggg ctgcccacga tagaagctgc caagcaggga gaacctgtgc     3120

caactgtgga gtggggaggt tgggcctgga ccctcaaccc ctgcaacctt ccctagcccc     3180

ctcaatagat gagcaggtca ggctgtggcc cttacctcac ccgcagttct cgcccagtgc     3240

tgcagccggc tcacctctct ccgcttcttg cacatcactg gcctgtgtgt gctgcttgct     3300

cctgttctgt tcgcttgctc ccgttccgtt cggcttttgc tttgcgttag ggtgaagacc     3360

ctagcgtcca gctcccctca acgctatatt ttgacactaa aaaagaaggt ttctaaattg     3420

taggagcagg atggaaatac tttgctgccc ttgccatctt ttaggatggg cccccaggag     3480

actgaggtct tcctgggccc tcattgctgc ttatcgtacc ccccatcacc tgcacatggg     3540

acagaccggg ctggagggtg accttggctg tgtgcgtccc agcaaaagag ctctggcccg     3600

catctcgctg tgccctgaag ggggatgaag ggcgatgcct cgcccgaggc tttgggctgc     3660

tgcactgcat gctgggactg ctcctactct ctgtcccacc cctcacccag ctgtggtccg     3720

gctttgggag agtggtgaat tgcgctgccc gaactcggag cggagcaggg tagggaccgt     3780

gtacagcttg ataaccctta ataaaaaggg agtttgacca gaaaaaaaaa aaaaaaaaaa     3840

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa     3900

aaaaaaaaaa                                                            3910


<210>  23
<211>  839
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ATG9 polypeptide encoded by mRNA transcript variant 1, NCBI
       Protein ID NP_001070666.1

<400>  23

Met Ala Gln Phe Asp Thr Glu Tyr Gln Arg Leu Glu Ala Ser Tyr Ser 
1               5                   10                  15      


Asp Ser Pro Pro Gly Glu Glu Asp Leu Leu Val His Val Ala Glu Gly 
            20                  25                  30          


Ser Lys Ser Pro Trp His His Ile Glu Asn Leu Asp Leu Phe Phe Ser 
        35                  40                  45              


Arg Val Tyr Asn Leu His Gln Lys Asn Gly Phe Thr Cys Met Leu Ile 
    50                  55                  60                  


Gly Glu Ile Phe Glu Leu Met Gln Phe Leu Phe Val Val Ala Phe Thr 
65                  70                  75                  80  


Thr Phe Leu Val Ser Cys Val Asp Tyr Asp Ile Leu Phe Ala Asn Lys 
                85                  90                  95      


Met Val Asn His Ser Leu His Pro Thr Glu Pro Val Lys Val Thr Leu 
            100                 105                 110         


Pro Asp Ala Phe Leu Pro Ala Gln Val Cys Ser Ala Arg Ile Gln Glu 
        115                 120                 125             


Asn Gly Ser Leu Ile Thr Ile Leu Val Ile Ala Gly Val Phe Trp Ile 
    130                 135                 140                 


His Arg Leu Ile Lys Phe Ile Tyr Asn Ile Cys Cys Tyr Trp Glu Ile 
145                 150                 155                 160 


His Ser Phe Tyr Leu His Ala Leu Arg Ile Pro Met Ser Ala Leu Pro 
                165                 170                 175     


Tyr Cys Thr Trp Gln Glu Val Gln Ala Arg Ile Val Gln Thr Gln Lys 
            180                 185                 190         


Glu His Gln Ile Cys Ile His Lys Arg Glu Leu Thr Glu Leu Asp Ile 
        195                 200                 205             


Tyr His Arg Ile Leu Arg Phe Gln Asn Tyr Met Val Ala Leu Val Asn 
    210                 215                 220                 


Lys Ser Leu Leu Pro Leu Arg Phe Arg Leu Pro Gly Leu Gly Glu Ala 
225                 230                 235                 240 


Val Phe Phe Thr Arg Gly Leu Lys Tyr Asn Phe Glu Leu Ile Leu Phe 
                245                 250                 255     


Trp Gly Pro Gly Ser Leu Phe Leu Asn Glu Trp Ser Leu Lys Ala Glu 
            260                 265                 270         


Tyr Lys Arg Gly Gly Gln Arg Leu Glu Leu Ala Gln Arg Leu Ser Asn 
        275                 280                 285             


Arg Ile Leu Trp Ile Gly Ile Ala Asn Phe Leu Leu Cys Pro Leu Ile 
    290                 295                 300                 


Leu Ile Trp Gln Ile Leu Tyr Ala Phe Phe Ser Tyr Ala Glu Val Leu 
305                 310                 315                 320 


Lys Arg Glu Pro Gly Ala Leu Gly Ala Arg Cys Trp Ser Leu Tyr Gly 
                325                 330                 335     


Arg Cys Tyr Leu Arg His Phe Asn Glu Leu Glu His Glu Leu Gln Ser 
            340                 345                 350         


Arg Leu Asn Arg Gly Tyr Lys Pro Ala Ser Lys Tyr Met Asn Cys Phe 
        355                 360                 365             


Leu Ser Pro Leu Leu Thr Leu Leu Ala Lys Asn Gly Ala Phe Phe Ala 
    370                 375                 380                 


Gly Ser Ile Leu Ala Val Leu Ile Ala Leu Thr Ile Tyr Asp Glu Asp 
385                 390                 395                 400 


Val Leu Ala Val Glu His Val Leu Thr Thr Val Thr Leu Leu Gly Val 
                405                 410                 415     


Thr Val Thr Val Cys Arg Ser Phe Ile Pro Asp Gln His Met Val Phe 
            420                 425                 430         


Cys Pro Glu Gln Leu Leu Arg Val Ile Leu Ala His Ile His Tyr Met 
        435                 440                 445             


Pro Asp His Trp Gln Gly Asn Ala His Arg Ser Gln Thr Arg Asp Glu 
    450                 455                 460                 


Phe Ala Gln Leu Phe Gln Tyr Lys Ala Val Phe Ile Leu Glu Glu Leu 
465                 470                 475                 480 


Leu Ser Pro Ile Val Thr Pro Leu Ile Leu Ile Phe Cys Leu Arg Pro 
                485                 490                 495     


Arg Ala Leu Glu Ile Ile Asp Phe Phe Arg Asn Phe Thr Val Glu Val 
            500                 505                 510         


Val Gly Val Gly Asp Thr Cys Ser Phe Ala Gln Met Asp Val Arg Gln 
        515                 520                 525             


His Gly His Pro Gln Trp Leu Ser Ala Gly Gln Thr Glu Ala Ser Val 
    530                 535                 540                 


Tyr Gln Gln Ala Glu Asp Gly Lys Thr Glu Leu Ser Leu Met His Phe 
545                 550                 555                 560 


Ala Ile Thr Asn Pro Gly Trp Gln Pro Pro Arg Glu Ser Thr Ala Phe 
                565                 570                 575     


Leu Gly Phe Leu Lys Glu Gln Val Gln Arg Asp Gly Ala Ala Ala Ser 
            580                 585                 590         


Leu Ala Gln Gly Gly Leu Leu Pro Glu Asn Ala Leu Phe Thr Ser Ile 
        595                 600                 605             


Gln Ser Leu Gln Ser Glu Ser Glu Pro Leu Ser Leu Ile Ala Asn Val 
    610                 615                 620                 


Val Ala Gly Ser Ser Cys Arg Gly Pro Pro Leu Pro Arg Asp Leu Gln 
625                 630                 635                 640 


Gly Ser Arg His Arg Ala Glu Val Ala Ser Ala Leu Arg Ser Phe Ser 
                645                 650                 655     


Pro Leu Gln Pro Gly Gln Ala Pro Thr Gly Arg Ala His Ser Thr Met 
            660                 665                 670         


Thr Gly Ser Gly Val Asp Ala Arg Thr Ala Ser Ser Gly Ser Ser Val 
        675                 680                 685             


Trp Glu Gly Gln Leu Gln Ser Leu Val Leu Ser Glu Tyr Ala Ser Thr 
    690                 695                 700                 


Glu Met Ser Leu His Ala Leu Tyr Met His Gln Leu His Lys Gln Gln 
705                 710                 715                 720 


Ala Gln Ala Glu Pro Glu Arg His Val Trp His Arg Arg Glu Ser Asp 
                725                 730                 735     


Glu Ser Gly Glu Ser Ala Pro Asp Glu Gly Gly Glu Gly Ala Arg Ala 
            740                 745                 750         


Pro Gln Ser Ile Pro Arg Ser Ala Ser Tyr Pro Cys Ala Ala Pro Arg 
        755                 760                 765             


Pro Gly Ala Pro Glu Thr Thr Ala Leu His Gly Gly Phe Gln Arg Arg 
    770                 775                 780                 


Tyr Gly Gly Ile Thr Asp Pro Gly Thr Val Pro Arg Val Pro Ser His 
785                 790                 795                 800 


Phe Ser Arg Leu Pro Leu Gly Gly Trp Ala Glu Asp Gly Gln Ser Ala 
                805                 810                 815     


Ser Arg His Pro Glu Pro Val Pro Glu Glu Gly Ser Glu Asp Glu Leu 
            820                 825                 830         


Pro Pro Gln Val His Lys Val 
        835                 


<210>  24
<211>  3910
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 9A (ATG9A), transcript variant 1, 
       mRNA corresponding to NCBI Reference Sequence: NM_001077198.2

<400>  24
gttgtgtctc cgccccccgt gcttattggc cagcctggct gcgggtgaca gtgagtggca       60

gacacccggc ctagcgccgc gggtcgcgcc gagccgagcc gagccgagcg gagccggcgg      120

agcctctgga atcacccggg tcgctgttcc tgagcagctg cagagcatcg agggctggag      180

aggagcacat actgtccatg gagctggtgg tcaaggtgga cagggggcgg tggtgatggc      240

gcagtttgac actgaatacc agcgcctaga ggcctcctat agtgattcac ccccagggga      300

ggaggacctg ttggtgcacg tcgccgaggg gagcaagtca ccttggcacc atattgaaaa      360

ccttgacctc ttcttctctc gagtttataa tctgcaccag aagaatggct tcacatgtat      420

gctcatcggg gagatctttg agctcatgca gttcctcttt gtggttgcct tcactacctt      480

cctggtcagc tgcgtggact atgacatcct atttgccaac aagatggtga accacagtct      540

tcaccctact gaacccgtca aggtcactct gccagacgcc tttttgcctg ctcaagtctg      600

tagtgccagg attcaggaaa atggctccct tatcaccatc ctggtcattg ctggtgtctt      660

ctggatccac cggcttatca agttcatcta taacatttgc tgctactggg agatccactc      720

cttctacctg cacgctctgc gcatccctat gtctgccctt ccgtattgca cgtggcaaga      780

agtgcaggcc cggatcgtgc agacgcagaa ggagcaccag atctgcatcc acaaacgtga      840

gctgacagaa ctggacatct accaccgcat cctccgtttc cagaactaca tggtggcact      900

ggttaacaaa tccctcctgc ctctgcgctt ccgcctgcct ggcctcgggg aagctgtctt      960

cttcacccgt ggtctcaagt acaactttga gctgatcctc ttctggggac ctggctctct     1020

gtttctcaat gaatggagcc tcaaggccga gtacaaacgt ggggggcaac ggctagagct     1080

ggcccagcgc ctcagcaacc gcatcctgtg gattggcatc gctaacttcc tgctgtgccc     1140

cctcatcctc atatggcaaa tcctctatgc cttcttcagc tatgctgagg tgctgaagcg     1200

ggagccgggg gccctgggag cacgctgctg gtcactctat ggccgctgct acctccgcca     1260

cttcaacgag ctggagcacg agctgcagtc ccgcctcaac cgtggctaca agcccgcctc     1320

caagtacatg aattgcttct tgtcacctct tttgacactg ctggccaaga atggagcctt     1380

cttcgctggc tccatcctgg ctgtgcttat tgccctcacc atttatgacg aagatgtgtt     1440

ggctgtggaa catgtgctga ccaccgtcac actcctgggg gtcaccgtga ccgtgtgcag     1500

gtcctttatc ccggaccagc acatggtgtt ctgccctgag cagctgctcc gcgtgatcct     1560

cgctcacatc cactacatgc ctgaccactg gcagggtaat gcccaccgct cgcagacccg     1620

ggacgagttt gcccagctct tccagtacaa ggcagtgttc attttggaag agttgctgag     1680

ccccattgtc acacccctca tcctcatctt ctgcctgcgc ccacgggccc tggagattat     1740

agacttcttc cgaaacttca ccgtggaggt cgttggtgtg ggagatacct gctcctttgc     1800

tcagatggat gttcgccagc atggtcatcc ccagtggcta tctgctgggc agacagaggc     1860

ctcagtgtac cagcaagctg aggatggaaa gacagagttg tcactcatgc actttgccat     1920

caccaaccct ggctggcagc caccacgtga gagcacagcc ttcctaggct tcctcaagga     1980

gcaggttcag cgggatggag cagctgctag cctcgcccaa gggggtctgc tccctgaaaa     2040

tgccctcttt acgtctatcc agtccttaca atctgagtct gagcccctga gccttatcgc     2100

aaatgtggta gctggctcat cctgccgggg ccctccactg cccagagacc tgcagggctc     2160

caggcacagg gctgaagtcg cctctgccct gcgctccttc tccccgctgc aacccgggca     2220

ggcgcccaca ggccgggctc acagcaccat gacaggctct ggggtggatg ccaggacagc     2280

cagctccggg agcagcgtgt gggaaggaca gctgcagagc ctggtgctgt cagaatatgc     2340

atccacagag atgagcctgc atgccctcta tatgcaccag ctccacaagc agcaggccca     2400

ggctgaacct gagcggcatg tatggcaccg ccgggagagt gatgagagtg gagaaagcgc     2460

ccctgatgaa gggggagagg gcgcccgggc cccccagtct atccctcgct ctgctagcta     2520

tccctgtgca gcaccccggc ctggagctcc tgagaccact gccctgcatg ggggcttcca     2580

gaggcgctac ggtggcatca cagatcctgg cacagtgccc agggttccct ctcatttctc     2640

tcggctgcct cttggagggt gggcagaaga tgggcagtcg gcatcaaggc accctgagcc     2700

cgtgcccgaa gagggctcgg aggatgagct accccctcag gtgcacaagg tatagacaag     2760

gctgagcagg gttcctgtgg cccaggatgg aggccaccgc tgccctgcca tcccgtctgc     2820

ctgccatggg acggctcctc tgagtgttcc ctggccccac gtgtgtggtg tttgtgtgtc     2880

tgtgcctggc caagggaggt gccaacactg ggcttgccac agccccagga gaggaatttg     2940

gggcctagga accgagggca cacgggactc tagcctcatc cccaggaccc ccttggctca     3000

gagtgtggtg ctagaaactg gcccccagcc cagccccagt actgccacct ttacacctac     3060

ccctgcaagt ccccagaggg ctgcccacga tagaagctgc caagcaggga gaacctgtgc     3120

caactgtgga gtggggaggt tgggcctgga ccctcaaccc ctgcaacctt ccctagcccc     3180

ctcaatagat gagcaggtca ggctgtggcc cttacctcac ccgcagttct cgcccagtgc     3240

tgcagccggc tcacctctct ccgcttcttg cacatcactg gcctgtgtgt gctgcttgct     3300

cctgttctgt tcgcttgctc ccgttccgtt cggcttttgc tttgcgttag ggtgaagacc     3360

ctagcgtcca gctcccctca acgctatatt ttgacactaa aaaagaaggt ttctaaattg     3420

taggagcagg atggaaatac tttgctgccc ttgccatctt ttaggatggg cccccaggag     3480

actgaggtct tcctgggccc tcattgctgc ttatcgtacc ccccatcacc tgcacatggg     3540

acagaccggg ctggagggtg accttggctg tgtgcgtccc agcaaaagag ctctggcccg     3600

catctcgctg tgccctgaag ggggatgaag ggcgatgcct cgcccgaggc tttgggctgc     3660

tgcactgcat gctgggactg ctcctactct ctgtcccacc cctcacccag ctgtggtccg     3720

gctttgggag agtggtgaat tgcgctgccc gaactcggag cggagcaggg tagggaccgt     3780

gtacagcttg ataaccctta ataaaaaggg agtttgacca gaaaaaaaaa aaaaaaaaaa     3840

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa     3900

aaaaaaaaaa                                                            3910


<210>  25
<211>  839
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 9A (ATG9A), transcript variant 1, 
       polypeptide for NCBI Reference Sequence: NM_001077198.2

<400>  25

Met Ala Gln Phe Asp Thr Glu Tyr Gln Arg Leu Glu Ala Ser Tyr Ser 
1               5                   10                  15      


Asp Ser Pro Pro Gly Glu Glu Asp Leu Leu Val His Val Ala Glu Gly 
            20                  25                  30          


Ser Lys Ser Pro Trp His His Ile Glu Asn Leu Asp Leu Phe Phe Ser 
        35                  40                  45              


Arg Val Tyr Asn Leu His Gln Lys Asn Gly Phe Thr Cys Met Leu Ile 
    50                  55                  60                  


Gly Glu Ile Phe Glu Leu Met Gln Phe Leu Phe Val Val Ala Phe Thr 
65                  70                  75                  80  


Thr Phe Leu Val Ser Cys Val Asp Tyr Asp Ile Leu Phe Ala Asn Lys 
                85                  90                  95      


Met Val Asn His Ser Leu His Pro Thr Glu Pro Val Lys Val Thr Leu 
            100                 105                 110         


Pro Asp Ala Phe Leu Pro Ala Gln Val Cys Ser Ala Arg Ile Gln Glu 
        115                 120                 125             


Asn Gly Ser Leu Ile Thr Ile Leu Val Ile Ala Gly Val Phe Trp Ile 
    130                 135                 140                 


His Arg Leu Ile Lys Phe Ile Tyr Asn Ile Cys Cys Tyr Trp Glu Ile 
145                 150                 155                 160 


His Ser Phe Tyr Leu His Ala Leu Arg Ile Pro Met Ser Ala Leu Pro 
                165                 170                 175     


Tyr Cys Thr Trp Gln Glu Val Gln Ala Arg Ile Val Gln Thr Gln Lys 
            180                 185                 190         


Glu His Gln Ile Cys Ile His Lys Arg Glu Leu Thr Glu Leu Asp Ile 
        195                 200                 205             


Tyr His Arg Ile Leu Arg Phe Gln Asn Tyr Met Val Ala Leu Val Asn 
    210                 215                 220                 


Lys Ser Leu Leu Pro Leu Arg Phe Arg Leu Pro Gly Leu Gly Glu Ala 
225                 230                 235                 240 


Val Phe Phe Thr Arg Gly Leu Lys Tyr Asn Phe Glu Leu Ile Leu Phe 
                245                 250                 255     


Trp Gly Pro Gly Ser Leu Phe Leu Asn Glu Trp Ser Leu Lys Ala Glu 
            260                 265                 270         


Tyr Lys Arg Gly Gly Gln Arg Leu Glu Leu Ala Gln Arg Leu Ser Asn 
        275                 280                 285             


Arg Ile Leu Trp Ile Gly Ile Ala Asn Phe Leu Leu Cys Pro Leu Ile 
    290                 295                 300                 


Leu Ile Trp Gln Ile Leu Tyr Ala Phe Phe Ser Tyr Ala Glu Val Leu 
305                 310                 315                 320 


Lys Arg Glu Pro Gly Ala Leu Gly Ala Arg Cys Trp Ser Leu Tyr Gly 
                325                 330                 335     


Arg Cys Tyr Leu Arg His Phe Asn Glu Leu Glu His Glu Leu Gln Ser 
            340                 345                 350         


Arg Leu Asn Arg Gly Tyr Lys Pro Ala Ser Lys Tyr Met Asn Cys Phe 
        355                 360                 365             


Leu Ser Pro Leu Leu Thr Leu Leu Ala Lys Asn Gly Ala Phe Phe Ala 
    370                 375                 380                 


Gly Ser Ile Leu Ala Val Leu Ile Ala Leu Thr Ile Tyr Asp Glu Asp 
385                 390                 395                 400 


Val Leu Ala Val Glu His Val Leu Thr Thr Val Thr Leu Leu Gly Val 
                405                 410                 415     


Thr Val Thr Val Cys Arg Ser Phe Ile Pro Asp Gln His Met Val Phe 
            420                 425                 430         


Cys Pro Glu Gln Leu Leu Arg Val Ile Leu Ala His Ile His Tyr Met 
        435                 440                 445             


Pro Asp His Trp Gln Gly Asn Ala His Arg Ser Gln Thr Arg Asp Glu 
    450                 455                 460                 


Phe Ala Gln Leu Phe Gln Tyr Lys Ala Val Phe Ile Leu Glu Glu Leu 
465                 470                 475                 480 


Leu Ser Pro Ile Val Thr Pro Leu Ile Leu Ile Phe Cys Leu Arg Pro 
                485                 490                 495     


Arg Ala Leu Glu Ile Ile Asp Phe Phe Arg Asn Phe Thr Val Glu Val 
            500                 505                 510         


Val Gly Val Gly Asp Thr Cys Ser Phe Ala Gln Met Asp Val Arg Gln 
        515                 520                 525             


His Gly His Pro Gln Trp Leu Ser Ala Gly Gln Thr Glu Ala Ser Val 
    530                 535                 540                 


Tyr Gln Gln Ala Glu Asp Gly Lys Thr Glu Leu Ser Leu Met His Phe 
545                 550                 555                 560 


Ala Ile Thr Asn Pro Gly Trp Gln Pro Pro Arg Glu Ser Thr Ala Phe 
                565                 570                 575     


Leu Gly Phe Leu Lys Glu Gln Val Gln Arg Asp Gly Ala Ala Ala Ser 
            580                 585                 590         


Leu Ala Gln Gly Gly Leu Leu Pro Glu Asn Ala Leu Phe Thr Ser Ile 
        595                 600                 605             


Gln Ser Leu Gln Ser Glu Ser Glu Pro Leu Ser Leu Ile Ala Asn Val 
    610                 615                 620                 


Val Ala Gly Ser Ser Cys Arg Gly Pro Pro Leu Pro Arg Asp Leu Gln 
625                 630                 635                 640 


Gly Ser Arg His Arg Ala Glu Val Ala Ser Ala Leu Arg Ser Phe Ser 
                645                 650                 655     


Pro Leu Gln Pro Gly Gln Ala Pro Thr Gly Arg Ala His Ser Thr Met 
            660                 665                 670         


Thr Gly Ser Gly Val Asp Ala Arg Thr Ala Ser Ser Gly Ser Ser Val 
        675                 680                 685             


Trp Glu Gly Gln Leu Gln Ser Leu Val Leu Ser Glu Tyr Ala Ser Thr 
    690                 695                 700                 


Glu Met Ser Leu His Ala Leu Tyr Met His Gln Leu His Lys Gln Gln 
705                 710                 715                 720 


Ala Gln Ala Glu Pro Glu Arg His Val Trp His Arg Arg Glu Ser Asp 
                725                 730                 735     


Glu Ser Gly Glu Ser Ala Pro Asp Glu Gly Gly Glu Gly Ala Arg Ala 
            740                 745                 750         


Pro Gln Ser Ile Pro Arg Ser Ala Ser Tyr Pro Cys Ala Ala Pro Arg 
        755                 760                 765             


Pro Gly Ala Pro Glu Thr Thr Ala Leu His Gly Gly Phe Gln Arg Arg 
    770                 775                 780                 


Tyr Gly Gly Ile Thr Asp Pro Gly Thr Val Pro Arg Val Pro Ser His 
785                 790                 795                 800 


Phe Ser Arg Leu Pro Leu Gly Gly Trp Ala Glu Asp Gly Gln Ser Ala 
                805                 810                 815     


Ser Arg His Pro Glu Pro Val Pro Glu Glu Gly Ser Glu Asp Glu Leu 
            820                 825                 830         


Pro Pro Gln Val His Lys Val 
        835                 


<210>  26
<211>  3858
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 9A (ATG9A), transcript variant 2, 
       mRNA for NCBI Reference Sequence: NM_024085.4

<400>  26
gttgtgtctc cgccccccgt gcttattggc cagcctggct gcgggtgaca gtgagtggca       60

gacacccggc ctagcgccgc gggtcgcgcc gagccgagcc gagccgagcg gagccggcgg      120

agcctctgga atcacccggg tcgctgttcc tgaggtggtc aaggtggaca gggggcggtg      180

gtgatggcgc agtttgacac tgaataccag cgcctagagg cctcctatag tgattcaccc      240

ccaggggagg aggacctgtt ggtgcacgtc gccgagggga gcaagtcacc ttggcaccat      300

attgaaaacc ttgacctctt cttctctcga gtttataatc tgcaccagaa gaatggcttc      360

acatgtatgc tcatcgggga gatctttgag ctcatgcagt tcctctttgt ggttgccttc      420

actaccttcc tggtcagctg cgtggactat gacatcctat ttgccaacaa gatggtgaac      480

cacagtcttc accctactga acccgtcaag gtcactctgc cagacgcctt tttgcctgct      540

caagtctgta gtgccaggat tcaggaaaat ggctccctta tcaccatcct ggtcattgct      600

ggtgtcttct ggatccaccg gcttatcaag ttcatctata acatttgctg ctactgggag      660

atccactcct tctacctgca cgctctgcgc atccctatgt ctgcccttcc gtattgcacg      720

tggcaagaag tgcaggcccg gatcgtgcag acgcagaagg agcaccagat ctgcatccac      780

aaacgtgagc tgacagaact ggacatctac caccgcatcc tccgtttcca gaactacatg      840

gtggcactgg ttaacaaatc cctcctgcct ctgcgcttcc gcctgcctgg cctcggggaa      900

gctgtcttct tcacccgtgg tctcaagtac aactttgagc tgatcctctt ctggggacct      960

ggctctctgt ttctcaatga atggagcctc aaggccgagt acaaacgtgg ggggcaacgg     1020

ctagagctgg cccagcgcct cagcaaccgc atcctgtgga ttggcatcgc taacttcctg     1080

ctgtgccccc tcatcctcat atggcaaatc ctctatgcct tcttcagcta tgctgaggtg     1140

ctgaagcggg agccgggggc cctgggagca cgctgctggt cactctatgg ccgctgctac     1200

ctccgccact tcaacgagct ggagcacgag ctgcagtccc gcctcaaccg tggctacaag     1260

cccgcctcca agtacatgaa ttgcttcttg tcacctcttt tgacactgct ggccaagaat     1320

ggagccttct tcgctggctc catcctggct gtgcttattg ccctcaccat ttatgacgaa     1380

gatgtgttgg ctgtggaaca tgtgctgacc accgtcacac tcctgggggt caccgtgacc     1440

gtgtgcaggt cctttatccc ggaccagcac atggtgttct gccctgagca gctgctccgc     1500

gtgatcctcg ctcacatcca ctacatgcct gaccactggc agggtaatgc ccaccgctcg     1560

cagacccggg acgagtttgc ccagctcttc cagtacaagg cagtgttcat tttggaagag     1620

ttgctgagcc ccattgtcac acccctcatc ctcatcttct gcctgcgccc acgggccctg     1680

gagattatag acttcttccg aaacttcacc gtggaggtcg ttggtgtggg agatacctgc     1740

tcctttgctc agatggatgt tcgccagcat ggtcatcccc agtggctatc tgctgggcag     1800

acagaggcct cagtgtacca gcaagctgag gatggaaaga cagagttgtc actcatgcac     1860

tttgccatca ccaaccctgg ctggcagcca ccacgtgaga gcacagcctt cctaggcttc     1920

ctcaaggagc aggttcagcg ggatggagca gctgctagcc tcgcccaagg gggtctgctc     1980

cctgaaaatg ccctctttac gtctatccag tccttacaat ctgagtctga gcccctgagc     2040

cttatcgcaa atgtggtagc tggctcatcc tgccggggcc ctccactgcc cagagacctg     2100

cagggctcca ggcacagggc tgaagtcgcc tctgccctgc gctccttctc cccgctgcaa     2160

cccgggcagg cgcccacagg ccgggctcac agcaccatga caggctctgg ggtggatgcc     2220

aggacagcca gctccgggag cagcgtgtgg gaaggacagc tgcagagcct ggtgctgtca     2280

gaatatgcat ccacagagat gagcctgcat gccctctata tgcaccagct ccacaagcag     2340

caggcccagg ctgaacctga gcggcatgta tggcaccgcc gggagagtga tgagagtgga     2400

gaaagcgccc ctgatgaagg gggagagggc gcccgggccc cccagtctat ccctcgctct     2460

gctagctatc cctgtgcagc accccggcct ggagctcctg agaccactgc cctgcatggg     2520

ggcttccaga ggcgctacgg tggcatcaca gatcctggca cagtgcccag ggttccctct     2580

catttctctc ggctgcctct tggagggtgg gcagaagatg ggcagtcggc atcaaggcac     2640

cctgagcccg tgcccgaaga gggctcggag gatgagctac cccctcaggt gcacaaggta     2700

tagacaaggc tgagcagggt tcctgtggcc caggatggag gccaccgctg ccctgccatc     2760

ccgtctgcct gccatgggac ggctcctctg agtgttccct ggccccacgt gtgtggtgtt     2820

tgtgtgtctg tgcctggcca agggaggtgc caacactggg cttgccacag ccccaggaga     2880

ggaatttggg gcctaggaac cgagggcaca cgggactcta gcctcatccc caggaccccc     2940

ttggctcaga gtgtggtgct agaaactggc ccccagccca gccccagtac tgccaccttt     3000

acacctaccc ctgcaagtcc ccagagggct gcccacgata gaagctgcca agcagggaga     3060

acctgtgcca actgtggagt ggggaggttg ggcctggacc ctcaacccct gcaaccttcc     3120

ctagccccct caatagatga gcaggtcagg ctgtggccct tacctcaccc gcagttctcg     3180

cccagtgctg cagccggctc acctctctcc gcttcttgca catcactggc ctgtgtgtgc     3240

tgcttgctcc tgttctgttc gcttgctccc gttccgttcg gcttttgctt tgcgttaggg     3300

tgaagaccct agcgtccagc tcccctcaac gctatatttt gacactaaaa aagaaggttt     3360

ctaaattgta ggagcaggat ggaaatactt tgctgccctt gccatctttt aggatgggcc     3420

cccaggagac tgaggtcttc ctgggccctc attgctgctt atcgtacccc ccatcacctg     3480

cacatgggac agaccgggct ggagggtgac cttggctgtg tgcgtcccag caaaagagct     3540

ctggcccgca tctcgctgtg ccctgaaggg ggatgaaggg cgatgcctcg cccgaggctt     3600

tgggctgctg cactgcatgc tgggactgct cctactctct gtcccacccc tcacccagct     3660

gtggtccggc tttgggagag tggtgaattg cgctgcccga actcggagcg gagcagggta     3720

gggaccgtgt acagcttgat aacccttaat aaaaagggag tttgaccaga aaaaaaaaaa     3780

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa     3840

aaaaaaaaaa aaaaaaaa                                                   3858


<210>  27
<211>  839
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 9A (ATG9A), transcript variant 2, 
       polypeptide

<400>  27

Met Ala Gln Phe Asp Thr Glu Tyr Gln Arg Leu Glu Ala Ser Tyr Ser 
1               5                   10                  15      


Asp Ser Pro Pro Gly Glu Glu Asp Leu Leu Val His Val Ala Glu Gly 
            20                  25                  30          


Ser Lys Ser Pro Trp His His Ile Glu Asn Leu Asp Leu Phe Phe Ser 
        35                  40                  45              


Arg Val Tyr Asn Leu His Gln Lys Asn Gly Phe Thr Cys Met Leu Ile 
    50                  55                  60                  


Gly Glu Ile Phe Glu Leu Met Gln Phe Leu Phe Val Val Ala Phe Thr 
65                  70                  75                  80  


Thr Phe Leu Val Ser Cys Val Asp Tyr Asp Ile Leu Phe Ala Asn Lys 
                85                  90                  95      


Met Val Asn His Ser Leu His Pro Thr Glu Pro Val Lys Val Thr Leu 
            100                 105                 110         


Pro Asp Ala Phe Leu Pro Ala Gln Val Cys Ser Ala Arg Ile Gln Glu 
        115                 120                 125             


Asn Gly Ser Leu Ile Thr Ile Leu Val Ile Ala Gly Val Phe Trp Ile 
    130                 135                 140                 


His Arg Leu Ile Lys Phe Ile Tyr Asn Ile Cys Cys Tyr Trp Glu Ile 
145                 150                 155                 160 


His Ser Phe Tyr Leu His Ala Leu Arg Ile Pro Met Ser Ala Leu Pro 
                165                 170                 175     


Tyr Cys Thr Trp Gln Glu Val Gln Ala Arg Ile Val Gln Thr Gln Lys 
            180                 185                 190         


Glu His Gln Ile Cys Ile His Lys Arg Glu Leu Thr Glu Leu Asp Ile 
        195                 200                 205             


Tyr His Arg Ile Leu Arg Phe Gln Asn Tyr Met Val Ala Leu Val Asn 
    210                 215                 220                 


Lys Ser Leu Leu Pro Leu Arg Phe Arg Leu Pro Gly Leu Gly Glu Ala 
225                 230                 235                 240 


Val Phe Phe Thr Arg Gly Leu Lys Tyr Asn Phe Glu Leu Ile Leu Phe 
                245                 250                 255     


Trp Gly Pro Gly Ser Leu Phe Leu Asn Glu Trp Ser Leu Lys Ala Glu 
            260                 265                 270         


Tyr Lys Arg Gly Gly Gln Arg Leu Glu Leu Ala Gln Arg Leu Ser Asn 
        275                 280                 285             


Arg Ile Leu Trp Ile Gly Ile Ala Asn Phe Leu Leu Cys Pro Leu Ile 
    290                 295                 300                 


Leu Ile Trp Gln Ile Leu Tyr Ala Phe Phe Ser Tyr Ala Glu Val Leu 
305                 310                 315                 320 


Lys Arg Glu Pro Gly Ala Leu Gly Ala Arg Cys Trp Ser Leu Tyr Gly 
                325                 330                 335     


Arg Cys Tyr Leu Arg His Phe Asn Glu Leu Glu His Glu Leu Gln Ser 
            340                 345                 350         


Arg Leu Asn Arg Gly Tyr Lys Pro Ala Ser Lys Tyr Met Asn Cys Phe 
        355                 360                 365             


Leu Ser Pro Leu Leu Thr Leu Leu Ala Lys Asn Gly Ala Phe Phe Ala 
    370                 375                 380                 


Gly Ser Ile Leu Ala Val Leu Ile Ala Leu Thr Ile Tyr Asp Glu Asp 
385                 390                 395                 400 


Val Leu Ala Val Glu His Val Leu Thr Thr Val Thr Leu Leu Gly Val 
                405                 410                 415     


Thr Val Thr Val Cys Arg Ser Phe Ile Pro Asp Gln His Met Val Phe 
            420                 425                 430         


Cys Pro Glu Gln Leu Leu Arg Val Ile Leu Ala His Ile His Tyr Met 
        435                 440                 445             


Pro Asp His Trp Gln Gly Asn Ala His Arg Ser Gln Thr Arg Asp Glu 
    450                 455                 460                 


Phe Ala Gln Leu Phe Gln Tyr Lys Ala Val Phe Ile Leu Glu Glu Leu 
465                 470                 475                 480 


Leu Ser Pro Ile Val Thr Pro Leu Ile Leu Ile Phe Cys Leu Arg Pro 
                485                 490                 495     


Arg Ala Leu Glu Ile Ile Asp Phe Phe Arg Asn Phe Thr Val Glu Val 
            500                 505                 510         


Val Gly Val Gly Asp Thr Cys Ser Phe Ala Gln Met Asp Val Arg Gln 
        515                 520                 525             


His Gly His Pro Gln Trp Leu Ser Ala Gly Gln Thr Glu Ala Ser Val 
    530                 535                 540                 


Tyr Gln Gln Ala Glu Asp Gly Lys Thr Glu Leu Ser Leu Met His Phe 
545                 550                 555                 560 


Ala Ile Thr Asn Pro Gly Trp Gln Pro Pro Arg Glu Ser Thr Ala Phe 
                565                 570                 575     


Leu Gly Phe Leu Lys Glu Gln Val Gln Arg Asp Gly Ala Ala Ala Ser 
            580                 585                 590         


Leu Ala Gln Gly Gly Leu Leu Pro Glu Asn Ala Leu Phe Thr Ser Ile 
        595                 600                 605             


Gln Ser Leu Gln Ser Glu Ser Glu Pro Leu Ser Leu Ile Ala Asn Val 
    610                 615                 620                 


Val Ala Gly Ser Ser Cys Arg Gly Pro Pro Leu Pro Arg Asp Leu Gln 
625                 630                 635                 640 


Gly Ser Arg His Arg Ala Glu Val Ala Ser Ala Leu Arg Ser Phe Ser 
                645                 650                 655     


Pro Leu Gln Pro Gly Gln Ala Pro Thr Gly Arg Ala His Ser Thr Met 
            660                 665                 670         


Thr Gly Ser Gly Val Asp Ala Arg Thr Ala Ser Ser Gly Ser Ser Val 
        675                 680                 685             


Trp Glu Gly Gln Leu Gln Ser Leu Val Leu Ser Glu Tyr Ala Ser Thr 
    690                 695                 700                 


Glu Met Ser Leu His Ala Leu Tyr Met His Gln Leu His Lys Gln Gln 
705                 710                 715                 720 


Ala Gln Ala Glu Pro Glu Arg His Val Trp His Arg Arg Glu Ser Asp 
                725                 730                 735     


Glu Ser Gly Glu Ser Ala Pro Asp Glu Gly Gly Glu Gly Ala Arg Ala 
            740                 745                 750         


Pro Gln Ser Ile Pro Arg Ser Ala Ser Tyr Pro Cys Ala Ala Pro Arg 
        755                 760                 765             


Pro Gly Ala Pro Glu Thr Thr Ala Leu His Gly Gly Phe Gln Arg Arg 
    770                 775                 780                 


Tyr Gly Gly Ile Thr Asp Pro Gly Thr Val Pro Arg Val Pro Ser His 
785                 790                 795                 800 


Phe Ser Arg Leu Pro Leu Gly Gly Trp Ala Glu Asp Gly Gln Ser Ala 
                805                 810                 815     


Ser Arg His Pro Glu Pro Val Pro Glu Glu Gly Ser Glu Asp Glu Leu 
            820                 825                 830         


Pro Pro Gln Val His Lys Val 
        835                 


<210>  28
<211>  570
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human DJ-1 (Park7) mRNA Genbank accession no. AB073864.1 
       GI:16751470

<400>  28
atggcttcca aaagagctct ggtcatcctg gctaaaggag cagaggaaat ggagacggtc       60

atccctgtag atgtcatgag gcgagctggg attaaggtca ccgttgcagg cctggctgga      120

aaagacccag tacagtgtag ccgtgatgtg gtcatttgtc ctgatgccag ccttgaagat      180

gcaaaaaaag agggaccata tgatgtggtg gttctaccag gaggtaatct gggcgcacag      240

aatttatctg agtctgctgc tgtgaaggag atactgaagg agcaggaaaa ccggaagggc      300

ctgatagccg ccatctgtgc aggtcctact gctctgttgg ctcatgaaat aggctgtgga      360

agtaaagtta caacacaccc tcttgctaaa gacaaaatga tgaatggagg tcattacacc      420

tactctgaga atcgtgtgga aaaagacggc ctgattctta caagccgggg gcctgggacc      480

agcttcgagt ttgcgcttgc aattgttgaa gccctgaatg gcaaggaggt ggcggctcaa      540

gtgaaggctc cacttgttct taaagactag                                       570


<210>  29
<211>  189
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human DJ-1 (Park7) polypeptide Genbank Protein ID/Accession No. 
       BAB71782.1  GI:16751471

<400>  29

Met Ala Ser Lys Arg Ala Leu Val Ile Leu Ala Lys Gly Ala Glu Glu 
1               5                   10                  15      


Met Glu Thr Val Ile Pro Val Asp Val Met Arg Arg Ala Gly Ile Lys 
            20                  25                  30          


Val Thr Val Ala Gly Leu Ala Gly Lys Asp Pro Val Gln Cys Ser Arg 
        35                  40                  45              


Asp Val Val Ile Cys Pro Asp Ala Ser Leu Glu Asp Ala Lys Lys Glu 
    50                  55                  60                  


Gly Pro Tyr Asp Val Val Val Leu Pro Gly Gly Asn Leu Gly Ala Gln 
65                  70                  75                  80  


Asn Leu Ser Glu Ser Ala Ala Val Lys Glu Ile Leu Lys Glu Gln Glu 
                85                  90                  95      


Asn Arg Lys Gly Leu Ile Ala Ala Ile Cys Ala Gly Pro Thr Ala Leu 
            100                 105                 110         


Leu Ala His Glu Ile Gly Cys Gly Ser Lys Val Thr Thr His Pro Leu 
        115                 120                 125             


Ala Lys Asp Lys Met Met Asn Gly Gly His Tyr Thr Tyr Ser Glu Asn 
    130                 135                 140                 


Arg Val Glu Lys Asp Gly Leu Ile Leu Thr Ser Arg Gly Pro Gly Thr 
145                 150                 155                 160 


Ser Phe Glu Phe Ala Leu Ala Ile Val Glu Ala Leu Asn Gly Lys Glu 
                165                 170                 175     


Val Ala Ala Gln Val Lys Ala Pro Leu Val Leu Lys Asp 
            180                 185                 


<210>  30
<211>  504
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens DJ-1 (PARK7) gene, partial cds

<400>  30
cacatagccc attaggatgt caccttttct gtttctactt tgcaggtcat tacacctact       60

ctgagaatcg tgtggaaaaa gacagcctga ttcttacaag ccgggggcct gggaccagct      120

tcgagtttgc gcttgcaatt gttgaagccc tgaatggcaa ggaggtggcg gctcaagtga      180

aggctccact tgttcttaaa gactagagca gcgaactgcg acgatcactt agagaaacag      240

gccgttagga atccattctc actgtgttcg ctctaaacaa aacagtggta ggttaatgtg      300

ttcagaagtc gctgtcctta ctacttttgc ggaagtatgg aagtcacaac tacacagaga      360

tttctcagcc tacaaattgt gtctatacat ttctaagcct tgtttgcaga ataaacaggg      420

catttagcaa actactgatt gtttcttgtt ttgtctctca tttcttttgt gaaattaaat      480

tccgtatcac cttcatttgc agct                                             504


<210>  31
<211>  52
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens DJ-1 (PARK7) gene, partial cds

<400>  31

His Tyr Thr Tyr Ser Glu Asn Arg Val Glu Lys Asp Ser Leu Ile Leu 
1               5                   10                  15      


Thr Ser Arg Gly Pro Gly Thr Ser Phe Glu Phe Ala Leu Ala Ile Val 
            20                  25                  30          


Glu Ala Leu Asn Gly Lys Glu Val Ala Ala Gln Val Lys Ala Pro Leu 
        35                  40                  45              


Val Leu Lys Asp 
    50          


<210>  32
<211>  8733
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human mTOR mRNA Genbank accession no.: NM_004958.3

<400>  32
gctcccggct tagaggacag cggggaaggc gggcggtggg gcagggggcc tgaagcggcg       60

gtaccggtgc tggcggcggc agctgaggcc ttggccgaag ccgcgcgaac ctcagggcaa      120

gatgcttgga accggacctg ccgccgccac caccgctgcc accacatcta gcaatgtgag      180

cgtcctgcag cagtttgcca gtggcctaaa gagccggaat gaggaaacca gggccaaagc      240

cgccaaggag ctccagcact atgtcaccat ggaactccga gagatgagtc aagaggagtc      300

tactcgcttc tatgaccaac tgaaccatca catttttgaa ttggtttcca gctcagatgc      360

caatgagagg aaaggtggca tcttggccat agctagcctc ataggagtgg aaggtgggaa      420

tgccacccga attggcagat ttgccaacta tcttcggaac ctcctcccct ccaatgaccc      480

agttgtcatg gaaatggcat ccaaggccat tggccgtctt gccatggcag gggacacttt      540

taccgctgag tacgtggaat ttgaggtgaa gcgagccctg gaatggctgg gtgctgaccg      600

caatgagggc cggagacatg cagctgtcct ggttctccgt gagctggcca tcagcgtccc      660

taccttcttc ttccagcaag tgcaaccctt ctttgacaac atttttgtgg ccgtgtggga      720

ccccaaacag gccatccgtg agggagctgt agccgccctt cgtgcctgtc tgattctcac      780

aacccagcgt gagccgaagg agatgcagaa gcctcagtgg tacaggcaca catttgaaga      840

agcagagaag ggatttgatg agaccttggc caaagagaag ggcatgaatc gggatgatcg      900

gatccatgga gccttgttga tccttaacga gctggtccga atcagcagca tggagggaga      960

gcgtctgaga gaagaaatgg aagaaatcac acagcagcag ctggtacacg acaagtactg     1020

caaagatctc atgggcttcg gaacaaaacc tcgtcacatt acccccttca ccagtttcca     1080

ggctgtacag ccccagcagt caaatgcctt ggtggggctg ctggggtaca gctctcacca     1140

aggcctcatg ggatttggga cctcccccag tccagctaag tccaccctgg tggagagccg     1200

gtgttgcaga gacttgatgg aggagaaatt tgatcaggtg tgccagtggg tgctgaaatg     1260

caggaatagc aagaactcgc tgatccaaat gacaatcctt aatttgttgc cccgcttggc     1320

tgcattccga ccttctgcct tcacagatac ccagtatctc caagatacca tgaaccatgt     1380

cctaagctgt gtcaagaagg agaaggaacg tacagcggcc ttccaagccc tggggctact     1440

ttctgtggct gtgaggtctg agtttaaggt ctatttgcct cgcgtgctgg acatcatccg     1500

agcggccctg cccccaaagg acttcgccca taagaggcag aaggcaatgc aggtggatgc     1560

cacagtcttc acttgcatca gcatgctggc tcgagcaatg gggccaggca tccagcagga     1620

tatcaaggag ctgctggagc ccatgctggc agtgggacta agccctgccc tcactgcagt     1680

gctctacgac ctgagccgtc agattccaca gctaaagaag gacattcaag atgggctact     1740

gaaaatgctg tccctggtcc ttatgcacaa accccttcgc cacccaggca tgcccaaggg     1800

cctggcccat cagctggcct ctcctggcct cacgaccctc cctgaggcca gcgatgtggg     1860

cagcatcact cttgccctcc gaacgcttgg cagctttgaa tttgaaggcc actctctgac     1920

ccaatttgtt cgccactgtg cggatcattt cctgaacagt gagcacaagg agatccgcat     1980

ggaggctgcc cgcacctgct cccgcctgct cacaccctcc atccacctca tcagtggcca     2040

tgctcatgtg gttagccaga ccgcagtgca agtggtggca gatgtgctta gcaaactgct     2100

cgtagttggg ataacagatc ctgaccctga cattcgctac tgtgtcttgg cgtccctgga     2160

cgagcgcttt gatgcacacc tggcccaggc ggagaacttg caggccttgt ttgtggctct     2220

gaatgaccag gtgtttgaga tccgggagct ggccatctgc actgtgggcc gactcagtag     2280

catgaaccct gcctttgtca tgcctttcct gcgcaagatg ctcatccaga ttttgacaga     2340

gttggagcac agtgggattg gaagaatcaa agagcagagt gcccgcatgc tggggcacct     2400

ggtctccaat gccccccgac tcatccgccc ctacatggag cctattctga aggcattaat     2460

tttgaaactg aaagatccag accctgatcc aaacccaggt gtgatcaata atgtcctggc     2520

aacaatagga gaattggcac aggttagtgg cctggaaatg aggaaatggg ttgatgaact     2580

ttttattatc atcatggaca tgctccagga ttcctctttg ttggccaaaa ggcaggtggc     2640

tctgtggacc ctgggacagt tggtggccag cactggctat gtagtagagc cctacaggaa     2700

gtaccctact ttgcttgagg tgctactgaa ttttctgaag actgagcaga accagggtac     2760

acgcagagag gccatccgtg tgttagggct tttaggggct ttggatcctt acaagcacaa     2820

agtgaacatt ggcatgatag accagtcccg ggatgcctct gctgtcagcc tgtcagaatc     2880

caagtcaagt caggattcct ctgactatag cactagtgaa atgctggtca acatgggaaa     2940

cttgcctctg gatgagttct acccagctgt gtccatggtg gccctgatgc ggatcttccg     3000

agaccagtca ctctctcatc atcacaccat ggttgtccag gccatcacct tcatcttcaa     3060

gtccctggga ctcaaatgtg tgcagttcct gccccaggtc atgcccacgt tccttaacgt     3120

cattcgagtc tgtgatgggg ccatccggga atttttgttc cagcagctgg gaatgttggt     3180

gtcctttgtg aagagccaca tcagacctta tatggatgaa atagtcaccc tcatgagaga     3240

attctgggtc atgaacacct caattcagag cacgatcatt cttctcattg agcaaattgt     3300

ggtagctctt gggggtgaat ttaagctcta cctgccccag ctgatcccac acatgctgcg     3360

tgtcttcatg catgacaaca gcccaggccg cattgtctct atcaagttac tggctgcaat     3420

ccagctgttt ggcgccaacc tggatgacta cctgcattta ctgctgcctc ctattgttaa     3480

gttgtttgat gcccctgaag ctccactgcc atctcgaaag gcagcgctag agactgtgga     3540

ccgcctgacg gagtccctgg atttcactga ctatgcctcc cggatcattc accctattgt     3600

tcgaacactg gaccagagcc cagaactgcg ctccacagcc atggacacgc tgtcttcact     3660

tgtttttcag ctggggaaga agtaccaaat tttcattcca atggtgaata aagttctggt     3720

gcgacaccga atcaatcatc agcgctatga tgtgctcatc tgcagaattg tcaagggata     3780

cacacttgct gatgaagagg aggatccttt gatttaccag catcggatgc ttaggagtgg     3840

ccaaggggat gcattggcta gtggaccagt ggaaacagga cccatgaaga aactgcacgt     3900

cagcaccatc aacctccaaa aggcctgggg cgctgccagg agggtctcca aagatgactg     3960

gctggaatgg ctgagacggc tgagcctgga gctgctgaag gactcatcat cgccctccct     4020

gcgctcctgc tgggccctgg cacaggccta caacccgatg gccagggatc tcttcaatgc     4080

tgcatttgtg tcctgctggt ctgaactgaa tgaagatcaa caggatgagc tcatcagaag     4140

catcgagttg gccctcacct cacaagacat cgctgaagtc acacagaccc tcttaaactt     4200

ggctgaattc atggaacaca gtgacaaggg ccccctgcca ctgagagatg acaatggcat     4260

tgttctgctg ggtgagagag ctgccaagtg ccgagcatat gccaaagcac tacactacaa     4320

agaactggag ttccagaaag gccccacccc tgccattcta gaatctctca tcagcattaa     4380

taataagcta cagcagccgg aggcagcggc cggagtgtta gaatatgcca tgaaacactt     4440

tggagagctg gagatccagg ctacctggta tgagaaactg cacgagtggg aggatgccct     4500

tgtggcctat gacaagaaaa tggacaccaa caaggacgac ccagagctga tgctgggccg     4560

catgcgctgc ctcgaggcct tgggggaatg gggtcaactc caccagcagt gctgtgaaaa     4620

gtggaccctg gttaatgatg agacccaagc caagatggcc cggatggctg ctgcagctgc     4680

atggggttta ggtcagtggg acagcatgga agaatacacc tgtatgatcc ctcgggacac     4740

ccatgatggg gcattttata gagctgtgct ggcactgcat caggacctct tctccttggc     4800

acaacagtgc attgacaagg ccagggacct gctggatgct gaattaactg cgatggcagg     4860

agagagttac agtcgggcat atggggccat ggtttcttgc cacatgctgt ccgagctgga     4920

ggaggttatc cagtacaaac ttgtccccga gcgacgagag atcatccgcc agatctggtg     4980

ggagagactg cagggctgcc agcgtatcgt agaggactgg cagaaaatcc ttatggtgcg     5040

gtcccttgtg gtcagccctc atgaagacat gagaacctgg ctcaagtatg caagcctgtg     5100

cggcaagagt ggcaggctgg ctcttgctca taaaacttta gtgttgctcc tgggagttga     5160

tccgtctcgg caacttgacc atcctctgcc aacagttcac cctcaggtga cctatgccta     5220

catgaaaaac atgtggaaga gtgcccgcaa gatcgatgcc ttccagcaca tgcagcattt     5280

tgtccagacc atgcagcaac aggcccagca tgccatcgct actgaggacc agcagcataa     5340

gcaggaactg cacaagctca tggcccgatg cttcctgaaa cttggagagt ggcagctgaa     5400

tctacagggc atcaatgaga gcacaatccc caaagtgctg cagtactaca gcgccgccac     5460

agagcacgac cgcagctggt acaaggcctg gcatgcgtgg gcagtgatga acttcgaagc     5520

tgtgctacac tacaaacatc agaaccaagc ccgcgatgag aagaagaaac tgcgtcatgc     5580

cagcggggcc aacatcacca acgccaccac tgccgccacc acggccgcca ctgccaccac     5640

cactgccagc accgagggca gcaacagtga gagcgaggcc gagagcaccg agaacagccc     5700

caccccatcg ccgctgcaga agaaggtcac tgaggatctg tccaaaaccc tcctgatgta     5760

cacggtgcct gccgtccagg gcttcttccg ttccatctcc ttgtcacgag gcaacaacct     5820

ccaggataca ctcagagttc tcaccttatg gtttgattat ggtcactggc cagatgtcaa     5880

tgaggcctta gtggaggggg tgaaagccat ccagattgat acctggctac aggttatacc     5940

tcagctcatt gcaagaattg atacgcccag acccttggtg ggacgtctca ttcaccagct     6000

tctcacagac attggtcggt accaccccca ggccctcatc tacccactga cagtggcttc     6060

taagtctacc acgacagccc ggcacaatgc agccaacaag attctgaaga acatgtgtga     6120

gcacagcaac accctggtcc agcaggccat gatggtgagc gaggagctga tccgagtggc     6180

catcctctgg catgagatgt ggcatgaagg cctggaagag gcatctcgtt tgtactttgg     6240

ggaaaggaac gtgaaaggca tgtttgaggt gctggagccc ttgcatgcta tgatggaacg     6300

gggcccccag actctgaagg aaacatcctt taatcaggcc tatggtcgag atttaatgga     6360

ggcccaagag tggtgcagga agtacatgaa atcagggaat gtcaaggacc tcacccaagc     6420

ctgggacctc tattatcatg tgttccgacg aatctcaaag cagctgcctc agctcacatc     6480

cttagagctg caatatgttt ccccaaaact tctgatgtgc cgggaccttg aattggctgt     6540

gccaggaaca tatgacccca accagccaat cattcgcatt cagtccatag caccgtcttt     6600

gcaagtcatc acatccaagc agaggccccg gaaattgaca cttatgggca gcaacggaca     6660

tgagtttgtt ttccttctaa aaggccatga agatctgcgc caggatgagc gtgtgatgca     6720

gctcttcggc ctggttaaca cccttctggc caatgaccca acatctcttc ggaaaaacct     6780

cagcatccag agatacgctg tcatcccttt atcgaccaac tcgggcctca ttggctgggt     6840

tccccactgt gacacactgc acgccctcat ccgggactac agggagaaga agaagatcct     6900

tctcaacatc gagcatcgca tcatgttgcg gatggctccg gactatgacc acttgactct     6960

gatgcagaag gtggaggtgt ttgagcatgc cgtcaataat acagctgggg acgacctggc     7020

caagctgctg tggctgaaaa gccccagctc cgaggtgtgg tttgaccgaa gaaccaatta     7080

tacccgttct ttagcggtca tgtcaatggt tgggtatatt ttaggcctgg gagatagaca     7140

cccatccaac ctgatgctgg accgtctgag tgggaagatc ctgcacattg actttgggga     7200

ctgctttgag gttgctatga cccgagagaa gtttccagag aagattccat ttagactaac     7260

aagaatgttg accaatgcta tggaggttac aggcctggat ggcaactaca gaatcacatg     7320

ccacacagtg atggaggtgc tgcgagagca caaggacagt gtcatggccg tgctggaagc     7380

ctttgtctat gaccccttgc tgaactggag gctgatggac acaaatacca aaggcaacaa     7440

gcgatcccga acgaggacgg attcctactc tgctggccag tcagtcgaaa ttttggacgg     7500

tgtggaactt ggagagccag cccataagaa aacggggacc acagtgccag aatctattca     7560

ttctttcatt ggagacggtt tggtgaaacc agaggcccta aataagaaag ctatccagat     7620

tattaacagg gttcgagata agctcactgg tcgggacttc tctcatgatg acactttgga     7680

tgttccaacg caagttgagc tgctcatcaa acaagcgaca tcccatgaaa acctctgcca     7740

gtgctatatt ggctggtgcc ctttctggta actggaggcc cagatgtgcc catcacgttt     7800

tttctgaggc ttttgtactt tagtaaatgc ttccactaaa ctgaaaccat ggtgagaaag     7860

tttgactttg ttaaatattt tgaaatgtaa atgaaaagaa ctactgtata ttaaaagttg     7920

gtttgaacca actttctagc tgctgttgaa gaatatattg tcagaaacac aaggcttgat     7980

ttggttccca ggacagtgaa acatagtaat accacgtaaa tcaagccatt cattttgggg     8040

aacagaagat ccataacttt agaaatacgg gttttgactt aactcacaag agaactcatc     8100

ataagtactt gctgatggaa gaatgaccta gttgctcctc tcaacatggg tacagcaaac     8160

tcagcacagc caagaagcct caggtcgtgg agaacatgga ttaggatcct agactgtaaa     8220

gacacagaag atgctgacct cacccctgcc acctatccca agacctcact ggtctgtgga     8280

cagcagcaga aatgtttgca agataggcca aaatgagtac aaaaggtctg tcttccatca     8340

gacccagtga tgctgcgact cacacgcttc aattcaagac ctgaccgcta gtagggaggt     8400

ttattcagat cgctggcagc ctcggctgag cagatgcaca gaggggatca ctgtgcagtg     8460

ggaccaccct cactggcctt ctgcagcagg gttctgggat gttttcagtg gtcaaaatac     8520

tctgtttaga gcaagggctc agaaaacaga aatactgtca tggaggtgct gaacacaggg     8580

aaggtctggt acatattgga aattatgagc agaacaaata ctcaactaaa tgcacaaagt     8640

ataaagtgta gccatgtcta gacaccatgt tgtatcagaa taatttttgt gccaataaat     8700

gacatcagaa ttttaaacat atgtaaaaaa aaa                                  8733


<210>  33
<211>  2549
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  33 human mTOR polypeptide Genbank Protein ID/accession no.: 
       NM_004949.1 GI: 4826730

<400>  33

Met Leu Gly Thr Gly Pro Ala Ala Ala Thr Thr Ala Ala Thr Thr Ser 
1               5                   10                  15      


Ser Asn Val Ser Val Leu Gln Gln Phe Ala Ser Gly Leu Lys Ser Arg 
            20                  25                  30          


Asn Glu Glu Thr Arg Ala Lys Ala Ala Lys Glu Leu Gln His Tyr Val 
        35                  40                  45              


Thr Met Glu Leu Arg Glu Met Ser Gln Glu Glu Ser Thr Arg Phe Tyr 
    50                  55                  60                  


Asp Gln Leu Asn His His Ile Phe Glu Leu Val Ser Ser Ser Asp Ala 
65                  70                  75                  80  


Asn Glu Arg Lys Gly Gly Ile Leu Ala Ile Ala Ser Leu Ile Gly Val 
                85                  90                  95      


Glu Gly Gly Asn Ala Thr Arg Ile Gly Arg Phe Ala Asn Tyr Leu Arg 
            100                 105                 110         


Asn Leu Leu Pro Ser Asn Asp Pro Val Val Met Glu Met Ala Ser Lys 
        115                 120                 125             


Ala Ile Gly Arg Leu Ala Met Ala Gly Asp Thr Phe Thr Ala Glu Tyr 
    130                 135                 140                 


Val Glu Phe Glu Val Lys Arg Ala Leu Glu Trp Leu Gly Ala Asp Arg 
145                 150                 155                 160 


Asn Glu Gly Arg Arg His Ala Ala Val Leu Val Leu Arg Glu Leu Ala 
                165                 170                 175     


Ile Ser Val Pro Thr Phe Phe Phe Gln Gln Val Gln Pro Phe Phe Asp 
            180                 185                 190         


Asn Ile Phe Val Ala Val Trp Asp Pro Lys Gln Ala Ile Arg Glu Gly 
        195                 200                 205             


Ala Val Ala Ala Leu Arg Ala Cys Leu Ile Leu Thr Thr Gln Arg Glu 
    210                 215                 220                 


Pro Lys Glu Met Gln Lys Pro Gln Trp Tyr Arg His Thr Phe Glu Glu 
225                 230                 235                 240 


Ala Glu Lys Gly Phe Asp Glu Thr Leu Ala Lys Glu Lys Gly Met Asn 
                245                 250                 255     


Arg Asp Asp Arg Ile His Gly Ala Leu Leu Ile Leu Asn Glu Leu Val 
            260                 265                 270         


Arg Ile Ser Ser Met Glu Gly Glu Arg Leu Arg Glu Glu Met Glu Glu 
        275                 280                 285             


Ile Thr Gln Gln Gln Leu Val His Asp Lys Tyr Cys Lys Asp Leu Met 
    290                 295                 300                 


Gly Phe Gly Thr Lys Pro Arg His Ile Thr Pro Phe Thr Ser Phe Gln 
305                 310                 315                 320 


Ala Val Gln Pro Gln Gln Ser Asn Ala Leu Val Gly Leu Leu Gly Tyr 
                325                 330                 335     


Ser Ser His Gln Gly Leu Met Gly Phe Gly Thr Ser Pro Ser Pro Ala 
            340                 345                 350         


Lys Ser Thr Leu Val Glu Ser Arg Cys Cys Arg Asp Leu Met Glu Glu 
        355                 360                 365             


Lys Phe Asp Gln Val Cys Gln Trp Val Leu Lys Cys Arg Asn Ser Lys 
    370                 375                 380                 


Asn Ser Leu Ile Gln Met Thr Ile Leu Asn Leu Leu Pro Arg Leu Ala 
385                 390                 395                 400 


Ala Phe Arg Pro Ser Ala Phe Thr Asp Thr Gln Tyr Leu Gln Asp Thr 
                405                 410                 415     


Met Asn His Val Leu Ser Cys Val Lys Lys Glu Lys Glu Arg Thr Ala 
            420                 425                 430         


Ala Phe Gln Ala Leu Gly Leu Leu Ser Val Ala Val Arg Ser Glu Phe 
        435                 440                 445             


Lys Val Tyr Leu Pro Arg Val Leu Asp Ile Ile Arg Ala Ala Leu Pro 
    450                 455                 460                 


Pro Lys Asp Phe Ala His Lys Arg Gln Lys Ala Met Gln Val Asp Ala 
465                 470                 475                 480 


Thr Val Phe Thr Cys Ile Ser Met Leu Ala Arg Ala Met Gly Pro Gly 
                485                 490                 495     


Ile Gln Gln Asp Ile Lys Glu Leu Leu Glu Pro Met Leu Ala Val Gly 
            500                 505                 510         


Leu Ser Pro Ala Leu Thr Ala Val Leu Tyr Asp Leu Ser Arg Gln Ile 
        515                 520                 525             


Pro Gln Leu Lys Lys Asp Ile Gln Asp Gly Leu Leu Lys Met Leu Ser 
    530                 535                 540                 


Leu Val Leu Met His Lys Pro Leu Arg His Pro Gly Met Pro Lys Gly 
545                 550                 555                 560 


Leu Ala His Gln Leu Ala Ser Pro Gly Leu Thr Thr Leu Pro Glu Ala 
                565                 570                 575     


Ser Asp Val Gly Ser Ile Thr Leu Ala Leu Arg Thr Leu Gly Ser Phe 
            580                 585                 590         


Glu Phe Glu Gly His Ser Leu Thr Gln Phe Val Arg His Cys Ala Asp 
        595                 600                 605             


His Phe Leu Asn Ser Glu His Lys Glu Ile Arg Met Glu Ala Ala Arg 
    610                 615                 620                 


Thr Cys Ser Arg Leu Leu Thr Pro Ser Ile His Leu Ile Ser Gly His 
625                 630                 635                 640 


Ala His Val Val Ser Gln Thr Ala Val Gln Val Val Ala Asp Val Leu 
                645                 650                 655     


Ser Lys Leu Leu Val Val Gly Ile Thr Asp Pro Asp Pro Asp Ile Arg 
            660                 665                 670         


Tyr Cys Val Leu Ala Ser Leu Asp Glu Arg Phe Asp Ala His Leu Ala 
        675                 680                 685             


Gln Ala Glu Asn Leu Gln Ala Leu Phe Val Ala Leu Asn Asp Gln Val 
    690                 695                 700                 


Phe Glu Ile Arg Glu Leu Ala Ile Cys Thr Val Gly Arg Leu Ser Ser 
705                 710                 715                 720 


Met Asn Pro Ala Phe Val Met Pro Phe Leu Arg Lys Met Leu Ile Gln 
                725                 730                 735     


Ile Leu Thr Glu Leu Glu His Ser Gly Ile Gly Arg Ile Lys Glu Gln 
            740                 745                 750         


Ser Ala Arg Met Leu Gly His Leu Val Ser Asn Ala Pro Arg Leu Ile 
        755                 760                 765             


Arg Pro Tyr Met Glu Pro Ile Leu Lys Ala Leu Ile Leu Lys Leu Lys 
    770                 775                 780                 


Asp Pro Asp Pro Asp Pro Asn Pro Gly Val Ile Asn Asn Val Leu Ala 
785                 790                 795                 800 


Thr Ile Gly Glu Leu Ala Gln Val Ser Gly Leu Glu Met Arg Lys Trp 
                805                 810                 815     


Val Asp Glu Leu Phe Ile Ile Ile Met Asp Met Leu Gln Asp Ser Ser 
            820                 825                 830         


Leu Leu Ala Lys Arg Gln Val Ala Leu Trp Thr Leu Gly Gln Leu Val 
        835                 840                 845             


Ala Ser Thr Gly Tyr Val Val Glu Pro Tyr Arg Lys Tyr Pro Thr Leu 
    850                 855                 860                 


Leu Glu Val Leu Leu Asn Phe Leu Lys Thr Glu Gln Asn Gln Gly Thr 
865                 870                 875                 880 


Arg Arg Glu Ala Ile Arg Val Leu Gly Leu Leu Gly Ala Leu Asp Pro 
                885                 890                 895     


Tyr Lys His Lys Val Asn Ile Gly Met Ile Asp Gln Ser Arg Asp Ala 
            900                 905                 910         


Ser Ala Val Ser Leu Ser Glu Ser Lys Ser Ser Gln Asp Ser Ser Asp 
        915                 920                 925             


Tyr Ser Thr Ser Glu Met Leu Val Asn Met Gly Asn Leu Pro Leu Asp 
    930                 935                 940                 


Glu Phe Tyr Pro Ala Val Ser Met Val Ala Leu Met Arg Ile Phe Arg 
945                 950                 955                 960 


Asp Gln Ser Leu Ser His His His Thr Met Val Val Gln Ala Ile Thr 
                965                 970                 975     


Phe Ile Phe Lys Ser Leu Gly Leu Lys Cys Val Gln Phe Leu Pro Gln 
            980                 985                 990         


Val Met Pro Thr Phe Leu Asn Val  Ile Arg Val Cys Asp  Gly Ala Ile 
        995                 1000                 1005             


Arg Glu  Phe Leu Phe Gln Gln  Leu Gly Met Leu Val  Ser Phe Val 
    1010                 1015                 1020             


Lys Ser  His Ile Arg Pro Tyr  Met Asp Glu Ile Val  Thr Leu Met 
    1025                 1030                 1035             


Arg Glu  Phe Trp Val Met Asn  Thr Ser Ile Gln Ser  Thr Ile Ile 
    1040                 1045                 1050             


Leu Leu  Ile Glu Gln Ile Val  Val Ala Leu Gly Gly  Glu Phe Lys 
    1055                 1060                 1065             


Leu Tyr  Leu Pro Gln Leu Ile  Pro His Met Leu Arg  Val Phe Met 
    1070                 1075                 1080             


His Asp  Asn Ser Pro Gly Arg  Ile Val Ser Ile Lys  Leu Leu Ala 
    1085                 1090                 1095             


Ala Ile  Gln Leu Phe Gly Ala  Asn Leu Asp Asp Tyr  Leu His Leu 
    1100                 1105                 1110             


Leu Leu  Pro Pro Ile Val Lys  Leu Phe Asp Ala Pro  Glu Ala Pro 
    1115                 1120                 1125             


Leu Pro  Ser Arg Lys Ala Ala  Leu Glu Thr Val Asp  Arg Leu Thr 
    1130                 1135                 1140             


Glu Ser  Leu Asp Phe Thr Asp  Tyr Ala Ser Arg Ile  Ile His Pro 
    1145                 1150                 1155             


Ile Val  Arg Thr Leu Asp Gln  Ser Pro Glu Leu Arg  Ser Thr Ala 
    1160                 1165                 1170             


Met Asp  Thr Leu Ser Ser Leu  Val Phe Gln Leu Gly  Lys Lys Tyr 
    1175                 1180                 1185             


Gln Ile  Phe Ile Pro Met Val  Asn Lys Val Leu Val  Arg His Arg 
    1190                 1195                 1200             


Ile Asn  His Gln Arg Tyr Asp  Val Leu Ile Cys Arg  Ile Val Lys 
    1205                 1210                 1215             


Gly Tyr  Thr Leu Ala Asp Glu  Glu Glu Asp Pro Leu  Ile Tyr Gln 
    1220                 1225                 1230             


His Arg  Met Leu Arg Ser Gly  Gln Gly Asp Ala Leu  Ala Ser Gly 
    1235                 1240                 1245             


Pro Val  Glu Thr Gly Pro Met  Lys Lys Leu His Val  Ser Thr Ile 
    1250                 1255                 1260             


Asn Leu  Gln Lys Ala Trp Gly  Ala Ala Arg Arg Val  Ser Lys Asp 
    1265                 1270                 1275             


Asp Trp  Leu Glu Trp Leu Arg  Arg Leu Ser Leu Glu  Leu Leu Lys 
    1280                 1285                 1290             


Asp Ser  Ser Ser Pro Ser Leu  Arg Ser Cys Trp Ala  Leu Ala Gln 
    1295                 1300                 1305             


Ala Tyr  Asn Pro Met Ala Arg  Asp Leu Phe Asn Ala  Ala Phe Val 
    1310                 1315                 1320             


Ser Cys  Trp Ser Glu Leu Asn  Glu Asp Gln Gln Asp  Glu Leu Ile 
    1325                 1330                 1335             


Arg Ser  Ile Glu Leu Ala Leu  Thr Ser Gln Asp Ile  Ala Glu Val 
    1340                 1345                 1350             


Thr Gln  Thr Leu Leu Asn Leu  Ala Glu Phe Met Glu  His Ser Asp 
    1355                 1360                 1365             


Lys Gly  Pro Leu Pro Leu Arg  Asp Asp Asn Gly Ile  Val Leu Leu 
    1370                 1375                 1380             


Gly Glu  Arg Ala Ala Lys Cys  Arg Ala Tyr Ala Lys  Ala Leu His 
    1385                 1390                 1395             


Tyr Lys  Glu Leu Glu Phe Gln  Lys Gly Pro Thr Pro  Ala Ile Leu 
    1400                 1405                 1410             


Glu Ser  Leu Ile Ser Ile Asn  Asn Lys Leu Gln Gln  Pro Glu Ala 
    1415                 1420                 1425             


Ala Ala  Gly Val Leu Glu Tyr  Ala Met Lys His Phe  Gly Glu Leu 
    1430                 1435                 1440             


Glu Ile  Gln Ala Thr Trp Tyr  Glu Lys Leu His Glu  Trp Glu Asp 
    1445                 1450                 1455             


Ala Leu  Val Ala Tyr Asp Lys  Lys Met Asp Thr Asn  Lys Asp Asp 
    1460                 1465                 1470             


Pro Glu  Leu Met Leu Gly Arg  Met Arg Cys Leu Glu  Ala Leu Gly 
    1475                 1480                 1485             


Glu Trp  Gly Gln Leu His Gln  Gln Cys Cys Glu Lys  Trp Thr Leu 
    1490                 1495                 1500             


Val Asn  Asp Glu Thr Gln Ala  Lys Met Ala Arg Met  Ala Ala Ala 
    1505                 1510                 1515             


Ala Ala  Trp Gly Leu Gly Gln  Trp Asp Ser Met Glu  Glu Tyr Thr 
    1520                 1525                 1530             


Cys Met  Ile Pro Arg Asp Thr  His Asp Gly Ala Phe  Tyr Arg Ala 
    1535                 1540                 1545             


Val Leu  Ala Leu His Gln Asp  Leu Phe Ser Leu Ala  Gln Gln Cys 
    1550                 1555                 1560             


Ile Asp  Lys Ala Arg Asp Leu  Leu Asp Ala Glu Leu  Thr Ala Met 
    1565                 1570                 1575             


Ala Gly  Glu Ser Tyr Ser Arg  Ala Tyr Gly Ala Met  Val Ser Cys 
    1580                 1585                 1590             


His Met  Leu Ser Glu Leu Glu  Glu Val Ile Gln Tyr  Lys Leu Val 
    1595                 1600                 1605             


Pro Glu  Arg Arg Glu Ile Ile  Arg Gln Ile Trp Trp  Glu Arg Leu 
    1610                 1615                 1620             


Gln Gly  Cys Gln Arg Ile Val  Glu Asp Trp Gln Lys  Ile Leu Met 
    1625                 1630                 1635             


Val Arg  Ser Leu Val Val Ser  Pro His Glu Asp Met  Arg Thr Trp 
    1640                 1645                 1650             


Leu Lys  Tyr Ala Ser Leu Cys  Gly Lys Ser Gly Arg  Leu Ala Leu 
    1655                 1660                 1665             


Ala His  Lys Thr Leu Val Leu  Leu Leu Gly Val Asp  Pro Ser Arg 
    1670                 1675                 1680             


Gln Leu  Asp His Pro Leu Pro  Thr Val His Pro Gln  Val Thr Tyr 
    1685                 1690                 1695             


Ala Tyr  Met Lys Asn Met Trp  Lys Ser Ala Arg Lys  Ile Asp Ala 
    1700                 1705                 1710             


Phe Gln  His Met Gln His Phe  Val Gln Thr Met Gln  Gln Gln Ala 
    1715                 1720                 1725             


Gln His  Ala Ile Ala Thr Glu  Asp Gln Gln His Lys  Gln Glu Leu 
    1730                 1735                 1740             


His Lys  Leu Met Ala Arg Cys  Phe Leu Lys Leu Gly  Glu Trp Gln 
    1745                 1750                 1755             


Leu Asn  Leu Gln Gly Ile Asn  Glu Ser Thr Ile Pro  Lys Val Leu 
    1760                 1765                 1770             


Gln Tyr  Tyr Ser Ala Ala Thr  Glu His Asp Arg Ser  Trp Tyr Lys 
    1775                 1780                 1785             


Ala Trp  His Ala Trp Ala Val  Met Asn Phe Glu Ala  Val Leu His 
    1790                 1795                 1800             


Tyr Lys  His Gln Asn Gln Ala  Arg Asp Glu Lys Lys  Lys Leu Arg 
    1805                 1810                 1815             


His Ala  Ser Gly Ala Asn Ile  Thr Asn Ala Thr Thr  Ala Ala Thr 
    1820                 1825                 1830             


Thr Ala  Ala Thr Ala Thr Thr  Thr Ala Ser Thr Glu  Gly Ser Asn 
    1835                 1840                 1845             


Ser Glu  Ser Glu Ala Glu Ser  Thr Glu Asn Ser Pro  Thr Pro Ser 
    1850                 1855                 1860             


Pro Leu  Gln Lys Lys Val Thr  Glu Asp Leu Ser Lys  Thr Leu Leu 
    1865                 1870                 1875             


Met Tyr  Thr Val Pro Ala Val  Gln Gly Phe Phe Arg  Ser Ile Ser 
    1880                 1885                 1890             


Leu Ser  Arg Gly Asn Asn Leu  Gln Asp Thr Leu Arg  Val Leu Thr 
    1895                 1900                 1905             


Leu Trp  Phe Asp Tyr Gly His  Trp Pro Asp Val Asn  Glu Ala Leu 
    1910                 1915                 1920             


Val Glu  Gly Val Lys Ala Ile  Gln Ile Asp Thr Trp  Leu Gln Val 
    1925                 1930                 1935             


Ile Pro  Gln Leu Ile Ala Arg  Ile Asp Thr Pro Arg  Pro Leu Val 
    1940                 1945                 1950             


Gly Arg  Leu Ile His Gln Leu  Leu Thr Asp Ile Gly  Arg Tyr His 
    1955                 1960                 1965             


Pro Gln  Ala Leu Ile Tyr Pro  Leu Thr Val Ala Ser  Lys Ser Thr 
    1970                 1975                 1980             


Thr Thr  Ala Arg His Asn Ala  Ala Asn Lys Ile Leu  Lys Asn Met 
    1985                 1990                 1995             


Cys Glu  His Ser Asn Thr Leu  Val Gln Gln Ala Met  Met Val Ser 
    2000                 2005                 2010             


Glu Glu  Leu Ile Arg Val Ala  Ile Leu Trp His Glu  Met Trp His 
    2015                 2020                 2025             


Glu Gly  Leu Glu Glu Ala Ser  Arg Leu Tyr Phe Gly  Glu Arg Asn 
    2030                 2035                 2040             


Val Lys  Gly Met Phe Glu Val  Leu Glu Pro Leu His  Ala Met Met 
    2045                 2050                 2055             


Glu Arg  Gly Pro Gln Thr Leu  Lys Glu Thr Ser Phe  Asn Gln Ala 
    2060                 2065                 2070             


Tyr Gly  Arg Asp Leu Met Glu  Ala Gln Glu Trp Cys  Arg Lys Tyr 
    2075                 2080                 2085             


Met Lys  Ser Gly Asn Val Lys  Asp Leu Thr Gln Ala  Trp Asp Leu 
    2090                 2095                 2100             


Tyr Tyr  His Val Phe Arg Arg  Ile Ser Lys Gln Leu  Pro Gln Leu 
    2105                 2110                 2115             


Thr Ser  Leu Glu Leu Gln Tyr  Val Ser Pro Lys Leu  Leu Met Cys 
    2120                 2125                 2130             


Arg Asp  Leu Glu Leu Ala Val  Pro Gly Thr Tyr Asp  Pro Asn Gln 
    2135                 2140                 2145             


Pro Ile  Ile Arg Ile Gln Ser  Ile Ala Pro Ser Leu  Gln Val Ile 
    2150                 2155                 2160             


Thr Ser  Lys Gln Arg Pro Arg  Lys Leu Thr Leu Met  Gly Ser Asn 
    2165                 2170                 2175             


Gly His  Glu Phe Val Phe Leu  Leu Lys Gly His Glu  Asp Leu Arg 
    2180                 2185                 2190             


Gln Asp  Glu Arg Val Met Gln  Leu Phe Gly Leu Val  Asn Thr Leu 
    2195                 2200                 2205             


Leu Ala  Asn Asp Pro Thr Ser  Leu Arg Lys Asn Leu  Ser Ile Gln 
    2210                 2215                 2220             


Arg Tyr  Ala Val Ile Pro Leu  Ser Thr Asn Ser Gly  Leu Ile Gly 
    2225                 2230                 2235             


Trp Val  Pro His Cys Asp Thr  Leu His Ala Leu Ile  Arg Asp Tyr 
    2240                 2245                 2250             


Arg Glu  Lys Lys Lys Ile Leu  Leu Asn Ile Glu His  Arg Ile Met 
    2255                 2260                 2265             


Leu Arg  Met Ala Pro Asp Tyr  Asp His Leu Thr Leu  Met Gln Lys 
    2270                 2275                 2280             


Val Glu  Val Phe Glu His Ala  Val Asn Asn Thr Ala  Gly Asp Asp 
    2285                 2290                 2295             


Leu Ala  Lys Leu Leu Trp Leu  Lys Ser Pro Ser Ser  Glu Val Trp 
    2300                 2305                 2310             


Phe Asp  Arg Arg Thr Asn Tyr  Thr Arg Ser Leu Ala  Val Met Ser 
    2315                 2320                 2325             


Met Val  Gly Tyr Ile Leu Gly  Leu Gly Asp Arg His  Pro Ser Asn 
    2330                 2335                 2340             


Leu Met  Leu Asp Arg Leu Ser  Gly Lys Ile Leu His  Ile Asp Phe 
    2345                 2350                 2355             


Gly Asp  Cys Phe Glu Val Ala  Met Thr Arg Glu Lys  Phe Pro Glu 
    2360                 2365                 2370             


Lys Ile  Pro Phe Arg Leu Thr  Arg Met Leu Thr Asn  Ala Met Glu 
    2375                 2380                 2385             


Val Thr  Gly Leu Asp Gly Asn  Tyr Arg Ile Thr Cys  His Thr Val 
    2390                 2395                 2400             


Met Glu  Val Leu Arg Glu His  Lys Asp Ser Val Met  Ala Val Leu 
    2405                 2410                 2415             


Glu Ala  Phe Val Tyr Asp Pro  Leu Leu Asn Trp Arg  Leu Met Asp 
    2420                 2425                 2430             


Thr Asn  Thr Lys Gly Asn Lys  Arg Ser Arg Thr Arg  Thr Asp Ser 
    2435                 2440                 2445             


Tyr Ser  Ala Gly Gln Ser Val  Glu Ile Leu Asp Gly  Val Glu Leu 
    2450                 2455                 2460             


Gly Glu  Pro Ala His Lys Lys  Thr Gly Thr Thr Val  Pro Glu Ser 
    2465                 2470                 2475             


Ile His  Ser Phe Ile Gly Asp  Gly Leu Val Lys Pro  Glu Ala Leu 
    2480                 2485                 2490             


Asn Lys  Lys Ala Ile Gln Ile  Ile Asn Arg Val Arg  Asp Lys Leu 
    2495                 2500                 2505             


Thr Gly  Arg Asp Phe Ser His  Asp Asp Thr Leu Asp  Val Pro Thr 
    2510                 2515                 2520             


Gln Val  Glu Leu Leu Ile Lys  Gln Ala Thr Ser His  Glu Asn Leu 
    2525                 2530                 2535             


Cys Gln  Cys Tyr Ile Gly Trp  Cys Pro Phe Trp 
    2540                 2545                 


<210>  34
<211>  4131
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human SRC proto-oncogene, non-receptor tyrosine kinase (SRC), 
       transcript variant 1 mRNA, Genbank Accession No.: NM_005417.4 GI 
       520262038

<400>  34
caaacaagtg cggccatttc accagcccag gctggcttct gctgttgact ggctgtggca       60

cctcaagcag cccctttccc ctctagcctc agtttatcac cgcaagagct accattcatc      120

tagcacaacc tgaccatcct cacactggtc agttccaacc ttcccaggaa tcttctgtgg      180

ccatgttcac tccggtttta cagaacagag aacagaagct cagagaagtg aagcaacttg      240

cccagctatg agagacagag ccaggatttg aaaccagatg aggacgctga ggcccagaga      300

gggaaagcca cttgcctagg gacacacagc ggggagaggt ggagcagggc ctctatttcg      360

agacccctga ctccacacct ggtgtttgtg ccaagacccc aggctgcctc ccaggtcctc      420

tgggacagcc cctgccttct accaggacca tgggtagcaa caagagcaag cccaaggatg      480

ccagccagcg gcgccgcagc ctggagcccg ccgagaacgt gcacggcgct ggcgggggcg      540

ctttccccgc ctcgcagacc cccagcaagc cagcctcggc cgacggccac cgcggcccca      600

gcgcggcctt cgcccccgcg gccgccgagc ccaagctgtt cggaggcttc aactcctcgg      660

acaccgtcac ctccccgcag agggcgggcc cgctggccgg tggagtgacc acctttgtgg      720

ccctctatga ctatgagtct aggacggaga cagacctgtc cttcaagaaa ggcgagcggc      780

tccagattgt caacaacaca gagggagact ggtggctggc ccactcgctc agcacaggac      840

agacaggcta catccccagc aactacgtgg cgccctccga ctccatccag gctgaggagt      900

ggtattttgg caagatcacc agacgggagt cagagcggtt actgctcaat gcagagaacc      960

cgagagggac cttcctcgtg cgagaaagtg agaccacgaa aggtgcctac tgcctctcag     1020

tgtctgactt cgacaacgcc aagggcctca acgtgaagca ctacaagatc cgcaagctgg     1080

acagcggcgg cttctacatc acctcccgca cccagttcaa cagcctgcag cagctggtgg     1140

cctactactc caaacacgcc gatggcctgt gccaccgcct caccaccgtg tgccccacgt     1200

ccaagccgca gactcagggc ctggccaagg atgcctggga gatccctcgg gagtcgctgc     1260

ggctggaggt caagctgggc cagggctgct ttggcgaggt gtggatgggg acctggaacg     1320

gtaccaccag ggtggccatc aaaaccctga agcctggcac gatgtctcca gaggccttcc     1380

tgcaggaggc ccaggtcatg aagaagctga ggcatgagaa gctggtgcag ttgtatgctg     1440

tggtttcaga ggagcccatt tacatcgtca cggagtacat gagcaagggg agtttgctgg     1500

actttctcaa gggggagaca ggcaagtacc tgcggctgcc tcagctggtg gacatggctg     1560

ctcagatcgc ctcaggcatg gcgtacgtgg agcggatgaa ctacgtccac cgggaccttc     1620

gtgcagccaa catcctggtg ggagagaacc tggtgtgcaa agtggccgac tttgggctgg     1680

ctcggctcat tgaagacaat gagtacacgg cgcggcaagg tgccaaattc cccatcaagt     1740

ggacggctcc agaagctgcc ctctatggcc gcttcaccat caagtcggac gtgtggtcct     1800

tcgggatcct gctgactgag ctcaccacaa agggacgggt gccctaccct gggatggtga     1860

accgcgaggt gctggaccag gtggagcggg gctaccggat gccctgcccg ccggagtgtc     1920

ccgagtccct gcacgacctc atgtgccagt gctggcggaa ggagcctgag gagcggccca     1980

ccttcgagta cctgcaggcc ttcctggagg actacttcac gtccaccgag ccccagtacc     2040

agcccgggga gaacctctag gcacaggcgg gcccagaccg gcttctcggc ttggatcctg     2100

ggctgggtgg cccctgtctc ggggcttgcc ccactctgcc tgcctgctgt tggtcctctc     2160

tctgtggggc tgaattgcca ggggcgaggc ccttcctctt tggtggcatg gaaggggctt     2220

ctggacctag ggtggcctga gagggcggtg ggtatgcgag accagcacgg tgactctgtc     2280

cagctcccgc tgtggccgca cgcctctccc tgcactccct cctggagctc tgtgggtctc     2340

tggaagagga accaggagaa gggctggggc cggggctgag ggtgcccttt tccagcctca     2400

gcctactccg ctcactgaac tccttcccca cttctgtgcc acccccggtc tatgtcgaga     2460

gctggccaaa gagcctttcc aaagaggagc gatgggcccc tggccccgcc tgcctgccac     2520

cctgcccctt gccatccatt ctggaaacac ctgtaggcag aggctgccga gacagaccct     2580

ctgccgctgc ttccaggctg ggcagcacaa ggccttgcct ggcctgatga tggtgggtgg     2640

gtgggatgag taccccctca aaccctgccc tccttagacc tgagggaccc ttcgagatca     2700

tcacttcctt gcccccattt cacccatggg gagacagttg agagcgggga tgtgacatgc     2760

ccaaggccac ggagcagttc agagtggagg cgggcttgga acccggtgct ccctctgtca     2820

tcctcaggaa ccaacaattc gtcggaggca tcatggaaag actgggacag cccaggaaac     2880

aaggggtctg aggatgcatt cgagatggca gattcccact gccgctgccc gctcagccca     2940

gctgttggga acagcatgga ggcagatgtg gggctgagct ggggaatcag ggtaaaaggt     3000

gcaggtgtgg agagagaggc ttcaatcggc ttgtgggtga tgtttgacct tcagagccag     3060

ccggctatga aagggagcga gcccctcggc tctggaggca atcaagcaga catagaagag     3120

ccaagagtcc aggaggccct ggtcctggcc tccttccccg tactttgtcc cgtggcattt     3180

caattcctgg ccctgttctc ctccccaagt cggcaccctt taactcatga ggagggaaaa     3240

gagtgcctaa gcgggggtga aagaggacgt gttacccact gccatgcacc aggactggct     3300

gtgtaacctt gggtggcccc tgctgtctct ctgggctgca gagtctgccc cacatgtggc     3360

catggcctct gcaactgctc agctctggtc caggccctgt ggcaggacac acatggtgag     3420

cctagccctg ggacatcagg agactgggct ctggctctgt tcggcctttg ggtgtgtggt     3480

ggattctccc tgggcctcag tgtgcccatc tgtaaagggg cagctgacag tttgtggcat     3540

cttgccaagg gtccctgtgt gtgtgtatgt gtgtgcatgt gtgcgtgtct ccatgtgcgt     3600

ccatatttaa catgtaaaaa tgtccccccc gctccgtccc ccaaacatgt tgtacatttc     3660

accatggccc cctcatcata gcaataacat tcccactgcc aggggttctt gagccagcca     3720

ggccctgcca gtggggaagg aggccaagca gtgcctgcct atgaaatttc aacttttcct     3780

ttcatacgtc tttattaccc aagtcttctc ccgtccattc cagtcaaatc tgggctcact     3840

caccccagcg agctctcaaa tccctctcca actgcctaag gccctttgtg taaggtgtct     3900

taatactgtc cttttttttt ttttaacagt gttttgtaga tttcagatga ctatgcagag     3960

gcctggggga cccctggctc tgggccgggc ctggggctcc gaaattccaa ggcccagact     4020

tgcggggggt gggggggtat ccagaattgg ttgtaaatac tttgcatatt gtctgattaa     4080

acacaaacag acctcagaat ctgatcaaca gttaaaaaaa aaaaaaaaaa a              4131


<210>  35
<211>  536
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human SRC proto-oncogene, non-receptor tyrosine kinase (SRC), 
       polypeptide encoded by transcript variant 1, Genbank Accession 
       No.: NP_005408.1 GI: 4885609

<400>  35

Met Gly Ser Asn Lys Ser Lys Pro Lys Asp Ala Ser Gln Arg Arg Arg 
1               5                   10                  15      


Ser Leu Glu Pro Ala Glu Asn Val His Gly Ala Gly Gly Gly Ala Phe 
            20                  25                  30          


Pro Ala Ser Gln Thr Pro Ser Lys Pro Ala Ser Ala Asp Gly His Arg 
        35                  40                  45              


Gly Pro Ser Ala Ala Phe Ala Pro Ala Ala Ala Glu Pro Lys Leu Phe 
    50                  55                  60                  


Gly Gly Phe Asn Ser Ser Asp Thr Val Thr Ser Pro Gln Arg Ala Gly 
65                  70                  75                  80  


Pro Leu Ala Gly Gly Val Thr Thr Phe Val Ala Leu Tyr Asp Tyr Glu 
                85                  90                  95      


Ser Arg Thr Glu Thr Asp Leu Ser Phe Lys Lys Gly Glu Arg Leu Gln 
            100                 105                 110         


Ile Val Asn Asn Thr Glu Gly Asp Trp Trp Leu Ala His Ser Leu Ser 
        115                 120                 125             


Thr Gly Gln Thr Gly Tyr Ile Pro Ser Asn Tyr Val Ala Pro Ser Asp 
    130                 135                 140                 


Ser Ile Gln Ala Glu Glu Trp Tyr Phe Gly Lys Ile Thr Arg Arg Glu 
145                 150                 155                 160 


Ser Glu Arg Leu Leu Leu Asn Ala Glu Asn Pro Arg Gly Thr Phe Leu 
                165                 170                 175     


Val Arg Glu Ser Glu Thr Thr Lys Gly Ala Tyr Cys Leu Ser Val Ser 
            180                 185                 190         


Asp Phe Asp Asn Ala Lys Gly Leu Asn Val Lys His Tyr Lys Ile Arg 
        195                 200                 205             


Lys Leu Asp Ser Gly Gly Phe Tyr Ile Thr Ser Arg Thr Gln Phe Asn 
    210                 215                 220                 


Ser Leu Gln Gln Leu Val Ala Tyr Tyr Ser Lys His Ala Asp Gly Leu 
225                 230                 235                 240 


Cys His Arg Leu Thr Thr Val Cys Pro Thr Ser Lys Pro Gln Thr Gln 
                245                 250                 255     


Gly Leu Ala Lys Asp Ala Trp Glu Ile Pro Arg Glu Ser Leu Arg Leu 
            260                 265                 270         


Glu Val Lys Leu Gly Gln Gly Cys Phe Gly Glu Val Trp Met Gly Thr 
        275                 280                 285             


Trp Asn Gly Thr Thr Arg Val Ala Ile Lys Thr Leu Lys Pro Gly Thr 
    290                 295                 300                 


Met Ser Pro Glu Ala Phe Leu Gln Glu Ala Gln Val Met Lys Lys Leu 
305                 310                 315                 320 


Arg His Glu Lys Leu Val Gln Leu Tyr Ala Val Val Ser Glu Glu Pro 
                325                 330                 335     


Ile Tyr Ile Val Thr Glu Tyr Met Ser Lys Gly Ser Leu Leu Asp Phe 
            340                 345                 350         


Leu Lys Gly Glu Thr Gly Lys Tyr Leu Arg Leu Pro Gln Leu Val Asp 
        355                 360                 365             


Met Ala Ala Gln Ile Ala Ser Gly Met Ala Tyr Val Glu Arg Met Asn 
    370                 375                 380                 


Tyr Val His Arg Asp Leu Arg Ala Ala Asn Ile Leu Val Gly Glu Asn 
385                 390                 395                 400 


Leu Val Cys Lys Val Ala Asp Phe Gly Leu Ala Arg Leu Ile Glu Asp 
                405                 410                 415     


Asn Glu Tyr Thr Ala Arg Gln Gly Ala Lys Phe Pro Ile Lys Trp Thr 
            420                 425                 430         


Ala Pro Glu Ala Ala Leu Tyr Gly Arg Phe Thr Ile Lys Ser Asp Val 
        435                 440                 445             


Trp Ser Phe Gly Ile Leu Leu Thr Glu Leu Thr Thr Lys Gly Arg Val 
    450                 455                 460                 


Pro Tyr Pro Gly Met Val Asn Arg Glu Val Leu Asp Gln Val Glu Arg 
465                 470                 475                 480 


Gly Tyr Arg Met Pro Cys Pro Pro Glu Cys Pro Glu Ser Leu His Asp 
                485                 490                 495     


Leu Met Cys Gln Cys Trp Arg Lys Glu Pro Glu Glu Arg Pro Thr Phe 
            500                 505                 510         


Glu Tyr Leu Gln Ala Phe Leu Glu Asp Tyr Phe Thr Ser Thr Glu Pro 
        515                 520                 525             


Gln Tyr Gln Pro Gly Glu Asn Leu 
    530                 535     


<210>  36
<211>  4056
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human SRC proto-oncogene, non-receptor tyrosine kinase (SRC), 
       transcript variant 2 mRNA, Genbank Accession No.: NM_198291.2, 
       GI: 520262048

<400>  36
ggcccggaat ccattccggc ctgggagccg gagcggccag gccgccgtct gcccgtcccg       60

ctggacgtcc cgcggtccgc cctcccgtgc gtccgtctgc cggtgagccc gcccgcccgc      120

cggcccagaa cagagaacag aagctcagag aagtgaagca acttgcccag ctatgagaga      180

cagagccagg atttgaaacc agatgaggac gctgaggccc agagagggaa agccacttgc      240

ctagggacac acagcgggga gaggtggagc agggcctcta tttcgagacc cctgactcca      300

cacctggtgt ttgtgccaag accccaggct gcctcccagg tcctctggga cagcccctgc      360

cttctaccag gaccatgggt agcaacaaga gcaagcccaa ggatgccagc cagcggcgcc      420

gcagcctgga gcccgccgag aacgtgcacg gcgctggcgg gggcgctttc cccgcctcgc      480

agacccccag caagccagcc tcggccgacg gccaccgcgg ccccagcgcg gccttcgccc      540

ccgcggccgc cgagcccaag ctgttcggag gcttcaactc ctcggacacc gtcacctccc      600

cgcagagggc gggcccgctg gccggtggag tgaccacctt tgtggccctc tatgactatg      660

agtctaggac ggagacagac ctgtccttca agaaaggcga gcggctccag attgtcaaca      720

acacagaggg agactggtgg ctggcccact cgctcagcac aggacagaca ggctacatcc      780

ccagcaacta cgtggcgccc tccgactcca tccaggctga ggagtggtat tttggcaaga      840

tcaccagacg ggagtcagag cggttactgc tcaatgcaga gaacccgaga gggaccttcc      900

tcgtgcgaga aagtgagacc acgaaaggtg cctactgcct ctcagtgtct gacttcgaca      960

acgccaaggg cctcaacgtg aagcactaca agatccgcaa gctggacagc ggcggcttct     1020

acatcacctc ccgcacccag ttcaacagcc tgcagcagct ggtggcctac tactccaaac     1080

acgccgatgg cctgtgccac cgcctcacca ccgtgtgccc cacgtccaag ccgcagactc     1140

agggcctggc caaggatgcc tgggagatcc ctcgggagtc gctgcggctg gaggtcaagc     1200

tgggccaggg ctgctttggc gaggtgtgga tggggacctg gaacggtacc accagggtgg     1260

ccatcaaaac cctgaagcct ggcacgatgt ctccagaggc cttcctgcag gaggcccagg     1320

tcatgaagaa gctgaggcat gagaagctgg tgcagttgta tgctgtggtt tcagaggagc     1380

ccatttacat cgtcacggag tacatgagca aggggagttt gctggacttt ctcaaggggg     1440

agacaggcaa gtacctgcgg ctgcctcagc tggtggacat ggctgctcag atcgcctcag     1500

gcatggcgta cgtggagcgg atgaactacg tccaccggga ccttcgtgca gccaacatcc     1560

tggtgggaga gaacctggtg tgcaaagtgg ccgactttgg gctggctcgg ctcattgaag     1620

acaatgagta cacggcgcgg caaggtgcca aattccccat caagtggacg gctccagaag     1680

ctgccctcta tggccgcttc accatcaagt cggacgtgtg gtccttcggg atcctgctga     1740

ctgagctcac cacaaaggga cgggtgccct accctgggat ggtgaaccgc gaggtgctgg     1800

accaggtgga gcggggctac cggatgccct gcccgccgga gtgtcccgag tccctgcacg     1860

acctcatgtg ccagtgctgg cggaaggagc ctgaggagcg gcccaccttc gagtacctgc     1920

aggccttcct ggaggactac ttcacgtcca ccgagcccca gtaccagccc ggggagaacc     1980

tctaggcaca ggcgggccca gaccggcttc tcggcttgga tcctgggctg ggtggcccct     2040

gtctcggggc ttgccccact ctgcctgcct gctgttggtc ctctctctgt ggggctgaat     2100

tgccaggggc gaggcccttc ctctttggtg gcatggaagg ggcttctgga cctagggtgg     2160

cctgagaggg cggtgggtat gcgagaccag cacggtgact ctgtccagct cccgctgtgg     2220

ccgcacgcct ctccctgcac tccctcctgg agctctgtgg gtctctggaa gaggaaccag     2280

gagaagggct ggggccgggg ctgagggtgc ccttttccag cctcagccta ctccgctcac     2340

tgaactcctt ccccacttct gtgccacccc cggtctatgt cgagagctgg ccaaagagcc     2400

tttccaaaga ggagcgatgg gcccctggcc ccgcctgcct gccaccctgc cccttgccat     2460

ccattctgga aacacctgta ggcagaggct gccgagacag accctctgcc gctgcttcca     2520

ggctgggcag cacaaggcct tgcctggcct gatgatggtg ggtgggtggg atgagtaccc     2580

cctcaaaccc tgccctcctt agacctgagg gacccttcga gatcatcact tccttgcccc     2640

catttcaccc atggggagac agttgagagc ggggatgtga catgcccaag gccacggagc     2700

agttcagagt ggaggcgggc ttggaacccg gtgctccctc tgtcatcctc aggaaccaac     2760

aattcgtcgg aggcatcatg gaaagactgg gacagcccag gaaacaaggg gtctgaggat     2820

gcattcgaga tggcagattc ccactgccgc tgcccgctca gcccagctgt tgggaacagc     2880

atggaggcag atgtggggct gagctgggga atcagggtaa aaggtgcagg tgtggagaga     2940

gaggcttcaa tcggcttgtg ggtgatgttt gaccttcaga gccagccggc tatgaaaggg     3000

agcgagcccc tcggctctgg aggcaatcaa gcagacatag aagagccaag agtccaggag     3060

gccctggtcc tggcctcctt ccccgtactt tgtcccgtgg catttcaatt cctggccctg     3120

ttctcctccc caagtcggca ccctttaact catgaggagg gaaaagagtg cctaagcggg     3180

ggtgaaagag gacgtgttac ccactgccat gcaccaggac tggctgtgta accttgggtg     3240

gcccctgctg tctctctggg ctgcagagtc tgccccacat gtggccatgg cctctgcaac     3300

tgctcagctc tggtccaggc cctgtggcag gacacacatg gtgagcctag ccctgggaca     3360

tcaggagact gggctctggc tctgttcggc ctttgggtgt gtggtggatt ctccctgggc     3420

ctcagtgtgc ccatctgtaa aggggcagct gacagtttgt ggcatcttgc caagggtccc     3480

tgtgtgtgtg tatgtgtgtg catgtgtgcg tgtctccatg tgcgtccata tttaacatgt     3540

aaaaatgtcc cccccgctcc gtcccccaaa catgttgtac atttcaccat ggccccctca     3600

tcatagcaat aacattccca ctgccagggg ttcttgagcc agccaggccc tgccagtggg     3660

gaaggaggcc aagcagtgcc tgcctatgaa atttcaactt ttcctttcat acgtctttat     3720

tacccaagtc ttctcccgtc cattccagtc aaatctgggc tcactcaccc cagcgagctc     3780

tcaaatccct ctccaactgc ctaaggccct ttgtgtaagg tgtcttaata ctgtcctttt     3840

ttttttttta acagtgtttt gtagatttca gatgactatg cagaggcctg ggggacccct     3900

ggctctgggc cgggcctggg gctccgaaat tccaaggccc agacttgcgg ggggtggggg     3960

ggtatccaga attggttgta aatactttgc atattgtctg attaaacaca aacagacctc     4020

agaatctgat caacagttaa aaaaaaaaaa aaaaaa                               4056


<210>  37
<211>  536
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human SRC proto-oncogene, non-receptor tyrosine kinase (SRC), 
       polypeptide encoded by transcript variant 2, Genbank Accession 
       No.: NM_938033.1, GI: 38202217

<400>  37

Met Gly Ser Asn Lys Ser Lys Pro Lys Asp Ala Ser Gln Arg Arg Arg 
1               5                   10                  15      


Ser Leu Glu Pro Ala Glu Asn Val His Gly Ala Gly Gly Gly Ala Phe 
            20                  25                  30          


Pro Ala Ser Gln Thr Pro Ser Lys Pro Ala Ser Ala Asp Gly His Arg 
        35                  40                  45              


Gly Pro Ser Ala Ala Phe Ala Pro Ala Ala Ala Glu Pro Lys Leu Phe 
    50                  55                  60                  


Gly Gly Phe Asn Ser Ser Asp Thr Val Thr Ser Pro Gln Arg Ala Gly 
65                  70                  75                  80  


Pro Leu Ala Gly Gly Val Thr Thr Phe Val Ala Leu Tyr Asp Tyr Glu 
                85                  90                  95      


Ser Arg Thr Glu Thr Asp Leu Ser Phe Lys Lys Gly Glu Arg Leu Gln 
            100                 105                 110         


Ile Val Asn Asn Thr Glu Gly Asp Trp Trp Leu Ala His Ser Leu Ser 
        115                 120                 125             


Thr Gly Gln Thr Gly Tyr Ile Pro Ser Asn Tyr Val Ala Pro Ser Asp 
    130                 135                 140                 


Ser Ile Gln Ala Glu Glu Trp Tyr Phe Gly Lys Ile Thr Arg Arg Glu 
145                 150                 155                 160 


Ser Glu Arg Leu Leu Leu Asn Ala Glu Asn Pro Arg Gly Thr Phe Leu 
                165                 170                 175     


Val Arg Glu Ser Glu Thr Thr Lys Gly Ala Tyr Cys Leu Ser Val Ser 
            180                 185                 190         


Asp Phe Asp Asn Ala Lys Gly Leu Asn Val Lys His Tyr Lys Ile Arg 
        195                 200                 205             


Lys Leu Asp Ser Gly Gly Phe Tyr Ile Thr Ser Arg Thr Gln Phe Asn 
    210                 215                 220                 


Ser Leu Gln Gln Leu Val Ala Tyr Tyr Ser Lys His Ala Asp Gly Leu 
225                 230                 235                 240 


Cys His Arg Leu Thr Thr Val Cys Pro Thr Ser Lys Pro Gln Thr Gln 
                245                 250                 255     


Gly Leu Ala Lys Asp Ala Trp Glu Ile Pro Arg Glu Ser Leu Arg Leu 
            260                 265                 270         


Glu Val Lys Leu Gly Gln Gly Cys Phe Gly Glu Val Trp Met Gly Thr 
        275                 280                 285             


Trp Asn Gly Thr Thr Arg Val Ala Ile Lys Thr Leu Lys Pro Gly Thr 
    290                 295                 300                 


Met Ser Pro Glu Ala Phe Leu Gln Glu Ala Gln Val Met Lys Lys Leu 
305                 310                 315                 320 


Arg His Glu Lys Leu Val Gln Leu Tyr Ala Val Val Ser Glu Glu Pro 
                325                 330                 335     


Ile Tyr Ile Val Thr Glu Tyr Met Ser Lys Gly Ser Leu Leu Asp Phe 
            340                 345                 350         


Leu Lys Gly Glu Thr Gly Lys Tyr Leu Arg Leu Pro Gln Leu Val Asp 
        355                 360                 365             


Met Ala Ala Gln Ile Ala Ser Gly Met Ala Tyr Val Glu Arg Met Asn 
    370                 375                 380                 


Tyr Val His Arg Asp Leu Arg Ala Ala Asn Ile Leu Val Gly Glu Asn 
385                 390                 395                 400 


Leu Val Cys Lys Val Ala Asp Phe Gly Leu Ala Arg Leu Ile Glu Asp 
                405                 410                 415     


Asn Glu Tyr Thr Ala Arg Gln Gly Ala Lys Phe Pro Ile Lys Trp Thr 
            420                 425                 430         


Ala Pro Glu Ala Ala Leu Tyr Gly Arg Phe Thr Ile Lys Ser Asp Val 
        435                 440                 445             


Trp Ser Phe Gly Ile Leu Leu Thr Glu Leu Thr Thr Lys Gly Arg Val 
    450                 455                 460                 


Pro Tyr Pro Gly Met Val Asn Arg Glu Val Leu Asp Gln Val Glu Arg 
465                 470                 475                 480 


Gly Tyr Arg Met Pro Cys Pro Pro Glu Cys Pro Glu Ser Leu His Asp 
                485                 490                 495     


Leu Met Cys Gln Cys Trp Arg Lys Glu Pro Glu Glu Arg Pro Thr Phe 
            500                 505                 510         


Glu Tyr Leu Gln Ala Phe Leu Glu Asp Tyr Phe Thr Ser Thr Glu Pro 
        515                 520                 525             


Gln Tyr Gln Pro Gly Glu Asn Leu 
    530                 535     


<210>  38
<211>  3791
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human focal adhesion kinase (FAK) mRNA GenBank Accession No.: 
       L13616.1 GI: 439874

<400>  38
cgaccactgt gagcccgcgg cgtgaggcgt cgggaggaag cgcggctgct gtcgcccagc       60

gccgccccgt cgtcgtctgc cttcgcttca cggcgccgag ccgcggtccg agcagaactg      120

gggctccctt gcatcttcca gttacaaatt cagtgccttc tgcagtttcc ccagagctcc      180

tcaagaataa cggaagggag aatatgacag atacctagca tctagcaaaa taatggcagc      240

tgcttacctt gaccccaact tgaatcacac accaaattcg agtactaaga ctcacctggg      300

tactggtatg gaacgttctc ctggtgcaat ggagcgagta ttaaaggtct ttcattattt      360

tgaaagcaat agtgagccaa ccacctgggc cagtattatc aggcatggag atgctactga      420

tgtcaggggc atcattcaga agatagtgga cagtcacaaa gtaaagcatg tggcctgcta      480

tggattccgc ctcagtcacc tgcggtcaga ggaggttcac tggcttcacg tggatatggg      540

cgtctccagt gtgagggaga agtatgagct tgctcaccca ccagaggagt ggaaatatga      600

attgagaatt cgttatttgc caaaaggatt tctaaaccag tttactgaag ataagccaac      660

tttgaatttc ttctatcaac aggtgaagag cgattatatg ttagagatag ctgatcaagt      720

ggaccaggaa attgctttga agttgggttg tctagaaata cggcgatcat actgggagat      780

gcggggcaat gcactagaaa agaagtctaa ctatgaagta ttagaaaaag atgttggttt      840

aaagcgattt tttcctaaga gtttactgga ttctgtcaag gccaaaacac taagaaaact      900

gatccaacaa acatttagac aatttgccaa ccttaataga gaagaaagta ttctgaaatt      960

ctttgagatc ctgtctccag tctacagatt tgataaggaa tgcttcaagt gtgctcttgg     1020

ttcaagctgg attatttcag tggaactggc aatcggccca gaagaaggaa tcagttacct     1080

aacggacaag ggctgcaatc ccacacatct tgctgacttc actcaagtgc aaaccattca     1140

gtattcaaac agtgaagaca aggacagaaa aggaatgcta caactaaaaa tagcaggtgc     1200

acccgagcct ctgacagtga cggcaccatc cctaaccatt gcggagaata tggctgacct     1260

aatagatggg tactgccggc tggtgaatgg aacctcgcag tcatttatca tcagacctca     1320

gaaagaaggt gaacgggctt tgccatcaat accaaagttg gccaacagcg aaaagcaagg     1380

catgcggaca cacgccgtct ctgtgtcaga aacagatgat tatgctgaga ttatagatga     1440

agaagatact tacaccatgc cctcaaccag ggattatgag attcaaagag aaagaataga     1500

acttggacga tgtattggag aaggccaatt tggagatgta catcaaggca tttatatgag     1560

tccagagaat ccagctttgg cggttgcaat taaaacatgt aaaaactgta cttcggacag     1620

cgtgagagag aaatttcttc aagaagcctt aacaatgcgt cagtttgacc atcctcatat     1680

tgtgaagctg attggagtca tcacagagaa tcctgtctgg ataatcatgg agctgtgcac     1740

acttggagag ctgaggtcat ttttgcaagt aaggaaatac agtttggatc tagcatcttt     1800

gatcctgtat gcctatcagc ttagtacagc tcttgcatat ctagagagca aaagatttgt     1860

acacagggac attgctgctc ggaatgttct ggtgtcctca aatgattgtg taaaattagg     1920

agactttgga ttatcccgat atatggaaga tagtacttac tacaaagctt ccaaaggaaa     1980

attgcctatt aaatggatgg ctccagagtc aatcaatttt cgacgtttta cctcagctag     2040

tgacgtatgg atgtttggtg tgtgtatgtg ggagatactg atgcatggtg tgaagccttt     2100

tcaaggagtg aagaacaatg atgtaatcgg tcgaattgaa aatggggaaa gattaccaat     2160

gcctccaaat tgtcctccta ccctctacag ccttatgacg aaatgctggg cctatgaccc     2220

cagcaggcgg cccaggttta ctgaacttaa agctcagctc agcacaatcc tggaggaaga     2280

gaaggctcag caagaagagc gcatgaggat ggagtccaga agacaggcca cagtgtcctg     2340

ggactccgga gggtctgatg aagcaccgcc caagcccagc agaccgggtt atcccagtcc     2400

gaggtccagc gaaggatttt atcccagccc acagcacatg gtacaaacca atcattacca     2460

ggtttctggc taccctggtt cacatggaat cacagccatg gctggcagca tctatccagg     2520

tcaggcatct cttttggacc aaacagattc atggaatcat agacctcagg agatagcaat     2580

gtggcagccc aatgtggagg actctacagt attggacctg cgagggattg ggcaagtgtt     2640

gccaacccat ctgatggaag agcgtctaat ccgacagcaa caggaaatgg aagaagatca     2700

gcgctggctg gaaaaagagg aaagatttct gaaacctgat gtgagactct ctcgaggcag     2760

tattgacagg gaggatggaa gtcttcaggg tccgattgga aaccaacata tatatcagcc     2820

tgtgggtaaa ccagatcctg cagctccacc aaagaaaccg cctcgccctg gagctcccgg     2880

tcatctggga agccttgcca gcctcagcag ccctgctgac agctacaacg agggtgtcaa     2940

gcttcagccc caggaaatca gcccccctcc tactgccaac ctggaccggt cgaatgataa     3000

ggtgtacgag aatgtgacgg gcctggtgaa agctgtcatc gagatgtcca gtaaaatcca     3060

gccagcccca ccagaggagt atgtccctat ggtgaaggaa gtcggcttgg ccctgaggac     3120

attattggcc actgtggatg agaccattcc cctcctacca gccagcaccc accgagagat     3180

tgagatggca cagaagctat tgaactctga cctgggtgag ctcatcaaca agatgaaact     3240

ggcccagcag tatgtcatga ccagcctcca gcaagagtac aaaaagcaaa tgctgactgc     3300

tgctcacgcc ctggctgtgg atgccaaaaa cttactcgat gtcattgacc aagcaagact     3360

gaaaatgctt gggcagacga gaccacactg agcctcccct aggagcacgt cttgctaccc     3420

tcttttgaag atgttctcta gccttccacc agcagcgagg aattaaccct gtgtcctcag     3480

tcgccagcac ttacagctcc aacttttttg aatgaccatc tggttgaaaa atctttctca     3540

tataagttta accacacttt gatttgggtt cattttttgt tttgtttttt tcaatcatga     3600

tattcagaaa aatccaggat ccaaaatgtg gcgtttttct aagaatgaaa attatatgta     3660

agcttttaag catcatgaag aacaatttat gttcacatta agatacgttc taaaggggga     3720

tggccaaggg gtgacatctt aattcctaaa ctaccttagc tgcatagtgg aagaggagag     3780

ctagaagcaa a                                                          3791


<210>  39
<211>  1052
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human focal adhesion kinase (FAK) polypeptide GenBank Accession 
       No.: AAA58469.1 GI: 439875

<400>  39

Met Ala Ala Ala Tyr Leu Asp Pro Asn Leu Asn His Thr Pro Asn Ser 
1               5                   10                  15      


Ser Thr Lys Thr His Leu Gly Thr Gly Met Glu Arg Ser Pro Gly Ala 
            20                  25                  30          


Met Glu Arg Val Leu Lys Val Phe His Tyr Phe Glu Ser Asn Ser Glu 
        35                  40                  45              


Pro Thr Thr Trp Ala Ser Ile Ile Arg His Gly Asp Ala Thr Asp Val 
    50                  55                  60                  


Arg Gly Ile Ile Gln Lys Ile Val Asp Ser His Lys Val Lys His Val 
65                  70                  75                  80  


Ala Cys Tyr Gly Phe Arg Leu Ser His Leu Arg Ser Glu Glu Val His 
                85                  90                  95      


Trp Leu His Val Asp Met Gly Val Ser Ser Val Arg Glu Lys Tyr Glu 
            100                 105                 110         


Leu Ala His Pro Pro Glu Glu Trp Lys Tyr Glu Leu Arg Ile Arg Tyr 
        115                 120                 125             


Leu Pro Lys Gly Phe Leu Asn Gln Phe Thr Glu Asp Lys Pro Thr Leu 
    130                 135                 140                 


Asn Phe Phe Tyr Gln Gln Val Lys Ser Asp Tyr Met Leu Glu Ile Ala 
145                 150                 155                 160 


Asp Gln Val Asp Gln Glu Ile Ala Leu Lys Leu Gly Cys Leu Glu Ile 
                165                 170                 175     


Arg Arg Ser Tyr Trp Glu Met Arg Gly Asn Ala Leu Glu Lys Lys Ser 
            180                 185                 190         


Asn Tyr Glu Val Leu Glu Lys Asp Val Gly Leu Lys Arg Phe Phe Pro 
        195                 200                 205             


Lys Ser Leu Leu Asp Ser Val Lys Ala Lys Thr Leu Arg Lys Leu Ile 
    210                 215                 220                 


Gln Gln Thr Phe Arg Gln Phe Ala Asn Leu Asn Arg Glu Glu Ser Ile 
225                 230                 235                 240 


Leu Lys Phe Phe Glu Ile Leu Ser Pro Val Tyr Arg Phe Asp Lys Glu 
                245                 250                 255     


Cys Phe Lys Cys Ala Leu Gly Ser Ser Trp Ile Ile Ser Val Glu Leu 
            260                 265                 270         


Ala Ile Gly Pro Glu Glu Gly Ile Ser Tyr Leu Thr Asp Lys Gly Cys 
        275                 280                 285             


Asn Pro Thr His Leu Ala Asp Phe Thr Gln Val Gln Thr Ile Gln Tyr 
    290                 295                 300                 


Ser Asn Ser Glu Asp Lys Asp Arg Lys Gly Met Leu Gln Leu Lys Ile 
305                 310                 315                 320 


Ala Gly Ala Pro Glu Pro Leu Thr Val Thr Ala Pro Ser Leu Thr Ile 
                325                 330                 335     


Ala Glu Asn Met Ala Asp Leu Ile Asp Gly Tyr Cys Arg Leu Val Asn 
            340                 345                 350         


Gly Thr Ser Gln Ser Phe Ile Ile Arg Pro Gln Lys Glu Gly Glu Arg 
        355                 360                 365             


Ala Leu Pro Ser Ile Pro Lys Leu Ala Asn Ser Glu Lys Gln Gly Met 
    370                 375                 380                 


Arg Thr His Ala Val Ser Val Ser Glu Thr Asp Asp Tyr Ala Glu Ile 
385                 390                 395                 400 


Ile Asp Glu Glu Asp Thr Tyr Thr Met Pro Ser Thr Arg Asp Tyr Glu 
                405                 410                 415     


Ile Gln Arg Glu Arg Ile Glu Leu Gly Arg Cys Ile Gly Glu Gly Gln 
            420                 425                 430         


Phe Gly Asp Val His Gln Gly Ile Tyr Met Ser Pro Glu Asn Pro Ala 
        435                 440                 445             


Leu Ala Val Ala Ile Lys Thr Cys Lys Asn Cys Thr Ser Asp Ser Val 
    450                 455                 460                 


Arg Glu Lys Phe Leu Gln Glu Ala Leu Thr Met Arg Gln Phe Asp His 
465                 470                 475                 480 


Pro His Ile Val Lys Leu Ile Gly Val Ile Thr Glu Asn Pro Val Trp 
                485                 490                 495     


Ile Ile Met Glu Leu Cys Thr Leu Gly Glu Leu Arg Ser Phe Leu Gln 
            500                 505                 510         


Val Arg Lys Tyr Ser Leu Asp Leu Ala Ser Leu Ile Leu Tyr Ala Tyr 
        515                 520                 525             


Gln Leu Ser Thr Ala Leu Ala Tyr Leu Glu Ser Lys Arg Phe Val His 
    530                 535                 540                 


Arg Asp Ile Ala Ala Arg Asn Val Leu Val Ser Ser Asn Asp Cys Val 
545                 550                 555                 560 


Lys Leu Gly Asp Phe Gly Leu Ser Arg Tyr Met Glu Asp Ser Thr Tyr 
                565                 570                 575     


Tyr Lys Ala Ser Lys Gly Lys Leu Pro Ile Lys Trp Met Ala Pro Glu 
            580                 585                 590         


Ser Ile Asn Phe Arg Arg Phe Thr Ser Ala Ser Asp Val Trp Met Phe 
        595                 600                 605             


Gly Val Cys Met Trp Glu Ile Leu Met His Gly Val Lys Pro Phe Gln 
    610                 615                 620                 


Gly Val Lys Asn Asn Asp Val Ile Gly Arg Ile Glu Asn Gly Glu Arg 
625                 630                 635                 640 


Leu Pro Met Pro Pro Asn Cys Pro Pro Thr Leu Tyr Ser Leu Met Thr 
                645                 650                 655     


Lys Cys Trp Ala Tyr Asp Pro Ser Arg Arg Pro Arg Phe Thr Glu Leu 
            660                 665                 670         


Lys Ala Gln Leu Ser Thr Ile Leu Glu Glu Glu Lys Ala Gln Gln Glu 
        675                 680                 685             


Glu Arg Met Arg Met Glu Ser Arg Arg Gln Ala Thr Val Ser Trp Asp 
    690                 695                 700                 


Ser Gly Gly Ser Asp Glu Ala Pro Pro Lys Pro Ser Arg Pro Gly Tyr 
705                 710                 715                 720 


Pro Ser Pro Arg Ser Ser Glu Gly Phe Tyr Pro Ser Pro Gln His Met 
                725                 730                 735     


Val Gln Thr Asn His Tyr Gln Val Ser Gly Tyr Pro Gly Ser His Gly 
            740                 745                 750         


Ile Thr Ala Met Ala Gly Ser Ile Tyr Pro Gly Gln Ala Ser Leu Leu 
        755                 760                 765             


Asp Gln Thr Asp Ser Trp Asn His Arg Pro Gln Glu Ile Ala Met Trp 
    770                 775                 780                 


Gln Pro Asn Val Glu Asp Ser Thr Val Leu Asp Leu Arg Gly Ile Gly 
785                 790                 795                 800 


Gln Val Leu Pro Thr His Leu Met Glu Glu Arg Leu Ile Arg Gln Gln 
                805                 810                 815     


Gln Glu Met Glu Glu Asp Gln Arg Trp Leu Glu Lys Glu Glu Arg Phe 
            820                 825                 830         


Leu Lys Pro Asp Val Arg Leu Ser Arg Gly Ser Ile Asp Arg Glu Asp 
        835                 840                 845             


Gly Ser Leu Gln Gly Pro Ile Gly Asn Gln His Ile Tyr Gln Pro Val 
    850                 855                 860                 


Gly Lys Pro Asp Pro Ala Ala Pro Pro Lys Lys Pro Pro Arg Pro Gly 
865                 870                 875                 880 


Ala Pro Gly His Leu Gly Ser Leu Ala Ser Leu Ser Ser Pro Ala Asp 
                885                 890                 895     


Ser Tyr Asn Glu Gly Val Lys Leu Gln Pro Gln Glu Ile Ser Pro Pro 
            900                 905                 910         


Pro Thr Ala Asn Leu Asp Arg Ser Asn Asp Lys Val Tyr Glu Asn Val 
        915                 920                 925             


Thr Gly Leu Val Lys Ala Val Ile Glu Met Ser Ser Lys Ile Gln Pro 
    930                 935                 940                 


Ala Pro Pro Glu Glu Tyr Val Pro Met Val Lys Glu Val Gly Leu Ala 
945                 950                 955                 960 


Leu Arg Thr Leu Leu Ala Thr Val Asp Glu Thr Ile Pro Leu Leu Pro 
                965                 970                 975     


Ala Ser Thr His Arg Glu Ile Glu Met Ala Gln Lys Leu Leu Asn Ser 
            980                 985                 990         


Asp Leu Gly Glu Leu Ile Asn Lys  Met Lys Leu Ala Gln  Gln Tyr Val 
        995                 1000                 1005             


Met Thr  Ser Leu Gln Gln Glu  Tyr Lys Lys Gln Met  Leu Thr Ala 
    1010                 1015                 1020             


Ala His  Ala Leu Ala Val Asp  Ala Lys Asn Leu Leu  Asp Val Ile 
    1025                 1030                 1035             


Asp Gln  Ala Arg Leu Lys Met  Leu Gly Gln Thr Arg  Pro His 
    1040                 1045                 1050         


<210>  40
<211>  1789
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) mRNA GenBank Accession No.: 
       U40282.1 GI:3150001

<400>  40
gaattcatct gtcgactgct accacgggag ttccccggag aaggatcctg cagcccgagt       60

cccgaggata aagcttgggg ttcatcctcc ttccctggat cactccacag tcctcaggct      120

tccccaatcc aggggactcg gcgccgggac gctgctatgg acgacatttt cactcagtgc      180

cgggagggca acgcagtcgc cgttcgcctg tggctggaca acacggagaa cgacctcaac      240

cagggggacg atcatggctt ctcccccttg cactgggcct gccgagaggg ccgctctgct      300

gtggttgaga tgttgatcat gcggggggca cggatcaatg taatgaaccg tggggatgac      360

acccccctgc atctggcagc cagtcatgga caccgtgata ttgtacagaa gctattgcag      420

tacaaggcag acatcaatgc agtgaatgaa cacgggaatg tgcccctgca ctatgcctgt      480

ttttggggcc aagatcaagt ggcagaggac ctggtggcaa atggggccct tgtcagcatc      540

tgtaacaagt atggagagat gcctgtggac aaagccaagg cacccctgag agagcttctc      600

cgagagcggg cagagaagat gggccagaat ctcaaccgta ttccatacaa ggacacattc      660

tggaagggga ccacccgcac tcggccccga aatggaaccc tgaacaaaca ctctggcatt      720

gacttcaaac agcttaactt cctgacgaag ctcaacgaga atcactctgg agagctatgg      780

aagggccgct ggcagggcaa tgacattgtc gtgaaggtgc tgaaggttcg agactggagt      840

acaaggaaga gcagggactt caatgaagag tgtccccggc tcaggatttt ctcgcatcca      900

aatgtgctcc cagtgctagg tgcctgccag tctccacctg ctcctcatcc tactctcatc      960

acacactgga tgccgtatgg atccctctac aatgtactac atgaaggcac caatttcgtc     1020

gtggaccaga gccaggctgt gaagtttgct ttggacatgg caaggggcat ggccttccta     1080

cacacactag agcccctcat cccacgacat gcactcaata gccgtagtgt aatgattgat     1140

gaggacatga ctgcccgaat tagcatggct gatgtcaagt tctctttcca atgtcctggt     1200

cgcatgtatg cacctgcctg ggtagccccc gaagctctgc agaagaagcc tgaagacaca     1260

aacagacgct cagcagacat gtggagtttt gcagtgcttc tgtgggaact ggtgacacgg     1320

gaggtaccct ttgctgacct ctccaatatg gagattggaa tgaaggtggc attggaaggc     1380

cttcggccta ccatcccacc aggtatttcc cctcatgtgt gtaagctcat gaagatctgc     1440

atgaatgaag accctgcaaa gcgacccaaa tttgacatga ttgtgcctat ccttgagaag     1500

atgcaggaca agtaggactg gaaggtcctt gcctgaactc cagaggtgtc gggacatggt     1560

tgggggaatg cacctcccca aagcagcagg cctctggttg cctcccccgc ctccagtcat     1620

ggtactaccc cagcctgggg tccatcccct tcccccatcc ctaccactgt gcgcaagagg     1680

ggcgggctca gagctttgtc acttgccaca tggtgtcttc caacatggga gggatcagcc     1740

ccgcctgtca caataaagtt tattatgaaa aaaaaaaaaa aaaaaaaaa                 1789


<210>  41
<211>  3052
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens focal adhesion kinase mRNA, complete cds

<400>  41
ccggtgtgaa ggccatgagt gattactggg ttgttggaaa gaagtctaac tatgaagtat       60

tagaaaaaga tgttggttta aagcgatttt ttcctaagag tttactggat tctgtcaagg      120

ccaaaacact aagaaaactg atccaacaaa catttagaca atttgccaac cttaatagag      180

aagaaagtat tctgaaattc tttgagatcc tgtctccagt ctacagattt gataaggaat      240

gcttcaagtg tgctcttggt tcaagctgga ttatttcagt ggaactggca atcggcccag      300

aagaaggaat cagttaccta acggacaagg gctgcaatcc cacacatctt gctgacttca      360

ctcaagtgca aaccattcag tattcaaaca gtgaagacaa ggacagaaaa ggaatgctac      420

aactaaaaat agcaggtgca cccgagcctc tgacagtgac ggcaccatcc ctaaccattg      480

cggagaatat ggctgaccta atagatgggt actgccggct ggtgaatgga acctcgcagt      540

catttatcat cagacctcag aaagaaggtg aacgggcttt gccatcaata ccaaagttgg      600

ccaacagcga aaagcaaggc atgcggacac acgccgtctc tgtgtcagaa acagatgatt      660

atgctgagat tatagatgaa gaagatactt acaccatgcc ctcaaccagg gattatgaga      720

ttcaaagaga aagaatagaa cttggacgat gtattggaga aggccaattt ggagatgtac      780

atcaaggcat ttatatgagt ccagagaatc cagctttggc ggttgcaatt aaaacatgta      840

aaaactgtac ttcggacagc gtgagagaga aatttcttca agaagcctgc cattacacat      900

ctttgcactg gaattggtgc agatatataa gtgatcctaa tgttgatgcc tgcccagacc      960

ccaggaatgc agagttaaca atgcgtcagt ttgaccatcc tcatattgtg aagctgattg     1020

gagtcatcac agagaatcct gtctggataa tcatggagct gtgcacactt ggagagctga     1080

ggtcattttt gcaagtaagg aaatacagtt tggatctagc atctttgatc ctgtatgcct     1140

atcagcttag tacagctctt gcatatctag agagcaaaag atttgtacac agggacattg     1200

ctgctcggaa tgttctggtg tcctcaaatg attgtgtaaa attaggagac tttggattat     1260

cccgatatat ggaagatagt acttactaca aagcttccaa aggaaaattg cctattaaat     1320

ggatggctcc agagtcaatc aattttcgac gttttacctc agctagtgac gtatggatgt     1380

ttggtgtgtg tatgtgggag atactgatgc atggtgtgaa gccttttcaa ggagtgaaga     1440

acaatgatgt aatcggtcga attgaaaatg gggaaagatt accaatgcct ccaaattgtc     1500

ctcctaccct ctacagcctt atgacgaaat gctgggccta tgaccccagc aggcggccca     1560

ggtttactga acttaaagct cagctcagca caatcctgga ggaagagaag gctcagcaag     1620

aagagcgcat gaggatggag tccagaagac aggccacagt gtcctgggac tccggagggt     1680

ctgatgaagc accgcccaag cccagcagac cgggttatcc cagtccgagg tccagcgaag     1740

gattttatcc cagcccacag cacatggtac aaaccaatca ttaccaggtt tctggctacc     1800

ctggttcaca tggaatcaca gccatggctg gcagcatcta tccaggtcag gcatctcttt     1860

tggaccaaac agattcatgg aatcatagat ctcaggagat agcaatgtgg cagcccaatg     1920

tggaggactc tacagtattg gacctgcgag ggattgggca agtgttgcca acccatctga     1980

tggaagagcg tctaatccga cagcaacagg aaatggaaga agatcagcgc tggctggaaa     2040

aagaggaaag atttctgatt ggaaaccaac atatatatca gcctgtgggt aaaccagatc     2100

ctgcagctcc accaaagaaa ccgcctcgcc ctggagctcc cggtcatctg ggaagccttg     2160

ccagcctcag cagccctgct gacagctaca acgagggtgt caagcttcag ccccaggaaa     2220

tcagcccccc tcctactgcc aacctggacc ggtcgaatga taaggtgtac gagaatgtga     2280

cgggcctggt gaaagctgtc atcgagatgt ccagtaaaat ccagccagcc ccaccagagg     2340

agtatgtccc tatggtgaag gaagtcggct tggccctgag gacattattg gccactgtgg     2400

atgagaccat tcccctccta ccagccagca cccaccgaga gattgagatg gcacagaagc     2460

tattgaactc tgacctgggt gagctcatca acaagatgaa actggcccag cagtatgtca     2520

tgaccagcct ccagcaagag tacaaaaagc aaatgctgac tgccgctcac gccctggctg     2580

tggatgccaa aaacttactc gatgtcattg accaagcaag actgaaaatg cttgggcaga     2640

cgagaccaca ctgagcctcc cctaggagca cgtcttgcta ccctcttttg aagatgttct     2700

ctagccttcc accagcagcg aggaattaac cctgtgtcct cagtcgccag cactcacagc     2760

tccaactttt ttgaatgacc atctggttga aaaatctttc tcatataagt ttaaccacac     2820

tttgatttgg gttcattttt tgttttgttt ttttcaatca tgatattcag aaaaatccag     2880

gatccaaaat gtggcgtttt tctaagaatg aaaattatat gtaagctttt aagcatcatg     2940

aagaacaatt tatgttcaca ttaagatacg ttctaaaggg ggatggccaa ggggtgacat     3000

cttaattcct aaactacctt agctgcatag tggaagagga gagccggaat tc             3052


<210>  42
<211>  879
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens focal adhesion kinase polypeptide

<400>  42

Met Ser Asp Tyr Trp Val Val Gly Lys Lys Ser Asn Tyr Glu Val Leu 
1               5                   10                  15      


Glu Lys Asp Val Gly Leu Lys Arg Phe Phe Pro Lys Ser Leu Leu Asp 
            20                  25                  30          


Ser Val Lys Ala Lys Thr Leu Arg Lys Leu Ile Gln Gln Thr Phe Arg 
        35                  40                  45              


Gln Phe Ala Asn Leu Asn Arg Glu Glu Ser Ile Leu Lys Phe Phe Glu 
    50                  55                  60                  


Ile Leu Ser Pro Val Tyr Arg Phe Asp Lys Glu Cys Phe Lys Cys Ala 
65                  70                  75                  80  


Leu Gly Ser Ser Trp Ile Ile Ser Val Glu Leu Ala Ile Gly Pro Glu 
                85                  90                  95      


Glu Gly Ile Ser Tyr Leu Thr Asp Lys Gly Cys Asn Pro Thr His Leu 
            100                 105                 110         


Ala Asp Phe Thr Gln Val Gln Thr Ile Gln Tyr Ser Asn Ser Glu Asp 
        115                 120                 125             


Lys Asp Arg Lys Gly Met Leu Gln Leu Lys Ile Ala Gly Ala Pro Glu 
    130                 135                 140                 


Pro Leu Thr Val Thr Ala Pro Ser Leu Thr Ile Ala Glu Asn Met Ala 
145                 150                 155                 160 


Asp Leu Ile Asp Gly Tyr Cys Arg Leu Val Asn Gly Thr Ser Gln Ser 
                165                 170                 175     


Phe Ile Ile Arg Pro Gln Lys Glu Gly Glu Arg Ala Leu Pro Ser Ile 
            180                 185                 190         


Pro Lys Leu Ala Asn Ser Glu Lys Gln Gly Met Arg Thr His Ala Val 
        195                 200                 205             


Ser Val Ser Glu Thr Asp Asp Tyr Ala Glu Ile Ile Asp Glu Glu Asp 
    210                 215                 220                 


Thr Tyr Thr Met Pro Ser Thr Arg Asp Tyr Glu Ile Gln Arg Glu Arg 
225                 230                 235                 240 


Ile Glu Leu Gly Arg Cys Ile Gly Glu Gly Gln Phe Gly Asp Val His 
                245                 250                 255     


Gln Gly Ile Tyr Met Ser Pro Glu Asn Pro Ala Leu Ala Val Ala Ile 
            260                 265                 270         


Lys Thr Cys Lys Asn Cys Thr Ser Asp Ser Val Arg Glu Lys Phe Leu 
        275                 280                 285             


Gln Glu Ala Cys His Tyr Thr Ser Leu His Trp Asn Trp Cys Arg Tyr 
    290                 295                 300                 


Ile Ser Asp Pro Asn Val Asp Ala Cys Pro Asp Pro Arg Asn Ala Glu 
305                 310                 315                 320 


Leu Thr Met Arg Gln Phe Asp His Pro His Ile Val Lys Leu Ile Gly 
                325                 330                 335     


Val Ile Thr Glu Asn Pro Val Trp Ile Ile Met Glu Leu Cys Thr Leu 
            340                 345                 350         


Gly Glu Leu Arg Ser Phe Leu Gln Val Arg Lys Tyr Ser Leu Asp Leu 
        355                 360                 365             


Ala Ser Leu Ile Leu Tyr Ala Tyr Gln Leu Ser Thr Ala Leu Ala Tyr 
    370                 375                 380                 


Leu Glu Ser Lys Arg Phe Val His Arg Asp Ile Ala Ala Arg Asn Val 
385                 390                 395                 400 


Leu Val Ser Ser Asn Asp Cys Val Lys Leu Gly Asp Phe Gly Leu Ser 
                405                 410                 415     


Arg Tyr Met Glu Asp Ser Thr Tyr Tyr Lys Ala Ser Lys Gly Lys Leu 
            420                 425                 430         


Pro Ile Lys Trp Met Ala Pro Glu Ser Ile Asn Phe Arg Arg Phe Thr 
        435                 440                 445             


Ser Ala Ser Asp Val Trp Met Phe Gly Val Cys Met Trp Glu Ile Leu 
    450                 455                 460                 


Met His Gly Val Lys Pro Phe Gln Gly Val Lys Asn Asn Asp Val Ile 
465                 470                 475                 480 


Gly Arg Ile Glu Asn Gly Glu Arg Leu Pro Met Pro Pro Asn Cys Pro 
                485                 490                 495     


Pro Thr Leu Tyr Ser Leu Met Thr Lys Cys Trp Ala Tyr Asp Pro Ser 
            500                 505                 510         


Arg Arg Pro Arg Phe Thr Glu Leu Lys Ala Gln Leu Ser Thr Ile Leu 
        515                 520                 525             


Glu Glu Glu Lys Ala Gln Gln Glu Glu Arg Met Arg Met Glu Ser Arg 
    530                 535                 540                 


Arg Gln Ala Thr Val Ser Trp Asp Ser Gly Gly Ser Asp Glu Ala Pro 
545                 550                 555                 560 


Pro Lys Pro Ser Arg Pro Gly Tyr Pro Ser Pro Arg Ser Ser Glu Gly 
                565                 570                 575     


Phe Tyr Pro Ser Pro Gln His Met Val Gln Thr Asn His Tyr Gln Val 
            580                 585                 590         


Ser Gly Tyr Pro Gly Ser His Gly Ile Thr Ala Met Ala Gly Ser Ile 
        595                 600                 605             


Tyr Pro Gly Gln Ala Ser Leu Leu Asp Gln Thr Asp Ser Trp Asn His 
    610                 615                 620                 


Arg Ser Gln Glu Ile Ala Met Trp Gln Pro Asn Val Glu Asp Ser Thr 
625                 630                 635                 640 


Val Leu Asp Leu Arg Gly Ile Gly Gln Val Leu Pro Thr His Leu Met 
                645                 650                 655     


Glu Glu Arg Leu Ile Arg Gln Gln Gln Glu Met Glu Glu Asp Gln Arg 
            660                 665                 670         


Trp Leu Glu Lys Glu Glu Arg Phe Leu Ile Gly Asn Gln His Ile Tyr 
        675                 680                 685             


Gln Pro Val Gly Lys Pro Asp Pro Ala Ala Pro Pro Lys Lys Pro Pro 
    690                 695                 700                 


Arg Pro Gly Ala Pro Gly His Leu Gly Ser Leu Ala Ser Leu Ser Ser 
705                 710                 715                 720 


Pro Ala Asp Ser Tyr Asn Glu Gly Val Lys Leu Gln Pro Gln Glu Ile 
                725                 730                 735     


Ser Pro Pro Pro Thr Ala Asn Leu Asp Arg Ser Asn Asp Lys Val Tyr 
            740                 745                 750         


Glu Asn Val Thr Gly Leu Val Lys Ala Val Ile Glu Met Ser Ser Lys 
        755                 760                 765             


Ile Gln Pro Ala Pro Pro Glu Glu Tyr Val Pro Met Val Lys Glu Val 
    770                 775                 780                 


Gly Leu Ala Leu Arg Thr Leu Leu Ala Thr Val Asp Glu Thr Ile Pro 
785                 790                 795                 800 


Leu Leu Pro Ala Ser Thr His Arg Glu Ile Glu Met Ala Gln Lys Leu 
                805                 810                 815     


Leu Asn Ser Asp Leu Gly Glu Leu Ile Asn Lys Met Lys Leu Ala Gln 
            820                 825                 830         


Gln Tyr Val Met Thr Ser Leu Gln Gln Glu Tyr Lys Lys Gln Met Leu 
        835                 840                 845             


Thr Ala Ala His Ala Leu Ala Val Asp Ala Lys Asn Leu Leu Asp Val 
    850                 855                 860                 


Ile Asp Gln Ala Arg Leu Lys Met Leu Gly Gln Thr Arg Pro His 
865                 870                 875                 


<210>  43
<211>  452
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) polypeptide GenBank Accession 
       No.: ACC16892.1 GI: 3150002

<400>  43

Met Asp Asp Ile Phe Thr Gln Cys Arg Glu Gly Asn Ala Val Ala Val 
1               5                   10                  15      


Arg Leu Trp Leu Asp Asn Thr Glu Asn Asp Leu Asn Gln Gly Asp Asp 
            20                  25                  30          


His Gly Phe Ser Pro Leu His Trp Ala Cys Arg Glu Gly Arg Ser Ala 
        35                  40                  45              


Val Val Glu Met Leu Ile Met Arg Gly Ala Arg Ile Asn Val Met Asn 
    50                  55                  60                  


Arg Gly Asp Asp Thr Pro Leu His Leu Ala Ala Ser His Gly His Arg 
65                  70                  75                  80  


Asp Ile Val Gln Lys Leu Leu Gln Tyr Lys Ala Asp Ile Asn Ala Val 
                85                  90                  95      


Asn Glu His Gly Asn Val Pro Leu His Tyr Ala Cys Phe Trp Gly Gln 
            100                 105                 110         


Asp Gln Val Ala Glu Asp Leu Val Ala Asn Gly Ala Leu Val Ser Ile 
        115                 120                 125             


Cys Asn Lys Tyr Gly Glu Met Pro Val Asp Lys Ala Lys Ala Pro Leu 
    130                 135                 140                 


Arg Glu Leu Leu Arg Glu Arg Ala Glu Lys Met Gly Gln Asn Leu Asn 
145                 150                 155                 160 


Arg Ile Pro Tyr Lys Asp Thr Phe Trp Lys Gly Thr Thr Arg Thr Arg 
                165                 170                 175     


Pro Arg Asn Gly Thr Leu Asn Lys His Ser Gly Ile Asp Phe Lys Gln 
            180                 185                 190         


Leu Asn Phe Leu Thr Lys Leu Asn Glu Asn His Ser Gly Glu Leu Trp 
        195                 200                 205             


Lys Gly Arg Trp Gln Gly Asn Asp Ile Val Val Lys Val Leu Lys Val 
    210                 215                 220                 


Arg Asp Trp Ser Thr Arg Lys Ser Arg Asp Phe Asn Glu Glu Cys Pro 
225                 230                 235                 240 


Arg Leu Arg Ile Phe Ser His Pro Asn Val Leu Pro Val Leu Gly Ala 
                245                 250                 255     


Cys Gln Ser Pro Pro Ala Pro His Pro Thr Leu Ile Thr His Trp Met 
            260                 265                 270         


Pro Tyr Gly Ser Leu Tyr Asn Val Leu His Glu Gly Thr Asn Phe Val 
        275                 280                 285             


Val Asp Gln Ser Gln Ala Val Lys Phe Ala Leu Asp Met Ala Arg Gly 
    290                 295                 300                 


Met Ala Phe Leu His Thr Leu Glu Pro Leu Ile Pro Arg His Ala Leu 
305                 310                 315                 320 


Asn Ser Arg Ser Val Met Ile Asp Glu Asp Met Thr Ala Arg Ile Ser 
                325                 330                 335     


Met Ala Asp Val Lys Phe Ser Phe Gln Cys Pro Gly Arg Met Tyr Ala 
            340                 345                 350         


Pro Ala Trp Val Ala Pro Glu Ala Leu Gln Lys Lys Pro Glu Asp Thr 
        355                 360                 365             


Asn Arg Arg Ser Ala Asp Met Trp Ser Phe Ala Val Leu Leu Trp Glu 
    370                 375                 380                 


Leu Val Thr Arg Glu Val Pro Phe Ala Asp Leu Ser Asn Met Glu Ile 
385                 390                 395                 400 


Gly Met Lys Val Ala Leu Glu Gly Leu Arg Pro Thr Ile Pro Pro Gly 
                405                 410                 415     


Ile Ser Pro His Val Cys Lys Leu Met Lys Ile Cys Met Asn Glu Asp 
            420                 425                 430         


Pro Ala Lys Arg Pro Lys Phe Asp Met Ile Val Pro Ile Leu Glu Lys 
        435                 440                 445             


Met Gln Asp Lys 
    450         


<210>  44
<211>  1843
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) mRNA transcript variant 1 
       GenBank Accession No.: NM_004517.3 GI: 510785736

<400>  44
agggacggag gcccgcgctg cccaagagcg ccacgggcgg ggcggggccg gcggcgggct       60

gcgggcgcgg ccggacggga gttccccgga gaaggatcct gcagcccgag tcccgaggat      120

aaagcttggg gttcatcctc cttccctgga tcactccaca gtcctcaggc ttccccaatc      180

caggggactc ggcgccggga cgctgctatg gacgacattt tcactcagtg ccgggagggc      240

aacgcagtcg ccgttcgcct gtggctggac aacacggaga acgacctcaa ccagggggac      300

gatcatggct tctccccctt gcactgggcc tgccgagagg gccgctctgc tgtggttgag      360

atgttgatca tgcggggggc acggatcaat gtaatgaacc gtggggatga cacccccctg      420

catctggcag ccagtcatgg acaccgtgat attgtacaga agctattgca gtacaaggca      480

gacatcaatg cagtgaatga acacgggaat gtgcccctgc actatgcctg tttttggggc      540

caagatcaag tggcagagga cctggtggca aatggggccc ttgtcagcat ctgtaacaag      600

tatggagaga tgcctgtgga caaagccaag gcacccctga gagagcttct ccgagagcgg      660

gcagagaaga tgggccagaa tctcaaccgt attccataca aggacacatt ctggaagggg      720

accacccgca ctcggccccg aaatggaacc ctgaacaaac actctggcat tgacttcaaa      780

cagcttaact tcctgacgaa gctcaacgag aatcactctg gagagctatg gaagggccgc      840

tggcagggca atgacattgt cgtgaaggtg ctgaaggttc gagactggag tacaaggaag      900

agcagggact tcaatgaaga gtgtccccgg ctcaggattt tctcgcatcc aaatgtgctc      960

ccagtgctag gtgcctgcca gtctccacct gctcctcatc ctactctcat cacacactgg     1020

atgccgtatg gatccctcta caatgtacta catgaaggca ccaatttcgt cgtggaccag     1080

agccaggctg tgaagtttgc tttggacatg gcaaggggca tggccttcct acacacacta     1140

gagcccctca tcccacgaca tgcactcaat agccgtagtg taatgattga tgaggacatg     1200

actgcccgaa ttagcatggc tgatgtcaag ttctctttcc aatgtcctgg tcgcatgtat     1260

gcacctgcct gggtagcccc cgaagctctg cagaagaagc ctgaagacac aaacagacgc     1320

tcagcagaca tgtggagttt tgcagtgctt ctgtgggaac tggtgacacg ggaggtaccc     1380

tttgctgacc tctccaatat ggagattgga atgaaggtgg cattggaagg ccttcggcct     1440

accatcccac caggtatttc ccctcatgtg tgtaagctca tgaagatctg catgaatgaa     1500

gaccctgcaa agcgacccaa atttgacatg attgtgccta tccttgagaa gatgcaggac     1560

aagtaggact ggaaggtcct tgcctgaact ccagaggtgt cgggacatgg ttgggggaat     1620

gcacctcccc aaagcagcag gcctctggtt gcctcccccg cctccagtca tggtactacc     1680

ccagccatgg ggtccatccc cttcccccat ccctaccact gtggccccaa gaggggcggg     1740

ctcagagctt tgtcacttgc cacatggtgt ctcccaacat gggagggatc agccccgcct     1800

gtcacaataa agtttattat gaaaacagga aaaaaaaaaa aaa                       1843


<210>  45
<211>  452
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) polypeptide encoded by 
       transcript variant 1 GenBank Accession No.: NP_004508.1 
       GI:4758606

<400>  45

Met Asp Asp Ile Phe Thr Gln Cys Arg Glu Gly Asn Ala Val Ala Val 
1               5                   10                  15      


Arg Leu Trp Leu Asp Asn Thr Glu Asn Asp Leu Asn Gln Gly Asp Asp 
            20                  25                  30          


His Gly Phe Ser Pro Leu His Trp Ala Cys Arg Glu Gly Arg Ser Ala 
        35                  40                  45              


Val Val Glu Met Leu Ile Met Arg Gly Ala Arg Ile Asn Val Met Asn 
    50                  55                  60                  


Arg Gly Asp Asp Thr Pro Leu His Leu Ala Ala Ser His Gly His Arg 
65                  70                  75                  80  


Asp Ile Val Gln Lys Leu Leu Gln Tyr Lys Ala Asp Ile Asn Ala Val 
                85                  90                  95      


Asn Glu His Gly Asn Val Pro Leu His Tyr Ala Cys Phe Trp Gly Gln 
            100                 105                 110         


Asp Gln Val Ala Glu Asp Leu Val Ala Asn Gly Ala Leu Val Ser Ile 
        115                 120                 125             


Cys Asn Lys Tyr Gly Glu Met Pro Val Asp Lys Ala Lys Ala Pro Leu 
    130                 135                 140                 


Arg Glu Leu Leu Arg Glu Arg Ala Glu Lys Met Gly Gln Asn Leu Asn 
145                 150                 155                 160 


Arg Ile Pro Tyr Lys Asp Thr Phe Trp Lys Gly Thr Thr Arg Thr Arg 
                165                 170                 175     


Pro Arg Asn Gly Thr Leu Asn Lys His Ser Gly Ile Asp Phe Lys Gln 
            180                 185                 190         


Leu Asn Phe Leu Thr Lys Leu Asn Glu Asn His Ser Gly Glu Leu Trp 
        195                 200                 205             


Lys Gly Arg Trp Gln Gly Asn Asp Ile Val Val Lys Val Leu Lys Val 
    210                 215                 220                 


Arg Asp Trp Ser Thr Arg Lys Ser Arg Asp Phe Asn Glu Glu Cys Pro 
225                 230                 235                 240 


Arg Leu Arg Ile Phe Ser His Pro Asn Val Leu Pro Val Leu Gly Ala 
                245                 250                 255     


Cys Gln Ser Pro Pro Ala Pro His Pro Thr Leu Ile Thr His Trp Met 
            260                 265                 270         


Pro Tyr Gly Ser Leu Tyr Asn Val Leu His Glu Gly Thr Asn Phe Val 
        275                 280                 285             


Val Asp Gln Ser Gln Ala Val Lys Phe Ala Leu Asp Met Ala Arg Gly 
    290                 295                 300                 


Met Ala Phe Leu His Thr Leu Glu Pro Leu Ile Pro Arg His Ala Leu 
305                 310                 315                 320 


Asn Ser Arg Ser Val Met Ile Asp Glu Asp Met Thr Ala Arg Ile Ser 
                325                 330                 335     


Met Ala Asp Val Lys Phe Ser Phe Gln Cys Pro Gly Arg Met Tyr Ala 
            340                 345                 350         


Pro Ala Trp Val Ala Pro Glu Ala Leu Gln Lys Lys Pro Glu Asp Thr 
        355                 360                 365             


Asn Arg Arg Ser Ala Asp Met Trp Ser Phe Ala Val Leu Leu Trp Glu 
    370                 375                 380                 


Leu Val Thr Arg Glu Val Pro Phe Ala Asp Leu Ser Asn Met Glu Ile 
385                 390                 395                 400 


Gly Met Lys Val Ala Leu Glu Gly Leu Arg Pro Thr Ile Pro Pro Gly 
                405                 410                 415     


Ile Ser Pro His Val Cys Lys Leu Met Lys Ile Cys Met Asn Glu Asp 
            420                 425                 430         


Pro Ala Lys Arg Pro Lys Phe Asp Met Ile Val Pro Ile Leu Glu Lys 
        435                 440                 445             


Met Gln Asp Lys 
    450         


<210>  46
<211>  1797
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) mRNA transcript variant 2 
       GenBank Accession No.: NM_001014794.2  GI:510785737

<400>  46
agggacggag gcccgcgctg cccaagagcg ccacgggcgg ggcggggccg gcggcgggct       60

gcgggcgcgg ccggacggga gttccccgga gaaggatcct gcagcccgag tcccgtcctc      120

aggcttcccc aatccagggg actcggcgcc gggacgctgc tatggacgac attttcactc      180

agtgccggga gggcaacgca gtcgccgttc gcctgtggct ggacaacacg gagaacgacc      240

tcaaccaggg ggacgatcat ggcttctccc ccttgcactg ggcctgccga gagggccgct      300

ctgctgtggt tgagatgttg atcatgcggg gggcacggat caatgtaatg aaccgtgggg      360

atgacacccc cctgcatctg gcagccagtc atggacaccg tgatattgta cagaagctat      420

tgcagtacaa ggcagacatc aatgcagtga atgaacacgg gaatgtgccc ctgcactatg      480

cctgtttttg gggccaagat caagtggcag aggacctggt ggcaaatggg gcccttgtca      540

gcatctgtaa caagtatgga gagatgcctg tggacaaagc caaggcaccc ctgagagagc      600

ttctccgaga gcgggcagag aagatgggcc agaatctcaa ccgtattcca tacaaggaca      660

cattctggaa ggggaccacc cgcactcggc cccgaaatgg aaccctgaac aaacactctg      720

gcattgactt caaacagctt aacttcctga cgaagctcaa cgagaatcac tctggagagc      780

tatggaaggg ccgctggcag ggcaatgaca ttgtcgtgaa ggtgctgaag gttcgagact      840

ggagtacaag gaagagcagg gacttcaatg aagagtgtcc ccggctcagg attttctcgc      900

atccaaatgt gctcccagtg ctaggtgcct gccagtctcc acctgctcct catcctactc      960

tcatcacaca ctggatgccg tatggatccc tctacaatgt actacatgaa ggcaccaatt     1020

tcgtcgtgga ccagagccag gctgtgaagt ttgctttgga catggcaagg ggcatggcct     1080

tcctacacac actagagccc ctcatcccac gacatgcact caatagccgt agtgtaatga     1140

ttgatgagga catgactgcc cgaattagca tggctgatgt caagttctct ttccaatgtc     1200

ctggtcgcat gtatgcacct gcctgggtag cccccgaagc tctgcagaag aagcctgaag     1260

acacaaacag acgctcagca gacatgtgga gttttgcagt gcttctgtgg gaactggtga     1320

cacgggaggt accctttgct gacctctcca atatggagat tggaatgaag gtggcattgg     1380

aaggccttcg gcctaccatc ccaccaggta tttcccctca tgtgtgtaag ctcatgaaga     1440

tctgcatgaa tgaagaccct gcaaagcgac ccaaatttga catgattgtg cctatccttg     1500

agaagatgca ggacaagtag gactggaagg tccttgcctg aactccagag gtgtcgggac     1560

atggttgggg gaatgcacct ccccaaagca gcaggcctct ggttgcctcc cccgcctcca     1620

gtcatggtac taccccagcc atggggtcca tccccttccc ccatccctac cactgtggcc     1680

ccaagagggg cgggctcaga gctttgtcac ttgccacatg gtgtctccca acatgggagg     1740

gatcagcccc gcctgtcaca ataaagttta ttatgaaaac aggaaaaaaa aaaaaaa        1797


<210>  47
<211>  452
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) polypeptide encoded by 
       transcript variant 2 GenBank Accession No.: NP_001014794.1  
       GI:62420873

<400>  47

Met Asp Asp Ile Phe Thr Gln Cys Arg Glu Gly Asn Ala Val Ala Val 
1               5                   10                  15      


Arg Leu Trp Leu Asp Asn Thr Glu Asn Asp Leu Asn Gln Gly Asp Asp 
            20                  25                  30          


His Gly Phe Ser Pro Leu His Trp Ala Cys Arg Glu Gly Arg Ser Ala 
        35                  40                  45              


Val Val Glu Met Leu Ile Met Arg Gly Ala Arg Ile Asn Val Met Asn 
    50                  55                  60                  


Arg Gly Asp Asp Thr Pro Leu His Leu Ala Ala Ser His Gly His Arg 
65                  70                  75                  80  


Asp Ile Val Gln Lys Leu Leu Gln Tyr Lys Ala Asp Ile Asn Ala Val 
                85                  90                  95      


Asn Glu His Gly Asn Val Pro Leu His Tyr Ala Cys Phe Trp Gly Gln 
            100                 105                 110         


Asp Gln Val Ala Glu Asp Leu Val Ala Asn Gly Ala Leu Val Ser Ile 
        115                 120                 125             


Cys Asn Lys Tyr Gly Glu Met Pro Val Asp Lys Ala Lys Ala Pro Leu 
    130                 135                 140                 


Arg Glu Leu Leu Arg Glu Arg Ala Glu Lys Met Gly Gln Asn Leu Asn 
145                 150                 155                 160 


Arg Ile Pro Tyr Lys Asp Thr Phe Trp Lys Gly Thr Thr Arg Thr Arg 
                165                 170                 175     


Pro Arg Asn Gly Thr Leu Asn Lys His Ser Gly Ile Asp Phe Lys Gln 
            180                 185                 190         


Leu Asn Phe Leu Thr Lys Leu Asn Glu Asn His Ser Gly Glu Leu Trp 
        195                 200                 205             


Lys Gly Arg Trp Gln Gly Asn Asp Ile Val Val Lys Val Leu Lys Val 
    210                 215                 220                 


Arg Asp Trp Ser Thr Arg Lys Ser Arg Asp Phe Asn Glu Glu Cys Pro 
225                 230                 235                 240 


Arg Leu Arg Ile Phe Ser His Pro Asn Val Leu Pro Val Leu Gly Ala 
                245                 250                 255     


Cys Gln Ser Pro Pro Ala Pro His Pro Thr Leu Ile Thr His Trp Met 
            260                 265                 270         


Pro Tyr Gly Ser Leu Tyr Asn Val Leu His Glu Gly Thr Asn Phe Val 
        275                 280                 285             


Val Asp Gln Ser Gln Ala Val Lys Phe Ala Leu Asp Met Ala Arg Gly 
    290                 295                 300                 


Met Ala Phe Leu His Thr Leu Glu Pro Leu Ile Pro Arg His Ala Leu 
305                 310                 315                 320 


Asn Ser Arg Ser Val Met Ile Asp Glu Asp Met Thr Ala Arg Ile Ser 
                325                 330                 335     


Met Ala Asp Val Lys Phe Ser Phe Gln Cys Pro Gly Arg Met Tyr Ala 
            340                 345                 350         


Pro Ala Trp Val Ala Pro Glu Ala Leu Gln Lys Lys Pro Glu Asp Thr 
        355                 360                 365             


Asn Arg Arg Ser Ala Asp Met Trp Ser Phe Ala Val Leu Leu Trp Glu 
    370                 375                 380                 


Leu Val Thr Arg Glu Val Pro Phe Ala Asp Leu Ser Asn Met Glu Ile 
385                 390                 395                 400 


Gly Met Lys Val Ala Leu Glu Gly Leu Arg Pro Thr Ile Pro Pro Gly 
                405                 410                 415     


Ile Ser Pro His Val Cys Lys Leu Met Lys Ile Cys Met Asn Glu Asp 
            420                 425                 430         


Pro Ala Lys Arg Pro Lys Phe Asp Met Ile Val Pro Ile Leu Glu Lys 
        435                 440                 445             


Met Gln Asp Lys 
    450         


<210>  48
<211>  2098
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) mRNA transcript variant 3 
       GenBank Accession No.: NM_001014795.2 GI: 510785738

<400>  48
agcccgagtc ccggtgagtg cgagaggacc cgcgccccct gtcacccctc acttctcctg       60

agtcccaagt aaggaacgga gccaaccctg gggaggaccc ccggtcccct ctcccagagt      120

atacggagcc tgaacctccc cacttccccc aactctgttc gcggataggg tctagttgcc      180

tgctctcgga catccgttca gcagacacta cctcttcgtc accccctgcc caccctgacc      240

cgcctttacc tcgcgtctag aggacacagc cagggatcat cccgcagccc cgaactcctt      300

cacagacccc cactagccgg ggacgcagct caggccccct acccccaaca caaacacttc      360

tctcctgtag aggataaagc ttggggttca tcctccttcc ctggatcact ccacagtcct      420

caggcttccc caatccaggg gactcggcgc cgggacgctg ctatggacga cattttcact      480

cagtgccggg agggcaacgc agtcgccgtt cgcctgtggc tggacaacac ggagaacgac      540

ctcaaccagg gggacgatca tggcttctcc cccttgcact gggcctgccg agagggccgc      600

tctgctgtgg ttgagatgtt gatcatgcgg ggggcacgga tcaatgtaat gaaccgtggg      660

gatgacaccc ccctgcatct ggcagccagt catggacacc gtgatattgt acagaagcta      720

ttgcagtaca aggcagacat caatgcagtg aatgaacacg ggaatgtgcc cctgcactat      780

gcctgttttt ggggccaaga tcaagtggca gaggacctgg tggcaaatgg ggcccttgtc      840

agcatctgta acaagtatgg agagatgcct gtggacaaag ccaaggcacc cctgagagag      900

cttctccgag agcgggcaga gaagatgggc cagaatctca accgtattcc atacaaggac      960

acattctgga aggggaccac ccgcactcgg ccccgaaatg gaaccctgaa caaacactct     1020

ggcattgact tcaaacagct taacttcctg acgaagctca acgagaatca ctctggagag     1080

ctatggaagg gccgctggca gggcaatgac attgtcgtga aggtgctgaa ggttcgagac     1140

tggagtacaa ggaagagcag ggacttcaat gaagagtgtc cccggctcag gattttctcg     1200

catccaaatg tgctcccagt gctaggtgcc tgccagtctc cacctgctcc tcatcctact     1260

ctcatcacac actggatgcc gtatggatcc ctctacaatg tactacatga aggcaccaat     1320

ttcgtcgtgg accagagcca ggctgtgaag tttgctttgg acatggcaag gggcatggcc     1380

ttcctacaca cactagagcc cctcatccca cgacatgcac tcaatagccg tagtgtaatg     1440

attgatgagg acatgactgc ccgaattagc atggctgatg tcaagttctc tttccaatgt     1500

cctggtcgca tgtatgcacc tgcctgggta gcccccgaag ctctgcagaa gaagcctgaa     1560

gacacaaaca gacgctcagc agacatgtgg agttttgcag tgcttctgtg ggaactggtg     1620

acacgggagg taccctttgc tgacctctcc aatatggaga ttggaatgaa ggtggcattg     1680

gaaggccttc ggcctaccat cccaccaggt atttcccctc atgtgtgtaa gctcatgaag     1740

atctgcatga atgaagaccc tgcaaagcga cccaaatttg acatgattgt gcctatcctt     1800

gagaagatgc aggacaagta ggactggaag gtccttgcct gaactccaga ggtgtcggga     1860

catggttggg ggaatgcacc tccccaaagc agcaggcctc tggttgcctc ccccgcctcc     1920

agtcatggta ctaccccagc catggggtcc atccccttcc cccatcccta ccactgtggc     1980

cccaagaggg gcgggctcag agctttgtca cttgccacat ggtgtctccc aacatgggag     2040

ggatcagccc cgcctgtcac aataaagttt attatgaaaa caggaaaaaa aaaaaaaa       2098


<210>  49
<211>  452
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) polypeptide encoded by 
       transcript variant 3 GenBank Accession No.: NP_001014795.1  
       GI:62420875

<400>  49

Met Asp Asp Ile Phe Thr Gln Cys Arg Glu Gly Asn Ala Val Ala Val 
1               5                   10                  15      


Arg Leu Trp Leu Asp Asn Thr Glu Asn Asp Leu Asn Gln Gly Asp Asp 
            20                  25                  30          


His Gly Phe Ser Pro Leu His Trp Ala Cys Arg Glu Gly Arg Ser Ala 
        35                  40                  45              


Val Val Glu Met Leu Ile Met Arg Gly Ala Arg Ile Asn Val Met Asn 
    50                  55                  60                  


Arg Gly Asp Asp Thr Pro Leu His Leu Ala Ala Ser His Gly His Arg 
65                  70                  75                  80  


Asp Ile Val Gln Lys Leu Leu Gln Tyr Lys Ala Asp Ile Asn Ala Val 
                85                  90                  95      


Asn Glu His Gly Asn Val Pro Leu His Tyr Ala Cys Phe Trp Gly Gln 
            100                 105                 110         


Asp Gln Val Ala Glu Asp Leu Val Ala Asn Gly Ala Leu Val Ser Ile 
        115                 120                 125             


Cys Asn Lys Tyr Gly Glu Met Pro Val Asp Lys Ala Lys Ala Pro Leu 
    130                 135                 140                 


Arg Glu Leu Leu Arg Glu Arg Ala Glu Lys Met Gly Gln Asn Leu Asn 
145                 150                 155                 160 


Arg Ile Pro Tyr Lys Asp Thr Phe Trp Lys Gly Thr Thr Arg Thr Arg 
                165                 170                 175     


Pro Arg Asn Gly Thr Leu Asn Lys His Ser Gly Ile Asp Phe Lys Gln 
            180                 185                 190         


Leu Asn Phe Leu Thr Lys Leu Asn Glu Asn His Ser Gly Glu Leu Trp 
        195                 200                 205             


Lys Gly Arg Trp Gln Gly Asn Asp Ile Val Val Lys Val Leu Lys Val 
    210                 215                 220                 


Arg Asp Trp Ser Thr Arg Lys Ser Arg Asp Phe Asn Glu Glu Cys Pro 
225                 230                 235                 240 


Arg Leu Arg Ile Phe Ser His Pro Asn Val Leu Pro Val Leu Gly Ala 
                245                 250                 255     


Cys Gln Ser Pro Pro Ala Pro His Pro Thr Leu Ile Thr His Trp Met 
            260                 265                 270         


Pro Tyr Gly Ser Leu Tyr Asn Val Leu His Glu Gly Thr Asn Phe Val 
        275                 280                 285             


Val Asp Gln Ser Gln Ala Val Lys Phe Ala Leu Asp Met Ala Arg Gly 
    290                 295                 300                 


Met Ala Phe Leu His Thr Leu Glu Pro Leu Ile Pro Arg His Ala Leu 
305                 310                 315                 320 


Asn Ser Arg Ser Val Met Ile Asp Glu Asp Met Thr Ala Arg Ile Ser 
                325                 330                 335     


Met Ala Asp Val Lys Phe Ser Phe Gln Cys Pro Gly Arg Met Tyr Ala 
            340                 345                 350         


Pro Ala Trp Val Ala Pro Glu Ala Leu Gln Lys Lys Pro Glu Asp Thr 
        355                 360                 365             


Asn Arg Arg Ser Ala Asp Met Trp Ser Phe Ala Val Leu Leu Trp Glu 
    370                 375                 380                 


Leu Val Thr Arg Glu Val Pro Phe Ala Asp Leu Ser Asn Met Glu Ile 
385                 390                 395                 400 


Gly Met Lys Val Ala Leu Glu Gly Leu Arg Pro Thr Ile Pro Pro Gly 
                405                 410                 415     


Ile Ser Pro His Val Cys Lys Leu Met Lys Ile Cys Met Asn Glu Asp 
            420                 425                 430         


Pro Ala Lys Arg Pro Lys Phe Asp Met Ile Val Pro Ile Leu Glu Lys 
        435                 440                 445             


Met Gln Asp Lys 
    450         


<210>  50
<211>  1660
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) mRNA transcript variant 4 
       GenBank Accession No.: NM_001278441.1 GI: 510785739

<400>  50
agggacggag gcccgcgctg cccaagagcg ccacgggcgg ggcggggccg gcggcgggct       60

gcgggcgcgg ccggacggga gttccccgga gaaggatcct gcagcccgag tcccgaggat      120

aaagcttggg gttcatcctc cttccctgga tcactccaca gtcctcaggc ttccccaatc      180

caggggactc ggcgccggga cgctgctatg gacgacattt tcactcagtg ccgggagggc      240

aacgcagtcg ccgttcgcct gtggctggac aacacggaga acgacctcaa ccagggggac      300

gatcatggct tctccccctt gcactgggcc tgccgagagg gccgctctgc tgtggttgag      360

atgttgatca tgcggggggc acggatcaat gtaatgaacc gtggggatga cacccccctg      420

catctggcag ccagtcatgg acaccgtgat attgtacaga agctattgca gtacaaggca      480

gacatcaatg cagtgaatga acacgggaat gtgcccctgc actatgcctg tttttggggc      540

caagatcaag tggcagagag cgggcagaga agatgggcca gaatctcaac cgtattccat      600

acaaggacac attctggaag gggaccaccc gcactcggcc ccctatggaa gggccgctgg      660

cagggcaatg acattgtcgt gaaggtgctg aaggttcgag actggagtac aaggaagagc      720

agggacttca atgaagagtg tccccggctc aggattttct cgcatccaaa tgtgctccca      780

gtgctaggtg cctgccagtc tccacctgct cctcatccta ctctcatcac acactggatg      840

ccgtatggat ccctctacaa tgtactacat gaaggcacca atttcgtcgt ggaccagagc      900

caggctgtga agtttgcttt ggacatggca aggggcatgg ccttcctaca cacactagag      960

cccctcatcc cacgacatgc actcaatagc cgtagtgtaa tgattgatga ggacatgact     1020

gcccgaatta gcatggctga tgtcaagttc tctttccaat gtcctggtcg catgtatgca     1080

cctgcctggg tagcccccga agctctgcag aagaagcctg aagacacaaa cagacgctca     1140

gcagacatgt ggagttttgc agtgcttctg tgggaactgg tgacacggga ggtacccttt     1200

gctgacctct ccaatatgga gattggaatg aaggtggcat tggaaggcct tcggcctacc     1260

atcccaccag gtatttcccc tcatgtgtgt aagctcatga agatctgcat gaatgaagac     1320

cctgcaaagc gacccaaatt tgacatgatt gtgcctatcc ttgagaagat gcaggacaag     1380

taggactgga aggtccttgc ctgaactcca gaggtgtcgg gacatggttg ggggaatgca     1440

cctccccaaa gcagcaggcc tctggttgcc tcccccgcct ccagtcatgg tactacccca     1500

gccatggggt ccatcccctt cccccatccc taccactgtg gccccaagag gggcgggctc     1560

agagctttgt cacttgccac atggtgtctc ccaacatggg agggatcagc cccgcctgtc     1620

acaataaagt ttattatgaa aacaggaaaa aaaaaaaaaa                           1660


<210>  51
<211>  391
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) polypeptide encoded by 
       transcript variant 4 GenBank Accession No.: NP_001265370.1  
       GI:510785740

<400>  51

Met Asp Asp Ile Phe Thr Gln Cys Arg Glu Gly Asn Ala Val Ala Val 
1               5                   10                  15      


Arg Leu Trp Leu Asp Asn Thr Glu Asn Asp Leu Asn Gln Gly Asp Asp 
            20                  25                  30          


His Gly Phe Ser Pro Leu His Trp Ala Cys Arg Glu Gly Arg Ser Ala 
        35                  40                  45              


Val Val Glu Met Leu Ile Met Arg Gly Ala Arg Ile Asn Val Met Asn 
    50                  55                  60                  


Arg Gly Asp Asp Thr Pro Leu His Leu Ala Ala Ser His Gly His Arg 
65                  70                  75                  80  


Asp Ile Val Gln Lys Leu Leu Gln Tyr Lys Ala Asp Ile Asn Ala Val 
                85                  90                  95      


Asn Glu His Gly Asn Val Pro Leu His Tyr Ala Cys Phe Trp Gly Gln 
            100                 105                 110         


Asp Gln Val Ala Glu Ser Gly Gln Arg Arg Trp Ala Arg Ile Ser Thr 
        115                 120                 125             


Val Phe His Thr Arg Thr His Ser Gly Arg Gly Pro Pro Ala Leu Gly 
    130                 135                 140                 


Pro Leu Trp Lys Gly Arg Trp Gln Gly Asn Asp Ile Val Val Lys Val 
145                 150                 155                 160 


Leu Lys Val Arg Asp Trp Ser Thr Arg Lys Ser Arg Asp Phe Asn Glu 
                165                 170                 175     


Glu Cys Pro Arg Leu Arg Ile Phe Ser His Pro Asn Val Leu Pro Val 
            180                 185                 190         


Leu Gly Ala Cys Gln Ser Pro Pro Ala Pro His Pro Thr Leu Ile Thr 
        195                 200                 205             


His Trp Met Pro Tyr Gly Ser Leu Tyr Asn Val Leu His Glu Gly Thr 
    210                 215                 220                 


Asn Phe Val Val Asp Gln Ser Gln Ala Val Lys Phe Ala Leu Asp Met 
225                 230                 235                 240 


Ala Arg Gly Met Ala Phe Leu His Thr Leu Glu Pro Leu Ile Pro Arg 
                245                 250                 255     


His Ala Leu Asn Ser Arg Ser Val Met Ile Asp Glu Asp Met Thr Ala 
            260                 265                 270         


Arg Ile Ser Met Ala Asp Val Lys Phe Ser Phe Gln Cys Pro Gly Arg 
        275                 280                 285             


Met Tyr Ala Pro Ala Trp Val Ala Pro Glu Ala Leu Gln Lys Lys Pro 
    290                 295                 300                 


Glu Asp Thr Asn Arg Arg Ser Ala Asp Met Trp Ser Phe Ala Val Leu 
305                 310                 315                 320 


Leu Trp Glu Leu Val Thr Arg Glu Val Pro Phe Ala Asp Leu Ser Asn 
                325                 330                 335     


Met Glu Ile Gly Met Lys Val Ala Leu Glu Gly Leu Arg Pro Thr Ile 
            340                 345                 350         


Pro Pro Gly Ile Ser Pro His Val Cys Lys Leu Met Lys Ile Cys Met 
        355                 360                 365             


Asn Glu Asp Pro Ala Lys Arg Pro Lys Phe Asp Met Ile Val Pro Ile 
    370                 375                 380                 


Leu Glu Lys Met Gln Asp Lys 
385                 390     


<210>  52
<211>  1631
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) mRNA transcript variant 5 
       GenBank Accession No.: NM_001278442.1  GI:510785741

<400>  52
agggacggag gcccgcgctg cccaagagcg ccacgggcgg ggcggggccg gcggcgggct       60

gcgggcgcgg ccggacggga gttccccgga gaaggatcct gcagcccgag tcccgtcctc      120

aggcttcccc aatccagggg actcggcgcc gggacgctgc tatggacgac attttcactc      180

agtgccggga gggcaacgca gtcgccgttc gcctgtggct ggacaacacg gagaacgacc      240

tcaaccaggg ctattgcagt acaaggcaga catcaatgca gtgaatgaac acgggaatgt      300

gcccctgcac tatgcctgtt tttggggcca agatcaagtg gcagaggacc tggtggcaaa      360

tggggccctt gtcagcatct gtaacaagta tggagagatg cctgtggaca aagccaaggc      420

acccctgaga gagcttctcc gagagcgggc agagaagatg ggccagaatc tcaaccgtat      480

tccatacaag gacacattct ggaaggggac cacccgcact cggccccgaa atggaaccct      540

gaacaaacac tctggcattg acttcaaaca gcttaacttc ctgacgaagc tcaacgagaa      600

tcactctgga gagctatgga agggccgctg gcagggcaat gacattgtcg tgaaggtgct      660

gaaggttcga gactggagta caaggaagag cagggacttc aatgaagagt gtccccggct      720

caggattttc tcgcatccaa atgtgctccc agtgctaggt gcctgccagt ctccacctgc      780

tcctcatcct actctcatca cacactggat gccgtatgga tccctctaca atgtactaca      840

tgaaggcacc aatttcgtcg tggaccagag ccaggctgtg aagtttgctt tggacatggc      900

aaggggcatg gccttcctac acacactaga gcccctcatc ccacgacatg cactcaatag      960

ccgtagtgta atgattgatg aggacatgac tgcccgaatt agcatggctg atgtcaagtt     1020

ctctttccaa tgtcctggtc gcatgtatgc acctgcctgg gtagcccccg aagctctgca     1080

gaagaagcct gaagacacaa acagacgctc agcagacatg tggagttttg cagtgcttct     1140

gtgggaactg gtgacacggg aggtaccctt tgctgacctc tccaatatgg agattggaat     1200

gaaggtggca ttggaaggcc ttcggcctac catcccacca ggtatttccc ctcatgtgtg     1260

taagctcatg aagatctgca tgaatgaaga ccctgcaaag cgacccaaat ttgacatgat     1320

tgtgcctatc cttgagaaga tgcaggacaa gtaggactgg aaggtccttg cctgaactcc     1380

agaggtgtcg ggacatggtt gggggaatgc acctccccaa agcagcaggc ctctggttgc     1440

ctcccccgcc tccagtcatg gtactacccc agccatgggg tccatcccct tcccccatcc     1500

ctaccactgt ggccccaaga ggggcgggct cagagctttg tcacttgcca catggtgtct     1560

cccaacatgg gagggatcag ccccgcctgt cacaataaag tttattatga aaacaggaaa     1620

aaaaaaaaaa a                                                          1631


<210>  53
<211>  318
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human integrin-linked kinase (ILK) polypeptide encoded by 
       transcript variant 5 GenBank Accession No.: NP_001265371.1  
       GI:510785742

<400>  53

Met Pro Val Asp Lys Ala Lys Ala Pro Leu Arg Glu Leu Leu Arg Glu 
1               5                   10                  15      


Arg Ala Glu Lys Met Gly Gln Asn Leu Asn Arg Ile Pro Tyr Lys Asp 
            20                  25                  30          


Thr Phe Trp Lys Gly Thr Thr Arg Thr Arg Pro Arg Asn Gly Thr Leu 
        35                  40                  45              


Asn Lys His Ser Gly Ile Asp Phe Lys Gln Leu Asn Phe Leu Thr Lys 
    50                  55                  60                  


Leu Asn Glu Asn His Ser Gly Glu Leu Trp Lys Gly Arg Trp Gln Gly 
65                  70                  75                  80  


Asn Asp Ile Val Val Lys Val Leu Lys Val Arg Asp Trp Ser Thr Arg 
                85                  90                  95      


Lys Ser Arg Asp Phe Asn Glu Glu Cys Pro Arg Leu Arg Ile Phe Ser 
            100                 105                 110         


His Pro Asn Val Leu Pro Val Leu Gly Ala Cys Gln Ser Pro Pro Ala 
        115                 120                 125             


Pro His Pro Thr Leu Ile Thr His Trp Met Pro Tyr Gly Ser Leu Tyr 
    130                 135                 140                 


Asn Val Leu His Glu Gly Thr Asn Phe Val Val Asp Gln Ser Gln Ala 
145                 150                 155                 160 


Val Lys Phe Ala Leu Asp Met Ala Arg Gly Met Ala Phe Leu His Thr 
                165                 170                 175     


Leu Glu Pro Leu Ile Pro Arg His Ala Leu Asn Ser Arg Ser Val Met 
            180                 185                 190         


Ile Asp Glu Asp Met Thr Ala Arg Ile Ser Met Ala Asp Val Lys Phe 
        195                 200                 205             


Ser Phe Gln Cys Pro Gly Arg Met Tyr Ala Pro Ala Trp Val Ala Pro 
    210                 215                 220                 


Glu Ala Leu Gln Lys Lys Pro Glu Asp Thr Asn Arg Arg Ser Ala Asp 
225                 230                 235                 240 


Met Trp Ser Phe Ala Val Leu Leu Trp Glu Leu Val Thr Arg Glu Val 
                245                 250                 255     


Pro Phe Ala Asp Leu Ser Asn Met Glu Ile Gly Met Lys Val Ala Leu 
            260                 265                 270         


Glu Gly Leu Arg Pro Thr Ile Pro Pro Gly Ile Ser Pro His Val Cys 
        275                 280                 285             


Lys Leu Met Lys Ile Cys Met Asn Glu Asp Pro Ala Lys Arg Pro Lys 
    290                 295                 300                 


Phe Asp Met Ile Val Pro Ile Leu Glu Lys Met Gln Asp Lys 
305                 310                 315             


<210>  54
<211>  3724
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human PI3KCA mRNA GenBank Accession No.: NM_006218.2  GI:54792081

<400>  54
tctccctcgg cgccgccgcc gccgcccgcg gggctgggac ccgatgcggt tagagccgcg       60

gagcctggaa gagccccgag cgtttctgct ttgggacaac catacatcta attccttaaa      120

gtagttttat atgtaaaact tgcaaagaat cagaacaatg cctccacgac catcatcagg      180

tgaactgtgg ggcatccact tgatgccccc aagaatccta gtagaatgtt tactaccaaa      240

tggaatgata gtgactttag aatgcctccg tgaggctaca ttaataacca taaagcatga      300

actatttaaa gaagcaagaa aataccccct ccatcaactt cttcaagatg aatcttctta      360

cattttcgta agtgttactc aagaagcaga aagggaagaa ttttttgatg aaacaagacg      420

actttgtgac cttcggcttt ttcaaccctt tttaaaagta attgaaccag taggcaaccg      480

tgaagaaaag atcctcaatc gagaaattgg ttttgctatc ggcatgccag tgtgtgaatt      540

tgatatggtt aaagatccag aagtacagga cttccgaaga aatattctga acgtttgtaa      600

agaagctgtg gatcttaggg acctcaattc acctcatagt agagcaatgt atgtctatcc      660

tccaaatgta gaatcttcac cagaattgcc aaagcacata tataataaat tagataaagg      720

gcaaataata gtggtgatct gggtaatagt ttctccaaat aatgacaagc agaagtatac      780

tctgaaaatc aaccatgact gtgtaccaga acaagtaatt gctgaagcaa tcaggaaaaa      840

aactcgaagt atgttgctat cctctgaaca actaaaactc tgtgttttag aatatcaggg      900

caagtatatt ttaaaagtgt gtggatgtga tgaatacttc ctagaaaaat atcctctgag      960

tcagtataag tatataagaa gctgtataat gcttgggagg atgcccaatt tgatgttgat     1020

ggctaaagaa agcctttatt ctcaactgcc aatggactgt tttacaatgc catcttattc     1080

cagacgcatt tccacagcta caccatatat gaatggagaa acatctacaa aatccctttg     1140

ggttataaat agtgcactca gaataaaaat tctttgtgca acctacgtga atgtaaatat     1200

tcgagacatt gataagatct atgttcgaac aggtatctac catggaggag aacccttatg     1260

tgacaatgtg aacactcaaa gagtaccttg ttccaatccc aggtggaatg aatggctgaa     1320

ttatgatata tacattcctg atcttcctcg tgctgctcga ctttgccttt ccatttgctc     1380

tgttaaaggc cgaaagggtg ctaaagagga acactgtcca ttggcatggg gaaatataaa     1440

cttgtttgat tacacagaca ctctagtatc tggaaaaatg gctttgaatc tttggccagt     1500

acctcatgga ttagaagatt tgctgaaccc tattggtgtt actggatcaa atccaaataa     1560

agaaactcca tgcttagagt tggagtttga ctggttcagc agtgtggtaa agttcccaga     1620

tatgtcagtg attgaagagc atgccaattg gtctgtatcc cgagaagcag gatttagcta     1680

ttcccacgca ggactgagta acagactagc tagagacaat gaattaaggg aaaatgacaa     1740

agaacagctc aaagcaattt ctacacgaga tcctctctct gaaatcactg agcaggagaa     1800

agattttcta tggagtcaca gacactattg tgtaactatc cccgaaattc tacccaaatt     1860

gcttctgtct gttaaatgga attctagaga tgaagtagcc cagatgtatt gcttggtaaa     1920

agattggcct ccaatcaaac ctgaacaggc tatggaactt ctggactgta attacccaga     1980

tcctatggtt cgaggttttg ctgttcggtg cttggaaaaa tatttaacag atgacaaact     2040

ttctcagtat ttaattcagc tagtacaggt cctaaaatat gaacaatatt tggataactt     2100

gcttgtgaga tttttactga agaaagcatt gactaatcaa aggattgggc actttttctt     2160

ttggcattta aaatctgaga tgcacaataa aacagttagc cagaggtttg gcctgctttt     2220

ggagtcctat tgtcgtgcat gtgggatgta tttgaagcac ctgaataggc aagtcgaggc     2280

aatggaaaag ctcattaact taactgacat tctcaaacag gagaagaagg atgaaacaca     2340

aaaggtacag atgaagtttt tagttgagca aatgaggcga ccagatttca tggatgctct     2400

acagggcttt ctgtctcctc taaaccctgc tcatcaacta ggaaacctca ggcttgaaga     2460

gtgtcgaatt atgtcctctg caaaaaggcc actgtggttg aattgggaga acccagacat     2520

catgtcagag ttactgtttc agaacaatga gatcatcttt aaaaatgggg atgatttacg     2580

gcaagatatg ctaacacttc aaattattcg tattatggaa aatatctggc aaaatcaagg     2640

tcttgatctt cgaatgttac cttatggttg tctgtcaatc ggtgactgtg tgggacttat     2700

tgaggtggtg cgaaattctc acactattat gcaaattcag tgcaaaggcg gcttgaaagg     2760

tgcactgcag ttcaacagcc acacactaca tcagtggctc aaagacaaga acaaaggaga     2820

aatatatgat gcagccattg acctgtttac acgttcatgt gctggatact gtgtagctac     2880

cttcattttg ggaattggag atcgtcacaa tagtaacatc atggtgaaag acgatggaca     2940

actgtttcat atagattttg gacacttttt ggatcacaag aagaaaaaat ttggttataa     3000

acgagaacgt gtgccatttg ttttgacaca ggatttctta atagtgatta gtaaaggagc     3060

ccaagaatgc acaaagacaa gagaatttga gaggtttcag gagatgtgtt acaaggctta     3120

tctagctatt cgacagcatg ccaatctctt cataaatctt ttctcaatga tgcttggctc     3180

tggaatgcca gaactacaat cttttgatga cattgcatac attcgaaaga ccctagcctt     3240

agataaaact gagcaagagg ctttggagta tttcatgaaa caaatgaatg atgcacatca     3300

tggtggctgg acaacaaaaa tggattggat cttccacaca attaaacagc atgcattgaa     3360

ctgaaaagat aactgagaaa atgaaagctc actctggatt ccacactgca ctgttaataa     3420

ctctcagcag gcaaagaccg attgcatagg aattgcacaa tccatgaaca gcattagaat     3480

ttacagcaag aacagaaata aaatactata taatttaaat aatgtaaacg caaacagggt     3540

ttgatagcac ttaaactagt tcatttcaaa attaagcttt agaataatgc gcaatttcat     3600

gttatgcctt aagtccaaaa aggtaaactt tgaagattgt ttgtatcttt ttttaaaaaa     3660

caaaacaaaa caaaaatccc caaaatatat agaaatgatg gagaaggaaa aaaaaaaaaa     3720

aaaa                                                                  3724


<210>  55
<211>  1068
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human PI3KCA polypeptide GenBank Accession No.:  NP_006209.2  
       GI:54792082

<400>  55

Met Pro Pro Arg Pro Ser Ser Gly Glu Leu Trp Gly Ile His Leu Met 
1               5                   10                  15      


Pro Pro Arg Ile Leu Val Glu Cys Leu Leu Pro Asn Gly Met Ile Val 
            20                  25                  30          


Thr Leu Glu Cys Leu Arg Glu Ala Thr Leu Ile Thr Ile Lys His Glu 
        35                  40                  45              


Leu Phe Lys Glu Ala Arg Lys Tyr Pro Leu His Gln Leu Leu Gln Asp 
    50                  55                  60                  


Glu Ser Ser Tyr Ile Phe Val Ser Val Thr Gln Glu Ala Glu Arg Glu 
65                  70                  75                  80  


Glu Phe Phe Asp Glu Thr Arg Arg Leu Cys Asp Leu Arg Leu Phe Gln 
                85                  90                  95      


Pro Phe Leu Lys Val Ile Glu Pro Val Gly Asn Arg Glu Glu Lys Ile 
            100                 105                 110         


Leu Asn Arg Glu Ile Gly Phe Ala Ile Gly Met Pro Val Cys Glu Phe 
        115                 120                 125             


Asp Met Val Lys Asp Pro Glu Val Gln Asp Phe Arg Arg Asn Ile Leu 
    130                 135                 140                 


Asn Val Cys Lys Glu Ala Val Asp Leu Arg Asp Leu Asn Ser Pro His 
145                 150                 155                 160 


Ser Arg Ala Met Tyr Val Tyr Pro Pro Asn Val Glu Ser Ser Pro Glu 
                165                 170                 175     


Leu Pro Lys His Ile Tyr Asn Lys Leu Asp Lys Gly Gln Ile Ile Val 
            180                 185                 190         


Val Ile Trp Val Ile Val Ser Pro Asn Asn Asp Lys Gln Lys Tyr Thr 
        195                 200                 205             


Leu Lys Ile Asn His Asp Cys Val Pro Glu Gln Val Ile Ala Glu Ala 
    210                 215                 220                 


Ile Arg Lys Lys Thr Arg Ser Met Leu Leu Ser Ser Glu Gln Leu Lys 
225                 230                 235                 240 


Leu Cys Val Leu Glu Tyr Gln Gly Lys Tyr Ile Leu Lys Val Cys Gly 
                245                 250                 255     


Cys Asp Glu Tyr Phe Leu Glu Lys Tyr Pro Leu Ser Gln Tyr Lys Tyr 
            260                 265                 270         


Ile Arg Ser Cys Ile Met Leu Gly Arg Met Pro Asn Leu Met Leu Met 
        275                 280                 285             


Ala Lys Glu Ser Leu Tyr Ser Gln Leu Pro Met Asp Cys Phe Thr Met 
    290                 295                 300                 


Pro Ser Tyr Ser Arg Arg Ile Ser Thr Ala Thr Pro Tyr Met Asn Gly 
305                 310                 315                 320 


Glu Thr Ser Thr Lys Ser Leu Trp Val Ile Asn Ser Ala Leu Arg Ile 
                325                 330                 335     


Lys Ile Leu Cys Ala Thr Tyr Val Asn Val Asn Ile Arg Asp Ile Asp 
            340                 345                 350         


Lys Ile Tyr Val Arg Thr Gly Ile Tyr His Gly Gly Glu Pro Leu Cys 
        355                 360                 365             


Asp Asn Val Asn Thr Gln Arg Val Pro Cys Ser Asn Pro Arg Trp Asn 
    370                 375                 380                 


Glu Trp Leu Asn Tyr Asp Ile Tyr Ile Pro Asp Leu Pro Arg Ala Ala 
385                 390                 395                 400 


Arg Leu Cys Leu Ser Ile Cys Ser Val Lys Gly Arg Lys Gly Ala Lys 
                405                 410                 415     


Glu Glu His Cys Pro Leu Ala Trp Gly Asn Ile Asn Leu Phe Asp Tyr 
            420                 425                 430         


Thr Asp Thr Leu Val Ser Gly Lys Met Ala Leu Asn Leu Trp Pro Val 
        435                 440                 445             


Pro His Gly Leu Glu Asp Leu Leu Asn Pro Ile Gly Val Thr Gly Ser 
    450                 455                 460                 


Asn Pro Asn Lys Glu Thr Pro Cys Leu Glu Leu Glu Phe Asp Trp Phe 
465                 470                 475                 480 


Ser Ser Val Val Lys Phe Pro Asp Met Ser Val Ile Glu Glu His Ala 
                485                 490                 495     


Asn Trp Ser Val Ser Arg Glu Ala Gly Phe Ser Tyr Ser His Ala Gly 
            500                 505                 510         


Leu Ser Asn Arg Leu Ala Arg Asp Asn Glu Leu Arg Glu Asn Asp Lys 
        515                 520                 525             


Glu Gln Leu Lys Ala Ile Ser Thr Arg Asp Pro Leu Ser Glu Ile Thr 
    530                 535                 540                 


Glu Gln Glu Lys Asp Phe Leu Trp Ser His Arg His Tyr Cys Val Thr 
545                 550                 555                 560 


Ile Pro Glu Ile Leu Pro Lys Leu Leu Leu Ser Val Lys Trp Asn Ser 
                565                 570                 575     


Arg Asp Glu Val Ala Gln Met Tyr Cys Leu Val Lys Asp Trp Pro Pro 
            580                 585                 590         


Ile Lys Pro Glu Gln Ala Met Glu Leu Leu Asp Cys Asn Tyr Pro Asp 
        595                 600                 605             


Pro Met Val Arg Gly Phe Ala Val Arg Cys Leu Glu Lys Tyr Leu Thr 
    610                 615                 620                 


Asp Asp Lys Leu Ser Gln Tyr Leu Ile Gln Leu Val Gln Val Leu Lys 
625                 630                 635                 640 


Tyr Glu Gln Tyr Leu Asp Asn Leu Leu Val Arg Phe Leu Leu Lys Lys 
                645                 650                 655     


Ala Leu Thr Asn Gln Arg Ile Gly His Phe Phe Phe Trp His Leu Lys 
            660                 665                 670         


Ser Glu Met His Asn Lys Thr Val Ser Gln Arg Phe Gly Leu Leu Leu 
        675                 680                 685             


Glu Ser Tyr Cys Arg Ala Cys Gly Met Tyr Leu Lys His Leu Asn Arg 
    690                 695                 700                 


Gln Val Glu Ala Met Glu Lys Leu Ile Asn Leu Thr Asp Ile Leu Lys 
705                 710                 715                 720 


Gln Glu Lys Lys Asp Glu Thr Gln Lys Val Gln Met Lys Phe Leu Val 
                725                 730                 735     


Glu Gln Met Arg Arg Pro Asp Phe Met Asp Ala Leu Gln Gly Phe Leu 
            740                 745                 750         


Ser Pro Leu Asn Pro Ala His Gln Leu Gly Asn Leu Arg Leu Glu Glu 
        755                 760                 765             


Cys Arg Ile Met Ser Ser Ala Lys Arg Pro Leu Trp Leu Asn Trp Glu 
    770                 775                 780                 


Asn Pro Asp Ile Met Ser Glu Leu Leu Phe Gln Asn Asn Glu Ile Ile 
785                 790                 795                 800 


Phe Lys Asn Gly Asp Asp Leu Arg Gln Asp Met Leu Thr Leu Gln Ile 
                805                 810                 815     


Ile Arg Ile Met Glu Asn Ile Trp Gln Asn Gln Gly Leu Asp Leu Arg 
            820                 825                 830         


Met Leu Pro Tyr Gly Cys Leu Ser Ile Gly Asp Cys Val Gly Leu Ile 
        835                 840                 845             


Glu Val Val Arg Asn Ser His Thr Ile Met Gln Ile Gln Cys Lys Gly 
    850                 855                 860                 


Gly Leu Lys Gly Ala Leu Gln Phe Asn Ser His Thr Leu His Gln Trp 
865                 870                 875                 880 


Leu Lys Asp Lys Asn Lys Gly Glu Ile Tyr Asp Ala Ala Ile Asp Leu 
                885                 890                 895     


Phe Thr Arg Ser Cys Ala Gly Tyr Cys Val Ala Thr Phe Ile Leu Gly 
            900                 905                 910         


Ile Gly Asp Arg His Asn Ser Asn Ile Met Val Lys Asp Asp Gly Gln 
        915                 920                 925             


Leu Phe His Ile Asp Phe Gly His Phe Leu Asp His Lys Lys Lys Lys 
    930                 935                 940                 


Phe Gly Tyr Lys Arg Glu Arg Val Pro Phe Val Leu Thr Gln Asp Phe 
945                 950                 955                 960 


Leu Ile Val Ile Ser Lys Gly Ala Gln Glu Cys Thr Lys Thr Arg Glu 
                965                 970                 975     


Phe Glu Arg Phe Gln Glu Met Cys Tyr Lys Ala Tyr Leu Ala Ile Arg 
            980                 985                 990         


Gln His Ala Asn Leu Phe Ile Asn  Leu Phe Ser Met Met  Leu Gly Ser 
        995                 1000                 1005             


Gly Met  Pro Glu Leu Gln Ser  Phe Asp Asp Ile Ala  Tyr Ile Arg 
    1010                 1015                 1020             


Lys Thr  Leu Ala Leu Asp Lys  Thr Glu Gln Glu Ala  Leu Glu Tyr 
    1025                 1030                 1035             


Phe Met  Lys Gln Met Asn Asp  Ala His His Gly Gly  Trp Thr Thr 
    1040                 1045                 1050             


Lys Met  Asp Trp Ile Phe His  Thr Ile Lys Gln His  Ala Leu Asn 
    1055                 1060                 1065             


<210>  56
<211>  3008
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human AKT1 mRNA transcript variant 1 GenBank Accession no.: 
       NM_005163.2  GI:62241010

<400>  56
taattatggg tctgtaacca ccctggactg ggtgctcctc actgacggac ttgtctgaac       60

ctctctttgt ctccagcgcc cagcactggg cctggcaaaa cctgagacgc ccggtacatg      120

ttggccaaat gaatgaacca gattcagacc ggcaggggcg ctgtggttta ggaggggcct      180

ggggtttctc ccaggaggtt tttgggcttg cgctggaggg ctctggactc ccgtttgcgc      240

cagtggcctg catcctggtc ctgtcttcct catgtttgaa tttctttgct ttcctagtct      300

ggggagcagg gaggagccct gtgccctgtc ccaggatcca tgggtaggaa caccatggac      360

agggagagca aacggggcca tctgtcacca ggggcttagg gaaggccgag ccagcctggg      420

tcaaagaagt caaaggggct gcctggagga ggcagcctgt cagctggtgc atcagaggct      480

gtggccaggc cagctgggct cggggagcgc cagcctgaga ggagcgcgtg agcgtcgcgg      540

gagcctcggg caccatgagc gacgtggcta ttgtgaagga gggttggctg cacaaacgag      600

gggagtacat caagacctgg cggccacgct acttcctcct caagaatgat ggcaccttca      660

ttggctacaa ggagcggccg caggatgtgg accaacgtga ggctcccctc aacaacttct      720

ctgtggcgca gtgccagctg atgaagacgg agcggccccg gcccaacacc ttcatcatcc      780

gctgcctgca gtggaccact gtcatcgaac gcaccttcca tgtggagact cctgaggagc      840

gggaggagtg gacaaccgcc atccagactg tggctgacgg cctcaagaag caggaggagg      900

aggagatgga cttccggtcg ggctcaccca gtgacaactc aggggctgaa gagatggagg      960

tgtccctggc caagcccaag caccgcgtga ccatgaacga gtttgagtac ctgaagctgc     1020

tgggcaaggg cactttcggc aaggtgatcc tggtgaagga gaaggccaca ggccgctact     1080

acgccatgaa gatcctcaag aaggaagtca tcgtggccaa ggacgaggtg gcccacacac     1140

tcaccgagaa ccgcgtcctg cagaactcca ggcacccctt cctcacagcc ctgaagtact     1200

ctttccagac ccacgaccgc ctctgctttg tcatggagta cgccaacggg ggcgagctgt     1260

tcttccacct gtcccgggag cgtgtgttct ccgaggaccg ggcccgcttc tatggcgctg     1320

agattgtgtc agccctggac tacctgcact cggagaagaa cgtggtgtac cgggacctca     1380

agctggagaa cctcatgctg gacaaggacg ggcacattaa gatcacagac ttcgggctgt     1440

gcaaggaggg gatcaaggac ggtgccacca tgaagacctt ttgcggcaca cctgagtacc     1500

tggcccccga ggtgctggag gacaatgact acggccgtgc agtggactgg tgggggctgg     1560

gcgtggtcat gtacgagatg atgtgcggtc gcctgccctt ctacaaccag gaccatgaga     1620

agctttttga gctcatcctc atggaggaga tccgcttccc gcgcacgctt ggtcccgagg     1680

ccaagtcctt gctttcaggg ctgctcaaga aggaccccaa gcagaggctt ggcgggggct     1740

ccgaggacgc caaggagatc atgcagcatc gcttctttgc cggtatcgtg tggcagcacg     1800

tgtacgagaa gaagctcagc ccacccttca agccccaggt cacgtcggag actgacacca     1860

ggtattttga tgaggagttc acggcccaga tgatcaccat cacaccacct gaccaagatg     1920

acagcatgga gtgtgtggac agcgagcgca ggccccactt cccccagttc tcctactcgg     1980

ccagcggcac ggcctgaggc ggcggtggac tgcgctggac gatagcttgg agggatggag     2040

aggcggcctc gtgccatgat ctgtatttaa tggtttttat ttctcgggtg catttgagag     2100

aagccacgct gtcctctcga gcccagatgg aaagacgttt ttgtgctgtg ggcagcaccc     2160

tcccccgcag cggggtaggg aagaaaacta tcctgcgggt tttaatttat ttcatccagt     2220

ttgttctccg ggtgtggcct cagccctcag aacaatccga ttcacgtagg gaaatgttaa     2280

ggacttctgc agctatgcgc aatgtggcat tggggggccg ggcaggtcct gcccatgtgt     2340

cccctcactc tgtcagccag ccgccctggg ctgtctgtca ccagctatct gtcatctctc     2400

tggggccctg ggcctcagtt caacctggtg gcaccagatg caacctcact atggtatgct     2460

ggccagcacc ctctcctggg ggtggcaggc acacagcagc cccccagcac taaggccgtg     2520

tctctgagga cgtcatcgga ggctgggccc ctgggatggg accagggatg ggggatgggc     2580

cagggtttac ccagtgggac agaggagcaa ggtttaaatt tgttattgtg tattatgttg     2640

ttcaaatgca ttttgggggt ttttaatctt tgtgacagga aagccctccc ccttcccctt     2700

ctgtgtcaca gttcttggtg actgtcccac cgggagcctc cccctcagat gatctctcca     2760

cggtagcact tgaccttttc gacgcttaac ctttccgctg tcgccccagg ccctccctga     2820

ctccctgtgg gggtggccat ccctgggccc ctccacgcct cctggccaga cgctgccgct     2880

gccgctgcac cacggcgttt ttttacaaca ttcaacttta gtatttttac tattataata     2940

taatatggaa ccttccctcc aaattcttca ataaaagttg cttttcaaaa aaaaaaaaaa     3000

aaaaaaaa                                                              3008


<210>  57
<211>  480
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human AKT1 polypeptide encode by AKT1 mRNA transcript variant 1 
       GenBank Accession no.: NP_005154.2  GI:62241011

<400>  57

Met Ser Asp Val Ala Ile Val Lys Glu Gly Trp Leu His Lys Arg Gly 
1               5                   10                  15      


Glu Tyr Ile Lys Thr Trp Arg Pro Arg Tyr Phe Leu Leu Lys Asn Asp 
            20                  25                  30          


Gly Thr Phe Ile Gly Tyr Lys Glu Arg Pro Gln Asp Val Asp Gln Arg 
        35                  40                  45              


Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gln Cys Gln Leu Met Lys 
    50                  55                  60                  


Thr Glu Arg Pro Arg Pro Asn Thr Phe Ile Ile Arg Cys Leu Gln Trp 
65                  70                  75                  80  


Thr Thr Val Ile Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Arg 
                85                  90                  95      


Glu Glu Trp Thr Thr Ala Ile Gln Thr Val Ala Asp Gly Leu Lys Lys 
            100                 105                 110         


Gln Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser Asp Asn 
        115                 120                 125             


Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys His Arg 
    130                 135                 140                 


Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys Gly Thr 
145                 150                 155                 160 


Phe Gly Lys Val Ile Leu Val Lys Glu Lys Ala Thr Gly Arg Tyr Tyr 
                165                 170                 175     


Ala Met Lys Ile Leu Lys Lys Glu Val Ile Val Ala Lys Asp Glu Val 
            180                 185                 190         


Ala His Thr Leu Thr Glu Asn Arg Val Leu Gln Asn Ser Arg His Pro 
        195                 200                 205             


Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gln Thr His Asp Arg Leu Cys 
    210                 215                 220                 


Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His Leu Ser 
225                 230                 235                 240 


Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Arg Phe Tyr Gly Ala Glu 
                245                 250                 255     


Ile Val Ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val Val Tyr 
            260                 265                 270         


Arg Asp Leu Lys Leu Glu Asn Leu Met Leu Asp Lys Asp Gly His Ile 
        275                 280                 285             


Lys Ile Thr Asp Phe Gly Leu Cys Lys Glu Gly Ile Lys Asp Gly Ala 
    290                 295                 300                 


Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr Leu Ala Pro Glu Val 
305                 310                 315                 320 


Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly Leu Gly 
                325                 330                 335     


Val Val Met Tyr Glu Met Met Cys Gly Arg Leu Pro Phe Tyr Asn Gln 
            340                 345                 350         


Asp His Glu Lys Leu Phe Glu Leu Ile Leu Met Glu Glu Ile Arg Phe 
        355                 360                 365             


Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu Leu Ser Gly Leu Leu 
    370                 375                 380                 


Lys Lys Asp Pro Lys Gln Arg Leu Gly Gly Gly Ser Glu Asp Ala Lys 
385                 390                 395                 400 


Glu Ile Met Gln His Arg Phe Phe Ala Gly Ile Val Trp Gln His Val 
                405                 410                 415     


Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gln Val Thr Ser Glu 
            420                 425                 430         


Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gln Met Ile Thr 
        435                 440                 445             


Ile Thr Pro Pro Asp Gln Asp Asp Ser Met Glu Cys Val Asp Ser Glu 
    450                 455                 460                 


Arg Arg Pro His Phe Pro Gln Phe Ser Tyr Ser Ala Ser Gly Thr Ala 
465                 470                 475                 480 


<210>  58
<211>  2878
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human AKT1 mRNA transcript variant 2 GenBank Accession no.: 
       NM_001014432.1  GI:62241014

<400>  58
cggcaggacc gagcgcggca ggcggctggc ccagcgcagc cagcgcggcc cgaaggacgg       60

gagcaggcgg ccgagcaccg agcgctgggc accgggcacc gagcggcggc ggcacgcgag      120

gcccggcccc gagcagcgcc cccgcccgcc gcggcctcca gcccggcccc gcccagcgcc      180

ggcccgcggg gatgcggagc ggcgggcgcc ggaggccgcg gcccggctag gcccgcgctc      240

gcgcccggac gcggcggccc ggggcttagg gaaggccgag ccagcctggg tcaaagaagt      300

caaaggggct gcctggagga ggcagcctgt cagctggtgc atcagaggct gtggccaggc      360

cagctgggct cggggagcgc cagcctgaga ggagcgcgtg agcgtcgcgg gagcctcggg      420

caccatgagc gacgtggcta ttgtgaagga gggttggctg cacaaacgag gggagtacat      480

caagacctgg cggccacgct acttcctcct caagaatgat ggcaccttca ttggctacaa      540

ggagcggccg caggatgtgg accaacgtga ggctcccctc aacaacttct ctgtggcgca      600

gtgccagctg atgaagacgg agcggccccg gcccaacacc ttcatcatcc gctgcctgca      660

gtggaccact gtcatcgaac gcaccttcca tgtggagact cctgaggagc gggaggagtg      720

gacaaccgcc atccagactg tggctgacgg cctcaagaag caggaggagg aggagatgga      780

cttccggtcg ggctcaccca gtgacaactc aggggctgaa gagatggagg tgtccctggc      840

caagcccaag caccgcgtga ccatgaacga gtttgagtac ctgaagctgc tgggcaaggg      900

cactttcggc aaggtgatcc tggtgaagga gaaggccaca ggccgctact acgccatgaa      960

gatcctcaag aaggaagtca tcgtggccaa ggacgaggtg gcccacacac tcaccgagaa     1020

ccgcgtcctg cagaactcca ggcacccctt cctcacagcc ctgaagtact ctttccagac     1080

ccacgaccgc ctctgctttg tcatggagta cgccaacggg ggcgagctgt tcttccacct     1140

gtcccgggag cgtgtgttct ccgaggaccg ggcccgcttc tatggcgctg agattgtgtc     1200

agccctggac tacctgcact cggagaagaa cgtggtgtac cgggacctca agctggagaa     1260

cctcatgctg gacaaggacg ggcacattaa gatcacagac ttcgggctgt gcaaggaggg     1320

gatcaaggac ggtgccacca tgaagacctt ttgcggcaca cctgagtacc tggcccccga     1380

ggtgctggag gacaatgact acggccgtgc agtggactgg tgggggctgg gcgtggtcat     1440

gtacgagatg atgtgcggtc gcctgccctt ctacaaccag gaccatgaga agctttttga     1500

gctcatcctc atggaggaga tccgcttccc gcgcacgctt ggtcccgagg ccaagtcctt     1560

gctttcaggg ctgctcaaga aggaccccaa gcagaggctt ggcgggggct ccgaggacgc     1620

caaggagatc atgcagcatc gcttctttgc cggtatcgtg tggcagcacg tgtacgagaa     1680

gaagctcagc ccacccttca agccccaggt cacgtcggag actgacacca ggtattttga     1740

tgaggagttc acggcccaga tgatcaccat cacaccacct gaccaagatg acagcatgga     1800

gtgtgtggac agcgagcgca ggccccactt cccccagttc tcctactcgg ccagcggcac     1860

ggcctgaggc ggcggtggac tgcgctggac gatagcttgg agggatggag aggcggcctc     1920

gtgccatgat ctgtatttaa tggtttttat ttctcgggtg catttgagag aagccacgct     1980

gtcctctcga gcccagatgg aaagacgttt ttgtgctgtg ggcagcaccc tcccccgcag     2040

cggggtaggg aagaaaacta tcctgcgggt tttaatttat ttcatccagt ttgttctccg     2100

ggtgtggcct cagccctcag aacaatccga ttcacgtagg gaaatgttaa ggacttctgc     2160

agctatgcgc aatgtggcat tggggggccg ggcaggtcct gcccatgtgt cccctcactc     2220

tgtcagccag ccgccctggg ctgtctgtca ccagctatct gtcatctctc tggggccctg     2280

ggcctcagtt caacctggtg gcaccagatg caacctcact atggtatgct ggccagcacc     2340

ctctcctggg ggtggcaggc acacagcagc cccccagcac taaggccgtg tctctgagga     2400

cgtcatcgga ggctgggccc ctgggatggg accagggatg ggggatgggc cagggtttac     2460

ccagtgggac agaggagcaa ggtttaaatt tgttattgtg tattatgttg ttcaaatgca     2520

ttttgggggt ttttaatctt tgtgacagga aagccctccc ccttcccctt ctgtgtcaca     2580

gttcttggtg actgtcccac cgggagcctc cccctcagat gatctctcca cggtagcact     2640

tgaccttttc gacgcttaac ctttccgctg tcgccccagg ccctccctga ctccctgtgg     2700

gggtggccat ccctgggccc ctccacgcct cctggccaga cgctgccgct gccgctgcac     2760

cacggcgttt ttttacaaca ttcaacttta gtatttttac tattataata taatatggaa     2820

ccttccctcc aaattcttca ataaaagttg cttttcaaaa aaaaaaaaaa aaaaaaaa       2878


<210>  59
<211>  480
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human AKT1 polypeptide encode by AKT1 polypeptided variant 2 
       GenBank Accession no.: NP_001014432.1  GI:62241015

<400>  59

Met Ser Asp Val Ala Ile Val Lys Glu Gly Trp Leu His Lys Arg Gly 
1               5                   10                  15      


Glu Tyr Ile Lys Thr Trp Arg Pro Arg Tyr Phe Leu Leu Lys Asn Asp 
            20                  25                  30          


Gly Thr Phe Ile Gly Tyr Lys Glu Arg Pro Gln Asp Val Asp Gln Arg 
        35                  40                  45              


Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gln Cys Gln Leu Met Lys 
    50                  55                  60                  


Thr Glu Arg Pro Arg Pro Asn Thr Phe Ile Ile Arg Cys Leu Gln Trp 
65                  70                  75                  80  


Thr Thr Val Ile Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Arg 
                85                  90                  95      


Glu Glu Trp Thr Thr Ala Ile Gln Thr Val Ala Asp Gly Leu Lys Lys 
            100                 105                 110         


Gln Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser Asp Asn 
        115                 120                 125             


Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys His Arg 
    130                 135                 140                 


Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys Gly Thr 
145                 150                 155                 160 


Phe Gly Lys Val Ile Leu Val Lys Glu Lys Ala Thr Gly Arg Tyr Tyr 
                165                 170                 175     


Ala Met Lys Ile Leu Lys Lys Glu Val Ile Val Ala Lys Asp Glu Val 
            180                 185                 190         


Ala His Thr Leu Thr Glu Asn Arg Val Leu Gln Asn Ser Arg His Pro 
        195                 200                 205             


Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gln Thr His Asp Arg Leu Cys 
    210                 215                 220                 


Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His Leu Ser 
225                 230                 235                 240 


Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Arg Phe Tyr Gly Ala Glu 
                245                 250                 255     


Ile Val Ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val Val Tyr 
            260                 265                 270         


Arg Asp Leu Lys Leu Glu Asn Leu Met Leu Asp Lys Asp Gly His Ile 
        275                 280                 285             


Lys Ile Thr Asp Phe Gly Leu Cys Lys Glu Gly Ile Lys Asp Gly Ala 
    290                 295                 300                 


Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr Leu Ala Pro Glu Val 
305                 310                 315                 320 


Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly Leu Gly 
                325                 330                 335     


Val Val Met Tyr Glu Met Met Cys Gly Arg Leu Pro Phe Tyr Asn Gln 
            340                 345                 350         


Asp His Glu Lys Leu Phe Glu Leu Ile Leu Met Glu Glu Ile Arg Phe 
        355                 360                 365             


Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu Leu Ser Gly Leu Leu 
    370                 375                 380                 


Lys Lys Asp Pro Lys Gln Arg Leu Gly Gly Gly Ser Glu Asp Ala Lys 
385                 390                 395                 400 


Glu Ile Met Gln His Arg Phe Phe Ala Gly Ile Val Trp Gln His Val 
                405                 410                 415     


Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gln Val Thr Ser Glu 
            420                 425                 430         


Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gln Met Ile Thr 
        435                 440                 445             


Ile Thr Pro Pro Asp Gln Asp Asp Ser Met Glu Cys Val Asp Ser Glu 
    450                 455                 460                 


Arg Arg Pro His Phe Pro Gln Phe Ser Tyr Ser Ala Ser Gly Thr Ala 
465                 470                 475                 480 


<210>  60
<211>  2794
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human AKT1 mRNA transcript variant 3 GenBank Accession no.: 
       NM_001014431.1  GI:62241012

<400>  60
cggcaggacc gagcgcggca ggcggctggc ccagcgcagc cagcgcggcc cgaaggacgg       60

gagcaggcgg ccgagcaccg agcgctgggc accgggcacc gagcggcggc ggcacgcgag      120

gcccggcccc gagcagcgcc cccgcccgcc gcggcctcca gcccggcccc gcccagcgcc      180

ggcccgcggg gatgcggagc ggcgggcgcc ggaggccgcg gcccggctag gcccgcgctc      240

gcgcccggac gcggcggccc gaggctgtgg ccaggccagc tgggctcggg gagcgccagc      300

ctgagaggag cgcgtgagcg tcgcgggagc ctcgggcacc atgagcgacg tggctattgt      360

gaaggagggt tggctgcaca aacgagggga gtacatcaag acctggcggc cacgctactt      420

cctcctcaag aatgatggca ccttcattgg ctacaaggag cggccgcagg atgtggacca      480

acgtgaggct cccctcaaca acttctctgt ggcgcagtgc cagctgatga agacggagcg      540

gccccggccc aacaccttca tcatccgctg cctgcagtgg accactgtca tcgaacgcac      600

cttccatgtg gagactcctg aggagcggga ggagtggaca accgccatcc agactgtggc      660

tgacggcctc aagaagcagg aggaggagga gatggacttc cggtcgggct cacccagtga      720

caactcaggg gctgaagaga tggaggtgtc cctggccaag cccaagcacc gcgtgaccat      780

gaacgagttt gagtacctga agctgctggg caagggcact ttcggcaagg tgatcctggt      840

gaaggagaag gccacaggcc gctactacgc catgaagatc ctcaagaagg aagtcatcgt      900

ggccaaggac gaggtggccc acacactcac cgagaaccgc gtcctgcaga actccaggca      960

ccccttcctc acagccctga agtactcttt ccagacccac gaccgcctct gctttgtcat     1020

ggagtacgcc aacgggggcg agctgttctt ccacctgtcc cgggagcgtg tgttctccga     1080

ggaccgggcc cgcttctatg gcgctgagat tgtgtcagcc ctggactacc tgcactcgga     1140

gaagaacgtg gtgtaccggg acctcaagct ggagaacctc atgctggaca aggacgggca     1200

cattaagatc acagacttcg ggctgtgcaa ggaggggatc aaggacggtg ccaccatgaa     1260

gaccttttgc ggcacacctg agtacctggc ccccgaggtg ctggaggaca atgactacgg     1320

ccgtgcagtg gactggtggg ggctgggcgt ggtcatgtac gagatgatgt gcggtcgcct     1380

gcccttctac aaccaggacc atgagaagct ttttgagctc atcctcatgg aggagatccg     1440

cttcccgcgc acgcttggtc ccgaggccaa gtccttgctt tcagggctgc tcaagaagga     1500

ccccaagcag aggcttggcg ggggctccga ggacgccaag gagatcatgc agcatcgctt     1560

ctttgccggt atcgtgtggc agcacgtgta cgagaagaag ctcagcccac ccttcaagcc     1620

ccaggtcacg tcggagactg acaccaggta ttttgatgag gagttcacgg cccagatgat     1680

caccatcaca ccacctgacc aagatgacag catggagtgt gtggacagcg agcgcaggcc     1740

ccacttcccc cagttctcct actcggccag cggcacggcc tgaggcggcg gtggactgcg     1800

ctggacgata gcttggaggg atggagaggc ggcctcgtgc catgatctgt atttaatggt     1860

ttttatttct cgggtgcatt tgagagaagc cacgctgtcc tctcgagccc agatggaaag     1920

acgtttttgt gctgtgggca gcaccctccc ccgcagcggg gtagggaaga aaactatcct     1980

gcgggtttta atttatttca tccagtttgt tctccgggtg tggcctcagc cctcagaaca     2040

atccgattca cgtagggaaa tgttaaggac ttctgcagct atgcgcaatg tggcattggg     2100

gggccgggca ggtcctgccc atgtgtcccc tcactctgtc agccagccgc cctgggctgt     2160

ctgtcaccag ctatctgtca tctctctggg gccctgggcc tcagttcaac ctggtggcac     2220

cagatgcaac ctcactatgg tatgctggcc agcaccctct cctgggggtg gcaggcacac     2280

agcagccccc cagcactaag gccgtgtctc tgaggacgtc atcggaggct gggcccctgg     2340

gatgggacca gggatggggg atgggccagg gtttacccag tgggacagag gagcaaggtt     2400

taaatttgtt attgtgtatt atgttgttca aatgcatttt gggggttttt aatctttgtg     2460

acaggaaagc cctccccctt ccccttctgt gtcacagttc ttggtgactg tcccaccggg     2520

agcctccccc tcagatgatc tctccacggt agcacttgac cttttcgacg cttaaccttt     2580

ccgctgtcgc cccaggccct ccctgactcc ctgtgggggt ggccatccct gggcccctcc     2640

acgcctcctg gccagacgct gccgctgccg ctgcaccacg gcgttttttt acaacattca     2700

actttagtat ttttactatt ataatataat atggaacctt ccctccaaat tcttcaataa     2760

aagttgcttt tcaaaaaaaa aaaaaaaaaa aaaa                                 2794


<210>  61
<211>  480
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human AKT1 polypeptide encoded by AKT1 mRNA transcript variant 3 
       GenBank Accession no.: NP_001014431.1  GI:62241013

<400>  61

Met Ser Asp Val Ala Ile Val Lys Glu Gly Trp Leu His Lys Arg Gly 
1               5                   10                  15      


Glu Tyr Ile Lys Thr Trp Arg Pro Arg Tyr Phe Leu Leu Lys Asn Asp 
            20                  25                  30          


Gly Thr Phe Ile Gly Tyr Lys Glu Arg Pro Gln Asp Val Asp Gln Arg 
        35                  40                  45              


Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gln Cys Gln Leu Met Lys 
    50                  55                  60                  


Thr Glu Arg Pro Arg Pro Asn Thr Phe Ile Ile Arg Cys Leu Gln Trp 
65                  70                  75                  80  


Thr Thr Val Ile Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Arg 
                85                  90                  95      


Glu Glu Trp Thr Thr Ala Ile Gln Thr Val Ala Asp Gly Leu Lys Lys 
            100                 105                 110         


Gln Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser Asp Asn 
        115                 120                 125             


Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys His Arg 
    130                 135                 140                 


Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys Gly Thr 
145                 150                 155                 160 


Phe Gly Lys Val Ile Leu Val Lys Glu Lys Ala Thr Gly Arg Tyr Tyr 
                165                 170                 175     


Ala Met Lys Ile Leu Lys Lys Glu Val Ile Val Ala Lys Asp Glu Val 
            180                 185                 190         


Ala His Thr Leu Thr Glu Asn Arg Val Leu Gln Asn Ser Arg His Pro 
        195                 200                 205             


Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gln Thr His Asp Arg Leu Cys 
    210                 215                 220                 


Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His Leu Ser 
225                 230                 235                 240 


Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Arg Phe Tyr Gly Ala Glu 
                245                 250                 255     


Ile Val Ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val Val Tyr 
            260                 265                 270         


Arg Asp Leu Lys Leu Glu Asn Leu Met Leu Asp Lys Asp Gly His Ile 
        275                 280                 285             


Lys Ile Thr Asp Phe Gly Leu Cys Lys Glu Gly Ile Lys Asp Gly Ala 
    290                 295                 300                 


Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr Leu Ala Pro Glu Val 
305                 310                 315                 320 


Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly Leu Gly 
                325                 330                 335     


Val Val Met Tyr Glu Met Met Cys Gly Arg Leu Pro Phe Tyr Asn Gln 
            340                 345                 350         


Asp His Glu Lys Leu Phe Glu Leu Ile Leu Met Glu Glu Ile Arg Phe 
        355                 360                 365             


Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu Leu Ser Gly Leu Leu 
    370                 375                 380                 


Lys Lys Asp Pro Lys Gln Arg Leu Gly Gly Gly Ser Glu Asp Ala Lys 
385                 390                 395                 400 


Glu Ile Met Gln His Arg Phe Phe Ala Gly Ile Val Trp Gln His Val 
                405                 410                 415     


Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gln Val Thr Ser Glu 
            420                 425                 430         


Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gln Met Ile Thr 
        435                 440                 445             


Ile Thr Pro Pro Asp Gln Asp Asp Ser Met Glu Cys Val Asp Ser Glu 
    450                 455                 460                 


Arg Arg Pro His Phe Pro Gln Phe Ser Tyr Ser Ala Ser Gly Thr Ala 
465                 470                 475                 480 


<210>  62
<211>  8626
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human TSC1(hamartin) mRNA transcript variant 1 NM_000368.4  
       GI:241666460

<400>  62
acgacggggg aggtgctgta cgtccaagat ggcggcgccc tgtaggctgg agggactgtg       60

aggtaaacag ctgaggggga ggagacggtg gtgaccatga aagacaccag gttgacagca      120

ctggaaactg aagtaccagt tgtcgctaga acagtttggt agtggcccca atgaagaacc      180

ttcagaacct gtagcacacg tcctggagcc agcacagcgc cttcgagcga gagaatggcc      240

caacaagcaa atgtcgggga gcttcttgcc atgctggact cccccatgct gggtgtgcgg      300

gacgacgtga cagctgtctt taaagagaac ctcaattctg accgtggccc tatgcttgta      360

aacaccttgg tggattatta cctggaaacc agctctcagc cggcattgca catcctgacc      420

accttgcaag agccacatga caagcacctc ttggacagga ttaacgaata tgtgggcaaa      480

gccgccactc gtttatccat cctctcgtta ctgggtcatg tcataagact gcagccatct      540

tggaagcata agctctctca agcacctctt ttgccttctt tactaaaatg tctcaagatg      600

gacactgacg tcgttgtcct cacaacaggc gtcttggtgt tgataaccat gctaccaatg      660

attccacagt ctgggaaaca gcatcttctt gatttctttg acatttttgg ccgtctgtca      720

tcatggtgcc tgaagaaacc aggccacgtg gcggaagtct atctcgtcca tctccatgcc      780

agtgtgtacg cactctttca tcgcctttat ggaatgtacc cttgcaactt cgtctccttt      840

ttgcgttctc attacagtat gaaagaaaac ctggagactt ttgaagaagt ggtcaagcca      900

atgatggagc atgtgcgaat tcatccggaa ttagtgactg gatccaagga ccatgaactg      960

gaccctcgaa ggtggaagag attagaaact catgatgttg tgatcgagtg tgccaaaatc     1020

tctctggatc ccacagaagc ctcatatgaa gatggctatt ctgtgtctca ccaaatctca     1080

gcccgctttc ctcatcgttc agccgatgtc accaccagcc cttatgctga cacacagaat     1140

agctatgggt gtgctacttc taccccttac tccacgtctc ggctgatgtt gttaaatatg     1200

ccagggcagc tacctcagac tctgagttcc ccatcgacac ggctgataac tgaaccacca     1260

caagctactc tttggagccc atctatggtt tgtggtatga ccactcctcc aacttctcct     1320

ggaaatgtcc cacctgatct gtcacaccct tacagtaaag tctttggtac aactgcaggt     1380

ggaaaaggaa ctcctctggg aaccccagca acctctcctc ctccagcccc actctgtcat     1440

tcggatgact acgtgcacat ttcactcccc caggccacag tcacaccccc caggaaggaa     1500

gagagaatgg attctgcaag accatgtcta cacagacaac accatcttct gaatgacaga     1560

ggatcagaag agccacctgg cagcaaaggt tctgtcactc taagtgatct tccagggttt     1620

ttaggtgatc tggcctctga agaagatagt attgaaaaag ataaagaaga agctgcaata     1680

tctagagaac tttctgagat caccacagca gaggcagagc ctgtggttcc tcgaggaggc     1740

tttgactctc ccttttaccg agacagtctc ccaggttctc agcggaagac ccactcggca     1800

gcctccagtt ctcagggcgc cagcgtgaac cctgagcctt tacactcctc cctggacaag     1860

cttgggcctg acacaccaaa gcaagccttt actcccatag acctgccctg cggcagtgct     1920

gatgaaagcc ctgcgggaga cagggaatgc cagacttctt tggagaccag tatcttcact     1980

cccagtcctt gtaaaattcc acctccgacg agagtgggct ttggaagcgg gcagcctccc     2040

ccgtatgatc atctttttga ggtggcattg ccaaagacag cccatcattt tgtcatcagg     2100

aagactgagg agctgttaaa gaaagcaaaa ggaaacacag aggaagatgg tgtgccctct     2160

acctccccaa tggaagtgct ggacagactg atacagcagg gagcagacgc gcacagcaag     2220

gagctgaaca agttgccttt acccagcaag tctgtcgact ggacccactt tggaggctct     2280

cctccttcag atgagatccg caccctccga gaccagttgc ttttactgca caaccagtta     2340

ctctatgagc gttttaagag gcagcagcat gccctccgga acaggcggct cctccgcaag     2400

gtgatcaaag cagcagctct ggaggaacat aatgctgcca tgaaagatca gttgaagtta     2460

caagagaagg acatccagat gtggaaggtt agtctgcaga aagaacaagc tagatacaat     2520

cagctccagg agcagcgtga cactatggta accaagctcc acagccagat cagacagctg     2580

cagcatgacc gagaggaatt ctacaaccag agccaggaat tacagacgaa gctggaggac     2640

tgcaggaaca tgattgcgga gctgcggata gaactgaaga aggccaacaa caaggtgtgt     2700

cacactgagc tgctgctcag tcaggtttcc caaaagctct caaacagtga gtcggtccag     2760

cagcagatgg agttcttgaa caggcagctg ttggttcttg gggaggtcaa cgagctctat     2820

ttggaacaac tgcagaacaa gcactcagat accacaaagg aagtagaaat gatgaaagcc     2880

gcctatcgga aagagctaga aaaaaacaga agccatgttc tccagcagac tcagaggctt     2940

gatacctccc aaaaacggat tttggaactg gaatctcacc tggccaagaa agaccacctt     3000

cttttggaac agaagaaata tctagaggat gtcaaactcc aggcaagagg acagctgcag     3060

gccgcagaga gcaggtatga ggctcagaaa aggataaccc aggtgtttga attggagatc     3120

ttagatttat atggcaggtt ggagaaagat ggcctcctga aaaaacttga agaagaaaaa     3180

gcagaagcag ctgaagcagc agaagaaagg cttgactgtt gtaatgacgg gtgctcagat     3240

tccatggtag ggcacaatga agaggcatct ggccacaacg gtgagaccaa gacccccagg     3300

cccagcagcg cccggggcag tagtggaagc agaggtggtg gaggcagcag cagcagcagc     3360

agcgagcttt ctaccccaga gaaaccccca caccagaggg caggcccatt cagcagtcgg     3420

tgggagacga ctatgggaga agcgtctgcc agcatcccca ccactgtggg ctcacttccc     3480

agttcaaaaa gcttcctggg tatgaaggct cgagagttat ttcgtaataa gagcgagagc     3540

cagtgtgatg aggacggcat gaccagtagc ctttctgaga gcctaaagac agaactgggc     3600

aaagacttgg gtgtggaagc caagattccc ctgaacctag atggccctca cccgtctccc     3660

ccgaccccgg acagtgttgg acagctacat atcatggact acaatgagac tcatcatgaa     3720

cacagctaag gaatgatggt caatcagtgt taacttgcat attgttggca cagaacagga     3780

ggtgtgaatg cacgtttcaa agctttcctg tttccagggt ctgagtgcaa gttcatgtgt     3840

ggaaatggga cggaggtcct ttggacagct gactgaatgc agaacggttt ttggatctgg     3900

cattgaaatg cctcttgacc ttcccctcca cccgccctaa ccccctctca tttacctcgc     3960

agtgtgttct aatccaaggg ccagttggtg ttcctcagta gctttacttt cttcctttcc     4020

cccccaaatg gttgcgtcct ttgaacctgt gcaatatgag gccaaattta atctttgagt     4080

ctaacacacc actttctgct ttcccgaagt tcagataact gggttggctc tcaattagac     4140

caggtagttt gttgcattgc aggtaagtct ggttttgtcc cttccaggag gacatagcct     4200

gcaaagctgg ttgtctttac atgaaagcgt ttacatgaga ctttccgact gcttttttga     4260

ttctgaagtt cagcatctaa agcagcaggt ctagaagaac aacggtttat tcatacttgc     4320

attcttttgg cagttctgat aagcttccta gaaagttctg tgtaaacaga agcctgtttc     4380

agaaatctgg agctggcact gtggagacca cacacccttt gggaaagctc ttgtctcttc     4440

ttcccccact acctcttatt tatttggtgt ttgcttgaat gctggtacta ttgtgaccac     4500

aggctggtgt gtaggtggta aaacctgttc tccataggag ggaaggagca gtcactggga     4560

gaggttaccc gagaagcact tgagcatgag gaactgcacc tttaggccat ctcagcttgc     4620

tgggcctttt gttaaaccct tctgtctact ggcctccctt tgtgtgcata cgcctcttgt     4680

tcatgtcagc ttatatgtga cactgcagca gaaaggctct gaaggtccaa agagtttctg     4740

caaagtgtat gtgaccatca tttcccaggc cattagggtt gcctcactgt agcaggttct     4800

aggctaccag aagaggggca gctttttcat accaattcca actttcaggg gctgactctc     4860

cagggagctg atgtcatcac actctccatg ttagtaatgg cagagcagtc taaacagagt     4920

ccgggagaat gctggcaaag gctggctgtg tatacccact aggctgcccc acgtgctccc     4980

gagagatgac actagtcaga aaattggcag tggcagagaa tccaaactca acaagtgctc     5040

ctgaaagaaa cgctagaagc ctaagaactg tggtctggtg ttccagctga ggcaggggga     5100

tttggtagga aggagccagt gaacttggct ttcctgtttc tatctttcat taaaaagaat     5160

agaaggattc agtcataaag aggtaaaaaa ctgtcacggt acgaaatctt agtgcccacg     5220

gaggcctcga gcagagagaa tgaaagtctt tttttttttt tttttttttt agcatggcaa     5280

taaatattct agcatcccta actaaagggg actagacagt tagagactct gtcaccctag     5340

ctataccagc agaaaacctg ttcaggcagg ctttctgggt gtgactgatt cccagcctgt     5400

ggcagggcgt ggtcccaact actcagccta gcacaggctg gcagttggta ctgaattgtc     5460

agatgtggag tattagtgac accacacatt taattcagct ttgtccaaag gaaagcttaa     5520

aacccaatac agtctagttt cctggttccg ttttagaaaa ggaaaacgtg aacaaactta     5580

gaaagggaag gaaatcccat cagtgaatcc tgaaactggt tttaagtgct ttccttctcc     5640

tcatgcccaa gagatctgtg ccatagaaca agataccagg cacttaaagc cttttcctga     5700

attggaaagg aaaagaggcc caagtgcaaa agaaaaaaca ttttagaaac ggacagctta     5760

taaaaataaa gggaagaaag gaggcagcat ggagagaggc ctgtgctaga agctccatgg     5820

acgtgtctgc acagggtcct cagctcatcc atgcggcctg ggtgtccttt tactcagctt     5880

tataacaaat gtggctccaa gctcaggtgc ctttgagttc taggaggctg tgggttttat     5940

tcaactacgg ttgggagaat gagacctgga gtcatgttga aggtgcccaa cctaaaaatg     6000

taggctttca tgttgcaaag aactccagag tcagtagtta ggtttggttt ggttttggac     6060

atgataaacc tgccaagagt caacaggtca cttgatcatg ctgcagtggg tagttctaag     6120

gatggaaagg tgacagtatt actctcgaga ggcaattcag tcctgggcaa aggtattagt     6180

acaataagcg ttaagggcag agtctacctt gaaaccaatt aagcagcttg gtattcataa     6240

atattgggat tggatggcct ccatccagaa atcactatgg gtgagcatac ctgtctcagc     6300

tgtttggcca atgtgcataa cctactcgga tccccacctg acactaacca gagtcagcac     6360

aggccccgag gagcccgaag tctgctgctg tgcagcatgg aattccttta aaaaggtgca     6420

ctacagtttt agcggggagg gggataggaa gacgcagagc aaatgagctc cggagtccct     6480

gcaggtgaat aaacacacag atctgcatct gatagaactt tgatggattt tcaaaaagcc     6540

gttgacaagg ctctgctata cagtctataa aaattgttat tatgggattg gaagaaacac     6600

gtggtcatga atagaaaaaa aacaaaccca aaggtaggaa ggtcaaggtc atttcttaga     6660

tggagaagtt gtgaaagatg tccttggaga tgagttttag gaccagcatt actaaggcag     6720

gtgggcagac agtgacctct ctaggtgtgt ccacagagtt tttcaggaga gaaaactgcc     6780

tgacctttgg gactaagctg cggaatcttc ttactaagct tgaagagtgg agaggcgaga     6840

ggtgagctac tttgtgagcc aaagcttatg tgacatggtt ggggaaacag tccaaactgt     6900

tctgagaagg tgaactgtta cgacccagga caattagaaa aattcaccca ccatgccgca     6960

cattactggg taaaagcagg gcagcaggga acaaaactcc agactcttgg gccgtcccca     7020

tttgcaacag cacacatagt ttctggtata tttgttggga aagataaaac tctagcagtt     7080

gttgagggga ggatgtataa aatggtcatg gggatgaaag gatctctgag accacagagg     7140

ctcagactca ctgttaagaa tagaaaactg ggtatgcgtt tcatgtagcc agcagaactg     7200

aagtgtgctg tgacaagcca atgtgaattt ctaccaaata gtagagcata ccacttgaag     7260

aaggaaagaa ccgaagagca aacaaaagtt ctgcgtaatg agactcacct tttctcgctg     7320

aaagcactaa gaggtgggag gaggcctgca caggctggag gagggtttgg gcagagcgaa     7380

gacccggcca ggaccttggt gagatggggt gccgcccacc tcctgcggat actcttggag     7440

agttgttccc ccagggggct ctgccccacc tggagaagga agctgcctgg tgtggagtga     7500

ctcaaatcag tatacctatc tgctgcacct tcactctcca gggtacatgc tttaaaaccg     7560

acccgcaaca agtattggaa aaatgtatcc agtctgaaga tgtttgtgta tctgtttaca     7620

tccagagttc tgtgacacat gccccccaga ttgctgcaaa gatcccaagg cattgattgc     7680

acttgattaa gcttttgtct gtaggtgaaa gaacaagttt aggtcgagga ctggccccta     7740

ggctgctgct gtgacccttg tcccatgtgg cttgtttgcc tgtccgggac tcttcgatgt     7800

gcccagggga gcgtgttcct gtctcttcca tgccgtcctg cagtccttat ctgctcgcct     7860

gagggaagag tagctgtagc tacaagggaa gcctgcctgg aagagccgag cacctgtgcc     7920

catggcttct ggtcatgaaa cgagttaatg atggcagagg agcttcctcc ccacttcgca     7980

gcgccacatt atccatcctc tgagataagt aggctggttt aaccattgga atggaccttt     8040

cagtggaaac cctgagagtc tgagaacccc cagaccaacc cttccctccc tttccccacc     8100

tcttacagtg tttggacagg agggtatggt gctgctctgt gtagcaagta ctttggctta     8160

tgaaagaggc agccacgcat tttgcactag gaagaatcag taatcacttt tcagaagact     8220

tctatggacc acaaatatat tacggaggaa cagattttgc taagacataa tctagtttta     8280

taactcaatc atgaatgaac catgtgtggc aaacttgcag tttaaagggg tcccatcagt     8340

gaaagaaact gatttttttt aacggactgc ttttagttaa attgaagaaa gtcagctctt     8400

gtcaaaaggt ctaaactttc ccgcctcaat cctaaaagca tgtcaacaat ccacatcaga     8460

tgccataaat atgaactgca ggataaaatg gtacaatctt agtgaatggg aattggaatc     8520

aaaagagttt gctgtccttc ttagaatgtt ctaaaatgtc aaggcagttg cttgtgttta     8580

actgtgaaca aataaaaatt tattgttttg cactacaaaa aaaaaa                    8626


<210>  63
<211>  1164
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human TSC1(hamartin) polypeptide encoded by mRNA transcript 
       variant 1 NP_000359.1  GI:4507693

<400>  63

Met Ala Gln Gln Ala Asn Val Gly Glu Leu Leu Ala Met Leu Asp Ser 
1               5                   10                  15      


Pro Met Leu Gly Val Arg Asp Asp Val Thr Ala Val Phe Lys Glu Asn 
            20                  25                  30          


Leu Asn Ser Asp Arg Gly Pro Met Leu Val Asn Thr Leu Val Asp Tyr 
        35                  40                  45              


Tyr Leu Glu Thr Ser Ser Gln Pro Ala Leu His Ile Leu Thr Thr Leu 
    50                  55                  60                  


Gln Glu Pro His Asp Lys His Leu Leu Asp Arg Ile Asn Glu Tyr Val 
65                  70                  75                  80  


Gly Lys Ala Ala Thr Arg Leu Ser Ile Leu Ser Leu Leu Gly His Val 
                85                  90                  95      


Ile Arg Leu Gln Pro Ser Trp Lys His Lys Leu Ser Gln Ala Pro Leu 
            100                 105                 110         


Leu Pro Ser Leu Leu Lys Cys Leu Lys Met Asp Thr Asp Val Val Val 
        115                 120                 125             


Leu Thr Thr Gly Val Leu Val Leu Ile Thr Met Leu Pro Met Ile Pro 
    130                 135                 140                 


Gln Ser Gly Lys Gln His Leu Leu Asp Phe Phe Asp Ile Phe Gly Arg 
145                 150                 155                 160 


Leu Ser Ser Trp Cys Leu Lys Lys Pro Gly His Val Ala Glu Val Tyr 
                165                 170                 175     


Leu Val His Leu His Ala Ser Val Tyr Ala Leu Phe His Arg Leu Tyr 
            180                 185                 190         


Gly Met Tyr Pro Cys Asn Phe Val Ser Phe Leu Arg Ser His Tyr Ser 
        195                 200                 205             


Met Lys Glu Asn Leu Glu Thr Phe Glu Glu Val Val Lys Pro Met Met 
    210                 215                 220                 


Glu His Val Arg Ile His Pro Glu Leu Val Thr Gly Ser Lys Asp His 
225                 230                 235                 240 


Glu Leu Asp Pro Arg Arg Trp Lys Arg Leu Glu Thr His Asp Val Val 
                245                 250                 255     


Ile Glu Cys Ala Lys Ile Ser Leu Asp Pro Thr Glu Ala Ser Tyr Glu 
            260                 265                 270         


Asp Gly Tyr Ser Val Ser His Gln Ile Ser Ala Arg Phe Pro His Arg 
        275                 280                 285             


Ser Ala Asp Val Thr Thr Ser Pro Tyr Ala Asp Thr Gln Asn Ser Tyr 
    290                 295                 300                 


Gly Cys Ala Thr Ser Thr Pro Tyr Ser Thr Ser Arg Leu Met Leu Leu 
305                 310                 315                 320 


Asn Met Pro Gly Gln Leu Pro Gln Thr Leu Ser Ser Pro Ser Thr Arg 
                325                 330                 335     


Leu Ile Thr Glu Pro Pro Gln Ala Thr Leu Trp Ser Pro Ser Met Val 
            340                 345                 350         


Cys Gly Met Thr Thr Pro Pro Thr Ser Pro Gly Asn Val Pro Pro Asp 
        355                 360                 365             


Leu Ser His Pro Tyr Ser Lys Val Phe Gly Thr Thr Ala Gly Gly Lys 
    370                 375                 380                 


Gly Thr Pro Leu Gly Thr Pro Ala Thr Ser Pro Pro Pro Ala Pro Leu 
385                 390                 395                 400 


Cys His Ser Asp Asp Tyr Val His Ile Ser Leu Pro Gln Ala Thr Val 
                405                 410                 415     


Thr Pro Pro Arg Lys Glu Glu Arg Met Asp Ser Ala Arg Pro Cys Leu 
            420                 425                 430         


His Arg Gln His His Leu Leu Asn Asp Arg Gly Ser Glu Glu Pro Pro 
        435                 440                 445             


Gly Ser Lys Gly Ser Val Thr Leu Ser Asp Leu Pro Gly Phe Leu Gly 
    450                 455                 460                 


Asp Leu Ala Ser Glu Glu Asp Ser Ile Glu Lys Asp Lys Glu Glu Ala 
465                 470                 475                 480 


Ala Ile Ser Arg Glu Leu Ser Glu Ile Thr Thr Ala Glu Ala Glu Pro 
                485                 490                 495     


Val Val Pro Arg Gly Gly Phe Asp Ser Pro Phe Tyr Arg Asp Ser Leu 
            500                 505                 510         


Pro Gly Ser Gln Arg Lys Thr His Ser Ala Ala Ser Ser Ser Gln Gly 
        515                 520                 525             


Ala Ser Val Asn Pro Glu Pro Leu His Ser Ser Leu Asp Lys Leu Gly 
    530                 535                 540                 


Pro Asp Thr Pro Lys Gln Ala Phe Thr Pro Ile Asp Leu Pro Cys Gly 
545                 550                 555                 560 


Ser Ala Asp Glu Ser Pro Ala Gly Asp Arg Glu Cys Gln Thr Ser Leu 
                565                 570                 575     


Glu Thr Ser Ile Phe Thr Pro Ser Pro Cys Lys Ile Pro Pro Pro Thr 
            580                 585                 590         


Arg Val Gly Phe Gly Ser Gly Gln Pro Pro Pro Tyr Asp His Leu Phe 
        595                 600                 605             


Glu Val Ala Leu Pro Lys Thr Ala His His Phe Val Ile Arg Lys Thr 
    610                 615                 620                 


Glu Glu Leu Leu Lys Lys Ala Lys Gly Asn Thr Glu Glu Asp Gly Val 
625                 630                 635                 640 


Pro Ser Thr Ser Pro Met Glu Val Leu Asp Arg Leu Ile Gln Gln Gly 
                645                 650                 655     


Ala Asp Ala His Ser Lys Glu Leu Asn Lys Leu Pro Leu Pro Ser Lys 
            660                 665                 670         


Ser Val Asp Trp Thr His Phe Gly Gly Ser Pro Pro Ser Asp Glu Ile 
        675                 680                 685             


Arg Thr Leu Arg Asp Gln Leu Leu Leu Leu His Asn Gln Leu Leu Tyr 
    690                 695                 700                 


Glu Arg Phe Lys Arg Gln Gln His Ala Leu Arg Asn Arg Arg Leu Leu 
705                 710                 715                 720 


Arg Lys Val Ile Lys Ala Ala Ala Leu Glu Glu His Asn Ala Ala Met 
                725                 730                 735     


Lys Asp Gln Leu Lys Leu Gln Glu Lys Asp Ile Gln Met Trp Lys Val 
            740                 745                 750         


Ser Leu Gln Lys Glu Gln Ala Arg Tyr Asn Gln Leu Gln Glu Gln Arg 
        755                 760                 765             


Asp Thr Met Val Thr Lys Leu His Ser Gln Ile Arg Gln Leu Gln His 
    770                 775                 780                 


Asp Arg Glu Glu Phe Tyr Asn Gln Ser Gln Glu Leu Gln Thr Lys Leu 
785                 790                 795                 800 


Glu Asp Cys Arg Asn Met Ile Ala Glu Leu Arg Ile Glu Leu Lys Lys 
                805                 810                 815     


Ala Asn Asn Lys Val Cys His Thr Glu Leu Leu Leu Ser Gln Val Ser 
            820                 825                 830         


Gln Lys Leu Ser Asn Ser Glu Ser Val Gln Gln Gln Met Glu Phe Leu 
        835                 840                 845             


Asn Arg Gln Leu Leu Val Leu Gly Glu Val Asn Glu Leu Tyr Leu Glu 
    850                 855                 860                 


Gln Leu Gln Asn Lys His Ser Asp Thr Thr Lys Glu Val Glu Met Met 
865                 870                 875                 880 


Lys Ala Ala Tyr Arg Lys Glu Leu Glu Lys Asn Arg Ser His Val Leu 
                885                 890                 895     


Gln Gln Thr Gln Arg Leu Asp Thr Ser Gln Lys Arg Ile Leu Glu Leu 
            900                 905                 910         


Glu Ser His Leu Ala Lys Lys Asp His Leu Leu Leu Glu Gln Lys Lys 
        915                 920                 925             


Tyr Leu Glu Asp Val Lys Leu Gln Ala Arg Gly Gln Leu Gln Ala Ala 
    930                 935                 940                 


Glu Ser Arg Tyr Glu Ala Gln Lys Arg Ile Thr Gln Val Phe Glu Leu 
945                 950                 955                 960 


Glu Ile Leu Asp Leu Tyr Gly Arg Leu Glu Lys Asp Gly Leu Leu Lys 
                965                 970                 975     


Lys Leu Glu Glu Glu Lys Ala Glu Ala Ala Glu Ala Ala Glu Glu Arg 
            980                 985                 990         


Leu Asp Cys Cys Asn Asp Gly Cys  Ser Asp Ser Met Val  Gly His Asn 
        995                 1000                 1005             


Glu Glu  Ala Ser Gly His Asn  Gly Glu Thr Lys Thr  Pro Arg Pro 
    1010                 1015                 1020             


Ser Ser  Ala Arg Gly Ser Ser  Gly Ser Arg Gly Gly  Gly Gly Ser 
    1025                 1030                 1035             


Ser Ser  Ser Ser Ser Glu Leu  Ser Thr Pro Glu Lys  Pro Pro His 
    1040                 1045                 1050             


Gln Arg  Ala Gly Pro Phe Ser  Ser Arg Trp Glu Thr  Thr Met Gly 
    1055                 1060                 1065             


Glu Ala  Ser Ala Ser Ile Pro  Thr Thr Val Gly Ser  Leu Pro Ser 
    1070                 1075                 1080             


Ser Lys  Ser Phe Leu Gly Met  Lys Ala Arg Glu Leu  Phe Arg Asn 
    1085                 1090                 1095             


Lys Ser  Glu Ser Gln Cys Asp  Glu Asp Gly Met Thr  Ser Ser Leu 
    1100                 1105                 1110             


Ser Glu  Ser Leu Lys Thr Glu  Leu Gly Lys Asp Leu  Gly Val Glu 
    1115                 1120                 1125             


Ala Lys  Ile Pro Leu Asn Leu  Asp Gly Pro His Pro  Ser Pro Pro 
    1130                 1135                 1140             


Thr Pro  Asp Ser Val Gly Gln  Leu His Ile Met Asp  Tyr Asn Glu 
    1145                 1150                 1155             


Thr His  His Glu His Ser 
    1160                 


<210>  64
<211>  8600
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens hamartin (TSC1) mRNA, complete cds GenBank: 
       AF013168.1


<220>
<221>  misc_feature
<222>  (7444)..(7444)
<223>  n is a, c, g, t or u

<400>  64
gtgctgtacg tccaagatgg cggcgcctgt aggctggagg gactgtgagg taaacagctg       60

agggggagga gacggtggtg accatgaaag acaccaggtt gacagcactg gaaactgaag      120

taccagttgt cgctagaaca gtttggtagt ggccccaatg aagaaccttc agaacctgta      180

gcacacgtcc tggagccagc acagcgcctt cgagcgagag aatggcccaa caagcaaatg      240

tcggggagct tcttgccatg ctggactccc ccatgctggg tgtgcgggac gacgtgacag      300

ctgtctttaa agagaacctc aattctgacc gtggccctat gcttgtaaac accttggtgg      360

attattacct ggaaaccagc tctcagccgg cattgcacat cctgaccacc ttgcaagagc      420

cacatgacaa gcacctcttg gacaggatta acgaatatgt gggcaaagcc gccactcgtt      480

tatccatcct ctcgttactg ggtcatgtca taagactgca gccatcttgg aagcataagc      540

tctctcaagc acctcttttg ccttctttac taaaatgtct caagatggac actgacgtcg      600

ttgtcctcac aacaggcgtc ttggtgttga taaccatgct accaatgatt ccacagtctg      660

ggaaacagca tcttcttgat ttctttgaca tttttggccg tctgtcatca tggtgcctga      720

agaaaccagg ccacgtggcg gaagtctatc tcgtccatct ccatgccagt gtgtacgcac      780

tctttcatcg cctttatgga atgtaccctt gcaacttcgt ctcctttttg cgttctcatt      840

acagtatgaa agaaaacctg gagacttttg aagaagtggt caagccaatg atggagcatg      900

tgcgaattca tccggaatta gtgactggat ccaaggacca tgaactggac cctcgaaggt      960

ggaagagatt agaaactcat gatgttgtga tcgagtgtgc caaaatctct ctggatccca     1020

cagaagcctc atatgaagat ggctattctg tgtctcacca aatctcagcc cgctttcctc     1080

atcgttcagc cgatgtcacc accagccctt atgctgacac acagaatagc tatgggtgtg     1140

ctacttctac cccttactcc acgtctcggc tgatgttgtt aaatatgcca gggcagctac     1200

ctcagactct gagttcccca tcgacacggc tgataactga accaccacaa gctactcttt     1260

ggagcccatc tatggtttgt ggtatgacca ctcctccaac ttctcctgga aatgtcccac     1320

ctgatctgtc acacccttac agtaaagtct ttggtacaac tgcaggtgga aaaggaactc     1380

ctctgggaac cccagcaacc tctcctcctc cagccccact ctgtcattcg gatgactacg     1440

tgcacatttc actcccccag gccacagtca caccccccag gaaggaagag agaatggatt     1500

ctgcaagacc atgtctacac agacaacacc atcttctgaa tgacagagga tcagaagagc     1560

cacctggcag caaaggttct gtcactctaa gtgatcttcc agggttttta ggtgatctgg     1620

cctctgaaga agatagtatt gaaaaagata aagaagaagc tgcaatatct agagaacttt     1680

ctgagatcac cacagcagag gcagagcctg tggttcctcg aggaggcttt gactctccct     1740

tttaccgaga cagtctccca ggttctcagc ggaagaccca ctcggcagcc tccagttctc     1800

agggcgccag cgtgaaccct gagcctttac actcctccct ggacaagctt gggcctgaca     1860

caccaaagca agcctttact cccatagacc tgccctgcgg cagtgctgat gaaagccctg     1920

cgggagacag ggaatgccag acttctttgg agaccagtat cttcactccc agtccttgta     1980

aaattccacc tccgacgaga gtgggctttg gaagcgggca gcctcccccg tatgatcatc     2040

tttttgaggt ggcattgcca aagacagccc atcattttgt catcaggaag actgaggagc     2100

tgttaaagaa agcaaaagga aacacagagg aagatggtgt gccctctacc tccccaatgg     2160

aagtgctgga cagactgata cagcagggag cagacgcgca cagcaaggag ctgaacaagt     2220

tgcctttacc cagcaagtct gtcgactgga cccactttgg aggctctcct ccttcagatg     2280

agatccgcac cctccgagac cagttgcttt tactgcacaa ccagttactc tatgagcgtt     2340

ttaagaggca gcagcatgcc ctccggaaca ggcggctcct ccgcaaggtg atcaaagcag     2400

cagctctgga ggaacataat gctgccatga aagatcagtt gaagttacaa gagaaggaca     2460

tccagatgtg gaaggttagt ctgcagaaag aacaagctag atacaatcag ctccaggagc     2520

agcgtgacac tatggtaacc aagctccaca gccagatcag acagctgcag catgaccgag     2580

aggaattcta caaccagagc caggaattac agacgaagct ggaggactgc aggaacatga     2640

ttgcggagct gcggatagaa ctgaagaagg ccaacaacaa ggtgtgtcac actgagctgc     2700

tgctcagtca ggtttcccaa aagctctcaa acagtgagtc ggtccagcag cagatggagt     2760

tcttgaacag gcagctgttg gttcttgggg aggtcaacga gctctatttg gaacaactgc     2820

agaacaagca ctcagatacc acaaaggaag tagaaatgat gaaagccgcc tatcggaaag     2880

agctagaaaa aaacagaagc catgttctcc agcagactca gaggcttgat acctcccaaa     2940

aacggatttt ggaactggaa tctcacctgg ccaagaaaga ccaccttctt ttggaacaga     3000

agaaatatct agaggatgtc aaactccagg caagaggaca gctgcaggcc gcagagagca     3060

ggtatgaggc tcagaaaagg ataacccagg tgtttgaatt ggagatctta gatttatatg     3120

gcaggttgga gaaagatggc ctcctgaaaa aacttgaaga agaaaaagca gaagcagctg     3180

aagcagcaga agaaaggctt gactgttgta atgacgggtg ctcagattcc atggtagggc     3240

acaatgaaga ggcatctggc cacaacggtg agaccaagac ccccaggccc agcagcgccc     3300

ggggcagtag tggaagcaga ggtggtggag gcagcagcag cagcagcagc gagctttcta     3360

ccccagagaa acccccacac cagagggcag gcccattcag cagtcggtgg gagacgacta     3420

tgggagaagc gtctgccagc atccccacca ctgtgggctc acttcccagt tcaaaaagct     3480

tcctgggtat gaaggctcga gagttatttc gtaataagag cgagagccag tgtgatgagg     3540

acggcatgac cagtagcctt tctgagagcc taaagacaga actgggcaaa gacttgggtg     3600

tggaagccaa gattcccctg aacctagatg gccctcaccc gtctcccccg accccggaca     3660

gtgttggaca gctacatatc atggactaca atgagactca tcatgaacac agctaaggaa     3720

tgatggtcaa tcagtgttaa cttgcatatt gttggcacag aacaggaggt gtgaatgcac     3780

gtttcaaagc tttcctgttt ccagggtctg agtgcaagtt catgtgtgga aatgggacgg     3840

aggtcctttg gacagctgac tgaatgcaga acggtttttg gatctggcat tgaaatgcct     3900

cttgaccttc ccctccaccc gccctaaccc cctctcattt acctcgcagt gtgttctaat     3960

ccaagggcca gttggtgttc ctcagtagct ttactttctt ccttcccccc caaatggttg     4020

cgtcctttga acctgtgcaa tatgaggcca aatttaatct ttgagtctaa cacaccactt     4080

tctgctttcc cgaagttcag ataactgggt tggctctcaa ttagaccagg tagtttgttg     4140

cattgcaggt aagtctggtt ttgtcccttc caggaggaca tagcctgcaa agctggttgt     4200

ctttacatga aagcgtttac atgagacttt ccgactgctt ttttgattct gaagttcagc     4260

atctaaagca gcaggtctag aagaacaacg gtttattcat acttgcattc ttttggcagt     4320

tctgataagc ttcctagaaa gttctgtgta aacagaagcc tgtttcagaa atctggagct     4380

ggcactgtgg agaccacaca ccctttggga aagctcttgt ctcttcttcc cccactacct     4440

cttatttatt tggtgtttgc ttgaatgctg gtactattgt gaccacaggc tggtgtgtag     4500

gtggtaaaac ctgttctcca taggagggaa ggagcagtca ctgggagagg ttacccgaga     4560

agcacttgag catgaggaac tgcaccttta ggccatctca gcttgctggg ccttttgtta     4620

aacccttctg tctactggcc tccctttgtg tgcatacgcc tcttgttcat gtcagcttat     4680

atgtgacact gcagcagaaa ggctctgaag gtccaaagag tttctgcaaa gtgtatgtga     4740

ccatcatttc ccaggccatt agggttgcct cactgtagca ggttctaggc taccagaaga     4800

ggggcagctt tttcatacca attccaactt tcaggggctg actctccagg gagctgatgt     4860

catcacactc tccatgttag taatggcaga gcagtctaaa cagagtccgg gagaatgctg     4920

gcaaaggctg gctgtgtata cccactaggc tgccccacgt gctcccgaga gatgacacta     4980

gtcagaaaag tggcagtggc agagaatcca aactcaacaa gtgctcctga aagaaatgct     5040

agaagcctaa gaactgtggt ctggtgttcc agctgaggca gggggatttg gtaggaagga     5100

gccagtgaac ttggctttcc tgtttctatc tttcattaaa aagaatagaa ggattcagtc     5160

ataaagaggt aaaaaactgt cacggtacga aatcttagtg cctacggagg cctcgagcag     5220

aaagaatgaa agtctttttt tttttttttt ttttttagca tggcaataaa tattctagca     5280

tccctaacta aaggggacta gacagttaga gactctgtca ccctagctat accagcagaa     5340

aacctgttca ggcaggcttt ctgggtgtga ctgattccca gcctgtggca gggcgtggtc     5400

ccaactactc agcctagcac aggctggcag ttggtactga attgtcagat gtggagtatt     5460

agtgacacca cacatttaat tcagctttgt ccaaaggaaa gcttaaaacc caatacagtc     5520

tagtttcctg gttccgtttt agaaaaggaa aacgtgaaca aacttagaaa gggaaggaaa     5580

tcccatcagt gaatcctgaa actggtttta agtgctttcc ttctcctcat gcccaagaga     5640

tctgtgccat agaacaagat accaggcact taaagccttt tcctgaattg gaaaggaaaa     5700

gaggcccaag tgcaaaagaa aaaacatttt agaaacggac agcttataaa aataaaggga     5760

agaaaggagg cagcatggag agaggcctgt gctagaagct ccatggacgt gtctgcacag     5820

ggtcctcagc tcatccatgc ggcctgggtg tccttttact cagctttata acaaatgtgg     5880

ctccaagctc aggtgccttt gagttctagg aggctgtggg ttttattcaa ctacggttgg     5940

gagaatgaga cctggagtca tgttgaaggt gcccaaccta aaaatgtagg ctttcatgtt     6000

gcaaagaact ccagagtcag tagttaggtt tggtttggtt ttggacatga taaacctgcc     6060

aagagtcaac aggtcacttg atcatgctgc agtgggtagt tctaaggatg gaaaggtgac     6120

agtattactc tcgagaggca attcagtcct gggcaaaggt attagtacaa taagcgttaa     6180

gggcagagtc taccttgaaa ccaattaagc agcttggtat tcataaatat tgggattgga     6240

tggcctccat ccagaaatca ctatgggtga gcatacctgt ctcagctgtt tggccaatgt     6300

gcataaccta ctcggatccc cacctgacac taaccagagt cagcacaggc cccgaggagc     6360

ccgaagttct ctgctgtgca gcatggaatt cctttaaaaa ggtgcactac agttttagcg     6420

gggaggggga taggaagacg cagagcaaat gagctccgga gtccctgcag gtgaataaac     6480

acacagatct gcatctgata gaactttgat ggattttcaa aaagccgttg acaaggctct     6540

gctatacagt ctataaaaat tgttattatg ggattggaag aaacacatgg tcatgaatag     6600

aaaaaaaaca aacccaaagg taggaaggtc aaggtcattt cttagatgga gaagttgtga     6660

aagatgtcct tggagatgag ttttaggacc agcattacta aggcaggtgg gcagacagtg     6720

acctctctag gtgtgtccac agagtttttc aggagagaaa actgcctgac ctttgggact     6780

aagctgcgga atcttcttac taagcttgaa gagtggagag gcgagaggtg agctactttg     6840

tgagccaaag cttatgtgac atggttgggg aaacagtcca aactgttctg agaaggtgaa     6900

ctgttacgac ccaggacaat tagaaaaatt cacccaccat gccgcacatt actgggtaaa     6960

agcagggcag cagggaacaa aactccagac tcttgggccg tccccatttg caacagcaca     7020

catagtttct ggtatatttg ttgggaaaga taaaactcta gcagttgttg aggggaggat     7080

gtataaaatg gtcatgggga tgaaaggatc tctgagacca cagaggctca gactcactgt     7140

taagaataga aaactgggta tgcgtttcat gtagccagca gaactgaagt gtgctgtgac     7200

aagccaatgt gaatttctac caaatagtag agcataccac ttgaagaagg aaagaaccga     7260

agagcaaaca aaagttctgc gtaatgagac tcaccttttc tcgctgaaag cactaagagg     7320

tgggaggagg cctgcacagg ctggaggagg gtttgggcag agcgaagacc cggccaggac     7380

cttggtgaga tggagtgccg cccacctcct gcggatactc ttggagagtt gttcccccag     7440

gggnctctgc cccacctgga gaaggaagct gcctggtgtg gagtgactca aatcagtata     7500

cctatctgct gcaccttcac tctccagggt acatgcttta aaaccgaccc gcaacaagta     7560

ttggaaaaat gtatccagtc tgaagatgtt tgtgtatctg tttacatcca gagttctgtg     7620

acacatgccc cccagattgc tgcaaagatc ccaaggcatt gattgcactt gattaagctt     7680

ttgtctgtag gtgaaagaac aagtttaggt cgaggactgg cccctaggct gctgctgtga     7740

cccttgtccc atgtggcttg tttgcctgtc cgggactctt cgatgtgccc aggggagcgt     7800

gttcctgtct cttccatgcc gtcctgcagt ccttatctgc tcgcctgagg gaagagtagc     7860

tgtagctaca agggaagcct gcctggaaga gccgagcacc tgtgcccatg gcttctggtc     7920

atgaaacgag ttaatgatgg cagaggagct tcctccccac ttcgcagcgc cacattatcc     7980

atcctctgag ataagtaggc tggtttaacc attggaatgg acctttcagt ggaaaccctg     8040

agagtctgag aacccccaga ccaacccttc cctccctttc cccacctctt acagtgtttg     8100

gacaggaggg tatggtgctg ctctgtgtag caagtacttt ggcttatgaa agaggcagcc     8160

acgcattttg cactaggaag aatcagtaat cacttttcag aagacttcta tggaccacaa     8220

atatattacg gaggaacaga ttttgctaag acataatcta gttttataac tcaatcatga     8280

atgaaccatg tgtggcaaac ttgcagttta aaggggtccc atcagtgaaa gaaactgatt     8340

ttttttaacg gactgctttt agttaaattg aagaaagtca gctcttgtca aaaggtctaa     8400

actttcccgc ctcaatccta aaagcatgtc aacaatccac atcagatgcc ataaatatga     8460

actgcaggat aaaatggtac aatcttagtg aatgggaatt ggaatcaaaa gagtttgctg     8520

tccttcttag aatgttctaa aatgtcaagg cagttgcttg tgtttaactg tgaacaaata     8580

aaaatttatt gttttgcact                                                 8600


<210>  65
<211>  1164
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens hamartin (TSC1) polypeptide, complete cds GenBank: 
       AF013168.1

<400>  65

Met Ala Gln Gln Ala Asn Val Gly Glu Leu Leu Ala Met Leu Asp Ser 
1               5                   10                  15      


Pro Met Leu Gly Val Arg Asp Asp Val Thr Ala Val Phe Lys Glu Asn 
            20                  25                  30          


Leu Asn Ser Asp Arg Gly Pro Met Leu Val Asn Thr Leu Val Asp Tyr 
        35                  40                  45              


Tyr Leu Glu Thr Ser Ser Gln Pro Ala Leu His Ile Leu Thr Thr Leu 
    50                  55                  60                  


Gln Glu Pro His Asp Lys His Leu Leu Asp Arg Ile Asn Glu Tyr Val 
65                  70                  75                  80  


Gly Lys Ala Ala Thr Arg Leu Ser Ile Leu Ser Leu Leu Gly His Val 
                85                  90                  95      


Ile Arg Leu Gln Pro Ser Trp Lys His Lys Leu Ser Gln Ala Pro Leu 
            100                 105                 110         


Leu Pro Ser Leu Leu Lys Cys Leu Lys Met Asp Thr Asp Val Val Val 
        115                 120                 125             


Leu Thr Thr Gly Val Leu Val Leu Ile Thr Met Leu Pro Met Ile Pro 
    130                 135                 140                 


Gln Ser Gly Lys Gln His Leu Leu Asp Phe Phe Asp Ile Phe Gly Arg 
145                 150                 155                 160 


Leu Ser Ser Trp Cys Leu Lys Lys Pro Gly His Val Ala Glu Val Tyr 
                165                 170                 175     


Leu Val His Leu His Ala Ser Val Tyr Ala Leu Phe His Arg Leu Tyr 
            180                 185                 190         


Gly Met Tyr Pro Cys Asn Phe Val Ser Phe Leu Arg Ser His Tyr Ser 
        195                 200                 205             


Met Lys Glu Asn Leu Glu Thr Phe Glu Glu Val Val Lys Pro Met Met 
    210                 215                 220                 


Glu His Val Arg Ile His Pro Glu Leu Val Thr Gly Ser Lys Asp His 
225                 230                 235                 240 


Glu Leu Asp Pro Arg Arg Trp Lys Arg Leu Glu Thr His Asp Val Val 
                245                 250                 255     


Ile Glu Cys Ala Lys Ile Ser Leu Asp Pro Thr Glu Ala Ser Tyr Glu 
            260                 265                 270         


Asp Gly Tyr Ser Val Ser His Gln Ile Ser Ala Arg Phe Pro His Arg 
        275                 280                 285             


Ser Ala Asp Val Thr Thr Ser Pro Tyr Ala Asp Thr Gln Asn Ser Tyr 
    290                 295                 300                 


Gly Cys Ala Thr Ser Thr Pro Tyr Ser Thr Ser Arg Leu Met Leu Leu 
305                 310                 315                 320 


Asn Met Pro Gly Gln Leu Pro Gln Thr Leu Ser Ser Pro Ser Thr Arg 
                325                 330                 335     


Leu Ile Thr Glu Pro Pro Gln Ala Thr Leu Trp Ser Pro Ser Met Val 
            340                 345                 350         


Cys Gly Met Thr Thr Pro Pro Thr Ser Pro Gly Asn Val Pro Pro Asp 
        355                 360                 365             


Leu Ser His Pro Tyr Ser Lys Val Phe Gly Thr Thr Ala Gly Gly Lys 
    370                 375                 380                 


Gly Thr Pro Leu Gly Thr Pro Ala Thr Ser Pro Pro Pro Ala Pro Leu 
385                 390                 395                 400 


Cys His Ser Asp Asp Tyr Val His Ile Ser Leu Pro Gln Ala Thr Val 
                405                 410                 415     


Thr Pro Pro Arg Lys Glu Glu Arg Met Asp Ser Ala Arg Pro Cys Leu 
            420                 425                 430         


His Arg Gln His His Leu Leu Asn Asp Arg Gly Ser Glu Glu Pro Pro 
        435                 440                 445             


Gly Ser Lys Gly Ser Val Thr Leu Ser Asp Leu Pro Gly Phe Leu Gly 
    450                 455                 460                 


Asp Leu Ala Ser Glu Glu Asp Ser Ile Glu Lys Asp Lys Glu Glu Ala 
465                 470                 475                 480 


Ala Ile Ser Arg Glu Leu Ser Glu Ile Thr Thr Ala Glu Ala Glu Pro 
                485                 490                 495     


Val Val Pro Arg Gly Gly Phe Asp Ser Pro Phe Tyr Arg Asp Ser Leu 
            500                 505                 510         


Pro Gly Ser Gln Arg Lys Thr His Ser Ala Ala Ser Ser Ser Gln Gly 
        515                 520                 525             


Ala Ser Val Asn Pro Glu Pro Leu His Ser Ser Leu Asp Lys Leu Gly 
    530                 535                 540                 


Pro Asp Thr Pro Lys Gln Ala Phe Thr Pro Ile Asp Leu Pro Cys Gly 
545                 550                 555                 560 


Ser Ala Asp Glu Ser Pro Ala Gly Asp Arg Glu Cys Gln Thr Ser Leu 
                565                 570                 575     


Glu Thr Ser Ile Phe Thr Pro Ser Pro Cys Lys Ile Pro Pro Pro Thr 
            580                 585                 590         


Arg Val Gly Phe Gly Ser Gly Gln Pro Pro Pro Tyr Asp His Leu Phe 
        595                 600                 605             


Glu Val Ala Leu Pro Lys Thr Ala His His Phe Val Ile Arg Lys Thr 
    610                 615                 620                 


Glu Glu Leu Leu Lys Lys Ala Lys Gly Asn Thr Glu Glu Asp Gly Val 
625                 630                 635                 640 


Pro Ser Thr Ser Pro Met Glu Val Leu Asp Arg Leu Ile Gln Gln Gly 
                645                 650                 655     


Ala Asp Ala His Ser Lys Glu Leu Asn Lys Leu Pro Leu Pro Ser Lys 
            660                 665                 670         


Ser Val Asp Trp Thr His Phe Gly Gly Ser Pro Pro Ser Asp Glu Ile 
        675                 680                 685             


Arg Thr Leu Arg Asp Gln Leu Leu Leu Leu His Asn Gln Leu Leu Tyr 
    690                 695                 700                 


Glu Arg Phe Lys Arg Gln Gln His Ala Leu Arg Asn Arg Arg Leu Leu 
705                 710                 715                 720 


Arg Lys Val Ile Lys Ala Ala Ala Leu Glu Glu His Asn Ala Ala Met 
                725                 730                 735     


Lys Asp Gln Leu Lys Leu Gln Glu Lys Asp Ile Gln Met Trp Lys Val 
            740                 745                 750         


Ser Leu Gln Lys Glu Gln Ala Arg Tyr Asn Gln Leu Gln Glu Gln Arg 
        755                 760                 765             


Asp Thr Met Val Thr Lys Leu His Ser Gln Ile Arg Gln Leu Gln His 
    770                 775                 780                 


Asp Arg Glu Glu Phe Tyr Asn Gln Ser Gln Glu Leu Gln Thr Lys Leu 
785                 790                 795                 800 


Glu Asp Cys Arg Asn Met Ile Ala Glu Leu Arg Ile Glu Leu Lys Lys 
                805                 810                 815     


Ala Asn Asn Lys Val Cys His Thr Glu Leu Leu Leu Ser Gln Val Ser 
            820                 825                 830         


Gln Lys Leu Ser Asn Ser Glu Ser Val Gln Gln Gln Met Glu Phe Leu 
        835                 840                 845             


Asn Arg Gln Leu Leu Val Leu Gly Glu Val Asn Glu Leu Tyr Leu Glu 
    850                 855                 860                 


Gln Leu Gln Asn Lys His Ser Asp Thr Thr Lys Glu Val Glu Met Met 
865                 870                 875                 880 


Lys Ala Ala Tyr Arg Lys Glu Leu Glu Lys Asn Arg Ser His Val Leu 
                885                 890                 895     


Gln Gln Thr Gln Arg Leu Asp Thr Ser Gln Lys Arg Ile Leu Glu Leu 
            900                 905                 910         


Glu Ser His Leu Ala Lys Lys Asp His Leu Leu Leu Glu Gln Lys Lys 
        915                 920                 925             


Tyr Leu Glu Asp Val Lys Leu Gln Ala Arg Gly Gln Leu Gln Ala Ala 
    930                 935                 940                 


Glu Ser Arg Tyr Glu Ala Gln Lys Arg Ile Thr Gln Val Phe Glu Leu 
945                 950                 955                 960 


Glu Ile Leu Asp Leu Tyr Gly Arg Leu Glu Lys Asp Gly Leu Leu Lys 
                965                 970                 975     


Lys Leu Glu Glu Glu Lys Ala Glu Ala Ala Glu Ala Ala Glu Glu Arg 
            980                 985                 990         


Leu Asp Cys Cys Asn Asp Gly Cys  Ser Asp Ser Met Val  Gly His Asn 
        995                 1000                 1005             


Glu Glu  Ala Ser Gly His Asn  Gly Glu Thr Lys Thr  Pro Arg Pro 
    1010                 1015                 1020             


Ser Ser  Ala Arg Gly Ser Ser  Gly Ser Arg Gly Gly  Gly Gly Ser 
    1025                 1030                 1035             


Ser Ser  Ser Ser Ser Glu Leu  Ser Thr Pro Glu Lys  Pro Pro His 
    1040                 1045                 1050             


Gln Arg  Ala Gly Pro Phe Ser  Ser Arg Trp Glu Thr  Thr Met Gly 
    1055                 1060                 1065             


Glu Ala  Ser Ala Ser Ile Pro  Thr Thr Val Gly Ser  Leu Pro Ser 
    1070                 1075                 1080             


Ser Lys  Ser Phe Leu Gly Met  Lys Ala Arg Glu Leu  Phe Arg Asn 
    1085                 1090                 1095             


Lys Ser  Glu Ser Gln Cys Asp  Glu Asp Gly Met Thr  Ser Ser Leu 
    1100                 1105                 1110             


Ser Glu  Ser Leu Lys Thr Glu  Leu Gly Lys Asp Leu  Gly Val Glu 
    1115                 1120                 1125             


Ala Lys  Ile Pro Leu Asn Leu  Asp Gly Pro His Pro  Ser Pro Pro 
    1130                 1135                 1140             


Thr Pro  Asp Ser Val Gly Gln  Leu His Ile Met Asp  Tyr Asn Glu 
    1145                 1150                 1155             


Thr His  His Glu His Ser 
    1160                 


<210>  66
<211>  2219
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiensTSC1 mRNA for tumor suppressor, complete cds

<400>  66
cctgtaggct ggagggactg tgaggtaaac agctgagggg gaggagacgg tggtgaccat       60

gaaagacacc aggttgacag cactggaaac tgaagtacca gttgtcgcta gaacagtttg      120

gtagtggccc caatgaagaa ccttcagaac ctgtagcaca cgtcctggag ccagcacagc      180

gccttcgagc gagagaatgg cccaacaagc aaatgtcggg gagcttcttg ccatgctgga      240

ctcccccatg ctgggtgtgc gggacgacgt gacagctgtc tttaaagaga acctcaattc      300

tgaccgtggc cctatgcttg taaacacctt ggtggattat tacctggaaa ccagctctca      360

gccggcattg cacatcctga ccaccttgca agagccacat gacaagcacc tcttggacag      420

gattaacgaa tatgtgggca aagccgccac tcgtttatcc atcctctcgt tactgggtca      480

tgtcataaga ctgcagccat cttggaagca taagctctct caagcacctc ttttgccttc      540

tttactaaaa tgtctcaaga tggacactga cgtcgttgtc ctcacaacag gcgtcttggt      600

gttgataacc atgctaccaa tgattccaca gtctgggaaa cagcatcttc ttgatttctt      660

tgacattttt ggccgtctgt catcatggtg cctgaagaaa ccaggccacg tggcggaagt      720

ctatctcgtc catctccatg ccagtgtgta cgcactcttt catcgccttt atggaatgta      780

cccttgcaac ttcgtctcct ttttgcgttc tcattacagt atgaaagaaa acctggagac      840

ttttgaagaa gtggtcaagc caatgatgga gcatgtgcga attcatccgg aattagtgac      900

tggatccaag gaccatgaac tggaccctcg aaggtggaag agattagaaa ctcatgatgt      960

tgtgatcgag tgtgccaaaa tctctctgga tcccacagaa gcctcatatg aagatggcta     1020

ttctgtgtct caccaaatct cagcccgctt tcctcatcgt tcagccgatg tcaccaccag     1080

cccttatgct gacacacaga atagctatgg gtgtgctact tctacccctt actccacgtc     1140

tcggctgatg ttgttaaata tgccagggca gctacctcag actctgagtt ccccatcgac     1200

acggctgata actgaaccac cacaagctac tctttggagc ccatctatgg tttgtggtat     1260

gaccactcct ccaacttctc ctggaaatgt cccacctgat ctgtcacacc cttacagtaa     1320

agtctttggt acaactgcag gtggaaaagg aactcctctg ggaaccccag caacctctcc     1380

tcctccagcc ccactctgtc attcggatga ctacgtgcac atttcactcc cccaggccac     1440

agtcacaccc cccaggaagg tgcgatccag ctcgtctgct atccctctgc ccaggcacag     1500

tgactcactt gcaagcctca ctttgagaag cctaattatg ccagatagaa ttctgaccta     1560

aaatgcaatg ggtttgaatc agaataattg aaaatggact aacaagctgc tctcaaggtg     1620

tgatctgccg acccctgggt ggttccaaaa ctctgctttg gcagaactct tgtggggaac     1680

ccagcaggtc aaacctactc tcttgatttt tttttttttt ttttttttgg agacagggtc     1740

ttgctctgtc gcccagggtg gagtgcagtg gcacgatcat ggctcactgc agccttgacc     1800

tcccagactc atgcaatcct cctgcatcag cctcgctagt agctgggact acaggcatgt     1860

gccatgatgc ctggctaagt tttttttttt tttggtggag atgaggtctc actatgttgc     1920

ccaggctagt cttaaactcc tggactcaaa cgatcctcct acctgggcct cccaaaatcc     1980

tgggattaca ggtttgagcc accacacctg accaaaccca ccttcatagt aacactaagc     2040

tgtcatttgc ccttcccact gcattggcat ctgcaatgat ggcacaaata cgacgggggg     2100

tgaaatggtg gcaacttagc ctgaatcaag gcaggcattg aagtgtccta tcccaagagg     2160

ccttgcattc ttccctgaca tcttctctcc attaaaaaca ataaaaacta aaaaatttt      2219


<210>  67
<211>  454
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens TSC1 polypeptide for tumor suppressor, complete cds

<400>  67

Met Ala Gln Gln Ala Asn Val Gly Glu Leu Leu Ala Met Leu Asp Ser 
1               5                   10                  15      


Pro Met Leu Gly Val Arg Asp Asp Val Thr Ala Val Phe Lys Glu Asn 
            20                  25                  30          


Leu Asn Ser Asp Arg Gly Pro Met Leu Val Asn Thr Leu Val Asp Tyr 
        35                  40                  45              


Tyr Leu Glu Thr Ser Ser Gln Pro Ala Leu His Ile Leu Thr Thr Leu 
    50                  55                  60                  


Gln Glu Pro His Asp Lys His Leu Leu Asp Arg Ile Asn Glu Tyr Val 
65                  70                  75                  80  


Gly Lys Ala Ala Thr Arg Leu Ser Ile Leu Ser Leu Leu Gly His Val 
                85                  90                  95      


Ile Arg Leu Gln Pro Ser Trp Lys His Lys Leu Ser Gln Ala Pro Leu 
            100                 105                 110         


Leu Pro Ser Leu Leu Lys Cys Leu Lys Met Asp Thr Asp Val Val Val 
        115                 120                 125             


Leu Thr Thr Gly Val Leu Val Leu Ile Thr Met Leu Pro Met Ile Pro 
    130                 135                 140                 


Gln Ser Gly Lys Gln His Leu Leu Asp Phe Phe Asp Ile Phe Gly Arg 
145                 150                 155                 160 


Leu Ser Ser Trp Cys Leu Lys Lys Pro Gly His Val Ala Glu Val Tyr 
                165                 170                 175     


Leu Val His Leu His Ala Ser Val Tyr Ala Leu Phe His Arg Leu Tyr 
            180                 185                 190         


Gly Met Tyr Pro Cys Asn Phe Val Ser Phe Leu Arg Ser His Tyr Ser 
        195                 200                 205             


Met Lys Glu Asn Leu Glu Thr Phe Glu Glu Val Val Lys Pro Met Met 
    210                 215                 220                 


Glu His Val Arg Ile His Pro Glu Leu Val Thr Gly Ser Lys Asp His 
225                 230                 235                 240 


Glu Leu Asp Pro Arg Arg Trp Lys Arg Leu Glu Thr His Asp Val Val 
                245                 250                 255     


Ile Glu Cys Ala Lys Ile Ser Leu Asp Pro Thr Glu Ala Ser Tyr Glu 
            260                 265                 270         


Asp Gly Tyr Ser Val Ser His Gln Ile Ser Ala Arg Phe Pro His Arg 
        275                 280                 285             


Ser Ala Asp Val Thr Thr Ser Pro Tyr Ala Asp Thr Gln Asn Ser Tyr 
    290                 295                 300                 


Gly Cys Ala Thr Ser Thr Pro Tyr Ser Thr Ser Arg Leu Met Leu Leu 
305                 310                 315                 320 


Asn Met Pro Gly Gln Leu Pro Gln Thr Leu Ser Ser Pro Ser Thr Arg 
                325                 330                 335     


Leu Ile Thr Glu Pro Pro Gln Ala Thr Leu Trp Ser Pro Ser Met Val 
            340                 345                 350         


Cys Gly Met Thr Thr Pro Pro Thr Ser Pro Gly Asn Val Pro Pro Asp 
        355                 360                 365             


Leu Ser His Pro Tyr Ser Lys Val Phe Gly Thr Thr Ala Gly Gly Lys 
    370                 375                 380                 


Gly Thr Pro Leu Gly Thr Pro Ala Thr Ser Pro Pro Pro Ala Pro Leu 
385                 390                 395                 400 


Cys His Ser Asp Asp Tyr Val His Ile Ser Leu Pro Gln Ala Thr Val 
                405                 410                 415     


Thr Pro Pro Arg Lys Val Arg Ser Ser Ser Ser Ala Ile Pro Leu Pro 
            420                 425                 430         


Arg His Ser Asp Ser Leu Ala Ser Leu Thr Leu Arg Ser Leu Ile Met 
        435                 440                 445             


Pro Asp Arg Ile Leu Thr 
    450                 


<210>  68
<211>  8623
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tuberous sclerosis 1 (TSC1), transcript variant 3, 
       mRNA

<400>  68
acgacggggg aggtgctgta cgtccaagat ggcggcgccc tgtaggctgg agggactgtg       60

aggtaaacag ctgaggggga ggagacggtg gtgaccatga aagacaccag gttgacagca      120

ctggaaactg aagtaccagt tgtcgctaga acagtttggt agtggcccca atgaagaacc      180

ttcagaacct gtagcacacg tcctggagcc agcacagcgc cttcgagcga gagaatggcc      240

caacaagcaa atgtcgggga gcttcttgcc atgctggact cccccatgct gggtgtgcgg      300

gacgacgtga cagctgtctt taaagagaac ctcaattctg accgtggccc tatgcttgta      360

aacaccttgg tggattatta cctggaaacc agctctcagc cggcattgca catcctgacc      420

accttgcaag agccacatga caagcacctc ttggacagga ttaacgaata tgtgggcaaa      480

gccgccactc gtttatccat cctctcgtta ctgggtcatg tcataagact gcagccatct      540

tggaagcata agctctctca agcacctctt ttgccttctt tactaaaatg tctcaagatg      600

gacactgacg tcgttgtcct cacaacaggc gtcttggtgt tgataaccat gctaccaatg      660

attccacagt ctgggaaaca gcatcttctt gatttctttg acatttttgg ccgtctgtca      720

tcatggtgcc tgaagaaacc aggccacgtg gcggaagtct atctcgtcca tctccatgcc      780

agtgtgtacg cactctttca tcgcctttat ggaatgtacc cttgcaactt cgtctccttt      840

ttgcgttctc attacagtat gaaagaaaac ctggagactt ttgaagaagt ggtcaagcca      900

atgatggagc atgtgcgaat tcatccggaa ttagtgactg gatccaagga ccatgaactg      960

gaccctcgaa ggtggaagag attagaaact catgatgttg tgatcgagtg tgccaaaatc     1020

tctctggatc ccacagaagc ctcatatgaa gatggctatt ctgtgtctca ccaaatctca     1080

gcccgctttc ctcatcgttc agccgatgtc accaccagcc cttatgctga cacacagaat     1140

agctatgggt gtgctacttc taccccttac tccacgtctc ggctgatgtt gttaaatatg     1200

ccagggcagc tacctcagac tctgagttcc ccatcgacac ggctgataac tgaaccacca     1260

caagctactc tttggagccc atctatggtt tgtggtatga ccactcctcc aacttctcct     1320

ggaaatgtcc cacctgatct gtcacaccct tacagtaaag tctttggtac aactggtgga     1380

aaaggaactc ctctgggaac cccagcaacc tctcctcctc cagccccact ctgtcattcg     1440

gatgactacg tgcacatttc actcccccag gccacagtca caccccccag gaaggaagag     1500

agaatggatt ctgcaagacc atgtctacac agacaacacc atcttctgaa tgacagagga     1560

tcagaagagc cacctggcag caaaggttct gtcactctaa gtgatcttcc agggttttta     1620

ggtgatctgg cctctgaaga agatagtatt gaaaaagata aagaagaagc tgcaatatct     1680

agagaacttt ctgagatcac cacagcagag gcagagcctg tggttcctcg aggaggcttt     1740

gactctccct tttaccgaga cagtctccca ggttctcagc ggaagaccca ctcggcagcc     1800

tccagttctc agggcgccag cgtgaaccct gagcctttac actcctccct ggacaagctt     1860

gggcctgaca caccaaagca agcctttact cccatagacc tgccctgcgg cagtgctgat     1920

gaaagccctg cgggagacag ggaatgccag acttctttgg agaccagtat cttcactccc     1980

agtccttgta aaattccacc tccgacgaga gtgggctttg gaagcgggca gcctcccccg     2040

tatgatcatc tttttgaggt ggcattgcca aagacagccc atcattttgt catcaggaag     2100

actgaggagc tgttaaagaa agcaaaagga aacacagagg aagatggtgt gccctctacc     2160

tccccaatgg aagtgctgga cagactgata cagcagggag cagacgcgca cagcaaggag     2220

ctgaacaagt tgcctttacc cagcaagtct gtcgactgga cccactttgg aggctctcct     2280

ccttcagatg agatccgcac cctccgagac cagttgcttt tactgcacaa ccagttactc     2340

tatgagcgtt ttaagaggca gcagcatgcc ctccggaaca ggcggctcct ccgcaaggtg     2400

atcaaagcag cagctctgga ggaacataat gctgccatga aagatcagtt gaagttacaa     2460

gagaaggaca tccagatgtg gaaggttagt ctgcagaaag aacaagctag atacaatcag     2520

ctccaggagc agcgtgacac tatggtaacc aagctccaca gccagatcag acagctgcag     2580

catgaccgag aggaattcta caaccagagc caggaattac agacgaagct ggaggactgc     2640

aggaacatga ttgcggagct gcggatagaa ctgaagaagg ccaacaacaa ggtgtgtcac     2700

actgagctgc tgctcagtca ggtttcccaa aagctctcaa acagtgagtc ggtccagcag     2760

cagatggagt tcttgaacag gcagctgttg gttcttgggg aggtcaacga gctctatttg     2820

gaacaactgc agaacaagca ctcagatacc acaaaggaag tagaaatgat gaaagccgcc     2880

tatcggaaag agctagaaaa aaacagaagc catgttctcc agcagactca gaggcttgat     2940

acctcccaaa aacggatttt ggaactggaa tctcacctgg ccaagaaaga ccaccttctt     3000

ttggaacaga agaaatatct agaggatgtc aaactccagg caagaggaca gctgcaggcc     3060

gcagagagca ggtatgaggc tcagaaaagg ataacccagg tgtttgaatt ggagatctta     3120

gatttatatg gcaggttgga gaaagatggc ctcctgaaaa aacttgaaga agaaaaagca     3180

gaagcagctg aagcagcaga agaaaggctt gactgttgta atgacgggtg ctcagattcc     3240

atggtagggc acaatgaaga ggcatctggc cacaacggtg agaccaagac ccccaggccc     3300

agcagcgccc ggggcagtag tggaagcaga ggtggtggag gcagcagcag cagcagcagc     3360

gagctttcta ccccagagaa acccccacac cagagggcag gcccattcag cagtcggtgg     3420

gagacgacta tgggagaagc gtctgccagc atccccacca ctgtgggctc acttcccagt     3480

tcaaaaagct tcctgggtat gaaggctcga gagttatttc gtaataagag cgagagccag     3540

tgtgatgagg acggcatgac cagtagcctt tctgagagcc taaagacaga actgggcaaa     3600

gacttgggtg tggaagccaa gattcccctg aacctagatg gccctcaccc gtctcccccg     3660

accccggaca gtgttggaca gctacatatc atggactaca atgagactca tcatgaacac     3720

agctaaggaa tgatggtcaa tcagtgttaa cttgcatatt gttggcacag aacaggaggt     3780

gtgaatgcac gtttcaaagc tttcctgttt ccagggtctg agtgcaagtt catgtgtgga     3840

aatgggacgg aggtcctttg gacagctgac tgaatgcaga acggtttttg gatctggcat     3900

tgaaatgcct cttgaccttc ccctccaccc gccctaaccc cctctcattt acctcgcagt     3960

gtgttctaat ccaagggcca gttggtgttc ctcagtagct ttactttctt cctttccccc     4020

ccaaatggtt gcgtcctttg aacctgtgca atatgaggcc aaatttaatc tttgagtcta     4080

acacaccact ttctgctttc ccgaagttca gataactggg ttggctctca attagaccag     4140

gtagtttgtt gcattgcagg taagtctggt tttgtccctt ccaggaggac atagcctgca     4200

aagctggttg tctttacatg aaagcgttta catgagactt tccgactgct tttttgattc     4260

tgaagttcag catctaaagc agcaggtcta gaagaacaac ggtttattca tacttgcatt     4320

cttttggcag ttctgataag cttcctagaa agttctgtgt aaacagaagc ctgtttcaga     4380

aatctggagc tggcactgtg gagaccacac accctttggg aaagctcttg tctcttcttc     4440

ccccactacc tcttatttat ttggtgtttg cttgaatgct ggtactattg tgaccacagg     4500

ctggtgtgta ggtggtaaaa cctgttctcc ataggaggga aggagcagtc actgggagag     4560

gttacccgag aagcacttga gcatgaggaa ctgcaccttt aggccatctc agcttgctgg     4620

gccttttgtt aaacccttct gtctactggc ctccctttgt gtgcatacgc ctcttgttca     4680

tgtcagctta tatgtgacac tgcagcagaa aggctctgaa ggtccaaaga gtttctgcaa     4740

agtgtatgtg accatcattt cccaggccat tagggttgcc tcactgtagc aggttctagg     4800

ctaccagaag aggggcagct ttttcatacc aattccaact ttcaggggct gactctccag     4860

ggagctgatg tcatcacact ctccatgtta gtaatggcag agcagtctaa acagagtccg     4920

ggagaatgct ggcaaaggct ggctgtgtat acccactagg ctgccccacg tgctcccgag     4980

agatgacact agtcagaaaa ttggcagtgg cagagaatcc aaactcaaca agtgctcctg     5040

aaagaaacgc tagaagccta agaactgtgg tctggtgttc cagctgaggc agggggattt     5100

ggtaggaagg agccagtgaa cttggctttc ctgtttctat ctttcattaa aaagaataga     5160

aggattcagt cataaagagg taaaaaactg tcacggtacg aaatcttagt gcccacggag     5220

gcctcgagca gagagaatga aagtcttttt tttttttttt tttttttagc atggcaataa     5280

atattctagc atccctaact aaaggggact agacagttag agactctgtc accctagcta     5340

taccagcaga aaacctgttc aggcaggctt tctgggtgtg actgattccc agcctgtggc     5400

agggcgtggt cccaactact cagcctagca caggctggca gttggtactg aattgtcaga     5460

tgtggagtat tagtgacacc acacatttaa ttcagctttg tccaaaggaa agcttaaaac     5520

ccaatacagt ctagtttcct ggttccgttt tagaaaagga aaacgtgaac aaacttagaa     5580

agggaaggaa atcccatcag tgaatcctga aactggtttt aagtgctttc cttctcctca     5640

tgcccaagag atctgtgcca tagaacaaga taccaggcac ttaaagcctt ttcctgaatt     5700

ggaaaggaaa agaggcccaa gtgcaaaaga aaaaacattt tagaaacgga cagcttataa     5760

aaataaaggg aagaaaggag gcagcatgga gagaggcctg tgctagaagc tccatggacg     5820

tgtctgcaca gggtcctcag ctcatccatg cggcctgggt gtccttttac tcagctttat     5880

aacaaatgtg gctccaagct caggtgcctt tgagttctag gaggctgtgg gttttattca     5940

actacggttg ggagaatgag acctggagtc atgttgaagg tgcccaacct aaaaatgtag     6000

gctttcatgt tgcaaagaac tccagagtca gtagttaggt ttggtttggt tttggacatg     6060

ataaacctgc caagagtcaa caggtcactt gatcatgctg cagtgggtag ttctaaggat     6120

ggaaaggtga cagtattact ctcgagaggc aattcagtcc tgggcaaagg tattagtaca     6180

ataagcgtta agggcagagt ctaccttgaa accaattaag cagcttggta ttcataaata     6240

ttgggattgg atggcctcca tccagaaatc actatgggtg agcatacctg tctcagctgt     6300

ttggccaatg tgcataacct actcggatcc ccacctgaca ctaaccagag tcagcacagg     6360

ccccgaggag cccgaagtct gctgctgtgc agcatggaat tcctttaaaa aggtgcacta     6420

cagttttagc ggggaggggg ataggaagac gcagagcaaa tgagctccgg agtccctgca     6480

ggtgaataaa cacacagatc tgcatctgat agaactttga tggattttca aaaagccgtt     6540

gacaaggctc tgctatacag tctataaaaa ttgttattat gggattggaa gaaacacgtg     6600

gtcatgaata gaaaaaaaac aaacccaaag gtaggaaggt caaggtcatt tcttagatgg     6660

agaagttgtg aaagatgtcc ttggagatga gttttaggac cagcattact aaggcaggtg     6720

ggcagacagt gacctctcta ggtgtgtcca cagagttttt caggagagaa aactgcctga     6780

cctttgggac taagctgcgg aatcttctta ctaagcttga agagtggaga ggcgagaggt     6840

gagctacttt gtgagccaaa gcttatgtga catggttggg gaaacagtcc aaactgttct     6900

gagaaggtga actgttacga cccaggacaa ttagaaaaat tcacccacca tgccgcacat     6960

tactgggtaa aagcagggca gcagggaaca aaactccaga ctcttgggcc gtccccattt     7020

gcaacagcac acatagtttc tggtatattt gttgggaaag ataaaactct agcagttgtt     7080

gaggggagga tgtataaaat ggtcatgggg atgaaaggat ctctgagacc acagaggctc     7140

agactcactg ttaagaatag aaaactgggt atgcgtttca tgtagccagc agaactgaag     7200

tgtgctgtga caagccaatg tgaatttcta ccaaatagta gagcatacca cttgaagaag     7260

gaaagaaccg aagagcaaac aaaagttctg cgtaatgaga ctcacctttt ctcgctgaaa     7320

gcactaagag gtgggaggag gcctgcacag gctggaggag ggtttgggca gagcgaagac     7380

ccggccagga ccttggtgag atggggtgcc gcccacctcc tgcggatact cttggagagt     7440

tgttccccca gggggctctg ccccacctgg agaaggaagc tgcctggtgt ggagtgactc     7500

aaatcagtat acctatctgc tgcaccttca ctctccaggg tacatgcttt aaaaccgacc     7560

cgcaacaagt attggaaaaa tgtatccagt ctgaagatgt ttgtgtatct gtttacatcc     7620

agagttctgt gacacatgcc ccccagattg ctgcaaagat cccaaggcat tgattgcact     7680

tgattaagct tttgtctgta ggtgaaagaa caagtttagg tcgaggactg gcccctaggc     7740

tgctgctgtg acccttgtcc catgtggctt gtttgcctgt ccgggactct tcgatgtgcc     7800

caggggagcg tgttcctgtc tcttccatgc cgtcctgcag tccttatctg ctcgcctgag     7860

ggaagagtag ctgtagctac aagggaagcc tgcctggaag agccgagcac ctgtgcccat     7920

ggcttctggt catgaaacga gttaatgatg gcagaggagc ttcctcccca cttcgcagcg     7980

ccacattatc catcctctga gataagtagg ctggtttaac cattggaatg gacctttcag     8040

tggaaaccct gagagtctga gaacccccag accaaccctt ccctcccttt ccccacctct     8100

tacagtgttt ggacaggagg gtatggtgct gctctgtgta gcaagtactt tggcttatga     8160

aagaggcagc cacgcatttt gcactaggaa gaatcagtaa tcacttttca gaagacttct     8220

atggaccaca aatatattac ggaggaacag attttgctaa gacataatct agttttataa     8280

ctcaatcatg aatgaaccat gtgtggcaaa cttgcagttt aaaggggtcc catcagtgaa     8340

agaaactgat tttttttaac ggactgcttt tagttaaatt gaagaaagtc agctcttgtc     8400

aaaaggtcta aactttcccg cctcaatcct aaaagcatgt caacaatcca catcagatgc     8460

cataaatatg aactgcagga taaaatggta caatcttagt gaatgggaat tggaatcaaa     8520

agagtttgct gtccttctta gaatgttcta aaatgtcaag gcagttgctt gtgtttaact     8580

gtgaacaaat aaaaatttat tgttttgcac tacaaaaaaa aaa                       8623


<210>  69
<211>  1163
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tuberous sclerosis 1 (TSC1), transcript variant 3, 
       mRNA

NCBI Reference Sequence: NM_001162426.1

<400>  69

Met Ala Gln Gln Ala Asn Val Gly Glu Leu Leu Ala Met Leu Asp Ser 
1               5                   10                  15      


Pro Met Leu Gly Val Arg Asp Asp Val Thr Ala Val Phe Lys Glu Asn 
            20                  25                  30          


Leu Asn Ser Asp Arg Gly Pro Met Leu Val Asn Thr Leu Val Asp Tyr 
        35                  40                  45              


Tyr Leu Glu Thr Ser Ser Gln Pro Ala Leu His Ile Leu Thr Thr Leu 
    50                  55                  60                  


Gln Glu Pro His Asp Lys His Leu Leu Asp Arg Ile Asn Glu Tyr Val 
65                  70                  75                  80  


Gly Lys Ala Ala Thr Arg Leu Ser Ile Leu Ser Leu Leu Gly His Val 
                85                  90                  95      


Ile Arg Leu Gln Pro Ser Trp Lys His Lys Leu Ser Gln Ala Pro Leu 
            100                 105                 110         


Leu Pro Ser Leu Leu Lys Cys Leu Lys Met Asp Thr Asp Val Val Val 
        115                 120                 125             


Leu Thr Thr Gly Val Leu Val Leu Ile Thr Met Leu Pro Met Ile Pro 
    130                 135                 140                 


Gln Ser Gly Lys Gln His Leu Leu Asp Phe Phe Asp Ile Phe Gly Arg 
145                 150                 155                 160 


Leu Ser Ser Trp Cys Leu Lys Lys Pro Gly His Val Ala Glu Val Tyr 
                165                 170                 175     


Leu Val His Leu His Ala Ser Val Tyr Ala Leu Phe His Arg Leu Tyr 
            180                 185                 190         


Gly Met Tyr Pro Cys Asn Phe Val Ser Phe Leu Arg Ser His Tyr Ser 
        195                 200                 205             


Met Lys Glu Asn Leu Glu Thr Phe Glu Glu Val Val Lys Pro Met Met 
    210                 215                 220                 


Glu His Val Arg Ile His Pro Glu Leu Val Thr Gly Ser Lys Asp His 
225                 230                 235                 240 


Glu Leu Asp Pro Arg Arg Trp Lys Arg Leu Glu Thr His Asp Val Val 
                245                 250                 255     


Ile Glu Cys Ala Lys Ile Ser Leu Asp Pro Thr Glu Ala Ser Tyr Glu 
            260                 265                 270         


Asp Gly Tyr Ser Val Ser His Gln Ile Ser Ala Arg Phe Pro His Arg 
        275                 280                 285             


Ser Ala Asp Val Thr Thr Ser Pro Tyr Ala Asp Thr Gln Asn Ser Tyr 
    290                 295                 300                 


Gly Cys Ala Thr Ser Thr Pro Tyr Ser Thr Ser Arg Leu Met Leu Leu 
305                 310                 315                 320 


Asn Met Pro Gly Gln Leu Pro Gln Thr Leu Ser Ser Pro Ser Thr Arg 
                325                 330                 335     


Leu Ile Thr Glu Pro Pro Gln Ala Thr Leu Trp Ser Pro Ser Met Val 
            340                 345                 350         


Cys Gly Met Thr Thr Pro Pro Thr Ser Pro Gly Asn Val Pro Pro Asp 
        355                 360                 365             


Leu Ser His Pro Tyr Ser Lys Val Phe Gly Thr Thr Gly Gly Lys Gly 
    370                 375                 380                 


Thr Pro Leu Gly Thr Pro Ala Thr Ser Pro Pro Pro Ala Pro Leu Cys 
385                 390                 395                 400 


His Ser Asp Asp Tyr Val His Ile Ser Leu Pro Gln Ala Thr Val Thr 
                405                 410                 415     


Pro Pro Arg Lys Glu Glu Arg Met Asp Ser Ala Arg Pro Cys Leu His 
            420                 425                 430         


Arg Gln His His Leu Leu Asn Asp Arg Gly Ser Glu Glu Pro Pro Gly 
        435                 440                 445             


Ser Lys Gly Ser Val Thr Leu Ser Asp Leu Pro Gly Phe Leu Gly Asp 
    450                 455                 460                 


Leu Ala Ser Glu Glu Asp Ser Ile Glu Lys Asp Lys Glu Glu Ala Ala 
465                 470                 475                 480 


Ile Ser Arg Glu Leu Ser Glu Ile Thr Thr Ala Glu Ala Glu Pro Val 
                485                 490                 495     


Val Pro Arg Gly Gly Phe Asp Ser Pro Phe Tyr Arg Asp Ser Leu Pro 
            500                 505                 510         


Gly Ser Gln Arg Lys Thr His Ser Ala Ala Ser Ser Ser Gln Gly Ala 
        515                 520                 525             


Ser Val Asn Pro Glu Pro Leu His Ser Ser Leu Asp Lys Leu Gly Pro 
    530                 535                 540                 


Asp Thr Pro Lys Gln Ala Phe Thr Pro Ile Asp Leu Pro Cys Gly Ser 
545                 550                 555                 560 


Ala Asp Glu Ser Pro Ala Gly Asp Arg Glu Cys Gln Thr Ser Leu Glu 
                565                 570                 575     


Thr Ser Ile Phe Thr Pro Ser Pro Cys Lys Ile Pro Pro Pro Thr Arg 
            580                 585                 590         


Val Gly Phe Gly Ser Gly Gln Pro Pro Pro Tyr Asp His Leu Phe Glu 
        595                 600                 605             


Val Ala Leu Pro Lys Thr Ala His His Phe Val Ile Arg Lys Thr Glu 
    610                 615                 620                 


Glu Leu Leu Lys Lys Ala Lys Gly Asn Thr Glu Glu Asp Gly Val Pro 
625                 630                 635                 640 


Ser Thr Ser Pro Met Glu Val Leu Asp Arg Leu Ile Gln Gln Gly Ala 
                645                 650                 655     


Asp Ala His Ser Lys Glu Leu Asn Lys Leu Pro Leu Pro Ser Lys Ser 
            660                 665                 670         


Val Asp Trp Thr His Phe Gly Gly Ser Pro Pro Ser Asp Glu Ile Arg 
        675                 680                 685             


Thr Leu Arg Asp Gln Leu Leu Leu Leu His Asn Gln Leu Leu Tyr Glu 
    690                 695                 700                 


Arg Phe Lys Arg Gln Gln His Ala Leu Arg Asn Arg Arg Leu Leu Arg 
705                 710                 715                 720 


Lys Val Ile Lys Ala Ala Ala Leu Glu Glu His Asn Ala Ala Met Lys 
                725                 730                 735     


Asp Gln Leu Lys Leu Gln Glu Lys Asp Ile Gln Met Trp Lys Val Ser 
            740                 745                 750         


Leu Gln Lys Glu Gln Ala Arg Tyr Asn Gln Leu Gln Glu Gln Arg Asp 
        755                 760                 765             


Thr Met Val Thr Lys Leu His Ser Gln Ile Arg Gln Leu Gln His Asp 
    770                 775                 780                 


Arg Glu Glu Phe Tyr Asn Gln Ser Gln Glu Leu Gln Thr Lys Leu Glu 
785                 790                 795                 800 


Asp Cys Arg Asn Met Ile Ala Glu Leu Arg Ile Glu Leu Lys Lys Ala 
                805                 810                 815     


Asn Asn Lys Val Cys His Thr Glu Leu Leu Leu Ser Gln Val Ser Gln 
            820                 825                 830         


Lys Leu Ser Asn Ser Glu Ser Val Gln Gln Gln Met Glu Phe Leu Asn 
        835                 840                 845             


Arg Gln Leu Leu Val Leu Gly Glu Val Asn Glu Leu Tyr Leu Glu Gln 
    850                 855                 860                 


Leu Gln Asn Lys His Ser Asp Thr Thr Lys Glu Val Glu Met Met Lys 
865                 870                 875                 880 


Ala Ala Tyr Arg Lys Glu Leu Glu Lys Asn Arg Ser His Val Leu Gln 
                885                 890                 895     


Gln Thr Gln Arg Leu Asp Thr Ser Gln Lys Arg Ile Leu Glu Leu Glu 
            900                 905                 910         


Ser His Leu Ala Lys Lys Asp His Leu Leu Leu Glu Gln Lys Lys Tyr 
        915                 920                 925             


Leu Glu Asp Val Lys Leu Gln Ala Arg Gly Gln Leu Gln Ala Ala Glu 
    930                 935                 940                 


Ser Arg Tyr Glu Ala Gln Lys Arg Ile Thr Gln Val Phe Glu Leu Glu 
945                 950                 955                 960 


Ile Leu Asp Leu Tyr Gly Arg Leu Glu Lys Asp Gly Leu Leu Lys Lys 
                965                 970                 975     


Leu Glu Glu Glu Lys Ala Glu Ala Ala Glu Ala Ala Glu Glu Arg Leu 
            980                 985                 990         


Asp Cys Cys Asn Asp Gly Cys Ser  Asp Ser Met Val Gly  His Asn Glu 
        995                 1000                 1005             


Glu Ala  Ser Gly His Asn Gly  Glu Thr Lys Thr Pro  Arg Pro Ser 
    1010                 1015                 1020             


Ser Ala  Arg Gly Ser Ser Gly  Ser Arg Gly Gly Gly  Gly Ser Ser 
    1025                 1030                 1035             


Ser Ser  Ser Ser Glu Leu Ser  Thr Pro Glu Lys Pro  Pro His Gln 
    1040                 1045                 1050             


Arg Ala  Gly Pro Phe Ser Ser  Arg Trp Glu Thr Thr  Met Gly Glu 
    1055                 1060                 1065             


Ala Ser  Ala Ser Ile Pro Thr  Thr Val Gly Ser Leu  Pro Ser Ser 
    1070                 1075                 1080             


Lys Ser  Phe Leu Gly Met Lys  Ala Arg Glu Leu Phe  Arg Asn Lys 
    1085                 1090                 1095             


Ser Glu  Ser Gln Cys Asp Glu  Asp Gly Met Thr Ser  Ser Leu Ser 
    1100                 1105                 1110             


Glu Ser  Leu Lys Thr Glu Leu  Gly Lys Asp Leu Gly  Val Glu Ala 
    1115                 1120                 1125             


Lys Ile  Pro Leu Asn Leu Asp  Gly Pro His Pro Ser  Pro Pro Thr 
    1130                 1135                 1140             


Pro Asp  Ser Val Gly Gln Leu  His Ile Met Asp Tyr  Asn Glu Thr 
    1145                 1150                 1155             


His His  Glu His Ser 
    1160             


<210>  70
<211>  8473
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapienstuberous sclerosis 1 (TSC1), transcript variant 4, 
       mRNA

<400>  70
acgacggggg aggtgctgta cgtccaagat ggcggcgccc tgtaggctgg agggactgtg       60

aggtaaacag ctgaggggga ggagacggtg gtgaccatga aagacaccag gttgacagca      120

ctggaaactg aagtaccagt tgtcgctaga acagtttggt agtggcccca atgaagaacc      180

ttcagaacct gtagcacacg tcctggagcc agcacagcgc cttcgagcga gagaatggcc      240

caacaagcaa atgtcgggga gcttcttgcc atgctggact cccccatgct gggtgtgcgg      300

gacgacgtga cagctgtctt taaagagaac ctcaattctg accgtggccc tatgcttgta      360

aacaccttgg tggattatta cctggaaacc agctctcagc cggcattgca catcctgacc      420

accttgcaag agccacatga caagatggac actgacgtcg ttgtcctcac aacaggcgtc      480

ttggtgttga taaccatgct accaatgatt ccacagtctg ggaaacagca tcttcttgat      540

ttctttgaca tttttggccg tctgtcatca tggtgcctga agaaaccagg ccacgtggcg      600

gaagtctatc tcgtccatct ccatgccagt gtgtacgcac tctttcatcg cctttatgga      660

atgtaccctt gcaacttcgt ctcctttttg cgttctcatt acagtatgaa agaaaacctg      720

gagacttttg aagaagtggt caagccaatg atggagcatg tgcgaattca tccggaatta      780

gtgactggat ccaaggacca tgaactggac cctcgaaggt ggaagagatt agaaactcat      840

gatgttgtga tcgagtgtgc caaaatctct ctggatccca cagaagcctc atatgaagat      900

ggctattctg tgtctcacca aatctcagcc cgctttcctc atcgttcagc cgatgtcacc      960

accagccctt atgctgacac acagaatagc tatgggtgtg ctacttctac cccttactcc     1020

acgtctcggc tgatgttgtt aaatatgcca gggcagctac ctcagactct gagttcccca     1080

tcgacacggc tgataactga accaccacaa gctactcttt ggagcccatc tatggtttgt     1140

ggtatgacca ctcctccaac ttctcctgga aatgtcccac ctgatctgtc acacccttac     1200

agtaaagtct ttggtacaac tgcaggtgga aaaggaactc ctctgggaac cccagcaacc     1260

tctcctcctc cagccccact ctgtcattcg gatgactacg tgcacatttc actcccccag     1320

gccacagtca caccccccag gaaggaagag agaatggatt ctgcaagacc atgtctacac     1380

agacaacacc atcttctgaa tgacagagga tcagaagagc cacctggcag caaaggttct     1440

gtcactctaa gtgatcttcc agggttttta ggtgatctgg cctctgaaga agatagtatt     1500

gaaaaagata aagaagaagc tgcaatatct agagaacttt ctgagatcac cacagcagag     1560

gcagagcctg tggttcctcg aggaggcttt gactctccct tttaccgaga cagtctccca     1620

ggttctcagc ggaagaccca ctcggcagcc tccagttctc agggcgccag cgtgaaccct     1680

gagcctttac actcctccct ggacaagctt gggcctgaca caccaaagca agcctttact     1740

cccatagacc tgccctgcgg cagtgctgat gaaagccctg cgggagacag ggaatgccag     1800

acttctttgg agaccagtat cttcactccc agtccttgta aaattccacc tccgacgaga     1860

gtgggctttg gaagcgggca gcctcccccg tatgatcatc tttttgaggt ggcattgcca     1920

aagacagccc atcattttgt catcaggaag actgaggagc tgttaaagaa agcaaaagga     1980

aacacagagg aagatggtgt gccctctacc tccccaatgg aagtgctgga cagactgata     2040

cagcagggag cagacgcgca cagcaaggag ctgaacaagt tgcctttacc cagcaagtct     2100

gtcgactgga cccactttgg aggctctcct ccttcagatg agatccgcac cctccgagac     2160

cagttgcttt tactgcacaa ccagttactc tatgagcgtt ttaagaggca gcagcatgcc     2220

ctccggaaca ggcggctcct ccgcaaggtg atcaaagcag cagctctgga ggaacataat     2280

gctgccatga aagatcagtt gaagttacaa gagaaggaca tccagatgtg gaaggttagt     2340

ctgcagaaag aacaagctag atacaatcag ctccaggagc agcgtgacac tatggtaacc     2400

aagctccaca gccagatcag acagctgcag catgaccgag aggaattcta caaccagagc     2460

caggaattac agacgaagct ggaggactgc aggaacatga ttgcggagct gcggatagaa     2520

ctgaagaagg ccaacaacaa ggtgtgtcac actgagctgc tgctcagtca ggtttcccaa     2580

aagctctcaa acagtgagtc ggtccagcag cagatggagt tcttgaacag gcagctgttg     2640

gttcttgggg aggtcaacga gctctatttg gaacaactgc agaacaagca ctcagatacc     2700

acaaaggaag tagaaatgat gaaagccgcc tatcggaaag agctagaaaa aaacagaagc     2760

catgttctcc agcagactca gaggcttgat acctcccaaa aacggatttt ggaactggaa     2820

tctcacctgg ccaagaaaga ccaccttctt ttggaacaga agaaatatct agaggatgtc     2880

aaactccagg caagaggaca gctgcaggcc gcagagagca ggtatgaggc tcagaaaagg     2940

ataacccagg tgtttgaatt ggagatctta gatttatatg gcaggttgga gaaagatggc     3000

ctcctgaaaa aacttgaaga agaaaaagca gaagcagctg aagcagcaga agaaaggctt     3060

gactgttgta atgacgggtg ctcagattcc atggtagggc acaatgaaga ggcatctggc     3120

cacaacggtg agaccaagac ccccaggccc agcagcgccc ggggcagtag tggaagcaga     3180

ggtggtggag gcagcagcag cagcagcagc gagctttcta ccccagagaa acccccacac     3240

cagagggcag gcccattcag cagtcggtgg gagacgacta tgggagaagc gtctgccagc     3300

atccccacca ctgtgggctc acttcccagt tcaaaaagct tcctgggtat gaaggctcga     3360

gagttatttc gtaataagag cgagagccag tgtgatgagg acggcatgac cagtagcctt     3420

tctgagagcc taaagacaga actgggcaaa gacttgggtg tggaagccaa gattcccctg     3480

aacctagatg gccctcaccc gtctcccccg accccggaca gtgttggaca gctacatatc     3540

atggactaca atgagactca tcatgaacac agctaaggaa tgatggtcaa tcagtgttaa     3600

cttgcatatt gttggcacag aacaggaggt gtgaatgcac gtttcaaagc tttcctgttt     3660

ccagggtctg agtgcaagtt catgtgtgga aatgggacgg aggtcctttg gacagctgac     3720

tgaatgcaga acggtttttg gatctggcat tgaaatgcct cttgaccttc ccctccaccc     3780

gccctaaccc cctctcattt acctcgcagt gtgttctaat ccaagggcca gttggtgttc     3840

ctcagtagct ttactttctt cctttccccc ccaaatggtt gcgtcctttg aacctgtgca     3900

atatgaggcc aaatttaatc tttgagtcta acacaccact ttctgctttc ccgaagttca     3960

gataactggg ttggctctca attagaccag gtagtttgtt gcattgcagg taagtctggt     4020

tttgtccctt ccaggaggac atagcctgca aagctggttg tctttacatg aaagcgttta     4080

catgagactt tccgactgct tttttgattc tgaagttcag catctaaagc agcaggtcta     4140

gaagaacaac ggtttattca tacttgcatt cttttggcag ttctgataag cttcctagaa     4200

agttctgtgt aaacagaagc ctgtttcaga aatctggagc tggcactgtg gagaccacac     4260

accctttggg aaagctcttg tctcttcttc ccccactacc tcttatttat ttggtgtttg     4320

cttgaatgct ggtactattg tgaccacagg ctggtgtgta ggtggtaaaa cctgttctcc     4380

ataggaggga aggagcagtc actgggagag gttacccgag aagcacttga gcatgaggaa     4440

ctgcaccttt aggccatctc agcttgctgg gccttttgtt aaacccttct gtctactggc     4500

ctccctttgt gtgcatacgc ctcttgttca tgtcagctta tatgtgacac tgcagcagaa     4560

aggctctgaa ggtccaaaga gtttctgcaa agtgtatgtg accatcattt cccaggccat     4620

tagggttgcc tcactgtagc aggttctagg ctaccagaag aggggcagct ttttcatacc     4680

aattccaact ttcaggggct gactctccag ggagctgatg tcatcacact ctccatgtta     4740

gtaatggcag agcagtctaa acagagtccg ggagaatgct ggcaaaggct ggctgtgtat     4800

acccactagg ctgccccacg tgctcccgag agatgacact agtcagaaaa ttggcagtgg     4860

cagagaatcc aaactcaaca agtgctcctg aaagaaacgc tagaagccta agaactgtgg     4920

tctggtgttc cagctgaggc agggggattt ggtaggaagg agccagtgaa cttggctttc     4980

ctgtttctat ctttcattaa aaagaataga aggattcagt cataaagagg taaaaaactg     5040

tcacggtacg aaatcttagt gcccacggag gcctcgagca gagagaatga aagtcttttt     5100

tttttttttt tttttttagc atggcaataa atattctagc atccctaact aaaggggact     5160

agacagttag agactctgtc accctagcta taccagcaga aaacctgttc aggcaggctt     5220

tctgggtgtg actgattccc agcctgtggc agggcgtggt cccaactact cagcctagca     5280

caggctggca gttggtactg aattgtcaga tgtggagtat tagtgacacc acacatttaa     5340

ttcagctttg tccaaaggaa agcttaaaac ccaatacagt ctagtttcct ggttccgttt     5400

tagaaaagga aaacgtgaac aaacttagaa agggaaggaa atcccatcag tgaatcctga     5460

aactggtttt aagtgctttc cttctcctca tgcccaagag atctgtgcca tagaacaaga     5520

taccaggcac ttaaagcctt ttcctgaatt ggaaaggaaa agaggcccaa gtgcaaaaga     5580

aaaaacattt tagaaacgga cagcttataa aaataaaggg aagaaaggag gcagcatgga     5640

gagaggcctg tgctagaagc tccatggacg tgtctgcaca gggtcctcag ctcatccatg     5700

cggcctgggt gtccttttac tcagctttat aacaaatgtg gctccaagct caggtgcctt     5760

tgagttctag gaggctgtgg gttttattca actacggttg ggagaatgag acctggagtc     5820

atgttgaagg tgcccaacct aaaaatgtag gctttcatgt tgcaaagaac tccagagtca     5880

gtagttaggt ttggtttggt tttggacatg ataaacctgc caagagtcaa caggtcactt     5940

gatcatgctg cagtgggtag ttctaaggat ggaaaggtga cagtattact ctcgagaggc     6000

aattcagtcc tgggcaaagg tattagtaca ataagcgtta agggcagagt ctaccttgaa     6060

accaattaag cagcttggta ttcataaata ttgggattgg atggcctcca tccagaaatc     6120

actatgggtg agcatacctg tctcagctgt ttggccaatg tgcataacct actcggatcc     6180

ccacctgaca ctaaccagag tcagcacagg ccccgaggag cccgaagtct gctgctgtgc     6240

agcatggaat tcctttaaaa aggtgcacta cagttttagc ggggaggggg ataggaagac     6300

gcagagcaaa tgagctccgg agtccctgca ggtgaataaa cacacagatc tgcatctgat     6360

agaactttga tggattttca aaaagccgtt gacaaggctc tgctatacag tctataaaaa     6420

ttgttattat gggattggaa gaaacacgtg gtcatgaata gaaaaaaaac aaacccaaag     6480

gtaggaaggt caaggtcatt tcttagatgg agaagttgtg aaagatgtcc ttggagatga     6540

gttttaggac cagcattact aaggcaggtg ggcagacagt gacctctcta ggtgtgtcca     6600

cagagttttt caggagagaa aactgcctga cctttgggac taagctgcgg aatcttctta     6660

ctaagcttga agagtggaga ggcgagaggt gagctacttt gtgagccaaa gcttatgtga     6720

catggttggg gaaacagtcc aaactgttct gagaaggtga actgttacga cccaggacaa     6780

ttagaaaaat tcacccacca tgccgcacat tactgggtaa aagcagggca gcagggaaca     6840

aaactccaga ctcttgggcc gtccccattt gcaacagcac acatagtttc tggtatattt     6900

gttgggaaag ataaaactct agcagttgtt gaggggagga tgtataaaat ggtcatgggg     6960

atgaaaggat ctctgagacc acagaggctc agactcactg ttaagaatag aaaactgggt     7020

atgcgtttca tgtagccagc agaactgaag tgtgctgtga caagccaatg tgaatttcta     7080

ccaaatagta gagcatacca cttgaagaag gaaagaaccg aagagcaaac aaaagttctg     7140

cgtaatgaga ctcacctttt ctcgctgaaa gcactaagag gtgggaggag gcctgcacag     7200

gctggaggag ggtttgggca gagcgaagac ccggccagga ccttggtgag atggggtgcc     7260

gcccacctcc tgcggatact cttggagagt tgttccccca gggggctctg ccccacctgg     7320

agaaggaagc tgcctggtgt ggagtgactc aaatcagtat acctatctgc tgcaccttca     7380

ctctccaggg tacatgcttt aaaaccgacc cgcaacaagt attggaaaaa tgtatccagt     7440

ctgaagatgt ttgtgtatct gtttacatcc agagttctgt gacacatgcc ccccagattg     7500

ctgcaaagat cccaaggcat tgattgcact tgattaagct tttgtctgta ggtgaaagaa     7560

caagtttagg tcgaggactg gcccctaggc tgctgctgtg acccttgtcc catgtggctt     7620

gtttgcctgt ccgggactct tcgatgtgcc caggggagcg tgttcctgtc tcttccatgc     7680

cgtcctgcag tccttatctg ctcgcctgag ggaagagtag ctgtagctac aagggaagcc     7740

tgcctggaag agccgagcac ctgtgcccat ggcttctggt catgaaacga gttaatgatg     7800

gcagaggagc ttcctcccca cttcgcagcg ccacattatc catcctctga gataagtagg     7860

ctggtttaac cattggaatg gacctttcag tggaaaccct gagagtctga gaacccccag     7920

accaaccctt ccctcccttt ccccacctct tacagtgttt ggacaggagg gtatggtgct     7980

gctctgtgta gcaagtactt tggcttatga aagaggcagc cacgcatttt gcactaggaa     8040

gaatcagtaa tcacttttca gaagacttct atggaccaca aatatattac ggaggaacag     8100

attttgctaa gacataatct agttttataa ctcaatcatg aatgaaccat gtgtggcaaa     8160

cttgcagttt aaaggggtcc catcagtgaa agaaactgat tttttttaac ggactgcttt     8220

tagttaaatt gaagaaagtc agctcttgtc aaaaggtcta aactttcccg cctcaatcct     8280

aaaagcatgt caacaatcca catcagatgc cataaatatg aactgcagga taaaatggta     8340

caatcttagt gaatgggaat tggaatcaaa agagtttgct gtccttctta gaatgttcta     8400

aaatgtcaag gcagttgctt gtgtttaact gtgaacaaat aaaaatttat tgttttgcac     8460

tacaaaaaaa aaa                                                        8473


<210>  71
<211>  1113
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapienstuberous sclerosis 1 (TSC1), transcript variant 4, 
       polypeptide

<400>  71

Met Ala Gln Gln Ala Asn Val Gly Glu Leu Leu Ala Met Leu Asp Ser 
1               5                   10                  15      


Pro Met Leu Gly Val Arg Asp Asp Val Thr Ala Val Phe Lys Glu Asn 
            20                  25                  30          


Leu Asn Ser Asp Arg Gly Pro Met Leu Val Asn Thr Leu Val Asp Tyr 
        35                  40                  45              


Tyr Leu Glu Thr Ser Ser Gln Pro Ala Leu His Ile Leu Thr Thr Leu 
    50                  55                  60                  


Gln Glu Pro His Asp Lys Met Asp Thr Asp Val Val Val Leu Thr Thr 
65                  70                  75                  80  


Gly Val Leu Val Leu Ile Thr Met Leu Pro Met Ile Pro Gln Ser Gly 
                85                  90                  95      


Lys Gln His Leu Leu Asp Phe Phe Asp Ile Phe Gly Arg Leu Ser Ser 
            100                 105                 110         


Trp Cys Leu Lys Lys Pro Gly His Val Ala Glu Val Tyr Leu Val His 
        115                 120                 125             


Leu His Ala Ser Val Tyr Ala Leu Phe His Arg Leu Tyr Gly Met Tyr 
    130                 135                 140                 


Pro Cys Asn Phe Val Ser Phe Leu Arg Ser His Tyr Ser Met Lys Glu 
145                 150                 155                 160 


Asn Leu Glu Thr Phe Glu Glu Val Val Lys Pro Met Met Glu His Val 
                165                 170                 175     


Arg Ile His Pro Glu Leu Val Thr Gly Ser Lys Asp His Glu Leu Asp 
            180                 185                 190         


Pro Arg Arg Trp Lys Arg Leu Glu Thr His Asp Val Val Ile Glu Cys 
        195                 200                 205             


Ala Lys Ile Ser Leu Asp Pro Thr Glu Ala Ser Tyr Glu Asp Gly Tyr 
    210                 215                 220                 


Ser Val Ser His Gln Ile Ser Ala Arg Phe Pro His Arg Ser Ala Asp 
225                 230                 235                 240 


Val Thr Thr Ser Pro Tyr Ala Asp Thr Gln Asn Ser Tyr Gly Cys Ala 
                245                 250                 255     


Thr Ser Thr Pro Tyr Ser Thr Ser Arg Leu Met Leu Leu Asn Met Pro 
            260                 265                 270         


Gly Gln Leu Pro Gln Thr Leu Ser Ser Pro Ser Thr Arg Leu Ile Thr 
        275                 280                 285             


Glu Pro Pro Gln Ala Thr Leu Trp Ser Pro Ser Met Val Cys Gly Met 
    290                 295                 300                 


Thr Thr Pro Pro Thr Ser Pro Gly Asn Val Pro Pro Asp Leu Ser His 
305                 310                 315                 320 


Pro Tyr Ser Lys Val Phe Gly Thr Thr Ala Gly Gly Lys Gly Thr Pro 
                325                 330                 335     


Leu Gly Thr Pro Ala Thr Ser Pro Pro Pro Ala Pro Leu Cys His Ser 
            340                 345                 350         


Asp Asp Tyr Val His Ile Ser Leu Pro Gln Ala Thr Val Thr Pro Pro 
        355                 360                 365             


Arg Lys Glu Glu Arg Met Asp Ser Ala Arg Pro Cys Leu His Arg Gln 
    370                 375                 380                 


His His Leu Leu Asn Asp Arg Gly Ser Glu Glu Pro Pro Gly Ser Lys 
385                 390                 395                 400 


Gly Ser Val Thr Leu Ser Asp Leu Pro Gly Phe Leu Gly Asp Leu Ala 
                405                 410                 415     


Ser Glu Glu Asp Ser Ile Glu Lys Asp Lys Glu Glu Ala Ala Ile Ser 
            420                 425                 430         


Arg Glu Leu Ser Glu Ile Thr Thr Ala Glu Ala Glu Pro Val Val Pro 
        435                 440                 445             


Arg Gly Gly Phe Asp Ser Pro Phe Tyr Arg Asp Ser Leu Pro Gly Ser 
    450                 455                 460                 


Gln Arg Lys Thr His Ser Ala Ala Ser Ser Ser Gln Gly Ala Ser Val 
465                 470                 475                 480 


Asn Pro Glu Pro Leu His Ser Ser Leu Asp Lys Leu Gly Pro Asp Thr 
                485                 490                 495     


Pro Lys Gln Ala Phe Thr Pro Ile Asp Leu Pro Cys Gly Ser Ala Asp 
            500                 505                 510         


Glu Ser Pro Ala Gly Asp Arg Glu Cys Gln Thr Ser Leu Glu Thr Ser 
        515                 520                 525             


Ile Phe Thr Pro Ser Pro Cys Lys Ile Pro Pro Pro Thr Arg Val Gly 
    530                 535                 540                 


Phe Gly Ser Gly Gln Pro Pro Pro Tyr Asp His Leu Phe Glu Val Ala 
545                 550                 555                 560 


Leu Pro Lys Thr Ala His His Phe Val Ile Arg Lys Thr Glu Glu Leu 
                565                 570                 575     


Leu Lys Lys Ala Lys Gly Asn Thr Glu Glu Asp Gly Val Pro Ser Thr 
            580                 585                 590         


Ser Pro Met Glu Val Leu Asp Arg Leu Ile Gln Gln Gly Ala Asp Ala 
        595                 600                 605             


His Ser Lys Glu Leu Asn Lys Leu Pro Leu Pro Ser Lys Ser Val Asp 
    610                 615                 620                 


Trp Thr His Phe Gly Gly Ser Pro Pro Ser Asp Glu Ile Arg Thr Leu 
625                 630                 635                 640 


Arg Asp Gln Leu Leu Leu Leu His Asn Gln Leu Leu Tyr Glu Arg Phe 
                645                 650                 655     


Lys Arg Gln Gln His Ala Leu Arg Asn Arg Arg Leu Leu Arg Lys Val 
            660                 665                 670         


Ile Lys Ala Ala Ala Leu Glu Glu His Asn Ala Ala Met Lys Asp Gln 
        675                 680                 685             


Leu Lys Leu Gln Glu Lys Asp Ile Gln Met Trp Lys Val Ser Leu Gln 
    690                 695                 700                 


Lys Glu Gln Ala Arg Tyr Asn Gln Leu Gln Glu Gln Arg Asp Thr Met 
705                 710                 715                 720 


Val Thr Lys Leu His Ser Gln Ile Arg Gln Leu Gln His Asp Arg Glu 
                725                 730                 735     


Glu Phe Tyr Asn Gln Ser Gln Glu Leu Gln Thr Lys Leu Glu Asp Cys 
            740                 745                 750         


Arg Asn Met Ile Ala Glu Leu Arg Ile Glu Leu Lys Lys Ala Asn Asn 
        755                 760                 765             


Lys Val Cys His Thr Glu Leu Leu Leu Ser Gln Val Ser Gln Lys Leu 
    770                 775                 780                 


Ser Asn Ser Glu Ser Val Gln Gln Gln Met Glu Phe Leu Asn Arg Gln 
785                 790                 795                 800 


Leu Leu Val Leu Gly Glu Val Asn Glu Leu Tyr Leu Glu Gln Leu Gln 
                805                 810                 815     


Asn Lys His Ser Asp Thr Thr Lys Glu Val Glu Met Met Lys Ala Ala 
            820                 825                 830         


Tyr Arg Lys Glu Leu Glu Lys Asn Arg Ser His Val Leu Gln Gln Thr 
        835                 840                 845             


Gln Arg Leu Asp Thr Ser Gln Lys Arg Ile Leu Glu Leu Glu Ser His 
    850                 855                 860                 


Leu Ala Lys Lys Asp His Leu Leu Leu Glu Gln Lys Lys Tyr Leu Glu 
865                 870                 875                 880 


Asp Val Lys Leu Gln Ala Arg Gly Gln Leu Gln Ala Ala Glu Ser Arg 
                885                 890                 895     


Tyr Glu Ala Gln Lys Arg Ile Thr Gln Val Phe Glu Leu Glu Ile Leu 
            900                 905                 910         


Asp Leu Tyr Gly Arg Leu Glu Lys Asp Gly Leu Leu Lys Lys Leu Glu 
        915                 920                 925             


Glu Glu Lys Ala Glu Ala Ala Glu Ala Ala Glu Glu Arg Leu Asp Cys 
    930                 935                 940                 


Cys Asn Asp Gly Cys Ser Asp Ser Met Val Gly His Asn Glu Glu Ala 
945                 950                 955                 960 


Ser Gly His Asn Gly Glu Thr Lys Thr Pro Arg Pro Ser Ser Ala Arg 
                965                 970                 975     


Gly Ser Ser Gly Ser Arg Gly Gly Gly Gly Ser Ser Ser Ser Ser Ser 
            980                 985                 990         


Glu Leu Ser Thr Pro Glu Lys Pro  Pro His Gln Arg Ala  Gly Pro Phe 
        995                 1000                 1005             


Ser Ser  Arg Trp Glu Thr Thr  Met Gly Glu Ala Ser  Ala Ser Ile 
    1010                 1015                 1020             


Pro Thr  Thr Val Gly Ser Leu  Pro Ser Ser Lys Ser  Phe Leu Gly 
    1025                 1030                 1035             


Met Lys  Ala Arg Glu Leu Phe  Arg Asn Lys Ser Glu  Ser Gln Cys 
    1040                 1045                 1050             


Asp Glu  Asp Gly Met Thr Ser  Ser Leu Ser Glu Ser  Leu Lys Thr 
    1055                 1060                 1065             


Glu Leu  Gly Lys Asp Leu Gly  Val Glu Ala Lys Ile  Pro Leu Asn 
    1070                 1075                 1080             


Leu Asp  Gly Pro His Pro Ser  Pro Pro Thr Pro Asp  Ser Val Gly 
    1085                 1090                 1095             


Gln Leu  His Ile Met Asp Tyr  Asn Glu Thr His His  Glu His Ser 
    1100                 1105                 1110             


<210>  72
<211>  5675
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human TSC2 mRNA transcript variant 1 GenBank Accession No.: 
       NM_000548.3  GI:116256351

<400>  72
ccggcggcgt cccggggcca ggggggtgcg cctttctccg cgtcggggcg gcccggagcg       60

cggtggcgcg gcgcgggagg ggttttctgg tgcgtcctgg tccaccatgg ccaaaccaac      120

aagcaaagat tcaggcttga aggagaagtt taagattctg ttgggactgg gaacaccgag      180

gccaaatccc aggtctgcag agggtaaaca gacggagttt atcatcaccg cggaaatact      240

gagagaactg agcatggaat gtggcctcaa caatcgcatc cggatgatag ggcagatttg      300

tgaagtcgca aaaaccaaga aatttgaaga gcacgcagtg gaagcactct ggaaggcggt      360

cgcggatctg ttgcagccgg agcggccgct ggaggcccgg cacgcggtgc tggctctgct      420

gaaggccatc gtgcaggggc agggcgagcg tttgggggtc ctcagagccc tcttctttaa      480

ggtcatcaag gattaccctt ccaacgaaga ccttcacgaa aggctggagg ttttcaaggc      540

cctcacagac aatgggagac acatcaccta cttggaggaa gagctggctg actttgtcct      600

gcagtggatg gatgttggct tgtcctcgga attccttctg gtgctggtga acttggtcaa      660

attcaatagc tgttacctcg acgagtacat cgcaaggatg gttcagatga tctgtctgct      720

gtgcgtccgg accgcgtcct ctgtggacat agaggtctcc ctgcaggtgc tggacgccgt      780

ggtctgctac aactgcctgc cggctgagag cctcccgctg ttcatcgtta ccctctgtcg      840

caccatcaac gtcaaggagc tctgcgagcc ttgctggaag ctgatgcgga acctccttgg      900

cacccacctg ggccacagcg ccatctacaa catgtgccac ctcatggagg acagagccta      960

catggaggac gcgcccctgc tgagaggagc cgtgtttttt gtgggcatgg ctctctgggg     1020

agcccaccgg ctctattctc tcaggaactc gccgacatct gtgttgccat cattttacca     1080

ggccatggca tgtccgaacg aggtggtgtc ctatgagatc gtcctgtcca tcaccaggct     1140

catcaagaag tataggaagg agctccaggt ggtggcgtgg gacattctgc tgaacatcat     1200

cgaacggctc cttcagcagc tccagacctt ggacagcccg gagctcagga ccatcgtcca     1260

tgacctgttg accacggtgg aggagctgtg tgaccagaac gagttccacg ggtctcagga     1320

gagatacttt gaactggtgg agagatgtgc ggaccagagg cctgagtcct ccctcctgaa     1380

cctgatctcc tatagagcgc agtccatcca cccggccaag gacggctgga ttcagaacct     1440

gcaggcgctg atggagagat tcttcaggag cgagtcccga ggcgccgtgc gcatcaaggt     1500

gctggacgtg ctgtcctttg tgctgctcat caacaggcag ttctatgagg aggagctgat     1560

taactcagtg gtcatctcgc agctctccca catccccgag gataaagacc accaggtccg     1620

aaagctggcc acccagttgc tggtggacct ggcagagggc tgccacacac accacttcaa     1680

cagcctgctg gacatcatcg agaaggtgat ggcccgctcc ctctccccac ccccggagct     1740

ggaagaaagg gatgtggccg catactcggc ctccttggag gatgtgaaga cagccgtcct     1800

ggggcttctg gtcatccttc agaccaagct gtacaccctg cctgcaagcc acgccacgcg     1860

tgtgtatgag atgctggtca gccacattca gctccactac aagcacagct acaccctgcc     1920

aatcgcgagc agcatccggc tgcaggcctt tgacttcctg ttgctgctgc gggccgactc     1980

actgcaccgc ctgggcctgc ccaacaagga tggagtcgtg cggttcagcc cctactgcgt     2040

ctgcgactac atggagccag agagaggctc tgagaagaag accagcggcc ccctttctcc     2100

tcccacaggg cctcctggcc cggcgcctgc aggccccgcc gtgcggctgg ggtccgtgcc     2160

ctactccctg ctcttccgcg tcctgctgca gtgcttgaag caggagtctg actggaaggt     2220

gctgaagctg gttctgggca ggctgcctga gtccctgcgc tataaagtgc tcatctttac     2280

ttccccttgc agtgtggacc agctgtgctc tgctctctgc tccatgcttt caggcccaaa     2340

gacactggag cggctccgag gcgccccaga aggcttctcc agaactgact tgcacctggc     2400

cgtggttcca gtgctgacag cattaatctc ttaccataac tacctggaca aaaccaaaca     2460

gcgcgagatg gtctactgcc tggagcaggg cctcatccac cgctgtgcca gccagtgcgt     2520

cgtggccttg tccatctgca gcgtggagat gcctgacatc atcatcaagg cgctgcctgt     2580

tctggtggtg aagctcacgc acatctcagc cacagccagc atggccgtcc cactgctgga     2640

gttcctgtcc actctggcca ggctgccgca cctctacagg aactttgccg cggagcagta     2700

tgccagtgtg ttcgccatct ccctgccgta caccaacccc tccaagttta atcagtacat     2760

cgtgtgtctg gcccatcacg tcatagccat gtggttcatc aggtgccgcc tgcccttccg     2820

gaaggatttt gtccctttca tcactaaggg cctgcggtcc aatgtcctct tgtcttttga     2880

tgacaccccc gagaaggaca gcttcagggc ccggagtact agtctcaacg agagacccaa     2940

gagtctgagg atagccagac cccccaaaca aggcttgaat aactctccac ccgtgaaaga     3000

attcaaggag agctctgcag ccgaggcctt ccggtgccgc agcatcagtg tgtctgaaca     3060

tgtggtccgc agcaggatac agacgtccct caccagtgcc agcttggggt ctgcagatga     3120

gaactccgtg gcccaggctg acgatagcct gaaaaacctc cacctggagc tcacggaaac     3180

ctgtctggac atgatggctc gatacgtctt ctccaacttc acggctgtcc cgaagaggtc     3240

tcctgtgggc gagttcctcc tagcgggtgg caggaccaaa acctggctgg ttgggaacaa     3300

gcttgtcact gtgacgacaa gcgtgggaac cgggacccgg tcgttactag gcctggactc     3360

gggggagctg cagtccggcc cggagtcgag ctccagcccc ggggtgcatg tgagacagac     3420

caaggaggcg ccggccaagc tggagtccca ggctgggcag caggtgtccc gtggggcccg     3480

ggatcgggtc cgttccatgt cggggggcca tggtcttcga gttggcgccc tggacgtgcc     3540

ggcctcccag ttcctgggca gtgccacttc tccaggacca cggactgcac cagccgcgaa     3600

acctgagaag gcctcagctg gcacccgggt tcctgtgcag gagaagacga acctggcggc     3660

ctatgtgccc ctgctgaccc agggctgggc ggagatcctg gtccggaggc ccacagggaa     3720

caccagctgg ctgatgagcc tggagaaccc gctcagccct ttctcctcgg acatcaacaa     3780

catgcccctg caggagctgt ctaacgccct catggcggct gagcgcttca aggagcaccg     3840

ggacacagcc ctgtacaagt cactgtcggt gccggcagcc agcacggcca aaccccctcc     3900

tctgcctcgc tccaacacag tggcctcttt ctcctccctg taccagtcca gctgccaagg     3960

acagctgcac aggagcgttt cctgggcaga ctccgccgtg gtcatggagg agggaagtcc     4020

gggcgaggtt cctgtgctgg tggagccccc agggttggag gacgttgagg cagcgctagg     4080

catggacagg cgcacggatg cctacagcag gtcgtcctca gtctccagcc aggaggagaa     4140

gtcgctccac gcggaggagc tggttggcag gggcatcccc atcgagcgag tcgtctcctc     4200

ggagggtggc cggccctctg tggacctctc cttccagccc tcgcagcccc tgagcaagtc     4260

cagctcctct cccgagctgc agactctgca ggacatcctc ggggaccctg gggacaaggc     4320

cgacgtgggc cggctgagcc ctgaggttaa ggcccggtca cagtcaggga ccctggacgg     4380

ggaaagtgct gcctggtcgg cctcgggcga agacagtcgg ggccagcccg agggtccctt     4440

gccttccagc tccccccgct cgcccagtgg cctccggccc cgaggttaca ccatctccga     4500

ctcggcccca tcacgcaggg gcaagagagt agagagggac gccttaaaga gcagagccac     4560

agcctccaat gcagagaaag tgccaggcat caaccccagt ttcgtgttcc tgcagctcta     4620

ccattccccc ttctttggcg acgagtcaaa caagccaatc ctgctgccca atgagtcaca     4680

gtcctttgag cggtcggtgc agctcctcga ccagatccca tcatacgaca cccacaagat     4740

cgccgtcctg tatgttggag aaggccagag caacagcgag ctcgccatcc tgtccaatga     4800

gcatggctcc tacaggtaca cggagttcct gacgggcctg ggccggctca tcgagctgaa     4860

ggactgccag ccggacaagg tgtacctggg aggcctggac gtgtgtggtg aggacggcca     4920

gttcacctac tgctggcacg atgacatcat gcaagccgtc ttccacatcg ccaccctgat     4980

gcccaccaag gacgtggaca agcaccgctg cgacaagaag cgccacctgg gcaacgactt     5040

tgtgtccatt gtctacaatg actccggtga ggacttcaag cttggcacca tcaagggcca     5100

gttcaacttt gtccacgtga tcgtcacccc gctggactac gagtgcaacc tggtgtccct     5160

gcagtgcagg aaagacatgg agggccttgt ggacaccagc gtggccaaga tcgtgtctga     5220

ccgcaacctg cccttcgtgg cccgccagat ggccctgcac gcaaatatgg cctcacaggt     5280

gcatcatagc cgctccaacc ccaccgatat ctacccctcc aagtggattg cccggctccg     5340

ccacatcaag cggctccgcc agcggatctg cgaggaagcc gcctactcca accccagcct     5400

acctctggtg caccctccgt cccatagcaa agcccctgca cagactccag ccgagcccac     5460

acctggctat gaggtgggcc agcggaagcg cctcatctcc tcggtggagg acttcaccga     5520

gtttgtgtga ggccggggcc ctccctcctg cactggcctt ggacggtatt gcctgtcagt     5580

gaaataaata aagtcctgac cccagtgcac agacatagag gcacagattg caaaaaaaaa     5640

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa                                5675


<210>  73
<211>  1807
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human TSC2 polypeptide encoded by mRNA transcript variant 1 
       GenBank Accession No.: NP_000539.2  GI:116256352

<400>  73

Met Ala Lys Pro Thr Ser Lys Asp Ser Gly Leu Lys Glu Lys Phe Lys 
1               5                   10                  15      


Ile Leu Leu Gly Leu Gly Thr Pro Arg Pro Asn Pro Arg Ser Ala Glu 
            20                  25                  30          


Gly Lys Gln Thr Glu Phe Ile Ile Thr Ala Glu Ile Leu Arg Glu Leu 
        35                  40                  45              


Ser Met Glu Cys Gly Leu Asn Asn Arg Ile Arg Met Ile Gly Gln Ile 
    50                  55                  60                  


Cys Glu Val Ala Lys Thr Lys Lys Phe Glu Glu His Ala Val Glu Ala 
65                  70                  75                  80  


Leu Trp Lys Ala Val Ala Asp Leu Leu Gln Pro Glu Arg Pro Leu Glu 
                85                  90                  95      


Ala Arg His Ala Val Leu Ala Leu Leu Lys Ala Ile Val Gln Gly Gln 
            100                 105                 110         


Gly Glu Arg Leu Gly Val Leu Arg Ala Leu Phe Phe Lys Val Ile Lys 
        115                 120                 125             


Asp Tyr Pro Ser Asn Glu Asp Leu His Glu Arg Leu Glu Val Phe Lys 
    130                 135                 140                 


Ala Leu Thr Asp Asn Gly Arg His Ile Thr Tyr Leu Glu Glu Glu Leu 
145                 150                 155                 160 


Ala Asp Phe Val Leu Gln Trp Met Asp Val Gly Leu Ser Ser Glu Phe 
                165                 170                 175     


Leu Leu Val Leu Val Asn Leu Val Lys Phe Asn Ser Cys Tyr Leu Asp 
            180                 185                 190         


Glu Tyr Ile Ala Arg Met Val Gln Met Ile Cys Leu Leu Cys Val Arg 
        195                 200                 205             


Thr Ala Ser Ser Val Asp Ile Glu Val Ser Leu Gln Val Leu Asp Ala 
    210                 215                 220                 


Val Val Cys Tyr Asn Cys Leu Pro Ala Glu Ser Leu Pro Leu Phe Ile 
225                 230                 235                 240 


Val Thr Leu Cys Arg Thr Ile Asn Val Lys Glu Leu Cys Glu Pro Cys 
                245                 250                 255     


Trp Lys Leu Met Arg Asn Leu Leu Gly Thr His Leu Gly His Ser Ala 
            260                 265                 270         


Ile Tyr Asn Met Cys His Leu Met Glu Asp Arg Ala Tyr Met Glu Asp 
        275                 280                 285             


Ala Pro Leu Leu Arg Gly Ala Val Phe Phe Val Gly Met Ala Leu Trp 
    290                 295                 300                 


Gly Ala His Arg Leu Tyr Ser Leu Arg Asn Ser Pro Thr Ser Val Leu 
305                 310                 315                 320 


Pro Ser Phe Tyr Gln Ala Met Ala Cys Pro Asn Glu Val Val Ser Tyr 
                325                 330                 335     


Glu Ile Val Leu Ser Ile Thr Arg Leu Ile Lys Lys Tyr Arg Lys Glu 
            340                 345                 350         


Leu Gln Val Val Ala Trp Asp Ile Leu Leu Asn Ile Ile Glu Arg Leu 
        355                 360                 365             


Leu Gln Gln Leu Gln Thr Leu Asp Ser Pro Glu Leu Arg Thr Ile Val 
    370                 375                 380                 


His Asp Leu Leu Thr Thr Val Glu Glu Leu Cys Asp Gln Asn Glu Phe 
385                 390                 395                 400 


His Gly Ser Gln Glu Arg Tyr Phe Glu Leu Val Glu Arg Cys Ala Asp 
                405                 410                 415     


Gln Arg Pro Glu Ser Ser Leu Leu Asn Leu Ile Ser Tyr Arg Ala Gln 
            420                 425                 430         


Ser Ile His Pro Ala Lys Asp Gly Trp Ile Gln Asn Leu Gln Ala Leu 
        435                 440                 445             


Met Glu Arg Phe Phe Arg Ser Glu Ser Arg Gly Ala Val Arg Ile Lys 
    450                 455                 460                 


Val Leu Asp Val Leu Ser Phe Val Leu Leu Ile Asn Arg Gln Phe Tyr 
465                 470                 475                 480 


Glu Glu Glu Leu Ile Asn Ser Val Val Ile Ser Gln Leu Ser His Ile 
                485                 490                 495     


Pro Glu Asp Lys Asp His Gln Val Arg Lys Leu Ala Thr Gln Leu Leu 
            500                 505                 510         


Val Asp Leu Ala Glu Gly Cys His Thr His His Phe Asn Ser Leu Leu 
        515                 520                 525             


Asp Ile Ile Glu Lys Val Met Ala Arg Ser Leu Ser Pro Pro Pro Glu 
    530                 535                 540                 


Leu Glu Glu Arg Asp Val Ala Ala Tyr Ser Ala Ser Leu Glu Asp Val 
545                 550                 555                 560 


Lys Thr Ala Val Leu Gly Leu Leu Val Ile Leu Gln Thr Lys Leu Tyr 
                565                 570                 575     


Thr Leu Pro Ala Ser His Ala Thr Arg Val Tyr Glu Met Leu Val Ser 
            580                 585                 590         


His Ile Gln Leu His Tyr Lys His Ser Tyr Thr Leu Pro Ile Ala Ser 
        595                 600                 605             


Ser Ile Arg Leu Gln Ala Phe Asp Phe Leu Leu Leu Leu Arg Ala Asp 
    610                 615                 620                 


Ser Leu His Arg Leu Gly Leu Pro Asn Lys Asp Gly Val Val Arg Phe 
625                 630                 635                 640 


Ser Pro Tyr Cys Val Cys Asp Tyr Met Glu Pro Glu Arg Gly Ser Glu 
                645                 650                 655     


Lys Lys Thr Ser Gly Pro Leu Ser Pro Pro Thr Gly Pro Pro Gly Pro 
            660                 665                 670         


Ala Pro Ala Gly Pro Ala Val Arg Leu Gly Ser Val Pro Tyr Ser Leu 
        675                 680                 685             


Leu Phe Arg Val Leu Leu Gln Cys Leu Lys Gln Glu Ser Asp Trp Lys 
    690                 695                 700                 


Val Leu Lys Leu Val Leu Gly Arg Leu Pro Glu Ser Leu Arg Tyr Lys 
705                 710                 715                 720 


Val Leu Ile Phe Thr Ser Pro Cys Ser Val Asp Gln Leu Cys Ser Ala 
                725                 730                 735     


Leu Cys Ser Met Leu Ser Gly Pro Lys Thr Leu Glu Arg Leu Arg Gly 
            740                 745                 750         


Ala Pro Glu Gly Phe Ser Arg Thr Asp Leu His Leu Ala Val Val Pro 
        755                 760                 765             


Val Leu Thr Ala Leu Ile Ser Tyr His Asn Tyr Leu Asp Lys Thr Lys 
    770                 775                 780                 


Gln Arg Glu Met Val Tyr Cys Leu Glu Gln Gly Leu Ile His Arg Cys 
785                 790                 795                 800 


Ala Ser Gln Cys Val Val Ala Leu Ser Ile Cys Ser Val Glu Met Pro 
                805                 810                 815     


Asp Ile Ile Ile Lys Ala Leu Pro Val Leu Val Val Lys Leu Thr His 
            820                 825                 830         


Ile Ser Ala Thr Ala Ser Met Ala Val Pro Leu Leu Glu Phe Leu Ser 
        835                 840                 845             


Thr Leu Ala Arg Leu Pro His Leu Tyr Arg Asn Phe Ala Ala Glu Gln 
    850                 855                 860                 


Tyr Ala Ser Val Phe Ala Ile Ser Leu Pro Tyr Thr Asn Pro Ser Lys 
865                 870                 875                 880 


Phe Asn Gln Tyr Ile Val Cys Leu Ala His His Val Ile Ala Met Trp 
                885                 890                 895     


Phe Ile Arg Cys Arg Leu Pro Phe Arg Lys Asp Phe Val Pro Phe Ile 
            900                 905                 910         


Thr Lys Gly Leu Arg Ser Asn Val Leu Leu Ser Phe Asp Asp Thr Pro 
        915                 920                 925             


Glu Lys Asp Ser Phe Arg Ala Arg Ser Thr Ser Leu Asn Glu Arg Pro 
    930                 935                 940                 


Lys Ser Leu Arg Ile Ala Arg Pro Pro Lys Gln Gly Leu Asn Asn Ser 
945                 950                 955                 960 


Pro Pro Val Lys Glu Phe Lys Glu Ser Ser Ala Ala Glu Ala Phe Arg 
                965                 970                 975     


Cys Arg Ser Ile Ser Val Ser Glu His Val Val Arg Ser Arg Ile Gln 
            980                 985                 990         


Thr Ser Leu Thr Ser Ala Ser Leu  Gly Ser Ala Asp Glu  Asn Ser Val 
        995                 1000                 1005             


Ala Gln  Ala Asp Asp Ser Leu  Lys Asn Leu His Leu  Glu Leu Thr 
    1010                 1015                 1020             


Glu Thr  Cys Leu Asp Met Met  Ala Arg Tyr Val Phe  Ser Asn Phe 
    1025                 1030                 1035             


Thr Ala  Val Pro Lys Arg Ser  Pro Val Gly Glu Phe  Leu Leu Ala 
    1040                 1045                 1050             


Gly Gly  Arg Thr Lys Thr Trp  Leu Val Gly Asn Lys  Leu Val Thr 
    1055                 1060                 1065             


Val Thr  Thr Ser Val Gly Thr  Gly Thr Arg Ser Leu  Leu Gly Leu 
    1070                 1075                 1080             


Asp Ser  Gly Glu Leu Gln Ser  Gly Pro Glu Ser Ser  Ser Ser Pro 
    1085                 1090                 1095             


Gly Val  His Val Arg Gln Thr  Lys Glu Ala Pro Ala  Lys Leu Glu 
    1100                 1105                 1110             


Ser Gln  Ala Gly Gln Gln Val  Ser Arg Gly Ala Arg  Asp Arg Val 
    1115                 1120                 1125             


Arg Ser  Met Ser Gly Gly His  Gly Leu Arg Val Gly  Ala Leu Asp 
    1130                 1135                 1140             


Val Pro  Ala Ser Gln Phe Leu  Gly Ser Ala Thr Ser  Pro Gly Pro 
    1145                 1150                 1155             


Arg Thr  Ala Pro Ala Ala Lys  Pro Glu Lys Ala Ser  Ala Gly Thr 
    1160                 1165                 1170             


Arg Val  Pro Val Gln Glu Lys  Thr Asn Leu Ala Ala  Tyr Val Pro 
    1175                 1180                 1185             


Leu Leu  Thr Gln Gly Trp Ala  Glu Ile Leu Val Arg  Arg Pro Thr 
    1190                 1195                 1200             


Gly Asn  Thr Ser Trp Leu Met  Ser Leu Glu Asn Pro  Leu Ser Pro 
    1205                 1210                 1215             


Phe Ser  Ser Asp Ile Asn Asn  Met Pro Leu Gln Glu  Leu Ser Asn 
    1220                 1225                 1230             


Ala Leu  Met Ala Ala Glu Arg  Phe Lys Glu His Arg  Asp Thr Ala 
    1235                 1240                 1245             


Leu Tyr  Lys Ser Leu Ser Val  Pro Ala Ala Ser Thr  Ala Lys Pro 
    1250                 1255                 1260             


Pro Pro  Leu Pro Arg Ser Asn  Thr Val Ala Ser Phe  Ser Ser Leu 
    1265                 1270                 1275             


Tyr Gln  Ser Ser Cys Gln Gly  Gln Leu His Arg Ser  Val Ser Trp 
    1280                 1285                 1290             


Ala Asp  Ser Ala Val Val Met  Glu Glu Gly Ser Pro  Gly Glu Val 
    1295                 1300                 1305             


Pro Val  Leu Val Glu Pro Pro  Gly Leu Glu Asp Val  Glu Ala Ala 
    1310                 1315                 1320             


Leu Gly  Met Asp Arg Arg Thr  Asp Ala Tyr Ser Arg  Ser Ser Ser 
    1325                 1330                 1335             


Val Ser  Ser Gln Glu Glu Lys  Ser Leu His Ala Glu  Glu Leu Val 
    1340                 1345                 1350             


Gly Arg  Gly Ile Pro Ile Glu  Arg Val Val Ser Ser  Glu Gly Gly 
    1355                 1360                 1365             


Arg Pro  Ser Val Asp Leu Ser  Phe Gln Pro Ser Gln  Pro Leu Ser 
    1370                 1375                 1380             


Lys Ser  Ser Ser Ser Pro Glu  Leu Gln Thr Leu Gln  Asp Ile Leu 
    1385                 1390                 1395             


Gly Asp  Pro Gly Asp Lys Ala  Asp Val Gly Arg Leu  Ser Pro Glu 
    1400                 1405                 1410             


Val Lys  Ala Arg Ser Gln Ser  Gly Thr Leu Asp Gly  Glu Ser Ala 
    1415                 1420                 1425             


Ala Trp  Ser Ala Ser Gly Glu  Asp Ser Arg Gly Gln  Pro Glu Gly 
    1430                 1435                 1440             


Pro Leu  Pro Ser Ser Ser Pro  Arg Ser Pro Ser Gly  Leu Arg Pro 
    1445                 1450                 1455             


Arg Gly  Tyr Thr Ile Ser Asp  Ser Ala Pro Ser Arg  Arg Gly Lys 
    1460                 1465                 1470             


Arg Val  Glu Arg Asp Ala Leu  Lys Ser Arg Ala Thr  Ala Ser Asn 
    1475                 1480                 1485             


Ala Glu  Lys Val Pro Gly Ile  Asn Pro Ser Phe Val  Phe Leu Gln 
    1490                 1495                 1500             


Leu Tyr  His Ser Pro Phe Phe  Gly Asp Glu Ser Asn  Lys Pro Ile 
    1505                 1510                 1515             


Leu Leu  Pro Asn Glu Ser Gln  Ser Phe Glu Arg Ser  Val Gln Leu 
    1520                 1525                 1530             


Leu Asp  Gln Ile Pro Ser Tyr  Asp Thr His Lys Ile  Ala Val Leu 
    1535                 1540                 1545             


Tyr Val  Gly Glu Gly Gln Ser  Asn Ser Glu Leu Ala  Ile Leu Ser 
    1550                 1555                 1560             


Asn Glu  His Gly Ser Tyr Arg  Tyr Thr Glu Phe Leu  Thr Gly Leu 
    1565                 1570                 1575             


Gly Arg  Leu Ile Glu Leu Lys  Asp Cys Gln Pro Asp  Lys Val Tyr 
    1580                 1585                 1590             


Leu Gly  Gly Leu Asp Val Cys  Gly Glu Asp Gly Gln  Phe Thr Tyr 
    1595                 1600                 1605             


Cys Trp  His Asp Asp Ile Met  Gln Ala Val Phe His  Ile Ala Thr 
    1610                 1615                 1620             


Leu Met  Pro Thr Lys Asp Val  Asp Lys His Arg Cys  Asp Lys Lys 
    1625                 1630                 1635             


Arg His  Leu Gly Asn Asp Phe  Val Ser Ile Val Tyr  Asn Asp Ser 
    1640                 1645                 1650             


Gly Glu  Asp Phe Lys Leu Gly  Thr Ile Lys Gly Gln  Phe Asn Phe 
    1655                 1660                 1665             


Val His  Val Ile Val Thr Pro  Leu Asp Tyr Glu Cys  Asn Leu Val 
    1670                 1675                 1680             


Ser Leu  Gln Cys Arg Lys Asp  Met Glu Gly Leu Val  Asp Thr Ser 
    1685                 1690                 1695             


Val Ala  Lys Ile Val Ser Asp  Arg Asn Leu Pro Phe  Val Ala Arg 
    1700                 1705                 1710             


Gln Met  Ala Leu His Ala Asn  Met Ala Ser Gln Val  His His Ser 
    1715                 1720                 1725             


Arg Ser  Asn Pro Thr Asp Ile  Tyr Pro Ser Lys Trp  Ile Ala Arg 
    1730                 1735                 1740             


Leu Arg  His Ile Lys Arg Leu  Arg Gln Arg Ile Cys  Glu Glu Ala 
    1745                 1750                 1755             


Ala Tyr  Ser Asn Pro Ser Leu  Pro Leu Val His Pro  Pro Ser His 
    1760                 1765                 1770             


Ser Lys  Ala Pro Ala Gln Thr  Pro Ala Glu Pro Thr  Pro Gly Tyr 
    1775                 1780                 1785             


Glu Val  Gly Gln Arg Lys Arg  Leu Ile Ser Ser Val  Glu Asp Phe 
    1790                 1795                 1800             


Thr Glu  Phe Val 
    1805         


<210>  74
<211>  5577
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tuberous sclerosis 2 (TSC2), transcript variant 5, 
       mRNA NCBI Reference Sequence: NM_001114382.1

<400>  74
ccggcggcgt cccggggcca ggggggtgcg cctttctccg cgtcggggcg gcccggagcg       60

cggtggcgcg gcgcgggagg ggttttctgg tgcgtcctgg tccaccatgg ccaaaccaac      120

aagcaaagat tcaggcttga aggagaagtt taagattctg ttgggactgg gaacaccgag      180

gccaaatccc aggtctgcag agggtaaaca gacggagttt atcatcaccg cggaaatact      240

gagagaactg agcatggaat gtggcctcaa caatcgcatc cggatgatag ggcagatttg      300

tgaagtcgca aaaaccaaga aatttgaaga gcacgcagtg gaagcactct ggaaggcggt      360

cgcggatctg ttgcagccgg agcggccgct ggaggcccgg cacgcggtgc tggctctgct      420

gaaggccatc gtgcaggggc agggcgagcg tttgggggtc ctcagagccc tcttctttaa      480

ggtcatcaag gattaccctt ccaacgaaga ccttcacgaa aggctggagg ttttcaaggc      540

cctcacagac aatgggagac acatcaccta cttggaggaa gagctggctg actttgtcct      600

gcagtggatg gatgttggct tgtcctcgga attccttctg gtgctggtga acttggtcaa      660

attcaatagc tgttacctcg acgagtacat cgcaaggatg gttcagatga tctgtctgct      720

gtgcgtccgg accgcgtcct ctgtggacat agaggtctcc ctgcaggtgc tggacgccgt      780

ggtctgctac aactgcctgc cggctgagag cctcccgctg ttcatcgtta ccctctgtcg      840

caccatcaac gtcaaggagc tctgcgagcc ttgctggaag ctgatgcgga acctccttgg      900

cacccacctg ggccacagcg ccatctacaa catgtgccac ctcatggagg acagagccta      960

catggaggac gcgcccctgc tgagaggagc cgtgtttttt gtgggcatgg ctctctgggg     1020

agcccaccgg ctctattctc tcaggaactc gccgacatct gtgttgccat cattttacca     1080

ggccatggca tgtccgaacg aggtggtgtc ctatgagatc gtcctgtcca tcaccaggct     1140

catcaagaag tataggaagg agctccaggt ggtggcgtgg gacattctgc tgaacatcat     1200

cgaacggctc cttcagcagc tccagacctt ggacagcccg gagctcagga ccatcgtcca     1260

tgacctgttg accacggtgg aggagctgtg tgaccagaac gagttccacg ggtctcagga     1320

gagatacttt gaactggtgg agagatgtgc ggaccagagg cctgagtcct ccctcctgaa     1380

cctgatctcc tatagagcgc agtccatcca cccggccaag gacggctgga ttcagaacct     1440

gcaggcgctg atggagagat tcttcaggag cgagtcccga ggcgccgtgc gcatcaaggt     1500

gctggacgtg ctgtcctttg tgctgctcat caacaggcag ttctatgagg aggagctgat     1560

taactcagtg gtcatctcgc agctctccca catccccgag gataaagacc accaggtccg     1620

aaagctggcc acccagttgc tggtggacct ggcagagggc tgccacacac accacttcaa     1680

cagcctgctg gacatcatcg agaaggtgat ggcccgctcc ctctccccac ccccggagct     1740

ggaagaaagg gatgtggccg catactcggc ctccttggag gatgtgaaga cagccgtcct     1800

ggggcttctg gtcatccttc agaccaagct gtacaccctg cctgcaagcc acgccacgcg     1860

tgtgtatgag atgctggtca gccacattca gctccactac aagcacagct acaccctgcc     1920

aatcgcgagc agcatccggc tgcaggcctt tgacttcctg ttgctgctgc gggccgactc     1980

actgcaccgc ctgggcctgc ccaacaagga tggagtcgtg cggttcagcc cctactgcgt     2040

ctgcgactac atggagccag agagaggctc tgagaagaag accagcggcc ccctttctcc     2100

tcccacaggg cctcctggcc cggcgcctgc aggccccgcc gtgcggctgg ggtccgtgcc     2160

ctactccctg ctcttccgcg tcctgctgca gtgcttgaag caggagtctg actggaaggt     2220

gctgaagctg gttctgggca ggctgcctga gtccctgcgc tataaagtgc tcatctttac     2280

ttccccttgc agtgtggacc agctgtgctc tgctctctgc tccatgcttt caggcccaaa     2340

gacactggag cggctccgag gcgccccaga aggcttctcc agaactgact tgcacctggc     2400

cgtggttcca gtgctgacag cattaatctc ttaccataac tacctggaca aaaccaaaca     2460

gcgcgagatg gtctactgcc tggagcaggg cctcatccac cgctgtgcca gccagtgcgt     2520

cgtggccttg tccatctgca gcgtggagat gcctgacatc atcatcaagg cgctgcctgt     2580

tctggtggtg aagctcacgc acatctcagc cacagccagc atggccgtcc cactgctgga     2640

gttcctgtcc actctggcca ggctgccgca cctctacagg aactttgccg cggagcagta     2700

tgccagtgtg ttcgccatct ccctgccgta caccaacccc tccaagttta atcagtacat     2760

cgtgtgtctg gcccatcacg tcatagccat gtggttcatc aggtgccgcc tgcccttccg     2820

gaaggatttt gtccctttca tcactaaggg cctgcggtcc aatgtcctct tgtcttttga     2880

tgacaccccc gagaaggaca gcttcagggc ccggagtact agtctcaacg agagacccaa     2940

gagtctgagg atagccagac cccccaaaca aggcttgaat aactctccac ccgtgaaaga     3000

attcaaggag agctctgcag ccgaggcctt ccggtgccgc agcatcagtg tgtctgaaca     3060

tgtggtccgc agcaggatac agacgtccct caccagtgcc agcttggggt ctgcagatga     3120

gaactccgtg gcccaggctg acgatagcct gaaaaacctc cacctggagc tcacggaaac     3180

ctgtctggac atgatggctc gatacgtctt ctccaacttc acggctgtcc cgaagaggtc     3240

tcctgtgggc gagttcctcc tagcgggtgg caggaccaaa acctggctgg ttgggaacaa     3300

gcttgtcact gtgacgacaa gcgtgggaac cgggacccgg tcgttactag gcctggactc     3360

gggggagctg cagtccggcc cggagtcgag ctccagcccc ggggtgcatg tgagacagac     3420

caaggaggcg ccggccaagc tggagtccca ggctgggcag caggtgtccc gtggggcccg     3480

ggatcgggtc cgttccatgt cggggggcca tggtcttcga gttggcgccc tggacgtgcc     3540

ggcctcccag ttcctgggca gtgccacttc tccaggacca cggactgcac cagccgcgaa     3600

acctgagaag gcctcagctg gcacccgggt tcctgtgcag gagaagacga acctggcggc     3660

ctatgtgccc ctgctgaccc agggctgggc ggagatcctg gtccggaggc ccacagggaa     3720

caccagctgg ctgatgagcc tggagaaccc gctcagccct ttctcctcgg acatcaacaa     3780

catgcccctg caggagctgt ctaacgccct catggcggct gagcgcttca aggagcaccg     3840

ggacacagcc ctgtacaagt cactgtcggt gccggcagcc agcacggcca aaccccctcc     3900

tctgcctcgc tccaacacag actccgccgt ggtcatggag gagggaagtc cgggcgaggt     3960

tcctgtgctg gtggagcccc cagggttgga ggacgttgag gcagcgctag gcatggacag     4020

gcgcacggat gcctacagca ggtcgtcctc agtctccagc caggaggaga agtcgctcca     4080

cgcggaggag ctggttggca ggggcatccc catcgagcga gtcgtctcct cggagggtgg     4140

ccggccctct gtggacctct ccttccagcc ctcgcagccc ctgagcaagt ccagctcctc     4200

tcccgagctg cagactctgc aggacatcct cggggaccct ggggacaagg ccgacgtggg     4260

ccggctgagc cctgaggtta aggcccggtc acagtcaggg accctggacg gggaaagtgc     4320

tgcctggtcg gcctcgggcg aagacagtcg gggccagccc gagggtccct tgccttccag     4380

ctccccccgc tcgcccagtg gcctccggcc ccgaggttac accatctccg actcggcccc     4440

atcacgcagg ggcaagagag tagagaggga cgccttaaag agcagagcca cagcctccaa     4500

tgcagagaaa gtgccaggca tcaaccccag tttcgtgttc ctgcagctct accattcccc     4560

cttctttggc gacgagtcaa acaagccaat cctgctgccc aatgagtcac agtcctttga     4620

gcggtcggtg cagctcctcg accagatccc atcatacgac acccacaaga tcgccgtcct     4680

gtatgttgga gaaggccaga gcaacagcga gctcgccatc ctgtccaatg agcatggctc     4740

ctacaggtac acggagttcc tgacgggcct gggccggctc atcgagctga aggactgcca     4800

gccggacaag gtgtacctgg gaggcctgga cgtgtgtggt gaggacggcc agttcaccta     4860

ctgctggcac gatgacatca tgcaagccgt cttccacatc gccaccctga tgcccaccaa     4920

ggacgtggac aagcaccgct gcgacaagaa gcgccacctg ggcaacgact ttgtgtccat     4980

tgtctacaat gactccggtg aggacttcaa gcttggcacc atcaagggcc agttcaactt     5040

tgtccacgtg atcgtcaccc cgctggacta cgagtgcaac ctggtgtccc tgcagtgcag     5100

gaaagacatg gagggccttg tggacaccag cgtggccaag atcgtgtctg accgcaacct     5160

gcccttcgtg gcccgccaga tggccctgca cgcaaatatg gcctcacagg tgcatcatag     5220

ccgctccaac cccaccgata tctacccctc caagtggatt gcccggctcc gccacatcaa     5280

gcggctccgc cagcggatct gcgaggaagc cgcctactcc aaccccagcc tacctctggt     5340

gcaccctccg tcccatagca aagcccctgc acagactcca gccgagccca cacctggcta     5400

tgaggtgggc cagcggaagc gcctcatctc ctcggtggag gacttcaccg agtttgtgtg     5460

aggccggggc cctccctcct gcactggcct tggacggtat tgcctgtcag tgaaataaat     5520

aaagtcctga ccccagtgca cagacataga ggcacagatt gcaaaaaaaa aaaaaaa        5577


<210>  75
<211>  1784
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tuberous sclerosis 2 (TSC2), transcript variant 5, 
       polypeptide

<400>  75

Met Ala Lys Pro Thr Ser Lys Asp Ser Gly Leu Lys Glu Lys Phe Lys 
1               5                   10                  15      


Ile Leu Leu Gly Leu Gly Thr Pro Arg Pro Asn Pro Arg Ser Ala Glu 
            20                  25                  30          


Gly Lys Gln Thr Glu Phe Ile Ile Thr Ala Glu Ile Leu Arg Glu Leu 
        35                  40                  45              


Ser Met Glu Cys Gly Leu Asn Asn Arg Ile Arg Met Ile Gly Gln Ile 
    50                  55                  60                  


Cys Glu Val Ala Lys Thr Lys Lys Phe Glu Glu His Ala Val Glu Ala 
65                  70                  75                  80  


Leu Trp Lys Ala Val Ala Asp Leu Leu Gln Pro Glu Arg Pro Leu Glu 
                85                  90                  95      


Ala Arg His Ala Val Leu Ala Leu Leu Lys Ala Ile Val Gln Gly Gln 
            100                 105                 110         


Gly Glu Arg Leu Gly Val Leu Arg Ala Leu Phe Phe Lys Val Ile Lys 
        115                 120                 125             


Asp Tyr Pro Ser Asn Glu Asp Leu His Glu Arg Leu Glu Val Phe Lys 
    130                 135                 140                 


Ala Leu Thr Asp Asn Gly Arg His Ile Thr Tyr Leu Glu Glu Glu Leu 
145                 150                 155                 160 


Ala Asp Phe Val Leu Gln Trp Met Asp Val Gly Leu Ser Ser Glu Phe 
                165                 170                 175     


Leu Leu Val Leu Val Asn Leu Val Lys Phe Asn Ser Cys Tyr Leu Asp 
            180                 185                 190         


Glu Tyr Ile Ala Arg Met Val Gln Met Ile Cys Leu Leu Cys Val Arg 
        195                 200                 205             


Thr Ala Ser Ser Val Asp Ile Glu Val Ser Leu Gln Val Leu Asp Ala 
    210                 215                 220                 


Val Val Cys Tyr Asn Cys Leu Pro Ala Glu Ser Leu Pro Leu Phe Ile 
225                 230                 235                 240 


Val Thr Leu Cys Arg Thr Ile Asn Val Lys Glu Leu Cys Glu Pro Cys 
                245                 250                 255     


Trp Lys Leu Met Arg Asn Leu Leu Gly Thr His Leu Gly His Ser Ala 
            260                 265                 270         


Ile Tyr Asn Met Cys His Leu Met Glu Asp Arg Ala Tyr Met Glu Asp 
        275                 280                 285             


Ala Pro Leu Leu Arg Gly Ala Val Phe Phe Val Gly Met Ala Leu Trp 
    290                 295                 300                 


Gly Ala His Arg Leu Tyr Ser Leu Arg Asn Ser Pro Thr Ser Val Leu 
305                 310                 315                 320 


Pro Ser Phe Tyr Gln Ala Met Ala Cys Pro Asn Glu Val Val Ser Tyr 
                325                 330                 335     


Glu Ile Val Leu Ser Ile Thr Arg Leu Ile Lys Lys Tyr Arg Lys Glu 
            340                 345                 350         


Leu Gln Val Val Ala Trp Asp Ile Leu Leu Asn Ile Ile Glu Arg Leu 
        355                 360                 365             


Leu Gln Gln Leu Gln Thr Leu Asp Ser Pro Glu Leu Arg Thr Ile Val 
    370                 375                 380                 


His Asp Leu Leu Thr Thr Val Glu Glu Leu Cys Asp Gln Asn Glu Phe 
385                 390                 395                 400 


His Gly Ser Gln Glu Arg Tyr Phe Glu Leu Val Glu Arg Cys Ala Asp 
                405                 410                 415     


Gln Arg Pro Glu Ser Ser Leu Leu Asn Leu Ile Ser Tyr Arg Ala Gln 
            420                 425                 430         


Ser Ile His Pro Ala Lys Asp Gly Trp Ile Gln Asn Leu Gln Ala Leu 
        435                 440                 445             


Met Glu Arg Phe Phe Arg Ser Glu Ser Arg Gly Ala Val Arg Ile Lys 
    450                 455                 460                 


Val Leu Asp Val Leu Ser Phe Val Leu Leu Ile Asn Arg Gln Phe Tyr 
465                 470                 475                 480 


Glu Glu Glu Leu Ile Asn Ser Val Val Ile Ser Gln Leu Ser His Ile 
                485                 490                 495     


Pro Glu Asp Lys Asp His Gln Val Arg Lys Leu Ala Thr Gln Leu Leu 
            500                 505                 510         


Val Asp Leu Ala Glu Gly Cys His Thr His His Phe Asn Ser Leu Leu 
        515                 520                 525             


Asp Ile Ile Glu Lys Val Met Ala Arg Ser Leu Ser Pro Pro Pro Glu 
    530                 535                 540                 


Leu Glu Glu Arg Asp Val Ala Ala Tyr Ser Ala Ser Leu Glu Asp Val 
545                 550                 555                 560 


Lys Thr Ala Val Leu Gly Leu Leu Val Ile Leu Gln Thr Lys Leu Tyr 
                565                 570                 575     


Thr Leu Pro Ala Ser His Ala Thr Arg Val Tyr Glu Met Leu Val Ser 
            580                 585                 590         


His Ile Gln Leu His Tyr Lys His Ser Tyr Thr Leu Pro Ile Ala Ser 
        595                 600                 605             


Ser Ile Arg Leu Gln Ala Phe Asp Phe Leu Leu Leu Leu Arg Ala Asp 
    610                 615                 620                 


Ser Leu His Arg Leu Gly Leu Pro Asn Lys Asp Gly Val Val Arg Phe 
625                 630                 635                 640 


Ser Pro Tyr Cys Val Cys Asp Tyr Met Glu Pro Glu Arg Gly Ser Glu 
                645                 650                 655     


Lys Lys Thr Ser Gly Pro Leu Ser Pro Pro Thr Gly Pro Pro Gly Pro 
            660                 665                 670         


Ala Pro Ala Gly Pro Ala Val Arg Leu Gly Ser Val Pro Tyr Ser Leu 
        675                 680                 685             


Leu Phe Arg Val Leu Leu Gln Cys Leu Lys Gln Glu Ser Asp Trp Lys 
    690                 695                 700                 


Val Leu Lys Leu Val Leu Gly Arg Leu Pro Glu Ser Leu Arg Tyr Lys 
705                 710                 715                 720 


Val Leu Ile Phe Thr Ser Pro Cys Ser Val Asp Gln Leu Cys Ser Ala 
                725                 730                 735     


Leu Cys Ser Met Leu Ser Gly Pro Lys Thr Leu Glu Arg Leu Arg Gly 
            740                 745                 750         


Ala Pro Glu Gly Phe Ser Arg Thr Asp Leu His Leu Ala Val Val Pro 
        755                 760                 765             


Val Leu Thr Ala Leu Ile Ser Tyr His Asn Tyr Leu Asp Lys Thr Lys 
    770                 775                 780                 


Gln Arg Glu Met Val Tyr Cys Leu Glu Gln Gly Leu Ile His Arg Cys 
785                 790                 795                 800 


Ala Ser Gln Cys Val Val Ala Leu Ser Ile Cys Ser Val Glu Met Pro 
                805                 810                 815     


Asp Ile Ile Ile Lys Ala Leu Pro Val Leu Val Val Lys Leu Thr His 
            820                 825                 830         


Ile Ser Ala Thr Ala Ser Met Ala Val Pro Leu Leu Glu Phe Leu Ser 
        835                 840                 845             


Thr Leu Ala Arg Leu Pro His Leu Tyr Arg Asn Phe Ala Ala Glu Gln 
    850                 855                 860                 


Tyr Ala Ser Val Phe Ala Ile Ser Leu Pro Tyr Thr Asn Pro Ser Lys 
865                 870                 875                 880 


Phe Asn Gln Tyr Ile Val Cys Leu Ala His His Val Ile Ala Met Trp 
                885                 890                 895     


Phe Ile Arg Cys Arg Leu Pro Phe Arg Lys Asp Phe Val Pro Phe Ile 
            900                 905                 910         


Thr Lys Gly Leu Arg Ser Asn Val Leu Leu Ser Phe Asp Asp Thr Pro 
        915                 920                 925             


Glu Lys Asp Ser Phe Arg Ala Arg Ser Thr Ser Leu Asn Glu Arg Pro 
    930                 935                 940                 


Lys Ser Leu Arg Ile Ala Arg Pro Pro Lys Gln Gly Leu Asn Asn Ser 
945                 950                 955                 960 


Pro Pro Val Lys Glu Phe Lys Glu Ser Ser Ala Ala Glu Ala Phe Arg 
                965                 970                 975     


Cys Arg Ser Ile Ser Val Ser Glu His Val Val Arg Ser Arg Ile Gln 
            980                 985                 990         


Thr Ser Leu Thr Ser Ala Ser Leu  Gly Ser Ala Asp Glu  Asn Ser Val 
        995                 1000                 1005             


Ala Gln  Ala Asp Asp Ser Leu  Lys Asn Leu His Leu  Glu Leu Thr 
    1010                 1015                 1020             


Glu Thr  Cys Leu Asp Met Met  Ala Arg Tyr Val Phe  Ser Asn Phe 
    1025                 1030                 1035             


Thr Ala  Val Pro Lys Arg Ser  Pro Val Gly Glu Phe  Leu Leu Ala 
    1040                 1045                 1050             


Gly Gly  Arg Thr Lys Thr Trp  Leu Val Gly Asn Lys  Leu Val Thr 
    1055                 1060                 1065             


Val Thr  Thr Ser Val Gly Thr  Gly Thr Arg Ser Leu  Leu Gly Leu 
    1070                 1075                 1080             


Asp Ser  Gly Glu Leu Gln Ser  Gly Pro Glu Ser Ser  Ser Ser Pro 
    1085                 1090                 1095             


Gly Val  His Val Arg Gln Thr  Lys Glu Ala Pro Ala  Lys Leu Glu 
    1100                 1105                 1110             


Ser Gln  Ala Gly Gln Gln Val  Ser Arg Gly Ala Arg  Asp Arg Val 
    1115                 1120                 1125             


Arg Ser  Met Ser Gly Gly His  Gly Leu Arg Val Gly  Ala Leu Asp 
    1130                 1135                 1140             


Val Pro  Ala Ser Gln Phe Leu  Gly Ser Ala Thr Ser  Pro Gly Pro 
    1145                 1150                 1155             


Arg Thr  Ala Pro Ala Ala Lys  Pro Glu Lys Ala Ser  Ala Gly Thr 
    1160                 1165                 1170             


Arg Val  Pro Val Gln Glu Lys  Thr Asn Leu Ala Ala  Tyr Val Pro 
    1175                 1180                 1185             


Leu Leu  Thr Gln Gly Trp Ala  Glu Ile Leu Val Arg  Arg Pro Thr 
    1190                 1195                 1200             


Gly Asn  Thr Ser Trp Leu Met  Ser Leu Glu Asn Pro  Leu Ser Pro 
    1205                 1210                 1215             


Phe Ser  Ser Asp Ile Asn Asn  Met Pro Leu Gln Glu  Leu Ser Asn 
    1220                 1225                 1230             


Ala Leu  Met Ala Ala Glu Arg  Phe Lys Glu His Arg  Asp Thr Ala 
    1235                 1240                 1245             


Leu Tyr  Lys Ser Leu Ser Val  Pro Ala Ala Ser Thr  Ala Lys Pro 
    1250                 1255                 1260             


Pro Pro  Leu Pro Arg Ser Asn  Thr Asp Ser Ala Val  Val Met Glu 
    1265                 1270                 1275             


Glu Gly  Ser Pro Gly Glu Val  Pro Val Leu Val Glu  Pro Pro Gly 
    1280                 1285                 1290             


Leu Glu  Asp Val Glu Ala Ala  Leu Gly Met Asp Arg  Arg Thr Asp 
    1295                 1300                 1305             


Ala Tyr  Ser Arg Ser Ser Ser  Val Ser Ser Gln Glu  Glu Lys Ser 
    1310                 1315                 1320             


Leu His  Ala Glu Glu Leu Val  Gly Arg Gly Ile Pro  Ile Glu Arg 
    1325                 1330                 1335             


Val Val  Ser Ser Glu Gly Gly  Arg Pro Ser Val Asp  Leu Ser Phe 
    1340                 1345                 1350             


Gln Pro  Ser Gln Pro Leu Ser  Lys Ser Ser Ser Ser  Pro Glu Leu 
    1355                 1360                 1365             


Gln Thr  Leu Gln Asp Ile Leu  Gly Asp Pro Gly Asp  Lys Ala Asp 
    1370                 1375                 1380             


Val Gly  Arg Leu Ser Pro Glu  Val Lys Ala Arg Ser  Gln Ser Gly 
    1385                 1390                 1395             


Thr Leu  Asp Gly Glu Ser Ala  Ala Trp Ser Ala Ser  Gly Glu Asp 
    1400                 1405                 1410             


Ser Arg  Gly Gln Pro Glu Gly  Pro Leu Pro Ser Ser  Ser Pro Arg 
    1415                 1420                 1425             


Ser Pro  Ser Gly Leu Arg Pro  Arg Gly Tyr Thr Ile  Ser Asp Ser 
    1430                 1435                 1440             


Ala Pro  Ser Arg Arg Gly Lys  Arg Val Glu Arg Asp  Ala Leu Lys 
    1445                 1450                 1455             


Ser Arg  Ala Thr Ala Ser Asn  Ala Glu Lys Val Pro  Gly Ile Asn 
    1460                 1465                 1470             


Pro Ser  Phe Val Phe Leu Gln  Leu Tyr His Ser Pro  Phe Phe Gly 
    1475                 1480                 1485             


Asp Glu  Ser Asn Lys Pro Ile  Leu Leu Pro Asn Glu  Ser Gln Ser 
    1490                 1495                 1500             


Phe Glu  Arg Ser Val Gln Leu  Leu Asp Gln Ile Pro  Ser Tyr Asp 
    1505                 1510                 1515             


Thr His  Lys Ile Ala Val Leu  Tyr Val Gly Glu Gly  Gln Ser Asn 
    1520                 1525                 1530             


Ser Glu  Leu Ala Ile Leu Ser  Asn Glu His Gly Ser  Tyr Arg Tyr 
    1535                 1540                 1545             


Thr Glu  Phe Leu Thr Gly Leu  Gly Arg Leu Ile Glu  Leu Lys Asp 
    1550                 1555                 1560             


Cys Gln  Pro Asp Lys Val Tyr  Leu Gly Gly Leu Asp  Val Cys Gly 
    1565                 1570                 1575             


Glu Asp  Gly Gln Phe Thr Tyr  Cys Trp His Asp Asp  Ile Met Gln 
    1580                 1585                 1590             


Ala Val  Phe His Ile Ala Thr  Leu Met Pro Thr Lys  Asp Val Asp 
    1595                 1600                 1605             


Lys His  Arg Cys Asp Lys Lys  Arg His Leu Gly Asn  Asp Phe Val 
    1610                 1615                 1620             


Ser Ile  Val Tyr Asn Asp Ser  Gly Glu Asp Phe Lys  Leu Gly Thr 
    1625                 1630                 1635             


Ile Lys  Gly Gln Phe Asn Phe  Val His Val Ile Val  Thr Pro Leu 
    1640                 1645                 1650             


Asp Tyr  Glu Cys Asn Leu Val  Ser Leu Gln Cys Arg  Lys Asp Met 
    1655                 1660                 1665             


Glu Gly  Leu Val Asp Thr Ser  Val Ala Lys Ile Val  Ser Asp Arg 
    1670                 1675                 1680             


Asn Leu  Pro Phe Val Ala Arg  Gln Met Ala Leu His  Ala Asn Met 
    1685                 1690                 1695             


Ala Ser  Gln Val His His Ser  Arg Ser Asn Pro Thr  Asp Ile Tyr 
    1700                 1705                 1710             


Pro Ser  Lys Trp Ile Ala Arg  Leu Arg His Ile Lys  Arg Leu Arg 
    1715                 1720                 1725             


Gln Arg  Ile Cys Glu Glu Ala  Ala Tyr Ser Asn Pro  Ser Leu Pro 
    1730                 1735                 1740             


Leu Val  His Pro Pro Ser His  Ser Lys Ala Pro Ala  Gln Thr Pro 
    1745                 1750                 1755             


Ala Glu  Pro Thr Pro Gly Tyr  Glu Val Gly Gln Arg  Lys Arg Leu 
    1760                 1765                 1770             


Ile Ser  Ser Val Glu Asp Phe  Thr Glu Phe Val 
    1775                 1780                 


<210>  76
<211>  5577
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tuberous sclerosis 2 (TSC2), transcript variant 5, 
       mRNA NCBI Reference Sequence: NM_001114382.1

<400>  76
ccggcggcgt cccggggcca ggggggtgcg cctttctccg cgtcggggcg gcccggagcg       60

cggtggcgcg gcgcgggagg ggttttctgg tgcgtcctgg tccaccatgg ccaaaccaac      120

aagcaaagat tcaggcttga aggagaagtt taagattctg ttgggactgg gaacaccgag      180

gccaaatccc aggtctgcag agggtaaaca gacggagttt atcatcaccg cggaaatact      240

gagagaactg agcatggaat gtggcctcaa caatcgcatc cggatgatag ggcagatttg      300

tgaagtcgca aaaaccaaga aatttgaaga gcacgcagtg gaagcactct ggaaggcggt      360

cgcggatctg ttgcagccgg agcggccgct ggaggcccgg cacgcggtgc tggctctgct      420

gaaggccatc gtgcaggggc agggcgagcg tttgggggtc ctcagagccc tcttctttaa      480

ggtcatcaag gattaccctt ccaacgaaga ccttcacgaa aggctggagg ttttcaaggc      540

cctcacagac aatgggagac acatcaccta cttggaggaa gagctggctg actttgtcct      600

gcagtggatg gatgttggct tgtcctcgga attccttctg gtgctggtga acttggtcaa      660

attcaatagc tgttacctcg acgagtacat cgcaaggatg gttcagatga tctgtctgct      720

gtgcgtccgg accgcgtcct ctgtggacat agaggtctcc ctgcaggtgc tggacgccgt      780

ggtctgctac aactgcctgc cggctgagag cctcccgctg ttcatcgtta ccctctgtcg      840

caccatcaac gtcaaggagc tctgcgagcc ttgctggaag ctgatgcgga acctccttgg      900

cacccacctg ggccacagcg ccatctacaa catgtgccac ctcatggagg acagagccta      960

catggaggac gcgcccctgc tgagaggagc cgtgtttttt gtgggcatgg ctctctgggg     1020

agcccaccgg ctctattctc tcaggaactc gccgacatct gtgttgccat cattttacca     1080

ggccatggca tgtccgaacg aggtggtgtc ctatgagatc gtcctgtcca tcaccaggct     1140

catcaagaag tataggaagg agctccaggt ggtggcgtgg gacattctgc tgaacatcat     1200

cgaacggctc cttcagcagc tccagacctt ggacagcccg gagctcagga ccatcgtcca     1260

tgacctgttg accacggtgg aggagctgtg tgaccagaac gagttccacg ggtctcagga     1320

gagatacttt gaactggtgg agagatgtgc ggaccagagg cctgagtcct ccctcctgaa     1380

cctgatctcc tatagagcgc agtccatcca cccggccaag gacggctgga ttcagaacct     1440

gcaggcgctg atggagagat tcttcaggag cgagtcccga ggcgccgtgc gcatcaaggt     1500

gctggacgtg ctgtcctttg tgctgctcat caacaggcag ttctatgagg aggagctgat     1560

taactcagtg gtcatctcgc agctctccca catccccgag gataaagacc accaggtccg     1620

aaagctggcc acccagttgc tggtggacct ggcagagggc tgccacacac accacttcaa     1680

cagcctgctg gacatcatcg agaaggtgat ggcccgctcc ctctccccac ccccggagct     1740

ggaagaaagg gatgtggccg catactcggc ctccttggag gatgtgaaga cagccgtcct     1800

ggggcttctg gtcatccttc agaccaagct gtacaccctg cctgcaagcc acgccacgcg     1860

tgtgtatgag atgctggtca gccacattca gctccactac aagcacagct acaccctgcc     1920

aatcgcgagc agcatccggc tgcaggcctt tgacttcctg ttgctgctgc gggccgactc     1980

actgcaccgc ctgggcctgc ccaacaagga tggagtcgtg cggttcagcc cctactgcgt     2040

ctgcgactac atggagccag agagaggctc tgagaagaag accagcggcc ccctttctcc     2100

tcccacaggg cctcctggcc cggcgcctgc aggccccgcc gtgcggctgg ggtccgtgcc     2160

ctactccctg ctcttccgcg tcctgctgca gtgcttgaag caggagtctg actggaaggt     2220

gctgaagctg gttctgggca ggctgcctga gtccctgcgc tataaagtgc tcatctttac     2280

ttccccttgc agtgtggacc agctgtgctc tgctctctgc tccatgcttt caggcccaaa     2340

gacactggag cggctccgag gcgccccaga aggcttctcc agaactgact tgcacctggc     2400

cgtggttcca gtgctgacag cattaatctc ttaccataac tacctggaca aaaccaaaca     2460

gcgcgagatg gtctactgcc tggagcaggg cctcatccac cgctgtgcca gccagtgcgt     2520

cgtggccttg tccatctgca gcgtggagat gcctgacatc atcatcaagg cgctgcctgt     2580

tctggtggtg aagctcacgc acatctcagc cacagccagc atggccgtcc cactgctgga     2640

gttcctgtcc actctggcca ggctgccgca cctctacagg aactttgccg cggagcagta     2700

tgccagtgtg ttcgccatct ccctgccgta caccaacccc tccaagttta atcagtacat     2760

cgtgtgtctg gcccatcacg tcatagccat gtggttcatc aggtgccgcc tgcccttccg     2820

gaaggatttt gtccctttca tcactaaggg cctgcggtcc aatgtcctct tgtcttttga     2880

tgacaccccc gagaaggaca gcttcagggc ccggagtact agtctcaacg agagacccaa     2940

gagtctgagg atagccagac cccccaaaca aggcttgaat aactctccac ccgtgaaaga     3000

attcaaggag agctctgcag ccgaggcctt ccggtgccgc agcatcagtg tgtctgaaca     3060

tgtggtccgc agcaggatac agacgtccct caccagtgcc agcttggggt ctgcagatga     3120

gaactccgtg gcccaggctg acgatagcct gaaaaacctc cacctggagc tcacggaaac     3180

ctgtctggac atgatggctc gatacgtctt ctccaacttc acggctgtcc cgaagaggtc     3240

tcctgtgggc gagttcctcc tagcgggtgg caggaccaaa acctggctgg ttgggaacaa     3300

gcttgtcact gtgacgacaa gcgtgggaac cgggacccgg tcgttactag gcctggactc     3360

gggggagctg cagtccggcc cggagtcgag ctccagcccc ggggtgcatg tgagacagac     3420

caaggaggcg ccggccaagc tggagtccca ggctgggcag caggtgtccc gtggggcccg     3480

ggatcgggtc cgttccatgt cggggggcca tggtcttcga gttggcgccc tggacgtgcc     3540

ggcctcccag ttcctgggca gtgccacttc tccaggacca cggactgcac cagccgcgaa     3600

acctgagaag gcctcagctg gcacccgggt tcctgtgcag gagaagacga acctggcggc     3660

ctatgtgccc ctgctgaccc agggctgggc ggagatcctg gtccggaggc ccacagggaa     3720

caccagctgg ctgatgagcc tggagaaccc gctcagccct ttctcctcgg acatcaacaa     3780

catgcccctg caggagctgt ctaacgccct catggcggct gagcgcttca aggagcaccg     3840

ggacacagcc ctgtacaagt cactgtcggt gccggcagcc agcacggcca aaccccctcc     3900

tctgcctcgc tccaacacag actccgccgt ggtcatggag gagggaagtc cgggcgaggt     3960

tcctgtgctg gtggagcccc cagggttgga ggacgttgag gcagcgctag gcatggacag     4020

gcgcacggat gcctacagca ggtcgtcctc agtctccagc caggaggaga agtcgctcca     4080

cgcggaggag ctggttggca ggggcatccc catcgagcga gtcgtctcct cggagggtgg     4140

ccggccctct gtggacctct ccttccagcc ctcgcagccc ctgagcaagt ccagctcctc     4200

tcccgagctg cagactctgc aggacatcct cggggaccct ggggacaagg ccgacgtggg     4260

ccggctgagc cctgaggtta aggcccggtc acagtcaggg accctggacg gggaaagtgc     4320

tgcctggtcg gcctcgggcg aagacagtcg gggccagccc gagggtccct tgccttccag     4380

ctccccccgc tcgcccagtg gcctccggcc ccgaggttac accatctccg actcggcccc     4440

atcacgcagg ggcaagagag tagagaggga cgccttaaag agcagagcca cagcctccaa     4500

tgcagagaaa gtgccaggca tcaaccccag tttcgtgttc ctgcagctct accattcccc     4560

cttctttggc gacgagtcaa acaagccaat cctgctgccc aatgagtcac agtcctttga     4620

gcggtcggtg cagctcctcg accagatccc atcatacgac acccacaaga tcgccgtcct     4680

gtatgttgga gaaggccaga gcaacagcga gctcgccatc ctgtccaatg agcatggctc     4740

ctacaggtac acggagttcc tgacgggcct gggccggctc atcgagctga aggactgcca     4800

gccggacaag gtgtacctgg gaggcctgga cgtgtgtggt gaggacggcc agttcaccta     4860

ctgctggcac gatgacatca tgcaagccgt cttccacatc gccaccctga tgcccaccaa     4920

ggacgtggac aagcaccgct gcgacaagaa gcgccacctg ggcaacgact ttgtgtccat     4980

tgtctacaat gactccggtg aggacttcaa gcttggcacc atcaagggcc agttcaactt     5040

tgtccacgtg atcgtcaccc cgctggacta cgagtgcaac ctggtgtccc tgcagtgcag     5100

gaaagacatg gagggccttg tggacaccag cgtggccaag atcgtgtctg accgcaacct     5160

gcccttcgtg gcccgccaga tggccctgca cgcaaatatg gcctcacagg tgcatcatag     5220

ccgctccaac cccaccgata tctacccctc caagtggatt gcccggctcc gccacatcaa     5280

gcggctccgc cagcggatct gcgaggaagc cgcctactcc aaccccagcc tacctctggt     5340

gcaccctccg tcccatagca aagcccctgc acagactcca gccgagccca cacctggcta     5400

tgaggtgggc cagcggaagc gcctcatctc ctcggtggag gacttcaccg agtttgtgtg     5460

aggccggggc cctccctcct gcactggcct tggacggtat tgcctgtcag tgaaataaat     5520

aaagtcctga ccccagtgca cagacataga ggcacagatt gcaaaaaaaa aaaaaaa        5577


<210>  77
<211>  1784
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tuberous sclerosis 2 (TSC2), transcript variant 5, 
       polypeptide

<400>  77

Met Ala Lys Pro Thr Ser Lys Asp Ser Gly Leu Lys Glu Lys Phe Lys 
1               5                   10                  15      


Ile Leu Leu Gly Leu Gly Thr Pro Arg Pro Asn Pro Arg Ser Ala Glu 
            20                  25                  30          


Gly Lys Gln Thr Glu Phe Ile Ile Thr Ala Glu Ile Leu Arg Glu Leu 
        35                  40                  45              


Ser Met Glu Cys Gly Leu Asn Asn Arg Ile Arg Met Ile Gly Gln Ile 
    50                  55                  60                  


Cys Glu Val Ala Lys Thr Lys Lys Phe Glu Glu His Ala Val Glu Ala 
65                  70                  75                  80  


Leu Trp Lys Ala Val Ala Asp Leu Leu Gln Pro Glu Arg Pro Leu Glu 
                85                  90                  95      


Ala Arg His Ala Val Leu Ala Leu Leu Lys Ala Ile Val Gln Gly Gln 
            100                 105                 110         


Gly Glu Arg Leu Gly Val Leu Arg Ala Leu Phe Phe Lys Val Ile Lys 
        115                 120                 125             


Asp Tyr Pro Ser Asn Glu Asp Leu His Glu Arg Leu Glu Val Phe Lys 
    130                 135                 140                 


Ala Leu Thr Asp Asn Gly Arg His Ile Thr Tyr Leu Glu Glu Glu Leu 
145                 150                 155                 160 


Ala Asp Phe Val Leu Gln Trp Met Asp Val Gly Leu Ser Ser Glu Phe 
                165                 170                 175     


Leu Leu Val Leu Val Asn Leu Val Lys Phe Asn Ser Cys Tyr Leu Asp 
            180                 185                 190         


Glu Tyr Ile Ala Arg Met Val Gln Met Ile Cys Leu Leu Cys Val Arg 
        195                 200                 205             


Thr Ala Ser Ser Val Asp Ile Glu Val Ser Leu Gln Val Leu Asp Ala 
    210                 215                 220                 


Val Val Cys Tyr Asn Cys Leu Pro Ala Glu Ser Leu Pro Leu Phe Ile 
225                 230                 235                 240 


Val Thr Leu Cys Arg Thr Ile Asn Val Lys Glu Leu Cys Glu Pro Cys 
                245                 250                 255     


Trp Lys Leu Met Arg Asn Leu Leu Gly Thr His Leu Gly His Ser Ala 
            260                 265                 270         


Ile Tyr Asn Met Cys His Leu Met Glu Asp Arg Ala Tyr Met Glu Asp 
        275                 280                 285             


Ala Pro Leu Leu Arg Gly Ala Val Phe Phe Val Gly Met Ala Leu Trp 
    290                 295                 300                 


Gly Ala His Arg Leu Tyr Ser Leu Arg Asn Ser Pro Thr Ser Val Leu 
305                 310                 315                 320 


Pro Ser Phe Tyr Gln Ala Met Ala Cys Pro Asn Glu Val Val Ser Tyr 
                325                 330                 335     


Glu Ile Val Leu Ser Ile Thr Arg Leu Ile Lys Lys Tyr Arg Lys Glu 
            340                 345                 350         


Leu Gln Val Val Ala Trp Asp Ile Leu Leu Asn Ile Ile Glu Arg Leu 
        355                 360                 365             


Leu Gln Gln Leu Gln Thr Leu Asp Ser Pro Glu Leu Arg Thr Ile Val 
    370                 375                 380                 


His Asp Leu Leu Thr Thr Val Glu Glu Leu Cys Asp Gln Asn Glu Phe 
385                 390                 395                 400 


His Gly Ser Gln Glu Arg Tyr Phe Glu Leu Val Glu Arg Cys Ala Asp 
                405                 410                 415     


Gln Arg Pro Glu Ser Ser Leu Leu Asn Leu Ile Ser Tyr Arg Ala Gln 
            420                 425                 430         


Ser Ile His Pro Ala Lys Asp Gly Trp Ile Gln Asn Leu Gln Ala Leu 
        435                 440                 445             


Met Glu Arg Phe Phe Arg Ser Glu Ser Arg Gly Ala Val Arg Ile Lys 
    450                 455                 460                 


Val Leu Asp Val Leu Ser Phe Val Leu Leu Ile Asn Arg Gln Phe Tyr 
465                 470                 475                 480 


Glu Glu Glu Leu Ile Asn Ser Val Val Ile Ser Gln Leu Ser His Ile 
                485                 490                 495     


Pro Glu Asp Lys Asp His Gln Val Arg Lys Leu Ala Thr Gln Leu Leu 
            500                 505                 510         


Val Asp Leu Ala Glu Gly Cys His Thr His His Phe Asn Ser Leu Leu 
        515                 520                 525             


Asp Ile Ile Glu Lys Val Met Ala Arg Ser Leu Ser Pro Pro Pro Glu 
    530                 535                 540                 


Leu Glu Glu Arg Asp Val Ala Ala Tyr Ser Ala Ser Leu Glu Asp Val 
545                 550                 555                 560 


Lys Thr Ala Val Leu Gly Leu Leu Val Ile Leu Gln Thr Lys Leu Tyr 
                565                 570                 575     


Thr Leu Pro Ala Ser His Ala Thr Arg Val Tyr Glu Met Leu Val Ser 
            580                 585                 590         


His Ile Gln Leu His Tyr Lys His Ser Tyr Thr Leu Pro Ile Ala Ser 
        595                 600                 605             


Ser Ile Arg Leu Gln Ala Phe Asp Phe Leu Leu Leu Leu Arg Ala Asp 
    610                 615                 620                 


Ser Leu His Arg Leu Gly Leu Pro Asn Lys Asp Gly Val Val Arg Phe 
625                 630                 635                 640 


Ser Pro Tyr Cys Val Cys Asp Tyr Met Glu Pro Glu Arg Gly Ser Glu 
                645                 650                 655     


Lys Lys Thr Ser Gly Pro Leu Ser Pro Pro Thr Gly Pro Pro Gly Pro 
            660                 665                 670         


Ala Pro Ala Gly Pro Ala Val Arg Leu Gly Ser Val Pro Tyr Ser Leu 
        675                 680                 685             


Leu Phe Arg Val Leu Leu Gln Cys Leu Lys Gln Glu Ser Asp Trp Lys 
    690                 695                 700                 


Val Leu Lys Leu Val Leu Gly Arg Leu Pro Glu Ser Leu Arg Tyr Lys 
705                 710                 715                 720 


Val Leu Ile Phe Thr Ser Pro Cys Ser Val Asp Gln Leu Cys Ser Ala 
                725                 730                 735     


Leu Cys Ser Met Leu Ser Gly Pro Lys Thr Leu Glu Arg Leu Arg Gly 
            740                 745                 750         


Ala Pro Glu Gly Phe Ser Arg Thr Asp Leu His Leu Ala Val Val Pro 
        755                 760                 765             


Val Leu Thr Ala Leu Ile Ser Tyr His Asn Tyr Leu Asp Lys Thr Lys 
    770                 775                 780                 


Gln Arg Glu Met Val Tyr Cys Leu Glu Gln Gly Leu Ile His Arg Cys 
785                 790                 795                 800 


Ala Ser Gln Cys Val Val Ala Leu Ser Ile Cys Ser Val Glu Met Pro 
                805                 810                 815     


Asp Ile Ile Ile Lys Ala Leu Pro Val Leu Val Val Lys Leu Thr His 
            820                 825                 830         


Ile Ser Ala Thr Ala Ser Met Ala Val Pro Leu Leu Glu Phe Leu Ser 
        835                 840                 845             


Thr Leu Ala Arg Leu Pro His Leu Tyr Arg Asn Phe Ala Ala Glu Gln 
    850                 855                 860                 


Tyr Ala Ser Val Phe Ala Ile Ser Leu Pro Tyr Thr Asn Pro Ser Lys 
865                 870                 875                 880 


Phe Asn Gln Tyr Ile Val Cys Leu Ala His His Val Ile Ala Met Trp 
                885                 890                 895     


Phe Ile Arg Cys Arg Leu Pro Phe Arg Lys Asp Phe Val Pro Phe Ile 
            900                 905                 910         


Thr Lys Gly Leu Arg Ser Asn Val Leu Leu Ser Phe Asp Asp Thr Pro 
        915                 920                 925             


Glu Lys Asp Ser Phe Arg Ala Arg Ser Thr Ser Leu Asn Glu Arg Pro 
    930                 935                 940                 


Lys Ser Leu Arg Ile Ala Arg Pro Pro Lys Gln Gly Leu Asn Asn Ser 
945                 950                 955                 960 


Pro Pro Val Lys Glu Phe Lys Glu Ser Ser Ala Ala Glu Ala Phe Arg 
                965                 970                 975     


Cys Arg Ser Ile Ser Val Ser Glu His Val Val Arg Ser Arg Ile Gln 
            980                 985                 990         


Thr Ser Leu Thr Ser Ala Ser Leu  Gly Ser Ala Asp Glu  Asn Ser Val 
        995                 1000                 1005             


Ala Gln  Ala Asp Asp Ser Leu  Lys Asn Leu His Leu  Glu Leu Thr 
    1010                 1015                 1020             


Glu Thr  Cys Leu Asp Met Met  Ala Arg Tyr Val Phe  Ser Asn Phe 
    1025                 1030                 1035             


Thr Ala  Val Pro Lys Arg Ser  Pro Val Gly Glu Phe  Leu Leu Ala 
    1040                 1045                 1050             


Gly Gly  Arg Thr Lys Thr Trp  Leu Val Gly Asn Lys  Leu Val Thr 
    1055                 1060                 1065             


Val Thr  Thr Ser Val Gly Thr  Gly Thr Arg Ser Leu  Leu Gly Leu 
    1070                 1075                 1080             


Asp Ser  Gly Glu Leu Gln Ser  Gly Pro Glu Ser Ser  Ser Ser Pro 
    1085                 1090                 1095             


Gly Val  His Val Arg Gln Thr  Lys Glu Ala Pro Ala  Lys Leu Glu 
    1100                 1105                 1110             


Ser Gln  Ala Gly Gln Gln Val  Ser Arg Gly Ala Arg  Asp Arg Val 
    1115                 1120                 1125             


Arg Ser  Met Ser Gly Gly His  Gly Leu Arg Val Gly  Ala Leu Asp 
    1130                 1135                 1140             


Val Pro  Ala Ser Gln Phe Leu  Gly Ser Ala Thr Ser  Pro Gly Pro 
    1145                 1150                 1155             


Arg Thr  Ala Pro Ala Ala Lys  Pro Glu Lys Ala Ser  Ala Gly Thr 
    1160                 1165                 1170             


Arg Val  Pro Val Gln Glu Lys  Thr Asn Leu Ala Ala  Tyr Val Pro 
    1175                 1180                 1185             


Leu Leu  Thr Gln Gly Trp Ala  Glu Ile Leu Val Arg  Arg Pro Thr 
    1190                 1195                 1200             


Gly Asn  Thr Ser Trp Leu Met  Ser Leu Glu Asn Pro  Leu Ser Pro 
    1205                 1210                 1215             


Phe Ser  Ser Asp Ile Asn Asn  Met Pro Leu Gln Glu  Leu Ser Asn 
    1220                 1225                 1230             


Ala Leu  Met Ala Ala Glu Arg  Phe Lys Glu His Arg  Asp Thr Ala 
    1235                 1240                 1245             


Leu Tyr  Lys Ser Leu Ser Val  Pro Ala Ala Ser Thr  Ala Lys Pro 
    1250                 1255                 1260             


Pro Pro  Leu Pro Arg Ser Asn  Thr Asp Ser Ala Val  Val Met Glu 
    1265                 1270                 1275             


Glu Gly  Ser Pro Gly Glu Val  Pro Val Leu Val Glu  Pro Pro Gly 
    1280                 1285                 1290             


Leu Glu  Asp Val Glu Ala Ala  Leu Gly Met Asp Arg  Arg Thr Asp 
    1295                 1300                 1305             


Ala Tyr  Ser Arg Ser Ser Ser  Val Ser Ser Gln Glu  Glu Lys Ser 
    1310                 1315                 1320             


Leu His  Ala Glu Glu Leu Val  Gly Arg Gly Ile Pro  Ile Glu Arg 
    1325                 1330                 1335             


Val Val  Ser Ser Glu Gly Gly  Arg Pro Ser Val Asp  Leu Ser Phe 
    1340                 1345                 1350             


Gln Pro  Ser Gln Pro Leu Ser  Lys Ser Ser Ser Ser  Pro Glu Leu 
    1355                 1360                 1365             


Gln Thr  Leu Gln Asp Ile Leu  Gly Asp Pro Gly Asp  Lys Ala Asp 
    1370                 1375                 1380             


Val Gly  Arg Leu Ser Pro Glu  Val Lys Ala Arg Ser  Gln Ser Gly 
    1385                 1390                 1395             


Thr Leu  Asp Gly Glu Ser Ala  Ala Trp Ser Ala Ser  Gly Glu Asp 
    1400                 1405                 1410             


Ser Arg  Gly Gln Pro Glu Gly  Pro Leu Pro Ser Ser  Ser Pro Arg 
    1415                 1420                 1425             


Ser Pro  Ser Gly Leu Arg Pro  Arg Gly Tyr Thr Ile  Ser Asp Ser 
    1430                 1435                 1440             


Ala Pro  Ser Arg Arg Gly Lys  Arg Val Glu Arg Asp  Ala Leu Lys 
    1445                 1450                 1455             


Ser Arg  Ala Thr Ala Ser Asn  Ala Glu Lys Val Pro  Gly Ile Asn 
    1460                 1465                 1470             


Pro Ser  Phe Val Phe Leu Gln  Leu Tyr His Ser Pro  Phe Phe Gly 
    1475                 1480                 1485             


Asp Glu  Ser Asn Lys Pro Ile  Leu Leu Pro Asn Glu  Ser Gln Ser 
    1490                 1495                 1500             


Phe Glu  Arg Ser Val Gln Leu  Leu Asp Gln Ile Pro  Ser Tyr Asp 
    1505                 1510                 1515             


Thr His  Lys Ile Ala Val Leu  Tyr Val Gly Glu Gly  Gln Ser Asn 
    1520                 1525                 1530             


Ser Glu  Leu Ala Ile Leu Ser  Asn Glu His Gly Ser  Tyr Arg Tyr 
    1535                 1540                 1545             


Thr Glu  Phe Leu Thr Gly Leu  Gly Arg Leu Ile Glu  Leu Lys Asp 
    1550                 1555                 1560             


Cys Gln  Pro Asp Lys Val Tyr  Leu Gly Gly Leu Asp  Val Cys Gly 
    1565                 1570                 1575             


Glu Asp  Gly Gln Phe Thr Tyr  Cys Trp His Asp Asp  Ile Met Gln 
    1580                 1585                 1590             


Ala Val  Phe His Ile Ala Thr  Leu Met Pro Thr Lys  Asp Val Asp 
    1595                 1600                 1605             


Lys His  Arg Cys Asp Lys Lys  Arg His Leu Gly Asn  Asp Phe Val 
    1610                 1615                 1620             


Ser Ile  Val Tyr Asn Asp Ser  Gly Glu Asp Phe Lys  Leu Gly Thr 
    1625                 1630                 1635             


Ile Lys  Gly Gln Phe Asn Phe  Val His Val Ile Val  Thr Pro Leu 
    1640                 1645                 1650             


Asp Tyr  Glu Cys Asn Leu Val  Ser Leu Gln Cys Arg  Lys Asp Met 
    1655                 1660                 1665             


Glu Gly  Leu Val Asp Thr Ser  Val Ala Lys Ile Val  Ser Asp Arg 
    1670                 1675                 1680             


Asn Leu  Pro Phe Val Ala Arg  Gln Met Ala Leu His  Ala Asn Met 
    1685                 1690                 1695             


Ala Ser  Gln Val His His Ser  Arg Ser Asn Pro Thr  Asp Ile Tyr 
    1700                 1705                 1710             


Pro Ser  Lys Trp Ile Ala Arg  Leu Arg His Ile Lys  Arg Leu Arg 
    1715                 1720                 1725             


Gln Arg  Ile Cys Glu Glu Ala  Ala Tyr Ser Asn Pro  Ser Leu Pro 
    1730                 1735                 1740             


Leu Val  His Pro Pro Ser His  Ser Lys Ala Pro Ala  Gln Thr Pro 
    1745                 1750                 1755             


Ala Glu  Pro Thr Pro Gly Tyr  Glu Val Gly Gln Arg  Lys Arg Leu 
    1760                 1765                 1770             


Ile Ser  Ser Val Glu Asp Phe  Thr Glu Phe Val 
    1775                 1780                 


<210>  78
<211>  1606
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human Heme Oxygenase-1 mRNA Genbank Accession No.:  NM_002133.2  
       GI:298676487

<400>  78
aaatgtgacc ggccgcggct ccggcagtca acgcctgcct cctctcgagc gtcctcagcg       60

cagccgccgc ccgcggagcc agcacgaacg agcccagcac cggccggatg gagcgtccgc      120

aacccgacag catgccccag gatttgtcag aggccctgaa ggaggccacc aaggaggtgc      180

acacccaggc agagaatgct gagttcatga ggaactttca gaagggccag gtgacccgag      240

acggcttcaa gctggtgatg gcctccctgt accacatcta tgtggccctg gaggaggaga      300

ttgagcgcaa caaggagagc ccagtcttcg cccctgtcta cttcccagaa gagctgcacc      360

gcaaggctgc cctggagcag gacctggcct tctggtacgg gccccgctgg caggaggtca      420

tcccctacac accagccatg cagcgctatg tgaagcggct ccacgaggtg gggcgcacag      480

agcccgagct gctggtggcc cacgcctaca cccgctacct gggtgacctg tctgggggcc      540

aggtgctcaa aaagattgcc cagaaagccc tggacctgcc cagctctggc gagggcctgg      600

ccttcttcac cttccccaac attgccagtg ccaccaagtt caagcagctc taccgctccc      660

gcatgaactc cctggagatg actcccgcag tcaggcagag ggtgatagaa gaggccaaga      720

ctgcgttcct gctcaacatc cagctctttg aggagttgca ggagctgctg acccatgaca      780

ccaaggacca gagcccctca cgggcaccag ggcttcgcca gcgggccagc aacaaagtgc      840

aagattctgc ccccgtggag actcccagag ggaagccccc actcaacacc cgctcccagg      900

ctccgcttct ccgatgggtc cttacactca gctttctggt ggcgacagtt gctgtagggc      960

tttatgccat gtgaatgcag gcatgctggc tcccagggcc atgaactttg tccggtggaa     1020

ggccttcttt ctagagaggg aattctcttg gctggcttcc ttaccgtggg cactgaaggc     1080

tttcagggcc tccagccctc tcactgtgtc cctctctctg gaaaggagga aggagcctat     1140

ggcatcttcc ccaacgaaaa gcacatccag gcaatggcct aaacttcaga gggggcgaag     1200

ggatcagccc tgcccttcag catcctcagt tcctgcagca gagcctggaa gacaccctaa     1260

tgtggcagct gtctcaaacc tccaaaagcc ctgagtttca agtatccttg ttgacacggc     1320

catgaccact ttccccgtgg gccatggcaa tttttacaca aacctgaaaa gatgttgtgt     1380

cttgtgtttt tgtcttattt ttgttggagc cactctgttc ctggctcagc ctcaaatgca     1440

gtatttttgt tgtgttctgt tgtttttata gcagggttgg ggtggttttt gagccatgcg     1500

tgggtgggga gggaggtgtt taacggcact gtggccttgg tctaactttt gtgtgaaata     1560

ataaacaaca ttgtctgata gtagcttgaa aaaaaaaaaa aaaaaa                    1606


<210>  79
<211>  288
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human Heme Oxygenase-1 polypeptide Genbank Accession No.: 
       NP_002124.1  GI:4504437

<400>  79

Met Glu Arg Pro Gln Pro Asp Ser Met Pro Gln Asp Leu Ser Glu Ala 
1               5                   10                  15      


Leu Lys Glu Ala Thr Lys Glu Val His Thr Gln Ala Glu Asn Ala Glu 
            20                  25                  30          


Phe Met Arg Asn Phe Gln Lys Gly Gln Val Thr Arg Asp Gly Phe Lys 
        35                  40                  45              


Leu Val Met Ala Ser Leu Tyr His Ile Tyr Val Ala Leu Glu Glu Glu 
    50                  55                  60                  


Ile Glu Arg Asn Lys Glu Ser Pro Val Phe Ala Pro Val Tyr Phe Pro 
65                  70                  75                  80  


Glu Glu Leu His Arg Lys Ala Ala Leu Glu Gln Asp Leu Ala Phe Trp 
                85                  90                  95      


Tyr Gly Pro Arg Trp Gln Glu Val Ile Pro Tyr Thr Pro Ala Met Gln 
            100                 105                 110         


Arg Tyr Val Lys Arg Leu His Glu Val Gly Arg Thr Glu Pro Glu Leu 
        115                 120                 125             


Leu Val Ala His Ala Tyr Thr Arg Tyr Leu Gly Asp Leu Ser Gly Gly 
    130                 135                 140                 


Gln Val Leu Lys Lys Ile Ala Gln Lys Ala Leu Asp Leu Pro Ser Ser 
145                 150                 155                 160 


Gly Glu Gly Leu Ala Phe Phe Thr Phe Pro Asn Ile Ala Ser Ala Thr 
                165                 170                 175     


Lys Phe Lys Gln Leu Tyr Arg Ser Arg Met Asn Ser Leu Glu Met Thr 
            180                 185                 190         


Pro Ala Val Arg Gln Arg Val Ile Glu Glu Ala Lys Thr Ala Phe Leu 
        195                 200                 205             


Leu Asn Ile Gln Leu Phe Glu Glu Leu Gln Glu Leu Leu Thr His Asp 
    210                 215                 220                 


Thr Lys Asp Gln Ser Pro Ser Arg Ala Pro Gly Leu Arg Gln Arg Ala 
225                 230                 235                 240 


Ser Asn Lys Val Gln Asp Ser Ala Pro Val Glu Thr Pro Arg Gly Lys 
                245                 250                 255     


Pro Pro Leu Asn Thr Arg Ser Gln Ala Pro Leu Leu Arg Trp Val Leu 
            260                 265                 270         


Thr Leu Ser Phe Leu Val Ala Thr Val Ala Val Gly Leu Tyr Ala Met 
        275                 280                 285             


<210>  80
<211>  5572
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human PTEN mRNA GenBank Accession No.: NM_000314.4  GI:110224474

<400>  80
cctcccctcg cccggcgcgg tcccgtccgc ctctcgctcg cctcccgcct cccctcggtc       60

ttccgaggcg cccgggctcc cggcgcggcg gcggaggggg cgggcaggcc ggcgggcggt      120

gatgtggcgg gactctttat gcgctgcggc aggatacgcg ctcggcgctg ggacgcgact      180

gcgctcagtt ctctcctctc ggaagctgca gccatgatgg aagtttgaga gttgagccgc      240

tgtgaggcga ggccgggctc aggcgaggga gatgagagac ggcggcggcc gcggcccgga      300

gcccctctca gcgcctgtga gcagccgcgg gggcagcgcc ctcggggagc cggccggcct      360

gcggcggcgg cagcggcggc gtttctcgcc tcctcttcgt cttttctaac cgtgcagcct      420

cttcctcggc ttctcctgaa agggaaggtg gaagccgtgg gctcgggcgg gagccggctg      480

aggcgcggcg gcggcggcgg cacctcccgc tcctggagcg ggggggagaa gcggcggcgg      540

cggcggccgc ggcggctgca gctccaggga gggggtctga gtcgcctgtc accatttcca      600

gggctgggaa cgccggagag ttggtctctc cccttctact gcctccaaca cggcggcggc      660

ggcggcggca catccaggga cccgggccgg ttttaaacct cccgtccgcc gccgccgcac      720

cccccgtggc ccgggctccg gaggccgccg gcggaggcag ccgttcggag gattattcgt      780

cttctcccca ttccgctgcc gccgctgcca ggcctctggc tgctgaggag aagcaggccc      840

agtcgctgca accatccagc agccgccgca gcagccatta cccggctgcg gtccagagcc      900

aagcggcggc agagcgaggg gcatcagcta ccgccaagtc cagagccatt tccatcctgc      960

agaagaagcc ccgccaccag cagcttctgc catctctctc ctcctttttc ttcagccaca     1020

ggctcccaga catgacagcc atcatcaaag agatcgttag cagaaacaaa aggagatatc     1080

aagaggatgg attcgactta gacttgacct atatttatcc aaacattatt gctatgggat     1140

ttcctgcaga aagacttgaa ggcgtataca ggaacaatat tgatgatgta gtaaggtttt     1200

tggattcaaa gcataaaaac cattacaaga tatacaatct ttgtgctgaa agacattatg     1260

acaccgccaa atttaattgc agagttgcac aatatccttt tgaagaccat aacccaccac     1320

agctagaact tatcaaaccc ttttgtgaag atcttgacca atggctaagt gaagatgaca     1380

atcatgttgc agcaattcac tgtaaagctg gaaagggacg aactggtgta atgatatgtg     1440

catatttatt acatcggggc aaatttttaa aggcacaaga ggccctagat ttctatgggg     1500

aagtaaggac cagagacaaa aagggagtaa ctattcccag tcagaggcgc tatgtgtatt     1560

attatagcta cctgttaaag aatcatctgg attatagacc agtggcactg ttgtttcaca     1620

agatgatgtt tgaaactatt ccaatgttca gtggcggaac ttgcaatcct cagtttgtgg     1680

tctgccagct aaaggtgaag atatattcct ccaattcagg acccacacga cgggaagaca     1740

agttcatgta ctttgagttc cctcagccgt tacctgtgtg tggtgatatc aaagtagagt     1800

tcttccacaa acagaacaag atgctaaaaa aggacaaaat gtttcacttt tgggtaaata     1860

cattcttcat accaggacca gaggaaacct cagaaaaagt agaaaatgga agtctatgtg     1920

atcaagaaat cgatagcatt tgcagtatag agcgtgcaga taatgacaag gaatatctag     1980

tacttacttt aacaaaaaat gatcttgaca aagcaaataa agacaaagcc aaccgatact     2040

tttctccaaa ttttaaggtg aagctgtact tcacaaaaac agtagaggag ccgtcaaatc     2100

cagaggctag cagttcaact tctgtaacac cagatgttag tgacaatgaa cctgatcatt     2160

atagatattc tgacaccact gactctgatc cagagaatga accttttgat gaagatcagc     2220

atacacaaat tacaaaagtc tgaatttttt tttatcaaga gggataaaac accatgaaaa     2280

taaacttgaa taaactgaaa atggaccttt ttttttttaa tggcaatagg acattgtgtc     2340

agattaccag ttataggaac aattctcttt tcctgaccaa tcttgtttta ccctatacat     2400

ccacagggtt ttgacacttg ttgtccagtt gaaaaaaggt tgtgtagctg tgtcatgtat     2460

ataccttttt gtgtcaaaag gacatttaaa attcaattag gattaataaa gatggcactt     2520

tcccgtttta ttccagtttt ataaaaagtg gagacagact gatgtgtata cgtaggaatt     2580

ttttcctttt gtgttctgtc accaactgaa gtggctaaag agctttgtga tatactggtt     2640

cacatcctac ccctttgcac ttgtggcaac agataagttt gcagttggct aagagaggtt     2700

tccgaagggt tttgctacat tctaatgcat gtattcgggt taggggaatg gagggaatgc     2760

tcagaaagga aataatttta tgctggactc tggaccatat accatctcca gctatttaca     2820

cacacctttc tttagcatgc tacagttatt aatctggaca ttcgaggaat tggccgctgt     2880

cactgcttgt tgtttgcgca ttttttttta aagcatattg gtgctagaaa aggcagctaa     2940

aggaagtgaa tctgtattgg ggtacaggaa tgaaccttct gcaacatctt aagatccaca     3000

aatgaaggga tataaaaata atgtcatagg taagaaacac agcaacaatg acttaaccat     3060

ataaatgtgg aggctatcaa caaagaatgg gcttgaaaca ttataaaaat tgacaatgat     3120

ttattaaata tgttttctca attgtaacga cttctccatc tcctgtgtaa tcaaggccag     3180

tgctaaaatt cagatgctgt tagtacctac atcagtcaac aacttacact tattttacta     3240

gttttcaatc ataatacctg ctgtggatgc ttcatgtgct gcctgcaagc ttcttttttc     3300

tcattaaata taaaatattt tgtaatgctg cacagaaatt ttcaatttga gattctacag     3360

taagcgtttt ttttctttga agatttatga tgcacttatt caatagctgt cagccgttcc     3420

acccttttga ccttacacat tctattacaa tgaattttgc agttttgcac attttttaaa     3480

tgtcattaac tgttagggaa ttttacttga atactgaata catataatgt ttatattaaa     3540

aaggacattt gtgttaaaaa ggaaattaga gttgcagtaa actttcaatg ctgcacacaa     3600

aaaaaagaca tttgattttt cagtagaaat tgtcctacat gtgctttatt gatttgctat     3660

tgaaagaata gggttttttt tttttttttt tttttttttt ttaaatgtgc agtgttgaat     3720

catttcttca tagtgctccc ccgagttggg actagggctt caatttcact tcttaaaaaa     3780

aatcatcata tatttgatat gcccagactg catacgattt taagcggagt acaactacta     3840

ttgtaaagct aatgtgaaga tattattaaa aaggtttttt tttccagaaa tttggtgtct     3900

tcaaattata ccttcacctt gacatttgaa tatccagcca ttttgtttct taatggtata     3960

aaattccatt ttcaataact tattggtgct gaaattgttc actagctgtg gtctgaccta     4020

gttaatttac aaatacagat tgaataggac ctactagagc agcatttata gagtttgatg     4080

gcaaatagat taggcagaac ttcatctaaa atattcttag taaataatgt tgacacgttt     4140

tccatacctt gtcagtttca ttcaacaatt tttaaatttt taacaaagct cttaggattt     4200

acacatttat atttaaacat tgatatatag agtattgatt gattgctcat aagttaaatt     4260

ggtaaagtta gagacaacta ttctaacacc tcaccattga aatttatatg ccaccttgtc     4320

tttcataaaa gctgaaaatt gttacctaaa atgaaaatca acttcatgtt ttgaagatag     4380

ttataaatat tgttctttgt tacaatttcg ggcaccgcat attaaaacgt aactttattg     4440

ttccaatatg taacatggag ggccaggtca taaataatga cattataatg ggcttttgca     4500

ctgttattat ttttcctttg gaatgtgaag gtctgaatga gggttttgat tttgaatgtt     4560

tcaatgtttt tgagaagcct tgcttacatt ttatggtgta gtcattggaa atggaaaaat     4620

ggcattatat atattatata tataaatata tattatacat actctcctta ctttatttca     4680

gttaccatcc ccatagaatt tgacaagaat tgctatgact gaaaggtttt cgagtcctaa     4740

ttaaaacttt atttatggca gtattcataa ttagcctgaa atgcattctg taggtaatct     4800

ctgagtttct ggaatatttt cttagacttt ttggatgtgc agcagcttac atgtctgaag     4860

ttacttgaag gcatcacttt taagaaagct tacagttggg ccctgtacca tcccaagtcc     4920

tttgtagctc ctcttgaaca tgtttgccat acttttaaaa gggtagttga ataaatagca     4980

tcaccattct ttgctgtggc acaggttata aacttaagtg gagtttaccg gcagcatcaa     5040

atgtttcagc tttaaaaaat aaaagtaggg tacaagttta atgtttagtt ctagaaattt     5100

tgtgcaatat gttcataacg atggctgtgg ttgccacaaa gtgcctcgtt tacctttaaa     5160

tactgttaat gtgtcatgca tgcagatgga aggggtggaa ctgtgcacta aagtgggggc     5220

tttaactgta gtatttggca gagttgcctt ctacctgcca gttcaaaagt tcaacctgtt     5280

ttcatataga atatatatac taaaaaattt cagtctgtta aacagcctta ctctgattca     5340

gcctcttcag atactcttgt gctgtgcagc agtggctctg tgtgtaaatg ctatgcactg     5400

aggatacaca aaaataccaa tatgatgtgt acaggataat gcctcatccc aatcagatgt     5460

ccatttgtta ttgtgtttgt taacaaccct ttatctctta gtgttataaa ctccacttaa     5520

aactgattaa agtctcattc ttgtcaaaaa aaaaaaaaaa aaaaaaaaaa aa             5572


<210>  81
<211>  403
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human PTEN polypeptide GenBank Accession No.: NP_000305.3  
       GI:73765544

<400>  81

Met Thr Ala Ile Ile Lys Glu Ile Val Ser Arg Asn Lys Arg Arg Tyr 
1               5                   10                  15      


Gln Glu Asp Gly Phe Asp Leu Asp Leu Thr Tyr Ile Tyr Pro Asn Ile 
            20                  25                  30          


Ile Ala Met Gly Phe Pro Ala Glu Arg Leu Glu Gly Val Tyr Arg Asn 
        35                  40                  45              


Asn Ile Asp Asp Val Val Arg Phe Leu Asp Ser Lys His Lys Asn His 
    50                  55                  60                  


Tyr Lys Ile Tyr Asn Leu Cys Ala Glu Arg His Tyr Asp Thr Ala Lys 
65                  70                  75                  80  


Phe Asn Cys Arg Val Ala Gln Tyr Pro Phe Glu Asp His Asn Pro Pro 
                85                  90                  95      


Gln Leu Glu Leu Ile Lys Pro Phe Cys Glu Asp Leu Asp Gln Trp Leu 
            100                 105                 110         


Ser Glu Asp Asp Asn His Val Ala Ala Ile His Cys Lys Ala Gly Lys 
        115                 120                 125             


Gly Arg Thr Gly Val Met Ile Cys Ala Tyr Leu Leu His Arg Gly Lys 
    130                 135                 140                 


Phe Leu Lys Ala Gln Glu Ala Leu Asp Phe Tyr Gly Glu Val Arg Thr 
145                 150                 155                 160 


Arg Asp Lys Lys Gly Val Thr Ile Pro Ser Gln Arg Arg Tyr Val Tyr 
                165                 170                 175     


Tyr Tyr Ser Tyr Leu Leu Lys Asn His Leu Asp Tyr Arg Pro Val Ala 
            180                 185                 190         


Leu Leu Phe His Lys Met Met Phe Glu Thr Ile Pro Met Phe Ser Gly 
        195                 200                 205             


Gly Thr Cys Asn Pro Gln Phe Val Val Cys Gln Leu Lys Val Lys Ile 
    210                 215                 220                 


Tyr Ser Ser Asn Ser Gly Pro Thr Arg Arg Glu Asp Lys Phe Met Tyr 
225                 230                 235                 240 


Phe Glu Phe Pro Gln Pro Leu Pro Val Cys Gly Asp Ile Lys Val Glu 
                245                 250                 255     


Phe Phe His Lys Gln Asn Lys Met Leu Lys Lys Asp Lys Met Phe His 
            260                 265                 270         


Phe Trp Val Asn Thr Phe Phe Ile Pro Gly Pro Glu Glu Thr Ser Glu 
        275                 280                 285             


Lys Val Glu Asn Gly Ser Leu Cys Asp Gln Glu Ile Asp Ser Ile Cys 
    290                 295                 300                 


Ser Ile Glu Arg Ala Asp Asn Asp Lys Glu Tyr Leu Val Leu Thr Leu 
305                 310                 315                 320 


Thr Lys Asn Asp Leu Asp Lys Ala Asn Lys Asp Lys Ala Asn Arg Tyr 
                325                 330                 335     


Phe Ser Pro Asn Phe Lys Val Lys Leu Tyr Phe Thr Lys Thr Val Glu 
            340                 345                 350         


Glu Pro Ser Asn Pro Glu Ala Ser Ser Ser Thr Ser Val Thr Pro Asp 
        355                 360                 365             


Val Ser Asp Asn Glu Pro Asp His Tyr Arg Tyr Ser Asp Thr Thr Asp 
    370                 375                 380                 


Ser Asp Pro Glu Asn Glu Pro Phe Asp Glu Asp Gln His Thr Gln Ile 
385                 390                 395                 400 


Thr Lys Val 
            


<210>  82
<211>  8585
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ARID1A mRNA GenBank Accession No.: NM_006015.4  
       GI:117968607 Homo sapiens AT rich interactive domain 1A 
       (SWI-like) (ARID1A), transcript variant 1, mRNA

<400>  82
cagaaagcgg agagtcacag cggggccagg ccctggggag cggagcctcc accgcccccc       60

tcattcccag gcaagggctt ggggggaatg agccgggaga gccgggtccc gagcctacag      120

agccgggagc agctgagccg ccggcgcctc ggccgccgcc gccgcctcct cctcctccgc      180

cgccgccagc ccggagcctg agccggcggg gcggggggga gaggagcgag cgcagcgcag      240

cagcggagcc ccgcgaggcc cgcccgggcg ggtggggagg gcagcccggg ggactgggcc      300

ccggggcggg gtgggagggg gggagaagac gaagacaggg ccgggtctct ccgcggacga      360

gacagcgggg atcatggccg cgcaggtcgc ccccgccgcc gccagcagcc tgggcaaccc      420

gccgccgccg ccgccctcgg agctgaagaa agccgagcag cagcagcggg aggaggcggg      480

gggcgaggcg gcggcggcgg cagcggccga gcgcggggaa atgaaggcag ccgccgggca      540

ggaaagcgag ggccccgccg tggggccgcc gcagccgctg ggaaaggagc tgcaggacgg      600

ggccgagagc aatgggggtg gcggcggcgg cggagccggc agcggcggcg ggcccggcgc      660

ggagccggac ctgaagaact cgaacgggaa cgcgggccct aggcccgccc tgaacaataa      720

cctcacggag ccgcccggcg gcggcggtgg cggcagcagc gatggggtgg gggcgcctcc      780

tcactcagcc gcggccgcct tgccgccccc agcctacggc ttcgggcaac cctacggccg      840

gagcccgtct gccgtcgccg ccgccgcggc cgccgtcttc caccaacaac atggcggaca      900

acaaagccct ggcctggcag cgctgcagag cggcggcggc gggggcctgg agccctacgc      960

ggggccccag cagaactctc acgaccacgg cttccccaac caccagtaca actcctacta     1020

ccccaaccgc agcgcctacc ccccgcccgc cccggcctac gcgctgagct ccccgagagg     1080

tggcactccg ggctccggcg cggcggcggc tgccggctcc aagccgcctc cctcctccag     1140

cgcctccgcc tcctcgtcgt cttcgtcctt cgctcagcag cgcttcgggg ccatgggggg     1200

aggcggcccc tccgcggccg gcgggggaac tccccagccc accgccaccc ccaccctcaa     1260

ccaactgctc acgtcgccca gctcggcccg gggctaccag ggctaccccg ggggcgacta     1320

cagtggcggg ccccaggacg ggggcgccgg caagggcccg gcggacatgg cctcgcagtg     1380

ttggggggct gcggcggcgg cagctgcggc ggcggccgcc tcgggagggg cccaacaaag     1440

gagccaccac gcgcccatga gccccgggag cagcggcggc ggggggcagc cgctcgcccg     1500

gacccctcag ccatccagtc caatggatca gatgggcaag atgagacctc agccatatgg     1560

cgggactaac ccatactcgc agcaacaggg acctccgtca ggaccgcagc aaggacatgg     1620

gtacccaggg cagccatacg ggtcccagac cccgcagcgg tacccgatga ccatgcaggg     1680

ccgggcgcag agtgccatgg gcggcctctc ttatacacag cagattcctc cttatggaca     1740

acaaggcccc agcgggtatg gtcaacaggg ccagactcca tattacaacc agcaaagtcc     1800

tcaccctcag cagcagcagc caccctactc ccagcaacca ccgtcccaga cccctcatgc     1860

ccaaccttcg tatcagcagc agccacagtc tcaaccacca cagctccagt cctctcagcc     1920

tccatactcc cagcagccat cccagcctcc acatcagcag tccccggctc catacccctc     1980

ccagcagtcg acgacacagc agcaccccca gagccagccc ccctactcac agccacaggc     2040

tcagtctcct taccagcagc agcaacctca gcagccagca ccctcgacgc tctcccagca     2100

ggctgcgtat cctcagcccc agtctcagca gtcccagcaa actgcctatt cccagcagcg     2160

cttccctcca ccgcaggagc tatctcaaga ttcatttggg tctcaggcat cctcagcccc     2220

ctcaatgacc tccagtaagg gagggcaaga agatatgaac ctgagccttc agtcaagacc     2280

ctccagcttg cctgatctat ctggttcaat agatgacctc cccatgggga cagaaggagc     2340

tctgagtcct ggagtgagca catcagggat ttccagcagc caaggagagc agagtaatcc     2400

agctcagtct cctttctctc ctcatacctc ccctcacctg cctggcatcc gaggcccttc     2460

cccgtcccct gttggctctc ccgccagtgt tgctcagtct cgctcaggac cactctcgcc     2520

tgctgcagtg ccaggcaacc agatgccacc tcggccaccc agtggccagt cggacagcat     2580

catgcatcct tccatgaacc aatcaagcat tgcccaagat cgaggttata tgcagaggaa     2640

cccccagatg ccccagtaca gttcccccca gcccggctca gccttatctc cgcgtcagcc     2700

ttccggagga cagatacaca caggcatggg ctcctaccag cagaactcca tggggagcta     2760

tggtccccag gggggtcagt atggcccaca aggtggctac cccaggcagc caaactataa     2820

tgccttgccc aatgccaact accccagtgc aggcatggct ggaggcataa accccatggg     2880

tgccggaggt caaatgcatg gacagcctgg catcccacct tatggcacac tccctccagg     2940

gaggatgagt cacgcctcca tgggcaaccg gccttatggc cctaacatgg ccaatatgcc     3000

acctcaggtt gggtcaggga tgtgtccccc accagggggc atgaaccgga aaacccaaga     3060

aactgctgtc gccatgcatg ttgctgccaa ctctatccaa aacaggccgc caggctaccc     3120

caatatgaat caagggggca tgatgggaac tggacctcct tatggacaag ggattaatag     3180

tatggctggc atgatcaacc ctcagggacc cccatattcc atgggtggaa ccatggccaa     3240

caattctgca gggatggcag ccagcccaga gatgatgggc cttggggatg taaagttaac     3300

tccagccacc aaaatgaaca acaaggcaga tgggacaccc aagacagaat ccaaatccaa     3360

gaaatccagt tcttctacta caaccaatga gaagatcacc aagttgtatg agctgggtgg     3420

tgagcctgag aggaagatgt gggtggaccg ttatctggcc ttcactgagg agaaggccat     3480

gggcatgaca aatctgcctg ctgtgggtag gaaacctctg gacctctatc gcctctatgt     3540

gtctgtgaag gagattggtg gattgactca ggtcaacaag aacaaaaaat ggcgggaact     3600

tgcaaccaac ctcaatgtgg gcacatcaag cagtgctgcc agctccttga aaaagcagta     3660

tatccagtgt ctctatgcct ttgaatgcaa gattgaacgg ggagaagacc ctcccccaga     3720

catctttgca gctgctgatt ccaagaagtc ccagcccaag atccagcctc cctctcctgc     3780

gggatcagga tctatgcagg ggccccagac tccccagtca accagcagtt ccatggcaga     3840

aggaggagac ttaaagccac caactccagc atccacacca cacagtcaga tccccccatt     3900

gccaggcatg agcaggagca attcagttgg gatccaggat gcctttaatg atggaagtga     3960

ctccacattc cagaagcgga attccatgac tccaaaccct gggtatcagc ccagtatgaa     4020

tacctctgac atgatggggc gcatgtccta tgagccaaat aaggatcctt atggcagcat     4080

gaggaaagct ccagggagtg atcccttcat gtcctcaggg cagggcccca acggcgggat     4140

gggtgacccc tacagtcgtg ctgccggccc tgggctagga aatgtggcga tgggaccacg     4200

acagcactat ccctatggag gtccttatga cagagtgagg acggagcctg gaatagggcc     4260

tgagggaaac atgagcactg gggccccaca gccgaatctc atgccttcca acccagactc     4320

ggggatgtat tctcctagcc gctacccccc gcagcagcag cagcagcagc agcaacgaca     4380

tgattcctat ggcaatcagt tctccaccca aggcacccct tctggcagcc ccttccccag     4440

ccagcagact acaatgtatc aacagcaaca gcagaattac aagcggccaa tggatggcac     4500

atatggccct cctgccaagc ggcacgaagg ggagatgtac agcgtgccat acagcactgg     4560

gcaggggcag cctcagcagc agcagttgcc cccagcccag ccccagcctg ccagccagca     4620

acaagctgcc cagccttccc ctcagcaaga tgtatacaac cagtatggca atgcctatcc     4680

tgccactgcc acagctgcta ctgagcgccg accagcaggc ggcccccaga accaatttcc     4740

attccagttt ggccgagacc gtgtctctgc accccctggc accaatgccc agcaaaacat     4800

gccaccacaa atgatgggcg gccccataca ggcatcagct gaggttgctc agcaaggcac     4860

catgtggcag gggcgtaatg acatgaccta taattatgcc aacaggcaga gcacgggctc     4920

tgccccccag ggccccgcct atcatggcgt gaaccgaaca gatgaaatgc tgcacacaga     4980

tcagagggcc aaccacgaag gctcgtggcc ttcccatggc acacgccagc ccccatatgg     5040

tccctctgcc cctgtgcccc ccatgacaag gccccctcca tctaactacc agcccccacc     5100

aagcatgcag aatcacattc ctcaggtatc cagccctgct cccctgcccc ggccaatgga     5160

gaaccgcacc tctcctagca agtctccatt cctgcactct gggatgaaaa tgcagaaggc     5220

aggtccccca gtacctgcct cgcacatagc acctgcccct gtgcagcccc ccatgattcg     5280

gcgggatatc accttcccac ctggctctgt tgaagccaca cagcctgtgt tgaagcagag     5340

gaggcggctc acaatgaaag acattggaac cccggaggca tggcgggtaa tgatgtccct     5400

caagtctggt ctcctggcag agagcacatg ggcattagat accatcaaca tcctgctgta     5460

tgatgacaac agcatcatga ccttcaacct cagtcagctc ccagggttgc tagagctcct     5520

tgtagaatat ttccgacgat gcctgattga gatctttggc attttaaagg agtatgaggt     5580

gggtgaccca ggacagagaa cgctactgga tcctgggagg ttcagcaagg tgtctagtcc     5640

agctcccatg gagggtgggg aagaagaaga agaacttcta ggtcctaaac tagaagagga     5700

agaagaagag gaagtagttg aaaatgatga ggagatagcc ttttcaggca aggacaagcc     5760

agcttcagag aatagtgagg agaagctgat cagtaagttt gacaagcttc cagtaaagat     5820

cgtacagaag aatgatccat ttgtggtgga ctgctcagat aagcttgggc gtgtgcagga     5880

gtttgacagt ggcctgctgc actggcggat tggtgggggg gacaccactg agcatatcca     5940

gacccacttc gagagcaaga cagagctgct gccttcccgg cctcacgcac cctgcccacc     6000

agcccctcgg aagcatgtga caacagcaga gggtacacca gggacaacag accaggaggg     6060

gcccccacct gatggacctc cagaaaaacg gatcacagcc actatggatg acatgttgtc     6120

tactcggtct agcaccttga ccgaggatgg agctaagagt tcagaggcca tcaaggagag     6180

cagcaagttt ccatttggca ttagcccagc acagagccac cggaacatca agatcctaga     6240

ggacgaaccc cacagtaagg atgagacccc actgtgtacc cttctggact ggcaggattc     6300

tcttgccaag cgctgcgtct gtgtgtccaa taccattcga agcctgtcat ttgtgccagg     6360

caatgacttt gagatgtcca aacacccagg gctgctgctc atcctgggca agctgatcct     6420

gctgcaccac aagcacccag aacggaagca ggcaccacta acttatgaaa aggaggagga     6480

acaggaccaa ggggtgagct gcaacaaagt ggagtggtgg tgggactgct tggagatgct     6540

ccgggaaaac accttggtta cactcgccaa catctcgggg cagttggacc tatctccata     6600

ccccgagagc atttgcctgc ctgtcctgga cggactccta cactgggcag tttgcccttc     6660

agctgaagcc caggacccct tttccaccct gggccccaat gccgtccttt ccccgcagag     6720

actggtcttg gaaaccctca gcaaactcag catccaggac aacaatgtgg acctgattct     6780

ggccacaccc cccttcagcc gcctggagaa gttgtatagc actatggtgc gcttcctcag     6840

tgaccgaaag aacccggtgt gccgggagat ggctgtggta ctgctggcca acctggctca     6900

gggggacagc ctggcagctc gtgccattgc agtgcagaag ggcagtatcg gcaacctcct     6960

gggcttccta gaggacagcc ttgccgccac acagttccag cagagccagg ccagcctcct     7020

ccacatgcag aacccaccct ttgagccaac tagtgtggac atgatgcggc gggctgcccg     7080

cgcgctgctt gccttggcca aggtggacga gaaccactca gagtttactc tgtacgaatc     7140

acggctgttg gacatctcgg tatcaccgtt gatgaactca ttggtttcac aagtcatttg     7200

tgatgtactg tttttgattg gccagtcatg acagccgtgg gacacctccc ccccccgtgt     7260

gtgtgtgcgt gtgtggagaa cttagaaact gactgttgcc ctttatttat gcaaaaccac     7320

ctcagaatcc agtttaccct gtgctgtcca gcttctccct tgggaaaaag tctctcctgt     7380

ttctctctcc tccttccacc tcccctccct ccatcacctc acgcctttct gttccttgtc     7440

ctcaccttac tcccctcagg accctacccc accctctttg aaaagacaaa gctctgccta     7500

catagaagac tttttttatt ttaaccaaag ttactgttgt ttacagtgag tttggggaaa     7560

aaaaataaaa taaaaatggc tttcccagtc cttgcatcaa cgggatgcca catttcataa     7620

ctgtttttaa tggtaaaaaa aaaaaaaaaa aatacaaaaa aaaattctga aggacaaaaa     7680

aggtgactgc tgaactgtgt gtggtttatt gttgtacatt cacaatcttg caggagccaa     7740

gaagttcgca gttgtgaaca gaccctgttc actggagagg cctgtgcagt agagtgtaga     7800

ccctttcatg tactgtactg tacacctgat actgtaaaca tactgtaata ataatgtctc     7860

acatggaaac agaaaacgct gggtcagcag caagctgtag tttttaaaaa tgtttttagt     7920

taaacgttga ggagaaaaaa aaaaaaggct tttcccccaa agtatcatgt gtgaacctac     7980

aacaccctga cctctttctc tcctccttga ttgtatgaat aaccctgaga tcacctctta     8040

gaactggttt taacctttag ctgcagcggc tacgctgcca cgtgtgtata tatatgacgt     8100

tgtacattgc acataccctt ggatccccac agtttggtcc tcctcccagc taccccttta     8160

tagtatgacg agttaacaag ttggtgacct gcacaaagcg agacacagct atttaatctc     8220

ttgccagata tcgcccctct tggtgcgatg ctgtacaggt ctctgtaaaa agtccttgct     8280

gtctcagcag ccaatcaact tatagtttat ttttttctgg gtttttgttt tgttttgttt     8340

tctttctaat cgaggtgtga aaaagttcta ggttcagttg aagttctgat gaagaaacac     8400

aattgagatt ttttcagtga taaaatctgc atatttgtat ttcaacaatg tagctaaaac     8460

ttgatgtaaa ttcctccttt ttttcctttt ttggcttaat gaatatcatt tattcagtat     8520

gaaatcttta tactatatgt tccacgtgtt aagaataaat gtacattaaa tcttggtaag     8580

acttt                                                                 8585


<210>  83
<211>  2285
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ARID1A polypeptide GenBank Accession No.: NP_006006.3  
       GI:21264565

<400>  83

Met Ala Ala Gln Val Ala Pro Ala Ala Ala Ser Ser Leu Gly Asn Pro 
1               5                   10                  15      


Pro Pro Pro Pro Pro Ser Glu Leu Lys Lys Ala Glu Gln Gln Gln Arg 
            20                  25                  30          


Glu Glu Ala Gly Gly Glu Ala Ala Ala Ala Ala Ala Ala Glu Arg Gly 
        35                  40                  45              


Glu Met Lys Ala Ala Ala Gly Gln Glu Ser Glu Gly Pro Ala Val Gly 
    50                  55                  60                  


Pro Pro Gln Pro Leu Gly Lys Glu Leu Gln Asp Gly Ala Glu Ser Asn 
65                  70                  75                  80  


Gly Gly Gly Gly Gly Gly Gly Ala Gly Ser Gly Gly Gly Pro Gly Ala 
                85                  90                  95      


Glu Pro Asp Leu Lys Asn Ser Asn Gly Asn Ala Gly Pro Arg Pro Ala 
            100                 105                 110         


Leu Asn Asn Asn Leu Thr Glu Pro Pro Gly Gly Gly Gly Gly Gly Ser 
        115                 120                 125             


Ser Asp Gly Val Gly Ala Pro Pro His Ser Ala Ala Ala Ala Leu Pro 
    130                 135                 140                 


Pro Pro Ala Tyr Gly Phe Gly Gln Pro Tyr Gly Arg Ser Pro Ser Ala 
145                 150                 155                 160 


Val Ala Ala Ala Ala Ala Ala Val Phe His Gln Gln His Gly Gly Gln 
                165                 170                 175     


Gln Ser Pro Gly Leu Ala Ala Leu Gln Ser Gly Gly Gly Gly Gly Leu 
            180                 185                 190         


Glu Pro Tyr Ala Gly Pro Gln Gln Asn Ser His Asp His Gly Phe Pro 
        195                 200                 205             


Asn His Gln Tyr Asn Ser Tyr Tyr Pro Asn Arg Ser Ala Tyr Pro Pro 
    210                 215                 220                 


Pro Ala Pro Ala Tyr Ala Leu Ser Ser Pro Arg Gly Gly Thr Pro Gly 
225                 230                 235                 240 


Ser Gly Ala Ala Ala Ala Ala Gly Ser Lys Pro Pro Pro Ser Ser Ser 
                245                 250                 255     


Ala Ser Ala Ser Ser Ser Ser Ser Ser Phe Ala Gln Gln Arg Phe Gly 
            260                 265                 270         


Ala Met Gly Gly Gly Gly Pro Ser Ala Ala Gly Gly Gly Thr Pro Gln 
        275                 280                 285             


Pro Thr Ala Thr Pro Thr Leu Asn Gln Leu Leu Thr Ser Pro Ser Ser 
    290                 295                 300                 


Ala Arg Gly Tyr Gln Gly Tyr Pro Gly Gly Asp Tyr Ser Gly Gly Pro 
305                 310                 315                 320 


Gln Asp Gly Gly Ala Gly Lys Gly Pro Ala Asp Met Ala Ser Gln Cys 
                325                 330                 335     


Trp Gly Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ser Gly Gly 
            340                 345                 350         


Ala Gln Gln Arg Ser His His Ala Pro Met Ser Pro Gly Ser Ser Gly 
        355                 360                 365             


Gly Gly Gly Gln Pro Leu Ala Arg Thr Pro Gln Pro Ser Ser Pro Met 
    370                 375                 380                 


Asp Gln Met Gly Lys Met Arg Pro Gln Pro Tyr Gly Gly Thr Asn Pro 
385                 390                 395                 400 


Tyr Ser Gln Gln Gln Gly Pro Pro Ser Gly Pro Gln Gln Gly His Gly 
                405                 410                 415     


Tyr Pro Gly Gln Pro Tyr Gly Ser Gln Thr Pro Gln Arg Tyr Pro Met 
            420                 425                 430         


Thr Met Gln Gly Arg Ala Gln Ser Ala Met Gly Gly Leu Ser Tyr Thr 
        435                 440                 445             


Gln Gln Ile Pro Pro Tyr Gly Gln Gln Gly Pro Ser Gly Tyr Gly Gln 
    450                 455                 460                 


Gln Gly Gln Thr Pro Tyr Tyr Asn Gln Gln Ser Pro His Pro Gln Gln 
465                 470                 475                 480 


Gln Gln Pro Pro Tyr Ser Gln Gln Pro Pro Ser Gln Thr Pro His Ala 
                485                 490                 495     


Gln Pro Ser Tyr Gln Gln Gln Pro Gln Ser Gln Pro Pro Gln Leu Gln 
            500                 505                 510         


Ser Ser Gln Pro Pro Tyr Ser Gln Gln Pro Ser Gln Pro Pro His Gln 
        515                 520                 525             


Gln Ser Pro Ala Pro Tyr Pro Ser Gln Gln Ser Thr Thr Gln Gln His 
    530                 535                 540                 


Pro Gln Ser Gln Pro Pro Tyr Ser Gln Pro Gln Ala Gln Ser Pro Tyr 
545                 550                 555                 560 


Gln Gln Gln Gln Pro Gln Gln Pro Ala Pro Ser Thr Leu Ser Gln Gln 
                565                 570                 575     


Ala Ala Tyr Pro Gln Pro Gln Ser Gln Gln Ser Gln Gln Thr Ala Tyr 
            580                 585                 590         


Ser Gln Gln Arg Phe Pro Pro Pro Gln Glu Leu Ser Gln Asp Ser Phe 
        595                 600                 605             


Gly Ser Gln Ala Ser Ser Ala Pro Ser Met Thr Ser Ser Lys Gly Gly 
    610                 615                 620                 


Gln Glu Asp Met Asn Leu Ser Leu Gln Ser Arg Pro Ser Ser Leu Pro 
625                 630                 635                 640 


Asp Leu Ser Gly Ser Ile Asp Asp Leu Pro Met Gly Thr Glu Gly Ala 
                645                 650                 655     


Leu Ser Pro Gly Val Ser Thr Ser Gly Ile Ser Ser Ser Gln Gly Glu 
            660                 665                 670         


Gln Ser Asn Pro Ala Gln Ser Pro Phe Ser Pro His Thr Ser Pro His 
        675                 680                 685             


Leu Pro Gly Ile Arg Gly Pro Ser Pro Ser Pro Val Gly Ser Pro Ala 
    690                 695                 700                 


Ser Val Ala Gln Ser Arg Ser Gly Pro Leu Ser Pro Ala Ala Val Pro 
705                 710                 715                 720 


Gly Asn Gln Met Pro Pro Arg Pro Pro Ser Gly Gln Ser Asp Ser Ile 
                725                 730                 735     


Met His Pro Ser Met Asn Gln Ser Ser Ile Ala Gln Asp Arg Gly Tyr 
            740                 745                 750         


Met Gln Arg Asn Pro Gln Met Pro Gln Tyr Ser Ser Pro Gln Pro Gly 
        755                 760                 765             


Ser Ala Leu Ser Pro Arg Gln Pro Ser Gly Gly Gln Ile His Thr Gly 
    770                 775                 780                 


Met Gly Ser Tyr Gln Gln Asn Ser Met Gly Ser Tyr Gly Pro Gln Gly 
785                 790                 795                 800 


Gly Gln Tyr Gly Pro Gln Gly Gly Tyr Pro Arg Gln Pro Asn Tyr Asn 
                805                 810                 815     


Ala Leu Pro Asn Ala Asn Tyr Pro Ser Ala Gly Met Ala Gly Gly Ile 
            820                 825                 830         


Asn Pro Met Gly Ala Gly Gly Gln Met His Gly Gln Pro Gly Ile Pro 
        835                 840                 845             


Pro Tyr Gly Thr Leu Pro Pro Gly Arg Met Ser His Ala Ser Met Gly 
    850                 855                 860                 


Asn Arg Pro Tyr Gly Pro Asn Met Ala Asn Met Pro Pro Gln Val Gly 
865                 870                 875                 880 


Ser Gly Met Cys Pro Pro Pro Gly Gly Met Asn Arg Lys Thr Gln Glu 
                885                 890                 895     


Thr Ala Val Ala Met His Val Ala Ala Asn Ser Ile Gln Asn Arg Pro 
            900                 905                 910         


Pro Gly Tyr Pro Asn Met Asn Gln Gly Gly Met Met Gly Thr Gly Pro 
        915                 920                 925             


Pro Tyr Gly Gln Gly Ile Asn Ser Met Ala Gly Met Ile Asn Pro Gln 
    930                 935                 940                 


Gly Pro Pro Tyr Ser Met Gly Gly Thr Met Ala Asn Asn Ser Ala Gly 
945                 950                 955                 960 


Met Ala Ala Ser Pro Glu Met Met Gly Leu Gly Asp Val Lys Leu Thr 
                965                 970                 975     


Pro Ala Thr Lys Met Asn Asn Lys Ala Asp Gly Thr Pro Lys Thr Glu 
            980                 985                 990         


Ser Lys Ser Lys Lys Ser Ser Ser  Ser Thr Thr Thr Asn  Glu Lys Ile 
        995                 1000                 1005             


Thr Lys  Leu Tyr Glu Leu Gly  Gly Glu Pro Glu Arg  Lys Met Trp 
    1010                 1015                 1020             


Val Asp  Arg Tyr Leu Ala Phe  Thr Glu Glu Lys Ala  Met Gly Met 
    1025                 1030                 1035             


Thr Asn  Leu Pro Ala Val Gly  Arg Lys Pro Leu Asp  Leu Tyr Arg 
    1040                 1045                 1050             


Leu Tyr  Val Ser Val Lys Glu  Ile Gly Gly Leu Thr  Gln Val Asn 
    1055                 1060                 1065             


Lys Asn  Lys Lys Trp Arg Glu  Leu Ala Thr Asn Leu  Asn Val Gly 
    1070                 1075                 1080             


Thr Ser  Ser Ser Ala Ala Ser  Ser Leu Lys Lys Gln  Tyr Ile Gln 
    1085                 1090                 1095             


Cys Leu  Tyr Ala Phe Glu Cys  Lys Ile Glu Arg Gly  Glu Asp Pro 
    1100                 1105                 1110             


Pro Pro  Asp Ile Phe Ala Ala  Ala Asp Ser Lys Lys  Ser Gln Pro 
    1115                 1120                 1125             


Lys Ile  Gln Pro Pro Ser Pro  Ala Gly Ser Gly Ser  Met Gln Gly 
    1130                 1135                 1140             


Pro Gln  Thr Pro Gln Ser Thr  Ser Ser Ser Met Ala  Glu Gly Gly 
    1145                 1150                 1155             


Asp Leu  Lys Pro Pro Thr Pro  Ala Ser Thr Pro His  Ser Gln Ile 
    1160                 1165                 1170             


Pro Pro  Leu Pro Gly Met Ser  Arg Ser Asn Ser Val  Gly Ile Gln 
    1175                 1180                 1185             


Asp Ala  Phe Asn Asp Gly Ser  Asp Ser Thr Phe Gln  Lys Arg Asn 
    1190                 1195                 1200             


Ser Met  Thr Pro Asn Pro Gly  Tyr Gln Pro Ser Met  Asn Thr Ser 
    1205                 1210                 1215             


Asp Met  Met Gly Arg Met Ser  Tyr Glu Pro Asn Lys  Asp Pro Tyr 
    1220                 1225                 1230             


Gly Ser  Met Arg Lys Ala Pro  Gly Ser Asp Pro Phe  Met Ser Ser 
    1235                 1240                 1245             


Gly Gln  Gly Pro Asn Gly Gly  Met Gly Asp Pro Tyr  Ser Arg Ala 
    1250                 1255                 1260             


Ala Gly  Pro Gly Leu Gly Asn  Val Ala Met Gly Pro  Arg Gln His 
    1265                 1270                 1275             


Tyr Pro  Tyr Gly Gly Pro Tyr  Asp Arg Val Arg Thr  Glu Pro Gly 
    1280                 1285                 1290             


Ile Gly  Pro Glu Gly Asn Met  Ser Thr Gly Ala Pro  Gln Pro Asn 
    1295                 1300                 1305             


Leu Met  Pro Ser Asn Pro Asp  Ser Gly Met Tyr Ser  Pro Ser Arg 
    1310                 1315                 1320             


Tyr Pro  Pro Gln Gln Gln Gln  Gln Gln Gln Gln Arg  His Asp Ser 
    1325                 1330                 1335             


Tyr Gly  Asn Gln Phe Ser Thr  Gln Gly Thr Pro Ser  Gly Ser Pro 
    1340                 1345                 1350             


Phe Pro  Ser Gln Gln Thr Thr  Met Tyr Gln Gln Gln  Gln Gln Asn 
    1355                 1360                 1365             


Tyr Lys  Arg Pro Met Asp Gly  Thr Tyr Gly Pro Pro  Ala Lys Arg 
    1370                 1375                 1380             


His Glu  Gly Glu Met Tyr Ser  Val Pro Tyr Ser Thr  Gly Gln Gly 
    1385                 1390                 1395             


Gln Pro  Gln Gln Gln Gln Leu  Pro Pro Ala Gln Pro  Gln Pro Ala 
    1400                 1405                 1410             


Ser Gln  Gln Gln Ala Ala Gln  Pro Ser Pro Gln Gln  Asp Val Tyr 
    1415                 1420                 1425             


Asn Gln  Tyr Gly Asn Ala Tyr  Pro Ala Thr Ala Thr  Ala Ala Thr 
    1430                 1435                 1440             


Glu Arg  Arg Pro Ala Gly Gly  Pro Gln Asn Gln Phe  Pro Phe Gln 
    1445                 1450                 1455             


Phe Gly  Arg Asp Arg Val Ser  Ala Pro Pro Gly Thr  Asn Ala Gln 
    1460                 1465                 1470             


Gln Asn  Met Pro Pro Gln Met  Met Gly Gly Pro Ile  Gln Ala Ser 
    1475                 1480                 1485             


Ala Glu  Val Ala Gln Gln Gly  Thr Met Trp Gln Gly  Arg Asn Asp 
    1490                 1495                 1500             


Met Thr  Tyr Asn Tyr Ala Asn  Arg Gln Ser Thr Gly  Ser Ala Pro 
    1505                 1510                 1515             


Gln Gly  Pro Ala Tyr His Gly  Val Asn Arg Thr Asp  Glu Met Leu 
    1520                 1525                 1530             


His Thr  Asp Gln Arg Ala Asn  His Glu Gly Ser Trp  Pro Ser His 
    1535                 1540                 1545             


Gly Thr  Arg Gln Pro Pro Tyr  Gly Pro Ser Ala Pro  Val Pro Pro 
    1550                 1555                 1560             


Met Thr  Arg Pro Pro Pro Ser  Asn Tyr Gln Pro Pro  Pro Ser Met 
    1565                 1570                 1575             


Gln Asn  His Ile Pro Gln Val  Ser Ser Pro Ala Pro  Leu Pro Arg 
    1580                 1585                 1590             


Pro Met  Glu Asn Arg Thr Ser  Pro Ser Lys Ser Pro  Phe Leu His 
    1595                 1600                 1605             


Ser Gly  Met Lys Met Gln Lys  Ala Gly Pro Pro Val  Pro Ala Ser 
    1610                 1615                 1620             


His Ile  Ala Pro Ala Pro Val  Gln Pro Pro Met Ile  Arg Arg Asp 
    1625                 1630                 1635             


Ile Thr  Phe Pro Pro Gly Ser  Val Glu Ala Thr Gln  Pro Val Leu 
    1640                 1645                 1650             


Lys Gln  Arg Arg Arg Leu Thr  Met Lys Asp Ile Gly  Thr Pro Glu 
    1655                 1660                 1665             


Ala Trp  Arg Val Met Met Ser  Leu Lys Ser Gly Leu  Leu Ala Glu 
    1670                 1675                 1680             


Ser Thr  Trp Ala Leu Asp Thr  Ile Asn Ile Leu Leu  Tyr Asp Asp 
    1685                 1690                 1695             


Asn Ser  Ile Met Thr Phe Asn  Leu Ser Gln Leu Pro  Gly Leu Leu 
    1700                 1705                 1710             


Glu Leu  Leu Val Glu Tyr Phe  Arg Arg Cys Leu Ile  Glu Ile Phe 
    1715                 1720                 1725             


Gly Ile  Leu Lys Glu Tyr Glu  Val Gly Asp Pro Gly  Gln Arg Thr 
    1730                 1735                 1740             


Leu Leu  Asp Pro Gly Arg Phe  Ser Lys Val Ser Ser  Pro Ala Pro 
    1745                 1750                 1755             


Met Glu  Gly Gly Glu Glu Glu  Glu Glu Leu Leu Gly  Pro Lys Leu 
    1760                 1765                 1770             


Glu Glu  Glu Glu Glu Glu Glu  Val Val Glu Asn Asp  Glu Glu Ile 
    1775                 1780                 1785             


Ala Phe  Ser Gly Lys Asp Lys  Pro Ala Ser Glu Asn  Ser Glu Glu 
    1790                 1795                 1800             


Lys Leu  Ile Ser Lys Phe Asp  Lys Leu Pro Val Lys  Ile Val Gln 
    1805                 1810                 1815             


Lys Asn  Asp Pro Phe Val Val  Asp Cys Ser Asp Lys  Leu Gly Arg 
    1820                 1825                 1830             


Val Gln  Glu Phe Asp Ser Gly  Leu Leu His Trp Arg  Ile Gly Gly 
    1835                 1840                 1845             


Gly Asp  Thr Thr Glu His Ile  Gln Thr His Phe Glu  Ser Lys Thr 
    1850                 1855                 1860             


Glu Leu  Leu Pro Ser Arg Pro  His Ala Pro Cys Pro  Pro Ala Pro 
    1865                 1870                 1875             


Arg Lys  His Val Thr Thr Ala  Glu Gly Thr Pro Gly  Thr Thr Asp 
    1880                 1885                 1890             


Gln Glu  Gly Pro Pro Pro Asp  Gly Pro Pro Glu Lys  Arg Ile Thr 
    1895                 1900                 1905             


Ala Thr  Met Asp Asp Met Leu  Ser Thr Arg Ser Ser  Thr Leu Thr 
    1910                 1915                 1920             


Glu Asp  Gly Ala Lys Ser Ser  Glu Ala Ile Lys Glu  Ser Ser Lys 
    1925                 1930                 1935             


Phe Pro  Phe Gly Ile Ser Pro  Ala Gln Ser His Arg  Asn Ile Lys 
    1940                 1945                 1950             


Ile Leu  Glu Asp Glu Pro His  Ser Lys Asp Glu Thr  Pro Leu Cys 
    1955                 1960                 1965             


Thr Leu  Leu Asp Trp Gln Asp  Ser Leu Ala Lys Arg  Cys Val Cys 
    1970                 1975                 1980             


Val Ser  Asn Thr Ile Arg Ser  Leu Ser Phe Val Pro  Gly Asn Asp 
    1985                 1990                 1995             


Phe Glu  Met Ser Lys His Pro  Gly Leu Leu Leu Ile  Leu Gly Lys 
    2000                 2005                 2010             


Leu Ile  Leu Leu His His Lys  His Pro Glu Arg Lys  Gln Ala Pro 
    2015                 2020                 2025             


Leu Thr  Tyr Glu Lys Glu Glu  Glu Gln Asp Gln Gly  Val Ser Cys 
    2030                 2035                 2040             


Asn Lys  Val Glu Trp Trp Trp  Asp Cys Leu Glu Met  Leu Arg Glu 
    2045                 2050                 2055             


Asn Thr  Leu Val Thr Leu Ala  Asn Ile Ser Gly Gln  Leu Asp Leu 
    2060                 2065                 2070             


Ser Pro  Tyr Pro Glu Ser Ile  Cys Leu Pro Val Leu  Asp Gly Leu 
    2075                 2080                 2085             


Leu His  Trp Ala Val Cys Pro  Ser Ala Glu Ala Gln  Asp Pro Phe 
    2090                 2095                 2100             


Ser Thr  Leu Gly Pro Asn Ala  Val Leu Ser Pro Gln  Arg Leu Val 
    2105                 2110                 2115             


Leu Glu  Thr Leu Ser Lys Leu  Ser Ile Gln Asp Asn  Asn Val Asp 
    2120                 2125                 2130             


Leu Ile  Leu Ala Thr Pro Pro  Phe Ser Arg Leu Glu  Lys Leu Tyr 
    2135                 2140                 2145             


Ser Thr  Met Val Arg Phe Leu  Ser Asp Arg Lys Asn  Pro Val Cys 
    2150                 2155                 2160             


Arg Glu  Met Ala Val Val Leu  Leu Ala Asn Leu Ala  Gln Gly Asp 
    2165                 2170                 2175             


Ser Leu  Ala Ala Arg Ala Ile  Ala Val Gln Lys Gly  Ser Ile Gly 
    2180                 2185                 2190             


Asn Leu  Leu Gly Phe Leu Glu  Asp Ser Leu Ala Ala  Thr Gln Phe 
    2195                 2200                 2205             


Gln Gln  Ser Gln Ala Ser Leu  Leu His Met Gln Asn  Pro Pro Phe 
    2210                 2215                 2220             


Glu Pro  Thr Ser Val Asp Met  Met Arg Arg Ala Ala  Arg Ala Leu 
    2225                 2230                 2235             


Leu Ala  Leu Ala Lys Val Asp  Glu Asn His Ser Glu  Phe Thr Leu 
    2240                 2245                 2250             


Tyr Glu  Ser Arg Leu Leu Asp  Ile Ser Val Ser Pro  Leu Met Asn 
    2255                 2260                 2265             


Ser Leu  Val Ser Gln Val Ile  Cys Asp Val Leu Phe  Leu Ile Gly 
    2270                 2275                 2280             


Gln Ser  
    2285 


<210>  84
<211>  7934
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens AT rich interactive domain 1A (SWI-like) (ARID1A), 
       transcript variant 2, mRNA NCBI Reference Sequence: NM_139135.2

<400>  84
cagaaagcgg agagtcacag cggggccagg ccctggggag cggagcctcc accgcccccc       60

tcattcccag gcaagggctt ggggggaatg agccgggaga gccgggtccc gagcctacag      120

agccgggagc agctgagccg ccggcgcctc ggccgccgcc gccgcctcct cctcctccgc      180

cgccgccagc ccggagcctg agccggcggg gcggggggga gaggagcgag cgcagcgcag      240

cagcggagcc ccgcgaggcc cgcccgggcg ggtggggagg gcagcccggg ggactgggcc      300

ccggggcggg gtgggagggg gggagaagac gaagacaggg ccgggtctct ccgcggacga      360

gacagcgggg atcatggccg cgcaggtcgc ccccgccgcc gccagcagcc tgggcaaccc      420

gccgccgccg ccgccctcgg agctgaagaa agccgagcag cagcagcggg aggaggcggg      480

gggcgaggcg gcggcggcgg cagcggccga gcgcggggaa atgaaggcag ccgccgggca      540

ggaaagcgag ggccccgccg tggggccgcc gcagccgctg ggaaaggagc tgcaggacgg      600

ggccgagagc aatgggggtg gcggcggcgg cggagccggc agcggcggcg ggcccggcgc      660

ggagccggac ctgaagaact cgaacgggaa cgcgggccct aggcccgccc tgaacaataa      720

cctcacggag ccgcccggcg gcggcggtgg cggcagcagc gatggggtgg gggcgcctcc      780

tcactcagcc gcggccgcct tgccgccccc agcctacggc ttcgggcaac cctacggccg      840

gagcccgtct gccgtcgccg ccgccgcggc cgccgtcttc caccaacaac atggcggaca      900

acaaagccct ggcctggcag cgctgcagag cggcggcggc gggggcctgg agccctacgc      960

ggggccccag cagaactctc acgaccacgg cttccccaac caccagtaca actcctacta     1020

ccccaaccgc agcgcctacc ccccgcccgc cccggcctac gcgctgagct ccccgagagg     1080

tggcactccg ggctccggcg cggcggcggc tgccggctcc aagccgcctc cctcctccag     1140

cgcctccgcc tcctcgtcgt cttcgtcctt cgctcagcag cgcttcgggg ccatgggggg     1200

aggcggcccc tccgcggccg gcgggggaac tccccagccc accgccaccc ccaccctcaa     1260

ccaactgctc acgtcgccca gctcggcccg gggctaccag ggctaccccg ggggcgacta     1320

cagtggcggg ccccaggacg ggggcgccgg caagggcccg gcggacatgg cctcgcagtg     1380

ttggggggct gcggcggcgg cagctgcggc ggcggccgcc tcgggagggg cccaacaaag     1440

gagccaccac gcgcccatga gccccgggag cagcggcggc ggggggcagc cgctcgcccg     1500

gacccctcag ccatccagtc caatggatca gatgggcaag atgagacctc agccatatgg     1560

cgggactaac ccatactcgc agcaacaggg acctccgtca ggaccgcagc aaggacatgg     1620

gtacccaggg cagccatacg ggtcccagac cccgcagcgg tacccgatga ccatgcaggg     1680

ccgggcgcag agtgccatgg gcggcctctc ttatacacag cagattcctc cttatggaca     1740

acaaggcccc agcgggtatg gtcaacaggg ccagactcca tattacaacc agcaaagtcc     1800

tcaccctcag cagcagcagc caccctactc ccagcaacca ccgtcccaga cccctcatgc     1860

ccaaccttcg tatcagcagc agccacagtc tcaaccacca cagctccagt cctctcagcc     1920

tccatactcc cagcagccat cccagcctcc acatcagcag tccccggctc catacccctc     1980

ccagcagtcg acgacacagc agcaccccca gagccagccc ccctactcac agccacaggc     2040

tcagtctcct taccagcagc agcaacctca gcagccagca ccctcgacgc tctcccagca     2100

ggctgcgtat cctcagcccc agtctcagca gtcccagcaa actgcctatt cccagcagcg     2160

cttccctcca ccgcaggagc tatctcaaga ttcatttggg tctcaggcat cctcagcccc     2220

ctcaatgacc tccagtaagg gagggcaaga agatatgaac ctgagccttc agtcaagacc     2280

ctccagcttg cctgatctat ctggttcaat agatgacctc cccatgggga cagaaggagc     2340

tctgagtcct ggagtgagca catcagggat ttccagcagc caaggagagc agagtaatcc     2400

agctcagtct cctttctctc ctcatacctc ccctcacctg cctggcatcc gaggcccttc     2460

cccgtcccct gttggctctc ccgccagtgt tgctcagtct cgctcaggac cactctcgcc     2520

tgctgcagtg ccaggcaacc agatgccacc tcggccaccc agtggccagt cggacagcat     2580

catgcatcct tccatgaacc aatcaagcat tgcccaagat cgaggttata tgcagaggaa     2640

cccccagatg ccccagtaca gttcccccca gcccggctca gccttatctc cgcgtcagcc     2700

ttccggagga cagatacaca caggcatggg ctcctaccag cagaactcca tggggagcta     2760

tggtccccag gggggtcagt atggcccaca aggtggctac cccaggcagc caaactataa     2820

tgccttgccc aatgccaact accccagtgc aggcatggct ggaggcataa accccatggg     2880

tgccggaggt caaatgcatg gacagcctgg catcccacct tatggcacac tccctccagg     2940

gaggatgagt cacgcctcca tgggcaaccg gccttatggc cctaacatgg ccaatatgcc     3000

acctcaggtt gggtcaggga tgtgtccccc accagggggc atgaaccgga aaacccaaga     3060

aactgctgtc gccatgcatg ttgctgccaa ctctatccaa aacaggccgc caggctaccc     3120

caatatgaat caagggggca tgatgggaac tggacctcct tatggacaag ggattaatag     3180

tatggctggc atgatcaacc ctcagggacc cccatattcc atgggtggaa ccatggccaa     3240

caattctgca gggatggcag ccagcccaga gatgatgggc cttggggatg taaagttaac     3300

tccagccacc aaaatgaaca acaaggcaga tgggacaccc aagacagaat ccaaatccaa     3360

gaaatccagt tcttctacta caaccaatga gaagatcacc aagttgtatg agctgggtgg     3420

tgagcctgag aggaagatgt gggtggaccg ttatctggcc ttcactgagg agaaggccat     3480

gggcatgaca aatctgcctg ctgtgggtag gaaacctctg gacctctatc gcctctatgt     3540

gtctgtgaag gagattggtg gattgactca ggtcaacaag aacaaaaaat ggcgggaact     3600

tgcaaccaac ctcaatgtgg gcacatcaag cagtgctgcc agctccttga aaaagcagta     3660

tatccagtgt ctctatgcct ttgaatgcaa gattgaacgg ggagaagacc ctcccccaga     3720

catctttgca gctgctgatt ccaagaagtc ccagcccaag atccagcctc cctctcctgc     3780

gggatcagga tctatgcagg ggccccagac tccccagtca accagcagtt ccatggcaga     3840

aggaggagac ttaaagccac caactccagc atccacacca cacagtcaga tccccccatt     3900

gccaggcatg agcaggagca attcagttgg gatccaggat gcctttaatg atggaagtga     3960

ctccacattc cagaagcgga attccatgac tccaaaccct gggtatcagc ccagtatgaa     4020

tacctctgac atgatggggc gcatgtccta tgagccaaat aaggatcctt atggcagcat     4080

gaggaaagct ccagggagtg atcccttcat gtcctcaggg cagggcccca acggcgggat     4140

gggtgacccc tacagtcgtg ctgccggccc tgggctagga aatgtggcga tgggaccacg     4200

acagcactat ccctatggag gtccttatga cagagtgagg acggagcctg gaatagggcc     4260

tgagggaaac atgagcactg gggccccaca gccgaatctc atgccttcca acccagactc     4320

ggggatgtat tctcctagcc gctacccccc gcagcagcag cagcagcagc agcaacgaca     4380

tgattcctat ggcaatcagt tctccaccca aggcacccct tctggcagcc ccttccccag     4440

ccagcagact acaatgtatc aacagcaaca gcaggtatcc agccctgctc ccctgccccg     4500

gccaatggag aaccgcacct ctcctagcaa gtctccattc ctgcactctg ggatgaaaat     4560

gcagaaggca ggtcccccag tacctgcctc gcacatagca cctgcccctg tgcagccccc     4620

catgattcgg cgggatatca ccttcccacc tggctctgtt gaagccacac agcctgtgtt     4680

gaagcagagg aggcggctca caatgaaaga cattggaacc ccggaggcat ggcgggtaat     4740

gatgtccctc aagtctggtc tcctggcaga gagcacatgg gcattagata ccatcaacat     4800

cctgctgtat gatgacaaca gcatcatgac cttcaacctc agtcagctcc cagggttgct     4860

agagctcctt gtagaatatt tccgacgatg cctgattgag atctttggca ttttaaagga     4920

gtatgaggtg ggtgacccag gacagagaac gctactggat cctgggaggt tcagcaaggt     4980

gtctagtcca gctcccatgg agggtgggga agaagaagaa gaacttctag gtcctaaact     5040

agaagaggaa gaagaagagg aagtagttga aaatgatgag gagatagcct tttcaggcaa     5100

ggacaagcca gcttcagaga atagtgagga gaagctgatc agtaagtttg acaagcttcc     5160

agtaaagatc gtacagaaga atgatccatt tgtggtggac tgctcagata agcttgggcg     5220

tgtgcaggag tttgacagtg gcctgctgca ctggcggatt ggtggggggg acaccactga     5280

gcatatccag acccacttcg agagcaagac agagctgctg ccttcccggc ctcacgcacc     5340

ctgcccacca gcccctcgga agcatgtgac aacagcagag ggtacaccag ggacaacaga     5400

ccaggagggg cccccacctg atggacctcc agaaaaacgg atcacagcca ctatggatga     5460

catgttgtct actcggtcta gcaccttgac cgaggatgga gctaagagtt cagaggccat     5520

caaggagagc agcaagtttc catttggcat tagcccagca cagagccacc ggaacatcaa     5580

gatcctagag gacgaacccc acagtaagga tgagacccca ctgtgtaccc ttctggactg     5640

gcaggattct cttgccaagc gctgcgtctg tgtgtccaat accattcgaa gcctgtcatt     5700

tgtgccaggc aatgactttg agatgtccaa acacccaggg ctgctgctca tcctgggcaa     5760

gctgatcctg ctgcaccaca agcacccaga acggaagcag gcaccactaa cttatgaaaa     5820

ggaggaggaa caggaccaag gggtgagctg caacaaagtg gagtggtggt gggactgctt     5880

ggagatgctc cgggaaaaca ccttggttac actcgccaac atctcggggc agttggacct     5940

atctccatac cccgagagca tttgcctgcc tgtcctggac ggactcctac actgggcagt     6000

ttgcccttca gctgaagccc aggacccctt ttccaccctg ggccccaatg ccgtcctttc     6060

cccgcagaga ctggtcttgg aaaccctcag caaactcagc atccaggaca acaatgtgga     6120

cctgattctg gccacacccc ccttcagccg cctggagaag ttgtatagca ctatggtgcg     6180

cttcctcagt gaccgaaaga acccggtgtg ccgggagatg gctgtggtac tgctggccaa     6240

cctggctcag ggggacagcc tggcagctcg tgccattgca gtgcagaagg gcagtatcgg     6300

caacctcctg ggcttcctag aggacagcct tgccgccaca cagttccagc agagccaggc     6360

cagcctcctc cacatgcaga acccaccctt tgagccaact agtgtggaca tgatgcggcg     6420

ggctgcccgc gcgctgcttg ccttggccaa ggtggacgag aaccactcag agtttactct     6480

gtacgaatca cggctgttgg acatctcggt atcaccgttg atgaactcat tggtttcaca     6540

agtcatttgt gatgtactgt ttttgattgg ccagtcatga cagccgtggg acacctcccc     6600

cccccgtgtg tgtgtgcgtg tgtggagaac ttagaaactg actgttgccc tttatttatg     6660

caaaaccacc tcagaatcca gtttaccctg tgctgtccag cttctccctt gggaaaaagt     6720

ctctcctgtt tctctctcct ccttccacct cccctccctc catcacctca cgcctttctg     6780

ttccttgtcc tcaccttact cccctcagga ccctacccca ccctctttga aaagacaaag     6840

ctctgcctac atagaagact ttttttattt taaccaaagt tactgttgtt tacagtgagt     6900

ttggggaaaa aaaataaaat aaaaatggct ttcccagtcc ttgcatcaac gggatgccac     6960

atttcataac tgtttttaat ggtaaaaaaa aaaaaaaaaa atacaaaaaa aaattctgaa     7020

ggacaaaaaa ggtgactgct gaactgtgtg tggtttattg ttgtacattc acaatcttgc     7080

aggagccaag aagttcgcag ttgtgaacag accctgttca ctggagaggc ctgtgcagta     7140

gagtgtagac cctttcatgt actgtactgt acacctgata ctgtaaacat actgtaataa     7200

taatgtctca catggaaaca gaaaacgctg ggtcagcagc aagctgtagt ttttaaaaat     7260

gtttttagtt aaacgttgag gagaaaaaaa aaaaaggctt ttcccccaaa gtatcatgtg     7320

tgaacctaca acaccctgac ctctttctct cctccttgat tgtatgaata accctgagat     7380

cacctcttag aactggtttt aacctttagc tgcagcggct acgctgccac gtgtgtatat     7440

atatgacgtt gtacattgca catacccttg gatccccaca gtttggtcct cctcccagct     7500

acccctttat agtatgacga gttaacaagt tggtgacctg cacaaagcga gacacagcta     7560

tttaatctct tgccagatat cgcccctctt ggtgcgatgc tgtacaggtc tctgtaaaaa     7620

gtccttgctg tctcagcagc caatcaactt atagtttatt tttttctggg tttttgtttt     7680

gttttgtttt ctttctaatc gaggtgtgaa aaagttctag gttcagttga agttctgatg     7740

aagaaacaca attgagattt tttcagtgat aaaatctgca tatttgtatt tcaacaatgt     7800

agctaaaact tgatgtaaat tcctcctttt tttccttttt tggcttaatg aatatcattt     7860

attcagtatg aaatctttat actatatgtt ccacgtgtta agaataaatg tacattaaat     7920

cttggtaaga cttt                                                       7934


<210>  85
<211>  2068
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens AT rich interactive domain 1A (SWI-like) (ARID1A), 
       transcript variant 2, mRNANCBI Reference Sequence: NM_139135.2

<400>  85

Met Ala Ala Gln Val Ala Pro Ala Ala Ala Ser Ser Leu Gly Asn Pro 
1               5                   10                  15      


Pro Pro Pro Pro Pro Ser Glu Leu Lys Lys Ala Glu Gln Gln Gln Arg 
            20                  25                  30          


Glu Glu Ala Gly Gly Glu Ala Ala Ala Ala Ala Ala Ala Glu Arg Gly 
        35                  40                  45              


Glu Met Lys Ala Ala Ala Gly Gln Glu Ser Glu Gly Pro Ala Val Gly 
    50                  55                  60                  


Pro Pro Gln Pro Leu Gly Lys Glu Leu Gln Asp Gly Ala Glu Ser Asn 
65                  70                  75                  80  


Gly Gly Gly Gly Gly Gly Gly Ala Gly Ser Gly Gly Gly Pro Gly Ala 
                85                  90                  95      


Glu Pro Asp Leu Lys Asn Ser Asn Gly Asn Ala Gly Pro Arg Pro Ala 
            100                 105                 110         


Leu Asn Asn Asn Leu Thr Glu Pro Pro Gly Gly Gly Gly Gly Gly Ser 
        115                 120                 125             


Ser Asp Gly Val Gly Ala Pro Pro His Ser Ala Ala Ala Ala Leu Pro 
    130                 135                 140                 


Pro Pro Ala Tyr Gly Phe Gly Gln Pro Tyr Gly Arg Ser Pro Ser Ala 
145                 150                 155                 160 


Val Ala Ala Ala Ala Ala Ala Val Phe His Gln Gln His Gly Gly Gln 
                165                 170                 175     


Gln Ser Pro Gly Leu Ala Ala Leu Gln Ser Gly Gly Gly Gly Gly Leu 
            180                 185                 190         


Glu Pro Tyr Ala Gly Pro Gln Gln Asn Ser His Asp His Gly Phe Pro 
        195                 200                 205             


Asn His Gln Tyr Asn Ser Tyr Tyr Pro Asn Arg Ser Ala Tyr Pro Pro 
    210                 215                 220                 


Pro Ala Pro Ala Tyr Ala Leu Ser Ser Pro Arg Gly Gly Thr Pro Gly 
225                 230                 235                 240 


Ser Gly Ala Ala Ala Ala Ala Gly Ser Lys Pro Pro Pro Ser Ser Ser 
                245                 250                 255     


Ala Ser Ala Ser Ser Ser Ser Ser Ser Phe Ala Gln Gln Arg Phe Gly 
            260                 265                 270         


Ala Met Gly Gly Gly Gly Pro Ser Ala Ala Gly Gly Gly Thr Pro Gln 
        275                 280                 285             


Pro Thr Ala Thr Pro Thr Leu Asn Gln Leu Leu Thr Ser Pro Ser Ser 
    290                 295                 300                 


Ala Arg Gly Tyr Gln Gly Tyr Pro Gly Gly Asp Tyr Ser Gly Gly Pro 
305                 310                 315                 320 


Gln Asp Gly Gly Ala Gly Lys Gly Pro Ala Asp Met Ala Ser Gln Cys 
                325                 330                 335     


Trp Gly Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ser Gly Gly 
            340                 345                 350         


Ala Gln Gln Arg Ser His His Ala Pro Met Ser Pro Gly Ser Ser Gly 
        355                 360                 365             


Gly Gly Gly Gln Pro Leu Ala Arg Thr Pro Gln Pro Ser Ser Pro Met 
    370                 375                 380                 


Asp Gln Met Gly Lys Met Arg Pro Gln Pro Tyr Gly Gly Thr Asn Pro 
385                 390                 395                 400 


Tyr Ser Gln Gln Gln Gly Pro Pro Ser Gly Pro Gln Gln Gly His Gly 
                405                 410                 415     


Tyr Pro Gly Gln Pro Tyr Gly Ser Gln Thr Pro Gln Arg Tyr Pro Met 
            420                 425                 430         


Thr Met Gln Gly Arg Ala Gln Ser Ala Met Gly Gly Leu Ser Tyr Thr 
        435                 440                 445             


Gln Gln Ile Pro Pro Tyr Gly Gln Gln Gly Pro Ser Gly Tyr Gly Gln 
    450                 455                 460                 


Gln Gly Gln Thr Pro Tyr Tyr Asn Gln Gln Ser Pro His Pro Gln Gln 
465                 470                 475                 480 


Gln Gln Pro Pro Tyr Ser Gln Gln Pro Pro Ser Gln Thr Pro His Ala 
                485                 490                 495     


Gln Pro Ser Tyr Gln Gln Gln Pro Gln Ser Gln Pro Pro Gln Leu Gln 
            500                 505                 510         


Ser Ser Gln Pro Pro Tyr Ser Gln Gln Pro Ser Gln Pro Pro His Gln 
        515                 520                 525             


Gln Ser Pro Ala Pro Tyr Pro Ser Gln Gln Ser Thr Thr Gln Gln His 
    530                 535                 540                 


Pro Gln Ser Gln Pro Pro Tyr Ser Gln Pro Gln Ala Gln Ser Pro Tyr 
545                 550                 555                 560 


Gln Gln Gln Gln Pro Gln Gln Pro Ala Pro Ser Thr Leu Ser Gln Gln 
                565                 570                 575     


Ala Ala Tyr Pro Gln Pro Gln Ser Gln Gln Ser Gln Gln Thr Ala Tyr 
            580                 585                 590         


Ser Gln Gln Arg Phe Pro Pro Pro Gln Glu Leu Ser Gln Asp Ser Phe 
        595                 600                 605             


Gly Ser Gln Ala Ser Ser Ala Pro Ser Met Thr Ser Ser Lys Gly Gly 
    610                 615                 620                 


Gln Glu Asp Met Asn Leu Ser Leu Gln Ser Arg Pro Ser Ser Leu Pro 
625                 630                 635                 640 


Asp Leu Ser Gly Ser Ile Asp Asp Leu Pro Met Gly Thr Glu Gly Ala 
                645                 650                 655     


Leu Ser Pro Gly Val Ser Thr Ser Gly Ile Ser Ser Ser Gln Gly Glu 
            660                 665                 670         


Gln Ser Asn Pro Ala Gln Ser Pro Phe Ser Pro His Thr Ser Pro His 
        675                 680                 685             


Leu Pro Gly Ile Arg Gly Pro Ser Pro Ser Pro Val Gly Ser Pro Ala 
    690                 695                 700                 


Ser Val Ala Gln Ser Arg Ser Gly Pro Leu Ser Pro Ala Ala Val Pro 
705                 710                 715                 720 


Gly Asn Gln Met Pro Pro Arg Pro Pro Ser Gly Gln Ser Asp Ser Ile 
                725                 730                 735     


Met His Pro Ser Met Asn Gln Ser Ser Ile Ala Gln Asp Arg Gly Tyr 
            740                 745                 750         


Met Gln Arg Asn Pro Gln Met Pro Gln Tyr Ser Ser Pro Gln Pro Gly 
        755                 760                 765             


Ser Ala Leu Ser Pro Arg Gln Pro Ser Gly Gly Gln Ile His Thr Gly 
    770                 775                 780                 


Met Gly Ser Tyr Gln Gln Asn Ser Met Gly Ser Tyr Gly Pro Gln Gly 
785                 790                 795                 800 


Gly Gln Tyr Gly Pro Gln Gly Gly Tyr Pro Arg Gln Pro Asn Tyr Asn 
                805                 810                 815     


Ala Leu Pro Asn Ala Asn Tyr Pro Ser Ala Gly Met Ala Gly Gly Ile 
            820                 825                 830         


Asn Pro Met Gly Ala Gly Gly Gln Met His Gly Gln Pro Gly Ile Pro 
        835                 840                 845             


Pro Tyr Gly Thr Leu Pro Pro Gly Arg Met Ser His Ala Ser Met Gly 
    850                 855                 860                 


Asn Arg Pro Tyr Gly Pro Asn Met Ala Asn Met Pro Pro Gln Val Gly 
865                 870                 875                 880 


Ser Gly Met Cys Pro Pro Pro Gly Gly Met Asn Arg Lys Thr Gln Glu 
                885                 890                 895     


Thr Ala Val Ala Met His Val Ala Ala Asn Ser Ile Gln Asn Arg Pro 
            900                 905                 910         


Pro Gly Tyr Pro Asn Met Asn Gln Gly Gly Met Met Gly Thr Gly Pro 
        915                 920                 925             


Pro Tyr Gly Gln Gly Ile Asn Ser Met Ala Gly Met Ile Asn Pro Gln 
    930                 935                 940                 


Gly Pro Pro Tyr Ser Met Gly Gly Thr Met Ala Asn Asn Ser Ala Gly 
945                 950                 955                 960 


Met Ala Ala Ser Pro Glu Met Met Gly Leu Gly Asp Val Lys Leu Thr 
                965                 970                 975     


Pro Ala Thr Lys Met Asn Asn Lys Ala Asp Gly Thr Pro Lys Thr Glu 
            980                 985                 990         


Ser Lys Ser Lys Lys Ser Ser Ser  Ser Thr Thr Thr Asn  Glu Lys Ile 
        995                 1000                 1005             


Thr Lys  Leu Tyr Glu Leu Gly  Gly Glu Pro Glu Arg  Lys Met Trp 
    1010                 1015                 1020             


Val Asp  Arg Tyr Leu Ala Phe  Thr Glu Glu Lys Ala  Met Gly Met 
    1025                 1030                 1035             


Thr Asn  Leu Pro Ala Val Gly  Arg Lys Pro Leu Asp  Leu Tyr Arg 
    1040                 1045                 1050             


Leu Tyr  Val Ser Val Lys Glu  Ile Gly Gly Leu Thr  Gln Val Asn 
    1055                 1060                 1065             


Lys Asn  Lys Lys Trp Arg Glu  Leu Ala Thr Asn Leu  Asn Val Gly 
    1070                 1075                 1080             


Thr Ser  Ser Ser Ala Ala Ser  Ser Leu Lys Lys Gln  Tyr Ile Gln 
    1085                 1090                 1095             


Cys Leu  Tyr Ala Phe Glu Cys  Lys Ile Glu Arg Gly  Glu Asp Pro 
    1100                 1105                 1110             


Pro Pro  Asp Ile Phe Ala Ala  Ala Asp Ser Lys Lys  Ser Gln Pro 
    1115                 1120                 1125             


Lys Ile  Gln Pro Pro Ser Pro  Ala Gly Ser Gly Ser  Met Gln Gly 
    1130                 1135                 1140             


Pro Gln  Thr Pro Gln Ser Thr  Ser Ser Ser Met Ala  Glu Gly Gly 
    1145                 1150                 1155             


Asp Leu  Lys Pro Pro Thr Pro  Ala Ser Thr Pro His  Ser Gln Ile 
    1160                 1165                 1170             


Pro Pro  Leu Pro Gly Met Ser  Arg Ser Asn Ser Val  Gly Ile Gln 
    1175                 1180                 1185             


Asp Ala  Phe Asn Asp Gly Ser  Asp Ser Thr Phe Gln  Lys Arg Asn 
    1190                 1195                 1200             


Ser Met  Thr Pro Asn Pro Gly  Tyr Gln Pro Ser Met  Asn Thr Ser 
    1205                 1210                 1215             


Asp Met  Met Gly Arg Met Ser  Tyr Glu Pro Asn Lys  Asp Pro Tyr 
    1220                 1225                 1230             


Gly Ser  Met Arg Lys Ala Pro  Gly Ser Asp Pro Phe  Met Ser Ser 
    1235                 1240                 1245             


Gly Gln  Gly Pro Asn Gly Gly  Met Gly Asp Pro Tyr  Ser Arg Ala 
    1250                 1255                 1260             


Ala Gly  Pro Gly Leu Gly Asn  Val Ala Met Gly Pro  Arg Gln His 
    1265                 1270                 1275             


Tyr Pro  Tyr Gly Gly Pro Tyr  Asp Arg Val Arg Thr  Glu Pro Gly 
    1280                 1285                 1290             


Ile Gly  Pro Glu Gly Asn Met  Ser Thr Gly Ala Pro  Gln Pro Asn 
    1295                 1300                 1305             


Leu Met  Pro Ser Asn Pro Asp  Ser Gly Met Tyr Ser  Pro Ser Arg 
    1310                 1315                 1320             


Tyr Pro  Pro Gln Gln Gln Gln  Gln Gln Gln Gln Arg  His Asp Ser 
    1325                 1330                 1335             


Tyr Gly  Asn Gln Phe Ser Thr  Gln Gly Thr Pro Ser  Gly Ser Pro 
    1340                 1345                 1350             


Phe Pro  Ser Gln Gln Thr Thr  Met Tyr Gln Gln Gln  Gln Gln Val 
    1355                 1360                 1365             


Ser Ser  Pro Ala Pro Leu Pro  Arg Pro Met Glu Asn  Arg Thr Ser 
    1370                 1375                 1380             


Pro Ser  Lys Ser Pro Phe Leu  His Ser Gly Met Lys  Met Gln Lys 
    1385                 1390                 1395             


Ala Gly  Pro Pro Val Pro Ala  Ser His Ile Ala Pro  Ala Pro Val 
    1400                 1405                 1410             


Gln Pro  Pro Met Ile Arg Arg  Asp Ile Thr Phe Pro  Pro Gly Ser 
    1415                 1420                 1425             


Val Glu  Ala Thr Gln Pro Val  Leu Lys Gln Arg Arg  Arg Leu Thr 
    1430                 1435                 1440             


Met Lys  Asp Ile Gly Thr Pro  Glu Ala Trp Arg Val  Met Met Ser 
    1445                 1450                 1455             


Leu Lys  Ser Gly Leu Leu Ala  Glu Ser Thr Trp Ala  Leu Asp Thr 
    1460                 1465                 1470             


Ile Asn  Ile Leu Leu Tyr Asp  Asp Asn Ser Ile Met  Thr Phe Asn 
    1475                 1480                 1485             


Leu Ser  Gln Leu Pro Gly Leu  Leu Glu Leu Leu Val  Glu Tyr Phe 
    1490                 1495                 1500             


Arg Arg  Cys Leu Ile Glu Ile  Phe Gly Ile Leu Lys  Glu Tyr Glu 
    1505                 1510                 1515             


Val Gly  Asp Pro Gly Gln Arg  Thr Leu Leu Asp Pro  Gly Arg Phe 
    1520                 1525                 1530             


Ser Lys  Val Ser Ser Pro Ala  Pro Met Glu Gly Gly  Glu Glu Glu 
    1535                 1540                 1545             


Glu Glu  Leu Leu Gly Pro Lys  Leu Glu Glu Glu Glu  Glu Glu Glu 
    1550                 1555                 1560             


Val Val  Glu Asn Asp Glu Glu  Ile Ala Phe Ser Gly  Lys Asp Lys 
    1565                 1570                 1575             


Pro Ala  Ser Glu Asn Ser Glu  Glu Lys Leu Ile Ser  Lys Phe Asp 
    1580                 1585                 1590             


Lys Leu  Pro Val Lys Ile Val  Gln Lys Asn Asp Pro  Phe Val Val 
    1595                 1600                 1605             


Asp Cys  Ser Asp Lys Leu Gly  Arg Val Gln Glu Phe  Asp Ser Gly 
    1610                 1615                 1620             


Leu Leu  His Trp Arg Ile Gly  Gly Gly Asp Thr Thr  Glu His Ile 
    1625                 1630                 1635             


Gln Thr  His Phe Glu Ser Lys  Thr Glu Leu Leu Pro  Ser Arg Pro 
    1640                 1645                 1650             


His Ala  Pro Cys Pro Pro Ala  Pro Arg Lys His Val  Thr Thr Ala 
    1655                 1660                 1665             


Glu Gly  Thr Pro Gly Thr Thr  Asp Gln Glu Gly Pro  Pro Pro Asp 
    1670                 1675                 1680             


Gly Pro  Pro Glu Lys Arg Ile  Thr Ala Thr Met Asp  Asp Met Leu 
    1685                 1690                 1695             


Ser Thr  Arg Ser Ser Thr Leu  Thr Glu Asp Gly Ala  Lys Ser Ser 
    1700                 1705                 1710             


Glu Ala  Ile Lys Glu Ser Ser  Lys Phe Pro Phe Gly  Ile Ser Pro 
    1715                 1720                 1725             


Ala Gln  Ser His Arg Asn Ile  Lys Ile Leu Glu Asp  Glu Pro His 
    1730                 1735                 1740             


Ser Lys  Asp Glu Thr Pro Leu  Cys Thr Leu Leu Asp  Trp Gln Asp 
    1745                 1750                 1755             


Ser Leu  Ala Lys Arg Cys Val  Cys Val Ser Asn Thr  Ile Arg Ser 
    1760                 1765                 1770             


Leu Ser  Phe Val Pro Gly Asn  Asp Phe Glu Met Ser  Lys His Pro 
    1775                 1780                 1785             


Gly Leu  Leu Leu Ile Leu Gly  Lys Leu Ile Leu Leu  His His Lys 
    1790                 1795                 1800             


His Pro  Glu Arg Lys Gln Ala  Pro Leu Thr Tyr Glu  Lys Glu Glu 
    1805                 1810                 1815             


Glu Gln  Asp Gln Gly Val Ser  Cys Asn Lys Val Glu  Trp Trp Trp 
    1820                 1825                 1830             


Asp Cys  Leu Glu Met Leu Arg  Glu Asn Thr Leu Val  Thr Leu Ala 
    1835                 1840                 1845             


Asn Ile  Ser Gly Gln Leu Asp  Leu Ser Pro Tyr Pro  Glu Ser Ile 
    1850                 1855                 1860             


Cys Leu  Pro Val Leu Asp Gly  Leu Leu His Trp Ala  Val Cys Pro 
    1865                 1870                 1875             


Ser Ala  Glu Ala Gln Asp Pro  Phe Ser Thr Leu Gly  Pro Asn Ala 
    1880                 1885                 1890             


Val Leu  Ser Pro Gln Arg Leu  Val Leu Glu Thr Leu  Ser Lys Leu 
    1895                 1900                 1905             


Ser Ile  Gln Asp Asn Asn Val  Asp Leu Ile Leu Ala  Thr Pro Pro 
    1910                 1915                 1920             


Phe Ser  Arg Leu Glu Lys Leu  Tyr Ser Thr Met Val  Arg Phe Leu 
    1925                 1930                 1935             


Ser Asp  Arg Lys Asn Pro Val  Cys Arg Glu Met Ala  Val Val Leu 
    1940                 1945                 1950             


Leu Ala  Asn Leu Ala Gln Gly  Asp Ser Leu Ala Ala  Arg Ala Ile 
    1955                 1960                 1965             


Ala Val  Gln Lys Gly Ser Ile  Gly Asn Leu Leu Gly  Phe Leu Glu 
    1970                 1975                 1980             


Asp Ser  Leu Ala Ala Thr Gln  Phe Gln Gln Ser Gln  Ala Ser Leu 
    1985                 1990                 1995             


Leu His  Met Gln Asn Pro Pro  Phe Glu Pro Thr Ser  Val Asp Met 
    2000                 2005                 2010             


Met Arg  Arg Ala Ala Arg Ala  Leu Leu Ala Leu Ala  Lys Val Asp 
    2015                 2020                 2025             


Glu Asn  His Ser Glu Phe Thr  Leu Tyr Glu Ser Arg  Leu Leu Asp 
    2030                 2035                 2040             


Ile Ser  Val Ser Pro Leu Met  Asn Ser Leu Val Ser  Gln Val Ile 
    2045                 2050                 2055             


Cys Asp  Val Leu Phe Leu Ile  Gly Gln Ser 
    2060                 2065             


<210>  86
<211>  5765
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human KRAS mRNA transcript variant b GenBank Accession No.: 
       NM_004985.4  GI:575403057

<400>  86
tcctaggcgg cggccgcggc ggcggaggca gcagcggcgg cggcagtggc ggcggcgaag       60

gtggcggcgg ctcggccagt actcccggcc cccgccattt cggactggga gcgagcgcgg      120

cgcaggcact gaaggcggcg gcggggccag aggctcagcg gctcccaggt gcgggagaga      180

ggcctgctga aaatgactga atataaactt gtggtagttg gagctggtgg cgtaggcaag      240

agtgccttga cgatacagct aattcagaat cattttgtgg acgaatatga tccaacaata      300

gaggattcct acaggaagca agtagtaatt gatggagaaa cctgtctctt ggatattctc      360

gacacagcag gtcaagagga gtacagtgca atgagggacc agtacatgag gactggggag      420

ggctttcttt gtgtatttgc cataaataat actaaatcat ttgaagatat tcaccattat      480

agagaacaaa ttaaaagagt taaggactct gaagatgtac ctatggtcct agtaggaaat      540

aaatgtgatt tgccttctag aacagtagac acaaaacagg ctcaggactt agcaagaagt      600

tatggaattc cttttattga aacatcagca aagacaagac agggtgttga tgatgccttc      660

tatacattag ttcgagaaat tcgaaaacat aaagaaaaga tgagcaaaga tggtaaaaag      720

aagaaaaaga agtcaaagac aaagtgtgta attatgtaaa tacaatttgt acttttttct      780

taaggcatac tagtacaagt ggtaattttt gtacattaca ctaaattatt agcatttgtt      840

ttagcattac ctaatttttt tcctgctcca tgcagactgt tagcttttac cttaaatgct      900

tattttaaaa tgacagtgga agtttttttt tcctctaagt gccagtattc ccagagtttt      960

ggtttttgaa ctagcaatgc ctgtgaaaaa gaaactgaat acctaagatt tctgtcttgg     1020

ggtttttggt gcatgcagtt gattacttct tatttttctt accaattgtg aatgttggtg     1080

tgaaacaaat taatgaagct tttgaatcat ccctattctg tgttttatct agtcacataa     1140

atggattaat tactaatttc agttgagacc ttctaattgg tttttactga aacattgagg     1200

gaacacaaat ttatgggctt cctgatgatg attcttctag gcatcatgtc ctatagtttg     1260

tcatccctga tgaatgtaaa gttacactgt tcacaaaggt tttgtctcct ttccactgct     1320

attagtcatg gtcactctcc ccaaaatatt atattttttc tataaaaaga aaaaaatgga     1380

aaaaaattac aaggcaatgg aaactattat aaggccattt ccttttcaca ttagataaat     1440

tactataaag actcctaata gcttttcctg ttaaggcaga cccagtatga aatggggatt     1500

attatagcaa ccattttggg gctatattta catgctacta aatttttata ataattgaaa     1560

agattttaac aagtataaaa aattctcata ggaattaaat gtagtctccc tgtgtcagac     1620

tgctctttca tagtataact ttaaatcttt tcttcaactt gagtctttga agatagtttt     1680

aattctgctt gtgacattaa aagattattt gggccagtta tagcttatta ggtgttgaag     1740

agaccaaggt tgcaaggcca ggccctgtgt gaacctttga gctttcatag agagtttcac     1800

agcatggact gtgtccccac ggtcatccag tgttgtcatg cattggttag tcaaaatggg     1860

gagggactag ggcagtttgg atagctcaac aagatacaat ctcactctgt ggtggtcctg     1920

ctgacaaatc aagagcattg cttttgtttc ttaagaaaac aaactctttt ttaaaaatta     1980

cttttaaata ttaactcaaa agttgagatt ttggggtggt ggtgtgccaa gacattaatt     2040

ttttttttaa acaatgaagt gaaaaagttt tacaatctct aggtttggct agttctctta     2100

acactggtta aattaacatt gcataaacac ttttcaagtc tgatccatat ttaataatgc     2160

tttaaaataa aaataaaaac aatccttttg ataaatttaa aatgttactt attttaaaat     2220

aaatgaagtg agatggcatg gtgaggtgaa agtatcactg gactaggaag aaggtgactt     2280

aggttctaga taggtgtctt ttaggactct gattttgagg acatcactta ctatccattt     2340

cttcatgtta aaagaagtca tctcaaactc ttagtttttt ttttttacaa ctatgtaatt     2400

tatattccat ttacataagg atacacttat ttgtcaagct cagcacaatc tgtaaatttt     2460

taacctatgt tacaccatct tcagtgccag tcttgggcaa aattgtgcaa gaggtgaagt     2520

ttatatttga atatccattc tcgttttagg actcttcttc catattagtg tcatcttgcc     2580

tccctacctt ccacatgccc catgacttga tgcagtttta atacttgtaa ttcccctaac     2640

cataagattt actgctgctg tggatatctc catgaagttt tcccactgag tcacatcaga     2700

aatgccctac atcttatttc ctcagggctc aagagaatct gacagatacc ataaagggat     2760

ttgacctaat cactaatttt caggtggtgg ctgatgcttt gaacatctct ttgctgccca     2820

atccattagc gacagtagga tttttcaaac ctggtatgaa tagacagaac cctatccagt     2880

ggaaggagaa tttaataaag atagtgctga aagaattcct taggtaatct ataactagga     2940

ctactcctgg taacagtaat acattccatt gttttagtaa ccagaaatct tcatgcaatg     3000

aaaaatactt taattcatga agcttacttt ttttttttgg tgtcagagtc tcgctcttgt     3060

cacccaggct ggaatgcagt ggcgccatct cagctcactg caacctccat ctcccaggtt     3120

caagcgattc tcgtgcctcg gcctcctgag tagctgggat tacaggcgtg tgccactaca     3180

ctcaactaat ttttgtattt ttaggagaga cggggtttca ccctgttggc caggctggtc     3240

tcgaactcct gacctcaagt gattcaccca ccttggcctc ataaacctgt tttgcagaac     3300

tcatttattc agcaaatatt tattgagtgc ctaccagatg ccagtcaccg cacaaggcac     3360

tgggtatatg gtatccccaa acaagagaca taatcccggt ccttaggtag tgctagtgtg     3420

gtctgtaata tcttactaag gcctttggta tacgacccag agataacacg atgcgtattt     3480

tagttttgca aagaaggggt ttggtctctg tgccagctct ataattgttt tgctacgatt     3540

ccactgaaac tcttcgatca agctacttta tgtaaatcac ttcattgttt taaaggaata     3600

aacttgatta tattgttttt ttatttggca taactgtgat tcttttagga caattactgt     3660

acacattaag gtgtatgtca gatattcata ttgacccaaa tgtgtaatat tccagttttc     3720

tctgcataag taattaaaat atacttaaaa attaatagtt ttatctgggt acaaataaac     3780

aggtgcctga actagttcac agacaaggaa acttctatgt aaaaatcact atgatttctg     3840

aattgctatg tgaaactaca gatctttgga acactgttta ggtagggtgt taagacttac     3900

acagtacctc gtttctacac agagaaagaa atggccatac ttcaggaact gcagtgctta     3960

tgaggggata tttaggcctc ttgaattttt gatgtagatg ggcatttttt taaggtagtg     4020

gttaattacc tttatgtgaa ctttgaatgg tttaacaaaa gatttgtttt tgtagagatt     4080

ttaaaggggg agaattctag aaataaatgt tacctaatta ttacagcctt aaagacaaaa     4140

atccttgttg aagttttttt aaaaaaagct aaattacata gacttaggca ttaacatgtt     4200

tgtggaagaa tatagcagac gtatattgta tcatttgagt gaatgttccc aagtaggcat     4260

tctaggctct atttaactga gtcacactgc ataggaattt agaacctaac ttttataggt     4320

tatcaaaact gttgtcacca ttgcacaatt ttgtcctaat atatacatag aaactttgtg     4380

gggcatgtta agttacagtt tgcacaagtt catctcattt gtattccatt gatttttttt     4440

ttcttctaaa cattttttct tcaaacagta tataactttt tttaggggat ttttttttag     4500

acagcaaaaa ctatctgaag atttccattt gtcaaaaagt aatgatttct tgataattgt     4560

gtagtaatgt tttttagaac ccagcagtta ccttaaagct gaatttatat ttagtaactt     4620

ctgtgttaat actggatagc atgaattctg cattgagaaa ctgaatagct gtcataaaat     4680

gaaactttct ttctaaagaa agatactcac atgagttctt gaagaatagt cataactaga     4740

ttaagatctg tgttttagtt taatagtttg aagtgcctgt ttgggataat gataggtaat     4800

ttagatgaat ttaggggaaa aaaaagttat ctgcagatat gttgagggcc catctctccc     4860

cccacacccc cacagagcta actgggttac agtgttttat ccgaaagttt ccaattccac     4920

tgtcttgtgt tttcatgttg aaaatacttt tgcatttttc ctttgagtgc caatttctta     4980

ctagtactat ttcttaatgt aacatgttta cctggaatgt attttaacta tttttgtata     5040

gtgtaaactg aaacatgcac attttgtaca ttgtgctttc ttttgtggga catatgcagt     5100

gtgatccagt tgttttccat catttggttg cgctgaccta ggaatgttgg tcatatcaaa     5160

cattaaaaat gaccactctt ttaattgaaa ttaactttta aatgtttata ggagtatgtg     5220

ctgtgaagtg atctaaaatt tgtaatattt ttgtcatgaa ctgtactact cctaattatt     5280

gtaatgtaat aaaaatagtt acagtgacta tgagtgtgta tttattcatg aaatttgaac     5340

tgtttgcccc gaaatggata tggaatactt tataagccat agacactata gtataccagt     5400

gaatctttta tgcagcttgt tagaagtatc ctttatttct aaaaggtgct gtggatatta     5460

tgtaaaggcg tgtttgctta aacttaaaac catatttaga agtagatgca aaacaaatct     5520

gcctttatga caaaaaaata ggataacatt atttatttat ttccttttat caaagaaggt     5580

aattgataca caacaggtga cttggtttta ggcccaaagg tagcagcagc aacattaata     5640

atggaaataa ttgaatagtt agttatgtat gttaatgcca gtcaccagca ggctatttca     5700

aggtcagaag taatgactcc atacatatta tttatttcta taactacatt taaatcatta     5760

ccagg                                                                 5765


<210>  87
<211>  188
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human KRAS Isoform B polypeptide GenBank Accession No.: 
       NP_004976.2  GI:15718761

<400>  87

Met Thr Glu Tyr Lys Leu Val Val Val Gly Ala Gly Gly Val Gly Lys 
1               5                   10                  15      


Ser Ala Leu Thr Ile Gln Leu Ile Gln Asn His Phe Val Asp Glu Tyr 
            20                  25                  30          


Asp Pro Thr Ile Glu Asp Ser Tyr Arg Lys Gln Val Val Ile Asp Gly 
        35                  40                  45              


Glu Thr Cys Leu Leu Asp Ile Leu Asp Thr Ala Gly Gln Glu Glu Tyr 
    50                  55                  60                  


Ser Ala Met Arg Asp Gln Tyr Met Arg Thr Gly Glu Gly Phe Leu Cys 
65                  70                  75                  80  


Val Phe Ala Ile Asn Asn Thr Lys Ser Phe Glu Asp Ile His His Tyr 
                85                  90                  95      


Arg Glu Gln Ile Lys Arg Val Lys Asp Ser Glu Asp Val Pro Met Val 
            100                 105                 110         


Leu Val Gly Asn Lys Cys Asp Leu Pro Ser Arg Thr Val Asp Thr Lys 
        115                 120                 125             


Gln Ala Gln Asp Leu Ala Arg Ser Tyr Gly Ile Pro Phe Ile Glu Thr 
    130                 135                 140                 


Ser Ala Lys Thr Arg Gln Gly Val Asp Asp Ala Phe Tyr Thr Leu Val 
145                 150                 155                 160 


Arg Glu Ile Arg Lys His Lys Glu Lys Met Ser Lys Asp Gly Lys Lys 
                165                 170                 175     


Lys Lys Lys Lys Ser Lys Thr Lys Cys Val Ile Met 
            180                 185             


<210>  88
<211>  5775
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens K-ras oncogene protein (KRAS) mRNA, complete 
       cdsGenBank: M54968.1

<400>  88
tcctaggcgg cggccgcggc ggcggaggca gcagcggcgg cggcagtggc ggcggcgaag       60

gtggcggcgg ctcggccagt actcccggcc cccgccattt cggactggga gcgagcgcgg      120

cgcaggcact gaaggcggcg gcggggccag aggctcagcg gctcccaggt gcgggagaga      180

ggcctgctga aaatgactga atataaactt gtggtagttg gagcttgtgg cgtaggcaag      240

agtgccttga cgatacagct aattcagaat cattttgtgg acgaatatga tccaacaata      300

gaggattcct acaggaagca agtagtaatt gatggagaaa cctgtctctt ggatattctc      360

gacacagcag gtcaagagga gtacagtgca atgagggacc agtacatgag gactggggag      420

ggctttcttt gtgtatttgc cataaataat actaaatcat ttgaagatat tcaccattat      480

agagaacaaa ttaaaagagt taaggactct gaagatgtac ctatggtcct agtaggaaat      540

aaatgtgatt tgccttctag aacagtagac acaaaacagg ctcaggactt agcaagaagt      600

tatggaattc cttttattga aacatcagca aagacaagac agggtgttga tgatgccttc      660

tatacattag ttcgagaaat tcgaaaacat aaagaaaaga tgagcaaaga tggtaaaaag      720

aagaaaaaga agtcaaagac aaagtgtgta attatgtaaa tacaatttgt acttttttct      780

taaggcatac tagtacaagt ggtaattttt gtacattaca ctaaattatt agcatttgtt      840

ttagcattac ctaatttttt tcctgctcca tgcagactgt tagcttttac cttaaatgct      900

tattttaaaa tgacagtgga agtttttttt tcctcgaagt gccagtattc ccagagtttt      960

ggtttttgaa ctagcaatgc ctgtgaaaaa gaaactgaat acctaagatt tctgtcttgg     1020

ggtttttggt gcatgcagtt gattacttct tatttttctt accaagtgtg aatgttggtg     1080

tgaaacaaat taatgaagct tttgaatcat ccctattctg tgttttatct agtcacataa     1140

atggattaat tactaatttc agttgagacc ttctaattgg tttttactga aacattgagg     1200

gacacaaatt tatgggcttc ctgatgatga ttcttctagg catcatgtcc tatagtttgt     1260

catccctgat gaatgtaaag ttacactgtt cacaaaggtt ttgtctcctt tccactgcta     1320

ttagtcatgg tcactctccc caaaatatta tattttttct ataaaaagaa aaaaatggaa     1380

aaaaattaca aggcaatgga aactattata aggccatttc cttttcacat tagataaatt     1440

actataaaga ctcctaatag ctttttcctg ttaaggcaga cccagtatga atgggattat     1500

tatagcaacc attttggggc tatatttaca tgctactaaa tttttataat aattgaaaag     1560

attttaacaa gtataaaaaa attctcatag gaattaaatg tagtctccct gtgtcagact     1620

gctctttcat agtataactt taaatctttt cttcaacttg agtctttgaa gatagtttta     1680

attctgcttg tgacattaaa agattatttg ggccagttat agcttattag gtgttgaaga     1740

gaccaaggtt gcaagccagg ccctgtgtga accttgagct ttcatagaga gtttcacagc     1800

atggactgtg tgccccacgg tcatccgagt ggttgtacga tgcattggtt agtcaaaaat     1860

ggggagggac tagggcagtt tggatagctc aacaagatac aatctcactc tgtggtggtc     1920

ctgctgacaa atcaagagca ttgcttttgt ttcttaagaa aacaaactct tttttaaaaa     1980

ttacttttaa atattaactc aaaagttgag attttggggt ggtggtgtgc caagacatta     2040

attttttttt taaacaatga agtgaaaaag ttttacaatc tctaggtttg gctagttctc     2100

ttaacactgg ttaaattaac attgcataaa cacttttcaa gtctgatcca tatttaataa     2160

tgctttaaaa taaaaataaa aacaatcctt ttgataaatt taaaatgtta cttattttaa     2220

aataaatgaa gtgagatggc atggtgaggt gaaagtatca ctggactagg ttgttggtga     2280

cttaggttct agataggtgt cttttaggac tctgattttg aggacatcac ttactatcca     2340

tttcttcatg ttaaaagaag tcatctcaaa ctcttagttt ttttttttta cactatgtga     2400

tttatattcc atttacataa ggatacactt atttgtcaag ctcagcacaa tctgtaaatt     2460

tttaacctat gttacaccat cttcagtgcc agtcttgggc aaaattgtgc aagaggtgaa     2520

gtttatattt gaatatccat tctcgtttta ggactcttct tccatattag tgtcatcttg     2580

cctccctacc ttccacatgc cccatgactt gatgcagttt taatacttgt aattccccta     2640

accataagat ttactgctgc tgtggatatc tccatgaagt tttcccactg agtcacatca     2700

gaaatgccct acatcttatt ttcctcaggg ctcaagagaa tctgacagat accataaagg     2760

gatttgacct aatcactaat tttcaggtgg tggctgatgc tttgaacatc tctttgctgc     2820

ccaatccatt agcgacagta ggatttttca accctggtat gaatagacag aaccctatcc     2880

agtggaagga gaatttaata aagatagtgc agaaagaatt ccttaggtaa tctataacta     2940

ggactactcc tggtaacagt aatacattcc attgttttag taaccagaaa tcttcatgca     3000

atgaaaaata ctttaattca tgaagcttac tttttttttt ttggtgtcag agtctcgctc     3060

ttgtcaccca ggctggaatg cagtggcgcc atctcagctc actgcaacct tccatcttcc     3120

caggttcaag cgattctcgt gcctcggcct cctgagtagc tgggattaca ggcgtgtgca     3180

ctacactcaa ctaatttttg tatttttagg agagacgggg tttcacctgt tggccaggct     3240

ggtctcgaac tcctgacctc aagtgattca cccaccttgg cctcataaac ctgttttgca     3300

gaactcattt attcagcaaa tatttattga gtgcctacca gatgccagtc accgcacaag     3360

gcactgggta tatggtatcc ccaaacaaga gacataatcc cggtccttag gtactgctag     3420

tgtggtctgt aatatcttac taaggccttt ggtatacgac ccagagataa cacgatgcgt     3480

attttagttt tgcaaagaag gggtttggtc tctgtgccag ctctataatt gttttgctac     3540

gattccactg aaactcttcg atcaagctac tttatgtaaa tcacttcatt gttttaaagg     3600

aataaacttg attatattgt ttttttattt ggcataactg tgattctttt aggacaatta     3660

ctgtacacat taaggtgtat gtcagatatt catattgacc caaatgtgta atattccagt     3720

tttctctgca taagtaatta aaatatactt aaaaattaat agttttatct gggtacaaat     3780

aaacagtgcc tgaactagtt cacagacaag ggaaacttct atgtaaaaat cactatgatt     3840

tctgaattgc tatgtgaaac tacagatctt tggaacactg tttaggtagg gtgttaagac     3900

ttgacacagt acctcgtttc tacacagaga aagaaatggc catacttcag gaactgcagt     3960

gcttatgagg ggatatttag gcctcttgaa tttttgatgt agatgggcat ttttttaagg     4020

tagtggttaa ttacctttat gtgaactttg aatggtttaa caaaagattt gtttttgtag     4080

agattttaaa gggggagaat tctagaaata aatgttacct aattattaca gccttaaaga     4140

caaaaatcct tgttgaagtt tttttaaaaa aagactaaat tacatagact taggcattaa     4200

catgtttgtg gaagaatata gcagacgtat attgtatcat ttgagtgaat gttcccaagt     4260

aggcattcta ggctctattt aactgagtca cactgcatag gaatttagaa cctaactttt     4320

ataggttatc aaaactgttg tcaccattgc acaattttgt cctaatatat acatagaaac     4380

tttgtggggc atgttaagtt acagtttgca caagttcatc tcatttgtat tccattgatt     4440

tttttttttc ttctaaacat tttttcttca aaacagtata tataactttt tttaggggat     4500

tttttttaga cagcaaaaaa ctatctgaag atttccattt gtcaaaaagt aatgatttct     4560

tgataattgt gtagtgaatg ttttttagaa cccagcagtt accttgaaag ctgaatttat     4620

atttagtaac ttctgtgtta atactggata gcatgaattc tgcattgaga aactgaatag     4680

ctgtcataaa atgctttctt tcctaaagaa agatactcac atgagttctt gaagaatagt     4740

cataactaga ttaagatctg tgttttagtt taatagtttg aagtgcctgt ttgggataat     4800

gataggtaat ttagatgaat ttaggggaaa aaaaagttat ctgcagttat gttgagggcc     4860

catctctccc cccacacccc cacagagcta actgggttac agtgttttat ccgaaagttt     4920

ccaattccac tgtcttgtgt tttcatgttg aaaatacttt tgcatttttc ctttgagtgc     4980

caatttctta ctagtactat ttcttaatgt aacatgttta cctggcctgt cttttaacta     5040

tttttgtata gtgtaaactg aaacatgcac attttgtaca ttgtgctttc ttttgtgggt     5100

catatgcagt gtgatccagt tgttttccat catttggttg cgctgaccta ggaatgttgg     5160

tcatatcaaa cattaaaaat gaccactctt ttaatgaaat taacttttaa atgtttatag     5220

gagtatgtgc tgtgaagtga tctaaaattt gtaatatttt tgtcatgaac tgtactactc     5280

ctaattattg taatgtaata aaaatagtta cagtgactat gagtgtgtat ttattcatgc     5340

aaatttgaac tgtttgcccc gaaatggata tggatacttt ataagccata gacactatag     5400

tataccagtg aatcttttat gcagcttgtt agaagtatcc ttttattttc taaaaggtgc     5460

tgtggatatt atgtaaaggc gtgtttgctt aaacaatttt ccatatttag aagtagatgc     5520

aaaacaaatc tgcctttatg acaaaaaaat aggataacat tatttattta tttcctttta     5580

tcaataaggt aattgataca caacaggtga cttggtttta ggcccaaagg tagcagcagc     5640

aacattaata atggaaataa ttgaatagtt agttatgtat gttaatgcca gtcaccagca     5700

ggctatttca aggtcagaag taatgactcc atacatatta tttatttcta taactacatt     5760

taaatcatta ccagg                                                      5775


<210>  89
<211>  188
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens K-ras oncogene protein (KRAS) polypeptide, GenBank: 
       M54968.1

<400>  89

Met Thr Glu Tyr Lys Leu Val Val Val Gly Ala Cys Gly Val Gly Lys 
1               5                   10                  15      


Ser Ala Leu Thr Ile Gln Leu Ile Gln Asn His Phe Val Asp Glu Tyr 
            20                  25                  30          


Asp Pro Thr Ile Glu Asp Ser Tyr Arg Lys Gln Val Val Ile Asp Gly 
        35                  40                  45              


Glu Thr Cys Leu Leu Asp Ile Leu Asp Thr Ala Gly Gln Glu Glu Tyr 
    50                  55                  60                  


Ser Ala Met Arg Asp Gln Tyr Met Arg Thr Gly Glu Gly Phe Leu Cys 
65                  70                  75                  80  


Val Phe Ala Ile Asn Asn Thr Lys Ser Phe Glu Asp Ile His His Tyr 
                85                  90                  95      


Arg Glu Gln Ile Lys Arg Val Lys Asp Ser Glu Asp Val Pro Met Val 
            100                 105                 110         


Leu Val Gly Asn Lys Cys Asp Leu Pro Ser Arg Thr Val Asp Thr Lys 
        115                 120                 125             


Gln Ala Gln Asp Leu Ala Arg Ser Tyr Gly Ile Pro Phe Ile Glu Thr 
    130                 135                 140                 


Ser Ala Lys Thr Arg Gln Gly Val Asp Asp Ala Phe Tyr Thr Leu Val 
145                 150                 155                 160 


Arg Glu Ile Arg Lys His Lys Glu Lys Met Ser Lys Asp Gly Lys Lys 
                165                 170                 175     


Lys Lys Lys Lys Ser Lys Thr Lys Cys Val Ile Met 
            180                 185             


<210>  90
<211>  6492
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human BCL-2 mRNA GenBank Accession No.: NM_000633.2  GI:72198188

<400>  90
tttctgtgaa gcagaagtct gggaatcgat ctggaaatcc tcctaatttt tactccctct       60

ccccgcgact cctgattcat tgggaagttt caaatcagct ataactggag agtgctgaag      120

attgatggga tcgttgcctt atgcatttgt tttggtttta caaaaaggaa acttgacaga      180

ggatcatgct gtacttaaaa aatacaacat cacagaggaa gtagactgat attaacaata      240

cttactaata ataacgtgcc tcatgaaata aagatccgaa aggaattgga ataaaaattt      300

cctgcatctc atgccaaggg ggaaacacca gaatcaagtg ttccgcgtga ttgaagacac      360

cccctcgtcc aagaatgcaa agcacatcca ataaaatagc tggattataa ctcctcttct      420

ttctctgggg gccgtggggt gggagctggg gcgagaggtg ccgttggccc ccgttgcttt      480

tcctctggga aggatggcgc acgctgggag aacagggtac gataaccggg agatagtgat      540

gaagtacatc cattataagc tgtcgcagag gggctacgag tgggatgcgg gagatgtggg      600

cgccgcgccc ccgggggccg cccccgcacc gggcatcttc tcctcccagc ccgggcacac      660

gccccatcca gccgcatccc gggacccggt cgccaggacc tcgccgctgc agaccccggc      720

tgcccccggc gccgccgcgg ggcctgcgct cagcccggtg ccacctgtgg tccacctgac      780

cctccgccag gccggcgacg acttctcccg ccgctaccgc cgcgacttcg ccgagatgtc      840

cagccagctg cacctgacgc ccttcaccgc gcggggacgc tttgccacgg tggtggagga      900

gctcttcagg gacggggtga actgggggag gattgtggcc ttctttgagt tcggtggggt      960

catgtgtgtg gagagcgtca accgggagat gtcgcccctg gtggacaaca tcgccctgtg     1020

gatgactgag tacctgaacc ggcacctgca cacctggatc caggataacg gaggctggga     1080

tgcctttgtg gaactgtacg gccccagcat gcggcctctg tttgatttct cctggctgtc     1140

tctgaagact ctgctcagtt tggccctggt gggagcttgc atcaccctgg gtgcctatct     1200

gggccacaag tgaagtcaac atgcctgccc caaacaaata tgcaaaaggt tcactaaagc     1260

agtagaaata atatgcattg tcagtgatgt accatgaaac aaagctgcag gctgtttaag     1320

aaaaaataac acacatataa acatcacaca cacagacaga cacacacaca cacaacaatt     1380

aacagtcttc aggcaaaacg tcgaatcagc tatttactgc caaagggaaa tatcatttat     1440

tttttacatt attaagaaaa aaagatttat ttatttaaga cagtcccatc aaaactcctg     1500

tctttggaaa tccgaccact aattgccaag caccgcttcg tgtggctcca cctggatgtt     1560

ctgtgcctgt aaacatagat tcgctttcca tgttgttggc cggatcacca tctgaagagc     1620

agacggatgg aaaaaggacc tgatcattgg ggaagctggc tttctggctg ctggaggctg     1680

gggagaaggt gttcattcac ttgcatttct ttgccctggg ggctgtgata ttaacagagg     1740

gagggttcct gtggggggaa gtccatgcct ccctggcctg aagaagagac tctttgcata     1800

tgactcacat gatgcatacc tggtgggagg aaaagagttg ggaacttcag atggacctag     1860

tacccactga gatttccacg ccgaaggaca gcgatgggaa aaatgccctt aaatcatagg     1920

aaagtatttt tttaagctac caattgtgcc gagaaaagca ttttagcaat ttatacaata     1980

tcatccagta ccttaagccc tgattgtgta tattcatata ttttggatac gcacccccca     2040

actcccaata ctggctctgt ctgagtaaga aacagaatcc tctggaactt gaggaagtga     2100

acatttcggt gacttccgca tcaggaaggc tagagttacc cagagcatca ggccgccaca     2160

agtgcctgct tttaggagac cgaagtccgc agaacctgcc tgtgtcccag cttggaggcc     2220

tggtcctgga actgagccgg ggccctcact ggcctcctcc agggatgatc aacagggcag     2280

tgtggtctcc gaatgtctgg aagctgatgg agctcagaat tccactgtca agaaagagca     2340

gtagaggggt gtggctgggc ctgtcaccct ggggccctcc aggtaggccc gttttcacgt     2400

ggagcatggg agccacgacc cttcttaaga catgtatcac tgtagaggga aggaacagag     2460

gccctgggcc cttcctatca gaaggacatg gtgaaggctg ggaacgtgag gagaggcaat     2520

ggccacggcc cattttggct gtagcacatg gcacgttggc tgtgtggcct tggcccacct     2580

gtgagtttaa agcaaggctt taaatgactt tggagagggt cacaaatcct aaaagaagca     2640

ttgaagtgag gtgtcatgga ttaattgacc cctgtctatg gaattacatg taaaacatta     2700

tcttgtcact gtagtttggt tttatttgaa aacctgacaa aaaaaaagtt ccaggtgtgg     2760

aatatggggg ttatctgtac atcctggggc attaaaaaaa aaatcaatgg tggggaacta     2820

taaagaagta acaaaagaag tgacatcttc agcaaataaa ctaggaaatt tttttttctt     2880

ccagtttaga atcagccttg aaacattgat ggaataactc tgtggcatta ttgcattata     2940

taccatttat ctgtattaac tttggaatgt actctgttca atgtttaatg ctgtggttga     3000

tatttcgaaa gctgctttaa aaaaatacat gcatctcagc gtttttttgt ttttaattgt     3060

atttagttat ggcctataca ctatttgtga gcaaaggtga tcgttttctg tttgagattt     3120

ttatctcttg attcttcaaa agcattctga gaaggtgaga taagccctga gtctcagcta     3180

cctaagaaaa acctggatgt cactggccac tgaggagctt tgtttcaacc aagtcatgtg     3240

catttccacg tcaacagaat tgtttattgt gacagttata tctgttgtcc ctttgacctt     3300

gtttcttgaa ggtttcctcg tccctgggca attccgcatt taattcatgg tattcaggat     3360

tacatgcatg tttggttaaa cccatgagat tcattcagtt aaaaatccag atggcaaatg     3420

accagcagat tcaaatctat ggtggtttga cctttagaga gttgctttac gtggcctgtt     3480

tcaacacaga cccacccaga gccctcctgc cctccttccg cgggggcttt ctcatggctg     3540

tccttcaggg tcttcctgaa atgcagtggt gcttacgctc caccaagaaa gcaggaaacc     3600

tgtggtatga agccagacct ccccggcggg cctcagggaa cagaatgatc agacctttga     3660

atgattctaa tttttaagca aaatattatt ttatgaaagg tttacattgt caaagtgatg     3720

aatatggaat atccaatcct gtgctgctat cctgccaaaa tcattttaat ggagtcagtt     3780

tgcagtatgc tccacgtggt aagatcctcc aagctgcttt agaagtaaca atgaagaacg     3840

tggacgtttt taatataaag cctgttttgt cttttgttgt tgttcaaacg ggattcacag     3900

agtatttgaa aaatgtatat atattaagag gtcacggggg ctaattgctg gctggctgcc     3960

ttttgctgtg gggttttgtt acctggtttt aataacagta aatgtgccca gcctcttggc     4020

cccagaactg tacagtattg tggctgcact tgctctaaga gtagttgatg ttgcattttc     4080

cttattgtta aaaacatgtt agaagcaatg aatgtatata aaagcctcaa ctagtcattt     4140

ttttctcctc ttcttttttt tcattatatc taattatttt gcagttgggc aacagagaac     4200

catccctatt ttgtattgaa gagggattca catctgcatc ttaactgctc tttatgaatg     4260

aaaaaacagt cctctgtatg tactcctctt tacactggcc agggtcagag ttaaatagag     4320

tatatgcact ttccaaattg gggacaaggg ctctaaaaaa agccccaaaa ggagaagaac     4380

atctgagaac ctcctcggcc ctcccagtcc ctcgctgcac aaatactccg caagagaggc     4440

cagaatgaca gctgacaggg tctatggcca tcgggtcgtc tccgaagatt tggcaggggc     4500

agaaaactct ggcaggctta agatttggaa taaagtcaca gaattaagga agcacctcaa     4560

tttagttcaa acaagacgcc aacattctct ccacagctca cttacctctc tgtgttcaga     4620

tgtggccttc catttatatg tgatctttgt tttattagta aatgcttatc atctaaagat     4680

gtagctctgg cccagtggga aaaattagga agtgattata aatcgagagg agttataata     4740

atcaagatta aatgtaaata atcagggcaa tcccaacaca tgtctagctt tcacctccag     4800

gatctattga gtgaacagaa ttgcaaatag tctctatttg taattgaact tatcctaaaa     4860

caaatagttt ataaatgtga acttaaactc taattaattc caactgtact tttaaggcag     4920

tggctgtttt tagactttct tatcacttat agttagtaat gtacacctac tctatcagag     4980

aaaaacagga aaggctcgaa atacaagcca ttctaaggaa attagggagt cagttgaaat     5040

tctattctga tcttattctg tggtgtcttt tgcagcccag acaaatgtgg ttacacactt     5100

tttaagaaat acaattctac attgtcaagc ttatgaaggt tccaatcaga tctttattgt     5160

tattcaattt ggatctttca gggatttttt ttttaaatta ttatgggaca aaggacattt     5220

gttggagggg tgggagggag gaagaatttt taaatgtaaa acattcccaa gtttggatca     5280

gggagttgga agttttcaga ataaccagaa ctaagggtat gaaggacctg tattggggtc     5340

gatgtgatgc ctctgcgaag aaccttgtgt gacaaatgag aaacattttg aagtttgtgg     5400

tacgaccttt agattccaga gacatcagca tggctcaaag tgcagctccg tttggcagtg     5460

caatggtata aatttcaagc tggatatgtc taatgggtat ttaaacaata aatgtgcagt     5520

tttaactaac aggatattta atgacaacct tctggttggt agggacatct gtttctaaat     5580

gtttattatg tacaatacag aaaaaaattt tataaaatta agcaatgtga aactgaattg     5640

gagagtgata atacaagtcc tttagtctta cccagtgaat cattctgttc catgtctttg     5700

gacaaccatg accttggaca atcatgaaat atgcatctca ctggatgcaa agaaaatcag     5760

atggagcatg aatggtactg taccggttca tctggactgc cccagaaaaa taacttcaag     5820

caaacatcct atcaacaaca aggttgttct gcataccaag ctgagcacag aagatgggaa     5880

cactggtgga ggatggaaag gctcgctcaa tcaagaaaat tctgagacta ttaataaata     5940

agactgtagt gtagatactg agtaaatcca tgcacctaaa ccttttggaa aatctgccgt     6000

gggccctcca gatagctcat ttcattaagt ttttccctcc aaggtagaat ttgcaagagt     6060

gacagtggat tgcatttctt ttggggaagc tttcttttgg tggttttgtt tattatacct     6120

tcttaagttt tcaaccaagg tttgcttttg ttttgagtta ctggggttat ttttgtttta     6180

aataaaaata agtgtacaat aagtgttttt gtattgaaag cttttgttat caagattttc     6240

atacttttac cttccatggc tctttttaag attgatactt ttaagaggtg gctgatattc     6300

tgcaacactg tacacataaa aaatacggta aggatacttt acatggttaa ggtaaagtaa     6360

gtctccagtt ggccaccatt agctataatg gcactttgtt tgtgttgttg gaaaaagtca     6420

cattgccatt aaactttcct tgtctgtcta gttaatattg tgaagaaaaa taaagtacag     6480

tgtgagatac tg                                                         6492


<210>  91
<211>  239
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human BCL-2 polypeptide GenBank Accession No.: NP_000624.2  
       GI:72198189

<400>  91

Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu Ile Val Met 
1               5                   10                  15      


Lys Tyr Ile His Tyr Lys Leu Ser Gln Arg Gly Tyr Glu Trp Asp Ala 
            20                  25                  30          


Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly Ile 
        35                  40                  45              


Phe Ser Ser Gln Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
    50                  55                  60                  


Pro Val Ala Arg Thr Ser Pro Leu Gln Thr Pro Ala Ala Pro Gly Ala 
65                  70                  75                  80  


Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Thr 
                85                  90                  95      


Leu Arg Gln Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Arg Asp Phe 
            100                 105                 110         


Ala Glu Met Ser Ser Gln Leu His Leu Thr Pro Phe Thr Ala Arg Gly 
        115                 120                 125             


Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
    130                 135                 140                 


Gly Arg Ile Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 
145                 150                 155                 160 


Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn Ile Ala Leu Trp 
                165                 170                 175     


Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp Ile Gln Asp Asn 
            180                 185                 190         


Gly Gly Trp Asp Ala Phe Val Glu Leu Tyr Gly Pro Ser Met Arg Pro 
        195                 200                 205             


Leu Phe Asp Phe Ser Trp Leu Ser Leu Lys Thr Leu Leu Ser Leu Ala 
    210                 215                 220                 


Leu Val Gly Ala Cys Ile Thr Leu Gly Ala Tyr Leu Gly His Lys 
225                 230                 235                 


<210>  92
<211>  1207
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens B-cell CLL/lymphoma 2 (BCL2), transcript variant 
       beta, mRNA NCBI Reference Sequence: NM_000657.2

<400>  92
tttctgtgaa gcagaagtct gggaatcgat ctggaaatcc tcctaatttt tactccctct       60

ccccgcgact cctgattcat tgggaagttt caaatcagct ataactggag agtgctgaag      120

attgatggga tcgttgcctt atgcatttgt tttggtttta caaaaaggaa acttgacaga      180

ggatcatgct gtacttaaaa aatacaacat cacagaggaa gtagactgat attaacaata      240

cttactaata ataacgtgcc tcatgaaata aagatccgaa aggaattgga ataaaaattt      300

cctgcatctc atgccaaggg ggaaacacca gaatcaagtg ttccgcgtga ttgaagacac      360

cccctcgtcc aagaatgcaa agcacatcca ataaaatagc tggattataa ctcctcttct      420

ttctctgggg gccgtggggt gggagctggg gcgagaggtg ccgttggccc ccgttgcttt      480

tcctctggga aggatggcgc acgctgggag aacagggtac gataaccggg agatagtgat      540

gaagtacatc cattataagc tgtcgcagag gggctacgag tgggatgcgg gagatgtggg      600

cgccgcgccc ccgggggccg cccccgcacc gggcatcttc tcctcccagc ccgggcacac      660

gccccatcca gccgcatccc gggacccggt cgccaggacc tcgccgctgc agaccccggc      720

tgcccccggc gccgccgcgg ggcctgcgct cagcccggtg ccacctgtgg tccacctgac      780

cctccgccag gccggcgacg acttctcccg ccgctaccgc cgcgacttcg ccgagatgtc      840

cagccagctg cacctgacgc ccttcaccgc gcggggacgc tttgccacgg tggtggagga      900

gctcttcagg gacggggtga actgggggag gattgtggcc ttctttgagt tcggtggggt      960

catgtgtgtg gagagcgtca accgggagat gtcgcccctg gtggacaaca tcgccctgtg     1020

gatgactgag tacctgaacc ggcacctgca cacctggatc caggataacg gaggctgggt     1080

aggtgcactt ggtgatgtga gtctgggctg aggccacagg tccgagatgc gggggttgga     1140

gtgcgggtgg gctcctgggg caatgggagg ctgtggagcc ggcgaaataa aatcagagtt     1200

gttgcta                                                               1207


<210>  93
<211>  205
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens B-cell CLL/lymphoma 2 (BCL2), transcript variant 
       beta, polypeptide NCBI Reference Sequence: NM_000657.2

<400>  93

Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu Ile Val Met 
1               5                   10                  15      


Lys Tyr Ile His Tyr Lys Leu Ser Gln Arg Gly Tyr Glu Trp Asp Ala 
            20                  25                  30          


Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly Ile 
        35                  40                  45              


Phe Ser Ser Gln Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
    50                  55                  60                  


Pro Val Ala Arg Thr Ser Pro Leu Gln Thr Pro Ala Ala Pro Gly Ala 
65                  70                  75                  80  


Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Thr 
                85                  90                  95      


Leu Arg Gln Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Arg Asp Phe 
            100                 105                 110         


Ala Glu Met Ser Ser Gln Leu His Leu Thr Pro Phe Thr Ala Arg Gly 
        115                 120                 125             


Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
    130                 135                 140                 


Gly Arg Ile Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 
145                 150                 155                 160 


Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn Ile Ala Leu Trp 
                165                 170                 175     


Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp Ile Gln Asp Asn 
            180                 185                 190         


Gly Gly Trp Val Gly Ala Leu Gly Asp Val Ser Leu Gly 
        195                 200                 205 


<210>  94
<211>  3411
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ATG 16 mRNA variant 1 GenBank Accession No.: NM_030803.6  
       GI:124256479

<400>  94
actagcgagc gccctgcgta ggcaccggct cctgagcccg tgcttcgggt gagggggcgg       60

gtcttccggc cctctcgaaa atcatttccg gcatgagccg gaagaccgtc ccggatggcc      120

tcggggactg ccagtgtgtg gaggtgagct ccgggattgc cggcattccc gcttctgctg      180

gttgcttcat gctgcaggct gcggccgtca gccctcgctc gcattggtgg cgctgaggtg      240

ccggggcagc aagtgacatg tcgtcgggcc tccgcgccgc tgacttcccc cgctggaagc      300

gccacatctc ggagcaactg aggcgccggg accggctgca gagacaggcg ttcgaggaga      360

tcatcctgca gtataacaaa ttgctggaaa agtcagatct tcattcagtg ttggcccaga      420

aactacaggc tgaaaagcat gacgtaccaa acaggcacga gataagtccc ggacatgatg      480

gcacatggaa tgacaatcag ctacaagaaa tggcccaact gaggattaag caccaagagg      540

aactgactga attacacaag aaacgtgggg agttagctca actggtgatt gacctgaata      600

accaaatgca gcggaaggac agggagatgc agatgaatga agcaaaaatt gcagaatgtt      660

tgcagactat ctctgacctg gagacggagt gcctagacct gcgcactaag ctttgtgacc      720

ttgaaagagc caaccagacc ctgaaggatg aatatgatgc cctgcagatc acttttactg      780

ccttggaggg aaaactgagg aaaactacgg aagagaacca ggagctggtc accagatgga      840

tggctgagaa agcccaggaa gccaatcggc ttaatgcaga gaatgaaaaa gactccagga      900

ggcggcaagc ccggctgcag aaagagcttg cagaagcagc aaaggaacct ctaccagtcg      960

aacaggatga tgacattgag gtcattgtgg atgaaacttc tgatcacaca gaagagacct     1020

ctcctgtgcg agccatcagc agagcagcca ctaagcgact ctcgcagcct gctggaggcc     1080

ttctggattc tatcactaat atctttggga gacgctctgt ctcttccttc ccagtccccc     1140

aggacaatgt ggatactcat cctggttctg gtaaagaagt gagggtacca gctactgcct     1200

tgtgtgtctt cgatgcacat gatggggaag tcaacgctgt gcagttcagt ccaggttccc     1260

ggttactggc cactggaggc atggaccgca gggttaagct ttgggaagta tttggagaaa     1320

aatgtgagtt caagggttcc ctatctggca gtaatgcagg aattacaagc attgaatttg     1380

atagtgctgg atcttacctc ttagcagctt caaatgattt tgcaagccga atctggactg     1440

tggatgatta tcgattacgg cacacactca cgggacacag tgggaaagtg ctgtctgcta     1500

agttcctgct ggacaatgcg cggattgtct caggaagtca cgaccggact ctcaaactct     1560

gggatctacg cagcaaagtc tgcataaaga cagtgtttgc aggatccagt tgcaatgata     1620

ttgtctgcac agagcaatgt gtaatgagtg gacattttga caagaaaatt cgtttctggg     1680

acattcgatc agagagcata gttcgagaga tggagctgtt gggaaagatt actgccctgg     1740

acttaaaccc agaaaggact gagctcctga gctgctcccg tgatgacttg ctaaaagtta     1800

ttgatctccg aacaaatgct atcaagcaga cattcagtgc acctgggttc aagtgcggct     1860

ctgactggac cagagttgtc ttcagccctg atggcagtta cgtggcggca ggctctgctg     1920

agggctctct gtatatctgg agtgtgctca cagggaaagt ggaaaaggtt ctttcaaagc     1980

agcacagctc atccatcaat gcggtggcgt ggtcgccctc tggctcgcac gttgtcagtg     2040

tggacaaagg atgcaaagct gtgctgtggg cacagtactg acggggctct cagggctggg     2100

aggaccccag tgccctcctc agaagaagca catgggctcc tgcagccctg tcctggcagg     2160

tgatgtgctg ggtatagcat ggacctccca gagaagctca agctatgtgg cactgtagct     2220

ttgccgtgaa tgggatttct gaagatttga ctgaggtctc tcttggcctg gaagaataac     2280

actgaaaaaa cctgacgctg cggtcactta gcagaggctc aggttcttgc cttgggaaac     2340

actactagct ctgaccttcc atacctcact tgggggagca cagggccccg ctgggcctcc     2400

tcaccaacgg cagtgccaaa atcagccccc acatcaaggt ggtgttctct gtgctttctc     2460

tcgtccttcc aaagtcggtt ctggcctaac gcatgtccca acaccttggg ttcatttgcc     2520

cggtgaactc actttaagca ttggattaac ggaaactccc gaactacaga cccctccctg     2580

gtgggttgca tgaatgtgtc tcattactgc tgaaatgtcc tcacatctct ttcactgttc     2640

ttcagagctt tctggctctc tttcccccac aaaattcgac atatttaaaa atctccgtgt     2700

ggctttaaaa aatggttttt tgtttttttg tttttttgag gtgggagagg atgtgtgaaa     2760

atcttttcca gggaaatggg ttcgctgcag aggtaaggat gtgttcctgt atcgatctgc     2820

agacacccag aaggtgggtg cacactgcat gcttgggggt gccaagggat tcgagacctc     2880

caacatactt gtctgaaggt ggtgattctg gccatggccc ctctgccaag cctgtgtgcg     2940

atgcccttgg tgctttagtg caagaagcct aggctcagaa gcacagcagc gccatctttc     3000

cgtttcaggg gttgtgatga aggccaagga aaaacattta tctttactat tttacctacg     3060

tataaagttt tagttcattg ggtgtgcgaa acaccctttt tatcactttt aaatttgcac     3120

tttatttttt ttcttccatg cttgttctct ggacatttgg ggatgtgagt gttagagctg     3180

gtgagagagg agtcaggtgg ccttcccacc gatggtcctg gcctccacct gccctctctt     3240

ccctgcctga tcaccgcttt ccaatttgcc cttcagagaa cttaagtcaa ggagagttga     3300

aattcacagg ccagggcaca tcttttattt atttcattat gttggccaac agaacttgat     3360

tgtaaataat aataaagaaa tctgttatat acttttcaaa ctccaaaaaa a              3411


<210>  95
<211>  607
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ATG 16 Isoform 1polypeptide encoded by mRNA variant 1 
       GenBank Accession No.: NP_110430.5  GI:124256480

<400>  95

Met Ser Ser Gly Leu Arg Ala Ala Asp Phe Pro Arg Trp Lys Arg His 
1               5                   10                  15      


Ile Ser Glu Gln Leu Arg Arg Arg Asp Arg Leu Gln Arg Gln Ala Phe 
            20                  25                  30          


Glu Glu Ile Ile Leu Gln Tyr Asn Lys Leu Leu Glu Lys Ser Asp Leu 
        35                  40                  45              


His Ser Val Leu Ala Gln Lys Leu Gln Ala Glu Lys His Asp Val Pro 
    50                  55                  60                  


Asn Arg His Glu Ile Ser Pro Gly His Asp Gly Thr Trp Asn Asp Asn 
65                  70                  75                  80  


Gln Leu Gln Glu Met Ala Gln Leu Arg Ile Lys His Gln Glu Glu Leu 
                85                  90                  95      


Thr Glu Leu His Lys Lys Arg Gly Glu Leu Ala Gln Leu Val Ile Asp 
            100                 105                 110         


Leu Asn Asn Gln Met Gln Arg Lys Asp Arg Glu Met Gln Met Asn Glu 
        115                 120                 125             


Ala Lys Ile Ala Glu Cys Leu Gln Thr Ile Ser Asp Leu Glu Thr Glu 
    130                 135                 140                 


Cys Leu Asp Leu Arg Thr Lys Leu Cys Asp Leu Glu Arg Ala Asn Gln 
145                 150                 155                 160 


Thr Leu Lys Asp Glu Tyr Asp Ala Leu Gln Ile Thr Phe Thr Ala Leu 
                165                 170                 175     


Glu Gly Lys Leu Arg Lys Thr Thr Glu Glu Asn Gln Glu Leu Val Thr 
            180                 185                 190         


Arg Trp Met Ala Glu Lys Ala Gln Glu Ala Asn Arg Leu Asn Ala Glu 
        195                 200                 205             


Asn Glu Lys Asp Ser Arg Arg Arg Gln Ala Arg Leu Gln Lys Glu Leu 
    210                 215                 220                 


Ala Glu Ala Ala Lys Glu Pro Leu Pro Val Glu Gln Asp Asp Asp Ile 
225                 230                 235                 240 


Glu Val Ile Val Asp Glu Thr Ser Asp His Thr Glu Glu Thr Ser Pro 
                245                 250                 255     


Val Arg Ala Ile Ser Arg Ala Ala Thr Lys Arg Leu Ser Gln Pro Ala 
            260                 265                 270         


Gly Gly Leu Leu Asp Ser Ile Thr Asn Ile Phe Gly Arg Arg Ser Val 
        275                 280                 285             


Ser Ser Phe Pro Val Pro Gln Asp Asn Val Asp Thr His Pro Gly Ser 
    290                 295                 300                 


Gly Lys Glu Val Arg Val Pro Ala Thr Ala Leu Cys Val Phe Asp Ala 
305                 310                 315                 320 


His Asp Gly Glu Val Asn Ala Val Gln Phe Ser Pro Gly Ser Arg Leu 
                325                 330                 335     


Leu Ala Thr Gly Gly Met Asp Arg Arg Val Lys Leu Trp Glu Val Phe 
            340                 345                 350         


Gly Glu Lys Cys Glu Phe Lys Gly Ser Leu Ser Gly Ser Asn Ala Gly 
        355                 360                 365             


Ile Thr Ser Ile Glu Phe Asp Ser Ala Gly Ser Tyr Leu Leu Ala Ala 
    370                 375                 380                 


Ser Asn Asp Phe Ala Ser Arg Ile Trp Thr Val Asp Asp Tyr Arg Leu 
385                 390                 395                 400 


Arg His Thr Leu Thr Gly His Ser Gly Lys Val Leu Ser Ala Lys Phe 
                405                 410                 415     


Leu Leu Asp Asn Ala Arg Ile Val Ser Gly Ser His Asp Arg Thr Leu 
            420                 425                 430         


Lys Leu Trp Asp Leu Arg Ser Lys Val Cys Ile Lys Thr Val Phe Ala 
        435                 440                 445             


Gly Ser Ser Cys Asn Asp Ile Val Cys Thr Glu Gln Cys Val Met Ser 
    450                 455                 460                 


Gly His Phe Asp Lys Lys Ile Arg Phe Trp Asp Ile Arg Ser Glu Ser 
465                 470                 475                 480 


Ile Val Arg Glu Met Glu Leu Leu Gly Lys Ile Thr Ala Leu Asp Leu 
                485                 490                 495     


Asn Pro Glu Arg Thr Glu Leu Leu Ser Cys Ser Arg Asp Asp Leu Leu 
            500                 505                 510         


Lys Val Ile Asp Leu Arg Thr Asn Ala Ile Lys Gln Thr Phe Ser Ala 
        515                 520                 525             


Pro Gly Phe Lys Cys Gly Ser Asp Trp Thr Arg Val Val Phe Ser Pro 
    530                 535                 540                 


Asp Gly Ser Tyr Val Ala Ala Gly Ser Ala Glu Gly Ser Leu Tyr Ile 
545                 550                 555                 560 


Trp Ser Val Leu Thr Gly Lys Val Glu Lys Val Leu Ser Lys Gln His 
                565                 570                 575     


Ser Ser Ser Ile Asn Ala Val Ala Trp Ser Pro Ser Gly Ser His Val 
            580                 585                 590         


Val Ser Val Asp Lys Gly Cys Lys Ala Val Leu Trp Ala Gln Tyr 
        595                 600                 605         


<210>  96
<211>  3354
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 16-like 1 (S. cerevisiae) 
       (ATG16L1), transcript variant 2, mRNA NCBI Reference Sequence: 
       NM_017974.3

<400>  96
actagcgagc gccctgcgta ggcaccggct cctgagcccg tgcttcgggt gagggggcgg       60

gtcttccggc cctctcgaaa atcatttccg gcatgagccg gaagaccgtc ccggatggcc      120

tcggggactg ccagtgtgtg gaggtgagct ccgggattgc cggcattccc gcttctgctg      180

gttgcttcat gctgcaggct gcggccgtca gccctcgctc gcattggtgg cgctgaggtg      240

ccggggcagc aagtgacatg tcgtcgggcc tccgcgccgc tgacttcccc cgctggaagc      300

gccacatctc ggagcaactg aggcgccggg accggctgca gagacaggcg ttcgaggaga      360

tcatcctgca gtataacaaa ttgctggaaa agtcagatct tcattcagtg ttggcccaga      420

aactacaggc tgaaaagcat gacgtaccaa acaggcacga gataagtccc ggacatgatg      480

gcacatggaa tgacaatcag ctacaagaaa tggcccaact gaggattaag caccaagagg      540

aactgactga attacacaag aaacgtgggg agttagctca actggtgatt gacctgaata      600

accaaatgca gcggaaggac agggagatgc agatgaatga agcaaaaatt gcagaatgtt      660

tgcagactat ctctgacctg gagacggagt gcctagacct gcgcactaag ctttgtgacc      720

ttgaaagagc caaccagacc ctgaaggatg aatatgatgc cctgcagatc acttttactg      780

ccttggaggg aaaactgagg aaaactacgg aagagaacca ggagctggtc accagatgga      840

tggctgagaa agcccaggaa gccaatcggc ttaatgcaga gaatgaaaaa gactccagga      900

ggcggcaagc ccggctgcag aaagagcttg cagaagcagc aaaggaacct ctaccagtcg      960

aacaggatga tgacattgag gtcattgtgg atgaaacttc tgatcacaca gaagagacct     1020

ctcctgtgcg agccatcagc agagcagcca cgagacgctc tgtctcttcc ttcccagtcc     1080

cccaggacaa tgtggatact catcctggtt ctggtaaaga agtgagggta ccagctactg     1140

ccttgtgtgt cttcgatgca catgatgggg aagtcaacgc tgtgcagttc agtccaggtt     1200

cccggttact ggccactgga ggcatggacc gcagggttaa gctttgggaa gtatttggag     1260

aaaaatgtga gttcaagggt tccctatctg gcagtaatgc aggaattaca agcattgaat     1320

ttgatagtgc tggatcttac ctcttagcag cttcaaatga ttttgcaagc cgaatctgga     1380

ctgtggatga ttatcgatta cggcacacac tcacgggaca cagtgggaaa gtgctgtctg     1440

ctaagttcct gctggacaat gcgcggattg tctcaggaag tcacgaccgg actctcaaac     1500

tctgggatct acgcagcaaa gtctgcataa agacagtgtt tgcaggatcc agttgcaatg     1560

atattgtctg cacagagcaa tgtgtaatga gtggacattt tgacaagaaa attcgtttct     1620

gggacattcg atcagagagc atagttcgag agatggagct gttgggaaag attactgccc     1680

tggacttaaa cccagaaagg actgagctcc tgagctgctc ccgtgatgac ttgctaaaag     1740

ttattgatct ccgaacaaat gctatcaagc agacattcag tgcacctggg ttcaagtgcg     1800

gctctgactg gaccagagtt gtcttcagcc ctgatggcag ttacgtggcg gcaggctctg     1860

ctgagggctc tctgtatatc tggagtgtgc tcacagggaa agtggaaaag gttctttcaa     1920

agcagcacag ctcatccatc aatgcggtgg cgtggtcgcc ctctggctcg cacgttgtca     1980

gtgtggacaa aggatgcaaa gctgtgctgt gggcacagta ctgacggggc tctcagggct     2040

gggaggaccc cagtgccctc ctcagaagaa gcacatgggc tcctgcagcc ctgtcctggc     2100

aggtgatgtg ctgggtatag catggacctc ccagagaagc tcaagctatg tggcactgta     2160

gctttgccgt gaatgggatt tctgaagatt tgactgaggt ctctcttggc ctggaagaat     2220

aacactgaaa aaacctgacg ctgcggtcac ttagcagagg ctcaggttct tgccttggga     2280

aacactacta gctctgacct tccatacctc acttggggga gcacagggcc ccgctgggcc     2340

tcctcaccaa cggcagtgcc aaaatcagcc cccacatcaa ggtggtgttc tctgtgcttt     2400

ctctcgtcct tccaaagtcg gttctggcct aacgcatgtc ccaacacctt gggttcattt     2460

gcccggtgaa ctcactttaa gcattggatt aacggaaact cccgaactac agacccctcc     2520

ctggtgggtt gcatgaatgt gtctcattac tgctgaaatg tcctcacatc tctttcactg     2580

ttcttcagag ctttctggct ctctttcccc cacaaaattc gacatattta aaaatctccg     2640

tgtggcttta aaaaatggtt ttttgttttt ttgttttttt gaggtgggag aggatgtgtg     2700

aaaatctttt ccagggaaat gggttcgctg cagaggtaag gatgtgttcc tgtatcgatc     2760

tgcagacacc cagaaggtgg gtgcacactg catgcttggg ggtgccaagg gattcgagac     2820

ctccaacata cttgtctgaa ggtggtgatt ctggccatgg cccctctgcc aagcctgtgt     2880

gcgatgccct tggtgcttta gtgcaagaag cctaggctca gaagcacagc agcgccatct     2940

ttccgtttca ggggttgtga tgaaggccaa ggaaaaacat ttatctttac tattttacct     3000

acgtataaag ttttagttca ttgggtgtgc gaaacaccct ttttatcact tttaaatttg     3060

cactttattt tttttcttcc atgcttgttc tctggacatt tggggatgtg agtgttagag     3120

ctggtgagag aggagtcagg tggccttccc accgatggtc ctggcctcca cctgccctct     3180

cttccctgcc tgatcaccgc tttccaattt gcccttcaga gaacttaagt caaggagagt     3240

tgaaattcac aggccagggc acatctttta tttatttcat tatgttggcc aacagaactt     3300

gattgtaaat aataataaag aaatctgtta tatacttttc aaactccaaa aaaa           3354


<210>  97
<211>  588
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 16-like 1 (S. cerevisiae) 
       (ATG16L1), transcript variant 2, mRNA NCBI Reference Sequence: 
       NM_017974.3

<400>  97

Met Ser Ser Gly Leu Arg Ala Ala Asp Phe Pro Arg Trp Lys Arg His 
1               5                   10                  15      


Ile Ser Glu Gln Leu Arg Arg Arg Asp Arg Leu Gln Arg Gln Ala Phe 
            20                  25                  30          


Glu Glu Ile Ile Leu Gln Tyr Asn Lys Leu Leu Glu Lys Ser Asp Leu 
        35                  40                  45              


His Ser Val Leu Ala Gln Lys Leu Gln Ala Glu Lys His Asp Val Pro 
    50                  55                  60                  


Asn Arg His Glu Ile Ser Pro Gly His Asp Gly Thr Trp Asn Asp Asn 
65                  70                  75                  80  


Gln Leu Gln Glu Met Ala Gln Leu Arg Ile Lys His Gln Glu Glu Leu 
                85                  90                  95      


Thr Glu Leu His Lys Lys Arg Gly Glu Leu Ala Gln Leu Val Ile Asp 
            100                 105                 110         


Leu Asn Asn Gln Met Gln Arg Lys Asp Arg Glu Met Gln Met Asn Glu 
        115                 120                 125             


Ala Lys Ile Ala Glu Cys Leu Gln Thr Ile Ser Asp Leu Glu Thr Glu 
    130                 135                 140                 


Cys Leu Asp Leu Arg Thr Lys Leu Cys Asp Leu Glu Arg Ala Asn Gln 
145                 150                 155                 160 


Thr Leu Lys Asp Glu Tyr Asp Ala Leu Gln Ile Thr Phe Thr Ala Leu 
                165                 170                 175     


Glu Gly Lys Leu Arg Lys Thr Thr Glu Glu Asn Gln Glu Leu Val Thr 
            180                 185                 190         


Arg Trp Met Ala Glu Lys Ala Gln Glu Ala Asn Arg Leu Asn Ala Glu 
        195                 200                 205             


Asn Glu Lys Asp Ser Arg Arg Arg Gln Ala Arg Leu Gln Lys Glu Leu 
    210                 215                 220                 


Ala Glu Ala Ala Lys Glu Pro Leu Pro Val Glu Gln Asp Asp Asp Ile 
225                 230                 235                 240 


Glu Val Ile Val Asp Glu Thr Ser Asp His Thr Glu Glu Thr Ser Pro 
                245                 250                 255     


Val Arg Ala Ile Ser Arg Ala Ala Thr Arg Arg Ser Val Ser Ser Phe 
            260                 265                 270         


Pro Val Pro Gln Asp Asn Val Asp Thr His Pro Gly Ser Gly Lys Glu 
        275                 280                 285             


Val Arg Val Pro Ala Thr Ala Leu Cys Val Phe Asp Ala His Asp Gly 
    290                 295                 300                 


Glu Val Asn Ala Val Gln Phe Ser Pro Gly Ser Arg Leu Leu Ala Thr 
305                 310                 315                 320 


Gly Gly Met Asp Arg Arg Val Lys Leu Trp Glu Val Phe Gly Glu Lys 
                325                 330                 335     


Cys Glu Phe Lys Gly Ser Leu Ser Gly Ser Asn Ala Gly Ile Thr Ser 
            340                 345                 350         


Ile Glu Phe Asp Ser Ala Gly Ser Tyr Leu Leu Ala Ala Ser Asn Asp 
        355                 360                 365             


Phe Ala Ser Arg Ile Trp Thr Val Asp Asp Tyr Arg Leu Arg His Thr 
    370                 375                 380                 


Leu Thr Gly His Ser Gly Lys Val Leu Ser Ala Lys Phe Leu Leu Asp 
385                 390                 395                 400 


Asn Ala Arg Ile Val Ser Gly Ser His Asp Arg Thr Leu Lys Leu Trp 
                405                 410                 415     


Asp Leu Arg Ser Lys Val Cys Ile Lys Thr Val Phe Ala Gly Ser Ser 
            420                 425                 430         


Cys Asn Asp Ile Val Cys Thr Glu Gln Cys Val Met Ser Gly His Phe 
        435                 440                 445             


Asp Lys Lys Ile Arg Phe Trp Asp Ile Arg Ser Glu Ser Ile Val Arg 
    450                 455                 460                 


Glu Met Glu Leu Leu Gly Lys Ile Thr Ala Leu Asp Leu Asn Pro Glu 
465                 470                 475                 480 


Arg Thr Glu Leu Leu Ser Cys Ser Arg Asp Asp Leu Leu Lys Val Ile 
                485                 490                 495     


Asp Leu Arg Thr Asn Ala Ile Lys Gln Thr Phe Ser Ala Pro Gly Phe 
            500                 505                 510         


Lys Cys Gly Ser Asp Trp Thr Arg Val Val Phe Ser Pro Asp Gly Ser 
        515                 520                 525             


Tyr Val Ala Ala Gly Ser Ala Glu Gly Ser Leu Tyr Ile Trp Ser Val 
    530                 535                 540                 


Leu Thr Gly Lys Val Glu Lys Val Leu Ser Lys Gln His Ser Ser Ser 
545                 550                 555                 560 


Ile Asn Ala Val Ala Trp Ser Pro Ser Gly Ser His Val Val Ser Val 
                565                 570                 575     


Asp Lys Gly Cys Lys Ala Val Leu Trp Ala Gln Tyr 
            580                 585             


<210>  98
<211>  2922
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 16-like 1 (S. cerevisiae) 
       (ATG16L1), transcript variant 3, mRNA NCBI Reference Sequence: 
       NM_198890.2

<400>  98
actagcgagc gccctgcgta ggcaccggct cctgagcccg tgcttcgggt gagggggcgg       60

gtcttccggc cctctcgaaa atcatttccg gcatgagccg gaagaccgtc ccggatggcc      120

tcggggactg ccagtgtgtg gaggtgagct ccgggattgc cggcattccc gcttctgctg      180

gttgcttcat gctgcaggct gcggccgtca gccctcgctc gcattggtgg cgctgaggtg      240

ccggggcagc aagtgacatg tcgtcgggcc tccgcgccgc tgacttcccc cgctggaagc      300

gccacatctc ggagcaactg aggcgccggg accggctgca gagacaggcg ttcgaggaga      360

tcatcctgca gtataacaaa ttgctggaaa agtcagatct tcattcagtg ttggcccaga      420

aactacaggc tgaaaagcat gacgtaccaa acaggcacga gataaggagg cggcaagccc      480

ggctgcagaa agagcttgca gaagcagcaa aggaacctct accagtcgaa caggatgatg      540

acattgaggt cattgtggat gaaacttctg atcacacaga agagacctct cctgtgcgag      600

ccatcagcag agcagccacg agacgctctg tctcttcctt cccagtcccc caggacaatg      660

tggatactca tcctggttct ggtaaagaag tgagggtacc agctactgcc ttgtgtgtct      720

tcgatgcaca tgatggggaa gtcaacgctg tgcagttcag tccaggttcc cggttactgg      780

ccactggagg catggaccgc agggttaagc tttgggaagt atttggagaa aaatgtgagt      840

tcaagggttc cctatctggc agtaatgcag gaattacaag cattgaattt gatagtgctg      900

gatcttacct cttagcagct tcaaatgatt ttgcaagccg aatctggact gtggatgatt      960

atcgattacg gcacacactc acgggacaca gtgggaaagt gctgtctgct aagttcctgc     1020

tggacaatgc gcggattgtc tcaggaagtc acgaccggac tctcaaactc tgggatctac     1080

gcagcaaagt ctgcataaag acagtgtttg caggatccag ttgcaatgat attgtctgca     1140

cagagcaatg tgtaatgagt ggacattttg acaagaaaat tcgtttctgg gacattcgat     1200

cagagagcat agttcgagag atggagctgt tgggaaagat tactgccctg gacttaaacc     1260

cagaaaggac tgagctcctg agctgctccc gtgatgactt gctaaaagtt attgatctcc     1320

gaacaaatgc tatcaagcag acattcagtg cacctgggtt caagtgcggc tctgactgga     1380

ccagagttgt cttcagccct gatggcagtt acgtggcggc aggctctgct gagggctctc     1440

tgtatatctg gagtgtgctc acagggaaag tggaaaaggt tctttcaaag cagcacagct     1500

catccatcaa tgcggtggcg tggtcgccct ctggctcgca cgttgtcagt gtggacaaag     1560

gatgcaaagc tgtgctgtgg gcacagtact gacggggctc tcagggctgg gaggacccca     1620

gtgccctcct cagaagaagc acatgggctc ctgcagccct gtcctggcag gtgatgtgct     1680

gggtatagca tggacctccc agagaagctc aagctatgtg gcactgtagc tttgccgtga     1740

atgggatttc tgaagatttg actgaggtct ctcttggcct ggaagaataa cactgaaaaa     1800

acctgacgct gcggtcactt agcagaggct caggttcttg ccttgggaaa cactactagc     1860

tctgaccttc catacctcac ttgggggagc acagggcccc gctgggcctc ctcaccaacg     1920

gcagtgccaa aatcagcccc cacatcaagg tggtgttctc tgtgctttct ctcgtccttc     1980

caaagtcggt tctggcctaa cgcatgtccc aacaccttgg gttcatttgc ccggtgaact     2040

cactttaagc attggattaa cggaaactcc cgaactacag acccctccct ggtgggttgc     2100

atgaatgtgt ctcattactg ctgaaatgtc ctcacatctc tttcactgtt cttcagagct     2160

ttctggctct ctttccccca caaaattcga catatttaaa aatctccgtg tggctttaaa     2220

aaatggtttt ttgttttttt gtttttttga ggtgggagag gatgtgtgaa aatcttttcc     2280

agggaaatgg gttcgctgca gaggtaagga tgtgttcctg tatcgatctg cagacaccca     2340

gaaggtgggt gcacactgca tgcttggggg tgccaaggga ttcgagacct ccaacatact     2400

tgtctgaagg tggtgattct ggccatggcc cctctgccaa gcctgtgtgc gatgcccttg     2460

gtgctttagt gcaagaagcc taggctcaga agcacagcag cgccatcttt ccgtttcagg     2520

ggttgtgatg aaggccaagg aaaaacattt atctttacta ttttacctac gtataaagtt     2580

ttagttcatt gggtgtgcga aacacccttt ttatcacttt taaatttgca ctttattttt     2640

tttcttccat gcttgttctc tggacatttg gggatgtgag tgttagagct ggtgagagag     2700

gagtcaggtg gccttcccac cgatggtcct ggcctccacc tgccctctct tccctgcctg     2760

atcaccgctt tccaatttgc ccttcagaga acttaagtca aggagagttg aaattcacag     2820

gccagggcac atcttttatt tatttcatta tgttggccaa cagaacttga ttgtaaataa     2880

taataaagaa atctgttata tacttttcaa actccaaaaa aa                        2922


<210>  99
<211>  444
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 16-like 1 (S. cerevisiae) 
       (ATG16L1), transcript variant 3, polypeptide NCBI Reference 
       Sequence: NM_198890.2

<400>  99

Met Ser Ser Gly Leu Arg Ala Ala Asp Phe Pro Arg Trp Lys Arg His 
1               5                   10                  15      


Ile Ser Glu Gln Leu Arg Arg Arg Asp Arg Leu Gln Arg Gln Ala Phe 
            20                  25                  30          


Glu Glu Ile Ile Leu Gln Tyr Asn Lys Leu Leu Glu Lys Ser Asp Leu 
        35                  40                  45              


His Ser Val Leu Ala Gln Lys Leu Gln Ala Glu Lys His Asp Val Pro 
    50                  55                  60                  


Asn Arg His Glu Ile Arg Arg Arg Gln Ala Arg Leu Gln Lys Glu Leu 
65                  70                  75                  80  


Ala Glu Ala Ala Lys Glu Pro Leu Pro Val Glu Gln Asp Asp Asp Ile 
                85                  90                  95      


Glu Val Ile Val Asp Glu Thr Ser Asp His Thr Glu Glu Thr Ser Pro 
            100                 105                 110         


Val Arg Ala Ile Ser Arg Ala Ala Thr Arg Arg Ser Val Ser Ser Phe 
        115                 120                 125             


Pro Val Pro Gln Asp Asn Val Asp Thr His Pro Gly Ser Gly Lys Glu 
    130                 135                 140                 


Val Arg Val Pro Ala Thr Ala Leu Cys Val Phe Asp Ala His Asp Gly 
145                 150                 155                 160 


Glu Val Asn Ala Val Gln Phe Ser Pro Gly Ser Arg Leu Leu Ala Thr 
                165                 170                 175     


Gly Gly Met Asp Arg Arg Val Lys Leu Trp Glu Val Phe Gly Glu Lys 
            180                 185                 190         


Cys Glu Phe Lys Gly Ser Leu Ser Gly Ser Asn Ala Gly Ile Thr Ser 
        195                 200                 205             


Ile Glu Phe Asp Ser Ala Gly Ser Tyr Leu Leu Ala Ala Ser Asn Asp 
    210                 215                 220                 


Phe Ala Ser Arg Ile Trp Thr Val Asp Asp Tyr Arg Leu Arg His Thr 
225                 230                 235                 240 


Leu Thr Gly His Ser Gly Lys Val Leu Ser Ala Lys Phe Leu Leu Asp 
                245                 250                 255     


Asn Ala Arg Ile Val Ser Gly Ser His Asp Arg Thr Leu Lys Leu Trp 
            260                 265                 270         


Asp Leu Arg Ser Lys Val Cys Ile Lys Thr Val Phe Ala Gly Ser Ser 
        275                 280                 285             


Cys Asn Asp Ile Val Cys Thr Glu Gln Cys Val Met Ser Gly His Phe 
    290                 295                 300                 


Asp Lys Lys Ile Arg Phe Trp Asp Ile Arg Ser Glu Ser Ile Val Arg 
305                 310                 315                 320 


Glu Met Glu Leu Leu Gly Lys Ile Thr Ala Leu Asp Leu Asn Pro Glu 
                325                 330                 335     


Arg Thr Glu Leu Leu Ser Cys Ser Arg Asp Asp Leu Leu Lys Val Ile 
            340                 345                 350         


Asp Leu Arg Thr Asn Ala Ile Lys Gln Thr Phe Ser Ala Pro Gly Phe 
        355                 360                 365             


Lys Cys Gly Ser Asp Trp Thr Arg Val Val Phe Ser Pro Asp Gly Ser 
    370                 375                 380                 


Tyr Val Ala Ala Gly Ser Ala Glu Gly Ser Leu Tyr Ile Trp Ser Val 
385                 390                 395                 400 


Leu Thr Gly Lys Val Glu Lys Val Leu Ser Lys Gln His Ser Ser Ser 
                405                 410                 415     


Ile Asn Ala Val Ala Trp Ser Pro Ser Gly Ser His Val Val Ser Val 
            420                 425                 430         


Asp Lys Gly Cys Lys Ala Val Leu Trp Ala Gln Tyr 
        435                 440                 


<210>  100
<211>  3409
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 16-like 1 (S. cerevisiae) 
       (ATG16L1), transcript variant 4, mRNA NCBI Reference Sequence: 
       NM_001190266.1

<400>  100
actagcgagc gccctgcgta ggcaccggct cctgagcccg tgcttcgggt gagggggcgg       60

gtcttccggc cctctcgaaa atcatttccg gcatgagccg gaagaccgtc ccggatggcc      120

tcggggactg ccagtgtgtg gaggtgagct ccgggattgc cggcattccc gcttctgctg      180

gttgcttcat gctgcaggct gcggccgtca gccctcgctc gcattggtgg cgctgaggtg      240

ccggggcagc aagtgacatg tcgtcgggcc tccgcgccgc tgacttcccc cgctggaagc      300

gccacatctc ggagcaactg aggcgccggg accggctgca gagacaggcg ttcgaggaga      360

tcatcctgca ataacaaatt gctggaaaag tcagatcttc attcagtgtt ggcccagaaa      420

ctacaggctg aaaagcatga cgtaccaaac aggcacgaga taagtcccgg acatgatggc      480

acatggaatg acaatcagct acaagaaatg gcccaactga ggattaagca ccaagaggaa      540

ctgactgaat tacacaagaa acgtggggag ttagctcaac tggtgattga cctgaataac      600

caaatgcagc ggaaggacag ggagatgcag atgaatgaag caaaaattgc agaatgtttg      660

cagactatct ctgacctgga gacggagtgc ctagacctgc gcactaagct ttgtgacctt      720

gaaagagcca accagaccct gaaggatgaa tatgatgccc tgcagatcac ttttactgcc      780

ttggagggaa aactgaggaa aactacggaa gagaaccagg agctggtcac cagatggatg      840

gctgagaaag cccaggaagc caatcggctt aatgcagaga atgaaaaaga ctccaggagg      900

cggcaagccc ggctgcagaa agagcttgca gaagcagcaa aggaacctct accagtcgaa      960

caggatgatg acattgaggt cattgtggat gaaacttctg atcacacaga agagacctct     1020

cctgtgcgag ccatcagcag agcagccact aagcgactct cgcagcctgc tggaggcctt     1080

ctggattcta tcactaatat ctttgggaga cgctctgtct cttccttccc agtcccccag     1140

gacaatgtgg atactcatcc tggttctggt aaagaagtga gggtaccagc tactgccttg     1200

tgtgtcttcg atgcacatga tggggaagtc aacgctgtgc agttcagtcc aggttcccgg     1260

ttactggcca ctggaggcat ggaccgcagg gttaagcttt gggaagtatt tggagaaaaa     1320

tgtgagttca agggttccct atctggcagt aatgcaggaa ttacaagcat tgaatttgat     1380

agtgctggat cttacctctt agcagcttca aatgattttg caagccgaat ctggactgtg     1440

gatgattatc gattacggca cacactcacg ggacacagtg ggaaagtgct gtctgctaag     1500

ttcctgctgg acaatgcgcg gattgtctca ggaagtcacg accggactct caaactctgg     1560

gatctacgca gcaaagtctg cataaagaca gtgtttgcag gatccagttg caatgatatt     1620

gtctgcacag agcaatgtgt aatgagtgga cattttgaca agaaaattcg tttctgggac     1680

attcgatcag agagcatagt tcgagagatg gagctgttgg gaaagattac tgccctggac     1740

ttaaacccag aaaggactga gctcctgagc tgctcccgtg atgacttgct aaaagttatt     1800

gatctccgaa caaatgctat caagcagaca ttcagtgcac ctgggttcaa gtgcggctct     1860

gactggacca gagttgtctt cagccctgat ggcagttacg tggcggcagg ctctgctgag     1920

ggctctctgt atatctggag tgtgctcaca gggaaagtgg aaaaggttct ttcaaagcag     1980

cacagctcat ccatcaatgc ggtggcgtgg tcgccctctg gctcgcacgt tgtcagtgtg     2040

gacaaaggat gcaaagctgt gctgtgggca cagtactgac ggggctctca gggctgggag     2100

gaccccagtg ccctcctcag aagaagcaca tgggctcctg cagccctgtc ctggcaggtg     2160

atgtgctggg tatagcatgg acctcccaga gaagctcaag ctatgtggca ctgtagcttt     2220

gccgtgaatg ggatttctga agatttgact gaggtctctc ttggcctgga agaataacac     2280

tgaaaaaacc tgacgctgcg gtcacttagc agaggctcag gttcttgcct tgggaaacac     2340

tactagctct gaccttccat acctcacttg ggggagcaca gggccccgct gggcctcctc     2400

accaacggca gtgccaaaat cagcccccac atcaaggtgg tgttctctgt gctttctctc     2460

gtccttccaa agtcggttct ggcctaacgc atgtcccaac accttgggtt catttgcccg     2520

gtgaactcac tttaagcatt ggattaacgg aaactcccga actacagacc cctccctggt     2580

gggttgcatg aatgtgtctc attactgctg aaatgtcctc acatctcttt cactgttctt     2640

cagagctttc tggctctctt tcccccacaa aattcgacat atttaaaaat ctccgtgtgg     2700

ctttaaaaaa tggttttttg tttttttgtt tttttgaggt gggagaggat gtgtgaaaat     2760

cttttccagg gaaatgggtt cgctgcagag gtaaggatgt gttcctgtat cgatctgcag     2820

acacccagaa ggtgggtgca cactgcatgc ttgggggtgc caagggattc gagacctcca     2880

acatacttgt ctgaaggtgg tgattctggc catggcccct ctgccaagcc tgtgtgcgat     2940

gcccttggtg ctttagtgca agaagcctag gctcagaagc acagcagcgc catctttccg     3000

tttcaggggt tgtgatgaag gccaaggaaa aacatttatc tttactattt tacctacgta     3060

taaagtttta gttcattggg tgtgcgaaac acccttttta tcacttttaa atttgcactt     3120

tatttttttt cttccatgct tgttctctgg acatttgggg atgtgagtgt tagagctggt     3180

gagagaggag tcaggtggcc ttcccaccga tggtcctggc ctccacctgc cctctcttcc     3240

ctgcctgatc accgctttcc aatttgccct tcagagaact taagtcaagg agagttgaaa     3300

ttcacaggcc agggcacatc ttttatttat ttcattatgt tggccaacag aacttgattg     3360

taaataataa taaagaaatc tgttatatac ttttcaaact ccaaaaaaa                 3409


<210>  101
<211>  523
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 16-like 1 (S. cerevisiae) 
       (ATG16L1), transcript variant 4, polypeptide NCBI Reference 
       Sequence: NM_001190266.1

<400>  101

Met Ala Gln Leu Arg Ile Lys His Gln Glu Glu Leu Thr Glu Leu His 
1               5                   10                  15      


Lys Lys Arg Gly Glu Leu Ala Gln Leu Val Ile Asp Leu Asn Asn Gln 
            20                  25                  30          


Met Gln Arg Lys Asp Arg Glu Met Gln Met Asn Glu Ala Lys Ile Ala 
        35                  40                  45              


Glu Cys Leu Gln Thr Ile Ser Asp Leu Glu Thr Glu Cys Leu Asp Leu 
    50                  55                  60                  


Arg Thr Lys Leu Cys Asp Leu Glu Arg Ala Asn Gln Thr Leu Lys Asp 
65                  70                  75                  80  


Glu Tyr Asp Ala Leu Gln Ile Thr Phe Thr Ala Leu Glu Gly Lys Leu 
                85                  90                  95      


Arg Lys Thr Thr Glu Glu Asn Gln Glu Leu Val Thr Arg Trp Met Ala 
            100                 105                 110         


Glu Lys Ala Gln Glu Ala Asn Arg Leu Asn Ala Glu Asn Glu Lys Asp 
        115                 120                 125             


Ser Arg Arg Arg Gln Ala Arg Leu Gln Lys Glu Leu Ala Glu Ala Ala 
    130                 135                 140                 


Lys Glu Pro Leu Pro Val Glu Gln Asp Asp Asp Ile Glu Val Ile Val 
145                 150                 155                 160 


Asp Glu Thr Ser Asp His Thr Glu Glu Thr Ser Pro Val Arg Ala Ile 
                165                 170                 175     


Ser Arg Ala Ala Thr Lys Arg Leu Ser Gln Pro Ala Gly Gly Leu Leu 
            180                 185                 190         


Asp Ser Ile Thr Asn Ile Phe Gly Arg Arg Ser Val Ser Ser Phe Pro 
        195                 200                 205             


Val Pro Gln Asp Asn Val Asp Thr His Pro Gly Ser Gly Lys Glu Val 
    210                 215                 220                 


Arg Val Pro Ala Thr Ala Leu Cys Val Phe Asp Ala His Asp Gly Glu 
225                 230                 235                 240 


Val Asn Ala Val Gln Phe Ser Pro Gly Ser Arg Leu Leu Ala Thr Gly 
                245                 250                 255     


Gly Met Asp Arg Arg Val Lys Leu Trp Glu Val Phe Gly Glu Lys Cys 
            260                 265                 270         


Glu Phe Lys Gly Ser Leu Ser Gly Ser Asn Ala Gly Ile Thr Ser Ile 
        275                 280                 285             


Glu Phe Asp Ser Ala Gly Ser Tyr Leu Leu Ala Ala Ser Asn Asp Phe 
    290                 295                 300                 


Ala Ser Arg Ile Trp Thr Val Asp Asp Tyr Arg Leu Arg His Thr Leu 
305                 310                 315                 320 


Thr Gly His Ser Gly Lys Val Leu Ser Ala Lys Phe Leu Leu Asp Asn 
                325                 330                 335     


Ala Arg Ile Val Ser Gly Ser His Asp Arg Thr Leu Lys Leu Trp Asp 
            340                 345                 350         


Leu Arg Ser Lys Val Cys Ile Lys Thr Val Phe Ala Gly Ser Ser Cys 
        355                 360                 365             


Asn Asp Ile Val Cys Thr Glu Gln Cys Val Met Ser Gly His Phe Asp 
    370                 375                 380                 


Lys Lys Ile Arg Phe Trp Asp Ile Arg Ser Glu Ser Ile Val Arg Glu 
385                 390                 395                 400 


Met Glu Leu Leu Gly Lys Ile Thr Ala Leu Asp Leu Asn Pro Glu Arg 
                405                 410                 415     


Thr Glu Leu Leu Ser Cys Ser Arg Asp Asp Leu Leu Lys Val Ile Asp 
            420                 425                 430         


Leu Arg Thr Asn Ala Ile Lys Gln Thr Phe Ser Ala Pro Gly Phe Lys 
        435                 440                 445             


Cys Gly Ser Asp Trp Thr Arg Val Val Phe Ser Pro Asp Gly Ser Tyr 
    450                 455                 460                 


Val Ala Ala Gly Ser Ala Glu Gly Ser Leu Tyr Ile Trp Ser Val Leu 
465                 470                 475                 480 


Thr Gly Lys Val Glu Lys Val Leu Ser Lys Gln His Ser Ser Ser Ile 
                485                 490                 495     


Asn Ala Val Ala Trp Ser Pro Ser Gly Ser His Val Val Ser Val Asp 
            500                 505                 510         


Lys Gly Cys Lys Ala Val Leu Trp Ala Gln Tyr 
        515                 520             


<210>  102
<211>  3407
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  102 Homo sapiens autophagy related 16-like 1 (S. cerevisiae) 
       (ATG16L1), transcript variant 5, mRNA NCBI Reference Sequence: 
       NM_001190267.1

<400>  102
actagcgagc gccctgcgta ggcaccggct cctgagcccg tgcttcgggt gagggggcgg       60

gtcttccggc cctctcgaaa atcatttccg gcatgagccg gaagaccgtc ccggatggcc      120

tcggggactg ccagtgtgtg gaggtgagct ccgggattgc cggcattccc gcttctgctg      180

gttgcttcat gctgcaggct gcggccgtca gccctcgctc gcattggtgg cgctgaggtg      240

ccggggcagc aagtgacatg tcgtcgggcc tccgcgccgc tgacttcccc cgctggaagc      300

gccacatctc ggagcaactg aggcgccggg accggctgca gagacaggcg ttcgaggaga      360

tcatcctgca gtataacaaa ttgctggaaa agtcagatct tcattcagtg ttggcccaga      420

aactacaggc tgaaaagcat gacgtaccaa acaggcacga gataagtccc ggacatgatg      480

gcacatggaa tgacaatcag ctacaagaaa tggcccaact gaggattaag caccaagagg      540

aactgactga attacacaag aaacgtgggg agctcaactg gtgattgacc tgaataacca      600

aatgcagcgg aaggacaggg agatgcagat gaatgaagca aaaattgcag aatgtttgca      660

gactatctct gacctggaga cggagtgcct agacctgcgc actaagcttt gtgaccttga      720

aagagccaac cagaccctga aggatgaata tgatgccctg cagatcactt ttactgcctt      780

ggagggaaaa ctgaggaaaa ctacggaaga gaaccaggag ctggtcacca gatggatggc      840

tgagaaagcc caggaagcca atcggcttaa tgcagagaat gaaaaagact ccaggaggcg      900

gcaagcccgg ctgcagaaag agcttgcaga agcagcaaag gaacctctac cagtcgaaca      960

ggatgatgac attgaggtca ttgtggatga aacttctgat cacacagaag agacctctcc     1020

tgtgcgagcc atcagcagag cagccactaa gcgactctcg cagcctgctg gaggccttct     1080

ggattctatc actaatatct ttgggagacg ctctgtctct tccttcccag tcccccagga     1140

caatgtggat actcatcctg gttctggtaa agaagtgagg gtaccagcta ctgccttgtg     1200

tgtcttcgat gcacatgatg gggaagtcaa cgctgtgcag ttcagtccag gttcccggtt     1260

actggccact ggaggcatgg accgcagggt taagctttgg gaagtatttg gagaaaaatg     1320

tgagttcaag ggttccctat ctggcagtaa tgcaggaatt acaagcattg aatttgatag     1380

tgctggatct tacctcttag cagcttcaaa tgattttgca agccgaatct ggactgtgga     1440

tgattatcga ttacggcaca cactcacggg acacagtggg aaagtgctgt ctgctaagtt     1500

cctgctggac aatgcgcgga ttgtctcagg aagtcacgac cggactctca aactctggga     1560

tctacgcagc aaagtctgca taaagacagt gtttgcagga tccagttgca atgatattgt     1620

ctgcacagag caatgtgtaa tgagtggaca ttttgacaag aaaattcgtt tctgggacat     1680

tcgatcagag agcatagttc gagagatgga gctgttggga aagattactg ccctggactt     1740

aaacccagaa aggactgagc tcctgagctg ctcccgtgat gacttgctaa aagttattga     1800

tctccgaaca aatgctatca agcagacatt cagtgcacct gggttcaagt gcggctctga     1860

ctggaccaga gttgtcttca gccctgatgg cagttacgtg gcggcaggct ctgctgaggg     1920

ctctctgtat atctggagtg tgctcacagg gaaagtggaa aaggttcttt caaagcagca     1980

cagctcatcc atcaatgcgg tggcgtggtc gccctctggc tcgcacgttg tcagtgtgga     2040

caaaggatgc aaagctgtgc tgtgggcaca gtactgacgg ggctctcagg gctgggagga     2100

ccccagtgcc ctcctcagaa gaagcacatg ggctcctgca gccctgtcct ggcaggtgat     2160

gtgctgggta tagcatggac ctcccagaga agctcaagct atgtggcact gtagctttgc     2220

cgtgaatggg atttctgaag atttgactga ggtctctctt ggcctggaag aataacactg     2280

aaaaaacctg acgctgcggt cacttagcag aggctcaggt tcttgccttg ggaaacacta     2340

ctagctctga ccttccatac ctcacttggg ggagcacagg gccccgctgg gcctcctcac     2400

caacggcagt gccaaaatca gcccccacat caaggtggtg ttctctgtgc tttctctcgt     2460

ccttccaaag tcggttctgg cctaacgcat gtcccaacac cttgggttca tttgcccggt     2520

gaactcactt taagcattgg attaacggaa actcccgaac tacagacccc tccctggtgg     2580

gttgcatgaa tgtgtctcat tactgctgaa atgtcctcac atctctttca ctgttcttca     2640

gagctttctg gctctctttc ccccacaaaa ttcgacatat ttaaaaatct ccgtgtggct     2700

ttaaaaaatg gttttttgtt tttttgtttt tttgaggtgg gagaggatgt gtgaaaatct     2760

tttccaggga aatgggttcg ctgcagaggt aaggatgtgt tcctgtatcg atctgcagac     2820

acccagaagg tgggtgcaca ctgcatgctt gggggtgcca agggattcga gacctccaac     2880

atacttgtct gaaggtggtg attctggcca tggcccctct gccaagcctg tgtgcgatgc     2940

ccttggtgct ttagtgcaag aagcctaggc tcagaagcac agcagcgcca tctttccgtt     3000

tcaggggttg tgatgaaggc caaggaaaaa catttatctt tactatttta cctacgtata     3060

aagttttagt tcattgggtg tgcgaaacac cctttttatc acttttaaat ttgcacttta     3120

ttttttttct tccatgcttg ttctctggac atttggggat gtgagtgtta gagctggtga     3180

gagaggagtc aggtggcctt cccaccgatg gtcctggcct ccacctgccc tctcttccct     3240

gcctgatcac cgctttccaa tttgcccttc agagaactta agtcaaggag agttgaaatt     3300

cacaggccag ggcacatctt ttatttattt cattatgttg gccaacagaa cttgattgta     3360

aataataata aagaaatctg ttatatactt ttcaaactcc aaaaaaa                   3407


<210>  103
<211>  491
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 16-like 1 (S. cerevisiae) 
       (ATG16L1), transcript variant 5, polypeptide NCBI Reference 
       Sequence: NM_001190267.1

<400>  103

Met Gln Arg Lys Asp Arg Glu Met Gln Met Asn Glu Ala Lys Ile Ala 
1               5                   10                  15      


Glu Cys Leu Gln Thr Ile Ser Asp Leu Glu Thr Glu Cys Leu Asp Leu 
            20                  25                  30          


Arg Thr Lys Leu Cys Asp Leu Glu Arg Ala Asn Gln Thr Leu Lys Asp 
        35                  40                  45              


Glu Tyr Asp Ala Leu Gln Ile Thr Phe Thr Ala Leu Glu Gly Lys Leu 
    50                  55                  60                  


Arg Lys Thr Thr Glu Glu Asn Gln Glu Leu Val Thr Arg Trp Met Ala 
65                  70                  75                  80  


Glu Lys Ala Gln Glu Ala Asn Arg Leu Asn Ala Glu Asn Glu Lys Asp 
                85                  90                  95      


Ser Arg Arg Arg Gln Ala Arg Leu Gln Lys Glu Leu Ala Glu Ala Ala 
            100                 105                 110         


Lys Glu Pro Leu Pro Val Glu Gln Asp Asp Asp Ile Glu Val Ile Val 
        115                 120                 125             


Asp Glu Thr Ser Asp His Thr Glu Glu Thr Ser Pro Val Arg Ala Ile 
    130                 135                 140                 


Ser Arg Ala Ala Thr Lys Arg Leu Ser Gln Pro Ala Gly Gly Leu Leu 
145                 150                 155                 160 


Asp Ser Ile Thr Asn Ile Phe Gly Arg Arg Ser Val Ser Ser Phe Pro 
                165                 170                 175     


Val Pro Gln Asp Asn Val Asp Thr His Pro Gly Ser Gly Lys Glu Val 
            180                 185                 190         


Arg Val Pro Ala Thr Ala Leu Cys Val Phe Asp Ala His Asp Gly Glu 
        195                 200                 205             


Val Asn Ala Val Gln Phe Ser Pro Gly Ser Arg Leu Leu Ala Thr Gly 
    210                 215                 220                 


Gly Met Asp Arg Arg Val Lys Leu Trp Glu Val Phe Gly Glu Lys Cys 
225                 230                 235                 240 


Glu Phe Lys Gly Ser Leu Ser Gly Ser Asn Ala Gly Ile Thr Ser Ile 
                245                 250                 255     


Glu Phe Asp Ser Ala Gly Ser Tyr Leu Leu Ala Ala Ser Asn Asp Phe 
            260                 265                 270         


Ala Ser Arg Ile Trp Thr Val Asp Asp Tyr Arg Leu Arg His Thr Leu 
        275                 280                 285             


Thr Gly His Ser Gly Lys Val Leu Ser Ala Lys Phe Leu Leu Asp Asn 
    290                 295                 300                 


Ala Arg Ile Val Ser Gly Ser His Asp Arg Thr Leu Lys Leu Trp Asp 
305                 310                 315                 320 


Leu Arg Ser Lys Val Cys Ile Lys Thr Val Phe Ala Gly Ser Ser Cys 
                325                 330                 335     


Asn Asp Ile Val Cys Thr Glu Gln Cys Val Met Ser Gly His Phe Asp 
            340                 345                 350         


Lys Lys Ile Arg Phe Trp Asp Ile Arg Ser Glu Ser Ile Val Arg Glu 
        355                 360                 365             


Met Glu Leu Leu Gly Lys Ile Thr Ala Leu Asp Leu Asn Pro Glu Arg 
    370                 375                 380                 


Thr Glu Leu Leu Ser Cys Ser Arg Asp Asp Leu Leu Lys Val Ile Asp 
385                 390                 395                 400 


Leu Arg Thr Asn Ala Ile Lys Gln Thr Phe Ser Ala Pro Gly Phe Lys 
                405                 410                 415     


Cys Gly Ser Asp Trp Thr Arg Val Val Phe Ser Pro Asp Gly Ser Tyr 
            420                 425                 430         


Val Ala Ala Gly Ser Ala Glu Gly Ser Leu Tyr Ile Trp Ser Val Leu 
        435                 440                 445             


Thr Gly Lys Val Glu Lys Val Leu Ser Lys Gln His Ser Ser Ser Ile 
    450                 455                 460                 


Asn Ala Val Ala Trp Ser Pro Ser Gly Ser His Val Val Ser Val Asp 
465                 470                 475                 480 


Lys Gly Cys Lys Ala Val Leu Trp Ala Gln Tyr 
                485                 490     


<210>  104
<211>  4330
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ATG 12 mRNA variant 1 GenBank Accession No.: NM_004707.3  
       GI:290560745

<400>  104
ggaaacattt ctaggtccga gccagcggcc taactggagt gtgcgcattc gaccgagcac       60

agacacgttg cccaccgctc ctctcccgag gtctgtagtc gcggagaaac acatgttgcg      120

ttactaacgt tcagaggtct gcgacagctt cgatttgaat gactagccgg gaacaccaag      180

tttcactgtg taattgcgtc cccctactcc ggcgcctcct ttgcgacgct ccctggagaa      240

aagcacgccc actgcacgcg ctcagtcgct acttccgctc tcgagtgtct ccaagcaaga      300

tggcggagga gccgcagtct gtgttgcagc ttcctacttc aattgctgct ggaggggaag      360

gacttacgga tgtctcccca gaaacaacca ccccggagcc cccgtcttcc gctgcagttt      420

ccccgggaac agaggaacct gctggcgaca ccaagaaaaa aattgacatt ttgctaaagg      480

ctgtgggaga cactcctatt atgaaaacaa agaagtgggc agtagagcga acacgaacca      540

tccaaggact cattgacttc atcaaaaagt ttcttaaact tgtggcctca gaacagttgt      600

ttatttatgt gaatcagtcc tttgctcctt ccccagacca agaagttgga actctctatg      660

agtgttttgg cagtgatggt aaactggttt tacattactg caagtctcag gcgtggggat      720

gaaccacaaa gaaaatcaac ttgctactac atgaaatgga ttttcacgga agagacagct      780

ctgaaaagtt ttgatgcttg tggcaagaga cttaacagat gtgatctatt tagtatgtgt      840

ctactctatg tttatgcata agaaaacatc catagcatga atggactcag aaaaatgtga      900

tttgtattaa tgcaccagtc atcataaaag atggtcatga tagtacaccc attgctccta      960

cttgttacta ttattgctgc agatctgcct ccaaggttga aaaggagact aagactgtat     1020

aaacatcttc attgtcagtt ctcaaaatga ctgaaattgt tttcatggta aaagttaata     1080

tactaaaggg ttcctttttt tttaatgttt acatttatct ctatgtttac ctttttagtc     1140

acattgacct gctggctgaa tacctcaaat agtccagtag agggcagtcc accaggcaga     1200

aaaggttagg cgttttggtt tcacatcttt gctggggaat aataggggaa atggctgttt     1260

ttgctaattt ttagctaata tctagccagg agagcaagca cataggacag actgaaagac     1320

tgtaatttta cacaatacac atggcttaat tattttattg ggatacagaa aaatataaat     1380

tctggacaaa taagtcatat acctgttttc agtcctaaca tttaaggatt cttgagtccc     1440

aatcacataa ctgtggtgtt actctgtcat ttatatggtg tcaaaagcac ttgatgagta     1500

aacccagtag catctttttg agtgtttcat aatgcatttt ccaacttgaa aacaataatt     1560

gaaaaatagc cttattgtat attttatgcc atgactaaaa gtgccatttt tactgatgct     1620

attagactga taatttcttg aagtgaaatt taaccttttt ttctctttag tattatgttt     1680

ataatgccat atttttagaa agcattccag atcaggcatg gtggcttaca cctgtaatcc     1740

cagcactttg gaaggctgag gtgtggggat tgcgtgaagc cacaagtttg agaccagcct     1800

ggttagcaag gcaagatccc caactctaca aaaaaataaa aattaaaaaa aaattattag     1860

gctgcagagg caagaggatc ccctgagccc agaagttcaa gggtatagtg agtcgtgatt     1920

gtaccactgc attcctgctg agcaacagag tgagacccca tctcaaaaaa gaaaaaaaaa     1980

ggcattctag taaatcgaat gtaatgtgaa tggaatttca aaacaggatc taagatggta     2040

tgtagtagaa ttcaaagtaa tatcatttta aagttaaatg agtatggaaa aggtctgttc     2100

tctagttttg tccagttcag tttactgaag gaatatattt aattatattc atatatttaa     2160

caaataaaaa tatgttgaat tttcgtattg tttgccactg agggttcaga tgatagacct     2220

caaaaaatcg aaaatactgg ttgaaatttg tagcatccat ttagttattc tttttgacct     2280

aaataactta atagtttatt aaatctaagg ttagctaaat atgtagctaa ccttatttgt     2340

tttctttcct aacaactctg aagaatacat aggactttgc actttttttt tttttttttt     2400

tttttaaaga gacagcgttt cggtcttgtt gcccaggctg gactgtaatg gcacaatctt     2460

ggctcattgc aacctctacc tcccgggttc aagtgattct cctgcctcag cctcctgagt     2520

agctgggatt acaggcacct gccaccacac ccgcctaatt ttttgttatt tttagtggag     2580

acagggtttt accattttgg ccaggctggt ctcaaactcc tgactgacct caggtgatcc     2640

acccgccttg gcctcccaaa gtgctgggat tataggtgtg agccaccgtg ccgggcctgg     2700

actttgtact ttttcaaatc atatttaatt atttcttgat ctttactcaa agaattattc     2760

tgcgctaaat ttgcaacatt aaacaataat taagcctgga tccgataagt tattgggtga     2820

ctagaaccta caacacaatt tatttttaat ttaaactctc aagtctgatt aaacttgggg     2880

gaaaaactga acattcaggt gatattttaa attttctttg tgactaaata gatgtcatta     2940

atatcatgct tcatcttact gtgcatagat gacccatggc acacaaagta tatgtataat     3000

attgcaatat tgtacttcca tctttaaaag tatgccaaag acttcataat gaattcacca     3060

tatttaagga ataaggcaaa ttaatgttta tccgaaatca aaatgagatt ggataattcc     3120

tggaatctga tttctgtaag tttcagtcct aaagggacct taaatatcta ctagaacgaa     3180

gtacagtgta tttctcagat tacatgacat tagtcaacca atttagttgt aacacaaagt     3240

gagaaagcct taggtgttga aggagtgtga aaataaaaat ctcttttcat gcagtttctg     3300

cttcattttc caaggattat aatatgctga gtaaactttt ggcactaagg aagccagcta     3360

caggccacgt aatgaaaact attcagaaaa cagttcagca aatactacta tttgaataca     3420

gttcaaatcg tatttatata aatactctgc ctacattatt taacccaaac tggattattc     3480

accattcttt gaagatgcct tgtgttttct gttatctact tctgctcgtg cagtttactt     3540

acaccttcac cctttcaaat cctaactctt cttcaaggcc tgattcagat tttaactttt     3600

taaaggctat ctgaatcatt caaggagaag ataccctttc tctcataaaa acacttagag     3660

caaactacca ctattaaatc acttattgca tactgcctca gagccatgtg tccagttatt     3720

ctggttttta tctggtaccc acacacacat aatacttagt acatttcctt atatgtaaat     3780

aattagtaat gtaacacatc gagtgaatga aattgtgatt agcaaagtat gacagctaaa     3840

aacaaaggaa attttgtata gtttaagttt ttgtacagca gaaggtaatg gaaaatattt     3900

gcaacatatg acagtgtttt taatgtctta agtatatgaa gaatttgttt caattggtaa     3960

gaaaagacca actgaaaaat aggcaaaaca taccaaattt cgattcacaa aaagatacaa     4020

atggccaaca aaacatgaaa caccttgaac cccattggtt acaaaaaata caaataaaaa     4080

caatgctgac tgacaataaa aataatgtac atcttgctat tagcagagtt gtggggaaat     4140

cctgtcatat tttggtgaat agaatatgag ttggtatatc acttttaggg ggccatttgg     4200

caatatcaaa aaccttgaaa cgtaagtata aacttcagtt tgttcgtata acctgtcatt     4260

cacatattga gtgcctacta tgtcctaaat attggaataa cactaataaa gctcataaga     4320

gactaataaa                                                            4330


<210>  105
<211>  140
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ATG 12 Isoform 1 polypeptide encoded by mRNA variant 1 
       GenBank Accession No.: NP_004698.3  GI:290560746

<400>  105

Met Ala Glu Glu Pro Gln Ser Val Leu Gln Leu Pro Thr Ser Ile Ala 
1               5                   10                  15      


Ala Gly Gly Glu Gly Leu Thr Asp Val Ser Pro Glu Thr Thr Thr Pro 
            20                  25                  30          


Glu Pro Pro Ser Ser Ala Ala Val Ser Pro Gly Thr Glu Glu Pro Ala 
        35                  40                  45              


Gly Asp Thr Lys Lys Lys Ile Asp Ile Leu Leu Lys Ala Val Gly Asp 
    50                  55                  60                  


Thr Pro Ile Met Lys Thr Lys Lys Trp Ala Val Glu Arg Thr Arg Thr 
65                  70                  75                  80  


Ile Gln Gly Leu Ile Asp Phe Ile Lys Lys Phe Leu Lys Leu Val Ala 
                85                  90                  95      


Ser Glu Gln Leu Phe Ile Tyr Val Asn Gln Ser Phe Ala Pro Ser Pro 
            100                 105                 110         


Asp Gln Glu Val Gly Thr Leu Tyr Glu Cys Phe Gly Ser Asp Gly Lys 
        115                 120                 125             


Leu Val Leu His Tyr Cys Lys Ser Gln Ala Trp Gly 
    130                 135                 140 


<210>  106
<211>  4193
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 12 (ATG12), transcript variant 5, 
       mRNA NCBI Reference Sequence: NM_001277783.1

<400>  106
ggaaacattt ctaggtccga gccagcggcc taactggagt gtgcgcattc gaccgagcac       60

agacacgttg cccaccgctc ctctcccgag gtctgtagtc gcggagaaac acatgttgcg      120

ttactaacgt tcagaggtct gcgacagctt cgatttgaat gactagccgg gaacaccaag      180

tttcactgtg taattgcgtc cccctactcc ggcgcctcct ttgcgacgct ccctggagaa      240

aagcacgccc actgcacgcg ctcagtcgct acttccgctc tcgagtgtct ccaagcaaga      300

tggcggagga gccgcagtct gtgttgcagc ttcctacttc aattgctgct ggaggggaag      360

gacttacgga tgtctcccca gaaacaacca ccccggagcc cccgtcttcc gctgcagttt      420

ccccgggaac agaggaacct gctggcgaca ccaagaaaaa aatttattta tgtgaatcag      480

tcctttgctc cttccccaga ccaagaagtt ggaactctct atgagtgttt tggcagtgat      540

ggtaaactgg ttttacatta ctgcaagtct caggcgtggg gatgaaccac aaagaaaatc      600

aacttgctac tacatgaaat ggattttcac ggaagagaca gctctgaaaa gttttgatgc      660

ttgtggcaag agacttaaca gatgtgatct atttagtatg tgtctactct atgtttatgc      720

ataagaaaac atccatagca tgaatggact cagaaaaatg tgatttgtat taatgcacca      780

gtcatcataa aagatggtca tgatagtaca cccattgctc ctacttgtta ctattattgc      840

tgcagatctg cctccaaggt tgaaaaggag actaagactg tataaacatc ttcattgtca      900

gttctcaaaa tgactgaaat tgttttcatg gtaaaagtta atatactaaa gggttccttt      960

ttttttaatg tttacattta tctctatgtt taccttttta gtcacattga cctgctggct     1020

gaatacctca aatagtccag tagagggcag tccaccaggc agaaaaggtt aggcgttttg     1080

gtttcacatc tttgctgggg aataataggg gaaatggctg tttttgctaa tttttagcta     1140

atatctagcc aggagagcaa gcacatagga cagactgaaa gactgtaatt ttacacaata     1200

cacatggctt aattatttta ttgggataca gaaaaatata aattctggac aaataagtca     1260

tatacctgtt ttcagtccta acatttaagg attcttgagt cccaatcaca taactgtggt     1320

gttactctgt catttatatg gtgtcaaaag cacttgatga gtaaacccag tagcatcttt     1380

ttgagtgttt cataatgcat tttccaactt gaaaacaata attgaaaaat agccttattg     1440

tatattttat gccatgacta aaagtgccat ttttactgat gctattagac tgataatttc     1500

ttgaagtgaa atttaacctt tttttctctt tagtattatg tttataatgc catattttta     1560

gaaagcattc cagatcaggc atggtggctt acacctgtaa tcccagcact ttggaaggct     1620

gaggtgtggg gattgcgtga agccacaagt ttgagaccag cctggttagc aaggcaagat     1680

ccccaactct acaaaaaaat aaaaattaaa aaaaaattat taggctgcag aggcaagagg     1740

atcccctgag cccagaagtt caagggtata gtgagtcgtg attgtaccac tgcattcctg     1800

ctgagcaaca gagtgagacc ccatctcaaa aaagaaaaaa aaaggcattc tagtaaatcg     1860

aatgtaatgt gaatggaatt tcaaaacagg atctaagatg gtatgtagta gaattcaaag     1920

taatatcatt ttaaagttaa atgagtatgg aaaaggtctg ttctctagtt ttgtccagtt     1980

cagtttactg aaggaatata tttaattata ttcatatatt taacaaataa aaatatgttg     2040

aattttcgta ttgtttgcca ctgagggttc agatgataga cctcaaaaaa tcgaaaatac     2100

tggttgaaat ttgtagcatc catttagtta ttctttttga cctaaataac ttaatagttt     2160

attaaatcta aggttagcta aatatgtagc taaccttatt tgttttcttt cctaacaact     2220

ctgaagaata cataggactt tgcacttttt tttttttttt ttttttttaa agagacagcg     2280

tttcggtctt gttgcccagg ctggactgta atggcacaat cttggctcat tgcaacctct     2340

acctcccggg ttcaagtgat tctcctgcct cagcctcctg agtagctggg attacaggca     2400

cctgccacca cacccgccta attttttgtt atttttagtg gagacagggt tttaccattt     2460

tggccaggct ggtctcaaac tcctgactga cctcaggtga tccacccgcc ttggcctccc     2520

aaagtgctgg gattataggt gtgagccacc gtgccgggcc tggactttgt actttttcaa     2580

atcatattta attatttctt gatctttact caaagaatta ttctgcgcta aatttgcaac     2640

attaaacaat aattaagcct ggatccgata agttattggg tgactagaac ctacaacaca     2700

atttattttt aatttaaact ctcaagtctg attaaacttg ggggaaaaac tgaacattca     2760

ggtgatattt taaattttct ttgtgactaa atagatgtca ttaatatcat gcttcatctt     2820

actgtgcata gatgacccat ggcacacaaa gtatatgtat aatattgcaa tattgtactt     2880

ccatctttaa aagtatgcca aagacttcat aatgaattca ccatatttaa ggaataaggc     2940

aaattaatgt ttatccgaaa tcaaaatgag attggataat tcctggaatc tgatttctgt     3000

aagtttcagt cctaaaggga ccttaaatat ctactagaac gaagtacagt gtatttctca     3060

gattacatga cattagtcaa ccaatttagt tgtaacacaa agtgagaaag ccttaggtgt     3120

tgaaggagtg tgaaaataaa aatctctttt catgcagttt ctgcttcatt ttccaaggat     3180

tataatatgc tgagtaaact tttggcacta aggaagccag ctacaggcca cgtaatgaaa     3240

actattcaga aaacagttca gcaaatacta ctatttgaat acagttcaaa tcgtatttat     3300

ataaatactc tgcctacatt atttaaccca aactggatta ttcaccattc tttgaagatg     3360

ccttgtgttt tctgttatct acttctgctc gtgcagttta cttacacctt caccctttca     3420

aatcctaact cttcttcaag gcctgattca gattttaact ttttaaaggc tatctgaatc     3480

attcaaggag aagataccct ttctctcata aaaacactta gagcaaacta ccactattaa     3540

atcacttatt gcatactgcc tcagagccat gtgtccagtt attctggttt ttatctggta     3600

cccacacaca cataatactt agtacatttc cttatatgta aataattagt aatgtaacac     3660

atcgagtgaa tgaaattgtg attagcaaag tatgacagct aaaaacaaag gaaattttgt     3720

atagtttaag tttttgtaca gcagaaggta atggaaaata tttgcaacat atgacagtgt     3780

ttttaatgtc ttaagtatat gaagaatttg tttcaattgg taagaaaaga ccaactgaaa     3840

aataggcaaa acataccaaa tttcgattca caaaaagata caaatggcca acaaaacatg     3900

aaacaccttg aaccccattg gttacaaaaa atacaaataa aaacaatgct gactgacaat     3960

aaaaataatg tacatcttgc tattagcaga gttgtgggga aatcctgtca tattttggtg     4020

aatagaatat gagttggtat atcactttta gggggccatt tggcaatatc aaaaaccttg     4080

aaacgtaagt ataaacttca gtttgttcgt ataacctgtc attcacatat tgagtgccta     4140

ctatgtccta aatattggaa taacactaat aaagctcata agagactaat aaa            4193


<210>  107
<211>  74
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 12 (ATG12), transcript variant 5, 
       polypeptide

<400>  107

Met Ala Glu Glu Pro Gln Ser Val Leu Gln Leu Pro Thr Ser Ile Ala 
1               5                   10                  15      


Ala Gly Gly Glu Gly Leu Thr Asp Val Ser Pro Glu Thr Thr Thr Pro 
            20                  25                  30          


Glu Pro Pro Ser Ser Ala Ala Val Ser Pro Gly Thr Glu Glu Pro Ala 
        35                  40                  45              


Gly Asp Thr Lys Lys Lys Ile Tyr Leu Cys Glu Ser Val Leu Cys Ser 
    50                  55                  60                  


Phe Pro Arg Pro Arg Ser Trp Asn Ser Leu 
65                  70                  


<210>  108
<211>  2432
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ATG 10 mRNA variant 3 GenBank Accession No.: NM_001131028.1
       GI:196162716

<400>  108
agtgcgcacg ctccgactcg gccgtggcgg acctgactga aggaggccgc ggacctgact       60

gaaggaggcc acggccactt ctggttggcc tcggggcgcg ctggctcggc tcttcctccg      120

ccctcgaggc ccccgcagtc ccatcattca gttccgtagg gtcaccggcg cggcagtggc      180

ctcgcagggc gctgggtccc tctccccagc tctcctcccc ctggccccgt cgccccgccc      240

tcgccgggct gggctgcggg gtcaggggcc gagcggagag ggtagagacg gggtttcacc      300

gtgttagcca agatggtctc gatctcctga cctcgtgatc cgcccgcctc ggcctcccaa      360

agtgctggga ttacaggcgt gagccactgc gcccggcctg ttgtacagtt attaaagtta      420

tcatttaaca tggaagaaga tgagttcatt ggagaaaaaa cattccaacg ttattgtgca      480

gaattcatta aacattcaca acagataggt gatagttggg aatggagacc atcaaaggac      540

tgttctgatg gctacatgtg caaaatacac tttcaaatta agaatgggtc tgtgatgtca      600

catctaggag catctaccca tggacagaca tgtcttccca tggaggaggc tttcgagcta      660

cccttggatg attgtgaagt gattgaaact gcagcagcgt ccgaagtgat taaatatgag      720

tatcatgtct tatattcctg tagctaccaa gtgcctgtac tttactttag ggcaagcttt      780

ttagatggga gacctttaac tctgaaggac atatgggaag gagttcatga gtgctataag      840

atgcgactgc tacagggacc atgggacact attacgcaac aggaacatcc aatacttggg      900

caaccctttt ttgtacttca tccctgcaag acgaatgaat tcatgactcc tgtattaaag      960

aattctcaga aaatcaataa gaatgtcaac tatatcacat catggctgag cattgtaggg     1020

ccagttgttg ggctgaatct acctctgagt tatgccaaag caacgtctca ggatgaacga     1080

aatgtccctt aacaagattc ttctattgag tttaggaatt gcggcacgaa gaatgccaag     1140

agtttacctg gccagccctg gctttaatag gactgatacc atggaatatt tcatctcacc     1200

aagatgtgac atggattatt tttcccttgg acacaaatgt ctacagcaac tggtgtttga     1260

taggctgaat gtttagaaga aacacttcaa agggatacat catggccagg catggtggct     1320

cacacctgta atccaagcac tttgggaggc caaggtggga gcatcacttg atcctgggag     1380

ttcgagacca gcctgggcaa catggtgaaa ccctgtcggt acaaaaaaat acaaaaattt     1440

gcctgtttat ggtggtgtgt tcctgtagtc ccagctcccc aggaggctga ggtgggaggt     1500

tggctttaac ccaggaggca gaggttgcag tgagctgaga ctgtgccact gcagtccagc     1560

ctgggtgaca gagccagaca ctgtctcggg gaaaaaaaaa aaaaaaaaaa gacacatcac     1620

tataaatagc aaaaaaacaa atctaactta ttaatactag gaataccaac attattaggg     1680

cacttgcagg ttattctttt ctaggccaag tacttcactt ccatttgtct gacatggaga     1740

ttgagggaga aatgtatttg tgtgttcatt ttaatgtaag atatataaaa attaaattac     1800

tggatttacc tgtccctgaa actggtgtta taaacatgac ctatcttaag tgattttccc     1860

acaatcaaac tcaggaacaa tagattattt ctgttttact ccaaaagaga gagagagagt     1920

gagtgtgagt gtgtgtgtgt gtgtgtgtgt atgtgtgggt gtttgtgtag atagttgtaa     1980

aacaaagaaa aaacacaata ttttactgtg agataatatg ttttaccagc aaagtgtggc     2040

atagtaatta gaagttttct aaaaagctat aggagatatt taaacattaa aatttctttt     2100

tgacctatag taataaaaca atggtcattt tacccctctg cttctcaacc ccacagctgc     2160

tctgctgtac tctttgaggg ctcttgagcg agtcttcatg tccctgagac ttattttcct     2220

catctttaat ttgaaactaa caagctacct catagggttg ctgtgagaac cacatgagat     2280

cattaatgca tgataagata ttgtaaagta ttatacgaat attcattaaa tgctcacctt     2340

tcttgtatat aattggtatt cactaaggct gtaaataagt ttcatagcca gttaagtatt     2400

aagataaacc taacctggaa caaaaaaaaa aa                                   2432


<210>  109
<211>  220
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ATG 10 polypeptide encoded by mRNA variant 3 GenBank 
       Accession No.: NP_001124500.1  GI:196162717

<400>  109

Met Glu Glu Asp Glu Phe Ile Gly Glu Lys Thr Phe Gln Arg Tyr Cys 
1               5                   10                  15      


Ala Glu Phe Ile Lys His Ser Gln Gln Ile Gly Asp Ser Trp Glu Trp 
            20                  25                  30          


Arg Pro Ser Lys Asp Cys Ser Asp Gly Tyr Met Cys Lys Ile His Phe 
        35                  40                  45              


Gln Ile Lys Asn Gly Ser Val Met Ser His Leu Gly Ala Ser Thr His 
    50                  55                  60                  


Gly Gln Thr Cys Leu Pro Met Glu Glu Ala Phe Glu Leu Pro Leu Asp 
65                  70                  75                  80  


Asp Cys Glu Val Ile Glu Thr Ala Ala Ala Ser Glu Val Ile Lys Tyr 
                85                  90                  95      


Glu Tyr His Val Leu Tyr Ser Cys Ser Tyr Gln Val Pro Val Leu Tyr 
            100                 105                 110         


Phe Arg Ala Ser Phe Leu Asp Gly Arg Pro Leu Thr Leu Lys Asp Ile 
        115                 120                 125             


Trp Glu Gly Val His Glu Cys Tyr Lys Met Arg Leu Leu Gln Gly Pro 
    130                 135                 140                 


Trp Asp Thr Ile Thr Gln Gln Glu His Pro Ile Leu Gly Gln Pro Phe 
145                 150                 155                 160 


Phe Val Leu His Pro Cys Lys Thr Asn Glu Phe Met Thr Pro Val Leu 
                165                 170                 175     


Lys Asn Ser Gln Lys Ile Asn Lys Asn Val Asn Tyr Ile Thr Ser Trp 
            180                 185                 190         


Leu Ser Ile Val Gly Pro Val Val Gly Leu Asn Leu Pro Leu Ser Tyr 
        195                 200                 205             


Ala Lys Ala Thr Ser Gln Asp Glu Arg Asn Val Pro 
    210                 215                 220 


<210>  110
<211>  2297
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 10 (ATG10), transcript variant 2, 
       mRNA

NCBI Reference Sequence: NM_031482.4

<400>  110
agtgcgcacg ctccgactcg gccgtggcgg acctgactga aggaggccgc ggacctgact       60

gaaggaggcc acggccactt ctggttggcc tcggggcgcg ctggctcggc tcttcctccg      120

ccctcgaggc ccccgcagtc ccatcattca gttccgtagg gtcaccggcg cggcagtggc      180

ctcgcagggc gctgggtccc tctccccagc tctcctcccc ctggccccgt cgccccgccc      240

tcgccgggct gggctgcggg gtcaggggcc gagcggagag ggttatcatt taacatggaa      300

gaagatgagt tcattggaga aaaaacattc caacgttatt gtgcagaatt cattaaacat      360

tcacaacaga taggtgatag ttgggaatgg agaccatcaa aggactgttc tgatggctac      420

atgtgcaaaa tacactttca aattaagaat gggtctgtga tgtcacatct aggagcatct      480

acccatggac agacatgtct tcccatggag gaggctttcg agctaccctt ggatgattgt      540

gaagtgattg aaactgcagc agcgtccgaa gtgattaaat atgagtatca tgtcttatat      600

tcctgtagct accaagtgcc tgtactttac tttagggcaa gctttttaga tgggagacct      660

ttaactctga aggacatatg ggaaggagtt catgagtgct ataagatgcg actgctacag      720

ggaccatggg acactattac gcaacaggaa catccaatac ttgggcaacc cttttttgta      780

cttcatccct gcaagacgaa tgaattcatg actcctgtat taaagaattc tcagaaaatc      840

aataagaatg tcaactatat cacatcatgg ctgagcattg tagggccagt tgttgggctg      900

aatctacctc tgagttatgc caaagcaacg tctcaggatg aacgaaatgt cccttaacaa      960

gattcttcta ttgagtttag gaattgcggc acgaagaatg ccaagagttt acctggccag     1020

ccctggcttt aataggactg ataccatgga atatttcatc tcaccaagat gtgacatgga     1080

ttatttttcc cttggacaca aatgtctaca gcaactggtg tttgataggc tgaatgttta     1140

gaagaaacac ttcaaaggga tacatcatgg ccaggcatgg tggctcacac ctgtaatcca     1200

agcactttgg gaggccaagg tgggagcatc acttgatcct gggagttcga gaccagcctg     1260

ggcaacatgg tgaaaccctg tcggtacaaa aaaatacaaa aatttgcctg tttatggtgg     1320

tgtgttcctg tagtcccagc tccccaggag gctgaggtgg gaggttggct ttaacccagg     1380

aggcagaggt tgcagtgagc tgagactgtg ccactgcagt ccagcctggg tgacagagcc     1440

agacactgtc tcggggaaaa aaaaaaaaaa aaaaagacac atcactataa atagcaaaaa     1500

aacaaatcta acttattaat actaggaata ccaacattat tagggcactt gcaggttatt     1560

cttttctagg ccaagtactt cacttccatt tgtctgacat ggagattgag ggagaaatgt     1620

atttgtgtgt tcattttaat gtaagatata taaaaattaa attactggat ttacctgtcc     1680

ctgaaactgg tgttataaac atgacctatc ttaagtgatt ttcccacaat caaactcagg     1740

aacaatagat tatttctgtt ttactccaaa agagagagag agagtgagtg tgagtgtgtg     1800

tgtgtgtgtg tgtgtatgtg tgggtgtttg tgtagatagt tgtaaaacaa agaaaaaaca     1860

caatatttta ctgtgagata atatgtttta ccagcaaagt gtggcatagt aattagaagt     1920

tttctaaaaa gctataggag atatttaaac attaaaattt ctttttgacc tatagtaata     1980

aaacaatggt cattttaccc ctctgcttct caaccccaca gctgctctgc tgtactcttt     2040

gagggctctt gagcgagtct tcatgtccct gagacttatt ttcctcatct ttaatttgaa     2100

actaacaagc tacctcatag ggttgctgtg agaaccacat gagatcatta atgcatgata     2160

agatattgta aagtattata cgaatattca ttaaatgctc acctttcttg tatataattg     2220

gtattcacta aggctgtaaa taagtttcat agccagttaa gtattaagat aaacctaacc     2280

tggaacaaaa aaaaaaa                                                    2297


<210>  111
<211>  220
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 10 (ATG10), transcript variant 2, 
       polypeptide 

NCBI Reference Sequence: NM_031482.4

<400>  111

Met Glu Glu Asp Glu Phe Ile Gly Glu Lys Thr Phe Gln Arg Tyr Cys 
1               5                   10                  15      


Ala Glu Phe Ile Lys His Ser Gln Gln Ile Gly Asp Ser Trp Glu Trp 
            20                  25                  30          


Arg Pro Ser Lys Asp Cys Ser Asp Gly Tyr Met Cys Lys Ile His Phe 
        35                  40                  45              


Gln Ile Lys Asn Gly Ser Val Met Ser His Leu Gly Ala Ser Thr His 
    50                  55                  60                  


Gly Gln Thr Cys Leu Pro Met Glu Glu Ala Phe Glu Leu Pro Leu Asp 
65                  70                  75                  80  


Asp Cys Glu Val Ile Glu Thr Ala Ala Ala Ser Glu Val Ile Lys Tyr 
                85                  90                  95      


Glu Tyr His Val Leu Tyr Ser Cys Ser Tyr Gln Val Pro Val Leu Tyr 
            100                 105                 110         


Phe Arg Ala Ser Phe Leu Asp Gly Arg Pro Leu Thr Leu Lys Asp Ile 
        115                 120                 125             


Trp Glu Gly Val His Glu Cys Tyr Lys Met Arg Leu Leu Gln Gly Pro 
    130                 135                 140                 


Trp Asp Thr Ile Thr Gln Gln Glu His Pro Ile Leu Gly Gln Pro Phe 
145                 150                 155                 160 


Phe Val Leu His Pro Cys Lys Thr Asn Glu Phe Met Thr Pro Val Leu 
                165                 170                 175     


Lys Asn Ser Gln Lys Ile Asn Lys Asn Val Asn Tyr Ile Thr Ser Trp 
            180                 185                 190         


Leu Ser Ile Val Gly Pro Val Val Gly Leu Asn Leu Pro Leu Ser Tyr 
        195                 200                 205             


Ala Lys Ala Thr Ser Gln Asp Glu Arg Asn Val Pro 
    210                 215                 220 


<210>  112
<211>  1572
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ATG 3 mRNA variant 1 GenBank Accession No.: NM_022488.4  
       GI:523704483

<400>  112
tagtgtcacg tgaggccccg gtggcggcgc agctacggca agagagtgag aaggaaggga       60

agccggaagg ggcgcgagtg aagcaaagcg aggacagaca gctcgcagag ggcgaggggt      120

gcgtgtgcgt ccgcttctca cctcaggtct cccttcggcc ccgctgccct ccctcgcggc      180

tgggtgacag ctgggtccgg tccgtcgcgg gctgcctggg gtgcgaggat cgcgcacccc      240

gtcttcgcgc gctgtgcctg ccgccccgcc ccctcgtccc gcccgtcccg tcgcgtcgcg      300

tcccgtcccc tcgggtgctg ccagccgggt gctgatgcga gtcggtggca gcgaggacat      360

tttctgactc cctggcccct gacacggctg cactttccat cccgtcgcgg ggccggccgc      420

tactccggcc ccaggatgca gaatgtgatt aatactgtga agggaaaggc actggaagtg      480

gctgagtacc tgaccccggt cctcaaggaa tcaaagttta aggaaacagg tgtaattacc      540

ccagaagagt ttgtggcagc tggagatcac ctagtccacc actgtccaac atggcaatgg      600

gctacagggg aagaattgaa agtgaaggca tacctaccaa caggcaaaca atttttggta      660

accaaaaatg tgccgtgcta taagcggtgc aaacagatgg aatattcaga tgaattggaa      720

gctatcattg aagaagatga tggtgatggc ggatgggtag atacatatca caacacaggt      780

attacaggaa taacggaagc cgttaaagag atcacactgg aaaataagga caatataagg      840

cttcaagatt gctcagcact atgtgaagag gaagaagatg aagatgaagg agaagctgca      900

gatatggaag aatatgaaga gagtggattg ttggaaacag atgaggctac cctagataca      960

aggaaaatag tagaagcttg taaagccaaa actgatgctg gcggtgaaga tgctattttg     1020

caaaccagaa cttatgacct ttacatcact tatgataaat attaccagac tccacgatta     1080

tggttgtttg gctatgatga gcaacggcag cctttaacag ttgagcacat gtatgaagac     1140

atcagtcagg atcatgtgaa gaaaacagtg accattgaaa atcaccctca tctgccacca     1200

cctcccatgt gttcagttca cccatgcagg catgctgagg tgatgaagaa aatcattgag     1260

actgttgcag aaggaggggg agaacttgga gttcatatgt atcttcttat tttcttgaaa     1320

tttgtacaag ctgtcattcc aacaatagaa tatgactaca caagacactt cacaatgtaa     1380

tgaagagagc ataaaatcta tcctaattat tggttctgat ttttaaagaa ttaacccata     1440

gatgtgacca ttgaccatat tcatcaatat atacagtttc tctaataagg gacttatatg     1500

tttatgcatt aaataaaaat atgttccact accagcctta cttgtttaat aaaaatcagt     1560

gcaaagagag tt                                                         1572


<210>  113
<211>  314
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ATG 3 Isoform 1 polypeptide encoded by mRNA variant 1 
       GenBank Accession No.: NP_071933.2  GI:19526773

<400>  113

Met Gln Asn Val Ile Asn Thr Val Lys Gly Lys Ala Leu Glu Val Ala 
1               5                   10                  15      


Glu Tyr Leu Thr Pro Val Leu Lys Glu Ser Lys Phe Lys Glu Thr Gly 
            20                  25                  30          


Val Ile Thr Pro Glu Glu Phe Val Ala Ala Gly Asp His Leu Val His 
        35                  40                  45              


His Cys Pro Thr Trp Gln Trp Ala Thr Gly Glu Glu Leu Lys Val Lys 
    50                  55                  60                  


Ala Tyr Leu Pro Thr Gly Lys Gln Phe Leu Val Thr Lys Asn Val Pro 
65                  70                  75                  80  


Cys Tyr Lys Arg Cys Lys Gln Met Glu Tyr Ser Asp Glu Leu Glu Ala 
                85                  90                  95      


Ile Ile Glu Glu Asp Asp Gly Asp Gly Gly Trp Val Asp Thr Tyr His 
            100                 105                 110         


Asn Thr Gly Ile Thr Gly Ile Thr Glu Ala Val Lys Glu Ile Thr Leu 
        115                 120                 125             


Glu Asn Lys Asp Asn Ile Arg Leu Gln Asp Cys Ser Ala Leu Cys Glu 
    130                 135                 140                 


Glu Glu Glu Asp Glu Asp Glu Gly Glu Ala Ala Asp Met Glu Glu Tyr 
145                 150                 155                 160 


Glu Glu Ser Gly Leu Leu Glu Thr Asp Glu Ala Thr Leu Asp Thr Arg 
                165                 170                 175     


Lys Ile Val Glu Ala Cys Lys Ala Lys Thr Asp Ala Gly Gly Glu Asp 
            180                 185                 190         


Ala Ile Leu Gln Thr Arg Thr Tyr Asp Leu Tyr Ile Thr Tyr Asp Lys 
        195                 200                 205             


Tyr Tyr Gln Thr Pro Arg Leu Trp Leu Phe Gly Tyr Asp Glu Gln Arg 
    210                 215                 220                 


Gln Pro Leu Thr Val Glu His Met Tyr Glu Asp Ile Ser Gln Asp His 
225                 230                 235                 240 


Val Lys Lys Thr Val Thr Ile Glu Asn His Pro His Leu Pro Pro Pro 
                245                 250                 255     


Pro Met Cys Ser Val His Pro Cys Arg His Ala Glu Val Met Lys Lys 
            260                 265                 270         


Ile Ile Glu Thr Val Ala Glu Gly Gly Gly Glu Leu Gly Val His Met 
        275                 280                 285             


Tyr Leu Leu Ile Phe Leu Lys Phe Val Gln Ala Val Ile Pro Thr Ile 
    290                 295                 300                 


Glu Tyr Asp Tyr Thr Arg His Phe Thr Met 
305                 310                 


<210>  114
<211>  3060
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ATG 3 mRNA variant 2 GenBank Accession No.: NM_001278712.1 
       GI:523704486

<400>  114
tagtgtcacg tgaggccccg gtggcggcgc agctacggca agagagtgag aaggaaggga       60

agccggaagg ggcgcgagtg aagcaaagcg aggacagaca gctcgcagag ggcgaggggt      120

gcgtgtgcgt ccgcttctca cctcaggtct cccttcggcc ccgctgccct ccctcgcggc      180

tgggtgacag ctgggtccgg tccgtcgcgg gctgcctggg gtgcgaggat cgcgcacccc      240

gtcttcgcgc gctgtgcctg ccgccccgcc ccctcgtccc gcccgtcccg tcgcgtcgcg      300

tcccgtcccc tcgggtgctg ccagccgggt gctgatgcga gtcggtggca gcgaggacat      360

tttctgactc cctggcccct gacacggctg cactttccat cccgtcgcgg ggccggccgc      420

tactccggcc ccaggatgca gaatgtgatt aatactgtga agggaaaggc actggaagtg      480

gctgagtacc tgaccccggt cctcaaggaa tcaaagttta aggaaacagg tgtaattacc      540

ccagaagagt ttgtggcagc tggagatcac ctagtccacc actgtccaac atggcaatgg      600

gctacagggg aagaattgaa agtgaaggca tacctaccaa caggcaaaca atttttggta      660

accaaaaatg tgccgtgcta taagcggtgc aaacagatgg aatattcaga tgaattggaa      720

gctatcattg aagaagatga tggtgatggc ggatgggtag atacatatca caacacaggt      780

attacaggaa taacggaagc cgttaaagag atcacactgg aaaataagga caatataagg      840

cttcaagatt gctcagcact atgtgaagag gaagaagatg aagatgaagg agaagctgca      900

gatatggaag aatatgaaga gagtggattg ttggaaacag atgaggctac cctagataca      960

aggaaaatag tagaagcttg taaagccaaa actgatgctg gcggtgaaga tgctattttg     1020

caaaccagaa cttatgacct ttacatcact tatgataaat attaccagac tccacgatta     1080

tggttgtttg gctatgatga gcaacggcag cctttaacag ttgagcacat gtatgaagac     1140

atcagtcagg atcatgtgaa gaaaacagtg accattgaaa atcaccctca tctgccacca     1200

cctcccatgt gttcagttca cccatgcagg catgctgagg tgatgaagaa aatcattgag     1260

actgttgcag aaggaggggg agaacttgga gttcatatgt atccttccct gtatgtaaga     1320

ttagtggcaa aatggctgtt aacgattttt tttttgagaa atttagtgta accttaatga     1380

tacagtagct ataacttgtt ttaacttctc atctgaaaat cctagtagaa gtctttcatt     1440

tatttagaaa atgttatatt gtgatagtta tgccttgctt agttgattag aacatgtccc     1500

catgtgtagc agtagctcac ccttgttaca gaggtaacta ctaatttaga ataataatat     1560

gagtatcttt ttcttatgta catttccagt ttaacatgtt accaaaaatg tactttccca     1620

tcaggtaaaa gtaaaattag gttgtgtcaa tctgaggtgt ctctgaagca aataattcag     1680

cttaaacaaa tacgttaatt aaaagtaatg tactacgtga agtgctttag atattcttaa     1740

aatatagtac gttcattttt ctcaagacag tagcaccttt cttgtatctt actatagttt     1800

tcctactcag ttttcttgga ttgacaaaag taatacattc aagttaatgc ccaagcaaac     1860

tgttggatcc taaagtacat gtactcatag catagattta tcaaaaacag tcaaaatcaa     1920

tgatttacat gccagatagc ttgaattagt ttgatgctag gttaatactt gacttggtgg     1980

tggtttcggc tactatgtgt tagattagat gtggagacta agatgccacg tctaatcaga     2040

ctgtggggca ttgctaataa agtgtgtccc taagaagaac tttgaactga tttgatattg     2100

aaacaaacca aaaacttcca gtccaaacca tatttactcc tgaagtccta ttatgcctaa     2160

ctggcacatc tctgctggat gctaggtagc aacttggaag actgtctata tataattcat     2220

aatagtaatt ggtttccccc aatgaaaagt aagctcctaa tgtttttaag tgcaatgaac     2280

ataaaaaaat ttcagaattt aattaaagac agtccagtag agctcagtct ttggcaaata     2340

catacactaa cctagaatca gtctactata caggattttt atttaatttt tttctctttt     2400

gctatagtac ttcattatag tctattctgc ctcattttct gacataactc tagcaaactt     2460

aaaggagtct agtaaatgag tacattttgt actcatttaa acatgtatag ctttgtcctt     2520

tttatgtcaa agtaattgtg cttatatgaa cacctactca ttaagaagtt tcaaattcaa     2580

taatcgaatg agtggtcagg tagtcttaaa gagcctcatg ttaaatagac acaaatttgc     2640

atagttgaat tctttaatag acttaattta agattttgtg gggttttttt gagaaattaa     2700

tggcttaata aaatggataa ttacaacatg ggcttaaaac atagaactat ttacaaatct     2760

tattttcctt agctaaaaac ctttaggtat cttcttattt tcttgaaatt tgtacaagct     2820

gtcattccaa caatagaata tgactacaca agacacttca caatgtaatg aagagagcat     2880

aaaatctatc ctaattattg gttctgattt ttaaagaatt aacccataga tgtgaccatt     2940

gaccatattc atcaatatat acagtttctc taataaggga cttatatgtt tatgcattaa     3000

ataaaaatat gttccactac cagccttact tgtttaataa aaatcagtgc aaagagagtt     3060


<210>  115
<211>  311
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ATG 3 Isoform 2 polypeptide encoded by mRNA variant 2 
       GenBank Accession No.: NP_001265641.1  GI:523704487

<400>  115

Met Gln Asn Val Ile Asn Thr Val Lys Gly Lys Ala Leu Glu Val Ala 
1               5                   10                  15      


Glu Tyr Leu Thr Pro Val Leu Lys Glu Ser Lys Phe Lys Glu Thr Gly 
            20                  25                  30          


Val Ile Thr Pro Glu Glu Phe Val Ala Ala Gly Asp His Leu Val His 
        35                  40                  45              


His Cys Pro Thr Trp Gln Trp Ala Thr Gly Glu Glu Leu Lys Val Lys 
    50                  55                  60                  


Ala Tyr Leu Pro Thr Gly Lys Gln Phe Leu Val Thr Lys Asn Val Pro 
65                  70                  75                  80  


Cys Tyr Lys Arg Cys Lys Gln Met Glu Tyr Ser Asp Glu Leu Glu Ala 
                85                  90                  95      


Ile Ile Glu Glu Asp Asp Gly Asp Gly Gly Trp Val Asp Thr Tyr His 
            100                 105                 110         


Asn Thr Gly Ile Thr Gly Ile Thr Glu Ala Val Lys Glu Ile Thr Leu 
        115                 120                 125             


Glu Asn Lys Asp Asn Ile Arg Leu Gln Asp Cys Ser Ala Leu Cys Glu 
    130                 135                 140                 


Glu Glu Glu Asp Glu Asp Glu Gly Glu Ala Ala Asp Met Glu Glu Tyr 
145                 150                 155                 160 


Glu Glu Ser Gly Leu Leu Glu Thr Asp Glu Ala Thr Leu Asp Thr Arg 
                165                 170                 175     


Lys Ile Val Glu Ala Cys Lys Ala Lys Thr Asp Ala Gly Gly Glu Asp 
            180                 185                 190         


Ala Ile Leu Gln Thr Arg Thr Tyr Asp Leu Tyr Ile Thr Tyr Asp Lys 
        195                 200                 205             


Tyr Tyr Gln Thr Pro Arg Leu Trp Leu Phe Gly Tyr Asp Glu Gln Arg 
    210                 215                 220                 


Gln Pro Leu Thr Val Glu His Met Tyr Glu Asp Ile Ser Gln Asp His 
225                 230                 235                 240 


Val Lys Lys Thr Val Thr Ile Glu Asn His Pro His Leu Pro Pro Pro 
                245                 250                 255     


Pro Met Cys Ser Val His Pro Cys Arg His Ala Glu Val Met Lys Lys 
            260                 265                 270         


Ile Ile Glu Thr Val Ala Glu Gly Gly Gly Glu Leu Gly Val His Met 
        275                 280                 285             


Tyr Pro Ser Leu Tyr Val Arg Leu Val Ala Lys Trp Leu Leu Thr Ile 
    290                 295                 300                 


Phe Phe Leu Arg Asn Leu Val 
305                 310     


<210>  116
<211>  2326
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ATG4A mRNA variant 1 GenBank Accession No.: NM_052936.3  
       GI:313760675

<400>  116
gagttgcagt ttgagagcag ttccgggcag ggaggcgcct ttgctgccct cacagacttg       60

gcccctagca gtgcagaact acaagtccca gggatcctag cgaccgtccg tccgtagtca      120

agttgccggt ggaattggcc caggatgaca gctggagaat ggagtcagtt ttatccaagt      180

atgaagatca gattactatt ttcactgact acctagaaga atatccagat acagatgagc      240

tggtatggat cttagggaag cagcatctcc ttaaaacaga aaaatctaag ctgttgtctg      300

atataagtgc tcgtctatgg tttacataca gaaggaaatt ttcaccaatt ggtggaacgg      360

gcccttcatc agatgctggt tggggatgta tgctacgctg tggacagatg atgctggctc      420

aagcccttat ctgtagacac ttgggaaggg actggagctg ggagaaacaa aaagaacaac      480

ccaaagaata ccaacgcatc ctacagtgct tcttagatag aaaagattgt tgctactcta      540

tccatcaaat ggcacaaatg ggtgtaggag aagggaaatc aattggagaa tggtttggac      600

caaatacagt tgcacaggtg ttaaaaaaac ttgctttatt tgacgaatgg aattccttgg      660

ctgtttatgt ttcaatggat aacacagtgg tcattgaaga tatcaaaaaa atgtgccgtg      720

tccttccctt gagtgctgac acagctggtg acaggcctcc cgattcttta actgcttcaa      780

accagagtaa gggcacctct gcctactgct cagcctggaa acccctgctg ctcattgtgc      840

cccttcgcct gggcataaac caaatcaatc ctgtctatgt tgatgcattc aaagagtgtt      900

ttaagatgcc acagtcttta ggggcattag gaggaaaacc aaataacgcg tattatttca      960

taggattctt aggtgacgag ctcatcttct tggaccctca tacaacccag acctttgttg     1020

acactgaaga gaatggaacg gttaatgacc agactttcca ttgcctgcag tccccacagc     1080

gaatgaacat cctaaacctg gatccttcag ttgcattggg atttttctgc aaagaagaaa     1140

aagactttga taactggtgt agccttgttc agaaggaaat tctaaaggag aatttaagga     1200

tgtttgaatt agttcagaaa catccatcac actggcctcc ctttgtacct ccagccaagc     1260

cagaagtgac aaccactggg gcagaattca ttgactctac tgagcaactg gaggagtttg     1320

atctggagga agattttgag attctgagtg tgtagaatcc tgggaactca acttgaaggt     1380

ctgtcttcca tctggcacca taaaaacatg aacttattgc ataaaacttt tctagtcagc     1440

aagtgcctga tatgccaata gcatacaaac tcaatagcaa tcatgactga gccaatcact     1500

gtttctcaga aaaacaaaac aaaacaaaac aaatgacagt aacccttccc cggaaagaaa     1560

tagaacaatc atggagccta ggagcagaga gatgaggagg agttcattgc ttcccagctt     1620

gtgttatatg gctacagcaa gtcttcagct gctgcaatga ggaaatgggc atctggaaga     1680

caaacagcaa ctctcagctt gcttcaagaa ccagcagata agagatggtt aagctgttct     1740

tcaccctttc agatgtgacc tcttttggac taagcagcaa tctgttctct tgctcaaata     1800

ataaagtgac tgaatcaggg aggaaaaggt tcttgttaaa ttatttgatt gtgtagttga     1860

agtaattata atttatatca aaacgtttgt caaagaaacg atgtcaaata tacacttctt     1920

gatctccctt ctgtttgcgg ggatcttact atttgatggg tcactgtccc cattcttact     1980

gatacttttg tcagatatca ccctgtcctt aaatcatgat cacttaaatc aggggtcagc     2040

aaactttttc tgtaaagggc cagacgggaa atattttggg ctttgcaggc catgcggcct     2100

ctgtcacatc tactcaactc tgctgttgac atgcaaaagc agcaatagac aatatgcgtg     2160

taaatgagtg tggctgtaat ccaagaaaac tttatttaca aaagcaggtg gagggctggg     2220

tttggcctgc aggctgtagc ttgccaatca gtgacttaaa ttgttgattt ttgtttgata     2280

aattaaaaat aaattgtgtt tgaagtatac cctaaaaaaa aaaaaa                    2326


<210>  117
<211>  398
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ATG4A isoform A encoded by mRNA variant 1 GenBank Accession No.: 
       NP_443168.2  GI:30795252

<400>  117

Met Glu Ser Val Leu Ser Lys Tyr Glu Asp Gln Ile Thr Ile Phe Thr 
1               5                   10                  15      


Asp Tyr Leu Glu Glu Tyr Pro Asp Thr Asp Glu Leu Val Trp Ile Leu 
            20                  25                  30          


Gly Lys Gln His Leu Leu Lys Thr Glu Lys Ser Lys Leu Leu Ser Asp 
        35                  40                  45              


Ile Ser Ala Arg Leu Trp Phe Thr Tyr Arg Arg Lys Phe Ser Pro Ile 
    50                  55                  60                  


Gly Gly Thr Gly Pro Ser Ser Asp Ala Gly Trp Gly Cys Met Leu Arg 
65                  70                  75                  80  


Cys Gly Gln Met Met Leu Ala Gln Ala Leu Ile Cys Arg His Leu Gly 
                85                  90                  95      


Arg Asp Trp Ser Trp Glu Lys Gln Lys Glu Gln Pro Lys Glu Tyr Gln 
            100                 105                 110         


Arg Ile Leu Gln Cys Phe Leu Asp Arg Lys Asp Cys Cys Tyr Ser Ile 
        115                 120                 125             


His Gln Met Ala Gln Met Gly Val Gly Glu Gly Lys Ser Ile Gly Glu 
    130                 135                 140                 


Trp Phe Gly Pro Asn Thr Val Ala Gln Val Leu Lys Lys Leu Ala Leu 
145                 150                 155                 160 


Phe Asp Glu Trp Asn Ser Leu Ala Val Tyr Val Ser Met Asp Asn Thr 
                165                 170                 175     


Val Val Ile Glu Asp Ile Lys Lys Met Cys Arg Val Leu Pro Leu Ser 
            180                 185                 190         


Ala Asp Thr Ala Gly Asp Arg Pro Pro Asp Ser Leu Thr Ala Ser Asn 
        195                 200                 205             


Gln Ser Lys Gly Thr Ser Ala Tyr Cys Ser Ala Trp Lys Pro Leu Leu 
    210                 215                 220                 


Leu Ile Val Pro Leu Arg Leu Gly Ile Asn Gln Ile Asn Pro Val Tyr 
225                 230                 235                 240 


Val Asp Ala Phe Lys Glu Cys Phe Lys Met Pro Gln Ser Leu Gly Ala 
                245                 250                 255     


Leu Gly Gly Lys Pro Asn Asn Ala Tyr Tyr Phe Ile Gly Phe Leu Gly 
            260                 265                 270         


Asp Glu Leu Ile Phe Leu Asp Pro His Thr Thr Gln Thr Phe Val Asp 
        275                 280                 285             


Thr Glu Glu Asn Gly Thr Val Asn Asp Gln Thr Phe His Cys Leu Gln 
    290                 295                 300                 


Ser Pro Gln Arg Met Asn Ile Leu Asn Leu Asp Pro Ser Val Ala Leu 
305                 310                 315                 320 


Gly Phe Phe Cys Lys Glu Glu Lys Asp Phe Asp Asn Trp Cys Ser Leu 
                325                 330                 335     


Val Gln Lys Glu Ile Leu Lys Glu Asn Leu Arg Met Phe Glu Leu Val 
            340                 345                 350         


Gln Lys His Pro Ser His Trp Pro Pro Phe Val Pro Pro Ala Lys Pro 
        355                 360                 365             


Glu Val Thr Thr Thr Gly Ala Glu Phe Ile Asp Ser Thr Glu Gln Leu 
    370                 375                 380                 


Glu Glu Phe Asp Leu Glu Glu Asp Phe Glu Ile Leu Ser Val 
385                 390                 395             


<210>  118
<211>  2140
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 4A, cysteine peptidase (ATG4A), 
       transcript variant 2, mRNA

<400>  118
gagttgcagt ttgagagcag ttccgggcag ggaggcgcct ttgctgccct cacagacttg       60

gcccctagca gtgcagaact acaagtccca gggatcctag cgaccgtccg tccgtagtca      120

agttgccggt ggaattggcc caggatgaca gctggagaat ggagtcagtt ttatccaagt      180

atgaagatca gattactatt ttcactgact acctagaaga atatccagat acagatgagc      240

tggtatggat cttagggaag cagcatctcc ttaaaacaga aaaatctaag ctgttgtctg      300

atataagtgc tcgtctatgg tttacataca gaaggaaatt ttcaccaatt ggtggaacgg      360

gcccttcatc agatgctggt tggggatgta tgctacgctg tggacagatg atgctggctc      420

aagcccttat ctgtagacac ttgggaaggg actggagctg ggagaaacaa aaagaacaac      480

ccaaagaata ccaacgcatc ctacagtgct tcttagatag aaaagattgt tgctactcta      540

tccatcaaat ggcacaaatg ggtgtaggag aagggaaatc aattggagaa tggtttggac      600

caaatacagt tgcacaggtg ttaaaaaaac ttgctttatt tgacgaatgg aattccttgg      660

ctgtttatgt ttcaatggat aacacagtgg tcattgaaga tatcaaaaaa atgtgccgtg      720

tccttccctt gagtgctgac acagctggtg acaggcctcc cgattcttta actgcttcaa      780

accagagtga cgagctcatc ttcttggacc ctcatacaac ccagaccttt gttgacactg      840

aagagaatgg aacggttaat gaccagactt tccattgcct gcagtcccca cagcgaatga      900

acatcctaaa cctggatcct tcagttgcat tgggattttt ctgcaaagaa gaaaaagact      960

ttgataactg gtgtagcctt gttcagaagg aaattctaaa ggagaattta aggatgtttg     1020

aattagttca gaaacatcca tcacactggc ctccctttgt acctccagcc aagccagaag     1080

tgacaaccac tggggcagaa ttcattgact ctactgagca actggaggag tttgatctgg     1140

aggaagattt tgagattctg agtgtgtaga atcctgggaa ctcaacttga aggtctgtct     1200

tccatctggc accataaaaa catgaactta ttgcataaaa cttttctagt cagcaagtgc     1260

ctgatatgcc aatagcatac aaactcaata gcaatcatga ctgagccaat cactgtttct     1320

cagaaaaaca aaacaaaaca aaacaaatga cagtaaccct tccccggaaa gaaatagaac     1380

aatcatggag cctaggagca gagagatgag gaggagttca ttgcttccca gcttgtgtta     1440

tatggctaca gcaagtcttc agctgctgca atgaggaaat gggcatctgg aagacaaaca     1500

gcaactctca gcttgcttca agaaccagca gataagagat ggttaagctg ttcttcaccc     1560

tttcagatgt gacctctttt ggactaagca gcaatctgtt ctcttgctca aataataaag     1620

tgactgaatc agggaggaaa aggttcttgt taaattattt gattgtgtag ttgaagtaat     1680

tataatttat atcaaaacgt ttgtcaaaga aacgatgtca aatatacact tcttgatctc     1740

ccttctgttt gcggggatct tactatttga tgggtcactg tccccattct tactgatact     1800

tttgtcagat atcaccctgt ccttaaatca tgatcactta aatcaggggt cagcaaactt     1860

tttctgtaaa gggccagacg ggaaatattt tgggctttgc aggccatgcg gcctctgtca     1920

catctactca actctgctgt tgacatgcaa aagcagcaat agacaatatg cgtgtaaatg     1980

agtgtggctg taatccaaga aaactttatt tacaaaagca ggtggagggc tgggtttggc     2040

ctgcaggctg tagcttgcca atcagtgact taaattgttg atttttgttt gataaattaa     2100

aaataaattg tgtttgaagt ataccctaaa aaaaaaaaaa                           2140


<210>  119
<211>  336
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapien autophagy related 4A, cysteine peptidase (ATG4A), 
       transcript variant 2, mRNA NCBI Reference Sequence: NM_178270.2

<400>  119

Met Glu Ser Val Leu Ser Lys Tyr Glu Asp Gln Ile Thr Ile Phe Thr 
1               5                   10                  15      


Asp Tyr Leu Glu Glu Tyr Pro Asp Thr Asp Glu Leu Val Trp Ile Leu 
            20                  25                  30          


Gly Lys Gln His Leu Leu Lys Thr Glu Lys Ser Lys Leu Leu Ser Asp 
        35                  40                  45              


Ile Ser Ala Arg Leu Trp Phe Thr Tyr Arg Arg Lys Phe Ser Pro Ile 
    50                  55                  60                  


Gly Gly Thr Gly Pro Ser Ser Asp Ala Gly Trp Gly Cys Met Leu Arg 
65                  70                  75                  80  


Cys Gly Gln Met Met Leu Ala Gln Ala Leu Ile Cys Arg His Leu Gly 
                85                  90                  95      


Arg Asp Trp Ser Trp Glu Lys Gln Lys Glu Gln Pro Lys Glu Tyr Gln 
            100                 105                 110         


Arg Ile Leu Gln Cys Phe Leu Asp Arg Lys Asp Cys Cys Tyr Ser Ile 
        115                 120                 125             


His Gln Met Ala Gln Met Gly Val Gly Glu Gly Lys Ser Ile Gly Glu 
    130                 135                 140                 


Trp Phe Gly Pro Asn Thr Val Ala Gln Val Leu Lys Lys Leu Ala Leu 
145                 150                 155                 160 


Phe Asp Glu Trp Asn Ser Leu Ala Val Tyr Val Ser Met Asp Asn Thr 
                165                 170                 175     


Val Val Ile Glu Asp Ile Lys Lys Met Cys Arg Val Leu Pro Leu Ser 
            180                 185                 190         


Ala Asp Thr Ala Gly Asp Arg Pro Pro Asp Ser Leu Thr Ala Ser Asn 
        195                 200                 205             


Gln Ser Asp Glu Leu Ile Phe Leu Asp Pro His Thr Thr Gln Thr Phe 
    210                 215                 220                 


Val Asp Thr Glu Glu Asn Gly Thr Val Asn Asp Gln Thr Phe His Cys 
225                 230                 235                 240 


Leu Gln Ser Pro Gln Arg Met Asn Ile Leu Asn Leu Asp Pro Ser Val 
                245                 250                 255     


Ala Leu Gly Phe Phe Cys Lys Glu Glu Lys Asp Phe Asp Asn Trp Cys 
            260                 265                 270         


Ser Leu Val Gln Lys Glu Ile Leu Lys Glu Asn Leu Arg Met Phe Glu 
        275                 280                 285             


Leu Val Gln Lys His Pro Ser His Trp Pro Pro Phe Val Pro Pro Ala 
    290                 295                 300                 


Lys Pro Glu Val Thr Thr Thr Gly Ala Glu Phe Ile Asp Ser Thr Glu 
305                 310                 315                 320 


Gln Leu Glu Glu Phe Asp Leu Glu Glu Asp Phe Glu Ile Leu Ser Val 
                325                 330                 335     


<210>  120
<211>  2892
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ATG4B mRNA variant 1 GenBank Accession No.: NM_013325.4  
       GI:47132610

<400>  120
gcgccggcac acctattggc ccccgcggcg tcccgtcgcc gcgtcgcgtt gctggcccgt       60

cggagcgacg ccgctcgggt cagtcggcgg ccggactggg aagatggacg cagctactct      120

gacctacgac actctccggt ttgctgagtt tgaagatttt cctgagacct cagagcccgt      180

ttggatactg ggtagaaaat acagcatttt cacagaaaag gacgagatct tgtctgatgt      240

ggcatctaga ctttggttta catacaggaa aaactttcca gccattgggg ggacaggccc      300

cacctcggac acaggctggg gctgcatgct gcggtgtgga cagatgatct ttgcccaagc      360

cctggtgtgc cggcacctag gccgagattg gaggtggaca caaaggaaga ggcagccaga      420

cagctacttc agcgtcctca acgcattcat cgacaggaag gacagttact actccattca      480

ccagatagcg caaatgggag ttggcgaagg caagtccata ggccagtggt acgggcccaa      540

cactgtcgcc caggtcctga agaagcttgc tgtcttcgat acgtggagct ccttggcggt      600

ccacattgca atggacaaca ctgttgtgat ggaggaaatc agaaggttgt gcaggaccag      660

cgttccctgt gcaggcgcca ctgcgtttcc tgcagattcc gaccggcact gcaacggatt      720

ccctgccgga gctgaggtca ccaacaggcc gtcgccatgg agacccctgg tacttctcat      780

tcccctgcgc ctggggctca cggacatcaa cgaggcctac gtggagacgc tgaagcactg      840

cttcatgatg ccccagtccc tgggcgtcat cggagggaag cccaacagcg cccactactt      900

catcggctac gttggtgagg agctcatcta cctggacccc cacaccacgc agccagccgt      960

ggagcccact gatggctgct tcatcccgga cgagagcttc cactgccagc acccgccgtg     1020

ccgcatgagc atcgcggagc ttgacccgtc catcgctgtg gggtttttct gtaagactga     1080

agatgacttc aatgattggt gccagcaagt caaaaagctg tctctgcttg gaggtgccct     1140

gcccatgttt gagctggtgg agctgcagcc ttcacatctg gcctgccccg acgtcctgaa     1200

cctgtcccta gattcttctg atgtagagcg actggaaaga ttcttcgact cagaagatga     1260

agactttgaa atcctgtccc tttgaaaatc ctggggtcgg gggtggcacc tgtgagagcc     1320

tggggctcct ggtgccgctg cgtttcatcc atcccgcccg ctcgcctgcc gagggctgcg     1380

ccccgtgctg cctcccccca gagggccacc cgctgtgctc gtggactgag gctgcgctgc     1440

ccgggaggcc ttactgcttg gtgtcagact gcccagctca gagtgcccgt cagggcctgt     1500

gcatccgcac gcggagccgt ctgttaggag cttccagagt gttctctcga cactgccagc     1560

cccgtgttag cacctgggcc tcagtcccac ttgctcccag gcgccggttc tgtggttggt     1620

ttggaattaa agtcctgttt gaagttgtca gacacagaca tgaatttctg ggcgctccct     1680

gagtcagagt ctcagaagac ctgtgcaggc tggcgtgaga ggagcggcag ccacactgcg     1740

gccccacgcc caaggactgg gctgctctcg aggggggcgc gcccaccgct gtgtcctctc     1800

tgcccagcct ggcttaccaa gggctacctc agtgggagat gaggttggag gaacgaaggc     1860

gaggttcctc cttgctttgg ggagaaaagt attcaggaag tgggtgtgtg ggaaacctga     1920

agatggcgtg cacaggacac agcgtgggcg gcctgggcag aagggcggct ggctgtcctg     1980

gagctgctgc tggagcctgc cctcagagtg tccctttcca gtgctgtggc attctgtggc     2040

agcttcccca ggtgtggtga cggggggggg gcggggcctc cacctgtgac agccaggctt     2100

gagggtggac ggcgtgcctc tcccaggagc cttccccatg tccttgcctt gctgagaatt     2160

gccctcccat gccgctgagg tgttaggtgg tttagggcca aaaggggaaa accacttgag     2220

tcttgtggtg tgtggtgggc agacaccaca gggtggcatc acctggtggc atttccagaa     2280

cctcagcccc gattccagca cccaccaccg cctgaccctg tgtaacctgc tgtcccgggt     2340

cccagagtgc actctgcccc gctgctctgc tgcctgtcct gggaaagtat ctttgcccca     2400

ctaggaaatg taaacaggag ggcttgggga gcgtgggcac ttttctcatg agcagctact     2460

gcggcgttgg caggactcgc tgctgctgct gctgcttgtg taggtcgggg agccagagat     2520

ccccgaggac gcgcgccgga cagtcggcac tgaccggccc acctggtagc agaggacacc     2580

cccagccccc caagcattga agacatagtg tatttcctcg tatcctttct cccttgggtg     2640

tagttggggt ggggaagcag ggaaggctgg tgcgatctcc attccttggg ctccacgtcc     2700

gagttcatgg tgcgccgctg tgctgggagc tgcagtggta atgtgtggga caccttgacc     2760

aaaggggagc tttgtctcgt gtgttttgaa aaaggcttaa tgaagagaat gttgttcatt     2820

cttagtagta tagtttgcaa ttcttaatgg caaataataa gtttcagtag aaaacaaaaa     2880

aaaaaaaaaa aa                                                         2892


<210>  121
<211>  393
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ATG4B isoform A encoded by mRNA variant 1 GenBank Accession No.: 
       NP_037457.3  GI:47132611

<400>  121

Met Asp Ala Ala Thr Leu Thr Tyr Asp Thr Leu Arg Phe Ala Glu Phe 
1               5                   10                  15      


Glu Asp Phe Pro Glu Thr Ser Glu Pro Val Trp Ile Leu Gly Arg Lys 
            20                  25                  30          


Tyr Ser Ile Phe Thr Glu Lys Asp Glu Ile Leu Ser Asp Val Ala Ser 
        35                  40                  45              


Arg Leu Trp Phe Thr Tyr Arg Lys Asn Phe Pro Ala Ile Gly Gly Thr 
    50                  55                  60                  


Gly Pro Thr Ser Asp Thr Gly Trp Gly Cys Met Leu Arg Cys Gly Gln 
65                  70                  75                  80  


Met Ile Phe Ala Gln Ala Leu Val Cys Arg His Leu Gly Arg Asp Trp 
                85                  90                  95      


Arg Trp Thr Gln Arg Lys Arg Gln Pro Asp Ser Tyr Phe Ser Val Leu 
            100                 105                 110         


Asn Ala Phe Ile Asp Arg Lys Asp Ser Tyr Tyr Ser Ile His Gln Ile 
        115                 120                 125             


Ala Gln Met Gly Val Gly Glu Gly Lys Ser Ile Gly Gln Trp Tyr Gly 
    130                 135                 140                 


Pro Asn Thr Val Ala Gln Val Leu Lys Lys Leu Ala Val Phe Asp Thr 
145                 150                 155                 160 


Trp Ser Ser Leu Ala Val His Ile Ala Met Asp Asn Thr Val Val Met 
                165                 170                 175     


Glu Glu Ile Arg Arg Leu Cys Arg Thr Ser Val Pro Cys Ala Gly Ala 
            180                 185                 190         


Thr Ala Phe Pro Ala Asp Ser Asp Arg His Cys Asn Gly Phe Pro Ala 
        195                 200                 205             


Gly Ala Glu Val Thr Asn Arg Pro Ser Pro Trp Arg Pro Leu Val Leu 
    210                 215                 220                 


Leu Ile Pro Leu Arg Leu Gly Leu Thr Asp Ile Asn Glu Ala Tyr Val 
225                 230                 235                 240 


Glu Thr Leu Lys His Cys Phe Met Met Pro Gln Ser Leu Gly Val Ile 
                245                 250                 255     


Gly Gly Lys Pro Asn Ser Ala His Tyr Phe Ile Gly Tyr Val Gly Glu 
            260                 265                 270         


Glu Leu Ile Tyr Leu Asp Pro His Thr Thr Gln Pro Ala Val Glu Pro 
        275                 280                 285             


Thr Asp Gly Cys Phe Ile Pro Asp Glu Ser Phe His Cys Gln His Pro 
    290                 295                 300                 


Pro Cys Arg Met Ser Ile Ala Glu Leu Asp Pro Ser Ile Ala Val Gly 
305                 310                 315                 320 


Phe Phe Cys Lys Thr Glu Asp Asp Phe Asn Asp Trp Cys Gln Gln Val 
                325                 330                 335     


Lys Lys Leu Ser Leu Leu Gly Gly Ala Leu Pro Met Phe Glu Leu Val 
            340                 345                 350         


Glu Leu Gln Pro Ser His Leu Ala Cys Pro Asp Val Leu Asn Leu Ser 
        355                 360                 365             


Leu Asp Ser Ser Asp Val Glu Arg Leu Glu Arg Phe Phe Asp Ser Glu 
    370                 375                 380                 


Asp Glu Asp Phe Glu Ile Leu Ser Leu 
385                 390             


<210>  122
<211>  2912
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 4B, cysteine peptidase (ATG4B), 
       transcript variant 2, mRNA

<400>  122
gcgccggcac acctattggc ccccgcggcg tcccgtcgcc gcgtcgcgtt gctggcccgt       60

cggagcgacg ccgctcgggt cagtcggcgg ccggactggg aagatggacg cagctactct      120

gacctacgac actctccggt ttgctgagtt tgaagatttt cctgagacct cagagcccgt      180

ttggatactg ggtagaaaat acagcatttt cacagaaaag gacgagatct tgtctgatgt      240

ggcatctaga ctttggttta catacaggaa aaactttcca gccattgggg ggacaggccc      300

cacctcggac acaggctggg gctgcatgct gcggtgtgga cagatgatct ttgcccaagc      360

cctggtgtgc cggcacctag gccgagattg gaggtggaca caaaggaaga ggcagccaga      420

cagctacttc agcgtcctca acgcattcat cgacaggaag gacagttact actccattca      480

ccagatagcg caaatgggag ttggcgaagg caagtccata ggccagtggt acgggcccaa      540

cactgtcgcc caggtcctga agaagcttgc tgtcttcgat acgtggagct ccttggcggt      600

ccacattgca atggacaaca ctgttgtgat ggaggaaatc agaaggttgt gcaggaccag      660

cgttccctgt gcaggcgcca ctgcgtttcc tgcagattcc gaccggcact gcaacggatt      720

ccctgccgga gctgaggtca ccaacaggcc gtcgccatgg agacccctgg tacttctcat      780

tcccctgcgc ctggggctca cggacatcaa cgaggcctac gtggagacgc tgaagcactg      840

cttcatgatg ccccagtccc tgggcgtcat cggagggaag cccaacagcg cccactactt      900

catcggctac gttggtgagg agctcatcta cctggacccc cacaccacgc agccagccgt      960

ggagcccact gatggctgct tcatcccgga cgagagcttc cactgccagc acccgccgtg     1020

ccgcatgagc atcgcggagc ttgacccgtc catcgctgtg gggtttttct gtaagactga     1080

agatgacttc aatgattggt gccagcaagt caaaaagctg tctctgcttg gaggtgccct     1140

gcccatgttt gagctggtgg agctgcagcc ttcacatctg gcctgccccg acgtcctgaa     1200

cctgtcccta ggtgagagct gccaagtcca gattcttctg atgtagagcg actggaaaga     1260

ttcttcgact cagaagatga agactttgaa atcctgtccc tttgaaaatc ctggggtcgg     1320

gggtggcacc tgtgagagcc tggggctcct ggtgccgctg cgtttcatcc atcccgcccg     1380

ctcgcctgcc gagggctgcg ccccgtgctg cctcccccca gagggccacc cgctgtgctc     1440

gtggactgag gctgcgctgc ccgggaggcc ttactgcttg gtgtcagact gcccagctca     1500

gagtgcccgt cagggcctgt gcatccgcac gcggagccgt ctgttaggag cttccagagt     1560

gttctctcga cactgccagc cccgtgttag cacctgggcc tcagtcccac ttgctcccag     1620

gcgccggttc tgtggttggt ttggaattaa agtcctgttt gaagttgtca gacacagaca     1680

tgaatttctg ggcgctccct gagtcagagt ctcagaagac ctgtgcaggc tggcgtgaga     1740

ggagcggcag ccacactgcg gccccacgcc caaggactgg gctgctctcg aggggggcgc     1800

gcccaccgct gtgtcctctc tgcccagcct ggcttaccaa gggctacctc agtgggagat     1860

gaggttggag gaacgaaggc gaggttcctc cttgctttgg ggagaaaagt attcaggaag     1920

tgggtgtgtg ggaaacctga agatggcgtg cacaggacac agcgtgggcg gcctgggcag     1980

aagggcggct ggctgtcctg gagctgctgc tggagcctgc cctcagagtg tccctttcca     2040

gtgctgtggc attctgtggc agcttcccca ggtgtggtga cggggggggg gcggggcctc     2100

cacctgtgac agccaggctt gagggtggac ggcgtgcctc tcccaggagc cttccccatg     2160

tccttgcctt gctgagaatt gccctcccat gccgctgagg tgttaggtgg tttagggcca     2220

aaaggggaaa accacttgag tcttgtggtg tgtggtgggc agacaccaca gggtggcatc     2280

acctggtggc atttccagaa cctcagcccc gattccagca cccaccaccg cctgaccctg     2340

tgtaacctgc tgtcccgggt cccagagtgc actctgcccc gctgctctgc tgcctgtcct     2400

gggaaagtat ctttgcccca ctaggaaatg taaacaggag ggcttgggga gcgtgggcac     2460

ttttctcatg agcagctact gcggcgttgg caggactcgc tgctgctgct gctgcttgtg     2520

taggtcgggg agccagagat ccccgaggac gcgcgccgga cagtcggcac tgaccggccc     2580

acctggtagc agaggacacc cccagccccc caagcattga agacatagtg tatttcctcg     2640

tatcctttct cccttgggtg tagttggggt ggggaagcag ggaaggctgg tgcgatctcc     2700

attccttggg ctccacgtcc gagttcatgg tgcgccgctg tgctgggagc tgcagtggta     2760

atgtgtggga caccttgacc aaaggggagc tttgtctcgt gtgttttgaa aaaggcttaa     2820

tgaagagaat gttgttcatt cttagtagta tagtttgcaa ttcttaatgg caaataataa     2880

gtttcagtag aaaacaaaaa aaaaaaaaaa aa                                   2912


<210>  123
<211>  380
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 4B, cysteine peptidase (ATG4B), 
       transcript variant 2, polypeptide

       NCBI Reference Sequence: NM_178326.2

<400>  123

Met Asp Ala Ala Thr Leu Thr Tyr Asp Thr Leu Arg Phe Ala Glu Phe 
1               5                   10                  15      


Glu Asp Phe Pro Glu Thr Ser Glu Pro Val Trp Ile Leu Gly Arg Lys 
            20                  25                  30          


Tyr Ser Ile Phe Thr Glu Lys Asp Glu Ile Leu Ser Asp Val Ala Ser 
        35                  40                  45              


Arg Leu Trp Phe Thr Tyr Arg Lys Asn Phe Pro Ala Ile Gly Gly Thr 
    50                  55                  60                  


Gly Pro Thr Ser Asp Thr Gly Trp Gly Cys Met Leu Arg Cys Gly Gln 
65                  70                  75                  80  


Met Ile Phe Ala Gln Ala Leu Val Cys Arg His Leu Gly Arg Asp Trp 
                85                  90                  95      


Arg Trp Thr Gln Arg Lys Arg Gln Pro Asp Ser Tyr Phe Ser Val Leu 
            100                 105                 110         


Asn Ala Phe Ile Asp Arg Lys Asp Ser Tyr Tyr Ser Ile His Gln Ile 
        115                 120                 125             


Ala Gln Met Gly Val Gly Glu Gly Lys Ser Ile Gly Gln Trp Tyr Gly 
    130                 135                 140                 


Pro Asn Thr Val Ala Gln Val Leu Lys Lys Leu Ala Val Phe Asp Thr 
145                 150                 155                 160 


Trp Ser Ser Leu Ala Val His Ile Ala Met Asp Asn Thr Val Val Met 
                165                 170                 175     


Glu Glu Ile Arg Arg Leu Cys Arg Thr Ser Val Pro Cys Ala Gly Ala 
            180                 185                 190         


Thr Ala Phe Pro Ala Asp Ser Asp Arg His Cys Asn Gly Phe Pro Ala 
        195                 200                 205             


Gly Ala Glu Val Thr Asn Arg Pro Ser Pro Trp Arg Pro Leu Val Leu 
    210                 215                 220                 


Leu Ile Pro Leu Arg Leu Gly Leu Thr Asp Ile Asn Glu Ala Tyr Val 
225                 230                 235                 240 


Glu Thr Leu Lys His Cys Phe Met Met Pro Gln Ser Leu Gly Val Ile 
                245                 250                 255     


Gly Gly Lys Pro Asn Ser Ala His Tyr Phe Ile Gly Tyr Val Gly Glu 
            260                 265                 270         


Glu Leu Ile Tyr Leu Asp Pro His Thr Thr Gln Pro Ala Val Glu Pro 
        275                 280                 285             


Thr Asp Gly Cys Phe Ile Pro Asp Glu Ser Phe His Cys Gln His Pro 
    290                 295                 300                 


Pro Cys Arg Met Ser Ile Ala Glu Leu Asp Pro Ser Ile Ala Val Gly 
305                 310                 315                 320 


Phe Phe Cys Lys Thr Glu Asp Asp Phe Asn Asp Trp Cys Gln Gln Val 
                325                 330                 335     


Lys Lys Leu Ser Leu Leu Gly Gly Ala Leu Pro Met Phe Glu Leu Val 
            340                 345                 350         


Glu Leu Gln Pro Ser His Leu Ala Cys Pro Asp Val Leu Asn Leu Ser 
        355                 360                 365             


Leu Gly Glu Ser Cys Gln Val Gln Ile Leu Leu Met 
    370                 375                 380 


<210>  124
<211>  2738
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ATG4C mRNA variant 1 GenBank Accession No.:  NM_032852.3  
       GI:320118918

<400>  124
ctcttactac ggtggccggg gtgctcaaag tacctgtagc tgcggcgctg aggtcggaac       60

gtctgcgtgt gtgcgggctg gttttgtggc ggctgctgct agagctggag catttgccgg      120

gttggtggct cctgcacatt tttacagttc tccagtcctt ctctttcgtc agtataaaag      180

attaaactct acagaagaat gcaatcaagt gatggctttt cctttagaat ttgaatatgg      240

aggctacagg aacagatgaa gttgacaagc taaaaaccaa atttatatct gcttggaaca      300

acatgaaata tagttgggtg ttgaaaacaa agacgtattt tagtagaaat tctcctgtat      360

tattgcttgg aaaatgttac cattttaaat atgaagatga agataaaacg ttacctgcag      420

agtcgggatg tacaatagag gatcacgtaa ttgcaggaaa tgtagaagaa tttcgtaaag      480

atttcatttc tagaatatgg ctgacctaca gggaagaatt ccctcaaata gaaggctcag      540

ctttgacaac agactgtggg tggggctgca cattgagaac tggccagatg ctcttggctc      600

aaggactcat actacacttt cttggtagag cttggacctg gcctgatgct ttgaatattg      660

aaaattcaga ctctgaatca tggacttccc acactgtcaa aaaatttact gcatcatttg      720

aagcatcact ttcaggggaa agagaattca aaaccccaac aatttctctg aaggaaacaa      780

ttgggaaata ttctgatgat catgaaatgc gaaatgaagt ttatcatagg aaaatcatct      840

cttggtttgg tgattccccc ttggctcttt ttggcttaca tcaactaata gaatatggaa      900

agaagtctgg gaaaaaagca ggagattggt atggaccagc tgtggttgct cacattttaa      960

gaaaagcagt tgaagaagca aggcatcctg atttacaagg aataactatt tatgttgcac     1020

aagattgtac agtttacaat tctgatgtaa ttgataaaca gagtgcttcc atgacttctg     1080

ataatgcaga tgacaaagct gttattattc tagttcctgt tagacttggt ggagaaagaa     1140

ccaacaccga ctacttagaa tttgtgaagg gtattttaag cctggaatat tgtgtgggta     1200

ttattggtgg caaacctaaa cagtcatatt actttgctgg atttcaagat gacagtttga     1260

tttacatgga tcctcattac tgccaatctt ttgtagatgt cagcataaag gatttccctc     1320

ttgagacatt ccactgccct tctcccaaaa agatgtcttt tcgaaaaatg gatcccagct     1380

gtacaatagg attttactgt cgaaatgttc aggacttcaa acgagcttct gaagaaatca     1440

ccaagatgct gaaattttct tctaaggaga aatatccctt atttactttt gtaaatggtc     1500

attccagaga ctatgatttt acatctacta caaccaatga agaagacctt ttttcagagg     1560

atgaaaagaa acaattaaaa agatttagca cggaagagtt tgtcttgctt taaagattag     1620

cacatttgtg cttgataaga agaattccat tgaaagggga aaaatgaaga gaaacaagta     1680

tatctgaaat gtttattttc acaaatatct taattttata tgttctttaa aaaagaacat     1740

ttgaaaatat aacagttaaa gatatttttc taaaagagaa atgatttaat gaatcttgct     1800

ttctaataaa taaattgagt gattctggtt gcattcctat ttccctaaga tctactagtg     1860

ataattctac cttaactgta agccttttag tcttcaaagt cttccacctg agcccattgt     1920

tctcatggag gttttgtgat attaaccctc ccccaaagac tgggatcacc aaatagtttc     1980

aaaattctca gtttgtactg aagaccagaa gatcagagaa ggaaacttta atgctgtcta     2040

gcctcctgct attaatgcaa tcaaagaata cttttgcata tgtcttgata attaaatagt     2100

atttgttaac tgtgatatgc atacacttat ataagcagaa ttatgagtta aagtaatact     2160

tagcaatatg attttataat ggctcctcat tatgcttgct gttgaacctt ttatgaggag     2220

tgaatataaa gtattggttt tccctcacaa atttaaagat tatgttatta atactattat     2280

aactgcatca atcaagtcag ataaaggcaa ctataaaata gtagtagtgt ttgtttccta     2340

tctcaagggc gaaattttat gggaactcaa tttattatgc agtttttaag tttaaaatac     2400

caagaaagat gtcactagat tctcttctat gtgatttttg ttttttatat aaagcagtgt     2460

agtggtgttt agaagctgag gccacctgta aggcaaatct gccttaagtg tattatgtgt     2520

tacttaaagg caaatttgtg atctaaaagt acaagagtga tttttgagct aggattataa     2580

aatacataat aaagatgtga gaagataaaa tgcttttgtt ttggttttaa tgttgggatt     2640

attttaatcc tttcatttga aaaatcagtg tctcaaatga attctgttca tttataataa     2700

atgcatatat tgctctgaaa acaaaaaaaa aaaaaaaa                             2738


<210>  125
<211>  458
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ATG4C encoded by mRNA variant 1 GenBank Accession No.: 
       NP_116241.2  GI:30410844

<400>  125

Met Glu Ala Thr Gly Thr Asp Glu Val Asp Lys Leu Lys Thr Lys Phe 
1               5                   10                  15      


Ile Ser Ala Trp Asn Asn Met Lys Tyr Ser Trp Val Leu Lys Thr Lys 
            20                  25                  30          


Thr Tyr Phe Ser Arg Asn Ser Pro Val Leu Leu Leu Gly Lys Cys Tyr 
        35                  40                  45              


His Phe Lys Tyr Glu Asp Glu Asp Lys Thr Leu Pro Ala Glu Ser Gly 
    50                  55                  60                  


Cys Thr Ile Glu Asp His Val Ile Ala Gly Asn Val Glu Glu Phe Arg 
65                  70                  75                  80  


Lys Asp Phe Ile Ser Arg Ile Trp Leu Thr Tyr Arg Glu Glu Phe Pro 
                85                  90                  95      


Gln Ile Glu Gly Ser Ala Leu Thr Thr Asp Cys Gly Trp Gly Cys Thr 
            100                 105                 110         


Leu Arg Thr Gly Gln Met Leu Leu Ala Gln Gly Leu Ile Leu His Phe 
        115                 120                 125             


Leu Gly Arg Ala Trp Thr Trp Pro Asp Ala Leu Asn Ile Glu Asn Ser 
    130                 135                 140                 


Asp Ser Glu Ser Trp Thr Ser His Thr Val Lys Lys Phe Thr Ala Ser 
145                 150                 155                 160 


Phe Glu Ala Ser Leu Ser Gly Glu Arg Glu Phe Lys Thr Pro Thr Ile 
                165                 170                 175     


Ser Leu Lys Glu Thr Ile Gly Lys Tyr Ser Asp Asp His Glu Met Arg 
            180                 185                 190         


Asn Glu Val Tyr His Arg Lys Ile Ile Ser Trp Phe Gly Asp Ser Pro 
        195                 200                 205             


Leu Ala Leu Phe Gly Leu His Gln Leu Ile Glu Tyr Gly Lys Lys Ser 
    210                 215                 220                 


Gly Lys Lys Ala Gly Asp Trp Tyr Gly Pro Ala Val Val Ala His Ile 
225                 230                 235                 240 


Leu Arg Lys Ala Val Glu Glu Ala Arg His Pro Asp Leu Gln Gly Ile 
                245                 250                 255     


Thr Ile Tyr Val Ala Gln Asp Cys Thr Val Tyr Asn Ser Asp Val Ile 
            260                 265                 270         


Asp Lys Gln Ser Ala Ser Met Thr Ser Asp Asn Ala Asp Asp Lys Ala 
        275                 280                 285             


Val Ile Ile Leu Val Pro Val Arg Leu Gly Gly Glu Arg Thr Asn Thr 
    290                 295                 300                 


Asp Tyr Leu Glu Phe Val Lys Gly Ile Leu Ser Leu Glu Tyr Cys Val 
305                 310                 315                 320 


Gly Ile Ile Gly Gly Lys Pro Lys Gln Ser Tyr Tyr Phe Ala Gly Phe 
                325                 330                 335     


Gln Asp Asp Ser Leu Ile Tyr Met Asp Pro His Tyr Cys Gln Ser Phe 
            340                 345                 350         


Val Asp Val Ser Ile Lys Asp Phe Pro Leu Glu Thr Phe His Cys Pro 
        355                 360                 365             


Ser Pro Lys Lys Met Ser Phe Arg Lys Met Asp Pro Ser Cys Thr Ile 
    370                 375                 380                 


Gly Phe Tyr Cys Arg Asn Val Gln Asp Phe Lys Arg Ala Ser Glu Glu 
385                 390                 395                 400 


Ile Thr Lys Met Leu Lys Phe Ser Ser Lys Glu Lys Tyr Pro Leu Phe 
                405                 410                 415     


Thr Phe Val Asn Gly His Ser Arg Asp Tyr Asp Phe Thr Ser Thr Thr 
            420                 425                 430         


Thr Asn Glu Glu Asp Leu Phe Ser Glu Asp Glu Lys Lys Gln Leu Lys 
        435                 440                 445             


Arg Phe Ser Thr Glu Glu Phe Val Leu Leu 
    450                 455             


<210>  126
<211>  2690
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 4C, cysteine peptidase (ATG4C), 
       transcript variant 2, mRNA

NCBI Reference Sequence: NM_178221.2

<400>  126
ctcttactac ggtggccggg gtgctcaaag tacctgtagc tgcggcgctg aggtcggaac       60

gtctgcgtgt gtgcgggctg gttttgtggc ggctgctgct agagctggag catttgccgg      120

tcagtataaa agattaaact ctacagaaga atgcaatcaa gtgatggctt ttcctttaga      180

atttgaatat ggaggctaca ggaacagatg aagttgacaa gctaaaaacc aaatttatat      240

ctgcttggaa caacatgaaa tatagttggg tgttgaaaac aaagacgtat tttagtagaa      300

attctcctgt attattgctt ggaaaatgtt accattttaa atatgaagat gaagataaaa      360

cgttacctgc agagtcggga tgtacaatag aggatcacgt aattgcagga aatgtagaag      420

aatttcgtaa agatttcatt tctagaatat ggctgaccta cagggaagaa ttccctcaaa      480

tagaaggctc agctttgaca acagactgtg ggtggggctg cacattgaga actggccaga      540

tgctcttggc tcaaggactc atactacact ttcttggtag agcttggacc tggcctgatg      600

ctttgaatat tgaaaattca gactctgaat catggacttc ccacactgtc aaaaaattta      660

ctgcatcatt tgaagcatca ctttcagggg aaagagaatt caaaacccca acaatttctc      720

tgaaggaaac aattgggaaa tattctgatg atcatgaaat gcgaaatgaa gtttatcata      780

ggaaaatcat ctcttggttt ggtgattccc ccttggctct ttttggctta catcaactaa      840

tagaatatgg aaagaagtct gggaaaaaag caggagattg gtatggacca gctgtggttg      900

ctcacatttt aagaaaagca gttgaagaag caaggcatcc tgatttacaa ggaataacta      960

tttatgttgc acaagattgt acagtttaca attctgatgt aattgataaa cagagtgctt     1020

ccatgacttc tgataatgca gatgacaaag ctgttattat tctagttcct gttagacttg     1080

gtggagaaag aaccaacacc gactacttag aatttgtgaa gggtatttta agcctggaat     1140

attgtgtggg tattattggt ggcaaaccta aacagtcata ttactttgct ggatttcaag     1200

atgacagttt gatttacatg gatcctcatt actgccaatc ttttgtagat gtcagcataa     1260

aggatttccc tcttgagaca ttccactgcc cttctcccaa aaagatgtct tttcgaaaaa     1320

tggatcccag ctgtacaata ggattttact gtcgaaatgt tcaggacttc aaacgagctt     1380

ctgaagaaat caccaagatg ctgaaatttt cttctaagga gaaatatccc ttatttactt     1440

ttgtaaatgg tcattccaga gactatgatt ttacatctac tacaaccaat gaagaagacc     1500

ttttttcaga ggatgaaaag aaacaattaa aaagatttag cacggaagag tttgtcttgc     1560

tttaaagatt agcacatttg tgcttgataa gaagaattcc attgaaaggg gaaaaatgaa     1620

gagaaacaag tatatctgaa atgtttattt tcacaaatat cttaatttta tatgttcttt     1680

aaaaaagaac atttgaaaat ataacagtta aagatatttt tctaaaagag aaatgattta     1740

atgaatcttg ctttctaata aataaattga gtgattctgg ttgcattcct atttccctaa     1800

gatctactag tgataattct accttaactg taagcctttt agtcttcaaa gtcttccacc     1860

tgagcccatt gttctcatgg aggttttgtg atattaaccc tcccccaaag actgggatca     1920

ccaaatagtt tcaaaattct cagtttgtac tgaagaccag aagatcagag aaggaaactt     1980

taatgctgtc tagcctcctg ctattaatgc aatcaaagaa tacttttgca tatgtcttga     2040

taattaaata gtatttgtta actgtgatat gcatacactt atataagcag aattatgagt     2100

taaagtaata cttagcaata tgattttata atggctcctc attatgcttg ctgttgaacc     2160

ttttatgagg agtgaatata aagtattggt tttccctcac aaatttaaag attatgttat     2220

taatactatt ataactgcat caatcaagtc agataaaggc aactataaaa tagtagtagt     2280

gtttgtttcc tatctcaagg gcgaaatttt atgggaactc aatttattat gcagttttta     2340

agtttaaaat accaagaaag atgtcactag attctcttct atgtgatttt tgttttttat     2400

ataaagcagt gtagtggtgt ttagaagctg aggccacctg taaggcaaat ctgccttaag     2460

tgtattatgt gttacttaaa ggcaaatttg tgatctaaaa gtacaagagt gatttttgag     2520

ctaggattat aaaatacata ataaagatgt gagaagataa aatgcttttg ttttggtttt     2580

aatgttggga ttattttaat cctttcattt gaaaaatcag tgtctcaaat gaattctgtt     2640

catttataat aaatgcatat attgctctga aaacaaaaaa aaaaaaaaaa                2690


<210>  127
<211>  458
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 4C, cysteine peptidase (ATG4C), 
       transcript variant 2, polypeptide

<400>  127

Met Glu Ala Thr Gly Thr Asp Glu Val Asp Lys Leu Lys Thr Lys Phe 
1               5                   10                  15      


Ile Ser Ala Trp Asn Asn Met Lys Tyr Ser Trp Val Leu Lys Thr Lys 
            20                  25                  30          


Thr Tyr Phe Ser Arg Asn Ser Pro Val Leu Leu Leu Gly Lys Cys Tyr 
        35                  40                  45              


His Phe Lys Tyr Glu Asp Glu Asp Lys Thr Leu Pro Ala Glu Ser Gly 
    50                  55                  60                  


Cys Thr Ile Glu Asp His Val Ile Ala Gly Asn Val Glu Glu Phe Arg 
65                  70                  75                  80  


Lys Asp Phe Ile Ser Arg Ile Trp Leu Thr Tyr Arg Glu Glu Phe Pro 
                85                  90                  95      


Gln Ile Glu Gly Ser Ala Leu Thr Thr Asp Cys Gly Trp Gly Cys Thr 
            100                 105                 110         


Leu Arg Thr Gly Gln Met Leu Leu Ala Gln Gly Leu Ile Leu His Phe 
        115                 120                 125             


Leu Gly Arg Ala Trp Thr Trp Pro Asp Ala Leu Asn Ile Glu Asn Ser 
    130                 135                 140                 


Asp Ser Glu Ser Trp Thr Ser His Thr Val Lys Lys Phe Thr Ala Ser 
145                 150                 155                 160 


Phe Glu Ala Ser Leu Ser Gly Glu Arg Glu Phe Lys Thr Pro Thr Ile 
                165                 170                 175     


Ser Leu Lys Glu Thr Ile Gly Lys Tyr Ser Asp Asp His Glu Met Arg 
            180                 185                 190         


Asn Glu Val Tyr His Arg Lys Ile Ile Ser Trp Phe Gly Asp Ser Pro 
        195                 200                 205             


Leu Ala Leu Phe Gly Leu His Gln Leu Ile Glu Tyr Gly Lys Lys Ser 
    210                 215                 220                 


Gly Lys Lys Ala Gly Asp Trp Tyr Gly Pro Ala Val Val Ala His Ile 
225                 230                 235                 240 


Leu Arg Lys Ala Val Glu Glu Ala Arg His Pro Asp Leu Gln Gly Ile 
                245                 250                 255     


Thr Ile Tyr Val Ala Gln Asp Cys Thr Val Tyr Asn Ser Asp Val Ile 
            260                 265                 270         


Asp Lys Gln Ser Ala Ser Met Thr Ser Asp Asn Ala Asp Asp Lys Ala 
        275                 280                 285             


Val Ile Ile Leu Val Pro Val Arg Leu Gly Gly Glu Arg Thr Asn Thr 
    290                 295                 300                 


Asp Tyr Leu Glu Phe Val Lys Gly Ile Leu Ser Leu Glu Tyr Cys Val 
305                 310                 315                 320 


Gly Ile Ile Gly Gly Lys Pro Lys Gln Ser Tyr Tyr Phe Ala Gly Phe 
                325                 330                 335     


Gln Asp Asp Ser Leu Ile Tyr Met Asp Pro His Tyr Cys Gln Ser Phe 
            340                 345                 350         


Val Asp Val Ser Ile Lys Asp Phe Pro Leu Glu Thr Phe His Cys Pro 
        355                 360                 365             


Ser Pro Lys Lys Met Ser Phe Arg Lys Met Asp Pro Ser Cys Thr Ile 
    370                 375                 380                 


Gly Phe Tyr Cys Arg Asn Val Gln Asp Phe Lys Arg Ala Ser Glu Glu 
385                 390                 395                 400 


Ile Thr Lys Met Leu Lys Phe Ser Ser Lys Glu Lys Tyr Pro Leu Phe 
                405                 410                 415     


Thr Phe Val Asn Gly His Ser Arg Asp Tyr Asp Phe Thr Ser Thr Thr 
            420                 425                 430         


Thr Asn Glu Glu Asp Leu Phe Ser Glu Asp Glu Lys Lys Gln Leu Lys 
        435                 440                 445             


Arg Phe Ser Thr Glu Glu Phe Val Leu Leu 
    450                 455             


<210>  128
<211>  1996
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ATG4D mRNA variant 1 GenBank Accession No.:  NM_032885.5  
       GI:528078304

<400>  128
gtcatccggg gtcctggccc gctaagatgg cgatggctgc ggtagcagcg gcggcggctg       60

ttgcctggcc cggtaccctg gggacggggg ccgagtagcg ccttccccgg gccccgtgaa      120

ccggctgcgg gtcgcccttg gggggcagcg gccgcagccc cccacctggg ccctcggtcc      180

gccctcccgg cgcgtccatg aactcagtgt cgccggccgc cgcgcagtac cggagcagca      240

gcccggagga cgcgcgccgc cggcccgagg cccgcaggcc gcggggtccc agaggcccag      300

accccaacgg cctggggcct tccggagcca gcggccccgc tcttggctct cccggggctg      360

gcccgagtga gccggacgaa gtggacaagt tcaaggccaa gttcctgaca gcctggaaca      420

acgtcaagta cggttgggtg gttaaaagcc ggaccagctt tagcaagatc tccagcatcc      480

acctctgtgg ccgccgctac cgtttcgagg gcgagggtga catacagcgt ttccagcggg      540

actttgtgtc ccgcctgtgg ctcacatacc gccgggactt cccgcccctt cctgggggct      600

gcctgacctc ggactgtggc tgggggtgca tgttacgcag cggccagatg atgctggcac      660

agggccttct gctgcatttc ctgcccagag actggacatg ggccgagggc atgggcctgg      720

gcccccctga gctgtcaggg tcagcctctc ccagccggta ccatgggcct gcccgctgga      780

tgcccccacg ctgggcccag ggtgcccctg agctggagca ggaacgccgg caccggcaga      840

ttgtgtcctg gttcgccgac cacccccggg ccccctttgg cctacaccgg ctggtggagc      900

ttgggcagag ctcaggcaag aaggcaggtg actggtatgg gccatcgcta gtggcacaca      960

tcctcaggaa agccgtggag agctgctccg acgtcacccg cctggtggtg tacgtttctc     1020

aggactgcac agtgtacaag gcggatgtgg cacgcctggt ggccaggcca gaccccacag     1080

ccgagtggaa gtctgtggtc atcctggtgc ccgtgcgact gggtggcgag actctcaacc     1140

ccgtgtatgt gccctgcgtg aaggaactcc tgcgttgcga gctgtgcctg ggcatcatgg     1200

gtgggaaacc gcgacactca ctgtacttca ttggctacca agatgacttc ctgctgtacc     1260

tggaccctca ctactgccag cccactgtgg atgtcagcca ggccgacttc cccctggagt     1320

ccttccactg cacctcgccc cgcaagatgg cctttgccaa gatggaccca agctgtaccg     1380

tgggcttcta tgctggagac aggaaggagt ttgagacact ctgctcagag ctgaccaggg     1440

tcctcagctc ctcctcagcc acagagcggt accccatgtt caccctggcc gagggccatg     1500

ctcaggacca cagcctggac gacctctgct cccagctcgc ccagcccaca ctccggctcc     1560

ctcgcacagg gcggctcctc agggccaaac gccccagctc tgaggacttt gtgtttttat     1620

aaagggaggg gatgagggga aagatacaac actatttatt tttttattta tgtcatgtcg     1680

ggtgtgggat cttgagctct ggcagtgatg atggtacttc ctgttgtcag cccctcaagc     1740

ccagctgcaa ccagtctggg gccattcagc cagggacaga gcccacagag cccatacacc     1800

tgtctcccac cagcggggcc ctcctggcag ggtagggaag gaggaccccg ggcacccccc     1860

tcagggcctg actcacgtac tgtagtttgc actggacgcc cgggccctcc ctgtcccaaa     1920

gcccccttgg gggaactgtg gctgctgggg gccaataaag ctgtgtaact tgatcgtgaa     1980

aaaaaaaaaa aaaaaa                                                     1996


<210>  129
<211>  474
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ATG4D isoform 1 encoded by mRNA transcript variant 1 GenBank 
       Accession No.: NP_116274.3  GI:27903825

<400>  129

Met Asn Ser Val Ser Pro Ala Ala Ala Gln Tyr Arg Ser Ser Ser Pro 
1               5                   10                  15      


Glu Asp Ala Arg Arg Arg Pro Glu Ala Arg Arg Pro Arg Gly Pro Arg 
            20                  25                  30          


Gly Pro Asp Pro Asn Gly Leu Gly Pro Ser Gly Ala Ser Gly Pro Ala 
        35                  40                  45              


Leu Gly Ser Pro Gly Ala Gly Pro Ser Glu Pro Asp Glu Val Asp Lys 
    50                  55                  60                  


Phe Lys Ala Lys Phe Leu Thr Ala Trp Asn Asn Val Lys Tyr Gly Trp 
65                  70                  75                  80  


Val Val Lys Ser Arg Thr Ser Phe Ser Lys Ile Ser Ser Ile His Leu 
                85                  90                  95      


Cys Gly Arg Arg Tyr Arg Phe Glu Gly Glu Gly Asp Ile Gln Arg Phe 
            100                 105                 110         


Gln Arg Asp Phe Val Ser Arg Leu Trp Leu Thr Tyr Arg Arg Asp Phe 
        115                 120                 125             


Pro Pro Leu Pro Gly Gly Cys Leu Thr Ser Asp Cys Gly Trp Gly Cys 
    130                 135                 140                 


Met Leu Arg Ser Gly Gln Met Met Leu Ala Gln Gly Leu Leu Leu His 
145                 150                 155                 160 


Phe Leu Pro Arg Asp Trp Thr Trp Ala Glu Gly Met Gly Leu Gly Pro 
                165                 170                 175     


Pro Glu Leu Ser Gly Ser Ala Ser Pro Ser Arg Tyr His Gly Pro Ala 
            180                 185                 190         


Arg Trp Met Pro Pro Arg Trp Ala Gln Gly Ala Pro Glu Leu Glu Gln 
        195                 200                 205             


Glu Arg Arg His Arg Gln Ile Val Ser Trp Phe Ala Asp His Pro Arg 
    210                 215                 220                 


Ala Pro Phe Gly Leu His Arg Leu Val Glu Leu Gly Gln Ser Ser Gly 
225                 230                 235                 240 


Lys Lys Ala Gly Asp Trp Tyr Gly Pro Ser Leu Val Ala His Ile Leu 
                245                 250                 255     


Arg Lys Ala Val Glu Ser Cys Ser Asp Val Thr Arg Leu Val Val Tyr 
            260                 265                 270         


Val Ser Gln Asp Cys Thr Val Tyr Lys Ala Asp Val Ala Arg Leu Val 
        275                 280                 285             


Ala Arg Pro Asp Pro Thr Ala Glu Trp Lys Ser Val Val Ile Leu Val 
    290                 295                 300                 


Pro Val Arg Leu Gly Gly Glu Thr Leu Asn Pro Val Tyr Val Pro Cys 
305                 310                 315                 320 


Val Lys Glu Leu Leu Arg Cys Glu Leu Cys Leu Gly Ile Met Gly Gly 
                325                 330                 335     


Lys Pro Arg His Ser Leu Tyr Phe Ile Gly Tyr Gln Asp Asp Phe Leu 
            340                 345                 350         


Leu Tyr Leu Asp Pro His Tyr Cys Gln Pro Thr Val Asp Val Ser Gln 
        355                 360                 365             


Ala Asp Phe Pro Leu Glu Ser Phe His Cys Thr Ser Pro Arg Lys Met 
    370                 375                 380                 


Ala Phe Ala Lys Met Asp Pro Ser Cys Thr Val Gly Phe Tyr Ala Gly 
385                 390                 395                 400 


Asp Arg Lys Glu Phe Glu Thr Leu Cys Ser Glu Leu Thr Arg Val Leu 
                405                 410                 415     


Ser Ser Ser Ser Ala Thr Glu Arg Tyr Pro Met Phe Thr Leu Ala Glu 
            420                 425                 430         


Gly His Ala Gln Asp His Ser Leu Asp Asp Leu Cys Ser Gln Leu Ala 
        435                 440                 445             


Gln Pro Thr Leu Arg Leu Pro Arg Thr Gly Arg Leu Leu Arg Ala Lys 
    450                 455                 460                 


Arg Pro Ser Ser Glu Asp Phe Val Phe Leu 
465                 470                 


<210>  130
<211>  2070
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 4D, cysteine peptidase (ATG4D), 
       transcript variant 2, mRNA NCBI Reference Sequence: 
       NM_001281504.1

<400>  130
gtcatccggg gtcctggccc gctaagatgg cgatggctgc ggtagcagcg gcggcggctg       60

ttgcctggcc cggtaccctg gggacggggg ccgagtagcg ccttccccgg gccccgtgaa      120

ccggctgcgg gtcgcccttg gggggcagcg gccgcagccc cccacctggg ccctcggtcc      180

gccctcccgg cgcgtccatg aactcagtgt cgccggccgc cgcgcagtac cggagcagca      240

gcccggagga cgcgcgccgc cggcccgagg cccgcaggcc gcggggtccc agaggcccag      300

accccaacgg cctggggcct tccggagcca gcggccccgc tcttggctct cccggggctg      360

gcccgagtga gccggacgaa gtggacaagt tcaaggccaa gttcctgaca gcctggaaca      420

acgtcaagta cgatggggga atggaagcgc tgttgctgtc atgcaagtgc ttcatctcgc      480

tgggcgctgc ccctacgtct ccccaggttg ggtggttaaa agccggacca gctttagcaa      540

gatctccagc atccacctct gtggccgccg ctaccgtttc gagggcgagg gtgacataca      600

gcgtttccag cgggactttg tgtcccgcct gtggctcaca taccgccggg acttcccgcc      660

ccttcctggg ggctgcctga cctcggactg tggctggggg tgcatgttac gcagcggcca      720

gatgatgctg gcacagggcc ttctgctgca tttcctgccc agagactgga catgggccga      780

gggcatgggc ctgggccccc ctgagctgtc agggtcagcc tctcccagcc ggtaccatgg      840

gcctgcccgc tggatgcccc cacgctgggc ccagggtgcc cctgagctgg agcaggaacg      900

ccggcaccgg cagattgtgt cctggttcgc cgaccacccc cgggccccct ttggcctaca      960

ccggctggtg gagcttgggc agagctcagg caagaaggca ggtgactggt atgggccatc     1020

gctagtggca cacatcctca ggaaagccgt ggagagctgc tccgacgtca cccgcctggt     1080

ggtgtacgtt tctcaggact gcacagtgta caaggcggat gtggcacgcc tggtggccag     1140

gccagacccc acagccgagt ggaagtctgt ggtcatcctg gtgcccgtgc gactgggtgg     1200

cgagactctc aaccccgtgt atgtgccctg cgtgaaggaa ctcctgcgtt gcgagctgtg     1260

cctgggcatc atgggtggga aaccgcgaca ctcactgtac ttcattggct accaagatga     1320

cttcctgctg tacctggacc ctcactactg ccagcccact gtggatgtca gccaggccga     1380

cttccccctg gagtccttcc actgcacctc gccccgcaag atggcctttg ccaagatgga     1440

cccaagctgt accgtgggct tctatgctgg agacaggaag gagtttgaga cactctgctc     1500

agagctgacc agggtcctca gctcctcctc agccacagag cggtacccca tgttcaccct     1560

ggccgagggc catgctcagg accacagcct ggacgacctc tgctcccagc tcgcccagcc     1620

cacactccgg ctccctcgca cagggcggct cctcagggcc aaacgcccca gctctgagga     1680

ctttgtgttt ttataaaggg aggggatgag gggaaagata caacactatt tattttttta     1740

tttatgtcat gtcgggtgtg ggatcttgag ctctggcagt gatgatggta cttcctgttg     1800

tcagcccctc aagcccagct gcaaccagtc tggggccatt cagccaggga cagagcccac     1860

agagcccata cacctgtctc ccaccagcgg ggccctcctg gcagggtagg gaaggaggac     1920

cccgggcacc cccctcaggg cctgactcac gtactgtagt ttgcactgga cgcccgggcc     1980

ctccctgtcc caaagccccc ttgggggaac tgtggctgct gggggccaat aaagctgtgt     2040

aacttgatcg tgaaaaaaaa aaaaaaaaaa                                      2070


<210>  131
<211>  411
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens autophagy related 4D, cysteine peptidase (ATG4D), 
       transcript variant 2, polypeptide NCBI Reference Sequence: 
       NM_001281504.1

<400>  131

Met Gln Val Leu His Leu Ala Gly Arg Cys Pro Tyr Val Ser Pro Gly 
1               5                   10                  15      


Trp Val Val Lys Ser Arg Thr Ser Phe Ser Lys Ile Ser Ser Ile His 
            20                  25                  30          


Leu Cys Gly Arg Arg Tyr Arg Phe Glu Gly Glu Gly Asp Ile Gln Arg 
        35                  40                  45              


Phe Gln Arg Asp Phe Val Ser Arg Leu Trp Leu Thr Tyr Arg Arg Asp 
    50                  55                  60                  


Phe Pro Pro Leu Pro Gly Gly Cys Leu Thr Ser Asp Cys Gly Trp Gly 
65                  70                  75                  80  


Cys Met Leu Arg Ser Gly Gln Met Met Leu Ala Gln Gly Leu Leu Leu 
                85                  90                  95      


His Phe Leu Pro Arg Asp Trp Thr Trp Ala Glu Gly Met Gly Leu Gly 
            100                 105                 110         


Pro Pro Glu Leu Ser Gly Ser Ala Ser Pro Ser Arg Tyr His Gly Pro 
        115                 120                 125             


Ala Arg Trp Met Pro Pro Arg Trp Ala Gln Gly Ala Pro Glu Leu Glu 
    130                 135                 140                 


Gln Glu Arg Arg His Arg Gln Ile Val Ser Trp Phe Ala Asp His Pro 
145                 150                 155                 160 


Arg Ala Pro Phe Gly Leu His Arg Leu Val Glu Leu Gly Gln Ser Ser 
                165                 170                 175     


Gly Lys Lys Ala Gly Asp Trp Tyr Gly Pro Ser Leu Val Ala His Ile 
            180                 185                 190         


Leu Arg Lys Ala Val Glu Ser Cys Ser Asp Val Thr Arg Leu Val Val 
        195                 200                 205             


Tyr Val Ser Gln Asp Cys Thr Val Tyr Lys Ala Asp Val Ala Arg Leu 
    210                 215                 220                 


Val Ala Arg Pro Asp Pro Thr Ala Glu Trp Lys Ser Val Val Ile Leu 
225                 230                 235                 240 


Val Pro Val Arg Leu Gly Gly Glu Thr Leu Asn Pro Val Tyr Val Pro 
                245                 250                 255     


Cys Val Lys Glu Leu Leu Arg Cys Glu Leu Cys Leu Gly Ile Met Gly 
            260                 265                 270         


Gly Lys Pro Arg His Ser Leu Tyr Phe Ile Gly Tyr Gln Asp Asp Phe 
        275                 280                 285             


Leu Leu Tyr Leu Asp Pro His Tyr Cys Gln Pro Thr Val Asp Val Ser 
    290                 295                 300                 


Gln Ala Asp Phe Pro Leu Glu Ser Phe His Cys Thr Ser Pro Arg Lys 
305                 310                 315                 320 


Met Ala Phe Ala Lys Met Asp Pro Ser Cys Thr Val Gly Phe Tyr Ala 
                325                 330                 335     


Gly Asp Arg Lys Glu Phe Glu Thr Leu Cys Ser Glu Leu Thr Arg Val 
            340                 345                 350         


Leu Ser Ser Ser Ser Ala Thr Glu Arg Tyr Pro Met Phe Thr Leu Ala 
        355                 360                 365             


Glu Gly His Ala Gln Asp His Ser Leu Asp Asp Leu Cys Ser Gln Leu 
    370                 375                 380                 


Ala Gln Pro Thr Leu Arg Leu Pro Arg Thr Gly Arg Leu Leu Arg Ala 
385                 390                 395                 400 


Lys Arg Pro Ser Ser Glu Asp Phe Val Phe Leu 
                405                 410     


<210>  132
<211>  4835
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 2, mRNA NCBI Reference Sequence: NM_005241.3

<400>  132
ccttgccaag taacagcttt gctgtccaac atcgtgtgct gcttcgcgag aaagtcacat       60

tcggaccctt tggctagatt atcttagacg aattttacaa tgtgaagttc tgcatagatg      120

ccagtcaacc agatgttgga agctggctca agtacattag attcgctggc tgttatgatc      180

agcacaacct tgttgcatgc cagataaatg atcagatatt ctatagagta gttgcagaca      240

ttgcgccggg agaggagctt ctgctgttca tgaagagcga agactatccc catgaaacta      300

tggcgccgga tatccacgaa gaacggcaat atcgctgcga agactgtgac cagctctttg      360

aatctaaggc tgaactagca gatcaccaaa agtttccatg cagtactcct cactcagcat      420

tttcaatggt tgaagaggac tttcagcaaa aactcgaaag cgagaatgat ctccaagaga      480

tacacacgat ccaggagtgt aaggaatgtg accaagtttt tcctgatttg caaagcctgg      540

agaaacacat gctgtcacat actgaagaga gggaatacaa gtgtgatcag tgtcccaagg      600

catttaactg gaagtccaat ttaattcgcc accagatgtc acatgacagt ggaaagcact      660

atgaatgtga aaactgtgcc aaggttttca cggaccctag caaccttcag cggcacattc      720

gctctcagca tgtcggtgcc cgggcccatg catgcccgga gtgtggcaaa acgtttgcca      780

cttcgtcggg cctcaaacaa cacaagcaca tccacagcag tgtgaagccc tttatctgtg      840

aggtctgcca taaatcctat actcagtttt caaacctttg ccgtcataag cgcatgcatg      900

ctgattgcag aacccaaatc aagtgcaaag actgtggaca aatgttcagc actacgtctt      960

ccttaaataa acacaggagg ttttgtgagg gcaagaacca ttttgcggca ggtggatttt     1020

ttggccaagg catttcactt cctggaaccc cagctatgga taaaacgtcc atggttaata     1080

tgagtcatgc caacccgggc cttgctgact attttggcgc caataggcat cctgctggtc     1140

ttacctttcc aacagctcct ggattttctt ttagcttccc tggtctgttt ccttccggct     1200

tgtaccacag gcctcctttg atacctgcta gttctcctgt taaaggacta tcaagtactg     1260

aacagacaaa caaaagtcaa agtcccctca tgacacatcc tcagatactg ccagctacac     1320

aggatatttt gaaggcacta tctaaacacc catctgtagg ggacaataag ccagtggagc     1380

tccagcccga gaggtcctct gaagagaggc cctttgagaa aatcagtgac cagtcagaga     1440

gtagtgacct tgatgatgtc agtacaccaa gtggcagtga cctggaaaca acctcgggct     1500

ctgatctgga aagtgacatt gaaagtgata aagagaaatt taaagaaaat ggtaaaatgt     1560

tcaaagacaa agtaagccct cttcagaatc tggcttcaat aaataataag aaagaataca     1620

gcaatcattc cattttctca ccatctttag aggagcagac tgcggtgtca ggagctgtga     1680

atgattctat aaaggctatt gcttctattg ctgaaaaata ctttggttca acaggactgg     1740

tggggctgca agacaaaaaa gttggagctt taccttaccc ttccatgttt cccctcccat     1800

tttttccagc attctctcaa tcaatgtacc catttcctga tagagacttg agatcgttac     1860

ctttgaaaat ggaaccccaa tcaccaggtg aagtaaagaa actgcagaag ggcagctctg     1920

agtccccctt tgatctcacc actaagcgaa aggatgagaa gcccttgact ccagtcccct     1980

ccaagcctcc agtgacacct gccacaagcc aagaccagcc cctggatcta agtatgggca     2040

gtaggagtag agccagtggg acaaagctga ctgagcctcg aaaaaaccac gtgtttgggg     2100

gaaaaaaagg aagcaacgtc gaatcaagac ctgcttcaga tggttccttg cagcatgcaa     2160

gacccactcc tttctttatg gaccctattt acagagtaga gaaaagaaaa ctaactgacc     2220

cacttgaagc tttaaaagag aaatacttga ggccttctcc aggattcttg tttcacccac     2280

aattccaact gcctgatcag agaacttgga tgtcagctat tgaaaacatg gcagaaaagc     2340

tagagagctt cagtgccctg aaacctgagg ccagtgagct cttacagtca gtgccctcta     2400

tgttcaactt cagggcgcct cccaatgccc tgccagagaa ccttctgcgg aagggaaagg     2460

agcgctatac ctgcagatac tgtggcaaga tttttccaag gtctgcaaac ctaacacggc     2520

acttgagaac ccacacagga gagcagcctt acagatgcaa atactgtgac agatcattta     2580

gcatatcttc taacttgcaa aggcatgttc gcaacatcca caataaagag aagccattta     2640

agtgtcactt atgtgatagg tgttttggtc aacaaaccaa tttagacaga cacctaaaga     2700

aacatgagaa tgggaacatg tccggtacag caacatcgtc gcctcattct gaactggaaa     2760

gtacaggtgc gattctggat gacaaagaag atgcttactt cacagaaatt cgaaatttca     2820

ttgggaacag caaccatggc agccaatctc ccaggaatgt ggaggagaga atgaatggca     2880

gtcattttaa agatgaaaag gctttggtga ccagtcaaaa ttcagacttg ctggatgatg     2940

aagaagttga agatgaggtg ttgttagatg aggaggatga agacaatgat attactggaa     3000

aaacaggaaa ggaaccagtg acaagtaatt tacatgaagg aaaccctgag gatgactatg     3060

aagaaaccag tgccctggag atgagttgca agacatcccc agtgaggtat aaagaggaag     3120

aatataaaag tggactttct gctctagatc atataaggca cttcacagat agcctcaaaa     3180

tgaggaaaat ggaagataat caatattctg aagctgagct gtcttctttt agtacttccc     3240

atgtgccaga ggaacttaag cagccgttac acagaaagtc caaatcgcag gcatatgcta     3300

tgatgctgtc actgtctgac aaggagtccc tccattctac atcccacagt tcttccaacg     3360

tgtggcacag tatggccagg gctgcggcgg aatccagtgc tatccagtcc ataagccacg     3420

tatgacgtta tcaaggttga ccagagtggg accaagtcca acagtagcat ggctctttca     3480

tataggacta tttacaagac tgctgagcag aatgccttat aaacctgcag ggtcactcat     3540

ctaaagtcta gtgaccttaa actgaatgat ttaaaaaaga aaagaaagaa aaaagaaact     3600

atttattctc gatattttgt tttgcacagc aaaggcagct gctgacttct ggaagatcaa     3660

tcaatgcgac ttaaagtgat tcagtgaaaa caaaaaactt ggtgggctga aggcatcttc     3720

cagtttaccc caccttaggg tatgggtggg tgagaagggc agttgagatg gcagcattga     3780

tatgaatgaa cactccatag aaactgaatt ctcttttgta caagatcacc tgacatgatt     3840

gggaacagtt gcttttaatt acagatttaa tttttttctt cgttaaagtt ttatgtaatt     3900

taaccctttg aagacagaag tagttggatg aaatgcacag tcaattatta tagaaactga     3960

taacagggag tacttgttcc cccttttgcc ttcttaagta cattgtttaa aactagggaa     4020

aaagggtatg tgtatattgt aaactatgga tgttaacact caaagaggtt aagtcagtga     4080

agtaacctat tcatcaccag taccgctgta ccactaataa attgtttgcc aaatccttgt     4140

aataacatct taattttaga caatcatgtc actgttttta atgtttattt ttttgtgtgt     4200

gttgcgtgta tcatgtattt atttgttggc aaactattgt ttgttgatta aaatagcact     4260

gttccagtca gccactactt tatgacgtct gaggcacacc cctttccgaa tttcaaggac     4320

caaggtgacc cgacctgtgt atgagagtgc caaatggtgt ttggcttttc ttaacattcc     4380

tttttgtttg tttgttttgt tttccttctt aatgaactaa atacgaatag atgcaactta     4440

gtttttgtaa tactgaaatc gattcaattg tataaacgat tataatttct ttcatggaag     4500

catgattctt ctgattaaaa actgtactcc atattttatg ctggttgtct gcaagcttgt     4560

gcgatgttat gttcatgtta atcctatttg taaaatgaag tgttcccaac cttatgttaa     4620

aagagagaag taaataacag actgtattca gttattttgc cctttattga ggaaccagat     4680

ttgttttctt tttgtttgta atctcatttt gaaataatca gcaagttgag gtactttctt     4740

caaatgcttt gtacaatata aactgttatg cctttcagtg cattactatg ggaggagcaa     4800

ctaaaaaata aagacttaca aaaaggagta ttttt                                4835


<210>  133
<211>  1051
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 2, polypeptide NCBI Reference Sequence: NM_005241.3

<400>  133

Met Lys Ser Glu Asp Tyr Pro His Glu Thr Met Ala Pro Asp Ile His 
1               5                   10                  15      


Glu Glu Arg Gln Tyr Arg Cys Glu Asp Cys Asp Gln Leu Phe Glu Ser 
            20                  25                  30          


Lys Ala Glu Leu Ala Asp His Gln Lys Phe Pro Cys Ser Thr Pro His 
        35                  40                  45              


Ser Ala Phe Ser Met Val Glu Glu Asp Phe Gln Gln Lys Leu Glu Ser 
    50                  55                  60                  


Glu Asn Asp Leu Gln Glu Ile His Thr Ile Gln Glu Cys Lys Glu Cys 
65                  70                  75                  80  


Asp Gln Val Phe Pro Asp Leu Gln Ser Leu Glu Lys His Met Leu Ser 
                85                  90                  95      


His Thr Glu Glu Arg Glu Tyr Lys Cys Asp Gln Cys Pro Lys Ala Phe 
            100                 105                 110         


Asn Trp Lys Ser Asn Leu Ile Arg His Gln Met Ser His Asp Ser Gly 
        115                 120                 125             


Lys His Tyr Glu Cys Glu Asn Cys Ala Lys Val Phe Thr Asp Pro Ser 
    130                 135                 140                 


Asn Leu Gln Arg His Ile Arg Ser Gln His Val Gly Ala Arg Ala His 
145                 150                 155                 160 


Ala Cys Pro Glu Cys Gly Lys Thr Phe Ala Thr Ser Ser Gly Leu Lys 
                165                 170                 175     


Gln His Lys His Ile His Ser Ser Val Lys Pro Phe Ile Cys Glu Val 
            180                 185                 190         


Cys His Lys Ser Tyr Thr Gln Phe Ser Asn Leu Cys Arg His Lys Arg 
        195                 200                 205             


Met His Ala Asp Cys Arg Thr Gln Ile Lys Cys Lys Asp Cys Gly Gln 
    210                 215                 220                 


Met Phe Ser Thr Thr Ser Ser Leu Asn Lys His Arg Arg Phe Cys Glu 
225                 230                 235                 240 


Gly Lys Asn His Phe Ala Ala Gly Gly Phe Phe Gly Gln Gly Ile Ser 
                245                 250                 255     


Leu Pro Gly Thr Pro Ala Met Asp Lys Thr Ser Met Val Asn Met Ser 
            260                 265                 270         


His Ala Asn Pro Gly Leu Ala Asp Tyr Phe Gly Ala Asn Arg His Pro 
        275                 280                 285             


Ala Gly Leu Thr Phe Pro Thr Ala Pro Gly Phe Ser Phe Ser Phe Pro 
    290                 295                 300                 


Gly Leu Phe Pro Ser Gly Leu Tyr His Arg Pro Pro Leu Ile Pro Ala 
305                 310                 315                 320 


Ser Ser Pro Val Lys Gly Leu Ser Ser Thr Glu Gln Thr Asn Lys Ser 
                325                 330                 335     


Gln Ser Pro Leu Met Thr His Pro Gln Ile Leu Pro Ala Thr Gln Asp 
            340                 345                 350         


Ile Leu Lys Ala Leu Ser Lys His Pro Ser Val Gly Asp Asn Lys Pro 
        355                 360                 365             


Val Glu Leu Gln Pro Glu Arg Ser Ser Glu Glu Arg Pro Phe Glu Lys 
    370                 375                 380                 


Ile Ser Asp Gln Ser Glu Ser Ser Asp Leu Asp Asp Val Ser Thr Pro 
385                 390                 395                 400 


Ser Gly Ser Asp Leu Glu Thr Thr Ser Gly Ser Asp Leu Glu Ser Asp 
                405                 410                 415     


Ile Glu Ser Asp Lys Glu Lys Phe Lys Glu Asn Gly Lys Met Phe Lys 
            420                 425                 430         


Asp Lys Val Ser Pro Leu Gln Asn Leu Ala Ser Ile Asn Asn Lys Lys 
        435                 440                 445             


Glu Tyr Ser Asn His Ser Ile Phe Ser Pro Ser Leu Glu Glu Gln Thr 
    450                 455                 460                 


Ala Val Ser Gly Ala Val Asn Asp Ser Ile Lys Ala Ile Ala Ser Ile 
465                 470                 475                 480 


Ala Glu Lys Tyr Phe Gly Ser Thr Gly Leu Val Gly Leu Gln Asp Lys 
                485                 490                 495     


Lys Val Gly Ala Leu Pro Tyr Pro Ser Met Phe Pro Leu Pro Phe Phe 
            500                 505                 510         


Pro Ala Phe Ser Gln Ser Met Tyr Pro Phe Pro Asp Arg Asp Leu Arg 
        515                 520                 525             


Ser Leu Pro Leu Lys Met Glu Pro Gln Ser Pro Gly Glu Val Lys Lys 
    530                 535                 540                 


Leu Gln Lys Gly Ser Ser Glu Ser Pro Phe Asp Leu Thr Thr Lys Arg 
545                 550                 555                 560 


Lys Asp Glu Lys Pro Leu Thr Pro Val Pro Ser Lys Pro Pro Val Thr 
                565                 570                 575     


Pro Ala Thr Ser Gln Asp Gln Pro Leu Asp Leu Ser Met Gly Ser Arg 
            580                 585                 590         


Ser Arg Ala Ser Gly Thr Lys Leu Thr Glu Pro Arg Lys Asn His Val 
        595                 600                 605             


Phe Gly Gly Lys Lys Gly Ser Asn Val Glu Ser Arg Pro Ala Ser Asp 
    610                 615                 620                 


Gly Ser Leu Gln His Ala Arg Pro Thr Pro Phe Phe Met Asp Pro Ile 
625                 630                 635                 640 


Tyr Arg Val Glu Lys Arg Lys Leu Thr Asp Pro Leu Glu Ala Leu Lys 
                645                 650                 655     


Glu Lys Tyr Leu Arg Pro Ser Pro Gly Phe Leu Phe His Pro Gln Phe 
            660                 665                 670         


Gln Leu Pro Asp Gln Arg Thr Trp Met Ser Ala Ile Glu Asn Met Ala 
        675                 680                 685             


Glu Lys Leu Glu Ser Phe Ser Ala Leu Lys Pro Glu Ala Ser Glu Leu 
    690                 695                 700                 


Leu Gln Ser Val Pro Ser Met Phe Asn Phe Arg Ala Pro Pro Asn Ala 
705                 710                 715                 720 


Leu Pro Glu Asn Leu Leu Arg Lys Gly Lys Glu Arg Tyr Thr Cys Arg 
                725                 730                 735     


Tyr Cys Gly Lys Ile Phe Pro Arg Ser Ala Asn Leu Thr Arg His Leu 
            740                 745                 750         


Arg Thr His Thr Gly Glu Gln Pro Tyr Arg Cys Lys Tyr Cys Asp Arg 
        755                 760                 765             


Ser Phe Ser Ile Ser Ser Asn Leu Gln Arg His Val Arg Asn Ile His 
    770                 775                 780                 


Asn Lys Glu Lys Pro Phe Lys Cys His Leu Cys Asp Arg Cys Phe Gly 
785                 790                 795                 800 


Gln Gln Thr Asn Leu Asp Arg His Leu Lys Lys His Glu Asn Gly Asn 
                805                 810                 815     


Met Ser Gly Thr Ala Thr Ser Ser Pro His Ser Glu Leu Glu Ser Thr 
            820                 825                 830         


Gly Ala Ile Leu Asp Asp Lys Glu Asp Ala Tyr Phe Thr Glu Ile Arg 
        835                 840                 845             


Asn Phe Ile Gly Asn Ser Asn His Gly Ser Gln Ser Pro Arg Asn Val 
    850                 855                 860                 


Glu Glu Arg Met Asn Gly Ser His Phe Lys Asp Glu Lys Ala Leu Val 
865                 870                 875                 880 


Thr Ser Gln Asn Ser Asp Leu Leu Asp Asp Glu Glu Val Glu Asp Glu 
                885                 890                 895     


Val Leu Leu Asp Glu Glu Asp Glu Asp Asn Asp Ile Thr Gly Lys Thr 
            900                 905                 910         


Gly Lys Glu Pro Val Thr Ser Asn Leu His Glu Gly Asn Pro Glu Asp 
        915                 920                 925             


Asp Tyr Glu Glu Thr Ser Ala Leu Glu Met Ser Cys Lys Thr Ser Pro 
    930                 935                 940                 


Val Arg Tyr Lys Glu Glu Glu Tyr Lys Ser Gly Leu Ser Ala Leu Asp 
945                 950                 955                 960 


His Ile Arg His Phe Thr Asp Ser Leu Lys Met Arg Lys Met Glu Asp 
                965                 970                 975     


Asn Gln Tyr Ser Glu Ala Glu Leu Ser Ser Phe Ser Thr Ser His Val 
            980                 985                 990         


Pro Glu Glu Leu Lys Gln Pro Leu  His Arg Lys Ser Lys  Ser Gln Ala 
        995                 1000                 1005             


Tyr Ala  Met Met Leu Ser Leu  Ser Asp Lys Glu Ser  Leu His Ser 
    1010                 1015                 1020             


Thr Ser  His Ser Ser Ser Asn  Val Trp His Ser Met  Ala Arg Ala 
    1025                 1030                 1035             


Ala Ala  Glu Ser Ser Ala Ile  Gln Ser Ile Ser His  Val 
    1040                 1045                 1050     


<210>  134
<211>  4891
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 3, mRNA NCBI Reference Sequence: NM_001105078.3

<400>  134
agccttcttt cctcctcgcc cgcagtctcg cggagccctg ctgcttatct acgttgctaa       60

gccgggcgat ttccttgttc ctcctgcgaa acggtgcggt ctggacacgt ctccggggtg      120

ggtcgtccgg ccttcgatct tagacgaatt ttacaatgtg aagttctgca tagatgccag      180

tcaaccagat gttggaagct ggctcaagta cattagattc gctggctgtt atgatcagca      240

caaccttgtt gcatgccaga taaatgatca gatattctat agagtagttg cagacattgc      300

gccgggagag gagcttctgc tgttcatgaa gagcgaagac tatccccatg aaactatggc      360

gccggatatc cacgaagaac ggcaatatcg ctgcgaagac tgtgaccagc tctttgaatc      420

taaggctgaa ctagcagatc accaaaagtt tccatgcagt actcctcact cagcattttc      480

aatggttgaa gaggactttc agcaaaaact cgaaagcgag aatgatctcc aagagataca      540

cacgatccag gagtgtaagg aatgtgacca agtttttcct gatttgcaaa gcctggagaa      600

acacatgctg tcacatactg aagagaggga atacaagtgt gatcagtgtc ccaaggcatt      660

taactggaag tccaatttaa ttcgccacca gatgtcacat gacagtggaa agcactatga      720

atgtgaaaac tgtgccaagg ttttcacgga ccctagcaac cttcagcggc acattcgctc      780

tcagcatgtc ggtgcccggg cccatgcatg cccggagtgt ggcaaaacgt ttgccacttc      840

gtcgggcctc aaacaacaca agcacatcca cagcagtgtg aagcccttta tctgtgaggt      900

ctgccataaa tcctatactc agttttcaaa cctttgccgt cataagcgca tgcatgctga      960

ttgcagaacc caaatcaagt gcaaagactg tggacaaatg ttcagcacta cgtcttcctt     1020

aaataaacac aggaggtttt gtgagggcaa gaaccatttt gcggcaggtg gattttttgg     1080

ccaaggcatt tcacttcctg gaaccccagc tatggataaa acgtccatgg ttaatatgag     1140

tcatgccaac ccgggccttg ctgactattt tggcgccaat aggcatcctg ctggtcttac     1200

ctttccaaca gctcctggat tttcttttag cttccctggt ctgtttcctt ccggcttgta     1260

ccacaggcct cctttgatac ctgctagttc tcctgttaaa ggactatcaa gtactgaaca     1320

gacaaacaaa agtcaaagtc ccctcatgac acatcctcag atactgccag ctacacagga     1380

tattttgaag gcactatcta aacacccatc tgtaggggac aataagccag tggagctcca     1440

gcccgagagg tcctctgaag agaggccctt tgagaaaatc agtgaccagt cagagagtag     1500

tgaccttgat gatgtcagta caccaagtgg cagtgacctg gaaacaacct cgggctctga     1560

tctggaaagt gacattgaaa gtgataaaga gaaatttaaa gaaaatggta aaatgttcaa     1620

agacaaagta agccctcttc agaatctggc ttcaataaat aataagaaag aatacagcaa     1680

tcattccatt ttctcaccat ctttagagga gcagactgcg gtgtcaggag ctgtgaatga     1740

ttctataaag gctattgctt ctattgctga aaaatacttt ggttcaacag gactggtggg     1800

gctgcaagac aaaaaagttg gagctttacc ttacccttcc atgtttcccc tcccattttt     1860

tccagcattc tctcaatcaa tgtacccatt tcctgataga gacttgagat cgttaccttt     1920

gaaaatggaa ccccaatcac caggtgaagt aaagaaactg cagaagggca gctctgagtc     1980

cccctttgat ctcaccacta agcgaaagga tgagaagccc ttgactccag tcccctccaa     2040

gcctccagtg acacctgcca caagccaaga ccagcccctg gatctaagta tgggcagtag     2100

gagtagagcc agtgggacaa agctgactga gcctcgaaaa aaccacgtgt ttgggggaaa     2160

aaaaggaagc aacgtcgaat caagacctgc ttcagatggt tccttgcagc atgcaagacc     2220

cactcctttc tttatggacc ctatttacag agtagagaaa agaaaactaa ctgacccact     2280

tgaagcttta aaagagaaat acttgaggcc ttctccagga ttcttgtttc acccacaatt     2340

ccaactgcct gatcagagaa cttggatgtc agctattgaa aacatggcag aaaagctaga     2400

gagcttcagt gccctgaaac ctgaggccag tgagctctta cagtcagtgc cctctatgtt     2460

caacttcagg gcgcctccca atgccctgcc agagaacctt ctgcggaagg gaaaggagcg     2520

ctatacctgc agatactgtg gcaagatttt tccaaggtct gcaaacctaa cacggcactt     2580

gagaacccac acaggagagc agccttacag atgcaaatac tgtgacagat catttagcat     2640

atcttctaac ttgcaaaggc atgttcgcaa catccacaat aaagagaagc catttaagtg     2700

tcacttatgt gataggtgtt ttggtcaaca aaccaattta gacagacacc taaagaaaca     2760

tgagaatggg aacatgtccg gtacagcaac atcgtcgcct cattctgaac tggaaagtac     2820

aggtgcgatt ctggatgaca aagaagatgc ttacttcaca gaaattcgaa atttcattgg     2880

gaacagcaac catggcagcc aatctcccag gaatgtggag gagagaatga atggcagtca     2940

ttttaaagat gaaaaggctt tggtgaccag tcaaaattca gacttgctgg atgatgaaga     3000

agttgaagat gaggtgttgt tagatgagga ggatgaagac aatgatatta ctggaaaaac     3060

aggaaaggaa ccagtgacaa gtaatttaca tgaaggaaac cctgaggatg actatgaaga     3120

aaccagtgcc ctggagatga gttgcaagac atccccagtg aggtataaag aggaagaata     3180

taaaagtgga ctttctgctc tagatcatat aaggcacttc acagatagcc tcaaaatgag     3240

gaaaatggaa gataatcaat attctgaagc tgagctgtct tcttttagta cttcccatgt     3300

gccagaggaa cttaagcagc cgttacacag aaagtccaaa tcgcaggcat atgctatgat     3360

gctgtcactg tctgacaagg agtccctcca ttctacatcc cacagttctt ccaacgtgtg     3420

gcacagtatg gccagggctg cggcggaatc cagtgctatc cagtccataa gccacgtatg     3480

acgttatcaa ggttgaccag agtgggacca agtccaacag tagcatggct ctttcatata     3540

ggactattta caagactgct gagcagaatg ccttataaac ctgcagggtc actcatctaa     3600

agtctagtga ccttaaactg aatgatttaa aaaagaaaag aaagaaaaaa gaaactattt     3660

attctcgata ttttgttttg cacagcaaag gcagctgctg acttctggaa gatcaatcaa     3720

tgcgacttaa agtgattcag tgaaaacaaa aaacttggtg ggctgaaggc atcttccagt     3780

ttaccccacc ttagggtatg ggtgggtgag aagggcagtt gagatggcag cattgatatg     3840

aatgaacact ccatagaaac tgaattctct tttgtacaag atcacctgac atgattggga     3900

acagttgctt ttaattacag atttaatttt tttcttcgtt aaagttttat gtaatttaac     3960

cctttgaaga cagaagtagt tggatgaaat gcacagtcaa ttattataga aactgataac     4020

agggagtact tgttccccct tttgccttct taagtacatt gtttaaaact agggaaaaag     4080

ggtatgtgta tattgtaaac tatggatgtt aacactcaaa gaggttaagt cagtgaagta     4140

acctattcat caccagtacc gctgtaccac taataaattg tttgccaaat ccttgtaata     4200

acatcttaat tttagacaat catgtcactg tttttaatgt ttattttttt gtgtgtgttg     4260

cgtgtatcat gtatttattt gttggcaaac tattgtttgt tgattaaaat agcactgttc     4320

cagtcagcca ctactttatg acgtctgagg cacacccctt tccgaatttc aaggaccaag     4380

gtgacccgac ctgtgtatga gagtgccaaa tggtgtttgg cttttcttaa cattcctttt     4440

tgtttgtttg ttttgttttc cttcttaatg aactaaatac gaatagatgc aacttagttt     4500

ttgtaatact gaaatcgatt caattgtata aacgattata atttctttca tggaagcatg     4560

attcttctga ttaaaaactg tactccatat tttatgctgg ttgtctgcaa gcttgtgcga     4620

tgttatgttc atgttaatcc tatttgtaaa atgaagtgtt cccaacctta tgttaaaaga     4680

gagaagtaaa taacagactg tattcagtta ttttgccctt tattgaggaa ccagatttgt     4740

tttctttttg tttgtaatct cattttgaaa taatcagcaa gttgaggtac tttcttcaaa     4800

tgctttgtac aatataaact gttatgcctt tcagtgcatt actatgggag gagcaactaa     4860

aaaataaaga cttacaaaaa ggagtatttt t                                    4891


<210>  135
<211>  1051
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 3, polypeptide NCBI Reference Sequence: NM_001105078.3

<400>  135

Met Lys Ser Glu Asp Tyr Pro His Glu Thr Met Ala Pro Asp Ile His 
1               5                   10                  15      


Glu Glu Arg Gln Tyr Arg Cys Glu Asp Cys Asp Gln Leu Phe Glu Ser 
            20                  25                  30          


Lys Ala Glu Leu Ala Asp His Gln Lys Phe Pro Cys Ser Thr Pro His 
        35                  40                  45              


Ser Ala Phe Ser Met Val Glu Glu Asp Phe Gln Gln Lys Leu Glu Ser 
    50                  55                  60                  


Glu Asn Asp Leu Gln Glu Ile His Thr Ile Gln Glu Cys Lys Glu Cys 
65                  70                  75                  80  


Asp Gln Val Phe Pro Asp Leu Gln Ser Leu Glu Lys His Met Leu Ser 
                85                  90                  95      


His Thr Glu Glu Arg Glu Tyr Lys Cys Asp Gln Cys Pro Lys Ala Phe 
            100                 105                 110         


Asn Trp Lys Ser Asn Leu Ile Arg His Gln Met Ser His Asp Ser Gly 
        115                 120                 125             


Lys His Tyr Glu Cys Glu Asn Cys Ala Lys Val Phe Thr Asp Pro Ser 
    130                 135                 140                 


Asn Leu Gln Arg His Ile Arg Ser Gln His Val Gly Ala Arg Ala His 
145                 150                 155                 160 


Ala Cys Pro Glu Cys Gly Lys Thr Phe Ala Thr Ser Ser Gly Leu Lys 
                165                 170                 175     


Gln His Lys His Ile His Ser Ser Val Lys Pro Phe Ile Cys Glu Val 
            180                 185                 190         


Cys His Lys Ser Tyr Thr Gln Phe Ser Asn Leu Cys Arg His Lys Arg 
        195                 200                 205             


Met His Ala Asp Cys Arg Thr Gln Ile Lys Cys Lys Asp Cys Gly Gln 
    210                 215                 220                 


Met Phe Ser Thr Thr Ser Ser Leu Asn Lys His Arg Arg Phe Cys Glu 
225                 230                 235                 240 


Gly Lys Asn His Phe Ala Ala Gly Gly Phe Phe Gly Gln Gly Ile Ser 
                245                 250                 255     


Leu Pro Gly Thr Pro Ala Met Asp Lys Thr Ser Met Val Asn Met Ser 
            260                 265                 270         


His Ala Asn Pro Gly Leu Ala Asp Tyr Phe Gly Ala Asn Arg His Pro 
        275                 280                 285             


Ala Gly Leu Thr Phe Pro Thr Ala Pro Gly Phe Ser Phe Ser Phe Pro 
    290                 295                 300                 


Gly Leu Phe Pro Ser Gly Leu Tyr His Arg Pro Pro Leu Ile Pro Ala 
305                 310                 315                 320 


Ser Ser Pro Val Lys Gly Leu Ser Ser Thr Glu Gln Thr Asn Lys Ser 
                325                 330                 335     


Gln Ser Pro Leu Met Thr His Pro Gln Ile Leu Pro Ala Thr Gln Asp 
            340                 345                 350         


Ile Leu Lys Ala Leu Ser Lys His Pro Ser Val Gly Asp Asn Lys Pro 
        355                 360                 365             


Val Glu Leu Gln Pro Glu Arg Ser Ser Glu Glu Arg Pro Phe Glu Lys 
    370                 375                 380                 


Ile Ser Asp Gln Ser Glu Ser Ser Asp Leu Asp Asp Val Ser Thr Pro 
385                 390                 395                 400 


Ser Gly Ser Asp Leu Glu Thr Thr Ser Gly Ser Asp Leu Glu Ser Asp 
                405                 410                 415     


Ile Glu Ser Asp Lys Glu Lys Phe Lys Glu Asn Gly Lys Met Phe Lys 
            420                 425                 430         


Asp Lys Val Ser Pro Leu Gln Asn Leu Ala Ser Ile Asn Asn Lys Lys 
        435                 440                 445             


Glu Tyr Ser Asn His Ser Ile Phe Ser Pro Ser Leu Glu Glu Gln Thr 
    450                 455                 460                 


Ala Val Ser Gly Ala Val Asn Asp Ser Ile Lys Ala Ile Ala Ser Ile 
465                 470                 475                 480 


Ala Glu Lys Tyr Phe Gly Ser Thr Gly Leu Val Gly Leu Gln Asp Lys 
                485                 490                 495     


Lys Val Gly Ala Leu Pro Tyr Pro Ser Met Phe Pro Leu Pro Phe Phe 
            500                 505                 510         


Pro Ala Phe Ser Gln Ser Met Tyr Pro Phe Pro Asp Arg Asp Leu Arg 
        515                 520                 525             


Ser Leu Pro Leu Lys Met Glu Pro Gln Ser Pro Gly Glu Val Lys Lys 
    530                 535                 540                 


Leu Gln Lys Gly Ser Ser Glu Ser Pro Phe Asp Leu Thr Thr Lys Arg 
545                 550                 555                 560 


Lys Asp Glu Lys Pro Leu Thr Pro Val Pro Ser Lys Pro Pro Val Thr 
                565                 570                 575     


Pro Ala Thr Ser Gln Asp Gln Pro Leu Asp Leu Ser Met Gly Ser Arg 
            580                 585                 590         


Ser Arg Ala Ser Gly Thr Lys Leu Thr Glu Pro Arg Lys Asn His Val 
        595                 600                 605             


Phe Gly Gly Lys Lys Gly Ser Asn Val Glu Ser Arg Pro Ala Ser Asp 
    610                 615                 620                 


Gly Ser Leu Gln His Ala Arg Pro Thr Pro Phe Phe Met Asp Pro Ile 
625                 630                 635                 640 


Tyr Arg Val Glu Lys Arg Lys Leu Thr Asp Pro Leu Glu Ala Leu Lys 
                645                 650                 655     


Glu Lys Tyr Leu Arg Pro Ser Pro Gly Phe Leu Phe His Pro Gln Phe 
            660                 665                 670         


Gln Leu Pro Asp Gln Arg Thr Trp Met Ser Ala Ile Glu Asn Met Ala 
        675                 680                 685             


Glu Lys Leu Glu Ser Phe Ser Ala Leu Lys Pro Glu Ala Ser Glu Leu 
    690                 695                 700                 


Leu Gln Ser Val Pro Ser Met Phe Asn Phe Arg Ala Pro Pro Asn Ala 
705                 710                 715                 720 


Leu Pro Glu Asn Leu Leu Arg Lys Gly Lys Glu Arg Tyr Thr Cys Arg 
                725                 730                 735     


Tyr Cys Gly Lys Ile Phe Pro Arg Ser Ala Asn Leu Thr Arg His Leu 
            740                 745                 750         


Arg Thr His Thr Gly Glu Gln Pro Tyr Arg Cys Lys Tyr Cys Asp Arg 
        755                 760                 765             


Ser Phe Ser Ile Ser Ser Asn Leu Gln Arg His Val Arg Asn Ile His 
    770                 775                 780                 


Asn Lys Glu Lys Pro Phe Lys Cys His Leu Cys Asp Arg Cys Phe Gly 
785                 790                 795                 800 


Gln Gln Thr Asn Leu Asp Arg His Leu Lys Lys His Glu Asn Gly Asn 
                805                 810                 815     


Met Ser Gly Thr Ala Thr Ser Ser Pro His Ser Glu Leu Glu Ser Thr 
            820                 825                 830         


Gly Ala Ile Leu Asp Asp Lys Glu Asp Ala Tyr Phe Thr Glu Ile Arg 
        835                 840                 845             


Asn Phe Ile Gly Asn Ser Asn His Gly Ser Gln Ser Pro Arg Asn Val 
    850                 855                 860                 


Glu Glu Arg Met Asn Gly Ser His Phe Lys Asp Glu Lys Ala Leu Val 
865                 870                 875                 880 


Thr Ser Gln Asn Ser Asp Leu Leu Asp Asp Glu Glu Val Glu Asp Glu 
                885                 890                 895     


Val Leu Leu Asp Glu Glu Asp Glu Asp Asn Asp Ile Thr Gly Lys Thr 
            900                 905                 910         


Gly Lys Glu Pro Val Thr Ser Asn Leu His Glu Gly Asn Pro Glu Asp 
        915                 920                 925             


Asp Tyr Glu Glu Thr Ser Ala Leu Glu Met Ser Cys Lys Thr Ser Pro 
    930                 935                 940                 


Val Arg Tyr Lys Glu Glu Glu Tyr Lys Ser Gly Leu Ser Ala Leu Asp 
945                 950                 955                 960 


His Ile Arg His Phe Thr Asp Ser Leu Lys Met Arg Lys Met Glu Asp 
                965                 970                 975     


Asn Gln Tyr Ser Glu Ala Glu Leu Ser Ser Phe Ser Thr Ser His Val 
            980                 985                 990         


Pro Glu Glu Leu Lys Gln Pro Leu  His Arg Lys Ser Lys  Ser Gln Ala 
        995                 1000                 1005             


Tyr Ala  Met Met Leu Ser Leu  Ser Asp Lys Glu Ser  Leu His Ser 
    1010                 1015                 1020             


Thr Ser  His Ser Ser Ser Asn  Val Trp His Ser Met  Ala Arg Ala 
    1025                 1030                 1035             


Ala Ala  Glu Ser Ser Ala Ile  Gln Ser Ile Ser His  Val 
    1040                 1045                 1050     


<210>  136
<211>  5533
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 4, mRNA NCBI Reference Sequence: NM_004991.3

<400>  136
gattgccatc tgacaagatc tccaaatcaa agtgataaat cgctccaaac tttttttggc       60

ggcgctgaga tgttggaggg gcgtctagcg cgcatgtgcg aaggtgtcca aactgacaat      120

gctggagaga tagcgagtgt ggattgagag aaagggagag agggagggag agagagtgaa      180

agaagaaaat acagagagtg agtgtgtgga agagagagag aaacaggaga gaaacaggag      240

ggagggagag agagagagag agagagagag agagagagag agagagagag agagagagag      300

acaggagaga gagggaggga gcgagaggga gagcaaaaga aggaaaggat ccaagaaaaa      360

aaagccccaa ccacacacca gcggctgcag gactgggcac agcatgagat ccaaaggcag      420

ggcaaggaaa ctggccacaa ataatgagtg tgtatatggc aactaccctg aaataccttt      480

ggaagaaatg ccagatgcag atggagtagc cagcactccc tccctcaata ttcaagagcc      540

atgctctcct gccacatcca gtgaagcatt cactccaaag gagggttctc cttacaaagc      600

ccccatctac atccctgatg atatccccat tcctgctgag tttgaacttc gagagtcaaa      660

tatgcctggg gcaggactag gaatatggac caaaaggaag atcgaagtag gtgaaaagtt      720

tgggccttat gtgggagagc agaggtcaaa cctgaaagac cccagttatg gatgggagat      780

cttagacgaa ttttacaatg tgaagttctg catagatgcc agtcaaccag atgttggaag      840

ctggctcaag tacattagat tcgctggctg ttatgatcag cacaaccttg ttgcatgcca      900

gataaatgat cagatattct atagagtagt tgcagacatt gcgccgggag aggagcttct      960

gctgttcatg aagagcgaag actatcccca tgaaactatg gcgccggata tccacgaaga     1020

acggcaatat cgctgcgaag actgtgacca gctctttgaa tctaaggctg aactagcaga     1080

tcaccaaaag tttccatgca gtactcctca ctcagcattt tcaatggttg aagaggactt     1140

tcagcaaaaa ctcgaaagcg agaatgatct ccaagagata cacacgatcc aggagtgtaa     1200

ggaatgtgac caagtttttc ctgatttgca aagcctggag aaacacatgc tgtcacatac     1260

tgaagagagg gaatacaagt gtgatcagtg tcccaaggca tttaactgga agtccaattt     1320

aattcgccac cagatgtcac atgacagtgg aaagcactat gaatgtgaaa actgtgccaa     1380

ggttttcacg gaccctagca accttcagcg gcacattcgc tctcagcatg tcggtgcccg     1440

ggcccatgca tgcccggagt gtggcaaaac gtttgccact tcgtcgggcc tcaaacaaca     1500

caagcacatc cacagcagtg tgaagccctt tatctgtgag gtctgccata aatcctatac     1560

tcagttttca aacctttgcc gtcataagcg catgcatgct gattgcagaa cccaaatcaa     1620

gtgcaaagac tgtggacaaa tgttcagcac tacgtcttcc ttaaataaac acaggaggtt     1680

ttgtgagggc aagaaccatt ttgcggcagg tggatttttt ggccaaggca tttcacttcc     1740

tggaacccca gctatggata aaacgtccat ggttaatatg agtcatgcca acccgggcct     1800

tgctgactat tttggcgcca ataggcatcc tgctggtctt acctttccaa cagctcctgg     1860

attttctttt agcttccctg gtctgtttcc ttccggcttg taccacaggc ctcctttgat     1920

acctgctagt tctcctgtta aaggactatc aagtactgaa cagacaaaca aaagtcaaag     1980

tcccctcatg acacatcctc agatactgcc agctacacag gatattttga aggcactatc     2040

taaacaccca tctgtagggg acaataagcc agtggagctc cagcccgaga ggtcctctga     2100

agagaggccc tttgagaaaa tcagtgacca gtcagagagt agtgaccttg atgatgtcag     2160

tacaccaagt ggcagtgacc tggaaacaac ctcgggctct gatctggaaa gtgacattga     2220

aagtgataaa gagaaattta aagaaaatgg taaaatgttc aaagacaaag taagccctct     2280

tcagaatctg gcttcaataa ataataagaa agaatacagc aatcattcca ttttctcacc     2340

atctttagag gagcagactg cggtgtcagg agctgtgaat gattctataa aggctattgc     2400

ttctattgct gaaaaatact ttggttcaac aggactggtg gggctgcaag acaaaaaagt     2460

tggagcttta ccttaccctt ccatgtttcc cctcccattt tttccagcat tctctcaatc     2520

aatgtaccca tttcctgata gagacttgag atcgttacct ttgaaaatgg aaccccaatc     2580

accaggtgaa gtaaagaaac tgcagaaggg cagctctgag tccccctttg atctcaccac     2640

taagcgaaag gatgagaagc ccttgactcc agtcccctcc aagcctccag tgacacctgc     2700

cacaagccaa gaccagcccc tggatctaag tatgggcagt aggagtagag ccagtgggac     2760

aaagctgact gagcctcgaa aaaaccacgt gtttggggga aaaaaaggaa gcaacgtcga     2820

atcaagacct gcttcagatg gttccttgca gcatgcaaga cccactcctt tctttatgga     2880

ccctatttac agagtagaga aaagaaaact aactgaccca cttgaagctt taaaagagaa     2940

atacttgagg ccttctccag gattcttgtt tcacccacaa ttccaactgc ctgatcagag     3000

aacttggatg tcagctattg aaaacatggc agaaaagcta gagagcttca gtgccctgaa     3060

acctgaggcc agtgagctct tacagtcagt gccctctatg ttcaacttca gggcgcctcc     3120

caatgccctg ccagagaacc ttctgcggaa gggaaaggag cgctatacct gcagatactg     3180

tggcaagatt tttccaaggt ctgcaaacct aacacggcac ttgagaaccc acacaggaga     3240

gcagccttac agatgcaaat actgtgacag atcatttagc atatcttcta acttgcaaag     3300

gcatgttcgc aacatccaca ataaagagaa gccatttaag tgtcacttat gtgataggtg     3360

ttttggtcaa caaaccaatt tagacagaca cctaaagaaa catgagaatg ggaacatgtc     3420

cggtacagca acatcgtcgc ctcattctga actggaaagt acaggtgcga ttctggatga     3480

caaagaagat gcttacttca cagaaattcg aaatttcatt gggaacagca accatggcag     3540

ccaatctccc aggaatgtgg aggagagaat gaatggcagt cattttaaag atgaaaaggc     3600

tttggtgacc agtcaaaatt cagacttgct ggatgatgaa gaagttgaag atgaggtgtt     3660

gttagatgag gaggatgaag acaatgatat tactggaaaa acaggaaagg aaccagtgac     3720

aagtaattta catgaaggaa accctgagga tgactatgaa gaaaccagtg ccctggagat     3780

gagttgcaag acatccccag tgaggtataa agaggaagaa tataaaagtg gactttctgc     3840

tctagatcat ataaggcact tcacagatag cctcaaaatg aggaaaatgg aagataatca     3900

atattctgaa gctgagctgt cttcttttag tacttcccat gtgccagagg aacttaagca     3960

gccgttacac agaaagtcca aatcgcaggc atatgctatg atgctgtcac tgtctgacaa     4020

ggagtccctc cattctacat cccacagttc ttccaacgtg tggcacagta tggccagggc     4080

tgcggcggaa tccagtgcta tccagtccat aagccacgta tgacgttatc aaggttgacc     4140

agagtgggac caagtccaac agtagcatgg ctctttcata taggactatt tacaagactg     4200

ctgagcagaa tgccttataa acctgcaggg tcactcatct aaagtctagt gaccttaaac     4260

tgaatgattt aaaaaagaaa agaaagaaaa aagaaactat ttattctcga tattttgttt     4320

tgcacagcaa aggcagctgc tgacttctgg aagatcaatc aatgcgactt aaagtgattc     4380

agtgaaaaca aaaaacttgg tgggctgaag gcatcttcca gtttacccca ccttagggta     4440

tgggtgggtg agaagggcag ttgagatggc agcattgata tgaatgaaca ctccatagaa     4500

actgaattct cttttgtaca agatcacctg acatgattgg gaacagttgc ttttaattac     4560

agatttaatt tttttcttcg ttaaagtttt atgtaattta accctttgaa gacagaagta     4620

gttggatgaa atgcacagtc aattattata gaaactgata acagggagta cttgttcccc     4680

cttttgcctt cttaagtaca ttgtttaaaa ctagggaaaa agggtatgtg tatattgtaa     4740

actatggatg ttaacactca aagaggttaa gtcagtgaag taacctattc atcaccagta     4800

ccgctgtacc actaataaat tgtttgccaa atccttgtaa taacatctta attttagaca     4860

atcatgtcac tgtttttaat gtttattttt ttgtgtgtgt tgcgtgtatc atgtatttat     4920

ttgttggcaa actattgttt gttgattaaa atagcactgt tccagtcagc cactacttta     4980

tgacgtctga ggcacacccc tttccgaatt tcaaggacca aggtgacccg acctgtgtat     5040

gagagtgcca aatggtgttt ggcttttctt aacattcctt tttgtttgtt tgttttgttt     5100

tccttcttaa tgaactaaat acgaatagat gcaacttagt ttttgtaata ctgaaatcga     5160

ttcaattgta taaacgatta taatttcttt catggaagca tgattcttct gattaaaaac     5220

tgtactccat attttatgct ggttgtctgc aagcttgtgc gatgttatgt tcatgttaat     5280

cctatttgta aaatgaagtg ttcccaacct tatgttaaaa gagagaagta aataacagac     5340

tgtattcagt tattttgccc tttattgagg aaccagattt gttttctttt tgtttgtaat     5400

ctcattttga aataatcagc aagttgaggt actttcttca aatgctttgt acaatataaa     5460

ctgttatgcc tttcagtgca ttactatggg aggagcaact aaaaaataaa gacttacaaa     5520

aaggagtatt ttt                                                        5533


<210>  137
<211>  1239
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 4, polypeptide NCBI Reference Sequence: NM_004991.3

<400>  137

Met Arg Ser Lys Gly Arg Ala Arg Lys Leu Ala Thr Asn Asn Glu Cys 
1               5                   10                  15      


Val Tyr Gly Asn Tyr Pro Glu Ile Pro Leu Glu Glu Met Pro Asp Ala 
            20                  25                  30          


Asp Gly Val Ala Ser Thr Pro Ser Leu Asn Ile Gln Glu Pro Cys Ser 
        35                  40                  45              


Pro Ala Thr Ser Ser Glu Ala Phe Thr Pro Lys Glu Gly Ser Pro Tyr 
    50                  55                  60                  


Lys Ala Pro Ile Tyr Ile Pro Asp Asp Ile Pro Ile Pro Ala Glu Phe 
65                  70                  75                  80  


Glu Leu Arg Glu Ser Asn Met Pro Gly Ala Gly Leu Gly Ile Trp Thr 
                85                  90                  95      


Lys Arg Lys Ile Glu Val Gly Glu Lys Phe Gly Pro Tyr Val Gly Glu 
            100                 105                 110         


Gln Arg Ser Asn Leu Lys Asp Pro Ser Tyr Gly Trp Glu Ile Leu Asp 
        115                 120                 125             


Glu Phe Tyr Asn Val Lys Phe Cys Ile Asp Ala Ser Gln Pro Asp Val 
    130                 135                 140                 


Gly Ser Trp Leu Lys Tyr Ile Arg Phe Ala Gly Cys Tyr Asp Gln His 
145                 150                 155                 160 


Asn Leu Val Ala Cys Gln Ile Asn Asp Gln Ile Phe Tyr Arg Val Val 
                165                 170                 175     


Ala Asp Ile Ala Pro Gly Glu Glu Leu Leu Leu Phe Met Lys Ser Glu 
            180                 185                 190         


Asp Tyr Pro His Glu Thr Met Ala Pro Asp Ile His Glu Glu Arg Gln 
        195                 200                 205             


Tyr Arg Cys Glu Asp Cys Asp Gln Leu Phe Glu Ser Lys Ala Glu Leu 
    210                 215                 220                 


Ala Asp His Gln Lys Phe Pro Cys Ser Thr Pro His Ser Ala Phe Ser 
225                 230                 235                 240 


Met Val Glu Glu Asp Phe Gln Gln Lys Leu Glu Ser Glu Asn Asp Leu 
                245                 250                 255     


Gln Glu Ile His Thr Ile Gln Glu Cys Lys Glu Cys Asp Gln Val Phe 
            260                 265                 270         


Pro Asp Leu Gln Ser Leu Glu Lys His Met Leu Ser His Thr Glu Glu 
        275                 280                 285             


Arg Glu Tyr Lys Cys Asp Gln Cys Pro Lys Ala Phe Asn Trp Lys Ser 
    290                 295                 300                 


Asn Leu Ile Arg His Gln Met Ser His Asp Ser Gly Lys His Tyr Glu 
305                 310                 315                 320 


Cys Glu Asn Cys Ala Lys Val Phe Thr Asp Pro Ser Asn Leu Gln Arg 
                325                 330                 335     


His Ile Arg Ser Gln His Val Gly Ala Arg Ala His Ala Cys Pro Glu 
            340                 345                 350         


Cys Gly Lys Thr Phe Ala Thr Ser Ser Gly Leu Lys Gln His Lys His 
        355                 360                 365             


Ile His Ser Ser Val Lys Pro Phe Ile Cys Glu Val Cys His Lys Ser 
    370                 375                 380                 


Tyr Thr Gln Phe Ser Asn Leu Cys Arg His Lys Arg Met His Ala Asp 
385                 390                 395                 400 


Cys Arg Thr Gln Ile Lys Cys Lys Asp Cys Gly Gln Met Phe Ser Thr 
                405                 410                 415     


Thr Ser Ser Leu Asn Lys His Arg Arg Phe Cys Glu Gly Lys Asn His 
            420                 425                 430         


Phe Ala Ala Gly Gly Phe Phe Gly Gln Gly Ile Ser Leu Pro Gly Thr 
        435                 440                 445             


Pro Ala Met Asp Lys Thr Ser Met Val Asn Met Ser His Ala Asn Pro 
    450                 455                 460                 


Gly Leu Ala Asp Tyr Phe Gly Ala Asn Arg His Pro Ala Gly Leu Thr 
465                 470                 475                 480 


Phe Pro Thr Ala Pro Gly Phe Ser Phe Ser Phe Pro Gly Leu Phe Pro 
                485                 490                 495     


Ser Gly Leu Tyr His Arg Pro Pro Leu Ile Pro Ala Ser Ser Pro Val 
            500                 505                 510         


Lys Gly Leu Ser Ser Thr Glu Gln Thr Asn Lys Ser Gln Ser Pro Leu 
        515                 520                 525             


Met Thr His Pro Gln Ile Leu Pro Ala Thr Gln Asp Ile Leu Lys Ala 
    530                 535                 540                 


Leu Ser Lys His Pro Ser Val Gly Asp Asn Lys Pro Val Glu Leu Gln 
545                 550                 555                 560 


Pro Glu Arg Ser Ser Glu Glu Arg Pro Phe Glu Lys Ile Ser Asp Gln 
                565                 570                 575     


Ser Glu Ser Ser Asp Leu Asp Asp Val Ser Thr Pro Ser Gly Ser Asp 
            580                 585                 590         


Leu Glu Thr Thr Ser Gly Ser Asp Leu Glu Ser Asp Ile Glu Ser Asp 
        595                 600                 605             


Lys Glu Lys Phe Lys Glu Asn Gly Lys Met Phe Lys Asp Lys Val Ser 
    610                 615                 620                 


Pro Leu Gln Asn Leu Ala Ser Ile Asn Asn Lys Lys Glu Tyr Ser Asn 
625                 630                 635                 640 


His Ser Ile Phe Ser Pro Ser Leu Glu Glu Gln Thr Ala Val Ser Gly 
                645                 650                 655     


Ala Val Asn Asp Ser Ile Lys Ala Ile Ala Ser Ile Ala Glu Lys Tyr 
            660                 665                 670         


Phe Gly Ser Thr Gly Leu Val Gly Leu Gln Asp Lys Lys Val Gly Ala 
        675                 680                 685             


Leu Pro Tyr Pro Ser Met Phe Pro Leu Pro Phe Phe Pro Ala Phe Ser 
    690                 695                 700                 


Gln Ser Met Tyr Pro Phe Pro Asp Arg Asp Leu Arg Ser Leu Pro Leu 
705                 710                 715                 720 


Lys Met Glu Pro Gln Ser Pro Gly Glu Val Lys Lys Leu Gln Lys Gly 
                725                 730                 735     


Ser Ser Glu Ser Pro Phe Asp Leu Thr Thr Lys Arg Lys Asp Glu Lys 
            740                 745                 750         


Pro Leu Thr Pro Val Pro Ser Lys Pro Pro Val Thr Pro Ala Thr Ser 
        755                 760                 765             


Gln Asp Gln Pro Leu Asp Leu Ser Met Gly Ser Arg Ser Arg Ala Ser 
    770                 775                 780                 


Gly Thr Lys Leu Thr Glu Pro Arg Lys Asn His Val Phe Gly Gly Lys 
785                 790                 795                 800 


Lys Gly Ser Asn Val Glu Ser Arg Pro Ala Ser Asp Gly Ser Leu Gln 
                805                 810                 815     


His Ala Arg Pro Thr Pro Phe Phe Met Asp Pro Ile Tyr Arg Val Glu 
            820                 825                 830         


Lys Arg Lys Leu Thr Asp Pro Leu Glu Ala Leu Lys Glu Lys Tyr Leu 
        835                 840                 845             


Arg Pro Ser Pro Gly Phe Leu Phe His Pro Gln Phe Gln Leu Pro Asp 
    850                 855                 860                 


Gln Arg Thr Trp Met Ser Ala Ile Glu Asn Met Ala Glu Lys Leu Glu 
865                 870                 875                 880 


Ser Phe Ser Ala Leu Lys Pro Glu Ala Ser Glu Leu Leu Gln Ser Val 
                885                 890                 895     


Pro Ser Met Phe Asn Phe Arg Ala Pro Pro Asn Ala Leu Pro Glu Asn 
            900                 905                 910         


Leu Leu Arg Lys Gly Lys Glu Arg Tyr Thr Cys Arg Tyr Cys Gly Lys 
        915                 920                 925             


Ile Phe Pro Arg Ser Ala Asn Leu Thr Arg His Leu Arg Thr His Thr 
    930                 935                 940                 


Gly Glu Gln Pro Tyr Arg Cys Lys Tyr Cys Asp Arg Ser Phe Ser Ile 
945                 950                 955                 960 


Ser Ser Asn Leu Gln Arg His Val Arg Asn Ile His Asn Lys Glu Lys 
                965                 970                 975     


Pro Phe Lys Cys His Leu Cys Asp Arg Cys Phe Gly Gln Gln Thr Asn 
            980                 985                 990         


Leu Asp Arg His Leu Lys Lys His  Glu Asn Gly Asn Met  Ser Gly Thr 
        995                 1000                 1005             


Ala Thr  Ser Ser Pro His Ser  Glu Leu Glu Ser Thr  Gly Ala Ile 
    1010                 1015                 1020             


Leu Asp  Asp Lys Glu Asp Ala  Tyr Phe Thr Glu Ile  Arg Asn Phe 
    1025                 1030                 1035             


Ile Gly  Asn Ser Asn His Gly  Ser Gln Ser Pro Arg  Asn Val Glu 
    1040                 1045                 1050             


Glu Arg  Met Asn Gly Ser His  Phe Lys Asp Glu Lys  Ala Leu Val 
    1055                 1060                 1065             


Thr Ser  Gln Asn Ser Asp Leu  Leu Asp Asp Glu Glu  Val Glu Asp 
    1070                 1075                 1080             


Glu Val  Leu Leu Asp Glu Glu  Asp Glu Asp Asn Asp  Ile Thr Gly 
    1085                 1090                 1095             


Lys Thr  Gly Lys Glu Pro Val  Thr Ser Asn Leu His  Glu Gly Asn 
    1100                 1105                 1110             


Pro Glu  Asp Asp Tyr Glu Glu  Thr Ser Ala Leu Glu  Met Ser Cys 
    1115                 1120                 1125             


Lys Thr  Ser Pro Val Arg Tyr  Lys Glu Glu Glu Tyr  Lys Ser Gly 
    1130                 1135                 1140             


Leu Ser  Ala Leu Asp His Ile  Arg His Phe Thr Asp  Ser Leu Lys 
    1145                 1150                 1155             


Met Arg  Lys Met Glu Asp Asn  Gln Tyr Ser Glu Ala  Glu Leu Ser 
    1160                 1165                 1170             


Ser Phe  Ser Thr Ser His Val  Pro Glu Glu Leu Lys  Gln Pro Leu 
    1175                 1180                 1185             


His Arg  Lys Ser Lys Ser Gln  Ala Tyr Ala Met Met  Leu Ser Leu 
    1190                 1195                 1200             


Ser Asp  Lys Glu Ser Leu His  Ser Thr Ser His Ser  Ser Ser Asn 
    1205                 1210                 1215             


Val Trp  His Ser Met Ala Arg  Ala Ala Ala Glu Ser  Ser Ala Ile 
    1220                 1225                 1230             


Gln Ser  Ile Ser His Val 
    1235                 


<210>  138
<211>  4811
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 5, mRNA NCBI Reference Sequence: NM_001163999.1

<400>  138
ccttgccaag taacagcttt gctgtccaac atcgtgtgct gcttcgcgag aaagtcacat       60

tcggaccctt tggctagatt atcttagacg aattttacaa tgtgaagttc tgcatagatg      120

ccagtcaacc agatgttgga agctggctca agtacattag attcgctggc tgttatgatc      180

agcacaacct tgttgcatgc cagataaatg atcagatatt ctatagagta gttgcagaca      240

ttgcgccggg agaggagctt ctgctgttca tgaagagcga agactatccc catgaaacta      300

tggcgccgga tatccacgaa gaacggcaat atcgctgcga agactgtgac cagctctttg      360

aatctaaggc tgaactagca gatcaccaaa agtttccatg cagtactcct cactcagcat      420

tttcaatggt tgaagaggac tttcagcaaa aactcgaaag cgagaatgat ctccaagaga      480

tacacacgat ccaggagtgt aaggaatgtg accaagtttt tcctgatttg caaagcctgg      540

agaaacacat gctgtcacat actgaagaga gggaatacaa gtgtgatcag tgtcccaagg      600

catttaactg gaagtccaat ttaattcgcc accagatgtc acatgacagt ggaaagcact      660

atgaatgtga aaactgtgcc aagcaggttt tcacggaccc tagcaacctt cagcggcaca      720

ttcgctctca gcatgtcggt gcccgggccc atgcatgccc ggagtgtggc aaaacgtttg      780

ccacttcgtc gggcctcaaa caacacaagc acatccacag cagtgtgaag ccctttatct      840

gtgaggtctg ccataaatcc tatactcagt tttcaaacct ttgccgtcat aagcgcatgc      900

atgctgattg cagaacccaa atcaagtgca aagactgtgg acaaatgttc agcactacgt      960

cttccttaaa taaacacagg aggttttgtg agggcaagaa ccattttgcg gcaggtggat     1020

tttttggcca aggcatttca cttcctggaa ccccagctat ggataaaacg tccatggtta     1080

atatgagtca tgccaacccg ggccttgctg actattttgg cgccaatagg catcctgctg     1140

gtcttacctt tccaacagct cctggatttt cttttagctt ccctggtctg tttccttccg     1200

gcttgtacca caggcctcct ttgatacctg ctagttctcc tgttaaagga ctatcaagta     1260

ctgaacagac aaacaaaagt caaagtcccc tcatgacaca tcctcagata ctgccagcta     1320

cacaggatat tttgaaggca ctatctaaac acccatctgt aggggacaat aagccagtgg     1380

agctccagcc cgagaggtcc tctgaagaga ggccctttga gaaaatcagt gaccagtcag     1440

agagtagtga ccttgatgat gtcagtacac caagtggcag tgacctggaa acaacctcgg     1500

gctctgatct ggaaagtgac attgaaagtg ataaagagaa atttaaagaa aatggtaaaa     1560

tgttcaaaga caaagtaagc cctcttcaga atctggcttc aataaataat aagaaagaat     1620

acagcaatca ttccattttc tcaccatctt tagaggagca gactgcggtg tcaggagctg     1680

tgaatgattc tataaaggct attgcttcta ttgctgaaaa atactttggt tcaacaggac     1740

tggtggggct gcaagacaaa aaagttggag ctttacctta cccttccatg tttcccctcc     1800

cattttttcc agcattctct caatcaatgt acccatttcc tgatagagac ttgagatcgt     1860

tacctttgaa aatggaaccc caatcaccag gtgaagtaaa gaaactgcag aagggcagct     1920

ctgagtcccc ctttgatctc accactaagc gaaaggatga gaagcccttg actccagtcc     1980

cctccaagcc tccagtgaca cctgccacaa gccaagacca gcccctggat ctaagtatgg     2040

gcagtaggag tagagccagt gggacaaagc tgactgagcc tcgaaaaaac cacgtgtttg     2100

ggggaaaaaa aggaagcaac gtcgaatcaa gacctgcttc agatggttcc ttgcagcatg     2160

caagacccac tcctttcttt atggacccta tttacagagt agagaaaaga aaactaactg     2220

acccacttga agctttaaaa gagaaatact tgaggccttc tccaggattc ttgtttcacc     2280

cacaaatgtc agctattgaa aacatggcag aaaagctaga gagcttcagt gccctgaaac     2340

ctgaggccag tgagctctta cagtcagtgc cctctatgtt caacttcagg gcgcctccca     2400

atgccctgcc agagaacctt ctgcggaagg gaaaggagcg ctatacctgc agatactgtg     2460

gcaagatttt tccaaggtct gcaaacctaa cacggcactt gagaacccac acaggagagc     2520

agccttacag atgcaaatac tgtgacagat catttagcat atcttctaac ttgcaaaggc     2580

atgttcgcaa catccacaat aaagagaagc catttaagtg tcacttatgt gataggtgtt     2640

ttggtcaaca aaccaattta gacagacacc taaagaaaca tgagaatggg aacatgtccg     2700

gtacagcaac atcgtcgcct cattctgaac tggaaagtac aggtgcgatt ctggatgaca     2760

aagaagatgc ttacttcaca gaaattcgaa atttcattgg gaacagcaac catggcagcc     2820

aatctcccag gaatgtggag gagagaatga atggcagtca ttttaaagat gaaaaggctt     2880

tggtgaccag tcaaaattca gacttgctgg atgatgaaga agttgaagat gaggtgttgt     2940

tagatgagga ggatgaagac aatgatatta ctggaaaaac aggaaaggaa ccagtgacaa     3000

gtaatttaca tgaaggaaac cctgaggatg actatgaaga aaccagtgcc ctggagatga     3060

gttgcaagac atccccagtg aggtataaag aggaagaata taaaagtgga ctttctgctc     3120

tagatcatat aaggcacttc acagatagcc tcaaaatgag gaaaatggaa gataatcaat     3180

attctgaagc tgagctgtct tcttttagta cttcccatgt gccagaggaa cttaagcagc     3240

cgttacacag aaagtccaaa tcgcaggcat atgctatgat gctgtcactg tctgacaagg     3300

agtccctcca ttctacatcc cacagttctt ccaacgtgtg gcacagtatg gccagggctg     3360

cggcggaatc cagtgctatc cagtccataa gccacgtatg acgttatcaa ggttgaccag     3420

agtgggacca agtccaacag tagcatggct ctttcatata ggactattta caagactgct     3480

gagcagaatg ccttataaac ctgcagggtc actcatctaa agtctagtga ccttaaactg     3540

aatgatttaa aaaagaaaag aaagaaaaaa gaaactattt attctcgata ttttgttttg     3600

cacagcaaag gcagctgctg acttctggaa gatcaatcaa tgcgacttaa agtgattcag     3660

tgaaaacaaa aaacttggtg ggctgaaggc atcttccagt ttaccccacc ttagggtatg     3720

ggtgggtgag aagggcagtt gagatggcag cattgatatg aatgaacact ccatagaaac     3780

tgaattctct tttgtacaag atcacctgac atgattggga acagttgctt ttaattacag     3840

atttaatttt tttcttcgtt aaagttttat gtaatttaac cctttgaaga cagaagtagt     3900

tggatgaaat gcacagtcaa ttattataga aactgataac agggagtact tgttccccct     3960

tttgccttct taagtacatt gtttaaaact agggaaaaag ggtatgtgta tattgtaaac     4020

tatggatgtt aacactcaaa gaggttaagt cagtgaagta acctattcat caccagtacc     4080

gctgtaccac taataaattg tttgccaaat ccttgtaata acatcttaat tttagacaat     4140

catgtcactg tttttaatgt ttattttttt gtgtgtgttg cgtgtatcat gtatttattt     4200

gttggcaaac tattgtttgt tgattaaaat agcactgttc cagtcagcca ctactttatg     4260

acgtctgagg cacacccctt tccgaatttc aaggaccaag gtgacccgac ctgtgtatga     4320

gagtgccaaa tggtgtttgg cttttcttaa cattcctttt tgtttgtttg ttttgttttc     4380

cttcttaatg aactaaatac gaatagatgc aacttagttt ttgtaatact gaaatcgatt     4440

caattgtata aacgattata atttctttca tggaagcatg attcttctga ttaaaaactg     4500

tactccatat tttatgctgg ttgtctgcaa gcttgtgcga tgttatgttc atgttaatcc     4560

tatttgtaaa atgaagtgtt cccaacctta tgttaaaaga gagaagtaaa taacagactg     4620

tattcagtta ttttgccctt tattgaggaa ccagatttgt tttctttttg tttgtaatct     4680

cattttgaaa taatcagcaa gttgaggtac tttcttcaaa tgctttgtac aatataaact     4740

gttatgcctt tcagtgcatt actatgggag gagcaactaa aaaataaaga cttacaaaaa     4800

ggagtatttt t                                                          4811


<210>  139
<211>  1043
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 5, polypeptide NCBI Reference Sequence: NM_001163999.1

<400>  139

Met Lys Ser Glu Asp Tyr Pro His Glu Thr Met Ala Pro Asp Ile His 
1               5                   10                  15      


Glu Glu Arg Gln Tyr Arg Cys Glu Asp Cys Asp Gln Leu Phe Glu Ser 
            20                  25                  30          


Lys Ala Glu Leu Ala Asp His Gln Lys Phe Pro Cys Ser Thr Pro His 
        35                  40                  45              


Ser Ala Phe Ser Met Val Glu Glu Asp Phe Gln Gln Lys Leu Glu Ser 
    50                  55                  60                  


Glu Asn Asp Leu Gln Glu Ile His Thr Ile Gln Glu Cys Lys Glu Cys 
65                  70                  75                  80  


Asp Gln Val Phe Pro Asp Leu Gln Ser Leu Glu Lys His Met Leu Ser 
                85                  90                  95      


His Thr Glu Glu Arg Glu Tyr Lys Cys Asp Gln Cys Pro Lys Ala Phe 
            100                 105                 110         


Asn Trp Lys Ser Asn Leu Ile Arg His Gln Met Ser His Asp Ser Gly 
        115                 120                 125             


Lys His Tyr Glu Cys Glu Asn Cys Ala Lys Gln Val Phe Thr Asp Pro 
    130                 135                 140                 


Ser Asn Leu Gln Arg His Ile Arg Ser Gln His Val Gly Ala Arg Ala 
145                 150                 155                 160 


His Ala Cys Pro Glu Cys Gly Lys Thr Phe Ala Thr Ser Ser Gly Leu 
                165                 170                 175     


Lys Gln His Lys His Ile His Ser Ser Val Lys Pro Phe Ile Cys Glu 
            180                 185                 190         


Val Cys His Lys Ser Tyr Thr Gln Phe Ser Asn Leu Cys Arg His Lys 
        195                 200                 205             


Arg Met His Ala Asp Cys Arg Thr Gln Ile Lys Cys Lys Asp Cys Gly 
    210                 215                 220                 


Gln Met Phe Ser Thr Thr Ser Ser Leu Asn Lys His Arg Arg Phe Cys 
225                 230                 235                 240 


Glu Gly Lys Asn His Phe Ala Ala Gly Gly Phe Phe Gly Gln Gly Ile 
                245                 250                 255     


Ser Leu Pro Gly Thr Pro Ala Met Asp Lys Thr Ser Met Val Asn Met 
            260                 265                 270         


Ser His Ala Asn Pro Gly Leu Ala Asp Tyr Phe Gly Ala Asn Arg His 
        275                 280                 285             


Pro Ala Gly Leu Thr Phe Pro Thr Ala Pro Gly Phe Ser Phe Ser Phe 
    290                 295                 300                 


Pro Gly Leu Phe Pro Ser Gly Leu Tyr His Arg Pro Pro Leu Ile Pro 
305                 310                 315                 320 


Ala Ser Ser Pro Val Lys Gly Leu Ser Ser Thr Glu Gln Thr Asn Lys 
                325                 330                 335     


Ser Gln Ser Pro Leu Met Thr His Pro Gln Ile Leu Pro Ala Thr Gln 
            340                 345                 350         


Asp Ile Leu Lys Ala Leu Ser Lys His Pro Ser Val Gly Asp Asn Lys 
        355                 360                 365             


Pro Val Glu Leu Gln Pro Glu Arg Ser Ser Glu Glu Arg Pro Phe Glu 
    370                 375                 380                 


Lys Ile Ser Asp Gln Ser Glu Ser Ser Asp Leu Asp Asp Val Ser Thr 
385                 390                 395                 400 


Pro Ser Gly Ser Asp Leu Glu Thr Thr Ser Gly Ser Asp Leu Glu Ser 
                405                 410                 415     


Asp Ile Glu Ser Asp Lys Glu Lys Phe Lys Glu Asn Gly Lys Met Phe 
            420                 425                 430         


Lys Asp Lys Val Ser Pro Leu Gln Asn Leu Ala Ser Ile Asn Asn Lys 
        435                 440                 445             


Lys Glu Tyr Ser Asn His Ser Ile Phe Ser Pro Ser Leu Glu Glu Gln 
    450                 455                 460                 


Thr Ala Val Ser Gly Ala Val Asn Asp Ser Ile Lys Ala Ile Ala Ser 
465                 470                 475                 480 


Ile Ala Glu Lys Tyr Phe Gly Ser Thr Gly Leu Val Gly Leu Gln Asp 
                485                 490                 495     


Lys Lys Val Gly Ala Leu Pro Tyr Pro Ser Met Phe Pro Leu Pro Phe 
            500                 505                 510         


Phe Pro Ala Phe Ser Gln Ser Met Tyr Pro Phe Pro Asp Arg Asp Leu 
        515                 520                 525             


Arg Ser Leu Pro Leu Lys Met Glu Pro Gln Ser Pro Gly Glu Val Lys 
    530                 535                 540                 


Lys Leu Gln Lys Gly Ser Ser Glu Ser Pro Phe Asp Leu Thr Thr Lys 
545                 550                 555                 560 


Arg Lys Asp Glu Lys Pro Leu Thr Pro Val Pro Ser Lys Pro Pro Val 
                565                 570                 575     


Thr Pro Ala Thr Ser Gln Asp Gln Pro Leu Asp Leu Ser Met Gly Ser 
            580                 585                 590         


Arg Ser Arg Ala Ser Gly Thr Lys Leu Thr Glu Pro Arg Lys Asn His 
        595                 600                 605             


Val Phe Gly Gly Lys Lys Gly Ser Asn Val Glu Ser Arg Pro Ala Ser 
    610                 615                 620                 


Asp Gly Ser Leu Gln His Ala Arg Pro Thr Pro Phe Phe Met Asp Pro 
625                 630                 635                 640 


Ile Tyr Arg Val Glu Lys Arg Lys Leu Thr Asp Pro Leu Glu Ala Leu 
                645                 650                 655     


Lys Glu Lys Tyr Leu Arg Pro Ser Pro Gly Phe Leu Phe His Pro Gln 
            660                 665                 670         


Met Ser Ala Ile Glu Asn Met Ala Glu Lys Leu Glu Ser Phe Ser Ala 
        675                 680                 685             


Leu Lys Pro Glu Ala Ser Glu Leu Leu Gln Ser Val Pro Ser Met Phe 
    690                 695                 700                 


Asn Phe Arg Ala Pro Pro Asn Ala Leu Pro Glu Asn Leu Leu Arg Lys 
705                 710                 715                 720 


Gly Lys Glu Arg Tyr Thr Cys Arg Tyr Cys Gly Lys Ile Phe Pro Arg 
                725                 730                 735     


Ser Ala Asn Leu Thr Arg His Leu Arg Thr His Thr Gly Glu Gln Pro 
            740                 745                 750         


Tyr Arg Cys Lys Tyr Cys Asp Arg Ser Phe Ser Ile Ser Ser Asn Leu 
        755                 760                 765             


Gln Arg His Val Arg Asn Ile His Asn Lys Glu Lys Pro Phe Lys Cys 
    770                 775                 780                 


His Leu Cys Asp Arg Cys Phe Gly Gln Gln Thr Asn Leu Asp Arg His 
785                 790                 795                 800 


Leu Lys Lys His Glu Asn Gly Asn Met Ser Gly Thr Ala Thr Ser Ser 
                805                 810                 815     


Pro His Ser Glu Leu Glu Ser Thr Gly Ala Ile Leu Asp Asp Lys Glu 
            820                 825                 830         


Asp Ala Tyr Phe Thr Glu Ile Arg Asn Phe Ile Gly Asn Ser Asn His 
        835                 840                 845             


Gly Ser Gln Ser Pro Arg Asn Val Glu Glu Arg Met Asn Gly Ser His 
    850                 855                 860                 


Phe Lys Asp Glu Lys Ala Leu Val Thr Ser Gln Asn Ser Asp Leu Leu 
865                 870                 875                 880 


Asp Asp Glu Glu Val Glu Asp Glu Val Leu Leu Asp Glu Glu Asp Glu 
                885                 890                 895     


Asp Asn Asp Ile Thr Gly Lys Thr Gly Lys Glu Pro Val Thr Ser Asn 
            900                 905                 910         


Leu His Glu Gly Asn Pro Glu Asp Asp Tyr Glu Glu Thr Ser Ala Leu 
        915                 920                 925             


Glu Met Ser Cys Lys Thr Ser Pro Val Arg Tyr Lys Glu Glu Glu Tyr 
    930                 935                 940                 


Lys Ser Gly Leu Ser Ala Leu Asp His Ile Arg His Phe Thr Asp Ser 
945                 950                 955                 960 


Leu Lys Met Arg Lys Met Glu Asp Asn Gln Tyr Ser Glu Ala Glu Leu 
                965                 970                 975     


Ser Ser Phe Ser Thr Ser His Val Pro Glu Glu Leu Lys Gln Pro Leu 
            980                 985                 990         


His Arg Lys Ser Lys Ser Gln Ala  Tyr Ala Met Met Leu  Ser Leu Ser 
        995                 1000                 1005             


Asp Lys  Glu Ser Leu His Ser  Thr Ser His Ser Ser  Ser Asn Val 
    1010                 1015                 1020             


Trp His  Ser Met Ala Arg Ala  Ala Ala Glu Ser Ser  Ala Ile Gln 
    1025                 1030                 1035             


Ser Ile  Ser His Val 
    1040             


<210>  140
<211>  5740
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 6, mRNA NCBI Reference Sequence: NM_001164000.1

<400>  140
cacacacaca cacacacacc acacttgtgc tttcaagaca tcgaaacgga ggctatttcc       60

ctggggaaag aaatcctgcc tggcgagatc tccccattgg ttgtttaccc ggagaaatct      120

acatgtttaa gggggatggt gcatccataa tcagtctgtc cctataggac ttgggtcttg      180

gcgacctttt tgtgacctct cccgccagag gaggctgctg tcactttaaa aatttaaaag      240

aggagcccgt ctggcttccg atcagatcac tctgggcggc gggagatagc tccctttctc      300

cctcgccccg ggttctttct ggatggccga gcagatcctc tttaaagaga cagttcatga      360

aatagaaacc cggcggctga gcttggagtt gcgaaagggg acgatcccgt gagggtccag      420

gacccgcgaa ggcgctgcgg aggatctgaa agggggatag agctcccctc gcctccccag      480

gccccccacc ttttcaaact ctcctcctcc tgcttgtttt cccccattgg aactgggaag      540

gagaagtaga agttttagtg ggtttcagat aactttcatt acacatcggg ctgataagag      600

caagagaaag tgagaaaaga gggaggtgat gtgaaccaga aggaatagct ccgagctcat      660

ttaggaaggg ggaaaaagcc aaaacacacc aaacccgggt cacccagacg aaagaagact      720

tcatttcttg tattaaaaat acactgttgg cggacaataa atccgaaacg cgtggtcctg      780

gagagcagat cctagagacg gacaaagttg tcagagaccc atttggaaat cgagacgcga      840

ggcttttaaa aaattattat tattattttt aaacatctct aaatgttgct cgggatcgtt      900

tgaaaggatt ttcgtgcagg agcgttgggg gctgctgatt attttatttt gtttattttg      960

attcttctgt gaatgcctat tattgctgag ttgaggccat agaaatctaa agatcttaga     1020

cgaattttac aatgtgaagt tctgcataga tgccagtcaa ccagatgttg gaagctggct     1080

caagtacatt agattcgctg gctgttatga tcagcacaac cttgttgcat gccagataaa     1140

tgatcagata ttctatagag tagttgcaga cattgcgccg ggagaggagc ttctgctgtt     1200

catgaagagc gaagactatc cccatgaaac tatggcgccg gatatccacg aagaacggca     1260

atatcgctgc gaagactgtg accagctctt tgaatctaag gctgaactag cagatcacca     1320

aaagtttcca tgcagtactc ctcactcagc attttcaatg gttgaagagg actttcagca     1380

aaaactcgaa agcgagaatg atctccaaga gatacacacg atccaggagt gtaaggaatg     1440

tgaccaagtt tttcctgatt tgcaaagcct ggagaaacac atgctgtcac atactgaaga     1500

gagggaatac aagtgtgatc agtgtcccaa ggcatttaac tggaagtcca atttaattcg     1560

ccaccagatg tcacatgaca gtggaaagca ctatgaatgt gaaaactgtg ccaaggtttt     1620

cacggaccct agcaaccttc agcggcacat tcgctctcag catgtcggtg cccgggccca     1680

tgcatgcccg gagtgtggca aaacgtttgc cacttcgtcg ggcctcaaac aacacaagca     1740

catccacagc agtgtgaagc cctttatctg tgaggtctgc cataaatcct atactcagtt     1800

ttcaaacctt tgccgtcata agcgcatgca tgctgattgc agaacccaaa tcaagtgcaa     1860

agactgtgga caaatgttca gcactacgtc ttccttaaat aaacacagga ggttttgtga     1920

gggcaagaac cattttgcgg caggtggatt ttttggccaa ggcatttcac ttcctggaac     1980

cccagctatg gataaaacgt ccatggttaa tatgagtcat gccaacccgg gccttgctga     2040

ctattttggc gccaataggc atcctgctgg tcttaccttt ccaacagctc ctggattttc     2100

ttttagcttc cctggtctgt ttccttccgg cttgtaccac aggcctcctt tgatacctgc     2160

tagttctcct gttaaaggac tatcaagtac tgaacagaca aacaaaagtc aaagtcccct     2220

catgacacat cctcagatac tgccagctac acaggatatt ttgaaggcac tatctaaaca     2280

cccatctgta ggggacaata agccagtgga gctccagccc gagaggtcct ctgaagagag     2340

gccctttgag aaaatcagtg accagtcaga gagtagtgac cttgatgatg tcagtacacc     2400

aagtggcagt gacctggaaa caacctcggg ctctgatctg gaaagtgaca ttgaaagtga     2460

taaagagaaa tttaaagaaa atggtaaaat gttcaaagac aaagtaagcc ctcttcagaa     2520

tctggcttca ataaataata agaaagaata cagcaatcat tccattttct caccatcttt     2580

agaggagcag actgcggtgt caggagctgt gaatgattct ataaaggcta ttgcttctat     2640

tgctgaaaaa tactttggtt caacaggact ggtggggctg caagacaaaa aagttggagc     2700

tttaccttac ccttccatgt ttcccctccc attttttcca gcattctctc aatcaatgta     2760

cccatttcct gatagagact tgagatcgtt acctttgaaa atggaacccc aatcaccagg     2820

tgaagtaaag aaactgcaga agggcagctc tgagtccccc tttgatctca ccactaagcg     2880

aaaggatgag aagcccttga ctccagtccc ctccaagcct ccagtgacac ctgccacaag     2940

ccaagaccag cccctggatc taagtatggg cagtaggagt agagccagtg ggacaaagct     3000

gactgagcct cgaaaaaacc acgtgtttgg gggaaaaaaa ggaagcaacg tcgaatcaag     3060

acctgcttca gatggttcct tgcagcatgc aagacccact cctttcttta tggaccctat     3120

ttacagagta gagaaaagaa aactaactga cccacttgaa gctttaaaag agaaatactt     3180

gaggccttct ccaggattct tgtttcaccc acaaatgtca gctattgaaa acatggcaga     3240

aaagctagag agcttcagtg ccctgaaacc tgaggccagt gagctcttac agtcagtgcc     3300

ctctatgttc aacttcaggg cgcctcccaa tgccctgcca gagaaccttc tgcggaaggg     3360

aaaggagcgc tatacctgca gatactgtgg caagattttt ccaaggtctg caaacctaac     3420

acggcacttg agaacccaca caggagagca gccttacaga tgcaaatact gtgacagatc     3480

atttagcata tcttctaact tgcaaaggca tgttcgcaac atccacaata aagagaagcc     3540

atttaagtgt cacttatgtg ataggtgttt tggtcaacaa accaatttag acagacacct     3600

aaagaaacat gagaatggga acatgtccgg tacagcaaca tcgtcgcctc attctgaact     3660

ggaaagtaca ggtgcgattc tggatgacaa agaagatgct tacttcacag aaattcgaaa     3720

tttcattggg aacagcaacc atggcagcca atctcccagg aatgtggagg agagaatgaa     3780

tggcagtcat tttaaagatg aaaaggcttt ggtgaccagt caaaattcag acttgctgga     3840

tgatgaagaa gttgaagatg aggtgttgtt agatgaggag gatgaagaca atgatattac     3900

tggaaaaaca ggaaaggaac cagtgacaag taatttacat gaaggaaacc ctgaggatga     3960

ctatgaagaa accagtgccc tggagatgag ttgcaagaca tccccagtga ggtataaaga     4020

ggaagaatat aaaagtggac tttctgctct agatcatata aggcacttca cagatagcct     4080

caaaatgagg aaaatggaag ataatcaata ttctgaagct gagctgtctt cttttagtac     4140

ttcccatgtg ccagaggaac ttaagcagcc gttacacaga aagtccaaat cgcaggcata     4200

tgctatgatg ctgtcactgt ctgacaagga gtccctccat tctacatccc acagttcttc     4260

caacgtgtgg cacagtatgg ccagggctgc ggcggaatcc agtgctatcc agtccataag     4320

ccacgtatga cgttatcaag gttgaccaga gtgggaccaa gtccaacagt agcatggctc     4380

tttcatatag gactatttac aagactgctg agcagaatgc cttataaacc tgcagggtca     4440

ctcatctaaa gtctagtgac cttaaactga atgatttaaa aaagaaaaga aagaaaaaag     4500

aaactattta ttctcgatat tttgttttgc acagcaaagg cagctgctga cttctggaag     4560

atcaatcaat gcgacttaaa gtgattcagt gaaaacaaaa aacttggtgg gctgaaggca     4620

tcttccagtt taccccacct tagggtatgg gtgggtgaga agggcagttg agatggcagc     4680

attgatatga atgaacactc catagaaact gaattctctt ttgtacaaga tcacctgaca     4740

tgattgggaa cagttgcttt taattacaga tttaattttt ttcttcgtta aagttttatg     4800

taatttaacc ctttgaagac agaagtagtt ggatgaaatg cacagtcaat tattatagaa     4860

actgataaca gggagtactt gttccccctt ttgccttctt aagtacattg tttaaaacta     4920

gggaaaaagg gtatgtgtat attgtaaact atggatgtta acactcaaag aggttaagtc     4980

agtgaagtaa cctattcatc accagtaccg ctgtaccact aataaattgt ttgccaaatc     5040

cttgtaataa catcttaatt ttagacaatc atgtcactgt ttttaatgtt tatttttttg     5100

tgtgtgttgc gtgtatcatg tatttatttg ttggcaaact attgtttgtt gattaaaata     5160

gcactgttcc agtcagccac tactttatga cgtctgaggc acaccccttt ccgaatttca     5220

aggaccaagg tgacccgacc tgtgtatgag agtgccaaat ggtgtttggc ttttcttaac     5280

attccttttt gtttgtttgt tttgttttcc ttcttaatga actaaatacg aatagatgca     5340

acttagtttt tgtaatactg aaatcgattc aattgtataa acgattataa tttctttcat     5400

ggaagcatga ttcttctgat taaaaactgt actccatatt ttatgctggt tgtctgcaag     5460

cttgtgcgat gttatgttca tgttaatcct atttgtaaaa tgaagtgttc ccaaccttat     5520

gttaaaagag agaagtaaat aacagactgt attcagttat tttgcccttt attgaggaac     5580

cagatttgtt ttctttttgt ttgtaatctc attttgaaat aatcagcaag ttgaggtact     5640

ttcttcaaat gctttgtaca atataaactg ttatgccttt cagtgcatta ctatgggagg     5700

agcaactaaa aaataaagac ttacaaaaag gagtattttt                           5740


<210>  141
<211>  1042
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 6, polypeptide NCBI Reference Sequence: NM_001164000.1

<400>  141

Met Lys Ser Glu Asp Tyr Pro His Glu Thr Met Ala Pro Asp Ile His 
1               5                   10                  15      


Glu Glu Arg Gln Tyr Arg Cys Glu Asp Cys Asp Gln Leu Phe Glu Ser 
            20                  25                  30          


Lys Ala Glu Leu Ala Asp His Gln Lys Phe Pro Cys Ser Thr Pro His 
        35                  40                  45              


Ser Ala Phe Ser Met Val Glu Glu Asp Phe Gln Gln Lys Leu Glu Ser 
    50                  55                  60                  


Glu Asn Asp Leu Gln Glu Ile His Thr Ile Gln Glu Cys Lys Glu Cys 
65                  70                  75                  80  


Asp Gln Val Phe Pro Asp Leu Gln Ser Leu Glu Lys His Met Leu Ser 
                85                  90                  95      


His Thr Glu Glu Arg Glu Tyr Lys Cys Asp Gln Cys Pro Lys Ala Phe 
            100                 105                 110         


Asn Trp Lys Ser Asn Leu Ile Arg His Gln Met Ser His Asp Ser Gly 
        115                 120                 125             


Lys His Tyr Glu Cys Glu Asn Cys Ala Lys Val Phe Thr Asp Pro Ser 
    130                 135                 140                 


Asn Leu Gln Arg His Ile Arg Ser Gln His Val Gly Ala Arg Ala His 
145                 150                 155                 160 


Ala Cys Pro Glu Cys Gly Lys Thr Phe Ala Thr Ser Ser Gly Leu Lys 
                165                 170                 175     


Gln His Lys His Ile His Ser Ser Val Lys Pro Phe Ile Cys Glu Val 
            180                 185                 190         


Cys His Lys Ser Tyr Thr Gln Phe Ser Asn Leu Cys Arg His Lys Arg 
        195                 200                 205             


Met His Ala Asp Cys Arg Thr Gln Ile Lys Cys Lys Asp Cys Gly Gln 
    210                 215                 220                 


Met Phe Ser Thr Thr Ser Ser Leu Asn Lys His Arg Arg Phe Cys Glu 
225                 230                 235                 240 


Gly Lys Asn His Phe Ala Ala Gly Gly Phe Phe Gly Gln Gly Ile Ser 
                245                 250                 255     


Leu Pro Gly Thr Pro Ala Met Asp Lys Thr Ser Met Val Asn Met Ser 
            260                 265                 270         


His Ala Asn Pro Gly Leu Ala Asp Tyr Phe Gly Ala Asn Arg His Pro 
        275                 280                 285             


Ala Gly Leu Thr Phe Pro Thr Ala Pro Gly Phe Ser Phe Ser Phe Pro 
    290                 295                 300                 


Gly Leu Phe Pro Ser Gly Leu Tyr His Arg Pro Pro Leu Ile Pro Ala 
305                 310                 315                 320 


Ser Ser Pro Val Lys Gly Leu Ser Ser Thr Glu Gln Thr Asn Lys Ser 
                325                 330                 335     


Gln Ser Pro Leu Met Thr His Pro Gln Ile Leu Pro Ala Thr Gln Asp 
            340                 345                 350         


Ile Leu Lys Ala Leu Ser Lys His Pro Ser Val Gly Asp Asn Lys Pro 
        355                 360                 365             


Val Glu Leu Gln Pro Glu Arg Ser Ser Glu Glu Arg Pro Phe Glu Lys 
    370                 375                 380                 


Ile Ser Asp Gln Ser Glu Ser Ser Asp Leu Asp Asp Val Ser Thr Pro 
385                 390                 395                 400 


Ser Gly Ser Asp Leu Glu Thr Thr Ser Gly Ser Asp Leu Glu Ser Asp 
                405                 410                 415     


Ile Glu Ser Asp Lys Glu Lys Phe Lys Glu Asn Gly Lys Met Phe Lys 
            420                 425                 430         


Asp Lys Val Ser Pro Leu Gln Asn Leu Ala Ser Ile Asn Asn Lys Lys 
        435                 440                 445             


Glu Tyr Ser Asn His Ser Ile Phe Ser Pro Ser Leu Glu Glu Gln Thr 
    450                 455                 460                 


Ala Val Ser Gly Ala Val Asn Asp Ser Ile Lys Ala Ile Ala Ser Ile 
465                 470                 475                 480 


Ala Glu Lys Tyr Phe Gly Ser Thr Gly Leu Val Gly Leu Gln Asp Lys 
                485                 490                 495     


Lys Val Gly Ala Leu Pro Tyr Pro Ser Met Phe Pro Leu Pro Phe Phe 
            500                 505                 510         


Pro Ala Phe Ser Gln Ser Met Tyr Pro Phe Pro Asp Arg Asp Leu Arg 
        515                 520                 525             


Ser Leu Pro Leu Lys Met Glu Pro Gln Ser Pro Gly Glu Val Lys Lys 
    530                 535                 540                 


Leu Gln Lys Gly Ser Ser Glu Ser Pro Phe Asp Leu Thr Thr Lys Arg 
545                 550                 555                 560 


Lys Asp Glu Lys Pro Leu Thr Pro Val Pro Ser Lys Pro Pro Val Thr 
                565                 570                 575     


Pro Ala Thr Ser Gln Asp Gln Pro Leu Asp Leu Ser Met Gly Ser Arg 
            580                 585                 590         


Ser Arg Ala Ser Gly Thr Lys Leu Thr Glu Pro Arg Lys Asn His Val 
        595                 600                 605             


Phe Gly Gly Lys Lys Gly Ser Asn Val Glu Ser Arg Pro Ala Ser Asp 
    610                 615                 620                 


Gly Ser Leu Gln His Ala Arg Pro Thr Pro Phe Phe Met Asp Pro Ile 
625                 630                 635                 640 


Tyr Arg Val Glu Lys Arg Lys Leu Thr Asp Pro Leu Glu Ala Leu Lys 
                645                 650                 655     


Glu Lys Tyr Leu Arg Pro Ser Pro Gly Phe Leu Phe His Pro Gln Met 
            660                 665                 670         


Ser Ala Ile Glu Asn Met Ala Glu Lys Leu Glu Ser Phe Ser Ala Leu 
        675                 680                 685             


Lys Pro Glu Ala Ser Glu Leu Leu Gln Ser Val Pro Ser Met Phe Asn 
    690                 695                 700                 


Phe Arg Ala Pro Pro Asn Ala Leu Pro Glu Asn Leu Leu Arg Lys Gly 
705                 710                 715                 720 


Lys Glu Arg Tyr Thr Cys Arg Tyr Cys Gly Lys Ile Phe Pro Arg Ser 
                725                 730                 735     


Ala Asn Leu Thr Arg His Leu Arg Thr His Thr Gly Glu Gln Pro Tyr 
            740                 745                 750         


Arg Cys Lys Tyr Cys Asp Arg Ser Phe Ser Ile Ser Ser Asn Leu Gln 
        755                 760                 765             


Arg His Val Arg Asn Ile His Asn Lys Glu Lys Pro Phe Lys Cys His 
    770                 775                 780                 


Leu Cys Asp Arg Cys Phe Gly Gln Gln Thr Asn Leu Asp Arg His Leu 
785                 790                 795                 800 


Lys Lys His Glu Asn Gly Asn Met Ser Gly Thr Ala Thr Ser Ser Pro 
                805                 810                 815     


His Ser Glu Leu Glu Ser Thr Gly Ala Ile Leu Asp Asp Lys Glu Asp 
            820                 825                 830         


Ala Tyr Phe Thr Glu Ile Arg Asn Phe Ile Gly Asn Ser Asn His Gly 
        835                 840                 845             


Ser Gln Ser Pro Arg Asn Val Glu Glu Arg Met Asn Gly Ser His Phe 
    850                 855                 860                 


Lys Asp Glu Lys Ala Leu Val Thr Ser Gln Asn Ser Asp Leu Leu Asp 
865                 870                 875                 880 


Asp Glu Glu Val Glu Asp Glu Val Leu Leu Asp Glu Glu Asp Glu Asp 
                885                 890                 895     


Asn Asp Ile Thr Gly Lys Thr Gly Lys Glu Pro Val Thr Ser Asn Leu 
            900                 905                 910         


His Glu Gly Asn Pro Glu Asp Asp Tyr Glu Glu Thr Ser Ala Leu Glu 
        915                 920                 925             


Met Ser Cys Lys Thr Ser Pro Val Arg Tyr Lys Glu Glu Glu Tyr Lys 
    930                 935                 940                 


Ser Gly Leu Ser Ala Leu Asp His Ile Arg His Phe Thr Asp Ser Leu 
945                 950                 955                 960 


Lys Met Arg Lys Met Glu Asp Asn Gln Tyr Ser Glu Ala Glu Leu Ser 
                965                 970                 975     


Ser Phe Ser Thr Ser His Val Pro Glu Glu Leu Lys Gln Pro Leu His 
            980                 985                 990         


Arg Lys Ser Lys Ser Gln Ala Tyr  Ala Met Met Leu Ser  Leu Ser Asp 
        995                 1000                 1005             


Lys Glu  Ser Leu His Ser Thr  Ser His Ser Ser Ser  Asn Val Trp 
    1010                 1015                 1020             


His Ser  Met Ala Arg Ala Ala  Ala Glu Ser Ser Ala  Ile Gln Ser 
    1025                 1030                 1035             


Ile Ser  His Val 
    1040         


<210>  142
<211>  5195
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 7, mRNA NCBI Reference Sequence: NM_001205194.1

<400>  142
gattgccatc tgacaagatc tccaaatcaa agtgataaat cgctccaaac tttttttggc       60

ggcgctgaga tgttggaggg gcgtctagcg cgcatgtgcg aaggtgtcca aactgacaat      120

gctggagaga tagcgagtgt ggattgagag aaagggagag agggagggag agagagtgaa      180

agaagaaaat acagagagtg agtgtgtgga agagagagag aaacaggaga gaaacaggag      240

ggagggagag agagagagag agagagagag agagagagag agagagagag agagagagag      300

acaggagaga gagggaggga gcgagaggga gagcaaaaga aggaaaggat ccaagaaaaa      360

aaagccccaa ccacacacca gcggctgcag gactgggcac agcatgagat ccaaaggcag      420

ggcaaggaaa ctggccacaa atcttagacg aattttacaa tgtgaagttc tgcatagatg      480

ccagtcaacc agatgttgga agctggctca agtacattag attcgctggc tgttatgatc      540

agcacaacct tgttgcatgc cagataaatg atcagatatt ctatagagta gttgcagaca      600

ttgcgccggg agaggagctt ctgctgttca tgaagagcga agactatccc catgaaacta      660

tggcgccgga tatccacgaa gaacggcaat atcgctgcga agactgtgac cagctctttg      720

aatctaaggc tgaactagca gatcaccaaa agtttccatg cagtactcct cactcagcat      780

tttcaatggt tgaagaggac tttcagcaaa aactcgaaag cgagaatgat ctccaagaga      840

tacacacgat ccaggagtgt aaggaatgtg accaagtttt tcctgatttg caaagcctgg      900

agaaacacat gctgtcacat actgaagaga gggaatacaa gtgtgatcag tgtcccaagg      960

catttaactg gaagtccaat ttaattcgcc accagatgtc acatgacagt ggaaagcact     1020

atgaatgtga aaactgtgcc aaggttttca cggaccctag caaccttcag cggcacattc     1080

gctctcagca tgtcggtgcc cgggcccatg catgcccgga gtgtggcaaa acgtttgcca     1140

cttcgtcggg cctcaaacaa cacaagcaca tccacagcag tgtgaagccc tttatctgtg     1200

aggtctgcca taaatcctat actcagtttt caaacctttg ccgtcataag cgcatgcatg     1260

ctgattgcag aacccaaatc aagtgcaaag actgtggaca aatgttcagc actacgtctt     1320

ccttaaataa acacaggagg ttttgtgagg gcaagaacca ttttgcggca ggtggatttt     1380

ttggccaagg catttcactt cctggaaccc cagctatgga taaaacgtcc atggttaata     1440

tgagtcatgc caacccgggc cttgctgact attttggcgc caataggcat cctgctggtc     1500

ttacctttcc aacagctcct ggattttctt ttagcttccc tggtctgttt ccttccggct     1560

tgtaccacag gcctcctttg atacctgcta gttctcctgt taaaggacta tcaagtactg     1620

aacagacaaa caaaagtcaa agtcccctca tgacacatcc tcagatactg ccagctacac     1680

aggatatttt gaaggcacta tctaaacacc catctgtagg ggacaataag ccagtggagc     1740

tccagcccga gaggtcctct gaagagaggc cctttgagaa aatcagtgac cagtcagaga     1800

gtagtgacct tgatgatgtc agtacaccaa gtggcagtga cctggaaaca acctcgggct     1860

ctgatctgga aagtgacatt gaaagtgata aagagaaatt taaagaaaat ggtaaaatgt     1920

tcaaagacaa agtaagccct cttcagaatc tggcttcaat aaataataag aaagaataca     1980

gcaatcattc cattttctca ccatctttag aggagcagac tgcggtgtca ggagctgtga     2040

atgattctat aaaggctatt gcttctattg ctgaaaaata ctttggttca acaggactgg     2100

tggggctgca agacaaaaaa gttggagctt taccttaccc ttccatgttt cccctcccat     2160

tttttccagc attctctcaa tcaatgtacc catttcctga tagagacttg agatcgttac     2220

ctttgaaaat ggaaccccaa tcaccaggtg aagtaaagaa actgcagaag ggcagctctg     2280

agtccccctt tgatctcacc actaagcgaa aggatgagaa gcccttgact ccagtcccct     2340

ccaagcctcc agtgacacct gccacaagcc aagaccagcc cctggatcta agtatgggca     2400

gtaggagtag agccagtggg acaaagctga ctgagcctcg aaaaaaccac gtgtttgggg     2460

gaaaaaaagg aagcaacgtc gaatcaagac ctgcttcaga tggttccttg cagcatgcaa     2520

gacccactcc tttctttatg gaccctattt acagagtaga gaaaagaaaa ctaactgacc     2580

cacttgaagc tttaaaagag aaatacttga ggccttctcc aggattcttg tttcacccac     2640

aattccaact gcctgatcag agaacttgga tgtcagctat tgaaaacatg gcagaaaagc     2700

tagagagctt cagtgccctg aaacctgagg ccagtgagct cttacagtca gtgccctcta     2760

tgttcaactt cagggcgcct cccaatgccc tgccagagaa ccttctgcgg aagggaaagg     2820

agcgctatac ctgcagatac tgtggcaaga tttttccaag gtctgcaaac ctaacacggc     2880

acttgagaac ccacacagga gagcagcctt acagatgcaa atactgtgac agatcattta     2940

gcatatcttc taacttgcaa aggcatgttc gcaacatcca caataaagag aagccattta     3000

agtgtcactt atgtgatagg tgttttggtc aacaaaccaa tttagacaga cacctaaaga     3060

aacatgagaa tgggaacatg tccggtacag caacatcgtc gcctcattct gaactggaaa     3120

gtacaggtgc gattctggat gacaaagaag atgcttactt cacagaaatt cgaaatttca     3180

ttgggaacag caaccatggc agccaatctc ccaggaatgt ggaggagaga atgaatggca     3240

gtcattttaa agatgaaaag gctttggtga ccagtcaaaa ttcagacttg ctggatgatg     3300

aagaagttga agatgaggtg ttgttagatg aggaggatga agacaatgat attactggaa     3360

aaacaggaaa ggaaccagtg acaagtaatt tacatgaagg aaaccctgag gatgactatg     3420

aagaaaccag tgccctggag atgagttgca agacatcccc agtgaggtat aaagaggaag     3480

aatataaaag tggactttct gctctagatc atataaggca cttcacagat agcctcaaaa     3540

tgaggaaaat ggaagataat caatattctg aagctgagct gtcttctttt agtacttccc     3600

atgtgccaga ggaacttaag cagccgttac acagaaagtc caaatcgcag gcatatgcta     3660

tgatgctgtc actgtctgac aaggagtccc tccattctac atcccacagt tcttccaacg     3720

tgtggcacag tatggccagg gctgcggcgg aatccagtgc tatccagtcc ataagccacg     3780

tatgacgtta tcaaggttga ccagagtggg accaagtcca acagtagcat ggctctttca     3840

tataggacta tttacaagac tgctgagcag aatgccttat aaacctgcag ggtcactcat     3900

ctaaagtcta gtgaccttaa actgaatgat ttaaaaaaga aaagaaagaa aaaagaaact     3960

atttattctc gatattttgt tttgcacagc aaaggcagct gctgacttct ggaagatcaa     4020

tcaatgcgac ttaaagtgat tcagtgaaaa caaaaaactt ggtgggctga aggcatcttc     4080

cagtttaccc caccttaggg tatgggtggg tgagaagggc agttgagatg gcagcattga     4140

tatgaatgaa cactccatag aaactgaatt ctcttttgta caagatcacc tgacatgatt     4200

gggaacagtt gcttttaatt acagatttaa tttttttctt cgttaaagtt ttatgtaatt     4260

taaccctttg aagacagaag tagttggatg aaatgcacag tcaattatta tagaaactga     4320

taacagggag tacttgttcc cccttttgcc ttcttaagta cattgtttaa aactagggaa     4380

aaagggtatg tgtatattgt aaactatgga tgttaacact caaagaggtt aagtcagtga     4440

agtaacctat tcatcaccag taccgctgta ccactaataa attgtttgcc aaatccttgt     4500

aataacatct taattttaga caatcatgtc actgttttta atgtttattt ttttgtgtgt     4560

gttgcgtgta tcatgtattt atttgttggc aaactattgt ttgttgatta aaatagcact     4620

gttccagtca gccactactt tatgacgtct gaggcacacc cctttccgaa tttcaaggac     4680

caaggtgacc cgacctgtgt atgagagtgc caaatggtgt ttggcttttc ttaacattcc     4740

tttttgtttg tttgttttgt tttccttctt aatgaactaa atacgaatag atgcaactta     4800

gtttttgtaa tactgaaatc gattcaattg tataaacgat tataatttct ttcatggaag     4860

catgattctt ctgattaaaa actgtactcc atattttatg ctggttgtct gcaagcttgt     4920

gcgatgttat gttcatgtta atcctatttg taaaatgaag tgttcccaac cttatgttaa     4980

aagagagaag taaataacag actgtattca gttattttgc cctttattga ggaaccagat     5040

ttgttttctt tttgtttgta atctcatttt gaaataatca gcaagttgag gtactttctt     5100

caaatgcttt gtacaatata aactgttatg cctttcagtg cattactatg ggaggagcaa     5160

ctaaaaaata aagacttaca aaaaggagta ttttt                                5195


<210>  143
<211>  1051
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens MDS1 and EVI1 complex locus (MECOM), transcript 
       variant 7, polypeptide NCBI Reference Sequence: NM_001205194.1

<400>  143

Met Lys Ser Glu Asp Tyr Pro His Glu Thr Met Ala Pro Asp Ile His 
1               5                   10                  15      


Glu Glu Arg Gln Tyr Arg Cys Glu Asp Cys Asp Gln Leu Phe Glu Ser 
            20                  25                  30          


Lys Ala Glu Leu Ala Asp His Gln Lys Phe Pro Cys Ser Thr Pro His 
        35                  40                  45              


Ser Ala Phe Ser Met Val Glu Glu Asp Phe Gln Gln Lys Leu Glu Ser 
    50                  55                  60                  


Glu Asn Asp Leu Gln Glu Ile His Thr Ile Gln Glu Cys Lys Glu Cys 
65                  70                  75                  80  


Asp Gln Val Phe Pro Asp Leu Gln Ser Leu Glu Lys His Met Leu Ser 
                85                  90                  95      


His Thr Glu Glu Arg Glu Tyr Lys Cys Asp Gln Cys Pro Lys Ala Phe 
            100                 105                 110         


Asn Trp Lys Ser Asn Leu Ile Arg His Gln Met Ser His Asp Ser Gly 
        115                 120                 125             


Lys His Tyr Glu Cys Glu Asn Cys Ala Lys Val Phe Thr Asp Pro Ser 
    130                 135                 140                 


Asn Leu Gln Arg His Ile Arg Ser Gln His Val Gly Ala Arg Ala His 
145                 150                 155                 160 


Ala Cys Pro Glu Cys Gly Lys Thr Phe Ala Thr Ser Ser Gly Leu Lys 
                165                 170                 175     


Gln His Lys His Ile His Ser Ser Val Lys Pro Phe Ile Cys Glu Val 
            180                 185                 190         


Cys His Lys Ser Tyr Thr Gln Phe Ser Asn Leu Cys Arg His Lys Arg 
        195                 200                 205             


Met His Ala Asp Cys Arg Thr Gln Ile Lys Cys Lys Asp Cys Gly Gln 
    210                 215                 220                 


Met Phe Ser Thr Thr Ser Ser Leu Asn Lys His Arg Arg Phe Cys Glu 
225                 230                 235                 240 


Gly Lys Asn His Phe Ala Ala Gly Gly Phe Phe Gly Gln Gly Ile Ser 
                245                 250                 255     


Leu Pro Gly Thr Pro Ala Met Asp Lys Thr Ser Met Val Asn Met Ser 
            260                 265                 270         


His Ala Asn Pro Gly Leu Ala Asp Tyr Phe Gly Ala Asn Arg His Pro 
        275                 280                 285             


Ala Gly Leu Thr Phe Pro Thr Ala Pro Gly Phe Ser Phe Ser Phe Pro 
    290                 295                 300                 


Gly Leu Phe Pro Ser Gly Leu Tyr His Arg Pro Pro Leu Ile Pro Ala 
305                 310                 315                 320 


Ser Ser Pro Val Lys Gly Leu Ser Ser Thr Glu Gln Thr Asn Lys Ser 
                325                 330                 335     


Gln Ser Pro Leu Met Thr His Pro Gln Ile Leu Pro Ala Thr Gln Asp 
            340                 345                 350         


Ile Leu Lys Ala Leu Ser Lys His Pro Ser Val Gly Asp Asn Lys Pro 
        355                 360                 365             


Val Glu Leu Gln Pro Glu Arg Ser Ser Glu Glu Arg Pro Phe Glu Lys 
    370                 375                 380                 


Ile Ser Asp Gln Ser Glu Ser Ser Asp Leu Asp Asp Val Ser Thr Pro 
385                 390                 395                 400 


Ser Gly Ser Asp Leu Glu Thr Thr Ser Gly Ser Asp Leu Glu Ser Asp 
                405                 410                 415     


Ile Glu Ser Asp Lys Glu Lys Phe Lys Glu Asn Gly Lys Met Phe Lys 
            420                 425                 430         


Asp Lys Val Ser Pro Leu Gln Asn Leu Ala Ser Ile Asn Asn Lys Lys 
        435                 440                 445             


Glu Tyr Ser Asn His Ser Ile Phe Ser Pro Ser Leu Glu Glu Gln Thr 
    450                 455                 460                 


Ala Val Ser Gly Ala Val Asn Asp Ser Ile Lys Ala Ile Ala Ser Ile 
465                 470                 475                 480 


Ala Glu Lys Tyr Phe Gly Ser Thr Gly Leu Val Gly Leu Gln Asp Lys 
                485                 490                 495     


Lys Val Gly Ala Leu Pro Tyr Pro Ser Met Phe Pro Leu Pro Phe Phe 
            500                 505                 510         


Pro Ala Phe Ser Gln Ser Met Tyr Pro Phe Pro Asp Arg Asp Leu Arg 
        515                 520                 525             


Ser Leu Pro Leu Lys Met Glu Pro Gln Ser Pro Gly Glu Val Lys Lys 
    530                 535                 540                 


Leu Gln Lys Gly Ser Ser Glu Ser Pro Phe Asp Leu Thr Thr Lys Arg 
545                 550                 555                 560 


Lys Asp Glu Lys Pro Leu Thr Pro Val Pro Ser Lys Pro Pro Val Thr 
                565                 570                 575     


Pro Ala Thr Ser Gln Asp Gln Pro Leu Asp Leu Ser Met Gly Ser Arg 
            580                 585                 590         


Ser Arg Ala Ser Gly Thr Lys Leu Thr Glu Pro Arg Lys Asn His Val 
        595                 600                 605             


Phe Gly Gly Lys Lys Gly Ser Asn Val Glu Ser Arg Pro Ala Ser Asp 
    610                 615                 620                 


Gly Ser Leu Gln His Ala Arg Pro Thr Pro Phe Phe Met Asp Pro Ile 
625                 630                 635                 640 


Tyr Arg Val Glu Lys Arg Lys Leu Thr Asp Pro Leu Glu Ala Leu Lys 
                645                 650                 655     


Glu Lys Tyr Leu Arg Pro Ser Pro Gly Phe Leu Phe His Pro Gln Phe 
            660                 665                 670         


Gln Leu Pro Asp Gln Arg Thr Trp Met Ser Ala Ile Glu Asn Met Ala 
        675                 680                 685             


Glu Lys Leu Glu Ser Phe Ser Ala Leu Lys Pro Glu Ala Ser Glu Leu 
    690                 695                 700                 


Leu Gln Ser Val Pro Ser Met Phe Asn Phe Arg Ala Pro Pro Asn Ala 
705                 710                 715                 720 


Leu Pro Glu Asn Leu Leu Arg Lys Gly Lys Glu Arg Tyr Thr Cys Arg 
                725                 730                 735     


Tyr Cys Gly Lys Ile Phe Pro Arg Ser Ala Asn Leu Thr Arg His Leu 
            740                 745                 750         


Arg Thr His Thr Gly Glu Gln Pro Tyr Arg Cys Lys Tyr Cys Asp Arg 
        755                 760                 765             


Ser Phe Ser Ile Ser Ser Asn Leu Gln Arg His Val Arg Asn Ile His 
    770                 775                 780                 


Asn Lys Glu Lys Pro Phe Lys Cys His Leu Cys Asp Arg Cys Phe Gly 
785                 790                 795                 800 


Gln Gln Thr Asn Leu Asp Arg His Leu Lys Lys His Glu Asn Gly Asn 
                805                 810                 815     


Met Ser Gly Thr Ala Thr Ser Ser Pro His Ser Glu Leu Glu Ser Thr 
            820                 825                 830         


Gly Ala Ile Leu Asp Asp Lys Glu Asp Ala Tyr Phe Thr Glu Ile Arg 
        835                 840                 845             


Asn Phe Ile Gly Asn Ser Asn His Gly Ser Gln Ser Pro Arg Asn Val 
    850                 855                 860                 


Glu Glu Arg Met Asn Gly Ser His Phe Lys Asp Glu Lys Ala Leu Val 
865                 870                 875                 880 


Thr Ser Gln Asn Ser Asp Leu Leu Asp Asp Glu Glu Val Glu Asp Glu 
                885                 890                 895     


Val Leu Leu Asp Glu Glu Asp Glu Asp Asn Asp Ile Thr Gly Lys Thr 
            900                 905                 910         


Gly Lys Glu Pro Val Thr Ser Asn Leu His Glu Gly Asn Pro Glu Asp 
        915                 920                 925             


Asp Tyr Glu Glu Thr Ser Ala Leu Glu Met Ser Cys Lys Thr Ser Pro 
    930                 935                 940                 


Val Arg Tyr Lys Glu Glu Glu Tyr Lys Ser Gly Leu Ser Ala Leu Asp 
945                 950                 955                 960 


His Ile Arg His Phe Thr Asp Ser Leu Lys Met Arg Lys Met Glu Asp 
                965                 970                 975     


Asn Gln Tyr Ser Glu Ala Glu Leu Ser Ser Phe Ser Thr Ser His Val 
            980                 985                 990         


Pro Glu Glu Leu Lys Gln Pro Leu  His Arg Lys Ser Lys  Ser Gln Ala 
        995                 1000                 1005             


Tyr Ala  Met Met Leu Ser Leu  Ser Asp Lys Glu Ser  Leu His Ser 
    1010                 1015                 1020             


Thr Ser  His Ser Ser Ser Asn  Val Trp His Ser Met  Ala Arg Ala 
    1025                 1030                 1035             


Ala Ala  Glu Ser Ser Ala Ile  Gln Ser Ile Ser His  Val 
    1040                 1045                 1050     


<210>  144
<211>  4785
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human RON mRNA transcript variant 1 GenBank Accession No.: 
       NM_002447.2  GI:153946392

<400>  144
agtgtacagc ggcggctggg gcggcaggtg aggcggctgg ggcgttgctg tcgtgcgtcc       60

gcaggcgtca ggtgctcaga cccgagggcc gggaagggat ttgggtttca caggaacctg      120

gggcgggggt ccgctatctt ggggctgtcg ggaccgctgc ttaaatttgg cccagtccag      180

acctcgagtc gggcccccag ccaggcccac gcccaggtcc aggcccaggc cggtagggat      240

cctctagggt cccagctcgc ctcgatggag ctcctcccgc cgctgcctca gtccttcctg      300

ttgctgctgc tgttgcctgc caagcccgcg gcgggcgagg actggcagtg cccgcgcacc      360

ccctacgcgg cctctcgcga ctttgacgtg aagtacgtgg tgcccagctt ctccgccgga      420

ggcctggtac aggccatggt gacctacgag ggcgacagaa atgagagtgc tgtgtttgta      480

gccatacgca atcgcctgca tgtgcttggg cctgacctga agtctgtcca gagcctggcc      540

acgggccctg ctggagaccc tggctgccag acgtgtgcag cctgtggccc aggaccccac      600

ggccctcccg gtgacacaga cacaaaggtg ctggtgctgg atcccgcgct gcctgcgctg      660

gtcagttgtg gctccagcct gcagggccgc tgcttcctgc atgacctaga gccccaaggg      720

acagccgtgc atctggcagc gccagcctgc ctcttctcag cccaccataa ccggcccgat      780

gactgccccg actgtgtggc cagcccattg ggcacccgtg taactgtggt tgagcaaggc      840

caggcctcct atttctacgt ggcatcctca ctggacgcag ccgtggctgc cagcttcagc      900

ccacgctcag tgtctatcag gcgtctcaag gctgacgcct cgggattcgc accgggcttt      960

gtggcgttgt cagtgctgcc caagcatctt gtctcctaca gtattgaata cgtgcacagc     1020

ttccacacgg gagccttcgt atacttcctg actgtacagc cggccagcgt gacagatgat     1080

cctagtgccc tgcacacacg cctggcacgg cttagcgcca ctgagccaga gttgggtgac     1140

tatcgggagc tggtcctcga ctgcagattt gctccaaaac gcaggcgccg gggggcccca     1200

gaaggcggac agccctaccc tgtgctgcgg gtggcccact ccgctccagt gggtgcccaa     1260

cttgccactg agctgagcat cgccgagggc caggaagtac tatttggggt ctttgtgact     1320

ggcaaggatg gtggtcctgg cgtgggcccc aactctgtcg tctgtgcctt ccccattgac     1380

ctgctggaca cactaattga tgagggtgtg gagcgctgtt gtgaatcccc agtccatcca     1440

ggcctccggc gaggcctcga cttcttccag tcgcccagtt tttgccccaa cccgcctggc     1500

ctggaagccc tcagccccaa caccagctgc cgccacttcc ctctgctggt cagtagcagc     1560

ttctcacgtg tggacctatt caatgggctg ttgggaccag tacaggtcac tgcattgtat     1620

gtgacacgcc ttgacaacgt cacagtggca cacatgggca caatggatgg gcgtatcctg     1680

caggtggagc tggtcaggtc actaaactac ttgctgtatg tgtccaactt ctcactgggt     1740

gacagtgggc agcccgtgca gcgggatgtc agtcgtcttg gggaccacct actctttgcc     1800

tctggggacc aggttttcca ggtacctatc caaggccctg gctgccgcca cttcctgacc     1860

tgtgggcgtt gcctaagggc atggcatttc atgggctgtg gctggtgtgg gaacatgtgc     1920

ggccagcaga aggagtgtcc tggctcctgg caacaggacc actgcccacc taagcttact     1980

gagttccacc cccacagtgg acctctaagg ggcagtacaa ggctgaccct gtgtggctcc     2040

aacttctacc ttcacccttc tggtctggtg cctgagggaa cccatcaggt cactgtgggc     2100

caaagtccct gccggccact gcccaaggac agctcaaaac tcagaccagt gccccggaaa     2160

gactttgtag aggagtttga gtgtgaactg gagcccttgg gcacccaggc agtggggcct     2220

accaacgtca gcctcaccgt gactaacatg ccaccgggca agcacttccg ggtagacggc     2280

acctccgtgc tgagaggctt ctctttcatg gagccagtgc tgatagcagt gcaacccctc     2340

tttggcccac gggcaggagg cacctgtctc actcttgaag gccagagtct gtctgtaggc     2400

accagccggg ctgtgctggt caatgggact gagtgtctgc tagcacgggt cagtgagggg     2460

cagcttttat gtgccacacc ccctggggcc acggtggcca gtgtccccct tagcctgcag     2520

gtggggggtg cccaggtacc tggttcctgg accttccagt acagagaaga ccctgtcgtg     2580

ctaagcatca gccccaactg tggctacatc aactcccaca tcaccatctg tggccagcat     2640

ctaacttcag catggcactt agtgctgtca ttccatgacg ggcttagggc agtggaaagc     2700

aggtgtgaga ggcagcttcc agagcagcag ctgtgccgcc ttcctgaata tgtggtccga     2760

gacccccagg gatgggtggc agggaatctg agtgcccgag gggatggagc tgctggcttt     2820

acactgcctg gctttcgctt cctaccccca ccccatccac ccagtgccaa cctagttcca     2880

ctgaagcctg aggagcatgc cattaagttt gagtatattg ggctgggcgc tgtggctgac     2940

tgtgtgggta tcaacgtgac cgtgggtggt gagagctgcc agcacgagtt ccggggggac     3000

atggttgtct gccccctgcc cccatccctg cagcttggcc aggatggtgc cccattgcag     3060

gtctgcgtag atggtgaatg tcatatcctg ggtagagtgg tgcggccagg gccagatggg     3120

gtcccacaga gcacgctcct tggtatcctg ctgcctttgc tgctgcttgt ggctgcactg     3180

gcgactgcac tggtcttcag ctactggtgg cggaggaagc agctagttct tcctcccaac     3240

ctgaatgacc tggcatccct ggaccagact gctggagcca cacccctgcc tattctgtac     3300

tcgggctctg actacagaag tggccttgca ctccctgcca ttgatggtct ggattccacc     3360

acttgtgtcc atggagcatc cttctccgat agtgaagatg aatcctgtgt gccactgctg     3420

cggaaagagt ccatccagct aagggacctg gactctgcgc tcttggctga ggtcaaggat     3480

gtgctgattc cccatgagcg ggtggtcacc cacagtgacc gagtcattgg caaaggccac     3540

tttggagttg tctaccacgg agaatacata gaccaggccc agaatcgaat ccaatgtgcc     3600

atcaagtcac taagtcgcat cacagagatg cagcaggtgg aggccttcct gcgagagggg     3660

ctgctcatgc gtggcctgaa ccacccgaat gtgctggctc tcattggtat catgttgcca     3720

cctgagggcc tgccccatgt gctgctgccc tatatgtgcc acggtgacct gctccagttc     3780

atccgctcac ctcagcggaa ccccaccgtg aaggacctca tcagctttgg cctgcaggta     3840

gcccgcggca tggagtacct ggcagagcag aagtttgtgc acagggacct ggctgcgcgg     3900

aactgcatgc tggacgagtc attcacagtc aaggtggctg actttggttt ggcccgcgac     3960

atcctggaca gggagtacta tagtgttcaa cagcatcgcc acgctcgcct acctgtgaag     4020

tggatggcgc tggagagcct gcagacctat agatttacca ccaagtctga tgtgtggtca     4080

tttggtgtgc tgctgtggga actgctgaca cggggtgccc caccataccg ccacattgac     4140

ccttttgacc ttacccactt cctggcccag ggtcggcgcc tgccccagcc tgagtattgc     4200

cctgattctc tgtaccaagt gatgcagcaa tgctgggagg cagacccagc agtgcgaccc     4260

accttcagag tactagtggg ggaggtggag cagatagtgt ctgcactgct tggggaccat     4320

tatgtgcagc tgccagcaac ctacatgaac ttgggcccca gcacctcgca tgagatgaat     4380

gtgcgtccag aacagccgca gttctcaccc atgccaggga atgtacgccg gccccggcca     4440

ctctcagagc ctcctcggcc cacttgactt agttcttggg ctggacctgc ttagctgcct     4500

tgagctaacc ccaagctgcc tctgggccat gccaggccag agggcagtgg ccctccacct     4560

tgttcctgcc ctttaacttt cagaggcaat aggtaaatgg ggcccattag gtccctcact     4620

ccacagagtg agccagtgag ggcagtcctg caacatgtat ttatggagtg cctgctgtgg     4680

accctgtctt ctgggcacag tggactcagc agtgaccaca ccaacactga cccttgaacc     4740

aataaaggaa caaatgacta ttaaagcaca aaaaaaaaaa aaaaa                     4785


<210>  145
<211>  1400
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  RON polypeptide encoded by mRNA transcript variant 1 GenBank 
       Accession No.: NP_002438.2  GI:153946393

<400>  145

Met Glu Leu Leu Pro Pro Leu Pro Gln Ser Phe Leu Leu Leu Leu Leu 
1               5                   10                  15      


Leu Pro Ala Lys Pro Ala Ala Gly Glu Asp Trp Gln Cys Pro Arg Thr 
            20                  25                  30          


Pro Tyr Ala Ala Ser Arg Asp Phe Asp Val Lys Tyr Val Val Pro Ser 
        35                  40                  45              


Phe Ser Ala Gly Gly Leu Val Gln Ala Met Val Thr Tyr Glu Gly Asp 
    50                  55                  60                  


Arg Asn Glu Ser Ala Val Phe Val Ala Ile Arg Asn Arg Leu His Val 
65                  70                  75                  80  


Leu Gly Pro Asp Leu Lys Ser Val Gln Ser Leu Ala Thr Gly Pro Ala 
                85                  90                  95      


Gly Asp Pro Gly Cys Gln Thr Cys Ala Ala Cys Gly Pro Gly Pro His 
            100                 105                 110         


Gly Pro Pro Gly Asp Thr Asp Thr Lys Val Leu Val Leu Asp Pro Ala 
        115                 120                 125             


Leu Pro Ala Leu Val Ser Cys Gly Ser Ser Leu Gln Gly Arg Cys Phe 
    130                 135                 140                 


Leu His Asp Leu Glu Pro Gln Gly Thr Ala Val His Leu Ala Ala Pro 
145                 150                 155                 160 


Ala Cys Leu Phe Ser Ala His His Asn Arg Pro Asp Asp Cys Pro Asp 
                165                 170                 175     


Cys Val Ala Ser Pro Leu Gly Thr Arg Val Thr Val Val Glu Gln Gly 
            180                 185                 190         


Gln Ala Ser Tyr Phe Tyr Val Ala Ser Ser Leu Asp Ala Ala Val Ala 
        195                 200                 205             


Ala Ser Phe Ser Pro Arg Ser Val Ser Ile Arg Arg Leu Lys Ala Asp 
    210                 215                 220                 


Ala Ser Gly Phe Ala Pro Gly Phe Val Ala Leu Ser Val Leu Pro Lys 
225                 230                 235                 240 


His Leu Val Ser Tyr Ser Ile Glu Tyr Val His Ser Phe His Thr Gly 
                245                 250                 255     


Ala Phe Val Tyr Phe Leu Thr Val Gln Pro Ala Ser Val Thr Asp Asp 
            260                 265                 270         


Pro Ser Ala Leu His Thr Arg Leu Ala Arg Leu Ser Ala Thr Glu Pro 
        275                 280                 285             


Glu Leu Gly Asp Tyr Arg Glu Leu Val Leu Asp Cys Arg Phe Ala Pro 
    290                 295                 300                 


Lys Arg Arg Arg Arg Gly Ala Pro Glu Gly Gly Gln Pro Tyr Pro Val 
305                 310                 315                 320 


Leu Arg Val Ala His Ser Ala Pro Val Gly Ala Gln Leu Ala Thr Glu 
                325                 330                 335     


Leu Ser Ile Ala Glu Gly Gln Glu Val Leu Phe Gly Val Phe Val Thr 
            340                 345                 350         


Gly Lys Asp Gly Gly Pro Gly Val Gly Pro Asn Ser Val Val Cys Ala 
        355                 360                 365             


Phe Pro Ile Asp Leu Leu Asp Thr Leu Ile Asp Glu Gly Val Glu Arg 
    370                 375                 380                 


Cys Cys Glu Ser Pro Val His Pro Gly Leu Arg Arg Gly Leu Asp Phe 
385                 390                 395                 400 


Phe Gln Ser Pro Ser Phe Cys Pro Asn Pro Pro Gly Leu Glu Ala Leu 
                405                 410                 415     


Ser Pro Asn Thr Ser Cys Arg His Phe Pro Leu Leu Val Ser Ser Ser 
            420                 425                 430         


Phe Ser Arg Val Asp Leu Phe Asn Gly Leu Leu Gly Pro Val Gln Val 
        435                 440                 445             


Thr Ala Leu Tyr Val Thr Arg Leu Asp Asn Val Thr Val Ala His Met 
    450                 455                 460                 


Gly Thr Met Asp Gly Arg Ile Leu Gln Val Glu Leu Val Arg Ser Leu 
465                 470                 475                 480 


Asn Tyr Leu Leu Tyr Val Ser Asn Phe Ser Leu Gly Asp Ser Gly Gln 
                485                 490                 495     


Pro Val Gln Arg Asp Val Ser Arg Leu Gly Asp His Leu Leu Phe Ala 
            500                 505                 510         


Ser Gly Asp Gln Val Phe Gln Val Pro Ile Gln Gly Pro Gly Cys Arg 
        515                 520                 525             


His Phe Leu Thr Cys Gly Arg Cys Leu Arg Ala Trp His Phe Met Gly 
    530                 535                 540                 


Cys Gly Trp Cys Gly Asn Met Cys Gly Gln Gln Lys Glu Cys Pro Gly 
545                 550                 555                 560 


Ser Trp Gln Gln Asp His Cys Pro Pro Lys Leu Thr Glu Phe His Pro 
                565                 570                 575     


His Ser Gly Pro Leu Arg Gly Ser Thr Arg Leu Thr Leu Cys Gly Ser 
            580                 585                 590         


Asn Phe Tyr Leu His Pro Ser Gly Leu Val Pro Glu Gly Thr His Gln 
        595                 600                 605             


Val Thr Val Gly Gln Ser Pro Cys Arg Pro Leu Pro Lys Asp Ser Ser 
    610                 615                 620                 


Lys Leu Arg Pro Val Pro Arg Lys Asp Phe Val Glu Glu Phe Glu Cys 
625                 630                 635                 640 


Glu Leu Glu Pro Leu Gly Thr Gln Ala Val Gly Pro Thr Asn Val Ser 
                645                 650                 655     


Leu Thr Val Thr Asn Met Pro Pro Gly Lys His Phe Arg Val Asp Gly 
            660                 665                 670         


Thr Ser Val Leu Arg Gly Phe Ser Phe Met Glu Pro Val Leu Ile Ala 
        675                 680                 685             


Val Gln Pro Leu Phe Gly Pro Arg Ala Gly Gly Thr Cys Leu Thr Leu 
    690                 695                 700                 


Glu Gly Gln Ser Leu Ser Val Gly Thr Ser Arg Ala Val Leu Val Asn 
705                 710                 715                 720 


Gly Thr Glu Cys Leu Leu Ala Arg Val Ser Glu Gly Gln Leu Leu Cys 
                725                 730                 735     


Ala Thr Pro Pro Gly Ala Thr Val Ala Ser Val Pro Leu Ser Leu Gln 
            740                 745                 750         


Val Gly Gly Ala Gln Val Pro Gly Ser Trp Thr Phe Gln Tyr Arg Glu 
        755                 760                 765             


Asp Pro Val Val Leu Ser Ile Ser Pro Asn Cys Gly Tyr Ile Asn Ser 
    770                 775                 780                 


His Ile Thr Ile Cys Gly Gln His Leu Thr Ser Ala Trp His Leu Val 
785                 790                 795                 800 


Leu Ser Phe His Asp Gly Leu Arg Ala Val Glu Ser Arg Cys Glu Arg 
                805                 810                 815     


Gln Leu Pro Glu Gln Gln Leu Cys Arg Leu Pro Glu Tyr Val Val Arg 
            820                 825                 830         


Asp Pro Gln Gly Trp Val Ala Gly Asn Leu Ser Ala Arg Gly Asp Gly 
        835                 840                 845             


Ala Ala Gly Phe Thr Leu Pro Gly Phe Arg Phe Leu Pro Pro Pro His 
    850                 855                 860                 


Pro Pro Ser Ala Asn Leu Val Pro Leu Lys Pro Glu Glu His Ala Ile 
865                 870                 875                 880 


Lys Phe Glu Tyr Ile Gly Leu Gly Ala Val Ala Asp Cys Val Gly Ile 
                885                 890                 895     


Asn Val Thr Val Gly Gly Glu Ser Cys Gln His Glu Phe Arg Gly Asp 
            900                 905                 910         


Met Val Val Cys Pro Leu Pro Pro Ser Leu Gln Leu Gly Gln Asp Gly 
        915                 920                 925             


Ala Pro Leu Gln Val Cys Val Asp Gly Glu Cys His Ile Leu Gly Arg 
    930                 935                 940                 


Val Val Arg Pro Gly Pro Asp Gly Val Pro Gln Ser Thr Leu Leu Gly 
945                 950                 955                 960 


Ile Leu Leu Pro Leu Leu Leu Leu Val Ala Ala Leu Ala Thr Ala Leu 
                965                 970                 975     


Val Phe Ser Tyr Trp Trp Arg Arg Lys Gln Leu Val Leu Pro Pro Asn 
            980                 985                 990         


Leu Asn Asp Leu Ala Ser Leu Asp  Gln Thr Ala Gly Ala  Thr Pro Leu 
        995                 1000                 1005             


Pro Ile  Leu Tyr Ser Gly Ser  Asp Tyr Arg Ser Gly  Leu Ala Leu 
    1010                 1015                 1020             


Pro Ala  Ile Asp Gly Leu Asp  Ser Thr Thr Cys Val  His Gly Ala 
    1025                 1030                 1035             


Ser Phe  Ser Asp Ser Glu Asp  Glu Ser Cys Val Pro  Leu Leu Arg 
    1040                 1045                 1050             


Lys Glu  Ser Ile Gln Leu Arg  Asp Leu Asp Ser Ala  Leu Leu Ala 
    1055                 1060                 1065             


Glu Val  Lys Asp Val Leu Ile  Pro His Glu Arg Val  Val Thr His 
    1070                 1075                 1080             


Ser Asp  Arg Val Ile Gly Lys  Gly His Phe Gly Val  Val Tyr His 
    1085                 1090                 1095             


Gly Glu  Tyr Ile Asp Gln Ala  Gln Asn Arg Ile Gln  Cys Ala Ile 
    1100                 1105                 1110             


Lys Ser  Leu Ser Arg Ile Thr  Glu Met Gln Gln Val  Glu Ala Phe 
    1115                 1120                 1125             


Leu Arg  Glu Gly Leu Leu Met  Arg Gly Leu Asn His  Pro Asn Val 
    1130                 1135                 1140             


Leu Ala  Leu Ile Gly Ile Met  Leu Pro Pro Glu Gly  Leu Pro His 
    1145                 1150                 1155             


Val Leu  Leu Pro Tyr Met Cys  His Gly Asp Leu Leu  Gln Phe Ile 
    1160                 1165                 1170             


Arg Ser  Pro Gln Arg Asn Pro  Thr Val Lys Asp Leu  Ile Ser Phe 
    1175                 1180                 1185             


Gly Leu  Gln Val Ala Arg Gly  Met Glu Tyr Leu Ala  Glu Gln Lys 
    1190                 1195                 1200             


Phe Val  His Arg Asp Leu Ala  Ala Arg Asn Cys Met  Leu Asp Glu 
    1205                 1210                 1215             


Ser Phe  Thr Val Lys Val Ala  Asp Phe Gly Leu Ala  Arg Asp Ile 
    1220                 1225                 1230             


Leu Asp  Arg Glu Tyr Tyr Ser  Val Gln Gln His Arg  His Ala Arg 
    1235                 1240                 1245             


Leu Pro  Val Lys Trp Met Ala  Leu Glu Ser Leu Gln  Thr Tyr Arg 
    1250                 1255                 1260             


Phe Thr  Thr Lys Ser Asp Val  Trp Ser Phe Gly Val  Leu Leu Trp 
    1265                 1270                 1275             


Glu Leu  Leu Thr Arg Gly Ala  Pro Pro Tyr Arg His  Ile Asp Pro 
    1280                 1285                 1290             


Phe Asp  Leu Thr His Phe Leu  Ala Gln Gly Arg Arg  Leu Pro Gln 
    1295                 1300                 1305             


Pro Glu  Tyr Cys Pro Asp Ser  Leu Tyr Gln Val Met  Gln Gln Cys 
    1310                 1315                 1320             


Trp Glu  Ala Asp Pro Ala Val  Arg Pro Thr Phe Arg  Val Leu Val 
    1325                 1330                 1335             


Gly Glu  Val Glu Gln Ile Val  Ser Ala Leu Leu Gly  Asp His Tyr 
    1340                 1345                 1350             


Val Gln  Leu Pro Ala Thr Tyr  Met Asn Leu Gly Pro  Ser Thr Ser 
    1355                 1360                 1365             


His Glu  Met Asn Val Arg Pro  Glu Gln Pro Gln Phe  Ser Pro Met 
    1370                 1375                 1380             


Pro Gly  Asn Val Arg Arg Pro  Arg Pro Leu Ser Glu  Pro Pro Arg 
    1385                 1390                 1395             


Pro Thr  
    1400 


<210>  146
<211>  4638
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens macrophage stimulating 1 receptor (c-met-related 
       tyrosine kinase) (MST1R), transcript variant 2, mRNA NCBI 
       Reference Sequence: NM_001244937.1

<400>  146
agtgtacagc ggcggctggg gcggcaggtg aggcggctgg ggcgttgctg tcgtgcgtcc       60

gcaggcgtca ggtgctcaga cccgagggcc gggaagggat ttgggtttca caggaacctg      120

gggcgggggt ccgctatctt ggggctgtcg ggaccgctgc ttaaatttgg cccagtccag      180

acctcgagtc gggcccccag ccaggcccac gcccaggtcc aggcccaggc cggtagggat      240

cctctagggt cccagctcgc ctcgatggag ctcctcccgc cgctgcctca gtccttcctg      300

ttgctgctgc tgttgcctgc caagcccgcg gcgggcgagg actggcagtg cccgcgcacc      360

ccctacgcgg cctctcgcga ctttgacgtg aagtacgtgg tgcccagctt ctccgccgga      420

ggcctggtac aggccatggt gacctacgag ggcgacagaa atgagagtgc tgtgtttgta      480

gccatacgca atcgcctgca tgtgcttggg cctgacctga agtctgtcca gagcctggcc      540

acgggccctg ctggagaccc tggctgccag acgtgtgcag cctgtggccc aggaccccac      600

ggccctcccg gtgacacaga cacaaaggtg ctggtgctgg atcccgcgct gcctgcgctg      660

gtcagttgtg gctccagcct gcagggccgc tgcttcctgc atgacctaga gccccaaggg      720

acagccgtgc atctggcagc gccagcctgc ctcttctcag cccaccataa ccggcccgat      780

gactgccccg actgtgtggc cagcccattg ggcacccgtg taactgtggt tgagcaaggc      840

caggcctcct atttctacgt ggcatcctca ctggacgcag ccgtggctgc cagcttcagc      900

ccacgctcag tgtctatcag gcgtctcaag gctgacgcct cgggattcgc accgggcttt      960

gtggcgttgt cagtgctgcc caagcatctt gtctcctaca gtattgaata cgtgcacagc     1020

ttccacacgg gagccttcgt atacttcctg actgtacagc cggccagcgt gacagatgat     1080

cctagtgccc tgcacacacg cctggcacgg cttagcgcca ctgagccaga gttgggtgac     1140

tatcgggagc tggtcctcga ctgcagattt gctccaaaac gcaggcgccg gggggcccca     1200

gaaggcggac agccctaccc tgtgctgcgg gtggcccact ccgctccagt gggtgcccaa     1260

cttgccactg agctgagcat cgccgagggc caggaagtac tatttggggt ctttgtgact     1320

ggcaaggatg gtggtcctgg cgtgggcccc aactctgtcg tctgtgcctt ccccattgac     1380

ctgctggaca cactaattga tgagggtgtg gagcgctgtt gtgaatcccc agtccatcca     1440

ggcctccggc gaggcctcga cttcttccag tcgcccagtt tttgccccaa cccgcctggc     1500

ctggaagccc tcagccccaa caccagctgc cgccacttcc ctctgctggt cagtagcagc     1560

ttctcacgtg tggacctatt caatgggctg ttgggaccag tacaggtcac tgcattgtat     1620

gtgacacgcc ttgacaacgt cacagtggca cacatgggca caatggatgg gcgtatcctg     1680

caggtggagc tggtcaggtc actaaactac ttgctgtatg tgtccaactt ctcactgggt     1740

gacagtgggc agcccgtgca gcgggatgtc agtcgtcttg gggaccacct actctttgcc     1800

tctggggacc aggttttcca ggtacctatc caaggccctg gctgccgcca cttcctgacc     1860

tgtgggcgtt gcctaagggc atggcatttc atgggctgtg gctggtgtgg gaacatgtgc     1920

ggccagcaga aggagtgtcc tggctcctgg caacaggacc actgcccacc taagcttact     1980

gagttccacc cccacagtgg acctctaagg ggcagtacaa ggctgaccct gtgtggctcc     2040

aacttctacc ttcacccttc tggtctggtg cctgagggaa cccatcaggt cactgtgggc     2100

caaagtccct gccggccact gcccaaggac agctcaaaac tcagaccagt gccccggaaa     2160

gactttgtag aggagtttga gtgtgaactg gagcccttgg gcacccaggc agtggggcct     2220

accaacgtca gcctcaccgt gactaacatg ccaccgggca agcacttccg ggtagacggc     2280

acctccgtgc tgagaggctt ctctttcatg gagccagtgc tgatagcagt gcaacccctc     2340

tttggcccac gggcaggagg cacctgtctc actcttgaag gccagagtct gtctgtaggc     2400

accagccggg ctgtgctggt caatgggact gagtgtctgc tagcacgggt cagtgagggg     2460

cagcttttat gtgccacacc ccctggggcc acggtggcca gtgtccccct tagcctgcag     2520

gtggggggtg cccaggtacc tggttcctgg accttccagt acagagaaga ccctgtcgtg     2580

ctaagcatca gccccaactg tggctacatc aactcccaca tcaccatctg tggccagcat     2640

ctaacttcag catggcactt agtgctgtca ttccatgacg ggcttagggc agtggaaagc     2700

aggtgtgaga ggcagcttcc agagcagcag ctgtgccgcc ttcctgaata tgtggtccga     2760

gacccccagg gatgggtggc agggaatctg agtgcccgag gggatggagc tgctggcttt     2820

acactgcctg gctttcgctt cctaccccca ccccatccac ccagtgccaa cctagttcca     2880

ctgaagcctg aggagcatgc cattaagttt gaggtctgcg tagatggtga atgtcatatc     2940

ctgggtagag tggtgcggcc agggccagat ggggtcccac agagcacgct ccttggtatc     3000

ctgctgcctt tgctgctgct tgtggctgca ctggcgactg cactggtctt cagctactgg     3060

tggcggagga agcagctagt tcttcctccc aacctgaatg acctggcatc cctggaccag     3120

actgctggag ccacacccct gcctattctg tactcgggct ctgactacag aagtggcctt     3180

gcactccctg ccattgatgg tctggattcc accacttgtg tccatggagc atccttctcc     3240

gatagtgaag atgaatcctg tgtgccactg ctgcggaaag agtccatcca gctaagggac     3300

ctggactctg cgctcttggc tgaggtcaag gatgtgctga ttccccatga gcgggtggtc     3360

acccacagtg accgagtcat tggcaaaggc cactttggag ttgtctacca cggagaatac     3420

atagaccagg cccagaatcg aatccaatgt gccatcaagt cactaagtcg catcacagag     3480

atgcagcagg tggaggcctt cctgcgagag gggctgctca tgcgtggcct gaaccacccg     3540

aatgtgctgg ctctcattgg tatcatgttg ccacctgagg gcctgcccca tgtgctgctg     3600

ccctatatgt gccacggtga cctgctccag ttcatccgct cacctcagcg gaaccccacc     3660

gtgaaggacc tcatcagctt tggcctgcag gtagcccgcg gcatggagta cctggcagag     3720

cagaagtttg tgcacaggga cctggctgcg cggaactgca tgctggacga gtcattcaca     3780

gtcaaggtgg ctgactttgg tttggcccgc gacatcctgg acagggagta ctatagtgtt     3840

caacagcatc gccacgctcg cctacctgtg aagtggatgg cgctggagag cctgcagacc     3900

tatagattta ccaccaagtc tgatgtgtgg tcatttggtg tgctgctgtg ggaactgctg     3960

acacggggtg ccccaccata ccgccacatt gacccttttg accttaccca cttcctggcc     4020

cagggtcggc gcctgcccca gcctgagtat tgccctgatt ctctgtacca agtgatgcag     4080

caatgctggg aggcagaccc agcagtgcga cccaccttca gagtactagt gggggaggtg     4140

gagcagatag tgtctgcact gcttggggac cattatgtgc agctgccagc aacctacatg     4200

aacttgggcc ccagcacctc gcatgagatg aatgtgcgtc cagaacagcc gcagttctca     4260

cccatgccag ggaatgtacg ccggccccgg ccactctcag agcctcctcg gcccacttga     4320

cttagttctt gggctggacc tgcttagctg ccttgagcta accccaagct gcctctgggc     4380

catgccaggc cagagggcag tggccctcca ccttgttcct gccctttaac tttcagaggc     4440

aataggtaaa tggggcccat taggtccctc actccacaga gtgagccagt gagggcagtc     4500

ctgcaacatg tatttatgga gtgcctgctg tggaccctgt cttctgggca cagtggactc     4560

agcagtgacc acaccaacac tgacccttga accaataaag gaacaaatga ctattaaagc     4620

acaaaaaaaa aaaaaaaa                                                   4638


<210>  147
<211>  1351
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens macrophage stimulating 1 receptor (c-met-related 
       tyrosine kinase) (MST1R), transcript variant 2, polypeptide NCBI 
       Reference Sequence: NM_001244937.1

<400>  147

Met Glu Leu Leu Pro Pro Leu Pro Gln Ser Phe Leu Leu Leu Leu Leu 
1               5                   10                  15      


Leu Pro Ala Lys Pro Ala Ala Gly Glu Asp Trp Gln Cys Pro Arg Thr 
            20                  25                  30          


Pro Tyr Ala Ala Ser Arg Asp Phe Asp Val Lys Tyr Val Val Pro Ser 
        35                  40                  45              


Phe Ser Ala Gly Gly Leu Val Gln Ala Met Val Thr Tyr Glu Gly Asp 
    50                  55                  60                  


Arg Asn Glu Ser Ala Val Phe Val Ala Ile Arg Asn Arg Leu His Val 
65                  70                  75                  80  


Leu Gly Pro Asp Leu Lys Ser Val Gln Ser Leu Ala Thr Gly Pro Ala 
                85                  90                  95      


Gly Asp Pro Gly Cys Gln Thr Cys Ala Ala Cys Gly Pro Gly Pro His 
            100                 105                 110         


Gly Pro Pro Gly Asp Thr Asp Thr Lys Val Leu Val Leu Asp Pro Ala 
        115                 120                 125             


Leu Pro Ala Leu Val Ser Cys Gly Ser Ser Leu Gln Gly Arg Cys Phe 
    130                 135                 140                 


Leu His Asp Leu Glu Pro Gln Gly Thr Ala Val His Leu Ala Ala Pro 
145                 150                 155                 160 


Ala Cys Leu Phe Ser Ala His His Asn Arg Pro Asp Asp Cys Pro Asp 
                165                 170                 175     


Cys Val Ala Ser Pro Leu Gly Thr Arg Val Thr Val Val Glu Gln Gly 
            180                 185                 190         


Gln Ala Ser Tyr Phe Tyr Val Ala Ser Ser Leu Asp Ala Ala Val Ala 
        195                 200                 205             


Ala Ser Phe Ser Pro Arg Ser Val Ser Ile Arg Arg Leu Lys Ala Asp 
    210                 215                 220                 


Ala Ser Gly Phe Ala Pro Gly Phe Val Ala Leu Ser Val Leu Pro Lys 
225                 230                 235                 240 


His Leu Val Ser Tyr Ser Ile Glu Tyr Val His Ser Phe His Thr Gly 
                245                 250                 255     


Ala Phe Val Tyr Phe Leu Thr Val Gln Pro Ala Ser Val Thr Asp Asp 
            260                 265                 270         


Pro Ser Ala Leu His Thr Arg Leu Ala Arg Leu Ser Ala Thr Glu Pro 
        275                 280                 285             


Glu Leu Gly Asp Tyr Arg Glu Leu Val Leu Asp Cys Arg Phe Ala Pro 
    290                 295                 300                 


Lys Arg Arg Arg Arg Gly Ala Pro Glu Gly Gly Gln Pro Tyr Pro Val 
305                 310                 315                 320 


Leu Arg Val Ala His Ser Ala Pro Val Gly Ala Gln Leu Ala Thr Glu 
                325                 330                 335     


Leu Ser Ile Ala Glu Gly Gln Glu Val Leu Phe Gly Val Phe Val Thr 
            340                 345                 350         


Gly Lys Asp Gly Gly Pro Gly Val Gly Pro Asn Ser Val Val Cys Ala 
        355                 360                 365             


Phe Pro Ile Asp Leu Leu Asp Thr Leu Ile Asp Glu Gly Val Glu Arg 
    370                 375                 380                 


Cys Cys Glu Ser Pro Val His Pro Gly Leu Arg Arg Gly Leu Asp Phe 
385                 390                 395                 400 


Phe Gln Ser Pro Ser Phe Cys Pro Asn Pro Pro Gly Leu Glu Ala Leu 
                405                 410                 415     


Ser Pro Asn Thr Ser Cys Arg His Phe Pro Leu Leu Val Ser Ser Ser 
            420                 425                 430         


Phe Ser Arg Val Asp Leu Phe Asn Gly Leu Leu Gly Pro Val Gln Val 
        435                 440                 445             


Thr Ala Leu Tyr Val Thr Arg Leu Asp Asn Val Thr Val Ala His Met 
    450                 455                 460                 


Gly Thr Met Asp Gly Arg Ile Leu Gln Val Glu Leu Val Arg Ser Leu 
465                 470                 475                 480 


Asn Tyr Leu Leu Tyr Val Ser Asn Phe Ser Leu Gly Asp Ser Gly Gln 
                485                 490                 495     


Pro Val Gln Arg Asp Val Ser Arg Leu Gly Asp His Leu Leu Phe Ala 
            500                 505                 510         


Ser Gly Asp Gln Val Phe Gln Val Pro Ile Gln Gly Pro Gly Cys Arg 
        515                 520                 525             


His Phe Leu Thr Cys Gly Arg Cys Leu Arg Ala Trp His Phe Met Gly 
    530                 535                 540                 


Cys Gly Trp Cys Gly Asn Met Cys Gly Gln Gln Lys Glu Cys Pro Gly 
545                 550                 555                 560 


Ser Trp Gln Gln Asp His Cys Pro Pro Lys Leu Thr Glu Phe His Pro 
                565                 570                 575     


His Ser Gly Pro Leu Arg Gly Ser Thr Arg Leu Thr Leu Cys Gly Ser 
            580                 585                 590         


Asn Phe Tyr Leu His Pro Ser Gly Leu Val Pro Glu Gly Thr His Gln 
        595                 600                 605             


Val Thr Val Gly Gln Ser Pro Cys Arg Pro Leu Pro Lys Asp Ser Ser 
    610                 615                 620                 


Lys Leu Arg Pro Val Pro Arg Lys Asp Phe Val Glu Glu Phe Glu Cys 
625                 630                 635                 640 


Glu Leu Glu Pro Leu Gly Thr Gln Ala Val Gly Pro Thr Asn Val Ser 
                645                 650                 655     


Leu Thr Val Thr Asn Met Pro Pro Gly Lys His Phe Arg Val Asp Gly 
            660                 665                 670         


Thr Ser Val Leu Arg Gly Phe Ser Phe Met Glu Pro Val Leu Ile Ala 
        675                 680                 685             


Val Gln Pro Leu Phe Gly Pro Arg Ala Gly Gly Thr Cys Leu Thr Leu 
    690                 695                 700                 


Glu Gly Gln Ser Leu Ser Val Gly Thr Ser Arg Ala Val Leu Val Asn 
705                 710                 715                 720 


Gly Thr Glu Cys Leu Leu Ala Arg Val Ser Glu Gly Gln Leu Leu Cys 
                725                 730                 735     


Ala Thr Pro Pro Gly Ala Thr Val Ala Ser Val Pro Leu Ser Leu Gln 
            740                 745                 750         


Val Gly Gly Ala Gln Val Pro Gly Ser Trp Thr Phe Gln Tyr Arg Glu 
        755                 760                 765             


Asp Pro Val Val Leu Ser Ile Ser Pro Asn Cys Gly Tyr Ile Asn Ser 
    770                 775                 780                 


His Ile Thr Ile Cys Gly Gln His Leu Thr Ser Ala Trp His Leu Val 
785                 790                 795                 800 


Leu Ser Phe His Asp Gly Leu Arg Ala Val Glu Ser Arg Cys Glu Arg 
                805                 810                 815     


Gln Leu Pro Glu Gln Gln Leu Cys Arg Leu Pro Glu Tyr Val Val Arg 
            820                 825                 830         


Asp Pro Gln Gly Trp Val Ala Gly Asn Leu Ser Ala Arg Gly Asp Gly 
        835                 840                 845             


Ala Ala Gly Phe Thr Leu Pro Gly Phe Arg Phe Leu Pro Pro Pro His 
    850                 855                 860                 


Pro Pro Ser Ala Asn Leu Val Pro Leu Lys Pro Glu Glu His Ala Ile 
865                 870                 875                 880 


Lys Phe Glu Val Cys Val Asp Gly Glu Cys His Ile Leu Gly Arg Val 
                885                 890                 895     


Val Arg Pro Gly Pro Asp Gly Val Pro Gln Ser Thr Leu Leu Gly Ile 
            900                 905                 910         


Leu Leu Pro Leu Leu Leu Leu Val Ala Ala Leu Ala Thr Ala Leu Val 
        915                 920                 925             


Phe Ser Tyr Trp Trp Arg Arg Lys Gln Leu Val Leu Pro Pro Asn Leu 
    930                 935                 940                 


Asn Asp Leu Ala Ser Leu Asp Gln Thr Ala Gly Ala Thr Pro Leu Pro 
945                 950                 955                 960 


Ile Leu Tyr Ser Gly Ser Asp Tyr Arg Ser Gly Leu Ala Leu Pro Ala 
                965                 970                 975     


Ile Asp Gly Leu Asp Ser Thr Thr Cys Val His Gly Ala Ser Phe Ser 
            980                 985                 990         


Asp Ser Glu Asp Glu Ser Cys Val  Pro Leu Leu Arg Lys  Glu Ser Ile 
        995                 1000                 1005             


Gln Leu  Arg Asp Leu Asp Ser  Ala Leu Leu Ala Glu  Val Lys Asp 
    1010                 1015                 1020             


Val Leu  Ile Pro His Glu Arg  Val Val Thr His Ser  Asp Arg Val 
    1025                 1030                 1035             


Ile Gly  Lys Gly His Phe Gly  Val Val Tyr His Gly  Glu Tyr Ile 
    1040                 1045                 1050             


Asp Gln  Ala Gln Asn Arg Ile  Gln Cys Ala Ile Lys  Ser Leu Ser 
    1055                 1060                 1065             


Arg Ile  Thr Glu Met Gln Gln  Val Glu Ala Phe Leu  Arg Glu Gly 
    1070                 1075                 1080             


Leu Leu  Met Arg Gly Leu Asn  His Pro Asn Val Leu  Ala Leu Ile 
    1085                 1090                 1095             


Gly Ile  Met Leu Pro Pro Glu  Gly Leu Pro His Val  Leu Leu Pro 
    1100                 1105                 1110             


Tyr Met  Cys His Gly Asp Leu  Leu Gln Phe Ile Arg  Ser Pro Gln 
    1115                 1120                 1125             


Arg Asn  Pro Thr Val Lys Asp  Leu Ile Ser Phe Gly  Leu Gln Val 
    1130                 1135                 1140             


Ala Arg  Gly Met Glu Tyr Leu  Ala Glu Gln Lys Phe  Val His Arg 
    1145                 1150                 1155             


Asp Leu  Ala Ala Arg Asn Cys  Met Leu Asp Glu Ser  Phe Thr Val 
    1160                 1165                 1170             


Lys Val  Ala Asp Phe Gly Leu  Ala Arg Asp Ile Leu  Asp Arg Glu 
    1175                 1180                 1185             


Tyr Tyr  Ser Val Gln Gln His  Arg His Ala Arg Leu  Pro Val Lys 
    1190                 1195                 1200             


Trp Met  Ala Leu Glu Ser Leu  Gln Thr Tyr Arg Phe  Thr Thr Lys 
    1205                 1210                 1215             


Ser Asp  Val Trp Ser Phe Gly  Val Leu Leu Trp Glu  Leu Leu Thr 
    1220                 1225                 1230             


Arg Gly  Ala Pro Pro Tyr Arg  His Ile Asp Pro Phe  Asp Leu Thr 
    1235                 1240                 1245             


His Phe  Leu Ala Gln Gly Arg  Arg Leu Pro Gln Pro  Glu Tyr Cys 
    1250                 1255                 1260             


Pro Asp  Ser Leu Tyr Gln Val  Met Gln Gln Cys Trp  Glu Ala Asp 
    1265                 1270                 1275             


Pro Ala  Val Arg Pro Thr Phe  Arg Val Leu Val Gly  Glu Val Glu 
    1280                 1285                 1290             


Gln Ile  Val Ser Ala Leu Leu  Gly Asp His Tyr Val  Gln Leu Pro 
    1295                 1300                 1305             


Ala Thr  Tyr Met Asn Leu Gly  Pro Ser Thr Ser His  Glu Met Asn 
    1310                 1315                 1320             


Val Arg  Pro Glu Gln Pro Gln  Phe Ser Pro Met Pro  Gly Asn Val 
    1325                 1330                 1335             


Arg Arg  Pro Arg Pro Leu Ser  Glu Pro Pro Arg Pro  Thr 
    1340                 1345                 1350     


<210>  148
<211>  1488
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens soluble RON variant 1 (RON) mRNA, complete cds, 
       alternatively spliced GenBank: EU826582.1

<400>  148
atggagctcc tcccgccgct gcctcagtcc ttcctgttgc tgctgctgtt gcctgccaag       60

cccgcggcgg gcgaggactg gcagtgcccg cgcaccccct acgcggcctc tcgcgacttt      120

gacgtgaagt acgtggtgcc cagcttctcc gccggaggcc tggtacaggc catggtgacc      180

tacgagggcg acagaaatga gagtgctgtg tttgtagcca tacgcaatcg cctgcatgtg      240

cttgggcctg acctgaagtc tgtccagagc ctggccacgg gccctgctgg agaccctggc      300

tgccagacgt gtgcagcctg tggcccagga ccccacggcc ctcccggtga cacagacaca      360

aaggtgctgg tgctggatcc cgcgctgcct gcgctggtca gttgtggctc cagcctgcag      420

ggccgctgct tcctgcatga cctagagccc caagggacag ccgtgcatct ggcagcgcca      480

gcctgcctct tctcagccca ccataaccgg cccgatgact gccccgactg tgtggccagc      540

ccattgggca cccgtgtaac tgtggttgag caaggccagg cctcctattt ctacgtggca      600

tcctcactgg acgcagccgt ggctgccagc ttcagcccac gctcagtgtc tatcaggcgt      660

ctcaaggctg acgcctcggg attcgcaccg ggctttgtgg cgttgtcagt gctgcccaag      720

catcttgtct cctacagtat tgaatacgtg cacagcttcc acacgggagc cttcgtatac      780

ttcctgactg tacagccggc cagcgtgaca gatgatccta gtgccctgca cacacgcctg      840

gcacggctta gcgccactga gccagagttg ggtgactatc gggagctggt cctcgactgc      900

agatttgctc caaaacgcag gcgccggggg gccccagaag gcggacagcc ctaccctgtg      960

ctgcgggtgg cccactccgc tccagtgggt gcccaacttg ccactgagct gagcatcgcc     1020

gagggccagg aagtactatt tggggtcttt gtgactggca aggatggtgg tcctggcgtg     1080

ggccccaact ctgtcgtctg tgccttcccc attgacctgc tggacacact aattgatgag     1140

ggtgtggagc gctgttgtga atccccagtc catccaggcc tccggcgagg cctcgacttc     1200

ttccagtcgc ccagtttttg ccccaacccg cctggcctgg aagccctcag ccccaacacc     1260

agctgccgcc acttccctct gctggtcagt agcagcttct cacgtgtgga cctattcaat     1320

gggctgttgg gaccagtaca ggtcactgca ttgtatgtga cacgccttga caacgtcaca     1380

gtggcacaca tgggcacaat ggatgggcgt atcctgcagg tgggtcctca tccccacagt     1440

cccctagccc tgggtccttg tctccatccc cattttgctc acatctga                  1488


<210>  149
<211>  495
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens soluble RON variant 1 (RON) polypeptide, complete 
       cds, alternatively spliced GenBank: EU826582.1

<400>  149

Met Glu Leu Leu Pro Pro Leu Pro Gln Ser Phe Leu Leu Leu Leu Leu 
1               5                   10                  15      


Leu Pro Ala Lys Pro Ala Ala Gly Glu Asp Trp Gln Cys Pro Arg Thr 
            20                  25                  30          


Pro Tyr Ala Ala Ser Arg Asp Phe Asp Val Lys Tyr Val Val Pro Ser 
        35                  40                  45              


Phe Ser Ala Gly Gly Leu Val Gln Ala Met Val Thr Tyr Glu Gly Asp 
    50                  55                  60                  


Arg Asn Glu Ser Ala Val Phe Val Ala Ile Arg Asn Arg Leu His Val 
65                  70                  75                  80  


Leu Gly Pro Asp Leu Lys Ser Val Gln Ser Leu Ala Thr Gly Pro Ala 
                85                  90                  95      


Gly Asp Pro Gly Cys Gln Thr Cys Ala Ala Cys Gly Pro Gly Pro His 
            100                 105                 110         


Gly Pro Pro Gly Asp Thr Asp Thr Lys Val Leu Val Leu Asp Pro Ala 
        115                 120                 125             


Leu Pro Ala Leu Val Ser Cys Gly Ser Ser Leu Gln Gly Arg Cys Phe 
    130                 135                 140                 


Leu His Asp Leu Glu Pro Gln Gly Thr Ala Val His Leu Ala Ala Pro 
145                 150                 155                 160 


Ala Cys Leu Phe Ser Ala His His Asn Arg Pro Asp Asp Cys Pro Asp 
                165                 170                 175     


Cys Val Ala Ser Pro Leu Gly Thr Arg Val Thr Val Val Glu Gln Gly 
            180                 185                 190         


Gln Ala Ser Tyr Phe Tyr Val Ala Ser Ser Leu Asp Ala Ala Val Ala 
        195                 200                 205             


Ala Ser Phe Ser Pro Arg Ser Val Ser Ile Arg Arg Leu Lys Ala Asp 
    210                 215                 220                 


Ala Ser Gly Phe Ala Pro Gly Phe Val Ala Leu Ser Val Leu Pro Lys 
225                 230                 235                 240 


His Leu Val Ser Tyr Ser Ile Glu Tyr Val His Ser Phe His Thr Gly 
                245                 250                 255     


Ala Phe Val Tyr Phe Leu Thr Val Gln Pro Ala Ser Val Thr Asp Asp 
            260                 265                 270         


Pro Ser Ala Leu His Thr Arg Leu Ala Arg Leu Ser Ala Thr Glu Pro 
        275                 280                 285             


Glu Leu Gly Asp Tyr Arg Glu Leu Val Leu Asp Cys Arg Phe Ala Pro 
    290                 295                 300                 


Lys Arg Arg Arg Arg Gly Ala Pro Glu Gly Gly Gln Pro Tyr Pro Val 
305                 310                 315                 320 


Leu Arg Val Ala His Ser Ala Pro Val Gly Ala Gln Leu Ala Thr Glu 
                325                 330                 335     


Leu Ser Ile Ala Glu Gly Gln Glu Val Leu Phe Gly Val Phe Val Thr 
            340                 345                 350         


Gly Lys Asp Gly Gly Pro Gly Val Gly Pro Asn Ser Val Val Cys Ala 
        355                 360                 365             


Phe Pro Ile Asp Leu Leu Asp Thr Leu Ile Asp Glu Gly Val Glu Arg 
    370                 375                 380                 


Cys Cys Glu Ser Pro Val His Pro Gly Leu Arg Arg Gly Leu Asp Phe 
385                 390                 395                 400 


Phe Gln Ser Pro Ser Phe Cys Pro Asn Pro Pro Gly Leu Glu Ala Leu 
                405                 410                 415     


Ser Pro Asn Thr Ser Cys Arg His Phe Pro Leu Leu Val Ser Ser Ser 
            420                 425                 430         


Phe Ser Arg Val Asp Leu Phe Asn Gly Leu Leu Gly Pro Val Gln Val 
        435                 440                 445             


Thr Ala Leu Tyr Val Thr Arg Leu Asp Asn Val Thr Val Ala His Met 
    450                 455                 460                 


Gly Thr Met Asp Gly Arg Ile Leu Gln Val Gly Pro His Pro His Ser 
465                 470                 475                 480 


Pro Leu Ala Leu Gly Pro Cys Leu His Pro His Phe Ala His Ile 
                485                 490                 495 


<210>  150
<211>  1626
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens soluble RON variant 2 (RON) mRNA, complete cds, 
       alternatively spliced GenBank: EU826583.1

<400>  150
atggagctcc tcccgccgct gcctcagtcc ttcctgttgc tgctgctgtt gcctgccaag       60

cccgcggcgg gcgaggactg gcagtgcccg cgcaccccct acgcggcctc tcgcgacttt      120

gacgtgaagt acgtggtgcc cagcttctcc gccggaggcc tggtacaggc catggtgacc      180

tacgagggcg acagaaatga gagtgctgtg tttgtagcca tacgcaatcg cctgcatgtg      240

cttgggcctg acctgaagtc tgtccagagc ctggccacgg gccctgctgg agaccctggc      300

tgccagacgt gtgcagcctg tggcccagga ccccacggcc ctcccggtga cacagacaca      360

aaggtgctgg tgctggatcc cgcgctgcct gcgctggtca gttgtggctc cagcctgcag      420

ggccgctgct tcctgcatga cctagagccc caagggacag ccgtgcatct ggcagcgcca      480

gcctgcctct tctcagccca ccataaccgg cccgatgact gccccgactg tgtggccagc      540

ccattgggca cccgtgtaac tgtggttgag caaggccagg cctcctattt ctacgtggca      600

tcctcactgg acgcagccgt ggctgccagc ttcagcccac gctcagtgtc tatcaggcgt      660

ctcaaggctg acgcctcggg attcgcaccg ggctttgtgg cgttgtcagt gctgcccaag      720

catcttgtct cctacagtat tgaatacgtg cacagcttcc acacgggagc cttcgtatac      780

ttcctgactg tacagccggc cagcgtgaca gatgatccta gtgccctgca cacacgcctg      840

gcacggctta gcgccactga gccagagttg ggtgactatc gggagctggt cctcgactgc      900

agatttgctc caaaacgcag gcgccggggg gccccagaag gcggacagcc ctaccctgtg      960

ctgcgggtgg cccactccgc tccagtgggt gcccaacttg ccactgagct gagcatcgcc     1020

gagggccagg aagtactatt tggggtcttt gtgactggca aggatggtgg tcctggcgtg     1080

ggccccaact ctgtcgtctg tgccttcccc attgacctgc tggacacact aattgatgag     1140

ggtgtggagc gctgttgtga atccccagtc catccaggcc tccggcgagg cctcgacttc     1200

ttccagtcgc ccagtttttg ccccaacccg gttttccagg tacctatcca aggccctggc     1260

tgccgccact tcctgacctg tgggcgttgc ctaagggcat ggcatttcat gggctgtggc     1320

tggtgtggga acatgtgcgg ccagcagaag gagtgtcctg gctcctggca acaggaccac     1380

tgcccaccta agcttactga gttccacccc cacagtggac ctctaagggg cagtacaagg     1440

ctgaccctgt gtggctccaa cttctacctt cacccttctg gtctggtgcc tgagggaacc     1500

catcaggtca ctgtgggcca aagtccctgc cggccactgc ccaaggacag ctcaaaactc     1560

aggtacaatc tggtccctcc cctccctttc cctgaagggg gaaaccaagc agccccttcc     1620

ccatga                                                                1626


<210>  151
<211>  541
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens soluble RON variant 2 (RON) polypeptide, complete 
       cds, alternatively spliced GenBank: EU826583.1

<400>  151

Met Glu Leu Leu Pro Pro Leu Pro Gln Ser Phe Leu Leu Leu Leu Leu 
1               5                   10                  15      


Leu Pro Ala Lys Pro Ala Ala Gly Glu Asp Trp Gln Cys Pro Arg Thr 
            20                  25                  30          


Pro Tyr Ala Ala Ser Arg Asp Phe Asp Val Lys Tyr Val Val Pro Ser 
        35                  40                  45              


Phe Ser Ala Gly Gly Leu Val Gln Ala Met Val Thr Tyr Glu Gly Asp 
    50                  55                  60                  


Arg Asn Glu Ser Ala Val Phe Val Ala Ile Arg Asn Arg Leu His Val 
65                  70                  75                  80  


Leu Gly Pro Asp Leu Lys Ser Val Gln Ser Leu Ala Thr Gly Pro Ala 
                85                  90                  95      


Gly Asp Pro Gly Cys Gln Thr Cys Ala Ala Cys Gly Pro Gly Pro His 
            100                 105                 110         


Gly Pro Pro Gly Asp Thr Asp Thr Lys Val Leu Val Leu Asp Pro Ala 
        115                 120                 125             


Leu Pro Ala Leu Val Ser Cys Gly Ser Ser Leu Gln Gly Arg Cys Phe 
    130                 135                 140                 


Leu His Asp Leu Glu Pro Gln Gly Thr Ala Val His Leu Ala Ala Pro 
145                 150                 155                 160 


Ala Cys Leu Phe Ser Ala His His Asn Arg Pro Asp Asp Cys Pro Asp 
                165                 170                 175     


Cys Val Ala Ser Pro Leu Gly Thr Arg Val Thr Val Val Glu Gln Gly 
            180                 185                 190         


Gln Ala Ser Tyr Phe Tyr Val Ala Ser Ser Leu Asp Ala Ala Val Ala 
        195                 200                 205             


Ala Ser Phe Ser Pro Arg Ser Val Ser Ile Arg Arg Leu Lys Ala Asp 
    210                 215                 220                 


Ala Ser Gly Phe Ala Pro Gly Phe Val Ala Leu Ser Val Leu Pro Lys 
225                 230                 235                 240 


His Leu Val Ser Tyr Ser Ile Glu Tyr Val His Ser Phe His Thr Gly 
                245                 250                 255     


Ala Phe Val Tyr Phe Leu Thr Val Gln Pro Ala Ser Val Thr Asp Asp 
            260                 265                 270         


Pro Ser Ala Leu His Thr Arg Leu Ala Arg Leu Ser Ala Thr Glu Pro 
        275                 280                 285             


Glu Leu Gly Asp Tyr Arg Glu Leu Val Leu Asp Cys Arg Phe Ala Pro 
    290                 295                 300                 


Lys Arg Arg Arg Arg Gly Ala Pro Glu Gly Gly Gln Pro Tyr Pro Val 
305                 310                 315                 320 


Leu Arg Val Ala His Ser Ala Pro Val Gly Ala Gln Leu Ala Thr Glu 
                325                 330                 335     


Leu Ser Ile Ala Glu Gly Gln Glu Val Leu Phe Gly Val Phe Val Thr 
            340                 345                 350         


Gly Lys Asp Gly Gly Pro Gly Val Gly Pro Asn Ser Val Val Cys Ala 
        355                 360                 365             


Phe Pro Ile Asp Leu Leu Asp Thr Leu Ile Asp Glu Gly Val Glu Arg 
    370                 375                 380                 


Cys Cys Glu Ser Pro Val His Pro Gly Leu Arg Arg Gly Leu Asp Phe 
385                 390                 395                 400 


Phe Gln Ser Pro Ser Phe Cys Pro Asn Pro Val Phe Gln Val Pro Ile 
                405                 410                 415     


Gln Gly Pro Gly Cys Arg His Phe Leu Thr Cys Gly Arg Cys Leu Arg 
            420                 425                 430         


Ala Trp His Phe Met Gly Cys Gly Trp Cys Gly Asn Met Cys Gly Gln 
        435                 440                 445             


Gln Lys Glu Cys Pro Gly Ser Trp Gln Gln Asp His Cys Pro Pro Lys 
    450                 455                 460                 


Leu Thr Glu Phe His Pro His Ser Gly Pro Leu Arg Gly Ser Thr Arg 
465                 470                 475                 480 


Leu Thr Leu Cys Gly Ser Asn Phe Tyr Leu His Pro Ser Gly Leu Val 
                485                 490                 495     


Pro Glu Gly Thr His Gln Val Thr Val Gly Gln Ser Pro Cys Arg Pro 
            500                 505                 510         


Leu Pro Lys Asp Ser Ser Lys Leu Arg Tyr Asn Leu Val Pro Pro Leu 
        515                 520                 525             


Pro Phe Pro Glu Gly Gly Asn Gln Ala Ala Pro Ser Pro 
    530                 535                 540     


<210>  152
<211>  2727
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens soluble RON variant 3 (RON) mRNA, complete cds, 
       alternatively spliced GenBank: EU826584.1

<400>  152
atggagctcc tcccgccgct gcctcagtcc ttcctgttgc tgctgctgtt gcctgccaag       60

cccgcggcgg gcgaggactg gcagtgcccg cgcaccccct acgcggcctc tcgcgacttt      120

gacgtgaagt acgtggtgcc cagcttctcc gccggaggcc tggtacaggc catggtgacc      180

tacgagggcg acagaaatga gagtgctgtg tttgtagcca tacgcaatcg cctgcatgtg      240

cttgggcctg acctgaagtc tgtccagagc ctggccacgg gccctgctgg agaccctggc      300

tgccagacgt gtgcagcctg tggcccagga ccccacggcc ctcccggtga cacagacaca      360

aaggtgctgg tgctggatcc cgcgctgcct gcgctggtca gttgtggctc cagcctgcag      420

ggccgctgct tcctgcatga cctagagccc caagggacag ccgtgcatct ggcagcgcca      480

gcctgcctct tctcagccca ccataaccgg cccgatgact gccccgactg tgtggccagc      540

ccattgggca cccgtgtaac tgtggttgag caaggccagg cctcctattt ctacgtggca      600

tcctcactgg acgcagccgt ggctgccagc ttcagcccac gctcagtgtc tatcaggcgt      660

ctcaaggctg acgcctcggg attcgcaccg ggctttgtgg cgttgtcagt gctgcccaag      720

catcttgtct cctacagtat tgaatacgtg cacagcttcc acacgggagc cttcgtatac      780

ttcctgactg tacagccggc cagcgtgaca gatgatccta gtgccctgca cacacgcctg      840

gcacggctta gcgccactga gccagagttg ggtgactatc gggagctggt cctcgactgc      900

agatttgctc caaaacgcag gcgccggggg gccccagaag gcggacagcc ctaccctgtg      960

ctgcgggtgg cccactccgc tccagtgggt gcccaacttg ccactgagct gagcatcgcc     1020

gagggccagg aagtactatt tggggtcttt gtgactggca aggatggtgg tcctggcgtg     1080

ggccccaact ctgtcgtctg tgccttcccc attgacctgc tggacacact aattgatgag     1140

ggtgtggagc gctgttgtga atccccagtc catccaggcc tccggcgagg cctcgacttc     1200

ttccagtcgc ccagtttttg ccccaacccg cctggcctgg aagccctcag ccccaacacc     1260

agctgccgcc acttccctct gctggtcagt agcagcttct cacgtgtgga cctattcaat     1320

gggctgttgg gaccagtaca ggtcactgca ttgtatgtga cacgccttga caacgtcaca     1380

gtggcacaca tgggcacaat ggatgggcgt atcctgcagg tggagctggt caggtcacta     1440

aactacttgc tgtatgtgtc caacttctca ctgggtgaca gtgggcagcc cgtgcagcgg     1500

gatgtcagtc gtcttgggga ccacctactc tttgcctctg gggaccaggt tttccaggta     1560

cctatccaag gccctggctg ccgccacttc ctgacctgtg ggcgttgcct aagggcatgg     1620

catttcatgg gctgtggctg gtgtgggaac atgtgcggcc agcagaagga gtgtcctggc     1680

tcctggcaac aggaccactg cccacctaag cttactgagt tccaccccca cagtggacct     1740

ctaaggggca gtacaaggct gaccctgtgt ggctccaact tctaccttca cccttctggt     1800

ctggtgcctg agggaaccca tcaggtcact gtgggccaaa gtccctgccg gccactgccc     1860

aaggacagct caaaactcag accagtgccc cggaaagact ttgtagagga gtttgagtgt     1920

gaactggagc ccttgggcac ccaggcagtg gggcctacca acgtcagcct caccgtgact     1980

aacatgccac cgggcaagca cttccgggta gacggcacct ccgtgctgag aggcttctct     2040

ttcatggagc cagtgctgat agcagtgcaa cccctctttg gcccacgggc aggaggcacc     2100

tgtctcactc ttgaaggcca gagtctgtct gtaggcacca gccgggctgt gctggtcaat     2160

gggactgagt gtctgctagc acgggtcagt gaggggcagc ttttatgtgc cacaccccct     2220

ggggccacgg tggccagtgt cccccttagc ctgcaggtgg ggggtgccca ggtacctggt     2280

tcctggacct tccagtacag agaagaccct gtcgtgctaa gcatcagccc caactgtggc     2340

tacatcaact cccacatcac catctgtggc cagcatctaa cttcagcatg gcacttagtg     2400

ctgtcattcc atgacgggct tagggcagtg gaaagcaggc agtgtgagag gcagcttcca     2460

gagcagcagc tgtgccgcct tcctgaatat gtggtccgag acccccaggg atgggtggca     2520

gggaatctga gtgcccgagg ggatggagct gctggcttta cactgcctgg ctttcgcttc     2580

ctacccccac cccatccacc cagtgccaac ctagttccac tgaagcctga ggagcatgcc     2640

attaagtttg aggtaagtgt aagggatagg ggcagggaca gttggggatc tgaaagtagg     2700

ggccagccta ctggctggtc ctcatga                                         2727


<210>  153
<211>  908
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens soluble RON variant 3 (RON) polypeptide, complete 
       cds, alternatively spliced GenBank: EU826584.1

<400>  153

Met Glu Leu Leu Pro Pro Leu Pro Gln Ser Phe Leu Leu Leu Leu Leu 
1               5                   10                  15      


Leu Pro Ala Lys Pro Ala Ala Gly Glu Asp Trp Gln Cys Pro Arg Thr 
            20                  25                  30          


Pro Tyr Ala Ala Ser Arg Asp Phe Asp Val Lys Tyr Val Val Pro Ser 
        35                  40                  45              


Phe Ser Ala Gly Gly Leu Val Gln Ala Met Val Thr Tyr Glu Gly Asp 
    50                  55                  60                  


Arg Asn Glu Ser Ala Val Phe Val Ala Ile Arg Asn Arg Leu His Val 
65                  70                  75                  80  


Leu Gly Pro Asp Leu Lys Ser Val Gln Ser Leu Ala Thr Gly Pro Ala 
                85                  90                  95      


Gly Asp Pro Gly Cys Gln Thr Cys Ala Ala Cys Gly Pro Gly Pro His 
            100                 105                 110         


Gly Pro Pro Gly Asp Thr Asp Thr Lys Val Leu Val Leu Asp Pro Ala 
        115                 120                 125             


Leu Pro Ala Leu Val Ser Cys Gly Ser Ser Leu Gln Gly Arg Cys Phe 
    130                 135                 140                 


Leu His Asp Leu Glu Pro Gln Gly Thr Ala Val His Leu Ala Ala Pro 
145                 150                 155                 160 


Ala Cys Leu Phe Ser Ala His His Asn Arg Pro Asp Asp Cys Pro Asp 
                165                 170                 175     


Cys Val Ala Ser Pro Leu Gly Thr Arg Val Thr Val Val Glu Gln Gly 
            180                 185                 190         


Gln Ala Ser Tyr Phe Tyr Val Ala Ser Ser Leu Asp Ala Ala Val Ala 
        195                 200                 205             


Ala Ser Phe Ser Pro Arg Ser Val Ser Ile Arg Arg Leu Lys Ala Asp 
    210                 215                 220                 


Ala Ser Gly Phe Ala Pro Gly Phe Val Ala Leu Ser Val Leu Pro Lys 
225                 230                 235                 240 


His Leu Val Ser Tyr Ser Ile Glu Tyr Val His Ser Phe His Thr Gly 
                245                 250                 255     


Ala Phe Val Tyr Phe Leu Thr Val Gln Pro Ala Ser Val Thr Asp Asp 
            260                 265                 270         


Pro Ser Ala Leu His Thr Arg Leu Ala Arg Leu Ser Ala Thr Glu Pro 
        275                 280                 285             


Glu Leu Gly Asp Tyr Arg Glu Leu Val Leu Asp Cys Arg Phe Ala Pro 
    290                 295                 300                 


Lys Arg Arg Arg Arg Gly Ala Pro Glu Gly Gly Gln Pro Tyr Pro Val 
305                 310                 315                 320 


Leu Arg Val Ala His Ser Ala Pro Val Gly Ala Gln Leu Ala Thr Glu 
                325                 330                 335     


Leu Ser Ile Ala Glu Gly Gln Glu Val Leu Phe Gly Val Phe Val Thr 
            340                 345                 350         


Gly Lys Asp Gly Gly Pro Gly Val Gly Pro Asn Ser Val Val Cys Ala 
        355                 360                 365             


Phe Pro Ile Asp Leu Leu Asp Thr Leu Ile Asp Glu Gly Val Glu Arg 
    370                 375                 380                 


Cys Cys Glu Ser Pro Val His Pro Gly Leu Arg Arg Gly Leu Asp Phe 
385                 390                 395                 400 


Phe Gln Ser Pro Ser Phe Cys Pro Asn Pro Pro Gly Leu Glu Ala Leu 
                405                 410                 415     


Ser Pro Asn Thr Ser Cys Arg His Phe Pro Leu Leu Val Ser Ser Ser 
            420                 425                 430         


Phe Ser Arg Val Asp Leu Phe Asn Gly Leu Leu Gly Pro Val Gln Val 
        435                 440                 445             


Thr Ala Leu Tyr Val Thr Arg Leu Asp Asn Val Thr Val Ala His Met 
    450                 455                 460                 


Gly Thr Met Asp Gly Arg Ile Leu Gln Val Glu Leu Val Arg Ser Leu 
465                 470                 475                 480 


Asn Tyr Leu Leu Tyr Val Ser Asn Phe Ser Leu Gly Asp Ser Gly Gln 
                485                 490                 495     


Pro Val Gln Arg Asp Val Ser Arg Leu Gly Asp His Leu Leu Phe Ala 
            500                 505                 510         


Ser Gly Asp Gln Val Phe Gln Val Pro Ile Gln Gly Pro Gly Cys Arg 
        515                 520                 525             


His Phe Leu Thr Cys Gly Arg Cys Leu Arg Ala Trp His Phe Met Gly 
    530                 535                 540                 


Cys Gly Trp Cys Gly Asn Met Cys Gly Gln Gln Lys Glu Cys Pro Gly 
545                 550                 555                 560 


Ser Trp Gln Gln Asp His Cys Pro Pro Lys Leu Thr Glu Phe His Pro 
                565                 570                 575     


His Ser Gly Pro Leu Arg Gly Ser Thr Arg Leu Thr Leu Cys Gly Ser 
            580                 585                 590         


Asn Phe Tyr Leu His Pro Ser Gly Leu Val Pro Glu Gly Thr His Gln 
        595                 600                 605             


Val Thr Val Gly Gln Ser Pro Cys Arg Pro Leu Pro Lys Asp Ser Ser 
    610                 615                 620                 


Lys Leu Arg Pro Val Pro Arg Lys Asp Phe Val Glu Glu Phe Glu Cys 
625                 630                 635                 640 


Glu Leu Glu Pro Leu Gly Thr Gln Ala Val Gly Pro Thr Asn Val Ser 
                645                 650                 655     


Leu Thr Val Thr Asn Met Pro Pro Gly Lys His Phe Arg Val Asp Gly 
            660                 665                 670         


Thr Ser Val Leu Arg Gly Phe Ser Phe Met Glu Pro Val Leu Ile Ala 
        675                 680                 685             


Val Gln Pro Leu Phe Gly Pro Arg Ala Gly Gly Thr Cys Leu Thr Leu 
    690                 695                 700                 


Glu Gly Gln Ser Leu Ser Val Gly Thr Ser Arg Ala Val Leu Val Asn 
705                 710                 715                 720 


Gly Thr Glu Cys Leu Leu Ala Arg Val Ser Glu Gly Gln Leu Leu Cys 
                725                 730                 735     


Ala Thr Pro Pro Gly Ala Thr Val Ala Ser Val Pro Leu Ser Leu Gln 
            740                 745                 750         


Val Gly Gly Ala Gln Val Pro Gly Ser Trp Thr Phe Gln Tyr Arg Glu 
        755                 760                 765             


Asp Pro Val Val Leu Ser Ile Ser Pro Asn Cys Gly Tyr Ile Asn Ser 
    770                 775                 780                 


His Ile Thr Ile Cys Gly Gln His Leu Thr Ser Ala Trp His Leu Val 
785                 790                 795                 800 


Leu Ser Phe His Asp Gly Leu Arg Ala Val Glu Ser Arg Gln Cys Glu 
                805                 810                 815     


Arg Gln Leu Pro Glu Gln Gln Leu Cys Arg Leu Pro Glu Tyr Val Val 
            820                 825                 830         


Arg Asp Pro Gln Gly Trp Val Ala Gly Asn Leu Ser Ala Arg Gly Asp 
        835                 840                 845             


Gly Ala Ala Gly Phe Thr Leu Pro Gly Phe Arg Phe Leu Pro Pro Pro 
    850                 855                 860                 


His Pro Pro Ser Ala Asn Leu Val Pro Leu Lys Pro Glu Glu His Ala 
865                 870                 875                 880 


Ile Lys Phe Glu Val Ser Val Arg Asp Arg Gly Arg Asp Ser Trp Gly 
                885                 890                 895     


Ser Glu Ser Arg Gly Gln Pro Thr Gly Trp Ser Ser 
            900                 905             


<210>  154
<211>  1944
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens soluble RON variant 4 (RON) mRNA, complete cds, 
       alternatively spliced GenBank: EU826585.1

<400>  154
atggagctcc tcccgccgct gcctcagtcc ttcctgttgc tgctgctgtt gcctgccaag       60

cccgcggcgg gcgaggactg gcagtgcccg cgcaccccct acgcggcctc tcgcgacttt      120

gacgtgaagt acgtggtgcc cagcttctcc gccggaggcc tggtacaggc catggtgacc      180

tacgagggcg acagaaatga gagtgctgtg tttgtagcca tacgcaatcg cctgcatgtg      240

cttgggcctg acctgaagtc tgtccagagc ctggccacgg gccctgctgg agaccctggc      300

tgccagacgt gtgcagcctg tggcccagga ccccacggcc ctcccggtga cacagacaca      360

aaggtgctgg tgctggatcc cgcgctgcct gcgctggtca gttgtggctc cagcctgcag      420

ggccgctgct tcctgcatga cctagagccc caagggacag ccgtgcatct ggcagcgcca      480

gcctgcctct tctcagccca ccataaccgg cccgatgact gccccgactg tgtggccagc      540

ccattgggca cccgtgtaac tgtggttgag caaggccagg cctcctattt ctacgtggca      600

tcctcactgg acgcagccgt ggctgccagc ttcagcccac gctcagtgtc tatcaggcgt      660

ctcaaggctg acgcctcggg attcgcaccg ggctttgtgg cgttgtcagt gctgcccaag      720

catcttgtct cctacagtat tgaatacgtg cacagcttcc acacgggagc cttcgtatac      780

ttcctgactg tacagccggc cagcgtgaca gatgatccta gtgccctgca cacacgcctg      840

gcacggctta gcgccactga gccagagttg ggtgactatc gggagctggt cctcgactgc      900

agatttgctc caaaacgcag gcgccggggg gccccagaag gcggacagcc ctaccctgtg      960

ctgcgggtgg cccactccgc tccagtgggt gcccaacttg ccactgagct gagcatcgcc     1020

gagggccagg aagtactatt tggggtcttt gtgactggca aggatggtgg tcctggcgtg     1080

ggccccaact ctgtcgtctg tgccttcccc attgacctgc tggacacact aattgatgag     1140

ggtgtggagc gctgttgtga atccccagtc catccaggcc tccggcgagg cctcgacttc     1200

ttccagtcgc ccagtttttg ccccaacccg cctggcctgg aagccctcag ccccaacacc     1260

agctgccgcc acttccctct gctggtcagt agcagcttct cacgtgtgga cctattcaat     1320

gggctgttgg gaccagtaca ggtcactgca ttgtatgtga cacgccttga caacgtcaca     1380

gtggcacaca tgggcacaat ggatgggcgt atcctgcagg tggagctggt caggtcacta     1440

aactacttgc tgtatgtgtc caacttctca ctgggtgaca gtgggcagcc cgtgcagcgg     1500

gatgtcagtc gtcttgggga ccacctactc tttgcctctg gggaccaggt tttccaggta     1560

cctatccaag gccctggctg ccgccacttc ctgacctgtg ggcgttgcct aagggcatgg     1620

catttcatgg gctgtggctg gtgtgggaac atgtgcggcc agcagaagga gtgtcctggc     1680

tcctggcaac aggaccactg cccacctaag cttactgagt tccaccccca cagtggacct     1740

ctaaggggca gtacaaggct gaccctgtgt ggctccaact tctaccttca cccttctggt     1800

ctggtgcctg agggaaccca tcaggtcact gtgggccaaa gtccctgccg gccactgccc     1860

aaggacagct caaaactcag gtacaatctg gtccctcccc tccctttccc tgaaggggga     1920

aaccaagcag ccccttcccc atga                                            1944


<210>  155
<211>  647
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens soluble RON variant 4 (RON) mRNA, complete cds, 
       alternatively spliced GenBank: EU826585.1

<400>  155

Met Glu Leu Leu Pro Pro Leu Pro Gln Ser Phe Leu Leu Leu Leu Leu 
1               5                   10                  15      


Leu Pro Ala Lys Pro Ala Ala Gly Glu Asp Trp Gln Cys Pro Arg Thr 
            20                  25                  30          


Pro Tyr Ala Ala Ser Arg Asp Phe Asp Val Lys Tyr Val Val Pro Ser 
        35                  40                  45              


Phe Ser Ala Gly Gly Leu Val Gln Ala Met Val Thr Tyr Glu Gly Asp 
    50                  55                  60                  


Arg Asn Glu Ser Ala Val Phe Val Ala Ile Arg Asn Arg Leu His Val 
65                  70                  75                  80  


Leu Gly Pro Asp Leu Lys Ser Val Gln Ser Leu Ala Thr Gly Pro Ala 
                85                  90                  95      


Gly Asp Pro Gly Cys Gln Thr Cys Ala Ala Cys Gly Pro Gly Pro His 
            100                 105                 110         


Gly Pro Pro Gly Asp Thr Asp Thr Lys Val Leu Val Leu Asp Pro Ala 
        115                 120                 125             


Leu Pro Ala Leu Val Ser Cys Gly Ser Ser Leu Gln Gly Arg Cys Phe 
    130                 135                 140                 


Leu His Asp Leu Glu Pro Gln Gly Thr Ala Val His Leu Ala Ala Pro 
145                 150                 155                 160 


Ala Cys Leu Phe Ser Ala His His Asn Arg Pro Asp Asp Cys Pro Asp 
                165                 170                 175     


Cys Val Ala Ser Pro Leu Gly Thr Arg Val Thr Val Val Glu Gln Gly 
            180                 185                 190         


Gln Ala Ser Tyr Phe Tyr Val Ala Ser Ser Leu Asp Ala Ala Val Ala 
        195                 200                 205             


Ala Ser Phe Ser Pro Arg Ser Val Ser Ile Arg Arg Leu Lys Ala Asp 
    210                 215                 220                 


Ala Ser Gly Phe Ala Pro Gly Phe Val Ala Leu Ser Val Leu Pro Lys 
225                 230                 235                 240 


His Leu Val Ser Tyr Ser Ile Glu Tyr Val His Ser Phe His Thr Gly 
                245                 250                 255     


Ala Phe Val Tyr Phe Leu Thr Val Gln Pro Ala Ser Val Thr Asp Asp 
            260                 265                 270         


Pro Ser Ala Leu His Thr Arg Leu Ala Arg Leu Ser Ala Thr Glu Pro 
        275                 280                 285             


Glu Leu Gly Asp Tyr Arg Glu Leu Val Leu Asp Cys Arg Phe Ala Pro 
    290                 295                 300                 


Lys Arg Arg Arg Arg Gly Ala Pro Glu Gly Gly Gln Pro Tyr Pro Val 
305                 310                 315                 320 


Leu Arg Val Ala His Ser Ala Pro Val Gly Ala Gln Leu Ala Thr Glu 
                325                 330                 335     


Leu Ser Ile Ala Glu Gly Gln Glu Val Leu Phe Gly Val Phe Val Thr 
            340                 345                 350         


Gly Lys Asp Gly Gly Pro Gly Val Gly Pro Asn Ser Val Val Cys Ala 
        355                 360                 365             


Phe Pro Ile Asp Leu Leu Asp Thr Leu Ile Asp Glu Gly Val Glu Arg 
    370                 375                 380                 


Cys Cys Glu Ser Pro Val His Pro Gly Leu Arg Arg Gly Leu Asp Phe 
385                 390                 395                 400 


Phe Gln Ser Pro Ser Phe Cys Pro Asn Pro Pro Gly Leu Glu Ala Leu 
                405                 410                 415     


Ser Pro Asn Thr Ser Cys Arg His Phe Pro Leu Leu Val Ser Ser Ser 
            420                 425                 430         


Phe Ser Arg Val Asp Leu Phe Asn Gly Leu Leu Gly Pro Val Gln Val 
        435                 440                 445             


Thr Ala Leu Tyr Val Thr Arg Leu Asp Asn Val Thr Val Ala His Met 
    450                 455                 460                 


Gly Thr Met Asp Gly Arg Ile Leu Gln Val Glu Leu Val Arg Ser Leu 
465                 470                 475                 480 


Asn Tyr Leu Leu Tyr Val Ser Asn Phe Ser Leu Gly Asp Ser Gly Gln 
                485                 490                 495     


Pro Val Gln Arg Asp Val Ser Arg Leu Gly Asp His Leu Leu Phe Ala 
            500                 505                 510         


Ser Gly Asp Gln Val Phe Gln Val Pro Ile Gln Gly Pro Gly Cys Arg 
        515                 520                 525             


His Phe Leu Thr Cys Gly Arg Cys Leu Arg Ala Trp His Phe Met Gly 
    530                 535                 540                 


Cys Gly Trp Cys Gly Asn Met Cys Gly Gln Gln Lys Glu Cys Pro Gly 
545                 550                 555                 560 


Ser Trp Gln Gln Asp His Cys Pro Pro Lys Leu Thr Glu Phe His Pro 
                565                 570                 575     


His Ser Gly Pro Leu Arg Gly Ser Thr Arg Leu Thr Leu Cys Gly Ser 
            580                 585                 590         


Asn Phe Tyr Leu His Pro Ser Gly Leu Val Pro Glu Gly Thr His Gln 
        595                 600                 605             


Val Thr Val Gly Gln Ser Pro Cys Arg Pro Leu Pro Lys Asp Ser Ser 
    610                 615                 620                 


Lys Leu Arg Tyr Asn Leu Val Pro Pro Leu Pro Phe Pro Glu Gly Gly 
625                 630                 635                 640 


Asn Gln Ala Ala Pro Ser Pro 
                645         


<210>  156
<211>  5616
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human EGFR mRNA transcript variant 1 GenBank Accession No.: 
       NM_005228.3  GI:41327737

<400>  156
ccccggcgca gcgcggccgc agcagcctcc gccccccgca cggtgtgagc gcccgacgcg       60

gccgaggcgg ccggagtccc gagctagccc cggcggccgc cgccgcccag accggacgac      120

aggccacctc gtcggcgtcc gcccgagtcc ccgcctcgcc gccaacgcca caaccaccgc      180

gcacggcccc ctgactccgt ccagtattga tcgggagagc cggagcgagc tcttcgggga      240

gcagcgatgc gaccctccgg gacggccggg gcagcgctcc tggcgctgct ggctgcgctc      300

tgcccggcga gtcgggctct ggaggaaaag aaagtttgcc aaggcacgag taacaagctc      360

acgcagttgg gcacttttga agatcatttt ctcagcctcc agaggatgtt caataactgt      420

gaggtggtcc ttgggaattt ggaaattacc tatgtgcaga ggaattatga tctttccttc      480

ttaaagacca tccaggaggt ggctggttat gtcctcattg ccctcaacac agtggagcga      540

attcctttgg aaaacctgca gatcatcaga ggaaatatgt actacgaaaa ttcctatgcc      600

ttagcagtct tatctaacta tgatgcaaat aaaaccggac tgaaggagct gcccatgaga      660

aatttacagg aaatcctgca tggcgccgtg cggttcagca acaaccctgc cctgtgcaac      720

gtggagagca tccagtggcg ggacatagtc agcagtgact ttctcagcaa catgtcgatg      780

gacttccaga accacctggg cagctgccaa aagtgtgatc caagctgtcc caatgggagc      840

tgctggggtg caggagagga gaactgccag aaactgacca aaatcatctg tgcccagcag      900

tgctccgggc gctgccgtgg caagtccccc agtgactgct gccacaacca gtgtgctgca      960

ggctgcacag gcccccggga gagcgactgc ctggtctgcc gcaaattccg agacgaagcc     1020

acgtgcaagg acacctgccc cccactcatg ctctacaacc ccaccacgta ccagatggat     1080

gtgaaccccg agggcaaata cagctttggt gccacctgcg tgaagaagtg tccccgtaat     1140

tatgtggtga cagatcacgg ctcgtgcgtc cgagcctgtg gggccgacag ctatgagatg     1200

gaggaagacg gcgtccgcaa gtgtaagaag tgcgaagggc cttgccgcaa agtgtgtaac     1260

ggaataggta ttggtgaatt taaagactca ctctccataa atgctacgaa tattaaacac     1320

ttcaaaaact gcacctccat cagtggcgat ctccacatcc tgccggtggc atttaggggt     1380

gactccttca cacatactcc tcctctggat ccacaggaac tggatattct gaaaaccgta     1440

aaggaaatca cagggttttt gctgattcag gcttggcctg aaaacaggac ggacctccat     1500

gcctttgaga acctagaaat catacgcggc aggaccaagc aacatggtca gttttctctt     1560

gcagtcgtca gcctgaacat aacatccttg ggattacgct ccctcaagga gataagtgat     1620

ggagatgtga taatttcagg aaacaaaaat ttgtgctatg caaatacaat aaactggaaa     1680

aaactgtttg ggacctccgg tcagaaaacc aaaattataa gcaacagagg tgaaaacagc     1740

tgcaaggcca caggccaggt ctgccatgcc ttgtgctccc ccgagggctg ctggggcccg     1800

gagcccaggg actgcgtctc ttgccggaat gtcagccgag gcagggaatg cgtggacaag     1860

tgcaaccttc tggagggtga gccaagggag tttgtggaga actctgagtg catacagtgc     1920

cacccagagt gcctgcctca ggccatgaac atcacctgca caggacgggg accagacaac     1980

tgtatccagt gtgcccacta cattgacggc ccccactgcg tcaagacctg cccggcagga     2040

gtcatgggag aaaacaacac cctggtctgg aagtacgcag acgccggcca tgtgtgccac     2100

ctgtgccatc caaactgcac ctacggatgc actgggccag gtcttgaagg ctgtccaacg     2160

aatgggccta agatcccgtc catcgccact gggatggtgg gggccctcct cttgctgctg     2220

gtggtggccc tggggatcgg cctcttcatg cgaaggcgcc acatcgttcg gaagcgcacg     2280

ctgcggaggc tgctgcagga gagggagctt gtggagcctc ttacacccag tggagaagct     2340

cccaaccaag ctctcttgag gatcttgaag gaaactgaat tcaaaaagat caaagtgctg     2400

ggctccggtg cgttcggcac ggtgtataag ggactctgga tcccagaagg tgagaaagtt     2460

aaaattcccg tcgctatcaa ggaattaaga gaagcaacat ctccgaaagc caacaaggaa     2520

atcctcgatg aagcctacgt gatggccagc gtggacaacc cccacgtgtg ccgcctgctg     2580

ggcatctgcc tcacctccac cgtgcagctc atcacgcagc tcatgccctt cggctgcctc     2640

ctggactatg tccgggaaca caaagacaat attggctccc agtacctgct caactggtgt     2700

gtgcagatcg caaagggcat gaactacttg gaggaccgtc gcttggtgca ccgcgacctg     2760

gcagccagga acgtactggt gaaaacaccg cagcatgtca agatcacaga ttttgggctg     2820

gccaaactgc tgggtgcgga agagaaagaa taccatgcag aaggaggcaa agtgcctatc     2880

aagtggatgg cattggaatc aattttacac agaatctata cccaccagag tgatgtctgg     2940

agctacgggg tgaccgtttg ggagttgatg acctttggat ccaagccata tgacggaatc     3000

cctgccagcg agatctcctc catcctggag aaaggagaac gcctccctca gccacccata     3060

tgtaccatcg atgtctacat gatcatggtc aagtgctgga tgatagacgc agatagtcgc     3120

ccaaagttcc gtgagttgat catcgaattc tccaaaatgg cccgagaccc ccagcgctac     3180

cttgtcattc agggggatga aagaatgcat ttgccaagtc ctacagactc caacttctac     3240

cgtgccctga tggatgaaga agacatggac gacgtggtgg atgccgacga gtacctcatc     3300

ccacagcagg gcttcttcag cagcccctcc acgtcacgga ctcccctcct gagctctctg     3360

agtgcaacca gcaacaattc caccgtggct tgcattgata gaaatgggct gcaaagctgt     3420

cccatcaagg aagacagctt cttgcagcga tacagctcag accccacagg cgccttgact     3480

gaggacagca tagacgacac cttcctccca gtgcctgaat acataaacca gtccgttccc     3540

aaaaggcccg ctggctctgt gcagaatcct gtctatcaca atcagcctct gaaccccgcg     3600

cccagcagag acccacacta ccaggacccc cacagcactg cagtgggcaa ccccgagtat     3660

ctcaacactg tccagcccac ctgtgtcaac agcacattcg acagccctgc ccactgggcc     3720

cagaaaggca gccaccaaat tagcctggac aaccctgact accagcagga cttctttccc     3780

aaggaagcca agccaaatgg catctttaag ggctccacag ctgaaaatgc agaataccta     3840

agggtcgcgc cacaaagcag tgaatttatt ggagcatgac cacggaggat agtatgagcc     3900

ctaaaaatcc agactctttc gatacccagg accaagccac agcaggtcct ccatcccaac     3960

agccatgccc gcattagctc ttagacccac agactggttt tgcaacgttt acaccgacta     4020

gccaggaagt acttccacct cgggcacatt ttgggaagtt gcattccttt gtcttcaaac     4080

tgtgaagcat ttacagaaac gcatccagca agaatattgt ccctttgagc agaaatttat     4140

ctttcaaaga ggtatatttg aaaaaaaaaa aaagtatatg tgaggatttt tattgattgg     4200

ggatcttgga gtttttcatt gtcgctattg atttttactt caatgggctc ttccaacaag     4260

gaagaagctt gctggtagca cttgctaccc tgagttcatc caggcccaac tgtgagcaag     4320

gagcacaagc cacaagtctt ccagaggatg cttgattcca gtggttctgc ttcaaggctt     4380

ccactgcaaa acactaaaga tccaagaagg ccttcatggc cccagcaggc cggatcggta     4440

ctgtatcaag tcatggcagg tacagtagga taagccactc tgtcccttcc tgggcaaaga     4500

agaaacggag gggatggaat tcttccttag acttactttt gtaaaaatgt ccccacggta     4560

cttactcccc actgatggac cagtggtttc cagtcatgag cgttagactg acttgtttgt     4620

cttccattcc attgttttga aactcagtat gctgcccctg tcttgctgtc atgaaatcag     4680

caagagagga tgacacatca aataataact cggattccag cccacattgg attcatcagc     4740

atttggacca atagcccaca gctgagaatg tggaatacct aaggatagca ccgcttttgt     4800

tctcgcaaaa acgtatctcc taatttgagg ctcagatgaa atgcatcagg tcctttgggg     4860

catagatcag aagactacaa aaatgaagct gctctgaaat ctcctttagc catcacccca     4920

accccccaaa attagtttgt gttacttatg gaagatagtt ttctcctttt acttcacttc     4980

aaaagctttt tactcaaaga gtatatgttc cctccaggtc agctgccccc aaaccccctc     5040

cttacgcttt gtcacacaaa aagtgtctct gccttgagtc atctattcaa gcacttacag     5100

ctctggccac aacagggcat tttacaggtg cgaatgacag tagcattatg agtagtgtgg     5160

aattcaggta gtaaatatga aactagggtt tgaaattgat aatgctttca caacatttgc     5220

agatgtttta gaaggaaaaa agttccttcc taaaataatt tctctacaat tggaagattg     5280

gaagattcag ctagttagga gcccaccttt tttcctaatc tgtgtgtgcc ctgtaacctg     5340

actggttaac agcagtcctt tgtaaacagt gttttaaact ctcctagtca atatccaccc     5400

catccaattt atcaaggaag aaatggttca gaaaatattt tcagcctaca gttatgttca     5460

gtcacacaca catacaaaat gttccttttg cttttaaagt aatttttgac tcccagatca     5520

gtcagagccc ctacagcatt gttaagaaag tatttgattt ttgtctcaat gaaaataaaa     5580

ctatattcat ttccactcta aaaaaaaaaa aaaaaa                               5616


<210>  157
<211>  1210
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  EGFR isoform A precursor encoded by mRNA transcript variant 1 
       GenBank Accession No.: NP_005219.2  GI:29725609

<400>  157

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
1               5                   10                  15      


Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gln 
            20                  25                  30          


Gly Thr Ser Asn Lys Leu Thr Gln Leu Gly Thr Phe Glu Asp His Phe 
        35                  40                  45              


Leu Ser Leu Gln Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
    50                  55                  60                  


Leu Glu Ile Thr Tyr Val Gln Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65                  70                  75                  80  


Thr Ile Gln Glu Val Ala Gly Tyr Val Leu Ile Ala Leu Asn Thr Val 
                85                  90                  95      


Glu Arg Ile Pro Leu Glu Asn Leu Gln Ile Ile Arg Gly Asn Met Tyr 
            100                 105                 110         


Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
        115                 120                 125             


Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gln Glu Ile Leu 
    130                 135                 140                 


His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145                 150                 155                 160 


Ser Ile Gln Trp Arg Asp Ile Val Ser Ser Asp Phe Leu Ser Asn Met 
                165                 170                 175     


Ser Met Asp Phe Gln Asn His Leu Gly Ser Cys Gln Lys Cys Asp Pro 
            180                 185                 190         


Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gln 
        195                 200                 205             


Lys Leu Thr Lys Ile Ile Cys Ala Gln Gln Cys Ser Gly Arg Cys Arg 
    210                 215                 220                 


Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gln Cys Ala Ala Gly Cys 
225                 230                 235                 240 


Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
                245                 250                 255     


Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
            260                 265                 270         


Thr Thr Tyr Gln Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
        275                 280                 285             


Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
    290                 295                 300                 


Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305                 310                 315                 320 


Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
                325                 330                 335     


Cys Asn Gly Ile Gly Ile Gly Glu Phe Lys Asp Ser Leu Ser Ile Asn 
            340                 345                 350         


Ala Thr Asn Ile Lys His Phe Lys Asn Cys Thr Ser Ile Ser Gly Asp 
        355                 360                 365             


Leu His Ile Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
    370                 375                 380                 


Pro Pro Leu Asp Pro Gln Glu Leu Asp Ile Leu Lys Thr Val Lys Glu 
385                 390                 395                 400 


Ile Thr Gly Phe Leu Leu Ile Gln Ala Trp Pro Glu Asn Arg Thr Asp 
                405                 410                 415     


Leu His Ala Phe Glu Asn Leu Glu Ile Ile Arg Gly Arg Thr Lys Gln 
            420                 425                 430         


His Gly Gln Phe Ser Leu Ala Val Val Ser Leu Asn Ile Thr Ser Leu 
        435                 440                 445             


Gly Leu Arg Ser Leu Lys Glu Ile Ser Asp Gly Asp Val Ile Ile Ser 
    450                 455                 460                 


Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr Ile Asn Trp Lys Lys Leu 
465                 470                 475                 480 


Phe Gly Thr Ser Gly Gln Lys Thr Lys Ile Ile Ser Asn Arg Gly Glu 
                485                 490                 495     


Asn Ser Cys Lys Ala Thr Gly Gln Val Cys His Ala Leu Cys Ser Pro 
            500                 505                 510         


Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
        515                 520                 525             


Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Asn Leu Leu Glu Gly 
    530                 535                 540                 


Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys Ile Gln Cys His Pro 
545                 550                 555                 560 


Glu Cys Leu Pro Gln Ala Met Asn Ile Thr Cys Thr Gly Arg Gly Pro 
                565                 570                 575     


Asp Asn Cys Ile Gln Cys Ala His Tyr Ile Asp Gly Pro His Cys Val 
            580                 585                 590         


Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
        595                 600                 605             


Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
    610                 615                 620                 


Thr Tyr Gly Cys Thr Gly Pro Gly Leu Glu Gly Cys Pro Thr Asn Gly 
625                 630                 635                 640 


Pro Lys Ile Pro Ser Ile Ala Thr Gly Met Val Gly Ala Leu Leu Leu 
                645                 650                 655     


Leu Leu Val Val Ala Leu Gly Ile Gly Leu Phe Met Arg Arg Arg His 
            660                 665                 670         


Ile Val Arg Lys Arg Thr Leu Arg Arg Leu Leu Gln Glu Arg Glu Leu 
        675                 680                 685             


Val Glu Pro Leu Thr Pro Ser Gly Glu Ala Pro Asn Gln Ala Leu Leu 
    690                 695                 700                 


Arg Ile Leu Lys Glu Thr Glu Phe Lys Lys Ile Lys Val Leu Gly Ser 
705                 710                 715                 720 


Gly Ala Phe Gly Thr Val Tyr Lys Gly Leu Trp Ile Pro Glu Gly Glu 
                725                 730                 735     


Lys Val Lys Ile Pro Val Ala Ile Lys Glu Leu Arg Glu Ala Thr Ser 
            740                 745                 750         


Pro Lys Ala Asn Lys Glu Ile Leu Asp Glu Ala Tyr Val Met Ala Ser 
        755                 760                 765             


Val Asp Asn Pro His Val Cys Arg Leu Leu Gly Ile Cys Leu Thr Ser 
    770                 775                 780                 


Thr Val Gln Leu Ile Thr Gln Leu Met Pro Phe Gly Cys Leu Leu Asp 
785                 790                 795                 800 


Tyr Val Arg Glu His Lys Asp Asn Ile Gly Ser Gln Tyr Leu Leu Asn 
                805                 810                 815     


Trp Cys Val Gln Ile Ala Lys Gly Met Asn Tyr Leu Glu Asp Arg Arg 
            820                 825                 830         


Leu Val His Arg Asp Leu Ala Ala Arg Asn Val Leu Val Lys Thr Pro 
        835                 840                 845             


Gln His Val Lys Ile Thr Asp Phe Gly Leu Ala Lys Leu Leu Gly Ala 
    850                 855                 860                 


Glu Glu Lys Glu Tyr His Ala Glu Gly Gly Lys Val Pro Ile Lys Trp 
865                 870                 875                 880 


Met Ala Leu Glu Ser Ile Leu His Arg Ile Tyr Thr His Gln Ser Asp 
                885                 890                 895     


Val Trp Ser Tyr Gly Val Thr Val Trp Glu Leu Met Thr Phe Gly Ser 
            900                 905                 910         


Lys Pro Tyr Asp Gly Ile Pro Ala Ser Glu Ile Ser Ser Ile Leu Glu 
        915                 920                 925             


Lys Gly Glu Arg Leu Pro Gln Pro Pro Ile Cys Thr Ile Asp Val Tyr 
    930                 935                 940                 


Met Ile Met Val Lys Cys Trp Met Ile Asp Ala Asp Ser Arg Pro Lys 
945                 950                 955                 960 


Phe Arg Glu Leu Ile Ile Glu Phe Ser Lys Met Ala Arg Asp Pro Gln 
                965                 970                 975     


Arg Tyr Leu Val Ile Gln Gly Asp Glu Arg Met His Leu Pro Ser Pro 
            980                 985                 990         


Thr Asp Ser Asn Phe Tyr Arg Ala  Leu Met Asp Glu Glu  Asp Met Asp 
        995                 1000                 1005             


Asp Val  Val Asp Ala Asp Glu  Tyr Leu Ile Pro Gln  Gln Gly Phe 
    1010                 1015                 1020             


Phe Ser  Ser Pro Ser Thr Ser  Arg Thr Pro Leu Leu  Ser Ser Leu 
    1025                 1030                 1035             


Ser Ala  Thr Ser Asn Asn Ser  Thr Val Ala Cys Ile  Asp Arg Asn 
    1040                 1045                 1050             


Gly Leu  Gln Ser Cys Pro Ile  Lys Glu Asp Ser Phe  Leu Gln Arg 
    1055                 1060                 1065             


Tyr Ser  Ser Asp Pro Thr Gly  Ala Leu Thr Glu Asp  Ser Ile Asp 
    1070                 1075                 1080             


Asp Thr  Phe Leu Pro Val Pro  Glu Tyr Ile Asn Gln  Ser Val Pro 
    1085                 1090                 1095             


Lys Arg  Pro Ala Gly Ser Val  Gln Asn Pro Val Tyr  His Asn Gln 
    1100                 1105                 1110             


Pro Leu  Asn Pro Ala Pro Ser  Arg Asp Pro His Tyr  Gln Asp Pro 
    1115                 1120                 1125             


His Ser  Thr Ala Val Gly Asn  Pro Glu Tyr Leu Asn  Thr Val Gln 
    1130                 1135                 1140             


Pro Thr  Cys Val Asn Ser Thr  Phe Asp Ser Pro Ala  His Trp Ala 
    1145                 1150                 1155             


Gln Lys  Gly Ser His Gln Ile  Ser Leu Asp Asn Pro  Asp Tyr Gln 
    1160                 1165                 1170             


Gln Asp  Phe Phe Pro Lys Glu  Ala Lys Pro Asn Gly  Ile Phe Lys 
    1175                 1180                 1185             


Gly Ser  Thr Ala Glu Asn Ala  Glu Tyr Leu Arg Val  Ala Pro Gln 
    1190                 1195                 1200             


Ser Ser  Glu Phe Ile Gly Ala  
    1205                 1210 


<210>  158
<211>  2239
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens epidermal growth factor receptor (EGFR), transcript 
       variant 2, mRNA

NCBI Reference Sequence: NM_201282.1

<400>  158
ccccggcgca gcgcggccgc agcagcctcc gccccccgca cggtgtgagc gcccgacgcg       60

gccgaggcgg ccggagtccc gagctagccc cggcggccgc cgccgcccag accggacgac      120

aggccacctc gtcggcgtcc gcccgagtcc ccgcctcgcc gccaacgcca caaccaccgc      180

gcacggcccc ctgactccgt ccagtattga tcgggagagc cggagcgagc tcttcgggga      240

gcagcgatgc gaccctccgg gacggccggg gcagcgctcc tggcgctgct ggctgcgctc      300

tgcccggcga gtcgggctct ggaggaaaag aaagtttgcc aaggcacgag taacaagctc      360

acgcagttgg gcacttttga agatcatttt ctcagcctcc agaggatgtt caataactgt      420

gaggtggtcc ttgggaattt ggaaattacc tatgtgcaga ggaattatga tctttccttc      480

ttaaagacca tccaggaggt ggctggttat gtcctcattg ccctcaacac agtggagcga      540

attcctttgg aaaacctgca gatcatcaga ggaaatatgt actacgaaaa ttcctatgcc      600

ttagcagtct tatctaacta tgatgcaaat aaaaccggac tgaaggagct gcccatgaga      660

aatttacagg aaatcctgca tggcgccgtg cggttcagca acaaccctgc cctgtgcaac      720

gtggagagca tccagtggcg ggacatagtc agcagtgact ttctcagcaa catgtcgatg      780

gacttccaga accacctggg cagctgccaa aagtgtgatc caagctgtcc caatgggagc      840

tgctggggtg caggagagga gaactgccag aaactgacca aaatcatctg tgcccagcag      900

tgctccgggc gctgccgtgg caagtccccc agtgactgct gccacaacca gtgtgctgca      960

ggctgcacag gcccccggga gagcgactgc ctggtctgcc gcaaattccg agacgaagcc     1020

acgtgcaagg acacctgccc cccactcatg ctctacaacc ccaccacgta ccagatggat     1080

gtgaaccccg agggcaaata cagctttggt gccacctgcg tgaagaagtg tccccgtaat     1140

tatgtggtga cagatcacgg ctcgtgcgtc cgagcctgtg gggccgacag ctatgagatg     1200

gaggaagacg gcgtccgcaa gtgtaagaag tgcgaagggc cttgccgcaa agtgtgtaac     1260

ggaataggta ttggtgaatt taaagactca ctctccataa atgctacgaa tattaaacac     1320

ttcaaaaact gcacctccat cagtggcgat ctccacatcc tgccggtggc atttaggggt     1380

gactccttca cacatactcc tcctctggat ccacaggaac tggatattct gaaaaccgta     1440

aaggaaatca cagggttttt gctgattcag gcttggcctg aaaacaggac ggacctccat     1500

gcctttgaga acctagaaat catacgcggc aggaccaagc aacatggtca gttttctctt     1560

gcagtcgtca gcctgaacat aacatccttg ggattacgct ccctcaagga gataagtgat     1620

ggagatgtga taatttcagg aaacaaaaat ttgtgctatg caaatacaat aaactggaaa     1680

aaactgtttg ggacctccgg tcagaaaacc aaaattataa gcaacagagg tgaaaacagc     1740

tgcaaggcca caggccaggt ctgccatgcc ttgtgctccc ccgagggctg ctggggcccg     1800

gagcccaggg actgcgtctc ttgccggaat gtcagccgag gcagggaatg cgtggacaag     1860

tgcaaccttc tggagggtga gccaagggag tttgtggaga actctgagtg catacagtgc     1920

cacccagagt gcctgcctca ggccatgaac atcacctgca caggacgggg accagacaac     1980

tgtatccagt gtgcccacta cattgacggc ccccactgcg tcaagacctg cccggcagga     2040

gtcatgggag aaaacaacac cctggtctgg aagtacgcag acgccggcca tgtgtgccac     2100

ctgtgccatc caaactgcac ctacgggtcc taataaatct tcactgtctg actttagtct     2160

cccactaaaa ctgcatttcc tttctacaat ttcaatttct ccctttgctt caaataaagt     2220

cctgacacta ttcatttga                                                  2239


<210>  159
<211>  628
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens epidermal growth factor receptor (EGFR), transcript 
       variant 2, polypeptide NCBI Reference Sequence: NM_201282.1

<400>  159

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
1               5                   10                  15      


Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gln 
            20                  25                  30          


Gly Thr Ser Asn Lys Leu Thr Gln Leu Gly Thr Phe Glu Asp His Phe 
        35                  40                  45              


Leu Ser Leu Gln Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
    50                  55                  60                  


Leu Glu Ile Thr Tyr Val Gln Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65                  70                  75                  80  


Thr Ile Gln Glu Val Ala Gly Tyr Val Leu Ile Ala Leu Asn Thr Val 
                85                  90                  95      


Glu Arg Ile Pro Leu Glu Asn Leu Gln Ile Ile Arg Gly Asn Met Tyr 
            100                 105                 110         


Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
        115                 120                 125             


Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gln Glu Ile Leu 
    130                 135                 140                 


His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145                 150                 155                 160 


Ser Ile Gln Trp Arg Asp Ile Val Ser Ser Asp Phe Leu Ser Asn Met 
                165                 170                 175     


Ser Met Asp Phe Gln Asn His Leu Gly Ser Cys Gln Lys Cys Asp Pro 
            180                 185                 190         


Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gln 
        195                 200                 205             


Lys Leu Thr Lys Ile Ile Cys Ala Gln Gln Cys Ser Gly Arg Cys Arg 
    210                 215                 220                 


Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gln Cys Ala Ala Gly Cys 
225                 230                 235                 240 


Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
                245                 250                 255     


Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
            260                 265                 270         


Thr Thr Tyr Gln Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
        275                 280                 285             


Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
    290                 295                 300                 


Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305                 310                 315                 320 


Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
                325                 330                 335     


Cys Asn Gly Ile Gly Ile Gly Glu Phe Lys Asp Ser Leu Ser Ile Asn 
            340                 345                 350         


Ala Thr Asn Ile Lys His Phe Lys Asn Cys Thr Ser Ile Ser Gly Asp 
        355                 360                 365             


Leu His Ile Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
    370                 375                 380                 


Pro Pro Leu Asp Pro Gln Glu Leu Asp Ile Leu Lys Thr Val Lys Glu 
385                 390                 395                 400 


Ile Thr Gly Phe Leu Leu Ile Gln Ala Trp Pro Glu Asn Arg Thr Asp 
                405                 410                 415     


Leu His Ala Phe Glu Asn Leu Glu Ile Ile Arg Gly Arg Thr Lys Gln 
            420                 425                 430         


His Gly Gln Phe Ser Leu Ala Val Val Ser Leu Asn Ile Thr Ser Leu 
        435                 440                 445             


Gly Leu Arg Ser Leu Lys Glu Ile Ser Asp Gly Asp Val Ile Ile Ser 
    450                 455                 460                 


Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr Ile Asn Trp Lys Lys Leu 
465                 470                 475                 480 


Phe Gly Thr Ser Gly Gln Lys Thr Lys Ile Ile Ser Asn Arg Gly Glu 
                485                 490                 495     


Asn Ser Cys Lys Ala Thr Gly Gln Val Cys His Ala Leu Cys Ser Pro 
            500                 505                 510         


Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
        515                 520                 525             


Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Asn Leu Leu Glu Gly 
    530                 535                 540                 


Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys Ile Gln Cys His Pro 
545                 550                 555                 560 


Glu Cys Leu Pro Gln Ala Met Asn Ile Thr Cys Thr Gly Arg Gly Pro 
                565                 570                 575     


Asp Asn Cys Ile Gln Cys Ala His Tyr Ile Asp Gly Pro His Cys Val 
            580                 585                 590         


Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
        595                 600                 605             


Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
    610                 615                 620                 


Thr Tyr Gly Ser 
625             


<210>  160
<211>  1595
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens epidermal growth factor receptor (EGFR), transcript 
       variant 3, mRNA

NCBI Reference Sequence: NM_201283.1

<400>  160
ccccggcgca gcgcggccgc agcagcctcc gccccccgca cggtgtgagc gcccgacgcg       60

gccgaggcgg ccggagtccc gagctagccc cggcggccgc cgccgcccag accggacgac      120

aggccacctc gtcggcgtcc gcccgagtcc ccgcctcgcc gccaacgcca caaccaccgc      180

gcacggcccc ctgactccgt ccagtattga tcgggagagc cggagcgagc tcttcgggga      240

gcagcgatgc gaccctccgg gacggccggg gcagcgctcc tggcgctgct ggctgcgctc      300

tgcccggcga gtcgggctct ggaggaaaag aaagtttgcc aaggcacgag taacaagctc      360

acgcagttgg gcacttttga agatcatttt ctcagcctcc agaggatgtt caataactgt      420

gaggtggtcc ttgggaattt ggaaattacc tatgtgcaga ggaattatga tctttccttc      480

ttaaagacca tccaggaggt ggctggttat gtcctcattg ccctcaacac agtggagcga      540

attcctttgg aaaacctgca gatcatcaga ggaaatatgt actacgaaaa ttcctatgcc      600

ttagcagtct tatctaacta tgatgcaaat aaaaccggac tgaaggagct gcccatgaga      660

aatttacagg aaatcctgca tggcgccgtg cggttcagca acaaccctgc cctgtgcaac      720

gtggagagca tccagtggcg ggacatagtc agcagtgact ttctcagcaa catgtcgatg      780

gacttccaga accacctggg cagctgccaa aagtgtgatc caagctgtcc caatgggagc      840

tgctggggtg caggagagga gaactgccag aaactgacca aaatcatctg tgcccagcag      900

tgctccgggc gctgccgtgg caagtccccc agtgactgct gccacaacca gtgtgctgca      960

ggctgcacag gcccccggga gagcgactgc ctggtctgcc gcaaattccg agacgaagcc     1020

acgtgcaagg acacctgccc cccactcatg ctctacaacc ccaccacgta ccagatggat     1080

gtgaaccccg agggcaaata cagctttggt gccacctgcg tgaagaagtg tccccgtaat     1140

tatgtggtga cagatcacgg ctcgtgcgtc cgagcctgtg gggccgacag ctatgagatg     1200

gaggaagacg gcgtccgcaa gtgtaagaag tgcgaagggc cttgccgcaa agtgtgtaac     1260

ggaataggta ttggtgaatt taaagactca ctctccataa atgctacgaa tattaaacac     1320

ttcaaaaact gcacctccat cagtggcgat ctccacatcc tgccggtggc atttaggggt     1380

gactccttca cacatactcc tcctctggat ccacaggaac tggatattct gaaaaccgta     1440

aaggaaatca caggtttgag ctgaattatc acatgaatat aaatgggaaa tcagtgtttt     1500

agagagagaa cttttcgaca tatttcctgt tcccttggaa taaaaacatt tcttctgaaa     1560

ttttaccgtt aaaaaaaaaa aaaaaaaaaa aaaaa                                1595


<210>  161
<211>  405
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens epidermal growth factor receptor (EGFR), transcript 
       variant 3, polypeptide NCBI Reference Sequence: NM_201283.1

<400>  161

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
1               5                   10                  15      


Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gln 
            20                  25                  30          


Gly Thr Ser Asn Lys Leu Thr Gln Leu Gly Thr Phe Glu Asp His Phe 
        35                  40                  45              


Leu Ser Leu Gln Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
    50                  55                  60                  


Leu Glu Ile Thr Tyr Val Gln Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65                  70                  75                  80  


Thr Ile Gln Glu Val Ala Gly Tyr Val Leu Ile Ala Leu Asn Thr Val 
                85                  90                  95      


Glu Arg Ile Pro Leu Glu Asn Leu Gln Ile Ile Arg Gly Asn Met Tyr 
            100                 105                 110         


Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
        115                 120                 125             


Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gln Glu Ile Leu 
    130                 135                 140                 


His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145                 150                 155                 160 


Ser Ile Gln Trp Arg Asp Ile Val Ser Ser Asp Phe Leu Ser Asn Met 
                165                 170                 175     


Ser Met Asp Phe Gln Asn His Leu Gly Ser Cys Gln Lys Cys Asp Pro 
            180                 185                 190         


Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gln 
        195                 200                 205             


Lys Leu Thr Lys Ile Ile Cys Ala Gln Gln Cys Ser Gly Arg Cys Arg 
    210                 215                 220                 


Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gln Cys Ala Ala Gly Cys 
225                 230                 235                 240 


Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
                245                 250                 255     


Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
            260                 265                 270         


Thr Thr Tyr Gln Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
        275                 280                 285             


Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
    290                 295                 300                 


Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305                 310                 315                 320 


Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
                325                 330                 335     


Cys Asn Gly Ile Gly Ile Gly Glu Phe Lys Asp Ser Leu Ser Ile Asn 
            340                 345                 350         


Ala Thr Asn Ile Lys His Phe Lys Asn Cys Thr Ser Ile Ser Gly Asp 
        355                 360                 365             


Leu His Ile Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
    370                 375                 380                 


Pro Pro Leu Asp Pro Gln Glu Leu Asp Ile Leu Lys Thr Val Lys Glu 
385                 390                 395                 400 


Ile Thr Gly Leu Ser 
                405 


<210>  162
<211>  2865
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens epidermal growth factor receptor (EGFR), transcript 
       variant 4, mRNA

NCBI Reference Sequence: NM_201284.1

<400>  162
ccccggcgca gcgcggccgc agcagcctcc gccccccgca cggtgtgagc gcccgacgcg       60

gccgaggcgg ccggagtccc gagctagccc cggcggccgc cgccgcccag accggacgac      120

aggccacctc gtcggcgtcc gcccgagtcc ccgcctcgcc gccaacgcca caaccaccgc      180

gcacggcccc ctgactccgt ccagtattga tcgggagagc cggagcgagc tcttcgggga      240

gcagcgatgc gaccctccgg gacggccggg gcagcgctcc tggcgctgct ggctgcgctc      300

tgcccggcga gtcgggctct ggaggaaaag aaagtttgcc aaggcacgag taacaagctc      360

acgcagttgg gcacttttga agatcatttt ctcagcctcc agaggatgtt caataactgt      420

gaggtggtcc ttgggaattt ggaaattacc tatgtgcaga ggaattatga tctttccttc      480

ttaaagacca tccaggaggt ggctggttat gtcctcattg ccctcaacac agtggagcga      540

attcctttgg aaaacctgca gatcatcaga ggaaatatgt actacgaaaa ttcctatgcc      600

ttagcagtct tatctaacta tgatgcaaat aaaaccggac tgaaggagct gcccatgaga      660

aatttacagg aaatcctgca tggcgccgtg cggttcagca acaaccctgc cctgtgcaac      720

gtggagagca tccagtggcg ggacatagtc agcagtgact ttctcagcaa catgtcgatg      780

gacttccaga accacctggg cagctgccaa aagtgtgatc caagctgtcc caatgggagc      840

tgctggggtg caggagagga gaactgccag aaactgacca aaatcatctg tgcccagcag      900

tgctccgggc gctgccgtgg caagtccccc agtgactgct gccacaacca gtgtgctgca      960

ggctgcacag gcccccggga gagcgactgc ctggtctgcc gcaaattccg agacgaagcc     1020

acgtgcaagg acacctgccc cccactcatg ctctacaacc ccaccacgta ccagatggat     1080

gtgaaccccg agggcaaata cagctttggt gccacctgcg tgaagaagtg tccccgtaat     1140

tatgtggtga cagatcacgg ctcgtgcgtc cgagcctgtg gggccgacag ctatgagatg     1200

gaggaagacg gcgtccgcaa gtgtaagaag tgcgaagggc cttgccgcaa agtgtgtaac     1260

ggaataggta ttggtgaatt taaagactca ctctccataa atgctacgaa tattaaacac     1320

ttcaaaaact gcacctccat cagtggcgat ctccacatcc tgccggtggc atttaggggt     1380

gactccttca cacatactcc tcctctggat ccacaggaac tggatattct gaaaaccgta     1440

aaggaaatca cagggttttt gctgattcag gcttggcctg aaaacaggac ggacctccat     1500

gcctttgaga acctagaaat catacgcggc aggaccaagc aacatggtca gttttctctt     1560

gcagtcgtca gcctgaacat aacatccttg ggattacgct ccctcaagga gataagtgat     1620

ggagatgtga taatttcagg aaacaaaaat ttgtgctatg caaatacaat aaactggaaa     1680

aaactgtttg ggacctccgg tcagaaaacc aaaattataa gcaacagagg tgaaaacagc     1740

tgcaaggcca caggccaggt ctgccatgcc ttgtgctccc ccgagggctg ctggggcccg     1800

gagcccaggg actgcgtctc ttgccggaat gtcagccgag gcagggaatg cgtggacaag     1860

tgcaaccttc tggagggtga gccaagggag tttgtggaga actctgagtg catacagtgc     1920

cacccagagt gcctgcctca ggccatgaac atcacctgca caggacgggg accagacaac     1980

tgtatccagt gtgcccacta cattgacggc ccccactgcg tcaagacctg cccggcagga     2040

gtcatgggag aaaacaacac cctggtctgg aagtacgcag acgccggcca tgtgtgccac     2100

ctgtgccatc caaactgcac ctacgggcca ggaaatgaga gtctcaaagc catgttattc     2160

tgccttttta aactatcatc ctgtaatcaa agtaatgatg gcagcgtgtc ccaccagagc     2220

gggagcccag ctgctcagga gtcatgctta ggatggatcc cttctcttct gccgtcagag     2280

tttcagctgg gttggggtgg atgcagccac ctccatgcct ggccttctgc atctgtgatc     2340

atcacggcct cctcctgcca ctgagcctca tgccttcacg tgtctgttcc ccccgctttt     2400

cctttctgcc acccctgcac gtgggccgcc aggttcccaa gagtatccta cccatttcct     2460

tccttccact ccctttgcca gtgcctctca ccccaactag tagctaacca tcacccccag     2520

gactgacctc ttcctcctcg ctgccagatg attgttcaaa gcacagaatt tgtcagaaac     2580

ctgcagggac tccatgctgc cagccttctc cgtaattagc atggccccag tccatgcttc     2640

tagccttggt tccttctgcc cctctgtttg aaattctaga gccagctgtg ggacaattat     2700

ctgtgtcaaa agccagatgt gaaaacatct caataacaaa ctggctgctt tgttcaatgc     2760

tagaacaacg cctgtcacag agtagaaact caaaaatatt tgctgagtga atgaacaaat     2820

gaataaatgc ataataaata attaaccacc aatccaacat ccaga                     2865


<210>  163
<211>  705
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens epidermal growth factor receptor (EGFR), transcript 
       variant 4, polypeptide NCBI Reference Sequence: NM_201284.1

<400>  163

Met Arg Pro Ser Gly Thr Ala Gly Ala Ala Leu Leu Ala Leu Leu Ala 
1               5                   10                  15      


Ala Leu Cys Pro Ala Ser Arg Ala Leu Glu Glu Lys Lys Val Cys Gln 
            20                  25                  30          


Gly Thr Ser Asn Lys Leu Thr Gln Leu Gly Thr Phe Glu Asp His Phe 
        35                  40                  45              


Leu Ser Leu Gln Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn 
    50                  55                  60                  


Leu Glu Ile Thr Tyr Val Gln Arg Asn Tyr Asp Leu Ser Phe Leu Lys 
65                  70                  75                  80  


Thr Ile Gln Glu Val Ala Gly Tyr Val Leu Ile Ala Leu Asn Thr Val 
                85                  90                  95      


Glu Arg Ile Pro Leu Glu Asn Leu Gln Ile Ile Arg Gly Asn Met Tyr 
            100                 105                 110         


Tyr Glu Asn Ser Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn 
        115                 120                 125             


Lys Thr Gly Leu Lys Glu Leu Pro Met Arg Asn Leu Gln Glu Ile Leu 
    130                 135                 140                 


His Gly Ala Val Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu 
145                 150                 155                 160 


Ser Ile Gln Trp Arg Asp Ile Val Ser Ser Asp Phe Leu Ser Asn Met 
                165                 170                 175     


Ser Met Asp Phe Gln Asn His Leu Gly Ser Cys Gln Lys Cys Asp Pro 
            180                 185                 190         


Ser Cys Pro Asn Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gln 
        195                 200                 205             


Lys Leu Thr Lys Ile Ile Cys Ala Gln Gln Cys Ser Gly Arg Cys Arg 
    210                 215                 220                 


Gly Lys Ser Pro Ser Asp Cys Cys His Asn Gln Cys Ala Ala Gly Cys 
225                 230                 235                 240 


Thr Gly Pro Arg Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp 
                245                 250                 255     


Glu Ala Thr Cys Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro 
            260                 265                 270         


Thr Thr Tyr Gln Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly 
        275                 280                 285             


Ala Thr Cys Val Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His 
    290                 295                 300                 


Gly Ser Cys Val Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu 
305                 310                 315                 320 


Asp Gly Val Arg Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val 
                325                 330                 335     


Cys Asn Gly Ile Gly Ile Gly Glu Phe Lys Asp Ser Leu Ser Ile Asn 
            340                 345                 350         


Ala Thr Asn Ile Lys His Phe Lys Asn Cys Thr Ser Ile Ser Gly Asp 
        355                 360                 365             


Leu His Ile Leu Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr 
    370                 375                 380                 


Pro Pro Leu Asp Pro Gln Glu Leu Asp Ile Leu Lys Thr Val Lys Glu 
385                 390                 395                 400 


Ile Thr Gly Phe Leu Leu Ile Gln Ala Trp Pro Glu Asn Arg Thr Asp 
                405                 410                 415     


Leu His Ala Phe Glu Asn Leu Glu Ile Ile Arg Gly Arg Thr Lys Gln 
            420                 425                 430         


His Gly Gln Phe Ser Leu Ala Val Val Ser Leu Asn Ile Thr Ser Leu 
        435                 440                 445             


Gly Leu Arg Ser Leu Lys Glu Ile Ser Asp Gly Asp Val Ile Ile Ser 
    450                 455                 460                 


Gly Asn Lys Asn Leu Cys Tyr Ala Asn Thr Ile Asn Trp Lys Lys Leu 
465                 470                 475                 480 


Phe Gly Thr Ser Gly Gln Lys Thr Lys Ile Ile Ser Asn Arg Gly Glu 
                485                 490                 495     


Asn Ser Cys Lys Ala Thr Gly Gln Val Cys His Ala Leu Cys Ser Pro 
            500                 505                 510         


Glu Gly Cys Trp Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn 
        515                 520                 525             


Val Ser Arg Gly Arg Glu Cys Val Asp Lys Cys Asn Leu Leu Glu Gly 
    530                 535                 540                 


Glu Pro Arg Glu Phe Val Glu Asn Ser Glu Cys Ile Gln Cys His Pro 
545                 550                 555                 560 


Glu Cys Leu Pro Gln Ala Met Asn Ile Thr Cys Thr Gly Arg Gly Pro 
                565                 570                 575     


Asp Asn Cys Ile Gln Cys Ala His Tyr Ile Asp Gly Pro His Cys Val 
            580                 585                 590         


Lys Thr Cys Pro Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp 
        595                 600                 605             


Lys Tyr Ala Asp Ala Gly His Val Cys His Leu Cys His Pro Asn Cys 
    610                 615                 620                 


Thr Tyr Gly Pro Gly Asn Glu Ser Leu Lys Ala Met Leu Phe Cys Leu 
625                 630                 635                 640 


Phe Lys Leu Ser Ser Cys Asn Gln Ser Asn Asp Gly Ser Val Ser His 
                645                 650                 655     


Gln Ser Gly Ser Pro Ala Ala Gln Glu Ser Cys Leu Gly Trp Ile Pro 
            660                 665                 670         


Ser Leu Leu Pro Ser Glu Phe Gln Leu Gly Trp Gly Gly Cys Ser His 
        675                 680                 685             


Leu His Ala Trp Pro Ser Ala Ser Val Ile Ile Thr Ala Ser Ser Cys 
    690                 695                 700                 


His 
705 


<210>  164
<211>  7202
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human SnoN (Alias SKIL) mRNA transcript variant 1 GenBank 
       Accession No.: NM_005414.4  GI:351721977

<400>  164
gatgtgtgtg gggttcggag ccgcgccggc acagccgaag ggagcgggcg agcggcgacg       60

gcggcggcgg cgggcacaga ttaattaaaa gaagaatgaa ctataatcct tgaagataac      120

tgggcaattt tttaagtcgg aggctgttct tactggtgtg aggatttaca cacgtcttca      180

gtttttcagc acagaccagc agaccatcat ttttagagga aatactccct ctgccctcct      240

ttttggtttc cttggtggta aagattaaat ttggttgcat cattttgact tgtgtttgag      300

tctagatttt atggcacaag gaatggcata aacttttcat gtgttttggt taaaacaaac      360

cagaccattg cattgaccct ggacatcttt aattgagaaa ttggtaactt tattttaata      420

tgtatatctg aagaattcaa gaaaacaaag gcatcctcag aggtgtgcct cttttcttta      480

ttattagagg caaaacgaac aattttatag gatttgtagt gaaattatac cagattataa      540

ggagaaccaa aactaagtcg caaaatttat taatttaagg ggctctcgct ttgaaagttt      600

gagagtaagt tacgataggc atttgtatcc attcattact ttcctctttt caaataagca      660

actaaataga aatgctaatc tcagacttaa ttatttaaca gaagagtgta ccatggaaaa      720

cctccagaca aatttctcct tggttcaggg ctcaactaaa aaactgaatg ggatgggaga      780

tgatggcagc cccccagcga aaaaaatgat aacggacatt catgcaaatg gaaaaacgat      840

aaacaaggtg ccaacagtta agaaggaaca cttggatgac tatggagaag caccagtgga      900

aactgatgga gagcatgtta agcgaacctg tacttctgtt cctgaaactt tgcatttaaa      960

tcccagtttg aaacacacat tggcacaatt ccatttaagt agtcagagct cgctgggtgg     1020

accagcagca ttttctgctc ggcattccca agaaagcatg tcgcctactg tatttctgcc     1080

tcttccatca cctcaggttc ttcctggccc attgctcatc ccttcagata gctccacaga     1140

actcactcag actgtgttgg aaggggaatc tatttcttgt tttcaagttg gaggagaaaa     1200

gagactctgt ttgccccaag tcttaaattc tgttctccga gaatttacac tccagcaaat     1260

aaatacagtg tgtgatgaac tgtacatata ttgttcaagg tgtacttcag accagcttca     1320

tatcttaaag gtactgggca tacttccatt caatgcccca tcctgtgggc tgattacatt     1380

aactgatgca caaagattat gtaatgcttt attgcggcca cgaacttttc ctcaaaatgg     1440

tagcgtactt cctgctaaaa gctcattggc ccagttaaag gaaactggca gtgcctttga     1500

agtggagcat gaatgcctag gcaaatgtca gggtttattt gcaccccagt tttatgttca     1560

gcctgatgct ccgtgtattc aatgtctgga gtgttgtgga atgtttgcac cccagacgtt     1620

tgtgatgcat tctcacagat cacctgacaa aagaacttgc cactggggct ttgaatcagc     1680

taaatggcat tgctatcttc atgtgaacca aaaatactta ggaacacctg aagaaaagaa     1740

actgaagata attttagaag aaatgaagga gaagtttagc atgagaagtg gaaagagaaa     1800

tcaatccaag acagatgcac catcaggaat ggaattacag tcatggtatc ctgttataaa     1860

gcaggaaggt gaccatgttt ctcagacaca ttcattttta caccccagct actacttata     1920

catgtgtgat aaagtggttg ccccaaatgt gtcacttact tctgctgtat cccagtctaa     1980

agagctcaca aagacagagg caagtaagtc catatcaaga cagtcagaga aggctcacag     2040

tagtggtaaa cttcaaaaaa cagtgtctta tccagatgtc tcacttgagg aacaggagaa     2100

aatggattta aaaacaagta gagaattatg tagccgttta gatgcatcaa tctcaaataa     2160

ttctacaagt aaaaggaaat ctgagtctgc cacttgcaac ttagtcagag acataaacaa     2220

agtgggaatt ggccttgttg ctgccgcttc atctccgctt cttgtgaaag atgtcatttg     2280

tgaggatgat aagggaaaaa tcatggaaga agtaatgaga acttatttaa aacaacagga     2340

aaaactaaac ttgattttgc aaaagaagca acaacttcag atggaagtaa aaatgttgag     2400

tagttcaaaa tctatgaagg aactcactga agaacagcag aatttacaga aagagcttga     2460

atctttgcag aatgaacatg ctcaaagaat ggaagaattt tatgttgaac agaaagactt     2520

agagaaaaaa ttggagcaga taatgaagca aaaatgtacc tgtgactcaa atttagaaaa     2580

agacaaagag gctgaatatg caggacagtt ggcagaactg aggcagagat tggaccatgc     2640

tgaggccgat aggcaagaac tccaagatga actcagacag gaacgggaag caagacagaa     2700

gttagagatg atgataaaag agctaaagct gcaaattctg aaatcatcaa agactgctaa     2760

agaatagaaa ctgttaaaga gattcatctg tgtattactg acaaggtttt ttttgtttgt     2820

tgcttgcttt ggtaattgaa ttctgaagaa tttatctgca tgacgataac taggcattct     2880

atccatttgt agatcagaga aagtgaagag attatatatt agtacttaaa tttttacatt     2940

ttccaaatga atgaaaatgt atgtttcttt gtactttttt aaaaaaatca gcttagtaac     3000

aatactatat ggtttcaact agtaggtaat ctgcttatat ttctaatgca aacttaacaa     3060

ttgtgtactt tttaaaagct gcaatatgtg ttggaaaata gctgtggtca attttgttat     3120

ccatatttca gactcaattt tagatacaat ggtggcttta tattttaagt atatagagct     3180

actcaaggag ttgaatctcc ccttttctca ttaacacaat ttttctaagt tgatatggtg     3240

tactcattaa catacaccaa atttactttt actttgttca gattgtggaa tgaatttcca     3300

ccagttctct tctttttaat gtgtacccta ggaggaattt tactgaggtt atagcatacc     3360

ccatgagcac agtggggaag aagaatgtgt tgttatgtgc tgctgctaaa cagaagcagc     3420

agttgtaatt tgtttttcag tttaaatgtg gttatagtta gatttttttt taagcagcaa     3480

cttttcaaaa ataaaatgtg ataatttctg aacttttgtt tgtgttgtta atagtggtgt     3540

gaaaatatta acgttcttga gaaaaactga taccactgtt gtgtatcagt ttctatacaa     3600

tccataatcc tcctgtacag tttttacatg tagttatgag tcttactaaa atttatataa     3660

tggacttgtt ttcctttaag ttgtaaaatg ttaaacacct tgaaggttat tttggacttc     3720

tgtatgttta aatgttgtct taccaaaatt tgcacgaatg gaccattttc atttactact     3780

taatatcaaa atcaggaatt tacagtcaac tgatagtaca tgataggtgc atataggaca     3840

gtttagttac ctgctactaa aagattttta gataagtttt agaagataaa ggaattccat     3900

agtttcagga gggacaacat cttctgcact ttttttttgc acagaaaagt ctgtcattct     3960

ctaatggcaa atttcatatt tgttaattct tggctcaaaa tatattaggt aaaattctta     4020

gatctgtttt taaagggagt ttcctgaaac tatcattaat tgacattatt accccatgga     4080

ttttatggga taataaatgt ttttcatgtt ctcttataag atactatgta tgaaattact     4140

tcagagagct atatttattt taaaataaat tagctagggt taaggttata ttctatttcc     4200

agcatagaag gtagataatc taatggtgta gaaagaatca ctaggttgtc atttaaccag     4260

ttattttcat attttgctta atagtacata tccaaaaaga attttgtact tccccaaatg     4320

taatttattt actaaattga gtataaccta aatgtgtgtt ttctattttc catttaaatt     4380

ttgctatatt aagactaatt taattcgttg agtcttggaa tcttctcaag gaggaacaaa     4440

tattaaaatg acatgtagaa acaaattttt tttttttttt tttttttttt ttttttgaga     4500

cagagtctcg ctgtctccca ggctggagtt cagtggtgca atctcggctc actgcaagct     4560

ctgcctcctg ggttcaagcc attctcctgc ctcagcctcc cgagtagctg ggactacagg     4620

cacccaccac cactcccggc taatttttag aaacaaatat ttaaaatgac atattctccc     4680

aatacaatct atttagatct ggagaaggaa aaatcagata tttatgatat agttttattt     4740

taattttgaa ttatttgtgt cacagctcag ctttttggaa gacaaactca aacacctata     4800

atttcattta tatttctaat tcacttggaa cctttctgct ttatgttacc tagaaaatga     4860

taatttgttt aacccaaaac ttctaaaata aattgcttaa tccttgaaat atgttattgg     4920

aaaattttaa gcagtgctta aacaccatta aattattatg aacttgtaat tcagaattga     4980

gtaaagaaat attttttcta gtccttcata tattgaaaac ttgccacatg acattgtatc     5040

gtcttcattt tccagaagat gcgttggtgt gccataggtt tctaacttcc ttgaaaatag     5100

ttttttaagt caattgtaaa tatacgtatt attgttaaaa gtaactttaa actgcaacac     5160

atagcttcaa aacaatatag agattttgta ataccttata agtggagttg gctaaaatac     5220

cttatccata taaaacttat tctattcttt gcatgcttat tttgtgtgtt ggttgctagc     5280

ttaaagtttg atttgttgtt actctttgtg tgccaaattc actaggcaag cggatttttc     5340

ctcagacttc aaaaaataat tcttttaaga aaaaatgtaa aaatgtttat tctaaaaagc     5400

tgcattaaag ggacaaccta taaaaagttt tgctagctca tctttagaag gaagaaagaa     5460

tattagcttg ggtgatgttt aatttgggtg gcgatagttt ctgtaggcta aactttatga     5520

gaaaagtgta cctactctat aaaggtaata aatgtaaaac ctcttgctgt tattgaggaa     5580

gctcttcaac taccctaaat ttcacaaatg taacttataa cactatgaaa agatttgacc     5640

aacaatttac gtttgctgtg tgctttagtt tttgtttaag catattcttt tgcttgaatt     5700

tctgtgttca tgagagttag ggtgttttat gcttcttgaa ctaattttat aacatattta     5760

atatattacc agttaagata taaaatcatt tgtacatagc gaattgtaaa gcagctatta     5820

aagtaggtga aataaagtat atatttgccg gttatccata tcttttagaa gtcctgacag     5880

aacaaccagt ttatttgcac ataggtagct tctgtttgaa ggaaggtaaa gttataagga     5940

aactcaaata ctataagatg tgtcaaggta tttctccaga attaattgca aagctagtgc     6000

tgaaggattt taatcagctt ctaaaatttt cttctcaata aggcatatgt tttgattact     6060

tagggaagat tcctcatttt tatttgccct ttatgcattt aatccacatg ataggacatt     6120

aaaaattaat ataaagaaaa atcgtgctca tactgtacat ctgtttctgt gcttggaact     6180

acttgttaat agtttttatc gaagctgtca gcaataaggg acataaaact gctgtattat     6240

acattgtgga attgaataaa cagcctaatt ttttttttct agtatagggt acttaagcat     6300

ttccactttt ggaagaaaag tgtattagta ttttatattg catttcattt aaaaggacag     6360

tttttttttt ttttttgtaa atccattcat tgaaatggtt tctaaactgt ataatgtaat     6420

ttggagccta tttagtaata gaattaaatg tcctatgtag tgctacaatt tttgaattag     6480

aaagtgatca aatgtaagaa aaaaatttaa aaattcagcc cagaaaacaa aatagtgtat     6540

taaattagtt taatgtaaaa ggaatttata agattttttt cctcaatata gatacctcac     6600

ttgaaaagaa agcacagcat acttaaagta gttctagtaa acatgtccta gaaaacagtt     6660

gctaaatgta ggacatcttt tgaggaatta gtttatgaga aataaaattt tacttgtttt     6720

tactatcctg ttagaagtat ttgtttatcc tgataatttt aagccaacat agtagtctta     6780

aattactttt gaatttctaa tctgtgaagg cagtaaatga aatatctgtt ctgcaactgt     6840

tgaaacaaat aattggctac attgaccata attaaagtta aaattttgcc aatgatgtac     6900

agttttatgg ttaaagttgc tgtggttggt tgcattacat gacacagaaa actgtcctct     6960

acctcacgtg aaataaatat tttatatggt tttactaaaa ataagactca tgtatctggt     7020

cacctagttt acaaattttg aattatattt attgaaacat gacatactgt gctctgagct     7080

tatacctcaa ttgtattttg tgctgttttc cattttcatg ccttgtaaat aacttgtata     7140

gattgtggat caaatactaa ataaaaactt ttaatgccaa ttaaatttga ttcaagttaa     7200

aa                                                                    7202


<210>  165
<211>  684
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human SnoN (Alias SKIL) polypeptide encoded by mRNA transcript 
       variant 1 GenBank Accession No.: NP_005405.2  GI:223029418

<400>  165

Met Glu Asn Leu Gln Thr Asn Phe Ser Leu Val Gln Gly Ser Thr Lys 
1               5                   10                  15      


Lys Leu Asn Gly Met Gly Asp Asp Gly Ser Pro Pro Ala Lys Lys Met 
            20                  25                  30          


Ile Thr Asp Ile His Ala Asn Gly Lys Thr Ile Asn Lys Val Pro Thr 
        35                  40                  45              


Val Lys Lys Glu His Leu Asp Asp Tyr Gly Glu Ala Pro Val Glu Thr 
    50                  55                  60                  


Asp Gly Glu His Val Lys Arg Thr Cys Thr Ser Val Pro Glu Thr Leu 
65                  70                  75                  80  


His Leu Asn Pro Ser Leu Lys His Thr Leu Ala Gln Phe His Leu Ser 
                85                  90                  95      


Ser Gln Ser Ser Leu Gly Gly Pro Ala Ala Phe Ser Ala Arg His Ser 
            100                 105                 110         


Gln Glu Ser Met Ser Pro Thr Val Phe Leu Pro Leu Pro Ser Pro Gln 
        115                 120                 125             


Val Leu Pro Gly Pro Leu Leu Ile Pro Ser Asp Ser Ser Thr Glu Leu 
    130                 135                 140                 


Thr Gln Thr Val Leu Glu Gly Glu Ser Ile Ser Cys Phe Gln Val Gly 
145                 150                 155                 160 


Gly Glu Lys Arg Leu Cys Leu Pro Gln Val Leu Asn Ser Val Leu Arg 
                165                 170                 175     


Glu Phe Thr Leu Gln Gln Ile Asn Thr Val Cys Asp Glu Leu Tyr Ile 
            180                 185                 190         


Tyr Cys Ser Arg Cys Thr Ser Asp Gln Leu His Ile Leu Lys Val Leu 
        195                 200                 205             


Gly Ile Leu Pro Phe Asn Ala Pro Ser Cys Gly Leu Ile Thr Leu Thr 
    210                 215                 220                 


Asp Ala Gln Arg Leu Cys Asn Ala Leu Leu Arg Pro Arg Thr Phe Pro 
225                 230                 235                 240 


Gln Asn Gly Ser Val Leu Pro Ala Lys Ser Ser Leu Ala Gln Leu Lys 
                245                 250                 255     


Glu Thr Gly Ser Ala Phe Glu Val Glu His Glu Cys Leu Gly Lys Cys 
            260                 265                 270         


Gln Gly Leu Phe Ala Pro Gln Phe Tyr Val Gln Pro Asp Ala Pro Cys 
        275                 280                 285             


Ile Gln Cys Leu Glu Cys Cys Gly Met Phe Ala Pro Gln Thr Phe Val 
    290                 295                 300                 


Met His Ser His Arg Ser Pro Asp Lys Arg Thr Cys His Trp Gly Phe 
305                 310                 315                 320 


Glu Ser Ala Lys Trp His Cys Tyr Leu His Val Asn Gln Lys Tyr Leu 
                325                 330                 335     


Gly Thr Pro Glu Glu Lys Lys Leu Lys Ile Ile Leu Glu Glu Met Lys 
            340                 345                 350         


Glu Lys Phe Ser Met Arg Ser Gly Lys Arg Asn Gln Ser Lys Thr Asp 
        355                 360                 365             


Ala Pro Ser Gly Met Glu Leu Gln Ser Trp Tyr Pro Val Ile Lys Gln 
    370                 375                 380                 


Glu Gly Asp His Val Ser Gln Thr His Ser Phe Leu His Pro Ser Tyr 
385                 390                 395                 400 


Tyr Leu Tyr Met Cys Asp Lys Val Val Ala Pro Asn Val Ser Leu Thr 
                405                 410                 415     


Ser Ala Val Ser Gln Ser Lys Glu Leu Thr Lys Thr Glu Ala Ser Lys 
            420                 425                 430         


Ser Ile Ser Arg Gln Ser Glu Lys Ala His Ser Ser Gly Lys Leu Gln 
        435                 440                 445             


Lys Thr Val Ser Tyr Pro Asp Val Ser Leu Glu Glu Gln Glu Lys Met 
    450                 455                 460                 


Asp Leu Lys Thr Ser Arg Glu Leu Cys Ser Arg Leu Asp Ala Ser Ile 
465                 470                 475                 480 


Ser Asn Asn Ser Thr Ser Lys Arg Lys Ser Glu Ser Ala Thr Cys Asn 
                485                 490                 495     


Leu Val Arg Asp Ile Asn Lys Val Gly Ile Gly Leu Val Ala Ala Ala 
            500                 505                 510         


Ser Ser Pro Leu Leu Val Lys Asp Val Ile Cys Glu Asp Asp Lys Gly 
        515                 520                 525             


Lys Ile Met Glu Glu Val Met Arg Thr Tyr Leu Lys Gln Gln Glu Lys 
    530                 535                 540                 


Leu Asn Leu Ile Leu Gln Lys Lys Gln Gln Leu Gln Met Glu Val Lys 
545                 550                 555                 560 


Met Leu Ser Ser Ser Lys Ser Met Lys Glu Leu Thr Glu Glu Gln Gln 
                565                 570                 575     


Asn Leu Gln Lys Glu Leu Glu Ser Leu Gln Asn Glu His Ala Gln Arg 
            580                 585                 590         


Met Glu Glu Phe Tyr Val Glu Gln Lys Asp Leu Glu Lys Lys Leu Glu 
        595                 600                 605             


Gln Ile Met Lys Gln Lys Cys Thr Cys Asp Ser Asn Leu Glu Lys Asp 
    610                 615                 620                 


Lys Glu Ala Glu Tyr Ala Gly Gln Leu Ala Glu Leu Arg Gln Arg Leu 
625                 630                 635                 640 


Asp His Ala Glu Ala Asp Arg Gln Glu Leu Gln Asp Glu Leu Arg Gln 
                645                 650                 655     


Glu Arg Glu Ala Arg Gln Lys Leu Glu Met Met Ile Lys Glu Leu Lys 
            660                 665                 670         


Leu Gln Ile Leu Lys Ser Ser Lys Thr Ala Lys Glu 
        675                 680                 


<210>  166
<211>  7061
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SKI-like proto-oncogene (SKIL), transcript variant 
       2, mRNA

NCBI Reference Sequence: NM_001145097.2

<400>  166
ggtttcaaat tggccctttg gcctctggag caaattcaaa tgtaactctt ccccaacccc       60

ccttctcttc ttccagatta attaaaagaa gaatgaacta taatccttga agataactgg      120

gcaatttttt aagtcggagg ctgttcttac tggtgtgagg atttacacac gtcttcagtt      180

tttcagcaca gaccagcaga ccatcatttt tagaggaaat actccctctg ccctcctttt      240

tggtttcctt ggtggtaaag attaaatttg gttgcatcat tttgacttgt gtttgagtct      300

agattttatg gcacaaggaa tggcataaac ttttcatgtg ttttggttaa aacaaaccag      360

accattgcat tgaccctgga catctttaat tgagaaattg gtaactttat tttaatatgt      420

atatctgaag aattcaagaa aacaaaggca tcctcagagg tgtgcctctt ttctttatta      480

ttagaggcaa aacgaacaat tttataggat ttgtagtgaa attataccag attataagga      540

gaaccaaaac taagtcgcaa aatttattaa tttaaggggc tctcgctttg aaagtttgag      600

agtaagttac gataggcatt tgtatccatt cattactttc ctcttttcaa ataagcaact      660

aaatagaaat gctaatctca gacttaatta tttaacagaa gagtgtacca tggaaaacct      720

ccagacaaat ttctccttgg ttcagggctc aactaaaaaa ctgaatggga tgggagatga      780

tggcagcccc ccagcgaaaa aaatgataac ggacattcat gcaaatggaa aaacgataaa      840

caaggtgcca acagttaaga aggaacactt ggatgactat ggagaagcac cagtggaaac      900

tgatggagag catgttaagc gaacctgtac ttctgttcct gaaactttgc atttaaatcc      960

cagtttgaaa cacacattgg cacaattcca tttaagtagt cagagctcgc tgggtggacc     1020

agcagcattt tctgctcggc attcccaaga aagcatgtcg cctactgtat ttctgcctct     1080

tccatcacct caggttcttc ctggcccatt gctcatccct tcagatagct ccacagaact     1140

cactcagact gtgttggaag gggaatctat ttcttgtttt caagttggag gagaaaagag     1200

actctgtttg ccccaagtct taaattctgt tctccgagaa tttacactcc agcaaataaa     1260

tacagtgtgt gatgaactgt acatatattg ttcaaggtgt acttcagacc agcttcatat     1320

cttaaaggta ctgggcatac ttccattcaa tgccccatcc tgtgggctga ttacattaac     1380

tgatgcacaa agattatgta atgctttatt gcggccacga acttttcctc aaaatggtag     1440

cgtacttcct gctaaaagct cattggccca gttaaaggaa actggcagtg cctttgaagt     1500

ggagcatgaa tgcctaggca aatgtcaggg tttatttgca ccccagtttt atgttcagcc     1560

tgatgctccg tgtattcaat gtctggagtg ttgtggaatg tttgcacccc agacgtttgt     1620

gatgcattct cacagatcac ctgacaaaag aacttgccac tggggctttg aatcagctaa     1680

atggcattgc tatcttcatg tgaaccaaaa atacttagga acacctgaag aaaagaaact     1740

gaagataatt ttagaagaaa tgaaggagaa gtttagcatg agaagtggaa agagaaatca     1800

atccaagaca gatgcaccat caggaatgga attacagtca tggtatcctg ttataaagca     1860

ggaaggtgac catgtttctc agacacattc atttttacac cccagctact acttatacat     1920

gtgtgataaa gtggttgccc caaatgtgtc acttacttct gctgtatccc agtctaaaga     1980

gctcacaaag acagaggcaa atgcatcaat ctcaaataat tctacaagta aaaggaaatc     2040

tgagtctgcc acttgcaact tagtcagaga cataaacaaa gtgggaattg gccttgttgc     2100

tgccgcttca tctccgcttc ttgtgaaaga tgtcatttgt gaggatgata agggaaaaat     2160

catggaagaa gtaatgagaa cttatttaaa acaacaggaa aaactaaact tgattttgca     2220

aaagaagcaa caacttcaga tggaagtaaa aatgttgagt agttcaaaat ctatgaagga     2280

actcactgaa gaacagcaga atttacagaa agagcttgaa tctttgcaga atgaacatgc     2340

tcaaagaatg gaagaatttt atgttgaaca gaaagactta gagaaaaaat tggagcagat     2400

aatgaagcaa aaatgtacct gtgactcaaa tttagaaaaa gacaaagagg ctgaatatgc     2460

aggacagttg gcagaactga ggcagagatt ggaccatgct gaggccgata ggcaagaact     2520

ccaagatgaa ctcagacagg aacgggaagc aagacagaag ttagagatga tgataaaaga     2580

gctaaagctg caaattctga aatcatcaaa gactgctaaa gaatagaaac tgttaaagag     2640

attcatctgt gtattactga caaggttttt tttgtttgtt gcttgctttg gtaattgaat     2700

tctgaagaat ttatctgcat gacgataact aggcattcta tccatttgta gatcagagaa     2760

agtgaagaga ttatatatta gtacttaaat ttttacattt tccaaatgaa tgaaaatgta     2820

tgtttctttg tactttttta aaaaaatcag cttagtaaca atactatatg gtttcaacta     2880

gtaggtaatc tgcttatatt tctaatgcaa acttaacaat tgtgtacttt ttaaaagctg     2940

caatatgtgt tggaaaatag ctgtggtcaa ttttgttatc catatttcag actcaatttt     3000

agatacaatg gtggctttat attttaagta tatagagcta ctcaaggagt tgaatctccc     3060

cttttctcat taacacaatt tttctaagtt gatatggtgt actcattaac atacaccaaa     3120

tttactttta ctttgttcag attgtggaat gaatttccac cagttctctt ctttttaatg     3180

tgtaccctag gaggaatttt actgaggtta tagcataccc catgagcaca gtggggaaga     3240

agaatgtgtt gttatgtgct gctgctaaac agaagcagca gttgtaattt gtttttcagt     3300

ttaaatgtgg ttatagttag attttttttt aagcagcaac ttttcaaaaa taaaatgtga     3360

taatttctga acttttgttt gtgttgttaa tagtggtgtg aaaatattaa cgttcttgag     3420

aaaaactgat accactgttg tgtatcagtt tctatacaat ccataatcct cctgtacagt     3480

ttttacatgt agttatgagt cttactaaaa tttatataat ggacttgttt tcctttaagt     3540

tgtaaaatgt taaacacctt gaaggttatt ttggacttct gtatgtttaa atgttgtctt     3600

accaaaattt gcacgaatgg accattttca tttactactt aatatcaaaa tcaggaattt     3660

acagtcaact gatagtacat gataggtgca tataggacag tttagttacc tgctactaaa     3720

agatttttag ataagtttta gaagataaag gaattccata gtttcaggag ggacaacatc     3780

ttctgcactt tttttttgca cagaaaagtc tgtcattctc taatggcaaa tttcatattt     3840

gttaattctt ggctcaaaat atattaggta aaattcttag atctgttttt aaagggagtt     3900

tcctgaaact atcattaatt gacattatta ccccatggat tttatgggat aataaatgtt     3960

tttcatgttc tcttataaga tactatgtat gaaattactt cagagagcta tatttatttt     4020

aaaataaatt agctagggtt aaggttatat tctatttcca gcatagaagg tagataatct     4080

aatggtgtag aaagaatcac taggttgtca tttaaccagt tattttcata ttttgcttaa     4140

tagtacatat ccaaaaagaa ttttgtactt ccccaaatgt aatttattta ctaaattgag     4200

tataacctaa atgtgtgttt tctattttcc atttaaattt tgctatatta agactaattt     4260

aattcgttga gtcttggaat cttctcaagg aggaacaaat attaaaatga catgtagaaa     4320

caaatttttt tttttttttt tttttttttt tttttgagac agagtctcgc tgtctcccag     4380

gctggagttc agtggtgcaa tctcggctca ctgcaagctc tgcctcctgg gttcaagcca     4440

ttctcctgcc tcagcctccc gagtagctgg gactacaggc acccaccacc actcccggct     4500

aatttttaga aacaaatatt taaaatgaca tattctccca atacaatcta tttagatctg     4560

gagaaggaaa aatcagatat ttatgatata gttttatttt aattttgaat tatttgtgtc     4620

acagctcagc tttttggaag acaaactcaa acacctataa tttcatttat atttctaatt     4680

cacttggaac ctttctgctt tatgttacct agaaaatgat aatttgttta acccaaaact     4740

tctaaaataa attgcttaat ccttgaaata tgttattgga aaattttaag cagtgcttaa     4800

acaccattaa attattatga acttgtaatt cagaattgag taaagaaata ttttttctag     4860

tccttcatat attgaaaact tgccacatga cattgtatcg tcttcatttt ccagaagatg     4920

cgttggtgtg ccataggttt ctaacttcct tgaaaatagt tttttaagtc aattgtaaat     4980

atacgtatta ttgttaaaag taactttaaa ctgcaacaca tagcttcaaa acaatataga     5040

gattttgtaa taccttataa gtggagttgg ctaaaatacc ttatccatat aaaacttatt     5100

ctattctttg catgcttatt ttgtgtgttg gttgctagct taaagtttga tttgttgtta     5160

ctctttgtgt gccaaattca ctaggcaagc ggatttttcc tcagacttca aaaaataatt     5220

cttttaagaa aaaatgtaaa aatgtttatt ctaaaaagct gcattaaagg gacaacctat     5280

aaaaagtttt gctagctcat ctttagaagg aagaaagaat attagcttgg gtgatgttta     5340

atttgggtgg cgatagtttc tgtaggctaa actttatgag aaaagtgtac ctactctata     5400

aaggtaataa atgtaaaacc tcttgctgtt attgaggaag ctcttcaact accctaaatt     5460

tcacaaatgt aacttataac actatgaaaa gatttgacca acaatttacg tttgctgtgt     5520

gctttagttt ttgtttaagc atattctttt gcttgaattt ctgtgttcat gagagttagg     5580

gtgttttatg cttcttgaac taattttata acatatttaa tatattacca gttaagatat     5640

aaaatcattt gtacatagcg aattgtaaag cagctattaa agtaggtgaa ataaagtata     5700

tatttgccgg ttatccatat cttttagaag tcctgacaga acaaccagtt tatttgcaca     5760

taggtagctt ctgtttgaag gaaggtaaag ttataaggaa actcaaatac tataagatgt     5820

gtcaaggtat ttctccagaa ttaattgcaa agctagtgct gaaggatttt aatcagcttc     5880

taaaattttc ttctcaataa ggcatatgtt ttgattactt agggaagatt cctcattttt     5940

atttgccctt tatgcattta atccacatga taggacatta aaaattaata taaagaaaaa     6000

tcgtgctcat actgtacatc tgtttctgtg cttggaacta cttgttaata gtttttatcg     6060

aagctgtcag caataaggga cataaaactg ctgtattata cattgtggaa ttgaataaac     6120

agcctaattt tttttttcta gtatagggta cttaagcatt tccacttttg gaagaaaagt     6180

gtattagtat tttatattgc atttcattta aaaggacagt tttttttttt tttttgtaaa     6240

tccattcatt gaaatggttt ctaaactgta taatgtaatt tggagcctat ttagtaatag     6300

aattaaatgt cctatgtagt gctacaattt ttgaattaga aagtgatcaa atgtaagaaa     6360

aaaatttaaa aattcagccc agaaaacaaa atagtgtatt aaattagttt aatgtaaaag     6420

gaatttataa gatttttttc ctcaatatag atacctcact tgaaaagaaa gcacagcata     6480

cttaaagtag ttctagtaaa catgtcctag aaaacagttg ctaaatgtag gacatctttt     6540

gaggaattag tttatgagaa ataaaatttt acttgttttt actatcctgt tagaagtatt     6600

tgtttatcct gataatttta agccaacata gtagtcttaa attacttttg aatttctaat     6660

ctgtgaaggc agtaaatgaa atatctgttc tgcaactgtt gaaacaaata attggctaca     6720

ttgaccataa ttaaagttaa aattttgcca atgatgtaca gttttatggt taaagttgct     6780

gtggttggtt gcattacatg acacagaaaa ctgtcctcta cctcacgtga aataaatatt     6840

ttatatggtt ttactaaaaa taagactcat gtatctggtc acctagttta caaattttga     6900

attatattta ttgaaacatg acatactgtg ctctgagctt atacctcaat tgtattttgt     6960

gctgttttcc attttcatgc cttgtaaata acttgtatag attgtggatc aaatactaaa     7020

taaaaacttt taatgccaat taaatttgat tcaagttaaa a                         7061


<210>  167
<211>  638
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SKI-like proto-oncogene (SKIL), transcript variant 
       2, polypeptide

NCBI Reference Sequence: NM_001145097.2

<400>  167

Met Glu Asn Leu Gln Thr Asn Phe Ser Leu Val Gln Gly Ser Thr Lys 
1               5                   10                  15      


Lys Leu Asn Gly Met Gly Asp Asp Gly Ser Pro Pro Ala Lys Lys Met 
            20                  25                  30          


Ile Thr Asp Ile His Ala Asn Gly Lys Thr Ile Asn Lys Val Pro Thr 
        35                  40                  45              


Val Lys Lys Glu His Leu Asp Asp Tyr Gly Glu Ala Pro Val Glu Thr 
    50                  55                  60                  


Asp Gly Glu His Val Lys Arg Thr Cys Thr Ser Val Pro Glu Thr Leu 
65                  70                  75                  80  


His Leu Asn Pro Ser Leu Lys His Thr Leu Ala Gln Phe His Leu Ser 
                85                  90                  95      


Ser Gln Ser Ser Leu Gly Gly Pro Ala Ala Phe Ser Ala Arg His Ser 
            100                 105                 110         


Gln Glu Ser Met Ser Pro Thr Val Phe Leu Pro Leu Pro Ser Pro Gln 
        115                 120                 125             


Val Leu Pro Gly Pro Leu Leu Ile Pro Ser Asp Ser Ser Thr Glu Leu 
    130                 135                 140                 


Thr Gln Thr Val Leu Glu Gly Glu Ser Ile Ser Cys Phe Gln Val Gly 
145                 150                 155                 160 


Gly Glu Lys Arg Leu Cys Leu Pro Gln Val Leu Asn Ser Val Leu Arg 
                165                 170                 175     


Glu Phe Thr Leu Gln Gln Ile Asn Thr Val Cys Asp Glu Leu Tyr Ile 
            180                 185                 190         


Tyr Cys Ser Arg Cys Thr Ser Asp Gln Leu His Ile Leu Lys Val Leu 
        195                 200                 205             


Gly Ile Leu Pro Phe Asn Ala Pro Ser Cys Gly Leu Ile Thr Leu Thr 
    210                 215                 220                 


Asp Ala Gln Arg Leu Cys Asn Ala Leu Leu Arg Pro Arg Thr Phe Pro 
225                 230                 235                 240 


Gln Asn Gly Ser Val Leu Pro Ala Lys Ser Ser Leu Ala Gln Leu Lys 
                245                 250                 255     


Glu Thr Gly Ser Ala Phe Glu Val Glu His Glu Cys Leu Gly Lys Cys 
            260                 265                 270         


Gln Gly Leu Phe Ala Pro Gln Phe Tyr Val Gln Pro Asp Ala Pro Cys 
        275                 280                 285             


Ile Gln Cys Leu Glu Cys Cys Gly Met Phe Ala Pro Gln Thr Phe Val 
    290                 295                 300                 


Met His Ser His Arg Ser Pro Asp Lys Arg Thr Cys His Trp Gly Phe 
305                 310                 315                 320 


Glu Ser Ala Lys Trp His Cys Tyr Leu His Val Asn Gln Lys Tyr Leu 
                325                 330                 335     


Gly Thr Pro Glu Glu Lys Lys Leu Lys Ile Ile Leu Glu Glu Met Lys 
            340                 345                 350         


Glu Lys Phe Ser Met Arg Ser Gly Lys Arg Asn Gln Ser Lys Thr Asp 
        355                 360                 365             


Ala Pro Ser Gly Met Glu Leu Gln Ser Trp Tyr Pro Val Ile Lys Gln 
    370                 375                 380                 


Glu Gly Asp His Val Ser Gln Thr His Ser Phe Leu His Pro Ser Tyr 
385                 390                 395                 400 


Tyr Leu Tyr Met Cys Asp Lys Val Val Ala Pro Asn Val Ser Leu Thr 
                405                 410                 415     


Ser Ala Val Ser Gln Ser Lys Glu Leu Thr Lys Thr Glu Ala Asn Ala 
            420                 425                 430         


Ser Ile Ser Asn Asn Ser Thr Ser Lys Arg Lys Ser Glu Ser Ala Thr 
        435                 440                 445             


Cys Asn Leu Val Arg Asp Ile Asn Lys Val Gly Ile Gly Leu Val Ala 
    450                 455                 460                 


Ala Ala Ser Ser Pro Leu Leu Val Lys Asp Val Ile Cys Glu Asp Asp 
465                 470                 475                 480 


Lys Gly Lys Ile Met Glu Glu Val Met Arg Thr Tyr Leu Lys Gln Gln 
                485                 490                 495     


Glu Lys Leu Asn Leu Ile Leu Gln Lys Lys Gln Gln Leu Gln Met Glu 
            500                 505                 510         


Val Lys Met Leu Ser Ser Ser Lys Ser Met Lys Glu Leu Thr Glu Glu 
        515                 520                 525             


Gln Gln Asn Leu Gln Lys Glu Leu Glu Ser Leu Gln Asn Glu His Ala 
    530                 535                 540                 


Gln Arg Met Glu Glu Phe Tyr Val Glu Gln Lys Asp Leu Glu Lys Lys 
545                 550                 555                 560 


Leu Glu Gln Ile Met Lys Gln Lys Cys Thr Cys Asp Ser Asn Leu Glu 
                565                 570                 575     


Lys Asp Lys Glu Ala Glu Tyr Ala Gly Gln Leu Ala Glu Leu Arg Gln 
            580                 585                 590         


Arg Leu Asp His Ala Glu Ala Asp Arg Gln Glu Leu Gln Asp Glu Leu 
        595                 600                 605             


Arg Gln Glu Arg Glu Ala Arg Gln Lys Leu Glu Met Met Ile Lys Glu 
    610                 615                 620                 


Leu Lys Leu Gln Ile Leu Lys Ser Ser Lys Thr Ala Lys Glu 
625                 630                 635             


<210>  168
<211>  6711
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SKI-like proto-oncogene (SKIL), transcript variant 
       3, mRNA

NCBI Reference Sequence: NM_001145098.2

<400>  168
gatgtgtgtg gggttcggag ccgcgccggc acagccgaag ggagcgggcg agcggcgacg       60

gcggcggcgg cgggcacaga ttaattaaaa gaagaatgaa ctataatcct tgaagataac      120

tgggcaattt tttaagtcgg aggctgttct tactggtgtg aggatttaca cacgtcttca      180

gtttttcagc acagaccagc agaccatcat ttttagagga aatactccct ctgccctcct      240

ttttggtttc cttggtgggc tcaactaaaa aactgaatgg gatgggagat gatggcagcc      300

ccccagcgaa aaaaatgata acggacattc atgcaaatgg aaaaacgata aacaaggtgc      360

caacagttaa gaaggaacac ttggatgact atggagaagc accagtggaa actgatggag      420

agcatgttaa gcgaacctgt acttctgttc ctgaaacttt gcatttaaat cccagtttga      480

aacacacatt ggcacaattc catttaagta gtcagagctc gctgggtgga ccagcagcat      540

tttctgctcg gcattcccaa gaaagcatgt cgcctactgt atttctgcct cttccatcac      600

ctcaggttct tcctggccca ttgctcatcc cttcagatag ctccacagaa ctcactcaga      660

ctgtgttgga aggggaatct atttcttgtt ttcaagttgg aggagaaaag agactctgtt      720

tgccccaagt cttaaattct gttctccgag aatttacact ccagcaaata aatacagtgt      780

gtgatgaact gtacatatat tgttcaaggt gtacttcaga ccagcttcat atcttaaagg      840

tactgggcat acttccattc aatgccccat cctgtgggct gattacatta actgatgcac      900

aaagattatg taatgcttta ttgcggccac gaacttttcc tcaaaatggt agcgtacttc      960

ctgctaaaag ctcattggcc cagttaaagg aaactggcag tgcctttgaa gtggagcatg     1020

aatgcctagg caaatgtcag ggtttatttg caccccagtt ttatgttcag cctgatgctc     1080

cgtgtattca atgtctggag tgttgtggaa tgtttgcacc ccagacgttt gtgatgcatt     1140

ctcacagatc acctgacaaa agaacttgcc actggggctt tgaatcagct aaatggcatt     1200

gctatcttca tgtgaaccaa aaatacttag gaacacctga agaaaagaaa ctgaagataa     1260

ttttagaaga aatgaaggag aagtttagca tgagaagtgg aaagagaaat caatccaaga     1320

cagatgcacc atcaggaatg gaattacagt catggtatcc tgttataaag caggaaggtg     1380

accatgtttc tcagacacat tcatttttac accccagcta ctacttatac atgtgtgata     1440

aagtggttgc cccaaatgtg tcacttactt ctgctgtatc ccagtctaaa gagctcacaa     1500

agacagaggc aagtaagtcc atatcaagac agtcagagaa ggctcacagt agtggtaaac     1560

ttcaaaaaac agtgtcttat ccagatgtct cacttgagga acaggagaaa atggatttaa     1620

aaacaagtag agaattatgt agccgtttag atgcatcaat ctcaaataat tctacaagta     1680

aaaggaaatc tgagtctgcc acttgcaact tagtcagaga cataaacaaa gtgggaattg     1740

gccttgttgc tgccgcttca tctccgcttc ttgtgaaaga tgtcatttgt gaggatgata     1800

agggaaaaat catggaagaa gtaatgagaa cttatttaaa acaacaggaa aaactaaact     1860

tgattttgca aaagaagcaa caacttcaga tggaagtaaa aatgttgagt agttcaaaat     1920

ctatgaagga actcactgaa gaacagcaga atttacagaa agagcttgaa tctttgcaga     1980

atgaacatgc tcaaagaatg gaagaatttt atgttgaaca gaaagactta gagaaaaaat     2040

tggagcagat aatgaagcaa aaatgtacct gtgactcaaa tttagaaaaa gacaaagagg     2100

ctgaatatgc aggacagttg gcagaactga ggcagagatt ggaccatgct gaggccgata     2160

ggcaagaact ccaagatgaa ctcagacagg aacgggaagc aagacagaag ttagagatga     2220

tgataaaaga gctaaagctg caaattctga aatcatcaaa gactgctaaa gaatagaaac     2280

tgttaaagag attcatctgt gtattactga caaggttttt tttgtttgtt gcttgctttg     2340

gtaattgaat tctgaagaat ttatctgcat gacgataact aggcattcta tccatttgta     2400

gatcagagaa agtgaagaga ttatatatta gtacttaaat ttttacattt tccaaatgaa     2460

tgaaaatgta tgtttctttg tactttttta aaaaaatcag cttagtaaca atactatatg     2520

gtttcaacta gtaggtaatc tgcttatatt tctaatgcaa acttaacaat tgtgtacttt     2580

ttaaaagctg caatatgtgt tggaaaatag ctgtggtcaa ttttgttatc catatttcag     2640

actcaatttt agatacaatg gtggctttat attttaagta tatagagcta ctcaaggagt     2700

tgaatctccc cttttctcat taacacaatt tttctaagtt gatatggtgt actcattaac     2760

atacaccaaa tttactttta ctttgttcag attgtggaat gaatttccac cagttctctt     2820

ctttttaatg tgtaccctag gaggaatttt actgaggtta tagcataccc catgagcaca     2880

gtggggaaga agaatgtgtt gttatgtgct gctgctaaac agaagcagca gttgtaattt     2940

gtttttcagt ttaaatgtgg ttatagttag attttttttt aagcagcaac ttttcaaaaa     3000

taaaatgtga taatttctga acttttgttt gtgttgttaa tagtggtgtg aaaatattaa     3060

cgttcttgag aaaaactgat accactgttg tgtatcagtt tctatacaat ccataatcct     3120

cctgtacagt ttttacatgt agttatgagt cttactaaaa tttatataat ggacttgttt     3180

tcctttaagt tgtaaaatgt taaacacctt gaaggttatt ttggacttct gtatgtttaa     3240

atgttgtctt accaaaattt gcacgaatgg accattttca tttactactt aatatcaaaa     3300

tcaggaattt acagtcaact gatagtacat gataggtgca tataggacag tttagttacc     3360

tgctactaaa agatttttag ataagtttta gaagataaag gaattccata gtttcaggag     3420

ggacaacatc ttctgcactt tttttttgca cagaaaagtc tgtcattctc taatggcaaa     3480

tttcatattt gttaattctt ggctcaaaat atattaggta aaattcttag atctgttttt     3540

aaagggagtt tcctgaaact atcattaatt gacattatta ccccatggat tttatgggat     3600

aataaatgtt tttcatgttc tcttataaga tactatgtat gaaattactt cagagagcta     3660

tatttatttt aaaataaatt agctagggtt aaggttatat tctatttcca gcatagaagg     3720

tagataatct aatggtgtag aaagaatcac taggttgtca tttaaccagt tattttcata     3780

ttttgcttaa tagtacatat ccaaaaagaa ttttgtactt ccccaaatgt aatttattta     3840

ctaaattgag tataacctaa atgtgtgttt tctattttcc atttaaattt tgctatatta     3900

agactaattt aattcgttga gtcttggaat cttctcaagg aggaacaaat attaaaatga     3960

catgtagaaa caaatttttt tttttttttt tttttttttt tttttgagac agagtctcgc     4020

tgtctcccag gctggagttc agtggtgcaa tctcggctca ctgcaagctc tgcctcctgg     4080

gttcaagcca ttctcctgcc tcagcctccc gagtagctgg gactacaggc acccaccacc     4140

actcccggct aatttttaga aacaaatatt taaaatgaca tattctccca atacaatcta     4200

tttagatctg gagaaggaaa aatcagatat ttatgatata gttttatttt aattttgaat     4260

tatttgtgtc acagctcagc tttttggaag acaaactcaa acacctataa tttcatttat     4320

atttctaatt cacttggaac ctttctgctt tatgttacct agaaaatgat aatttgttta     4380

acccaaaact tctaaaataa attgcttaat ccttgaaata tgttattgga aaattttaag     4440

cagtgcttaa acaccattaa attattatga acttgtaatt cagaattgag taaagaaata     4500

ttttttctag tccttcatat attgaaaact tgccacatga cattgtatcg tcttcatttt     4560

ccagaagatg cgttggtgtg ccataggttt ctaacttcct tgaaaatagt tttttaagtc     4620

aattgtaaat atacgtatta ttgttaaaag taactttaaa ctgcaacaca tagcttcaaa     4680

acaatataga gattttgtaa taccttataa gtggagttgg ctaaaatacc ttatccatat     4740

aaaacttatt ctattctttg catgcttatt ttgtgtgttg gttgctagct taaagtttga     4800

tttgttgtta ctctttgtgt gccaaattca ctaggcaagc ggatttttcc tcagacttca     4860

aaaaataatt cttttaagaa aaaatgtaaa aatgtttatt ctaaaaagct gcattaaagg     4920

gacaacctat aaaaagtttt gctagctcat ctttagaagg aagaaagaat attagcttgg     4980

gtgatgttta atttgggtgg cgatagtttc tgtaggctaa actttatgag aaaagtgtac     5040

ctactctata aaggtaataa atgtaaaacc tcttgctgtt attgaggaag ctcttcaact     5100

accctaaatt tcacaaatgt aacttataac actatgaaaa gatttgacca acaatttacg     5160

tttgctgtgt gctttagttt ttgtttaagc atattctttt gcttgaattt ctgtgttcat     5220

gagagttagg gtgttttatg cttcttgaac taattttata acatatttaa tatattacca     5280

gttaagatat aaaatcattt gtacatagcg aattgtaaag cagctattaa agtaggtgaa     5340

ataaagtata tatttgccgg ttatccatat cttttagaag tcctgacaga acaaccagtt     5400

tatttgcaca taggtagctt ctgtttgaag gaaggtaaag ttataaggaa actcaaatac     5460

tataagatgt gtcaaggtat ttctccagaa ttaattgcaa agctagtgct gaaggatttt     5520

aatcagcttc taaaattttc ttctcaataa ggcatatgtt ttgattactt agggaagatt     5580

cctcattttt atttgccctt tatgcattta atccacatga taggacatta aaaattaata     5640

taaagaaaaa tcgtgctcat actgtacatc tgtttctgtg cttggaacta cttgttaata     5700

gtttttatcg aagctgtcag caataaggga cataaaactg ctgtattata cattgtggaa     5760

ttgaataaac agcctaattt tttttttcta gtatagggta cttaagcatt tccacttttg     5820

gaagaaaagt gtattagtat tttatattgc atttcattta aaaggacagt tttttttttt     5880

tttttgtaaa tccattcatt gaaatggttt ctaaactgta taatgtaatt tggagcctat     5940

ttagtaatag aattaaatgt cctatgtagt gctacaattt ttgaattaga aagtgatcaa     6000

atgtaagaaa aaaatttaaa aattcagccc agaaaacaaa atagtgtatt aaattagttt     6060

aatgtaaaag gaatttataa gatttttttc ctcaatatag atacctcact tgaaaagaaa     6120

gcacagcata cttaaagtag ttctagtaaa catgtcctag aaaacagttg ctaaatgtag     6180

gacatctttt gaggaattag tttatgagaa ataaaatttt acttgttttt actatcctgt     6240

tagaagtatt tgtttatcct gataatttta agccaacata gtagtcttaa attacttttg     6300

aatttctaat ctgtgaaggc agtaaatgaa atatctgttc tgcaactgtt gaaacaaata     6360

attggctaca ttgaccataa ttaaagttaa aattttgcca atgatgtaca gttttatggt     6420

taaagttgct gtggttggtt gcattacatg acacagaaaa ctgtcctcta cctcacgtga     6480

aataaatatt ttatatggtt ttactaaaaa taagactcat gtatctggtc acctagttta     6540

caaattttga attatattta ttgaaacatg acatactgtg ctctgagctt atacctcaat     6600

tgtattttgt gctgttttcc attttcatgc cttgtaaata acttgtatag attgtggatc     6660

aaatactaaa taaaaacttt taatgccaat taaatttgat tcaagttaaa a              6711


<210>  169
<211>  664
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SKI-like proto-oncogene (SKIL), transcript variant 
       3, polypeptide

NCBI Reference Sequence: NM_001145098.2

<400>  169

Met Gly Asp Asp Gly Ser Pro Pro Ala Lys Lys Met Ile Thr Asp Ile 
1               5                   10                  15      


His Ala Asn Gly Lys Thr Ile Asn Lys Val Pro Thr Val Lys Lys Glu 
            20                  25                  30          


His Leu Asp Asp Tyr Gly Glu Ala Pro Val Glu Thr Asp Gly Glu His 
        35                  40                  45              


Val Lys Arg Thr Cys Thr Ser Val Pro Glu Thr Leu His Leu Asn Pro 
    50                  55                  60                  


Ser Leu Lys His Thr Leu Ala Gln Phe His Leu Ser Ser Gln Ser Ser 
65                  70                  75                  80  


Leu Gly Gly Pro Ala Ala Phe Ser Ala Arg His Ser Gln Glu Ser Met 
                85                  90                  95      


Ser Pro Thr Val Phe Leu Pro Leu Pro Ser Pro Gln Val Leu Pro Gly 
            100                 105                 110         


Pro Leu Leu Ile Pro Ser Asp Ser Ser Thr Glu Leu Thr Gln Thr Val 
        115                 120                 125             


Leu Glu Gly Glu Ser Ile Ser Cys Phe Gln Val Gly Gly Glu Lys Arg 
    130                 135                 140                 


Leu Cys Leu Pro Gln Val Leu Asn Ser Val Leu Arg Glu Phe Thr Leu 
145                 150                 155                 160 


Gln Gln Ile Asn Thr Val Cys Asp Glu Leu Tyr Ile Tyr Cys Ser Arg 
                165                 170                 175     


Cys Thr Ser Asp Gln Leu His Ile Leu Lys Val Leu Gly Ile Leu Pro 
            180                 185                 190         


Phe Asn Ala Pro Ser Cys Gly Leu Ile Thr Leu Thr Asp Ala Gln Arg 
        195                 200                 205             


Leu Cys Asn Ala Leu Leu Arg Pro Arg Thr Phe Pro Gln Asn Gly Ser 
    210                 215                 220                 


Val Leu Pro Ala Lys Ser Ser Leu Ala Gln Leu Lys Glu Thr Gly Ser 
225                 230                 235                 240 


Ala Phe Glu Val Glu His Glu Cys Leu Gly Lys Cys Gln Gly Leu Phe 
                245                 250                 255     


Ala Pro Gln Phe Tyr Val Gln Pro Asp Ala Pro Cys Ile Gln Cys Leu 
            260                 265                 270         


Glu Cys Cys Gly Met Phe Ala Pro Gln Thr Phe Val Met His Ser His 
        275                 280                 285             


Arg Ser Pro Asp Lys Arg Thr Cys His Trp Gly Phe Glu Ser Ala Lys 
    290                 295                 300                 


Trp His Cys Tyr Leu His Val Asn Gln Lys Tyr Leu Gly Thr Pro Glu 
305                 310                 315                 320 


Glu Lys Lys Leu Lys Ile Ile Leu Glu Glu Met Lys Glu Lys Phe Ser 
                325                 330                 335     


Met Arg Ser Gly Lys Arg Asn Gln Ser Lys Thr Asp Ala Pro Ser Gly 
            340                 345                 350         


Met Glu Leu Gln Ser Trp Tyr Pro Val Ile Lys Gln Glu Gly Asp His 
        355                 360                 365             


Val Ser Gln Thr His Ser Phe Leu His Pro Ser Tyr Tyr Leu Tyr Met 
    370                 375                 380                 


Cys Asp Lys Val Val Ala Pro Asn Val Ser Leu Thr Ser Ala Val Ser 
385                 390                 395                 400 


Gln Ser Lys Glu Leu Thr Lys Thr Glu Ala Ser Lys Ser Ile Ser Arg 
                405                 410                 415     


Gln Ser Glu Lys Ala His Ser Ser Gly Lys Leu Gln Lys Thr Val Ser 
            420                 425                 430         


Tyr Pro Asp Val Ser Leu Glu Glu Gln Glu Lys Met Asp Leu Lys Thr 
        435                 440                 445             


Ser Arg Glu Leu Cys Ser Arg Leu Asp Ala Ser Ile Ser Asn Asn Ser 
    450                 455                 460                 


Thr Ser Lys Arg Lys Ser Glu Ser Ala Thr Cys Asn Leu Val Arg Asp 
465                 470                 475                 480 


Ile Asn Lys Val Gly Ile Gly Leu Val Ala Ala Ala Ser Ser Pro Leu 
                485                 490                 495     


Leu Val Lys Asp Val Ile Cys Glu Asp Asp Lys Gly Lys Ile Met Glu 
            500                 505                 510         


Glu Val Met Arg Thr Tyr Leu Lys Gln Gln Glu Lys Leu Asn Leu Ile 
        515                 520                 525             


Leu Gln Lys Lys Gln Gln Leu Gln Met Glu Val Lys Met Leu Ser Ser 
    530                 535                 540                 


Ser Lys Ser Met Lys Glu Leu Thr Glu Glu Gln Gln Asn Leu Gln Lys 
545                 550                 555                 560 


Glu Leu Glu Ser Leu Gln Asn Glu His Ala Gln Arg Met Glu Glu Phe 
                565                 570                 575     


Tyr Val Glu Gln Lys Asp Leu Glu Lys Lys Leu Glu Gln Ile Met Lys 
            580                 585                 590         


Gln Lys Cys Thr Cys Asp Ser Asn Leu Glu Lys Asp Lys Glu Ala Glu 
        595                 600                 605             


Tyr Ala Gly Gln Leu Ala Glu Leu Arg Gln Arg Leu Asp His Ala Glu 
    610                 615                 620                 


Ala Asp Arg Gln Glu Leu Gln Asp Glu Leu Arg Gln Glu Arg Glu Ala 
625                 630                 635                 640 


Arg Gln Lys Leu Glu Met Met Ile Lys Glu Leu Lys Leu Gln Ile Leu 
                645                 650                 655     


Lys Ser Ser Lys Thr Ala Lys Glu 
            660                 


<210>  170
<211>  7199
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SKI-like proto-oncogene (SKIL), transcript variant 
       4, mRNA

NCBI Reference Sequence: NM_001248008.1

<400>  170
ggtttcaaat tggccctttg gcctctggag caaattcaaa tgtaactctt ccccaacccc       60

ccttctcttc ttccagatta attaaaagaa gaatgaacta taatccttga agataactgg      120

gcaatttttt aagtcggagg ctgttcttac tggtgtgagg atttacacac gtcttcagtt      180

tttcagcaca gaccagcaga ccatcatttt tagaggaaat actccctctg ccctcctttt      240

tggtttcctt ggtggtaaag attaaatttg gttgcatcat tttgacttgt gtttgagtct      300

agattttatg gcacaaggaa tggcataaac ttttcatgtg ttttggttaa aacaaaccag      360

accattgcat tgaccctgga catctttaat tgagaaattg gtaactttat tttaatatgt      420

atatctgaag aattcaagaa aacaaaggca tcctcagagg tgtgcctctt ttctttatta      480

ttagaggcaa aacgaacaat tttataggat ttgtagtgaa attataccag attataagga      540

gaaccaaaac taagtcgcaa aatttattaa tttaaggggc tctcgctttg aaagtttgag      600

agtaagttac gataggcatt tgtatccatt cattactttc ctcttttcaa ataagcaact      660

aaatagaaat gctaatctca gacttaatta tttaacagaa gagtgtacca tggaaaacct      720

ccagacaaat ttctccttgg ttcagggctc aactaaaaaa ctgaatggga tgggagatga      780

tggcagcccc ccagcgaaaa aaatgataac ggacattcat gcaaatggaa aaacgataaa      840

caaggtgcca acagttaaga aggaacactt ggatgactat ggagaagcac cagtggaaac      900

tgatggagag catgttaagc gaacctgtac ttctgttcct gaaactttgc atttaaatcc      960

cagtttgaaa cacacattgg cacaattcca tttaagtagt cagagctcgc tgggtggacc     1020

agcagcattt tctgctcggc attcccaaga aagcatgtcg cctactgtat ttctgcctct     1080

tccatcacct caggttcttc ctggcccatt gctcatccct tcagatagct ccacagaact     1140

cactcagact gtgttggaag gggaatctat ttcttgtttt caagttggag gagaaaagag     1200

actctgtttg ccccaagtct taaattctgt tctccgagaa tttacactcc agcaaataaa     1260

tacagtgtgt gatgaactgt acatatattg ttcaaggtgt acttcagacc agcttcatat     1320

cttaaaggta ctgggcatac ttccattcaa tgccccatcc tgtgggctga ttacattaac     1380

tgatgcacaa agattatgta atgctttatt gcggccacga acttttcctc aaaatggtag     1440

cgtacttcct gctaaaagct cattggccca gttaaaggaa actggcagtg cctttgaagt     1500

ggagcatgaa tgcctaggca aatgtcaggg tttatttgca ccccagtttt atgttcagcc     1560

tgatgctccg tgtattcaat gtctggagtg ttgtggaatg tttgcacccc agacgtttgt     1620

gatgcattct cacagatcac ctgacaaaag aacttgccac tggggctttg aatcagctaa     1680

atggcattgc tatcttcatg tgaaccaaaa atacttagga acacctgaag aaaagaaact     1740

gaagataatt ttagaagaaa tgaaggagaa gtttagcatg agaagtggaa agagaaatca     1800

atccaagaca gatgcaccat caggaatgga attacagtca tggtatcctg ttataaagca     1860

ggaaggtgac catgtttctc agacacattc atttttacac cccagctact acttatacat     1920

gtgtgataaa gtggttgccc caaatgtgtc acttacttct gctgtatccc agtctaaaga     1980

gctcacaaag acagaggcaa gtaagtccat atcaagacag tcagagaagg ctcacagtag     2040

tggtaaactt caaaaaacag tgtcttatcc agatgtctca cttgaggaac aggagaaaat     2100

ggatttaaaa acaagtagag aattatgtag ccgtttagat gcatcaatct caaataattc     2160

tacaagtaaa aggaaatctg agtctgccac ttgcaactta gtcagagaca taaacaaagt     2220

gggaattggc cttgttgctg ccgcttcatc tccgcttctt gtgaaagatg tcatttgtga     2280

ggatgataag ggaaaaatca tggaagaagt aatgagaact tatttaaaac aacaggaaaa     2340

actaaacttg attttgcaaa agaagcaaca acttcagatg gaagtaaaaa tgttgagtag     2400

ttcaaaatct atgaaggaac tcactgaaga acagcagaat ttacagaaag agcttgaatc     2460

tttgcagaat gaacatgctc aaagaatgga agaattttat gttgaacaga aagacttaga     2520

gaaaaaattg gagcagataa tgaagcaaaa atgtacctgt gactcaaatt tagaaaaaga     2580

caaagaggct gaatatgcag gacagttggc agaactgagg cagagattgg accatgctga     2640

ggccgatagg caagaactcc aagatgaact cagacaggaa cgggaagcaa gacagaagtt     2700

agagatgatg ataaaagagc taaagctgca aattctgaaa tcatcaaaga ctgctaaaga     2760

atagaaactg ttaaagagat tcatctgtgt attactgaca aggttttttt tgtttgttgc     2820

ttgctttggt aattgaattc tgaagaattt atctgcatga cgataactag gcattctatc     2880

catttgtaga tcagagaaag tgaagagatt atatattagt acttaaattt ttacattttc     2940

caaatgaatg aaaatgtatg tttctttgta cttttttaaa aaaatcagct tagtaacaat     3000

actatatggt ttcaactagt aggtaatctg cttatatttc taatgcaaac ttaacaattg     3060

tgtacttttt aaaagctgca atatgtgttg gaaaatagct gtggtcaatt ttgttatcca     3120

tatttcagac tcaattttag atacaatggt ggctttatat tttaagtata tagagctact     3180

caaggagttg aatctcccct tttctcatta acacaatttt tctaagttga tatggtgtac     3240

tcattaacat acaccaaatt tacttttact ttgttcagat tgtggaatga atttccacca     3300

gttctcttct ttttaatgtg taccctagga ggaattttac tgaggttata gcatacccca     3360

tgagcacagt ggggaagaag aatgtgttgt tatgtgctgc tgctaaacag aagcagcagt     3420

tgtaatttgt ttttcagttt aaatgtggtt atagttagat ttttttttaa gcagcaactt     3480

ttcaaaaata aaatgtgata atttctgaac ttttgtttgt gttgttaata gtggtgtgaa     3540

aatattaacg ttcttgagaa aaactgatac cactgttgtg tatcagtttc tatacaatcc     3600

ataatcctcc tgtacagttt ttacatgtag ttatgagtct tactaaaatt tatataatgg     3660

acttgttttc ctttaagttg taaaatgtta aacaccttga aggttatttt ggacttctgt     3720

atgtttaaat gttgtcttac caaaatttgc acgaatggac cattttcatt tactacttaa     3780

tatcaaaatc aggaatttac agtcaactga tagtacatga taggtgcata taggacagtt     3840

tagttacctg ctactaaaag atttttagat aagttttaga agataaagga attccatagt     3900

ttcaggaggg acaacatctt ctgcactttt tttttgcaca gaaaagtctg tcattctcta     3960

atggcaaatt tcatatttgt taattcttgg ctcaaaatat attaggtaaa attcttagat     4020

ctgtttttaa agggagtttc ctgaaactat cattaattga cattattacc ccatggattt     4080

tatgggataa taaatgtttt tcatgttctc ttataagata ctatgtatga aattacttca     4140

gagagctata tttattttaa aataaattag ctagggttaa ggttatattc tatttccagc     4200

atagaaggta gataatctaa tggtgtagaa agaatcacta ggttgtcatt taaccagtta     4260

ttttcatatt ttgcttaata gtacatatcc aaaaagaatt ttgtacttcc ccaaatgtaa     4320

tttatttact aaattgagta taacctaaat gtgtgttttc tattttccat ttaaattttg     4380

ctatattaag actaatttaa ttcgttgagt cttggaatct tctcaaggag gaacaaatat     4440

taaaatgaca tgtagaaaca aatttttttt tttttttttt tttttttttt tttgagacag     4500

agtctcgctg tctcccaggc tggagttcag tggtgcaatc tcggctcact gcaagctctg     4560

cctcctgggt tcaagccatt ctcctgcctc agcctcccga gtagctggga ctacaggcac     4620

ccaccaccac tcccggctaa tttttagaaa caaatattta aaatgacata ttctcccaat     4680

acaatctatt tagatctgga gaaggaaaaa tcagatattt atgatatagt tttattttaa     4740

ttttgaatta tttgtgtcac agctcagctt tttggaagac aaactcaaac acctataatt     4800

tcatttatat ttctaattca cttggaacct ttctgcttta tgttacctag aaaatgataa     4860

tttgtttaac ccaaaacttc taaaataaat tgcttaatcc ttgaaatatg ttattggaaa     4920

attttaagca gtgcttaaac accattaaat tattatgaac ttgtaattca gaattgagta     4980

aagaaatatt ttttctagtc cttcatatat tgaaaacttg ccacatgaca ttgtatcgtc     5040

ttcattttcc agaagatgcg ttggtgtgcc ataggtttct aacttccttg aaaatagttt     5100

tttaagtcaa ttgtaaatat acgtattatt gttaaaagta actttaaact gcaacacata     5160

gcttcaaaac aatatagaga ttttgtaata ccttataagt ggagttggct aaaatacctt     5220

atccatataa aacttattct attctttgca tgcttatttt gtgtgttggt tgctagctta     5280

aagtttgatt tgttgttact ctttgtgtgc caaattcact aggcaagcgg atttttcctc     5340

agacttcaaa aaataattct tttaagaaaa aatgtaaaaa tgtttattct aaaaagctgc     5400

attaaaggga caacctataa aaagttttgc tagctcatct ttagaaggaa gaaagaatat     5460

tagcttgggt gatgtttaat ttgggtggcg atagtttctg taggctaaac tttatgagaa     5520

aagtgtacct actctataaa ggtaataaat gtaaaacctc ttgctgttat tgaggaagct     5580

cttcaactac cctaaatttc acaaatgtaa cttataacac tatgaaaaga tttgaccaac     5640

aatttacgtt tgctgtgtgc tttagttttt gtttaagcat attcttttgc ttgaatttct     5700

gtgttcatga gagttagggt gttttatgct tcttgaacta attttataac atatttaata     5760

tattaccagt taagatataa aatcatttgt acatagcgaa ttgtaaagca gctattaaag     5820

taggtgaaat aaagtatata tttgccggtt atccatatct tttagaagtc ctgacagaac     5880

aaccagttta tttgcacata ggtagcttct gtttgaagga aggtaaagtt ataaggaaac     5940

tcaaatacta taagatgtgt caaggtattt ctccagaatt aattgcaaag ctagtgctga     6000

aggattttaa tcagcttcta aaattttctt ctcaataagg catatgtttt gattacttag     6060

ggaagattcc tcatttttat ttgcccttta tgcatttaat ccacatgata ggacattaaa     6120

aattaatata aagaaaaatc gtgctcatac tgtacatctg tttctgtgct tggaactact     6180

tgttaatagt ttttatcgaa gctgtcagca ataagggaca taaaactgct gtattataca     6240

ttgtggaatt gaataaacag cctaattttt tttttctagt atagggtact taagcatttc     6300

cacttttgga agaaaagtgt attagtattt tatattgcat ttcatttaaa aggacagttt     6360

tttttttttt tttgtaaatc cattcattga aatggtttct aaactgtata atgtaatttg     6420

gagcctattt agtaatagaa ttaaatgtcc tatgtagtgc tacaattttt gaattagaaa     6480

gtgatcaaat gtaagaaaaa aatttaaaaa ttcagcccag aaaacaaaat agtgtattaa     6540

attagtttaa tgtaaaagga atttataaga tttttttcct caatatagat acctcacttg     6600

aaaagaaagc acagcatact taaagtagtt ctagtaaaca tgtcctagaa aacagttgct     6660

aaatgtagga catcttttga ggaattagtt tatgagaaat aaaattttac ttgtttttac     6720

tatcctgtta gaagtatttg tttatcctga taattttaag ccaacatagt agtcttaaat     6780

tacttttgaa tttctaatct gtgaaggcag taaatgaaat atctgttctg caactgttga     6840

aacaaataat tggctacatt gaccataatt aaagttaaaa ttttgccaat gatgtacagt     6900

tttatggtta aagttgctgt ggttggttgc attacatgac acagaaaact gtcctctacc     6960

tcacgtgaaa taaatatttt atatggtttt actaaaaata agactcatgt atctggtcac     7020

ctagtttaca aattttgaat tatatttatt gaaacatgac atactgtgct ctgagcttat     7080

acctcaattg tattttgtgc tgttttccat tttcatgcct tgtaaataac ttgtatagat     7140

tgtggatcaa atactaaata aaaactttta atgccaatta aatttgattc aagttaaaa      7199


<210>  171
<211>  684
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  171 Homo sapiens SKI-like proto-oncogene (SKIL), transcript 
       variant 4, polypeptide

NCBI Reference Sequence: NM_001248008.1

<400>  171

Met Glu Asn Leu Gln Thr Asn Phe Ser Leu Val Gln Gly Ser Thr Lys 
1               5                   10                  15      


Lys Leu Asn Gly Met Gly Asp Asp Gly Ser Pro Pro Ala Lys Lys Met 
            20                  25                  30          


Ile Thr Asp Ile His Ala Asn Gly Lys Thr Ile Asn Lys Val Pro Thr 
        35                  40                  45              


Val Lys Lys Glu His Leu Asp Asp Tyr Gly Glu Ala Pro Val Glu Thr 
    50                  55                  60                  


Asp Gly Glu His Val Lys Arg Thr Cys Thr Ser Val Pro Glu Thr Leu 
65                  70                  75                  80  


His Leu Asn Pro Ser Leu Lys His Thr Leu Ala Gln Phe His Leu Ser 
                85                  90                  95      


Ser Gln Ser Ser Leu Gly Gly Pro Ala Ala Phe Ser Ala Arg His Ser 
            100                 105                 110         


Gln Glu Ser Met Ser Pro Thr Val Phe Leu Pro Leu Pro Ser Pro Gln 
        115                 120                 125             


Val Leu Pro Gly Pro Leu Leu Ile Pro Ser Asp Ser Ser Thr Glu Leu 
    130                 135                 140                 


Thr Gln Thr Val Leu Glu Gly Glu Ser Ile Ser Cys Phe Gln Val Gly 
145                 150                 155                 160 


Gly Glu Lys Arg Leu Cys Leu Pro Gln Val Leu Asn Ser Val Leu Arg 
                165                 170                 175     


Glu Phe Thr Leu Gln Gln Ile Asn Thr Val Cys Asp Glu Leu Tyr Ile 
            180                 185                 190         


Tyr Cys Ser Arg Cys Thr Ser Asp Gln Leu His Ile Leu Lys Val Leu 
        195                 200                 205             


Gly Ile Leu Pro Phe Asn Ala Pro Ser Cys Gly Leu Ile Thr Leu Thr 
    210                 215                 220                 


Asp Ala Gln Arg Leu Cys Asn Ala Leu Leu Arg Pro Arg Thr Phe Pro 
225                 230                 235                 240 


Gln Asn Gly Ser Val Leu Pro Ala Lys Ser Ser Leu Ala Gln Leu Lys 
                245                 250                 255     


Glu Thr Gly Ser Ala Phe Glu Val Glu His Glu Cys Leu Gly Lys Cys 
            260                 265                 270         


Gln Gly Leu Phe Ala Pro Gln Phe Tyr Val Gln Pro Asp Ala Pro Cys 
        275                 280                 285             


Ile Gln Cys Leu Glu Cys Cys Gly Met Phe Ala Pro Gln Thr Phe Val 
    290                 295                 300                 


Met His Ser His Arg Ser Pro Asp Lys Arg Thr Cys His Trp Gly Phe 
305                 310                 315                 320 


Glu Ser Ala Lys Trp His Cys Tyr Leu His Val Asn Gln Lys Tyr Leu 
                325                 330                 335     


Gly Thr Pro Glu Glu Lys Lys Leu Lys Ile Ile Leu Glu Glu Met Lys 
            340                 345                 350         


Glu Lys Phe Ser Met Arg Ser Gly Lys Arg Asn Gln Ser Lys Thr Asp 
        355                 360                 365             


Ala Pro Ser Gly Met Glu Leu Gln Ser Trp Tyr Pro Val Ile Lys Gln 
    370                 375                 380                 


Glu Gly Asp His Val Ser Gln Thr His Ser Phe Leu His Pro Ser Tyr 
385                 390                 395                 400 


Tyr Leu Tyr Met Cys Asp Lys Val Val Ala Pro Asn Val Ser Leu Thr 
                405                 410                 415     


Ser Ala Val Ser Gln Ser Lys Glu Leu Thr Lys Thr Glu Ala Ser Lys 
            420                 425                 430         


Ser Ile Ser Arg Gln Ser Glu Lys Ala His Ser Ser Gly Lys Leu Gln 
        435                 440                 445             


Lys Thr Val Ser Tyr Pro Asp Val Ser Leu Glu Glu Gln Glu Lys Met 
    450                 455                 460                 


Asp Leu Lys Thr Ser Arg Glu Leu Cys Ser Arg Leu Asp Ala Ser Ile 
465                 470                 475                 480 


Ser Asn Asn Ser Thr Ser Lys Arg Lys Ser Glu Ser Ala Thr Cys Asn 
                485                 490                 495     


Leu Val Arg Asp Ile Asn Lys Val Gly Ile Gly Leu Val Ala Ala Ala 
            500                 505                 510         


Ser Ser Pro Leu Leu Val Lys Asp Val Ile Cys Glu Asp Asp Lys Gly 
        515                 520                 525             


Lys Ile Met Glu Glu Val Met Arg Thr Tyr Leu Lys Gln Gln Glu Lys 
    530                 535                 540                 


Leu Asn Leu Ile Leu Gln Lys Lys Gln Gln Leu Gln Met Glu Val Lys 
545                 550                 555                 560 


Met Leu Ser Ser Ser Lys Ser Met Lys Glu Leu Thr Glu Glu Gln Gln 
                565                 570                 575     


Asn Leu Gln Lys Glu Leu Glu Ser Leu Gln Asn Glu His Ala Gln Arg 
            580                 585                 590         


Met Glu Glu Phe Tyr Val Glu Gln Lys Asp Leu Glu Lys Lys Leu Glu 
        595                 600                 605             


Gln Ile Met Lys Gln Lys Cys Thr Cys Asp Ser Asn Leu Glu Lys Asp 
    610                 615                 620                 


Lys Glu Ala Glu Tyr Ala Gly Gln Leu Ala Glu Leu Arg Gln Arg Leu 
625                 630                 635                 640 


Asp His Ala Glu Ala Asp Arg Gln Glu Leu Gln Asp Glu Leu Arg Gln 
                645                 650                 655     


Glu Arg Glu Ala Arg Gln Lys Leu Glu Met Met Ile Lys Glu Leu Lys 
            660                 665                 670         


Leu Gln Ile Leu Lys Ser Ser Lys Thr Ala Lys Glu 
        675                 680                 


<210>  172
<211>  5966
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human TGF?2 mRNA transcript variant 1 GenBank Accession No.: 
       NM_001135599.2  GI:305682568

<400>  172
gtgatgttat ctgctggcag cagaaggttc gctccgagcg gagctccaga agctcctgac       60

aagagaaaga cagattgaga tagagataga aagagaaaga gagaaagaga cagcagagcg      120

agagcgcaag tgaaagaggc aggggagggg gatggagaat attagcctga cggtctaggg      180

agtcatccag gaacaaactg aggggctgcc cggctgcaga caggaggaga cagagaggat      240

ctattttagg gtggcaagtg cctacctacc ctaagcgagc aattccacgt tggggagaag      300

ccagcagagg ttgggaaagg gtgggagtcc aagggagccc ctgcgcaacc ccctcaggaa      360

taaaactccc cagccagggt gtcgcaaggg ctgccgttgt gatccgcagg gggtgaacgc      420

aaccgcgacg gctgatcgtc tgtggctggg ttggcgtttg gagcaagaga aggaggagca      480

ggagaaggag ggagctggag gctggaagcg tttgcaagcg gcggcggcag caacgtggag      540

taaccaagcg ggtcagcgcg cgcccgccag ggtgtaggcc acggagcgca gctcccagag      600

caggatccgc gccgcctcag cagcctctgc ggcccctgcg gcacccgacc gagtaccgag      660

cgccctgcga agcgcaccct cctccccgcg gtgcgctggg ctcgccccca gcgcgcgcac      720

acgcacacac acacacacac acacacacgc acgcacacac gtgtgcgctt ctctgctccg      780

gagctgctgc tgctcctgct ctcagcgccg cagtggaagg caggaccgaa ccgctccttc      840

tttaaatata taaatttcag cccaggtcag cctcggcggc ccccctcacc gcgctcccgg      900

cgcccctccc gtcagttcgc cagctgccag ccccgggacc ttttcatctc ttcccttttg      960

gccggaggag ccgagttcag atccgccact ccgcacccga gactgacaca ctgaactcca     1020

cttcctcctc ttaaatttat ttctacttaa tagccactcg tctctttttt tccccatctc     1080

attgctccaa gaattttttt cttcttactc gccaaagtca gggttccctc tgcccgtccc     1140

gtattaatat ttccactttt ggaactactg gccttttctt tttaaaggaa ttcaagcagg     1200

atacgttttt ctgttgggca ttgactagat tgtttgcaaa agtttcgcat caaaaacaac     1260

aacaacaaaa aaccaaacaa ctctccttga tctatacttt gagaattgtt gatttctttt     1320

ttttattctg acttttaaaa acaacttttt tttccacttt tttaaaaaat gcactactgt     1380

gtgctgagcg cttttctgat cctgcatctg gtcacggtcg cgctcagcct gtctacctgc     1440

agcacactcg atatggacca gttcatgcgc aagaggatcg aggcgatccg cgggcagatc     1500

ctgagcaagc tgaagctcac cagtccccca gaagactatc ctgagcccga ggaagtcccc     1560

ccggaggtga tttccatcta caacagcacc agggacttgc tccaggagaa ggcgagccgg     1620

agggcggccg cctgcgagcg cgagaggagc gacgaagagt actacgccaa ggaggtttac     1680

aaaatagaca tgccgccctt cttcccctcc gaaactgtct gcccagttgt tacaacaccc     1740

tctggctcag tgggcagctt gtgctccaga cagtcccagg tgctctgtgg gtaccttgat     1800

gccatcccgc ccactttcta cagaccctac ttcagaattg ttcgatttga cgtctcagca     1860

atggagaaga atgcttccaa tttggtgaaa gcagagttca gagtctttcg tttgcagaac     1920

ccaaaagcca gagtgcctga acaacggatt gagctatatc agattctcaa gtccaaagat     1980

ttaacatctc caacccagcg ctacatcgac agcaaagttg tgaaaacaag agcagaaggc     2040

gaatggctct ccttcgatgt aactgatgct gttcatgaat ggcttcacca taaagacagg     2100

aacctgggat ttaaaataag cttacactgt ccctgctgca cttttgtacc atctaataat     2160

tacatcatcc caaataaaag tgaagaacta gaagcaagat ttgcaggtat tgatggcacc     2220

tccacatata ccagtggtga tcagaaaact ataaagtcca ctaggaaaaa aaacagtggg     2280

aagaccccac atctcctgct aatgttattg ccctcctaca gacttgagtc acaacagacc     2340

aaccggcgga agaagcgtgc tttggatgcg gcctattgct ttagaaatgt gcaggataat     2400

tgctgcctac gtccacttta cattgatttc aagagggatc tagggtggaa atggatacac     2460

gaacccaaag ggtacaatgc caacttctgt gctggagcat gcccgtattt atggagttca     2520

gacactcagc acagcagggt cctgagctta tataatacca taaatccaga agcatctgct     2580

tctccttgct gcgtgtccca agatttagaa cctctaacca ttctctacta cattggcaaa     2640

acacccaaga ttgaacagct ttctaatatg attgtaaagt cttgcaaatg cagctaaaat     2700

tcttggaaaa gtggcaagac caaaatgaca atgatgatga taatgatgat gacgacgaca     2760

acgatgatgc ttgtaacaag aaaacataag agagccttgg ttcatcagtg ttaaaaaatt     2820

tttgaaaagg cggtactagt tcagacactt tggaagtttg tgttctgttt gttaaaactg     2880

gcatctgaca caaaaaaagt tgaaggcctt attctacatt tcacctactt tgtaagtgag     2940

agagacaaga agcaaatttt ttttaaagaa aaaaataaac actggaagaa tttattagtg     3000

ttaattatgt gaacaacgac aacaacaaca acaacaacaa acaggaaaat cccattaagt     3060

ggagttgctg tacgtaccgt tcctatcccg cgcctcactt gatttttctg tattgctatg     3120

caataggcac ccttcccatt cttactctta gagttaacag tgagttattt attgtgtgtt     3180

actatataat gaacgtttca ttgcccttgg aaaataaaac aggtgtataa agtggagacc     3240

aaatactttg ccagaaactc atggatggct taaggaactt gaactcaaac gagccagaaa     3300

aaaagaggtc atattaatgg gatgaaaacc caagtgagtt attatatgac cgagaaagtc     3360

tgcattaaga taaagaccct gaaaacacat gttatgtatc agctgcctaa ggaagcttct     3420

tgtaaggtcc aaaaactaaa aagactgtta ataaaagaaa ctttcagtca gaataagtct     3480

gtaagttttt ttttttcttt ttaattgtaa atggttcttt gtcagtttag taaaccagtg     3540

aaatgttgaa atgttttgac atgtactggt caaacttcag accttaaaat attgctgtat     3600

agctatgcta taggtttttt cctttgtttt ggtatatgta accataccta tattattaaa     3660

atagatggat atagaagcca gcataattga aaacacatct gcagatctct tttgcaaact     3720

attaaatcaa aacattaact actttatgtg taatgtgtaa atttttacca tattttttat     3780

attctgtaat aatgtcaact atgatttaga ttgacttaaa tttgggctct ttttaatgat     3840

cactcacaaa tgtatgtttc ttttagctgg ccagtacttt tgagtaaagc ccctatagtt     3900

tgacttgcac tacaaatgca tttttttttt aataacattt gccctacttg tgctttgtgt     3960

ttctttcatt attatgacat aagctacctg ggtccacttg tcttttcttt tttttgtttc     4020

acagaaaaga tgggttcgag ttcagtggtc ttcatcttcc aagcatcatt actaaccaag     4080

tcagacgtta acaaattttt atgttaggaa aaggaggaat gttatagata catagaaaat     4140

tgaagtaaaa tgttttcatt ttagcaagga tttagggttc taactaaaac tcagaatctt     4200

tattgagtta agaaaagttt ctctaccttg gtttaatcaa tatttttgta aaatcctatt     4260

gttattacaa agaggacact tcataggaaa catctttttc tttagtcagg tttttaatat     4320

tcagggggaa attgaaagat atatatttta gtcgattttt caaaagggga aaaaagtcca     4380

ggtcagcata agtcattttg tgtatttcac tgaagttata aggtttttat aaatgttctt     4440

tgaaggggaa aaggcacaag ccaatttttc ctatgatcaa aaaattcttt ctttcctctg     4500

agtgagagtt atctatatct gaggctaaag tttaccttgc tttaataaat aatttgccac     4560

atcattgcag aagaggtatc ctcatgctgg ggttaataga atatgtcagt ttatcacttg     4620

tcgcttattt agctttaaaa taaaaattaa taggcaaagc aatggaatat ttgcagtttc     4680

acctaaagag cagcataagg aggcgggaat ccaaagtgaa gttgtttgat atggtctact     4740

tcttttttgg aatttcctga ccattaatta aagaattgga tttgcaagtt tgaaaactgg     4800

aaaagcaaga gatgggatgc cataatagta aacagccctt gtgttggatg taacccaatc     4860

ccagatttga gtgtgtgttg attatttttt tgtcttccac ttttctatta tgtgtaaatc     4920

acttttattt ctgcagacat tttcctctca gataggatga cattttgttt tgtattattt     4980

tgtctttcct catgaatgca ctgataatat tttaaatgct ctattttaag atctcttgaa     5040

tctgtttttt ttttttttaa tttgggggtt ctgtaaggtc tttatttccc ataagtaaat     5100

attgccatgg gaggggggtg gaggtggcaa ggaaggggtg aagtgctagt atgcaagtgg     5160

gcagcaatta tttttgtgtt aatcagcagt acaatttgat cgttggcatg gttaaaaaat     5220

ggaatataag attagctgtt ttgtattttg atgaccaatt acgctgtatt ttaacacgat     5280

gtatgtctgt ttttgtggtg ctctagtggt aaataaatta tttcgatgat atgtggatgt     5340

ctttttccta tcagtaccat catcgagtct agaaaacacc tgtgatgcaa taagactatc     5400

tcaagctgga aaagtcatac cacctttccg attgccctct gtgctttctc ccttaaggac     5460

agtcacttca gaagtcatgc tttaaagcac aagagtcagg ccatatccat caaggataga     5520

agaaatccct gtgccgtctt tttattccct tatttattgc tatttggtaa ttgtttgaga     5580

tttagtttcc atccagcttg actgccgacc agaaaaaatg cagagagatg tttgcaccat     5640

gctttggctt tctggttcta tgttctgcca acgccagggc caaaagaact ggtctagaca     5700

gtatcccctg tagccccata acttggatag ttgctgagcc agccagatat aacaagagcc     5760

acgtgctttc tggggttggt tgtttgggat cagctacttg cctgtcagtt tcactggtac     5820

cactgcacca caaacaaaaa aacccaccct atttcctcca atttttttgg ctgctaccta     5880

caagaccaga ctcctcaaac gagttgccaa tctcttaata aataggatta ataaaaaaag     5940

taattgtgac tcaaaaaaaa aaaaaa                                          5966


<210>  173
<211>  442
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human TGF?2 polypeptide encoded by mRNA transcript variant 1 
       GenBank Accession No.: NP_001129071.1  GI:208022653

<400>  173

Met His Tyr Cys Val Leu Ser Ala Phe Leu Ile Leu His Leu Val Thr 
1               5                   10                  15      


Val Ala Leu Ser Leu Ser Thr Cys Ser Thr Leu Asp Met Asp Gln Phe 
            20                  25                  30          


Met Arg Lys Arg Ile Glu Ala Ile Arg Gly Gln Ile Leu Ser Lys Leu 
        35                  40                  45              


Lys Leu Thr Ser Pro Pro Glu Asp Tyr Pro Glu Pro Glu Glu Val Pro 
    50                  55                  60                  


Pro Glu Val Ile Ser Ile Tyr Asn Ser Thr Arg Asp Leu Leu Gln Glu 
65                  70                  75                  80  


Lys Ala Ser Arg Arg Ala Ala Ala Cys Glu Arg Glu Arg Ser Asp Glu 
                85                  90                  95      


Glu Tyr Tyr Ala Lys Glu Val Tyr Lys Ile Asp Met Pro Pro Phe Phe 
            100                 105                 110         


Pro Ser Glu Thr Val Cys Pro Val Val Thr Thr Pro Ser Gly Ser Val 
        115                 120                 125             


Gly Ser Leu Cys Ser Arg Gln Ser Gln Val Leu Cys Gly Tyr Leu Asp 
    130                 135                 140                 


Ala Ile Pro Pro Thr Phe Tyr Arg Pro Tyr Phe Arg Ile Val Arg Phe 
145                 150                 155                 160 


Asp Val Ser Ala Met Glu Lys Asn Ala Ser Asn Leu Val Lys Ala Glu 
                165                 170                 175     


Phe Arg Val Phe Arg Leu Gln Asn Pro Lys Ala Arg Val Pro Glu Gln 
            180                 185                 190         


Arg Ile Glu Leu Tyr Gln Ile Leu Lys Ser Lys Asp Leu Thr Ser Pro 
        195                 200                 205             


Thr Gln Arg Tyr Ile Asp Ser Lys Val Val Lys Thr Arg Ala Glu Gly 
    210                 215                 220                 


Glu Trp Leu Ser Phe Asp Val Thr Asp Ala Val His Glu Trp Leu His 
225                 230                 235                 240 


His Lys Asp Arg Asn Leu Gly Phe Lys Ile Ser Leu His Cys Pro Cys 
                245                 250                 255     


Cys Thr Phe Val Pro Ser Asn Asn Tyr Ile Ile Pro Asn Lys Ser Glu 
            260                 265                 270         


Glu Leu Glu Ala Arg Phe Ala Gly Ile Asp Gly Thr Ser Thr Tyr Thr 
        275                 280                 285             


Ser Gly Asp Gln Lys Thr Ile Lys Ser Thr Arg Lys Lys Asn Ser Gly 
    290                 295                 300                 


Lys Thr Pro His Leu Leu Leu Met Leu Leu Pro Ser Tyr Arg Leu Glu 
305                 310                 315                 320 


Ser Gln Gln Thr Asn Arg Arg Lys Lys Arg Ala Leu Asp Ala Ala Tyr 
                325                 330                 335     


Cys Phe Arg Asn Val Gln Asp Asn Cys Cys Leu Arg Pro Leu Tyr Ile 
            340                 345                 350         


Asp Phe Lys Arg Asp Leu Gly Trp Lys Trp Ile His Glu Pro Lys Gly 
        355                 360                 365             


Tyr Asn Ala Asn Phe Cys Ala Gly Ala Cys Pro Tyr Leu Trp Ser Ser 
    370                 375                 380                 


Asp Thr Gln His Ser Arg Val Leu Ser Leu Tyr Asn Thr Ile Asn Pro 
385                 390                 395                 400 


Glu Ala Ser Ala Ser Pro Cys Cys Val Ser Gln Asp Leu Glu Pro Leu 
                405                 410                 415     


Thr Ile Leu Tyr Tyr Ile Gly Lys Thr Pro Lys Ile Glu Gln Leu Ser 
            420                 425                 430         


Asn Met Ile Val Lys Ser Cys Lys Cys Ser 
        435                 440         


<210>  174
<211>  5882
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens transforming growth factor, beta 2 (TGFB2), 
       transcript variant 2, mRNA NCBI Reference Sequence: NM_003238.3

<400>  174
gtgatgttat ctgctggcag cagaaggttc gctccgagcg gagctccaga agctcctgac       60

aagagaaaga cagattgaga tagagataga aagagaaaga gagaaagaga cagcagagcg      120

agagcgcaag tgaaagaggc aggggagggg gatggagaat attagcctga cggtctaggg      180

agtcatccag gaacaaactg aggggctgcc cggctgcaga caggaggaga cagagaggat      240

ctattttagg gtggcaagtg cctacctacc ctaagcgagc aattccacgt tggggagaag      300

ccagcagagg ttgggaaagg gtgggagtcc aagggagccc ctgcgcaacc ccctcaggaa      360

taaaactccc cagccagggt gtcgcaaggg ctgccgttgt gatccgcagg gggtgaacgc      420

aaccgcgacg gctgatcgtc tgtggctggg ttggcgtttg gagcaagaga aggaggagca      480

ggagaaggag ggagctggag gctggaagcg tttgcaagcg gcggcggcag caacgtggag      540

taaccaagcg ggtcagcgcg cgcccgccag ggtgtaggcc acggagcgca gctcccagag      600

caggatccgc gccgcctcag cagcctctgc ggcccctgcg gcacccgacc gagtaccgag      660

cgccctgcga agcgcaccct cctccccgcg gtgcgctggg ctcgccccca gcgcgcgcac      720

acgcacacac acacacacac acacacacgc acgcacacac gtgtgcgctt ctctgctccg      780

gagctgctgc tgctcctgct ctcagcgccg cagtggaagg caggaccgaa ccgctccttc      840

tttaaatata taaatttcag cccaggtcag cctcggcggc ccccctcacc gcgctcccgg      900

cgcccctccc gtcagttcgc cagctgccag ccccgggacc ttttcatctc ttcccttttg      960

gccggaggag ccgagttcag atccgccact ccgcacccga gactgacaca ctgaactcca     1020

cttcctcctc ttaaatttat ttctacttaa tagccactcg tctctttttt tccccatctc     1080

attgctccaa gaattttttt cttcttactc gccaaagtca gggttccctc tgcccgtccc     1140

gtattaatat ttccactttt ggaactactg gccttttctt tttaaaggaa ttcaagcagg     1200

atacgttttt ctgttgggca ttgactagat tgtttgcaaa agtttcgcat caaaaacaac     1260

aacaacaaaa aaccaaacaa ctctccttga tctatacttt gagaattgtt gatttctttt     1320

ttttattctg acttttaaaa acaacttttt tttccacttt tttaaaaaat gcactactgt     1380

gtgctgagcg cttttctgat cctgcatctg gtcacggtcg cgctcagcct gtctacctgc     1440

agcacactcg atatggacca gttcatgcgc aagaggatcg aggcgatccg cgggcagatc     1500

ctgagcaagc tgaagctcac cagtccccca gaagactatc ctgagcccga ggaagtcccc     1560

ccggaggtga tttccatcta caacagcacc agggacttgc tccaggagaa ggcgagccgg     1620

agggcggccg cctgcgagcg cgagaggagc gacgaagagt actacgccaa ggaggtttac     1680

aaaatagaca tgccgccctt cttcccctcc gaaaatgcca tcccgcccac tttctacaga     1740

ccctacttca gaattgttcg atttgacgtc tcagcaatgg agaagaatgc ttccaatttg     1800

gtgaaagcag agttcagagt ctttcgtttg cagaacccaa aagccagagt gcctgaacaa     1860

cggattgagc tatatcagat tctcaagtcc aaagatttaa catctccaac ccagcgctac     1920

atcgacagca aagttgtgaa aacaagagca gaaggcgaat ggctctcctt cgatgtaact     1980

gatgctgttc atgaatggct tcaccataaa gacaggaacc tgggatttaa aataagctta     2040

cactgtccct gctgcacttt tgtaccatct aataattaca tcatcccaaa taaaagtgaa     2100

gaactagaag caagatttgc aggtattgat ggcacctcca catataccag tggtgatcag     2160

aaaactataa agtccactag gaaaaaaaac agtgggaaga ccccacatct cctgctaatg     2220

ttattgccct cctacagact tgagtcacaa cagaccaacc ggcggaagaa gcgtgctttg     2280

gatgcggcct attgctttag aaatgtgcag gataattgct gcctacgtcc actttacatt     2340

gatttcaaga gggatctagg gtggaaatgg atacacgaac ccaaagggta caatgccaac     2400

ttctgtgctg gagcatgccc gtatttatgg agttcagaca ctcagcacag cagggtcctg     2460

agcttatata ataccataaa tccagaagca tctgcttctc cttgctgcgt gtcccaagat     2520

ttagaacctc taaccattct ctactacatt ggcaaaacac ccaagattga acagctttct     2580

aatatgattg taaagtcttg caaatgcagc taaaattctt ggaaaagtgg caagaccaaa     2640

atgacaatga tgatgataat gatgatgacg acgacaacga tgatgcttgt aacaagaaaa     2700

cataagagag ccttggttca tcagtgttaa aaaatttttg aaaaggcggt actagttcag     2760

acactttgga agtttgtgtt ctgtttgtta aaactggcat ctgacacaaa aaaagttgaa     2820

ggccttattc tacatttcac ctactttgta agtgagagag acaagaagca aatttttttt     2880

aaagaaaaaa ataaacactg gaagaattta ttagtgttaa ttatgtgaac aacgacaaca     2940

acaacaacaa caacaaacag gaaaatccca ttaagtggag ttgctgtacg taccgttcct     3000

atcccgcgcc tcacttgatt tttctgtatt gctatgcaat aggcaccctt cccattctta     3060

ctcttagagt taacagtgag ttatttattg tgtgttacta tataatgaac gtttcattgc     3120

ccttggaaaa taaaacaggt gtataaagtg gagaccaaat actttgccag aaactcatgg     3180

atggcttaag gaacttgaac tcaaacgagc cagaaaaaaa gaggtcatat taatgggatg     3240

aaaacccaag tgagttatta tatgaccgag aaagtctgca ttaagataaa gaccctgaaa     3300

acacatgtta tgtatcagct gcctaaggaa gcttcttgta aggtccaaaa actaaaaaga     3360

ctgttaataa aagaaacttt cagtcagaat aagtctgtaa gttttttttt ttctttttaa     3420

ttgtaaatgg ttctttgtca gtttagtaaa ccagtgaaat gttgaaatgt tttgacatgt     3480

actggtcaaa cttcagacct taaaatattg ctgtatagct atgctatagg ttttttcctt     3540

tgttttggta tatgtaacca tacctatatt attaaaatag atggatatag aagccagcat     3600

aattgaaaac acatctgcag atctcttttg caaactatta aatcaaaaca ttaactactt     3660

tatgtgtaat gtgtaaattt ttaccatatt ttttatattc tgtaataatg tcaactatga     3720

tttagattga cttaaatttg ggctcttttt aatgatcact cacaaatgta tgtttctttt     3780

agctggccag tacttttgag taaagcccct atagtttgac ttgcactaca aatgcatttt     3840

ttttttaata acatttgccc tacttgtgct ttgtgtttct ttcattatta tgacataagc     3900

tacctgggtc cacttgtctt ttcttttttt tgtttcacag aaaagatggg ttcgagttca     3960

gtggtcttca tcttccaagc atcattacta accaagtcag acgttaacaa atttttatgt     4020

taggaaaagg aggaatgtta tagatacata gaaaattgaa gtaaaatgtt ttcattttag     4080

caaggattta gggttctaac taaaactcag aatctttatt gagttaagaa aagtttctct     4140

accttggttt aatcaatatt tttgtaaaat cctattgtta ttacaaagag gacacttcat     4200

aggaaacatc tttttcttta gtcaggtttt taatattcag ggggaaattg aaagatatat     4260

attttagtcg atttttcaaa aggggaaaaa agtccaggtc agcataagtc attttgtgta     4320

tttcactgaa gttataaggt ttttataaat gttctttgaa ggggaaaagg cacaagccaa     4380

tttttcctat gatcaaaaaa ttctttcttt cctctgagtg agagttatct atatctgagg     4440

ctaaagttta ccttgcttta ataaataatt tgccacatca ttgcagaaga ggtatcctca     4500

tgctggggtt aatagaatat gtcagtttat cacttgtcgc ttatttagct ttaaaataaa     4560

aattaatagg caaagcaatg gaatatttgc agtttcacct aaagagcagc ataaggaggc     4620

gggaatccaa agtgaagttg tttgatatgg tctacttctt ttttggaatt tcctgaccat     4680

taattaaaga attggatttg caagtttgaa aactggaaaa gcaagagatg ggatgccata     4740

atagtaaaca gcccttgtgt tggatgtaac ccaatcccag atttgagtgt gtgttgatta     4800

tttttttgtc ttccactttt ctattatgtg taaatcactt ttatttctgc agacattttc     4860

ctctcagata ggatgacatt ttgttttgta ttattttgtc tttcctcatg aatgcactga     4920

taatatttta aatgctctat tttaagatct cttgaatctg tttttttttt ttttaatttg     4980

ggggttctgt aaggtcttta tttcccataa gtaaatattg ccatgggagg ggggtggagg     5040

tggcaaggaa ggggtgaagt gctagtatgc aagtgggcag caattatttt tgtgttaatc     5100

agcagtacaa tttgatcgtt ggcatggtta aaaaatggaa tataagatta gctgttttgt     5160

attttgatga ccaattacgc tgtattttaa cacgatgtat gtctgttttt gtggtgctct     5220

agtggtaaat aaattatttc gatgatatgt ggatgtcttt ttcctatcag taccatcatc     5280

gagtctagaa aacacctgtg atgcaataag actatctcaa gctggaaaag tcataccacc     5340

tttccgattg ccctctgtgc tttctccctt aaggacagtc acttcagaag tcatgcttta     5400

aagcacaaga gtcaggccat atccatcaag gatagaagaa atccctgtgc cgtcttttta     5460

ttcccttatt tattgctatt tggtaattgt ttgagattta gtttccatcc agcttgactg     5520

ccgaccagaa aaaatgcaga gagatgtttg caccatgctt tggctttctg gttctatgtt     5580

ctgccaacgc cagggccaaa agaactggtc tagacagtat cccctgtagc cccataactt     5640

ggatagttgc tgagccagcc agatataaca agagccacgt gctttctggg gttggttgtt     5700

tgggatcagc tacttgcctg tcagtttcac tggtaccact gcaccacaaa caaaaaaacc     5760

caccctattt cctccaattt ttttggctgc tacctacaag accagactcc tcaaacgagt     5820

tgccaatctc ttaataaata ggattaataa aaaaagtaat tgtgactcaa aaaaaaaaaa     5880

aa                                                                    5882


<210>  175
<211>  414
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens transforming growth factor, beta 2 (TGFB2), 
       transcript variant 2, polypeptide  NCBI Reference Sequence: 
       NM_003238.3

<400>  175

Met His Tyr Cys Val Leu Ser Ala Phe Leu Ile Leu His Leu Val Thr 
1               5                   10                  15      


Val Ala Leu Ser Leu Ser Thr Cys Ser Thr Leu Asp Met Asp Gln Phe 
            20                  25                  30          


Met Arg Lys Arg Ile Glu Ala Ile Arg Gly Gln Ile Leu Ser Lys Leu 
        35                  40                  45              


Lys Leu Thr Ser Pro Pro Glu Asp Tyr Pro Glu Pro Glu Glu Val Pro 
    50                  55                  60                  


Pro Glu Val Ile Ser Ile Tyr Asn Ser Thr Arg Asp Leu Leu Gln Glu 
65                  70                  75                  80  


Lys Ala Ser Arg Arg Ala Ala Ala Cys Glu Arg Glu Arg Ser Asp Glu 
                85                  90                  95      


Glu Tyr Tyr Ala Lys Glu Val Tyr Lys Ile Asp Met Pro Pro Phe Phe 
            100                 105                 110         


Pro Ser Glu Asn Ala Ile Pro Pro Thr Phe Tyr Arg Pro Tyr Phe Arg 
        115                 120                 125             


Ile Val Arg Phe Asp Val Ser Ala Met Glu Lys Asn Ala Ser Asn Leu 
    130                 135                 140                 


Val Lys Ala Glu Phe Arg Val Phe Arg Leu Gln Asn Pro Lys Ala Arg 
145                 150                 155                 160 


Val Pro Glu Gln Arg Ile Glu Leu Tyr Gln Ile Leu Lys Ser Lys Asp 
                165                 170                 175     


Leu Thr Ser Pro Thr Gln Arg Tyr Ile Asp Ser Lys Val Val Lys Thr 
            180                 185                 190         


Arg Ala Glu Gly Glu Trp Leu Ser Phe Asp Val Thr Asp Ala Val His 
        195                 200                 205             


Glu Trp Leu His His Lys Asp Arg Asn Leu Gly Phe Lys Ile Ser Leu 
    210                 215                 220                 


His Cys Pro Cys Cys Thr Phe Val Pro Ser Asn Asn Tyr Ile Ile Pro 
225                 230                 235                 240 


Asn Lys Ser Glu Glu Leu Glu Ala Arg Phe Ala Gly Ile Asp Gly Thr 
                245                 250                 255     


Ser Thr Tyr Thr Ser Gly Asp Gln Lys Thr Ile Lys Ser Thr Arg Lys 
            260                 265                 270         


Lys Asn Ser Gly Lys Thr Pro His Leu Leu Leu Met Leu Leu Pro Ser 
        275                 280                 285             


Tyr Arg Leu Glu Ser Gln Gln Thr Asn Arg Arg Lys Lys Arg Ala Leu 
    290                 295                 300                 


Asp Ala Ala Tyr Cys Phe Arg Asn Val Gln Asp Asn Cys Cys Leu Arg 
305                 310                 315                 320 


Pro Leu Tyr Ile Asp Phe Lys Arg Asp Leu Gly Trp Lys Trp Ile His 
                325                 330                 335     


Glu Pro Lys Gly Tyr Asn Ala Asn Phe Cys Ala Gly Ala Cys Pro Tyr 
            340                 345                 350         


Leu Trp Ser Ser Asp Thr Gln His Ser Arg Val Leu Ser Leu Tyr Asn 
        355                 360                 365             


Thr Ile Asn Pro Glu Ala Ser Ala Ser Pro Cys Cys Val Ser Gln Asp 
    370                 375                 380                 


Leu Glu Pro Leu Thr Ile Leu Tyr Tyr Ile Gly Lys Thr Pro Lys Ile 
385                 390                 395                 400 


Glu Gln Leu Ser Asn Met Ile Val Lys Ser Cys Lys Cys Ser 
                405                 410                 


<210>  176
<211>  2591
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human p53 mRNA transcript variant 1 GenBank Accession No.: 
       NM_000546.5  GI:371502114

<400>  176
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggcagcca      180

gactgccttc cgggtcactg ccatggagga gccgcagtca gatcctagcg tcgagccccc      240

tctgagtcag gaaacatttt cagacctatg gaaactactt cctgaaaaca acgttctgtc      300

ccccttgccg tcccaagcaa tggatgattt gatgctgtcc ccggacgata ttgaacaatg      360

gttcactgaa gacccaggtc cagatgaagc tcccagaatg ccagaggctg ctccccccgt      420

ggcccctgca ccagcagctc ctacaccggc ggcccctgca ccagccccct cctggcccct      480

gtcatcttct gtcccttccc agaaaaccta ccagggcagc tacggtttcc gtctgggctt      540

cttgcattct gggacagcca agtctgtgac ttgcacgtac tcccctgccc tcaacaagat      600

gttttgccaa ctggccaaga cctgccctgt gcagctgtgg gttgattcca cacccccgcc      660

cggcacccgc gtccgcgcca tggccatcta caagcagtca cagcacatga cggaggttgt      720

gaggcgctgc ccccaccatg agcgctgctc agatagcgat ggtctggccc ctcctcagca      780

tcttatccga gtggaaggaa atttgcgtgt ggagtatttg gatgacagaa acacttttcg      840

acatagtgtg gtggtgccct atgagccgcc tgaggttggc tctgactgta ccaccatcca      900

ctacaactac atgtgtaaca gttcctgcat gggcggcatg aaccggaggc ccatcctcac      960

catcatcaca ctggaagact ccagtggtaa tctactggga cggaacagct ttgaggtgcg     1020

tgtttgtgcc tgtcctggga gagaccggcg cacagaggaa gagaatctcc gcaagaaagg     1080

ggagcctcac cacgagctgc ccccagggag cactaagcga gcactgccca acaacaccag     1140

ctcctctccc cagccaaaga agaaaccact ggatggagaa tatttcaccc ttcagatccg     1200

tgggcgtgag cgcttcgaga tgttccgaga gctgaatgag gccttggaac tcaaggatgc     1260

ccaggctggg aaggagccag gggggagcag ggctcactcc agccacctga agtccaaaaa     1320

gggtcagtct acctcccgcc ataaaaaact catgttcaag acagaagggc ctgactcaga     1380

ctgacattct ccacttcttg ttccccactg acagcctccc acccccatct ctccctcccc     1440

tgccattttg ggttttgggt ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac     1500

ccaggacttc catttgcttt gtcccggggc tccactgaac aagttggcct gcactggtgt     1560

tttgttgtgg ggaggaggat ggggagtagg acataccagc ttagatttta aggtttttac     1620

tgtgagggat gtttgggaga tgtaagaaat gttcttgcag ttaagggtta gtttacaatc     1680

agccacattc taggtagggg cccacttcac cgtactaacc agggaagctg tccctcactg     1740

ttgaattttc tctaacttca aggcccatat ctgtgaaatg ctggcatttg cacctacctc     1800

acagagtgca ttgtgagggt taatgaaata atgtacatct ggccttgaaa ccacctttta     1860

ttacatgggg tctagaactt gacccccttg agggtgcttg ttccctctcc ctgttggtcg     1920

gtgggttggt agtttctaca gttgggcagc tggttaggta gagggagttg tcaagtctct     1980

gctggcccag ccaaaccctg tctgacaacc tcttggtgaa ccttagtacc taaaaggaaa     2040

tctcacccca tcccacaccc tggaggattt catctcttgt atatgatgat ctggatccac     2100

caagacttgt tttatgctca gggtcaattt cttttttctt tttttttttt ttttttcttt     2160

ttctttgaga ctgggtctcg ctttgttgcc caggctggag tggagtggcg tgatcttggc     2220

ttactgcagc ctttgcctcc ccggctcgag cagtcctgcc tcagcctccg gagtagctgg     2280

gaccacaggt tcatgccacc atggccagcc aacttttgca tgttttgtag agatggggtc     2340

tcacagtgtt gcccaggctg gtctcaaact cctgggctca ggcgatccac ctgtctcagc     2400

ctcccagagt gctgggatta caattgtgag ccaccacgtc cagctggaag ggtcaacatc     2460

ttttacattc tgcaagcaca tctgcatttt caccccaccc ttcccctcct tctccctttt     2520

tatatcccat ttttatatcg atctcttatt ttacaataaa actttgctgc cacctgtgtg     2580

tctgaggggt g                                                          2591


<210>  177
<211>  393
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human p53 isoform a polypeptide encoded by mRNA transcript 
       variant 1 GenBank Accession No.: NP_000537.3  GI:120407068

<400>  177

Met Glu Glu Pro Gln Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gln 
1               5                   10                  15      


Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 
            20                  25                  30          


Ser Pro Leu Pro Ser Gln Ala Met Asp Asp Leu Met Leu Ser Pro Asp 
        35                  40                  45              


Asp Ile Glu Gln Trp Phe Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 
    50                  55                  60                  


Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro 
65                  70                  75                  80  


Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser 
                85                  90                  95      


Val Pro Ser Gln Lys Thr Tyr Gln Gly Ser Tyr Gly Phe Arg Leu Gly 
            100                 105                 110         


Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro 
        115                 120                 125             


Ala Leu Asn Lys Met Phe Cys Gln Leu Ala Lys Thr Cys Pro Val Gln 
    130                 135                 140                 


Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met 
145                 150                 155                 160 


Ala Ile Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys 
                165                 170                 175     


Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln 
            180                 185                 190         


His Leu Ile Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp 
        195                 200                 205             


Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 
    210                 215                 220                 


Val Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser 
225                 230                 235                 240 


Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr 
                245                 250                 255     


Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 
            260                 265                 270         


Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn 
        275                 280                 285             


Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr 
    290                 295                 300                 


Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys Lys 
305                 310                 315                 320 


Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Ile Arg Gly Arg Glu 
                325                 330                 335     


Arg Phe Glu Met Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp 
            340                 345                 350         


Ala Gln Ala Gly Lys Glu Pro Gly Gly Ser Arg Ala His Ser Ser His 
        355                 360                 365             


Leu Lys Ser Lys Lys Gly Gln Ser Thr Ser Arg His Lys Lys Leu Met 
    370                 375                 380                 


Phe Lys Thr Glu Gly Pro Asp Ser Asp 
385                 390             


<210>  178
<211>  2588
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 2, 
       mRNA

NCBI Reference Sequence: NM_001126112.2

<400>  178
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggccagac      180

tgccttccgg gtcactgcca tggaggagcc gcagtcagat cctagcgtcg agccccctct      240

gagtcaggaa acattttcag acctatggaa actacttcct gaaaacaacg ttctgtcccc      300

cttgccgtcc caagcaatgg atgatttgat gctgtccccg gacgatattg aacaatggtt      360

cactgaagac ccaggtccag atgaagctcc cagaatgcca gaggctgctc cccccgtggc      420

ccctgcacca gcagctccta caccggcggc ccctgcacca gccccctcct ggcccctgtc      480

atcttctgtc ccttcccaga aaacctacca gggcagctac ggtttccgtc tgggcttctt      540

gcattctggg acagccaagt ctgtgacttg cacgtactcc cctgccctca acaagatgtt      600

ttgccaactg gccaagacct gccctgtgca gctgtgggtt gattccacac ccccgcccgg      660

cacccgcgtc cgcgccatgg ccatctacaa gcagtcacag cacatgacgg aggttgtgag      720

gcgctgcccc caccatgagc gctgctcaga tagcgatggt ctggcccctc ctcagcatct      780

tatccgagtg gaaggaaatt tgcgtgtgga gtatttggat gacagaaaca cttttcgaca      840

tagtgtggtg gtgccctatg agccgcctga ggttggctct gactgtacca ccatccacta      900

caactacatg tgtaacagtt cctgcatggg cggcatgaac cggaggccca tcctcaccat      960

catcacactg gaagactcca gtggtaatct actgggacgg aacagctttg aggtgcgtgt     1020

ttgtgcctgt cctgggagag accggcgcac agaggaagag aatctccgca agaaagggga     1080

gcctcaccac gagctgcccc cagggagcac taagcgagca ctgcccaaca acaccagctc     1140

ctctccccag ccaaagaaga aaccactgga tggagaatat ttcacccttc agatccgtgg     1200

gcgtgagcgc ttcgagatgt tccgagagct gaatgaggcc ttggaactca aggatgccca     1260

ggctgggaag gagccagggg ggagcagggc tcactccagc cacctgaagt ccaaaaaggg     1320

tcagtctacc tcccgccata aaaaactcat gttcaagaca gaagggcctg actcagactg     1380

acattctcca cttcttgttc cccactgaca gcctcccacc cccatctctc cctcccctgc     1440

cattttgggt tttgggtctt tgaacccttg cttgcaatag gtgtgcgtca gaagcaccca     1500

ggacttccat ttgctttgtc ccggggctcc actgaacaag ttggcctgca ctggtgtttt     1560

gttgtgggga ggaggatggg gagtaggaca taccagctta gattttaagg tttttactgt     1620

gagggatgtt tgggagatgt aagaaatgtt cttgcagtta agggttagtt tacaatcagc     1680

cacattctag gtaggggccc acttcaccgt actaaccagg gaagctgtcc ctcactgttg     1740

aattttctct aacttcaagg cccatatctg tgaaatgctg gcatttgcac ctacctcaca     1800

gagtgcattg tgagggttaa tgaaataatg tacatctggc cttgaaacca ccttttatta     1860

catggggtct agaacttgac ccccttgagg gtgcttgttc cctctccctg ttggtcggtg     1920

ggttggtagt ttctacagtt gggcagctgg ttaggtagag ggagttgtca agtctctgct     1980

ggcccagcca aaccctgtct gacaacctct tggtgaacct tagtacctaa aaggaaatct     2040

caccccatcc cacaccctgg aggatttcat ctcttgtata tgatgatctg gatccaccaa     2100

gacttgtttt atgctcaggg tcaatttctt ttttcttttt tttttttttt tttctttttc     2160

tttgagactg ggtctcgctt tgttgcccag gctggagtgg agtggcgtga tcttggctta     2220

ctgcagcctt tgcctccccg gctcgagcag tcctgcctca gcctccggag tagctgggac     2280

cacaggttca tgccaccatg gccagccaac ttttgcatgt tttgtagaga tggggtctca     2340

cagtgttgcc caggctggtc tcaaactcct gggctcaggc gatccacctg tctcagcctc     2400

ccagagtgct gggattacaa ttgtgagcca ccacgtccag ctggaagggt caacatcttt     2460

tacattctgc aagcacatct gcattttcac cccacccttc ccctccttct ccctttttat     2520

atcccatttt tatatcgatc tcttatttta caataaaact ttgctgccac ctgtgtgtct     2580

gaggggtg                                                              2588


<210>  179
<211>  393
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 2, 
       polypeptide

NCBI Reference Sequence: NM_001126112.2

<400>  179

Met Glu Glu Pro Gln Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gln 
1               5                   10                  15      


Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 
            20                  25                  30          


Ser Pro Leu Pro Ser Gln Ala Met Asp Asp Leu Met Leu Ser Pro Asp 
        35                  40                  45              


Asp Ile Glu Gln Trp Phe Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 
    50                  55                  60                  


Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro 
65                  70                  75                  80  


Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser 
                85                  90                  95      


Val Pro Ser Gln Lys Thr Tyr Gln Gly Ser Tyr Gly Phe Arg Leu Gly 
            100                 105                 110         


Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro 
        115                 120                 125             


Ala Leu Asn Lys Met Phe Cys Gln Leu Ala Lys Thr Cys Pro Val Gln 
    130                 135                 140                 


Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met 
145                 150                 155                 160 


Ala Ile Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys 
                165                 170                 175     


Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln 
            180                 185                 190         


His Leu Ile Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp 
        195                 200                 205             


Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 
    210                 215                 220                 


Val Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser 
225                 230                 235                 240 


Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr 
                245                 250                 255     


Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 
            260                 265                 270         


Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn 
        275                 280                 285             


Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr 
    290                 295                 300                 


Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys Lys 
305                 310                 315                 320 


Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Ile Arg Gly Arg Glu 
                325                 330                 335     


Arg Phe Glu Met Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp 
            340                 345                 350         


Ala Gln Ala Gly Lys Glu Pro Gly Gly Ser Arg Ala His Ser Ser His 
        355                 360                 365             


Leu Lys Ser Lys Lys Gly Gln Ser Thr Ser Arg His Lys Lys Leu Met 
    370                 375                 380                 


Phe Lys Thr Glu Gly Pro Asp Ser Asp 
385                 390             


<210>  180
<211>  2588
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 1, 
       mRNA

NCBI Reference Sequence: NM_001276760.1

<400>  180
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggccagac      180

tgccttccgg gtcactgcca tggaggagcc gcagtcagat cctagcgtcg agccccctct      240

gagtcaggaa acattttcag acctatggaa actacttcct gaaaacaacg ttctgtcccc      300

cttgccgtcc caagcaatgg atgatttgat gctgtccccg gacgatattg aacaatggtt      360

cactgaagac ccaggtccag atgaagctcc cagaatgcca gaggctgctc cccccgtggc      420

ccctgcacca gcagctccta caccggcggc ccctgcacca gccccctcct ggcccctgtc      480

atcttctgtc ccttcccaga aaacctacca gggcagctac ggtttccgtc tgggcttctt      540

gcattctggg acagccaagt ctgtgacttg cacgtactcc cctgccctca acaagatgtt      600

ttgccaactg gccaagacct gccctgtgca gctgtgggtt gattccacac ccccgcccgg      660

cacccgcgtc cgcgccatgg ccatctacaa gcagtcacag cacatgacgg aggttgtgag      720

gcgctgcccc caccatgagc gctgctcaga tagcgatggt ctggcccctc ctcagcatct      780

tatccgagtg gaaggaaatt tgcgtgtgga gtatttggat gacagaaaca cttttcgaca      840

tagtgtggtg gtgccctatg agccgcctga ggttggctct gactgtacca ccatccacta      900

caactacatg tgtaacagtt cctgcatggg cggcatgaac cggaggccca tcctcaccat      960

catcacactg gaagactcca gtggtaatct actgggacgg aacagctttg aggtgcgtgt     1020

ttgtgcctgt cctgggagag accggcgcac agaggaagag aatctccgca agaaagggga     1080

gcctcaccac gagctgcccc cagggagcac taagcgagca ctgcccaaca acaccagctc     1140

ctctccccag ccaaagaaga aaccactgga tggagaatat ttcacccttc agatccgtgg     1200

gcgtgagcgc ttcgagatgt tccgagagct gaatgaggcc ttggaactca aggatgccca     1260

ggctgggaag gagccagggg ggagcagggc tcactccagc cacctgaagt ccaaaaaggg     1320

tcagtctacc tcccgccata aaaaactcat gttcaagaca gaagggcctg actcagactg     1380

acattctcca cttcttgttc cccactgaca gcctcccacc cccatctctc cctcccctgc     1440

cattttgggt tttgggtctt tgaacccttg cttgcaatag gtgtgcgtca gaagcaccca     1500

ggacttccat ttgctttgtc ccggggctcc actgaacaag ttggcctgca ctggtgtttt     1560

gttgtgggga ggaggatggg gagtaggaca taccagctta gattttaagg tttttactgt     1620

gagggatgtt tgggagatgt aagaaatgtt cttgcagtta agggttagtt tacaatcagc     1680

cacattctag gtaggggccc acttcaccgt actaaccagg gaagctgtcc ctcactgttg     1740

aattttctct aacttcaagg cccatatctg tgaaatgctg gcatttgcac ctacctcaca     1800

gagtgcattg tgagggttaa tgaaataatg tacatctggc cttgaaacca ccttttatta     1860

catggggtct agaacttgac ccccttgagg gtgcttgttc cctctccctg ttggtcggtg     1920

ggttggtagt ttctacagtt gggcagctgg ttaggtagag ggagttgtca agtctctgct     1980

ggcccagcca aaccctgtct gacaacctct tggtgaacct tagtacctaa aaggaaatct     2040

caccccatcc cacaccctgg aggatttcat ctcttgtata tgatgatctg gatccaccaa     2100

gacttgtttt atgctcaggg tcaatttctt ttttcttttt tttttttttt tttctttttc     2160

tttgagactg ggtctcgctt tgttgcccag gctggagtgg agtggcgtga tcttggctta     2220

ctgcagcctt tgcctccccg gctcgagcag tcctgcctca gcctccggag tagctgggac     2280

cacaggttca tgccaccatg gccagccaac ttttgcatgt tttgtagaga tggggtctca     2340

cagtgttgcc caggctggtc tcaaactcct gggctcaggc gatccacctg tctcagcctc     2400

ccagagtgct gggattacaa ttgtgagcca ccacgtccag ctggaagggt caacatcttt     2460

tacattctgc aagcacatct gcattttcac cccacccttc ccctccttct ccctttttat     2520

atcccatttt tatatcgatc tcttatttta caataaaact ttgctgccac ctgtgtgtct     2580

gaggggtg                                                              2588


<210>  181
<211>  354
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 1, 
       polypeptide

NCBI Reference Sequence: NM_001276760.1

<400>  181

Met Asp Asp Leu Met Leu Ser Pro Asp Asp Ile Glu Gln Trp Phe Thr 
1               5                   10                  15      


Glu Asp Pro Gly Pro Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro 
            20                  25                  30          


Pro Val Ala Pro Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro 
        35                  40                  45              


Ala Pro Ser Trp Pro Leu Ser Ser Ser Val Pro Ser Gln Lys Thr Tyr 
    50                  55                  60                  


Gln Gly Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala 
65                  70                  75                  80  


Lys Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys 
                85                  90                  95      


Gln Leu Ala Lys Thr Cys Pro Val Gln Leu Trp Val Asp Ser Thr Pro 
            100                 105                 110         


Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gln Ser Gln 
        115                 120                 125             


His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu Arg Cys Ser 
    130                 135                 140                 


Asp Ser Asp Gly Leu Ala Pro Pro Gln His Leu Ile Arg Val Glu Gly 
145                 150                 155                 160 


Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe Arg His Ser 
                165                 170                 175     


Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp Cys Thr Thr 
            180                 185                 190         


Ile His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly Gly Met Asn 
        195                 200                 205             


Arg Arg Pro Ile Leu Thr Ile Ile Thr Leu Glu Asp Ser Ser Gly Asn 
    210                 215                 220                 


Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala Cys Pro Gly 
225                 230                 235                 240 


Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro 
                245                 250                 255     


His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn 
            260                 265                 270         


Thr Ser Ser Ser Pro Gln Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr 
        275                 280                 285             


Phe Thr Leu Gln Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu 
    290                 295                 300                 


Leu Asn Glu Ala Leu Glu Leu Lys Asp Ala Gln Ala Gly Lys Glu Pro 
305                 310                 315                 320 


Gly Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lys Gly Gln 
                325                 330                 335     


Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro Asp 
            340                 345                 350         


Ser Asp 
        


<210>  182
<211>  2588
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 2, mRNA

<400>  182
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggccagac      180

tgccttccgg gtcactgcca tggaggagcc gcagtcagat cctagcgtcg agccccctct      240

gagtcaggaa acattttcag acctatggaa actacttcct gaaaacaacg ttctgtcccc      300

cttgccgtcc caagcaatgg atgatttgat gctgtccccg gacgatattg aacaatggtt      360

cactgaagac ccaggtccag atgaagctcc cagaatgcca gaggctgctc cccccgtggc      420

ccctgcacca gcagctccta caccggcggc ccctgcacca gccccctcct ggcccctgtc      480

atcttctgtc ccttcccaga aaacctacca gggcagctac ggtttccgtc tgggcttctt      540

gcattctggg acagccaagt ctgtgacttg cacgtactcc cctgccctca acaagatgtt      600

ttgccaactg gccaagacct gccctgtgca gctgtgggtt gattccacac ccccgcccgg      660

cacccgcgtc cgcgccatgg ccatctacaa gcagtcacag cacatgacgg aggttgtgag      720

gcgctgcccc caccatgagc gctgctcaga tagcgatggt ctggcccctc ctcagcatct      780

tatccgagtg gaaggaaatt tgcgtgtgga gtatttggat gacagaaaca cttttcgaca      840

tagtgtggtg gtgccctatg agccgcctga ggttggctct gactgtacca ccatccacta      900

caactacatg tgtaacagtt cctgcatggg cggcatgaac cggaggccca tcctcaccat      960

catcacactg gaagactcca gtggtaatct actgggacgg aacagctttg aggtgcgtgt     1020

ttgtgcctgt cctgggagag accggcgcac agaggaagag aatctccgca agaaagggga     1080

gcctcaccac gagctgcccc cagggagcac taagcgagca ctgcccaaca acaccagctc     1140

ctctccccag ccaaagaaga aaccactgga tggagaatat ttcacccttc agatccgtgg     1200

gcgtgagcgc ttcgagatgt tccgagagct gaatgaggcc ttggaactca aggatgccca     1260

ggctgggaag gagccagggg ggagcagggc tcactccagc cacctgaagt ccaaaaaggg     1320

tcagtctacc tcccgccata aaaaactcat gttcaagaca gaagggcctg actcagactg     1380

acattctcca cttcttgttc cccactgaca gcctcccacc cccatctctc cctcccctgc     1440

cattttgggt tttgggtctt tgaacccttg cttgcaatag gtgtgcgtca gaagcaccca     1500

ggacttccat ttgctttgtc ccggggctcc actgaacaag ttggcctgca ctggtgtttt     1560

gttgtgggga ggaggatggg gagtaggaca taccagctta gattttaagg tttttactgt     1620

gagggatgtt tgggagatgt aagaaatgtt cttgcagtta agggttagtt tacaatcagc     1680

cacattctag gtaggggccc acttcaccgt actaaccagg gaagctgtcc ctcactgttg     1740

aattttctct aacttcaagg cccatatctg tgaaatgctg gcatttgcac ctacctcaca     1800

gagtgcattg tgagggttaa tgaaataatg tacatctggc cttgaaacca ccttttatta     1860

catggggtct agaacttgac ccccttgagg gtgcttgttc cctctccctg ttggtcggtg     1920

ggttggtagt ttctacagtt gggcagctgg ttaggtagag ggagttgtca agtctctgct     1980

ggcccagcca aaccctgtct gacaacctct tggtgaacct tagtacctaa aaggaaatct     2040

caccccatcc cacaccctgg aggatttcat ctcttgtata tgatgatctg gatccaccaa     2100

gacttgtttt atgctcaggg tcaatttctt ttttcttttt tttttttttt tttctttttc     2160

tttgagactg ggtctcgctt tgttgcccag gctggagtgg agtggcgtga tcttggctta     2220

ctgcagcctt tgcctccccg gctcgagcag tcctgcctca gcctccggag tagctgggac     2280

cacaggttca tgccaccatg gccagccaac ttttgcatgt tttgtagaga tggggtctca     2340

cagtgttgcc caggctggtc tcaaactcct gggctcaggc gatccacctg tctcagcctc     2400

ccagagtgct gggattacaa ttgtgagcca ccacgtccag ctggaagggt caacatcttt     2460

tacattctgc aagcacatct gcattttcac cccacccttc ccctccttct ccctttttat     2520

atcccatttt tatatcgatc tcttatttta caataaaact ttgctgccac ctgtgtgtct     2580

gaggggtg                                                              2588


<210>  183
<211>  354
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 2, 
       polypeptide

NCBI Reference Sequence: NM_001276761.1

<400>  183

Met Asp Asp Leu Met Leu Ser Pro Asp Asp Ile Glu Gln Trp Phe Thr 
1               5                   10                  15      


Glu Asp Pro Gly Pro Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro 
            20                  25                  30          


Pro Val Ala Pro Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro 
        35                  40                  45              


Ala Pro Ser Trp Pro Leu Ser Ser Ser Val Pro Ser Gln Lys Thr Tyr 
    50                  55                  60                  


Gln Gly Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala 
65                  70                  75                  80  


Lys Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys 
                85                  90                  95      


Gln Leu Ala Lys Thr Cys Pro Val Gln Leu Trp Val Asp Ser Thr Pro 
            100                 105                 110         


Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gln Ser Gln 
        115                 120                 125             


His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu Arg Cys Ser 
    130                 135                 140                 


Asp Ser Asp Gly Leu Ala Pro Pro Gln His Leu Ile Arg Val Glu Gly 
145                 150                 155                 160 


Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe Arg His Ser 
                165                 170                 175     


Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp Cys Thr Thr 
            180                 185                 190         


Ile His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly Gly Met Asn 
        195                 200                 205             


Arg Arg Pro Ile Leu Thr Ile Ile Thr Leu Glu Asp Ser Ser Gly Asn 
    210                 215                 220                 


Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala Cys Pro Gly 
225                 230                 235                 240 


Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro 
                245                 250                 255     


His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn 
            260                 265                 270         


Thr Ser Ser Ser Pro Gln Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr 
        275                 280                 285             


Phe Thr Leu Gln Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu 
    290                 295                 300                 


Leu Asn Glu Ala Leu Glu Leu Lys Asp Ala Gln Ala Gly Lys Glu Pro 
305                 310                 315                 320 


Gly Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lys Gly Gln 
                325                 330                 335     


Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro Asp 
            340                 345                 350         


Ser Asp 
        


<210>  184
<211>  2724
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 3, 
       mRNA

NCBI Reference Sequence: NM_001126114.2

<400>  184
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggcagcca      180

gactgccttc cgggtcactg ccatggagga gccgcagtca gatcctagcg tcgagccccc      240

tctgagtcag gaaacatttt cagacctatg gaaactactt cctgaaaaca acgttctgtc      300

ccccttgccg tcccaagcaa tggatgattt gatgctgtcc ccggacgata ttgaacaatg      360

gttcactgaa gacccaggtc cagatgaagc tcccagaatg ccagaggctg ctccccccgt      420

ggcccctgca ccagcagctc ctacaccggc ggcccctgca ccagccccct cctggcccct      480

gtcatcttct gtcccttccc agaaaaccta ccagggcagc tacggtttcc gtctgggctt      540

cttgcattct gggacagcca agtctgtgac ttgcacgtac tcccctgccc tcaacaagat      600

gttttgccaa ctggccaaga cctgccctgt gcagctgtgg gttgattcca cacccccgcc      660

cggcacccgc gtccgcgcca tggccatcta caagcagtca cagcacatga cggaggttgt      720

gaggcgctgc ccccaccatg agcgctgctc agatagcgat ggtctggccc ctcctcagca      780

tcttatccga gtggaaggaa atttgcgtgt ggagtatttg gatgacagaa acacttttcg      840

acatagtgtg gtggtgccct atgagccgcc tgaggttggc tctgactgta ccaccatcca      900

ctacaactac atgtgtaaca gttcctgcat gggcggcatg aaccggaggc ccatcctcac      960

catcatcaca ctggaagact ccagtggtaa tctactggga cggaacagct ttgaggtgcg     1020

tgtttgtgcc tgtcctggga gagaccggcg cacagaggaa gagaatctcc gcaagaaagg     1080

ggagcctcac cacgagctgc ccccagggag cactaagcga gcactgccca acaacaccag     1140

ctcctctccc cagccaaaga agaaaccact ggatggagaa tatttcaccc ttcaggacca     1200

gaccagcttt caaaaagaaa attgttaaag agagcatgaa aatggttcta tgactttgcc     1260

tgatacagat gctacttgac ttacgatggt gttacttcct gataaactcg tcgtaagttg     1320

aaaatattat ccgtgggcgt gagcgcttcg agatgttccg agagctgaat gaggccttgg     1380

aactcaagga tgcccaggct gggaaggagc caggggggag cagggctcac tccagccacc     1440

tgaagtccaa aaagggtcag tctacctccc gccataaaaa actcatgttc aagacagaag     1500

ggcctgactc agactgacat tctccacttc ttgttcccca ctgacagcct cccaccccca     1560

tctctccctc ccctgccatt ttgggttttg ggtctttgaa cccttgcttg caataggtgt     1620

gcgtcagaag cacccaggac ttccatttgc tttgtcccgg ggctccactg aacaagttgg     1680

cctgcactgg tgttttgttg tggggaggag gatggggagt aggacatacc agcttagatt     1740

ttaaggtttt tactgtgagg gatgtttggg agatgtaaga aatgttcttg cagttaaggg     1800

ttagtttaca atcagccaca ttctaggtag gggcccactt caccgtacta accagggaag     1860

ctgtccctca ctgttgaatt ttctctaact tcaaggccca tatctgtgaa atgctggcat     1920

ttgcacctac ctcacagagt gcattgtgag ggttaatgaa ataatgtaca tctggccttg     1980

aaaccacctt ttattacatg gggtctagaa cttgaccccc ttgagggtgc ttgttccctc     2040

tccctgttgg tcggtgggtt ggtagtttct acagttgggc agctggttag gtagagggag     2100

ttgtcaagtc tctgctggcc cagccaaacc ctgtctgaca acctcttggt gaaccttagt     2160

acctaaaagg aaatctcacc ccatcccaca ccctggagga tttcatctct tgtatatgat     2220

gatctggatc caccaagact tgttttatgc tcagggtcaa tttctttttt cttttttttt     2280

tttttttttc tttttctttg agactgggtc tcgctttgtt gcccaggctg gagtggagtg     2340

gcgtgatctt ggcttactgc agcctttgcc tccccggctc gagcagtcct gcctcagcct     2400

ccggagtagc tgggaccaca ggttcatgcc accatggcca gccaactttt gcatgttttg     2460

tagagatggg gtctcacagt gttgcccagg ctggtctcaa actcctgggc tcaggcgatc     2520

cacctgtctc agcctcccag agtgctggga ttacaattgt gagccaccac gtccagctgg     2580

aagggtcaac atcttttaca ttctgcaagc acatctgcat tttcacccca cccttcccct     2640

ccttctccct ttttatatcc catttttata tcgatctctt attttacaat aaaactttgc     2700

tgccacctgt gtgtctgagg ggtg                                            2724


<210>  185
<211>  341
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 3, 
       polypeptide

       NCBI Reference Sequence: NM_001126114.2

       ACCESSION   NM_001126114 XR_243566

<400>  185

Met Glu Glu Pro Gln Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gln 
1               5                   10                  15      


Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 
            20                  25                  30          


Ser Pro Leu Pro Ser Gln Ala Met Asp Asp Leu Met Leu Ser Pro Asp 
        35                  40                  45              


Asp Ile Glu Gln Trp Phe Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 
    50                  55                  60                  


Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro 
65                  70                  75                  80  


Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser 
                85                  90                  95      


Val Pro Ser Gln Lys Thr Tyr Gln Gly Ser Tyr Gly Phe Arg Leu Gly 
            100                 105                 110         


Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro 
        115                 120                 125             


Ala Leu Asn Lys Met Phe Cys Gln Leu Ala Lys Thr Cys Pro Val Gln 
    130                 135                 140                 


Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met 
145                 150                 155                 160 


Ala Ile Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys 
                165                 170                 175     


Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln 
            180                 185                 190         


His Leu Ile Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp 
        195                 200                 205             


Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 
    210                 215                 220                 


Val Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser 
225                 230                 235                 240 


Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr 
                245                 250                 255     


Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 
            260                 265                 270         


Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn 
        275                 280                 285             


Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr 
    290                 295                 300                 


Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys Lys 
305                 310                 315                 320 


Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Asp Gln Thr Ser Phe 
                325                 330                 335     


Gln Lys Glu Asn Cys 
            340     


<210>  186
<211>  2651
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 4, 
       mRNA

NCBI Reference Sequence: NM_001126113.2

<400>  186
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggcagcca      180

gactgccttc cgggtcactg ccatggagga gccgcagtca gatcctagcg tcgagccccc      240

tctgagtcag gaaacatttt cagacctatg gaaactactt cctgaaaaca acgttctgtc      300

ccccttgccg tcccaagcaa tggatgattt gatgctgtcc ccggacgata ttgaacaatg      360

gttcactgaa gacccaggtc cagatgaagc tcccagaatg ccagaggctg ctccccccgt      420

ggcccctgca ccagcagctc ctacaccggc ggcccctgca ccagccccct cctggcccct      480

gtcatcttct gtcccttccc agaaaaccta ccagggcagc tacggtttcc gtctgggctt      540

cttgcattct gggacagcca agtctgtgac ttgcacgtac tcccctgccc tcaacaagat      600

gttttgccaa ctggccaaga cctgccctgt gcagctgtgg gttgattcca cacccccgcc      660

cggcacccgc gtccgcgcca tggccatcta caagcagtca cagcacatga cggaggttgt      720

gaggcgctgc ccccaccatg agcgctgctc agatagcgat ggtctggccc ctcctcagca      780

tcttatccga gtggaaggaa atttgcgtgt ggagtatttg gatgacagaa acacttttcg      840

acatagtgtg gtggtgccct atgagccgcc tgaggttggc tctgactgta ccaccatcca      900

ctacaactac atgtgtaaca gttcctgcat gggcggcatg aaccggaggc ccatcctcac      960

catcatcaca ctggaagact ccagtggtaa tctactggga cggaacagct ttgaggtgcg     1020

tgtttgtgcc tgtcctggga gagaccggcg cacagaggaa gagaatctcc gcaagaaagg     1080

ggagcctcac cacgagctgc ccccagggag cactaagcga gcactgccca acaacaccag     1140

ctcctctccc cagccaaaga agaaaccact ggatggagaa tatttcaccc ttcagatgct     1200

acttgactta cgatggtgtt acttcctgat aaactcgtcg taagttgaaa atattatccg     1260

tgggcgtgag cgcttcgaga tgttccgaga gctgaatgag gccttggaac tcaaggatgc     1320

ccaggctggg aaggagccag gggggagcag ggctcactcc agccacctga agtccaaaaa     1380

gggtcagtct acctcccgcc ataaaaaact catgttcaag acagaagggc ctgactcaga     1440

ctgacattct ccacttcttg ttccccactg acagcctccc acccccatct ctccctcccc     1500

tgccattttg ggttttgggt ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac     1560

ccaggacttc catttgcttt gtcccggggc tccactgaac aagttggcct gcactggtgt     1620

tttgttgtgg ggaggaggat ggggagtagg acataccagc ttagatttta aggtttttac     1680

tgtgagggat gtttgggaga tgtaagaaat gttcttgcag ttaagggtta gtttacaatc     1740

agccacattc taggtagggg cccacttcac cgtactaacc agggaagctg tccctcactg     1800

ttgaattttc tctaacttca aggcccatat ctgtgaaatg ctggcatttg cacctacctc     1860

acagagtgca ttgtgagggt taatgaaata atgtacatct ggccttgaaa ccacctttta     1920

ttacatgggg tctagaactt gacccccttg agggtgcttg ttccctctcc ctgttggtcg     1980

gtgggttggt agtttctaca gttgggcagc tggttaggta gagggagttg tcaagtctct     2040

gctggcccag ccaaaccctg tctgacaacc tcttggtgaa ccttagtacc taaaaggaaa     2100

tctcacccca tcccacaccc tggaggattt catctcttgt atatgatgat ctggatccac     2160

caagacttgt tttatgctca gggtcaattt cttttttctt tttttttttt ttttttcttt     2220

ttctttgaga ctgggtctcg ctttgttgcc caggctggag tggagtggcg tgatcttggc     2280

ttactgcagc ctttgcctcc ccggctcgag cagtcctgcc tcagcctccg gagtagctgg     2340

gaccacaggt tcatgccacc atggccagcc aacttttgca tgttttgtag agatggggtc     2400

tcacagtgtt gcccaggctg gtctcaaact cctgggctca ggcgatccac ctgtctcagc     2460

ctcccagagt gctgggatta caattgtgag ccaccacgtc cagctggaag ggtcaacatc     2520

ttttacattc tgcaagcaca tctgcatttt caccccaccc ttcccctcct tctccctttt     2580

tatatcccat ttttatatcg atctcttatt ttacaataaa actttgctgc cacctgtgtg     2640

tctgaggggt g                                                          2651


<210>  187
<211>  346
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 4, 
       polypeptide

NCBI Reference Sequence: NM_001126113.2

<400>  187

Met Glu Glu Pro Gln Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gln 
1               5                   10                  15      


Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 
            20                  25                  30          


Ser Pro Leu Pro Ser Gln Ala Met Asp Asp Leu Met Leu Ser Pro Asp 
        35                  40                  45              


Asp Ile Glu Gln Trp Phe Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 
    50                  55                  60                  


Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro 
65                  70                  75                  80  


Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Trp Pro Leu Ser Ser Ser 
                85                  90                  95      


Val Pro Ser Gln Lys Thr Tyr Gln Gly Ser Tyr Gly Phe Arg Leu Gly 
            100                 105                 110         


Phe Leu His Ser Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro 
        115                 120                 125             


Ala Leu Asn Lys Met Phe Cys Gln Leu Ala Lys Thr Cys Pro Val Gln 
    130                 135                 140                 


Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met 
145                 150                 155                 160 


Ala Ile Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys 
                165                 170                 175     


Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln 
            180                 185                 190         


His Leu Ile Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp 
        195                 200                 205             


Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu 
    210                 215                 220                 


Val Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser 
225                 230                 235                 240 


Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr 
                245                 250                 255     


Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val 
            260                 265                 270         


Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn 
        275                 280                 285             


Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr 
    290                 295                 300                 


Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys Lys 
305                 310                 315                 320 


Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Met Leu Leu Asp Leu 
                325                 330                 335     


Arg Trp Cys Tyr Phe Leu Ile Asn Ser Ser 
            340                 345     


<210>  188
<211>  2271
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 5, 
       mRNA

NCBI Reference Sequence: NM_001126115.1

<400>  188
tgaggccagg agatggaggc tgcagtgagc tgtgatcaca ccactgtgct ccagcctgag       60

tgacagagca agaccctatc tcaaaaaaaa aaaaaaaaaa gaaaagctcc tgaggtgtag      120

acgccaactc tctctagctc gctagtgggt tgcaggaggt gcttacgcat gtttgtttct      180

ttgctgccgt cttccagttg ctttatctgt tcacttgtgc cctgactttc aactctgtct      240

ccttcctctt cctacagtac tcccctgccc tcaacaagat gttttgccaa ctggccaaga      300

cctgccctgt gcagctgtgg gttgattcca cacccccgcc cggcacccgc gtccgcgcca      360

tggccatcta caagcagtca cagcacatga cggaggttgt gaggcgctgc ccccaccatg      420

agcgctgctc agatagcgat ggtctggccc ctcctcagca tcttatccga gtggaaggaa      480

atttgcgtgt ggagtatttg gatgacagaa acacttttcg acatagtgtg gtggtgccct      540

atgagccgcc tgaggttggc tctgactgta ccaccatcca ctacaactac atgtgtaaca      600

gttcctgcat gggcggcatg aaccggaggc ccatcctcac catcatcaca ctggaagact      660

ccagtggtaa tctactggga cggaacagct ttgaggtgcg tgtttgtgcc tgtcctggga      720

gagaccggcg cacagaggaa gagaatctcc gcaagaaagg ggagcctcac cacgagctgc      780

ccccagggag cactaagcga gcactgccca acaacaccag ctcctctccc cagccaaaga      840

agaaaccact ggatggagaa tatttcaccc ttcagatccg tgggcgtgag cgcttcgaga      900

tgttccgaga gctgaatgag gccttggaac tcaaggatgc ccaggctggg aaggagccag      960

gggggagcag ggctcactcc agccacctga agtccaaaaa gggtcagtct acctcccgcc     1020

ataaaaaact catgttcaag acagaagggc ctgactcaga ctgacattct ccacttcttg     1080

ttccccactg acagcctccc acccccatct ctccctcccc tgccattttg ggttttgggt     1140

ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac ccaggacttc catttgcttt     1200

gtcccggggc tccactgaac aagttggcct gcactggtgt tttgttgtgg ggaggaggat     1260

ggggagtagg acataccagc ttagatttta aggtttttac tgtgagggat gtttgggaga     1320

tgtaagaaat gttcttgcag ttaagggtta gtttacaatc agccacattc taggtagggg     1380

cccacttcac cgtactaacc agggaagctg tccctcactg ttgaattttc tctaacttca     1440

aggcccatat ctgtgaaatg ctggcatttg cacctacctc acagagtgca ttgtgagggt     1500

taatgaaata atgtacatct ggccttgaaa ccacctttta ttacatgggg tctagaactt     1560

gacccccttg agggtgcttg ttccctctcc ctgttggtcg gtgggttggt agtttctaca     1620

gttgggcagc tggttaggta gagggagttg tcaagtctct gctggcccag ccaaaccctg     1680

tctgacaacc tcttggtgaa ccttagtacc taaaaggaaa tctcacccca tcccacaccc     1740

tggaggattt catctcttgt atatgatgat ctggatccac caagacttgt tttatgctca     1800

gggtcaattt cttttttctt tttttttttt ttttttcttt ttctttgaga ctgggtctcg     1860

ctttgttgcc caggctggag tggagtggcg tgatcttggc ttactgcagc ctttgcctcc     1920

ccggctcgag cagtcctgcc tcagcctccg gagtagctgg gaccacaggt tcatgccacc     1980

atggccagcc aacttttgca tgttttgtag agatggggtc tcacagtgtt gcccaggctg     2040

gtctcaaact cctgggctca ggcgatccac ctgtctcagc ctcccagagt gctgggatta     2100

caattgtgag ccaccacgtc cagctggaag ggtcaacatc ttttacattc tgcaagcaca     2160

tctgcatttt caccccaccc ttcccctcct tctccctttt tatatcccat ttttatatcg     2220

atctcttatt ttacaataaa actttgctgc cacctgtgtg tctgaggggt g              2271


<210>  189
<211>  261
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 5, 
       polypeptide

NCBI Reference Sequence: NM_001126115.1

<400>  189

Met Phe Cys Gln Leu Ala Lys Thr Cys Pro Val Gln Leu Trp Val Asp 
1               5                   10                  15      


Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys 
            20                  25                  30          


Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu 
        35                  40                  45              


Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln His Leu Ile Arg 
    50                  55                  60                  


Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe 
65                  70                  75                  80  


Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp 
                85                  90                  95      


Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly 
            100                 105                 110         


Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr Leu Glu Asp Ser 
        115                 120                 125             


Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala 
    130                 135                 140                 


Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys 
145                 150                 155                 160 


Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu 
                165                 170                 175     


Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys Lys Lys Pro Leu Asp 
            180                 185                 190         


Gly Glu Tyr Phe Thr Leu Gln Ile Arg Gly Arg Glu Arg Phe Glu Met 
        195                 200                 205             


Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp Ala Gln Ala Gly 
    210                 215                 220                 


Lys Glu Pro Gly Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys 
225                 230                 235                 240 


Lys Gly Gln Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu 
                245                 250                 255     


Gly Pro Asp Ser Asp 
            260     


<210>  190
<211>  2271
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 5, 
       mRNA

NCBI Reference Sequence: NM_001276697.1

<400>  190
tgaggccagg agatggaggc tgcagtgagc tgtgatcaca ccactgtgct ccagcctgag       60

tgacagagca agaccctatc tcaaaaaaaa aaaaaaaaaa gaaaagctcc tgaggtgtag      120

acgccaactc tctctagctc gctagtgggt tgcaggaggt gcttacgcat gtttgtttct      180

ttgctgccgt cttccagttg ctttatctgt tcacttgtgc cctgactttc aactctgtct      240

ccttcctctt cctacagtac tcccctgccc tcaacaagat gttttgccaa ctggccaaga      300

cctgccctgt gcagctgtgg gttgattcca cacccccgcc cggcacccgc gtccgcgcca      360

tggccatcta caagcagtca cagcacatga cggaggttgt gaggcgctgc ccccaccatg      420

agcgctgctc agatagcgat ggtctggccc ctcctcagca tcttatccga gtggaaggaa      480

atttgcgtgt ggagtatttg gatgacagaa acacttttcg acatagtgtg gtggtgccct      540

atgagccgcc tgaggttggc tctgactgta ccaccatcca ctacaactac atgtgtaaca      600

gttcctgcat gggcggcatg aaccggaggc ccatcctcac catcatcaca ctggaagact      660

ccagtggtaa tctactggga cggaacagct ttgaggtgcg tgtttgtgcc tgtcctggga      720

gagaccggcg cacagaggaa gagaatctcc gcaagaaagg ggagcctcac cacgagctgc      780

ccccagggag cactaagcga gcactgccca acaacaccag ctcctctccc cagccaaaga      840

agaaaccact ggatggagaa tatttcaccc ttcagatccg tgggcgtgag cgcttcgaga      900

tgttccgaga gctgaatgag gccttggaac tcaaggatgc ccaggctggg aaggagccag      960

gggggagcag ggctcactcc agccacctga agtccaaaaa gggtcagtct acctcccgcc     1020

ataaaaaact catgttcaag acagaagggc ctgactcaga ctgacattct ccacttcttg     1080

ttccccactg acagcctccc acccccatct ctccctcccc tgccattttg ggttttgggt     1140

ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac ccaggacttc catttgcttt     1200

gtcccggggc tccactgaac aagttggcct gcactggtgt tttgttgtgg ggaggaggat     1260

ggggagtagg acataccagc ttagatttta aggtttttac tgtgagggat gtttgggaga     1320

tgtaagaaat gttcttgcag ttaagggtta gtttacaatc agccacattc taggtagggg     1380

cccacttcac cgtactaacc agggaagctg tccctcactg ttgaattttc tctaacttca     1440

aggcccatat ctgtgaaatg ctggcatttg cacctacctc acagagtgca ttgtgagggt     1500

taatgaaata atgtacatct ggccttgaaa ccacctttta ttacatgggg tctagaactt     1560

gacccccttg agggtgcttg ttccctctcc ctgttggtcg gtgggttggt agtttctaca     1620

gttgggcagc tggttaggta gagggagttg tcaagtctct gctggcccag ccaaaccctg     1680

tctgacaacc tcttggtgaa ccttagtacc taaaaggaaa tctcacccca tcccacaccc     1740

tggaggattt catctcttgt atatgatgat ctggatccac caagacttgt tttatgctca     1800

gggtcaattt cttttttctt tttttttttt ttttttcttt ttctttgaga ctgggtctcg     1860

ctttgttgcc caggctggag tggagtggcg tgatcttggc ttactgcagc ctttgcctcc     1920

ccggctcgag cagtcctgcc tcagcctccg gagtagctgg gaccacaggt tcatgccacc     1980

atggccagcc aacttttgca tgttttgtag agatggggtc tcacagtgtt gcccaggctg     2040

gtctcaaact cctgggctca ggcgatccac ctgtctcagc ctcccagagt gctgggatta     2100

caattgtgag ccaccacgtc cagctggaag ggtcaacatc ttttacattc tgcaagcaca     2160

tctgcatttt caccccaccc ttcccctcct tctccctttt tatatcccat ttttatatcg     2220

atctcttatt ttacaataaa actttgctgc cacctgtgtg tctgaggggt g              2271


<210>  191
<211>  234
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 5, 
       polypeptide

NCBI Reference Sequence: NM_001276697.1

<400>  191

Met Ala Ile Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg 
1               5                   10                  15      


Cys Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro 
            20                  25                  30          


Gln His Leu Ile Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp 
        35                  40                  45              


Asp Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro 
    50                  55                  60                  


Glu Val Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn 
65                  70                  75                  80  


Ser Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile 
                85                  90                  95      


Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu 
            100                 105                 110         


Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu 
        115                 120                 125             


Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser 
    130                 135                 140                 


Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys 
145                 150                 155                 160 


Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Ile Arg Gly Arg 
                165                 170                 175     


Glu Arg Phe Glu Met Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys 
            180                 185                 190         


Asp Ala Gln Ala Gly Lys Glu Pro Gly Gly Ser Arg Ala His Ser Ser 
        195                 200                 205             


His Leu Lys Ser Lys Lys Gly Gln Ser Thr Ser Arg His Lys Lys Leu 
    210                 215                 220                 


Met Phe Lys Thr Glu Gly Pro Asp Ser Asp 
225                 230                 


<210>  192
<211>  2708
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 8, 
       mRNA

NCBI Reference Sequence: NM_001126118.1

<400>  192
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggcagcca      180

gactgccttc cgggtcactg ccatggagga gccgcagtca gatcctagcg tcgagccccc      240

tctgagtcag gaaacatttt cagacctatg gaaactgtga gtggatccat tggaagggca      300

ggcccaccac ccccacccca accccagccc cctagcagag acctgtggga agcgaaaatt      360

ccatgggact gactttctgc tcttgtcttt cagacttcct gaaaacaacg ttctgtcccc      420

cttgccgtcc caagcaatgg atgatttgat gctgtccccg gacgatattg aacaatggtt      480

cactgaagac ccaggtccag atgaagctcc cagaatgcca gaggctgctc cccccgtggc      540

ccctgcacca gcagctccta caccggcggc ccctgcacca gccccctcct ggcccctgtc      600

atcttctgtc ccttcccaga aaacctacca gggcagctac ggtttccgtc tgggcttctt      660

gcattctggg acagccaagt ctgtgacttg cacgtactcc cctgccctca acaagatgtt      720

ttgccaactg gccaagacct gccctgtgca gctgtgggtt gattccacac ccccgcccgg      780

cacccgcgtc cgcgccatgg ccatctacaa gcagtcacag cacatgacgg aggttgtgag      840

gcgctgcccc caccatgagc gctgctcaga tagcgatggt ctggcccctc ctcagcatct      900

tatccgagtg gaaggaaatt tgcgtgtgga gtatttggat gacagaaaca cttttcgaca      960

tagtgtggtg gtgccctatg agccgcctga ggttggctct gactgtacca ccatccacta     1020

caactacatg tgtaacagtt cctgcatggg cggcatgaac cggaggccca tcctcaccat     1080

catcacactg gaagactcca gtggtaatct actgggacgg aacagctttg aggtgcgtgt     1140

ttgtgcctgt cctgggagag accggcgcac agaggaagag aatctccgca agaaagggga     1200

gcctcaccac gagctgcccc cagggagcac taagcgagca ctgcccaaca acaccagctc     1260

ctctccccag ccaaagaaga aaccactgga tggagaatat ttcacccttc agatccgtgg     1320

gcgtgagcgc ttcgagatgt tccgagagct gaatgaggcc ttggaactca aggatgccca     1380

ggctgggaag gagccagggg ggagcagggc tcactccagc cacctgaagt ccaaaaaggg     1440

tcagtctacc tcccgccata aaaaactcat gttcaagaca gaagggcctg actcagactg     1500

acattctcca cttcttgttc cccactgaca gcctcccacc cccatctctc cctcccctgc     1560

cattttgggt tttgggtctt tgaacccttg cttgcaatag gtgtgcgtca gaagcaccca     1620

ggacttccat ttgctttgtc ccggggctcc actgaacaag ttggcctgca ctggtgtttt     1680

gttgtgggga ggaggatggg gagtaggaca taccagctta gattttaagg tttttactgt     1740

gagggatgtt tgggagatgt aagaaatgtt cttgcagtta agggttagtt tacaatcagc     1800

cacattctag gtaggggccc acttcaccgt actaaccagg gaagctgtcc ctcactgttg     1860

aattttctct aacttcaagg cccatatctg tgaaatgctg gcatttgcac ctacctcaca     1920

gagtgcattg tgagggttaa tgaaataatg tacatctggc cttgaaacca ccttttatta     1980

catggggtct agaacttgac ccccttgagg gtgcttgttc cctctccctg ttggtcggtg     2040

ggttggtagt ttctacagtt gggcagctgg ttaggtagag ggagttgtca agtctctgct     2100

ggcccagcca aaccctgtct gacaacctct tggtgaacct tagtacctaa aaggaaatct     2160

caccccatcc cacaccctgg aggatttcat ctcttgtata tgatgatctg gatccaccaa     2220

gacttgtttt atgctcaggg tcaatttctt ttttcttttt tttttttttt tttctttttc     2280

tttgagactg ggtctcgctt tgttgcccag gctggagtgg agtggcgtga tcttggctta     2340

ctgcagcctt tgcctccccg gctcgagcag tcctgcctca gcctccggag tagctgggac     2400

cacaggttca tgccaccatg gccagccaac ttttgcatgt tttgtagaga tggggtctca     2460

cagtgttgcc caggctggtc tcaaactcct gggctcaggc gatccacctg tctcagcctc     2520

ccagagtgct gggattacaa ttgtgagcca ccacgtccag ctggaagggt caacatcttt     2580

tacattctgc aagcacatct gcattttcac cccacccttc ccctccttct ccctttttat     2640

atcccatttt tatatcgatc tcttatttta caataaaact ttgctgccac ctgtgtgtct     2700

gaggggtg                                                              2708


<210>  193
<211>  354
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 8, 
       polypeptide

NCBI Reference Sequence: NM_001126118.1

<400>  193

Met Asp Asp Leu Met Leu Ser Pro Asp Asp Ile Glu Gln Trp Phe Thr 
1               5                   10                  15      


Glu Asp Pro Gly Pro Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro 
            20                  25                  30          


Pro Val Ala Pro Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro 
        35                  40                  45              


Ala Pro Ser Trp Pro Leu Ser Ser Ser Val Pro Ser Gln Lys Thr Tyr 
    50                  55                  60                  


Gln Gly Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala 
65                  70                  75                  80  


Lys Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys 
                85                  90                  95      


Gln Leu Ala Lys Thr Cys Pro Val Gln Leu Trp Val Asp Ser Thr Pro 
            100                 105                 110         


Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gln Ser Gln 
        115                 120                 125             


His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu Arg Cys Ser 
    130                 135                 140                 


Asp Ser Asp Gly Leu Ala Pro Pro Gln His Leu Ile Arg Val Glu Gly 
145                 150                 155                 160 


Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe Arg His Ser 
                165                 170                 175     


Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp Cys Thr Thr 
            180                 185                 190         


Ile His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly Gly Met Asn 
        195                 200                 205             


Arg Arg Pro Ile Leu Thr Ile Ile Thr Leu Glu Asp Ser Ser Gly Asn 
    210                 215                 220                 


Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala Cys Pro Gly 
225                 230                 235                 240 


Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro 
                245                 250                 255     


His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn 
            260                 265                 270         


Thr Ser Ser Ser Pro Gln Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr 
        275                 280                 285             


Phe Thr Leu Gln Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu 
    290                 295                 300                 


Leu Asn Glu Ala Leu Glu Leu Lys Asp Ala Gln Ala Gly Lys Glu Pro 
305                 310                 315                 320 


Gly Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lys Gly Gln 
                325                 330                 335     


Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro Asp 
            340                 345                 350         


Ser Asp 
        


<210>  194
<211>  2331
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 7, 
       mRNA

NCBI Reference Sequence: NM_001276699.1

<400>  194
tgaggccagg agatggaggc tgcagtgagc tgtgatcaca ccactgtgct ccagcctgag       60

tgacagagca agaccctatc tcaaaaaaaa aaaaaaaaaa gaaaagctcc tgaggtgtag      120

acgccaactc tctctagctc gctagtgggt tgcaggaggt gcttacgcat gtttgtttct      180

ttgctgccgt cttccagttg ctttatctgt tcacttgtgc cctgactttc aactctgtct      240

ccttcctctt cctacagtac tcccctgccc tcaacaagat gttttgccaa ctggccaaga      300

cctgccctgt gcagctgtgg gttgattcca cacccccgcc cggcacccgc gtccgcgcca      360

tggccatcta caagcagtca cagcacatga cggaggttgt gaggcgctgc ccccaccatg      420

agcgctgctc agatagcgat ggtctggccc ctcctcagca tcttatccga gtggaaggaa      480

atttgcgtgt ggagtatttg gatgacagaa acacttttcg acatagtgtg gtggtgccct      540

atgagccgcc tgaggttggc tctgactgta ccaccatcca ctacaactac atgtgtaaca      600

gttcctgcat gggcggcatg aaccggaggc ccatcctcac catcatcaca ctggaagact      660

ccagtggtaa tctactggga cggaacagct ttgaggtgcg tgtttgtgcc tgtcctggga      720

gagaccggcg cacagaggaa gagaatctcc gcaagaaagg ggagcctcac cacgagctgc      780

ccccagggag cactaagcga gcactgccca acaacaccag ctcctctccc cagccaaaga      840

agaaaccact ggatggagaa tatttcaccc ttcagatgct acttgactta cgatggtgtt      900

acttcctgat aaactcgtcg taagttgaaa atattatccg tgggcgtgag cgcttcgaga      960

tgttccgaga gctgaatgag gccttggaac tcaaggatgc ccaggctggg aaggagccag     1020

gggggagcag ggctcactcc agccacctga agtccaaaaa gggtcagtct acctcccgcc     1080

ataaaaaact catgttcaag acagaagggc ctgactcaga ctgacattct ccacttcttg     1140

ttccccactg acagcctccc acccccatct ctccctcccc tgccattttg ggttttgggt     1200

ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac ccaggacttc catttgcttt     1260

gtcccggggc tccactgaac aagttggcct gcactggtgt tttgttgtgg ggaggaggat     1320

ggggagtagg acataccagc ttagatttta aggtttttac tgtgagggat gtttgggaga     1380

tgtaagaaat gttcttgcag ttaagggtta gtttacaatc agccacattc taggtagggg     1440

cccacttcac cgtactaacc agggaagctg tccctcactg ttgaattttc tctaacttca     1500

aggcccatat ctgtgaaatg ctggcatttg cacctacctc acagagtgca ttgtgagggt     1560

taatgaaata atgtacatct ggccttgaaa ccacctttta ttacatgggg tctagaactt     1620

gacccccttg agggtgcttg ttccctctcc ctgttggtcg gtgggttggt agtttctaca     1680

gttgggcagc tggttaggta gagggagttg tcaagtctct gctggcccag ccaaaccctg     1740

tctgacaacc tcttggtgaa ccttagtacc taaaaggaaa tctcacccca tcccacaccc     1800

tggaggattt catctcttgt atatgatgat ctggatccac caagacttgt tttatgctca     1860

gggtcaattt cttttttctt tttttttttt ttttttcttt ttctttgaga ctgggtctcg     1920

ctttgttgcc caggctggag tggagtggcg tgatcttggc ttactgcagc ctttgcctcc     1980

ccggctcgag cagtcctgcc tcagcctccg gagtagctgg gaccacaggt tcatgccacc     2040

atggccagcc aacttttgca tgttttgtag agatggggtc tcacagtgtt gcccaggctg     2100

gtctcaaact cctgggctca ggcgatccac ctgtctcagc ctcccagagt gctgggatta     2160

caattgtgag ccaccacgtc cagctggaag ggtcaacatc ttttacattc tgcaagcaca     2220

tctgcatttt caccccaccc ttcccctcct tctccctttt tatatcccat ttttatatcg     2280

atctcttatt ttacaataaa actttgctgc cacctgtgtg tctgaggggt g              2331


<210>  195
<211>  187
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 7, 
       mRNA

NCBI Reference Sequence: NM_001276699.1

<400>  195

Met Ala Ile Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg 
1               5                   10                  15      


Cys Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro 
            20                  25                  30          


Gln His Leu Ile Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp 
        35                  40                  45              


Asp Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro 
    50                  55                  60                  


Glu Val Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn 
65                  70                  75                  80  


Ser Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile 
                85                  90                  95      


Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu 
            100                 105                 110         


Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu 
        115                 120                 125             


Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser 
    130                 135                 140                 


Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys 
145                 150                 155                 160 


Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Met Leu Leu Asp 
                165                 170                 175     


Leu Arg Trp Cys Tyr Phe Leu Ile Asn Ser Ser 
            180                 185         


<210>  196
<211>  2404
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 6, 
       mRNA

NCBI Reference Sequence: NM_001276698.1

<400>  196
tgaggccagg agatggaggc tgcagtgagc tgtgatcaca ccactgtgct ccagcctgag       60

tgacagagca agaccctatc tcaaaaaaaa aaaaaaaaaa gaaaagctcc tgaggtgtag      120

acgccaactc tctctagctc gctagtgggt tgcaggaggt gcttacgcat gtttgtttct      180

ttgctgccgt cttccagttg ctttatctgt tcacttgtgc cctgactttc aactctgtct      240

ccttcctctt cctacagtac tcccctgccc tcaacaagat gttttgccaa ctggccaaga      300

cctgccctgt gcagctgtgg gttgattcca cacccccgcc cggcacccgc gtccgcgcca      360

tggccatcta caagcagtca cagcacatga cggaggttgt gaggcgctgc ccccaccatg      420

agcgctgctc agatagcgat ggtctggccc ctcctcagca tcttatccga gtggaaggaa      480

atttgcgtgt ggagtatttg gatgacagaa acacttttcg acatagtgtg gtggtgccct      540

atgagccgcc tgaggttggc tctgactgta ccaccatcca ctacaactac atgtgtaaca      600

gttcctgcat gggcggcatg aaccggaggc ccatcctcac catcatcaca ctggaagact      660

ccagtggtaa tctactggga cggaacagct ttgaggtgcg tgtttgtgcc tgtcctggga      720

gagaccggcg cacagaggaa gagaatctcc gcaagaaagg ggagcctcac cacgagctgc      780

ccccagggag cactaagcga gcactgccca acaacaccag ctcctctccc cagccaaaga      840

agaaaccact ggatggagaa tatttcaccc ttcaggacca gaccagcttt caaaaagaaa      900

attgttaaag agagcatgaa aatggttcta tgactttgcc tgatacagat gctacttgac      960

ttacgatggt gttacttcct gataaactcg tcgtaagttg aaaatattat ccgtgggcgt     1020

gagcgcttcg agatgttccg agagctgaat gaggccttgg aactcaagga tgcccaggct     1080

gggaaggagc caggggggag cagggctcac tccagccacc tgaagtccaa aaagggtcag     1140

tctacctccc gccataaaaa actcatgttc aagacagaag ggcctgactc agactgacat     1200

tctccacttc ttgttcccca ctgacagcct cccaccccca tctctccctc ccctgccatt     1260

ttgggttttg ggtctttgaa cccttgcttg caataggtgt gcgtcagaag cacccaggac     1320

ttccatttgc tttgtcccgg ggctccactg aacaagttgg cctgcactgg tgttttgttg     1380

tggggaggag gatggggagt aggacatacc agcttagatt ttaaggtttt tactgtgagg     1440

gatgtttggg agatgtaaga aatgttcttg cagttaaggg ttagtttaca atcagccaca     1500

ttctaggtag gggcccactt caccgtacta accagggaag ctgtccctca ctgttgaatt     1560

ttctctaact tcaaggccca tatctgtgaa atgctggcat ttgcacctac ctcacagagt     1620

gcattgtgag ggttaatgaa ataatgtaca tctggccttg aaaccacctt ttattacatg     1680

gggtctagaa cttgaccccc ttgagggtgc ttgttccctc tccctgttgg tcggtgggtt     1740

ggtagtttct acagttgggc agctggttag gtagagggag ttgtcaagtc tctgctggcc     1800

cagccaaacc ctgtctgaca acctcttggt gaaccttagt acctaaaagg aaatctcacc     1860

ccatcccaca ccctggagga tttcatctct tgtatatgat gatctggatc caccaagact     1920

tgttttatgc tcagggtcaa tttctttttt cttttttttt tttttttttc tttttctttg     1980

agactgggtc tcgctttgtt gcccaggctg gagtggagtg gcgtgatctt ggcttactgc     2040

agcctttgcc tccccggctc gagcagtcct gcctcagcct ccggagtagc tgggaccaca     2100

ggttcatgcc accatggcca gccaactttt gcatgttttg tagagatggg gtctcacagt     2160

gttgcccagg ctggtctcaa actcctgggc tcaggcgatc cacctgtctc agcctcccag     2220

agtgctggga ttacaattgt gagccaccac gtccagctgg aagggtcaac atcttttaca     2280

ttctgcaagc acatctgcat tttcacccca cccttcccct ccttctccct ttttatatcc     2340

catttttata tcgatctctt attttacaat aaaactttgc tgccacctgt gtgtctgagg     2400

ggtg                                                                  2404


<210>  197
<211>  182
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 6, 
       polypeptide

NCBI Reference Sequence: NM_001276698.1

<400>  197

Met Ala Ile Tyr Lys Gln Ser Gln His Met Thr Glu Val Val Arg Arg 
1               5                   10                  15      


Cys Pro His His Glu Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro 
            20                  25                  30          


Gln His Leu Ile Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp 
        35                  40                  45              


Asp Arg Asn Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro 
    50                  55                  60                  


Glu Val Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn 
65                  70                  75                  80  


Ser Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile 
                85                  90                  95      


Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu 
            100                 105                 110         


Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu 
        115                 120                 125             


Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro Pro Gly Ser 
    130                 135                 140                 


Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys 
145                 150                 155                 160 


Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Asp Gln Thr Ser 
                165                 170                 175     


Phe Gln Lys Glu Asn Cys 
            180         


<210>  198
<211>  2724
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 3, 
       mRNA

NCBI Reference Sequence: NM_001276696.1

<400>  198
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggcagcca      180

gactgccttc cgggtcactg ccatggagga gccgcagtca gatcctagcg tcgagccccc      240

tctgagtcag gaaacatttt cagacctatg gaaactactt cctgaaaaca acgttctgtc      300

ccccttgccg tcccaagcaa tggatgattt gatgctgtcc ccggacgata ttgaacaatg      360

gttcactgaa gacccaggtc cagatgaagc tcccagaatg ccagaggctg ctccccccgt      420

ggcccctgca ccagcagctc ctacaccggc ggcccctgca ccagccccct cctggcccct      480

gtcatcttct gtcccttccc agaaaaccta ccagggcagc tacggtttcc gtctgggctt      540

cttgcattct gggacagcca agtctgtgac ttgcacgtac tcccctgccc tcaacaagat      600

gttttgccaa ctggccaaga cctgccctgt gcagctgtgg gttgattcca cacccccgcc      660

cggcacccgc gtccgcgcca tggccatcta caagcagtca cagcacatga cggaggttgt      720

gaggcgctgc ccccaccatg agcgctgctc agatagcgat ggtctggccc ctcctcagca      780

tcttatccga gtggaaggaa atttgcgtgt ggagtatttg gatgacagaa acacttttcg      840

acatagtgtg gtggtgccct atgagccgcc tgaggttggc tctgactgta ccaccatcca      900

ctacaactac atgtgtaaca gttcctgcat gggcggcatg aaccggaggc ccatcctcac      960

catcatcaca ctggaagact ccagtggtaa tctactggga cggaacagct ttgaggtgcg     1020

tgtttgtgcc tgtcctggga gagaccggcg cacagaggaa gagaatctcc gcaagaaagg     1080

ggagcctcac cacgagctgc ccccagggag cactaagcga gcactgccca acaacaccag     1140

ctcctctccc cagccaaaga agaaaccact ggatggagaa tatttcaccc ttcaggacca     1200

gaccagcttt caaaaagaaa attgttaaag agagcatgaa aatggttcta tgactttgcc     1260

tgatacagat gctacttgac ttacgatggt gttacttcct gataaactcg tcgtaagttg     1320

aaaatattat ccgtgggcgt gagcgcttcg agatgttccg agagctgaat gaggccttgg     1380

aactcaagga tgcccaggct gggaaggagc caggggggag cagggctcac tccagccacc     1440

tgaagtccaa aaagggtcag tctacctccc gccataaaaa actcatgttc aagacagaag     1500

ggcctgactc agactgacat tctccacttc ttgttcccca ctgacagcct cccaccccca     1560

tctctccctc ccctgccatt ttgggttttg ggtctttgaa cccttgcttg caataggtgt     1620

gcgtcagaag cacccaggac ttccatttgc tttgtcccgg ggctccactg aacaagttgg     1680

cctgcactgg tgttttgttg tggggaggag gatggggagt aggacatacc agcttagatt     1740

ttaaggtttt tactgtgagg gatgtttggg agatgtaaga aatgttcttg cagttaaggg     1800

ttagtttaca atcagccaca ttctaggtag gggcccactt caccgtacta accagggaag     1860

ctgtccctca ctgttgaatt ttctctaact tcaaggccca tatctgtgaa atgctggcat     1920

ttgcacctac ctcacagagt gcattgtgag ggttaatgaa ataatgtaca tctggccttg     1980

aaaccacctt ttattacatg gggtctagaa cttgaccccc ttgagggtgc ttgttccctc     2040

tccctgttgg tcggtgggtt ggtagtttct acagttgggc agctggttag gtagagggag     2100

ttgtcaagtc tctgctggcc cagccaaacc ctgtctgaca acctcttggt gaaccttagt     2160

acctaaaagg aaatctcacc ccatcccaca ccctggagga tttcatctct tgtatatgat     2220

gatctggatc caccaagact tgttttatgc tcagggtcaa tttctttttt cttttttttt     2280

tttttttttc tttttctttg agactgggtc tcgctttgtt gcccaggctg gagtggagtg     2340

gcgtgatctt ggcttactgc agcctttgcc tccccggctc gagcagtcct gcctcagcct     2400

ccggagtagc tgggaccaca ggttcatgcc accatggcca gccaactttt gcatgttttg     2460

tagagatggg gtctcacagt gttgcccagg ctggtctcaa actcctgggc tcaggcgatc     2520

cacctgtctc agcctcccag agtgctggga ttacaattgt gagccaccac gtccagctgg     2580

aagggtcaac atcttttaca ttctgcaagc acatctgcat tttcacccca cccttcccct     2640

ccttctccct ttttatatcc catttttata tcgatctctt attttacaat aaaactttgc     2700

tgccacctgt gtgtctgagg ggtg                                            2724


<210>  199
<211>  302
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 3, 
       polypeptide

NCBI Reference Sequence: NM_001276696.1

<400>  199

Met Asp Asp Leu Met Leu Ser Pro Asp Asp Ile Glu Gln Trp Phe Thr 
1               5                   10                  15      


Glu Asp Pro Gly Pro Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro 
            20                  25                  30          


Pro Val Ala Pro Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro 
        35                  40                  45              


Ala Pro Ser Trp Pro Leu Ser Ser Ser Val Pro Ser Gln Lys Thr Tyr 
    50                  55                  60                  


Gln Gly Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala 
65                  70                  75                  80  


Lys Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys 
                85                  90                  95      


Gln Leu Ala Lys Thr Cys Pro Val Gln Leu Trp Val Asp Ser Thr Pro 
            100                 105                 110         


Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gln Ser Gln 
        115                 120                 125             


His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu Arg Cys Ser 
    130                 135                 140                 


Asp Ser Asp Gly Leu Ala Pro Pro Gln His Leu Ile Arg Val Glu Gly 
145                 150                 155                 160 


Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe Arg His Ser 
                165                 170                 175     


Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp Cys Thr Thr 
            180                 185                 190         


Ile His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly Gly Met Asn 
        195                 200                 205             


Arg Arg Pro Ile Leu Thr Ile Ile Thr Leu Glu Asp Ser Ser Gly Asn 
    210                 215                 220                 


Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala Cys Pro Gly 
225                 230                 235                 240 


Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro 
                245                 250                 255     


His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn 
            260                 265                 270         


Thr Ser Ser Ser Pro Gln Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr 
        275                 280                 285             


Phe Thr Leu Gln Asp Gln Thr Ser Phe Gln Lys Glu Asn Cys 
    290                 295                 300         


<210>  200
<211>  2651
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 4, 
       mRNA

NCBI Reference Sequence: NM_001276695.1

<400>  200
gatgggattg gggttttccc ctcccatgtg ctcaagactg gcgctaaaag ttttgagctt       60

ctcaaaagtc tagagccacc gtccagggag caggtagctg ctgggctccg gggacacttt      120

gcgttcgggc tgggagcgtg ctttccacga cggtgacacg cttccctgga ttggcagcca      180

gactgccttc cgggtcactg ccatggagga gccgcagtca gatcctagcg tcgagccccc      240

tctgagtcag gaaacatttt cagacctatg gaaactactt cctgaaaaca acgttctgtc      300

ccccttgccg tcccaagcaa tggatgattt gatgctgtcc ccggacgata ttgaacaatg      360

gttcactgaa gacccaggtc cagatgaagc tcccagaatg ccagaggctg ctccccccgt      420

ggcccctgca ccagcagctc ctacaccggc ggcccctgca ccagccccct cctggcccct      480

gtcatcttct gtcccttccc agaaaaccta ccagggcagc tacggtttcc gtctgggctt      540

cttgcattct gggacagcca agtctgtgac ttgcacgtac tcccctgccc tcaacaagat      600

gttttgccaa ctggccaaga cctgccctgt gcagctgtgg gttgattcca cacccccgcc      660

cggcacccgc gtccgcgcca tggccatcta caagcagtca cagcacatga cggaggttgt      720

gaggcgctgc ccccaccatg agcgctgctc agatagcgat ggtctggccc ctcctcagca      780

tcttatccga gtggaaggaa atttgcgtgt ggagtatttg gatgacagaa acacttttcg      840

acatagtgtg gtggtgccct atgagccgcc tgaggttggc tctgactgta ccaccatcca      900

ctacaactac atgtgtaaca gttcctgcat gggcggcatg aaccggaggc ccatcctcac      960

catcatcaca ctggaagact ccagtggtaa tctactggga cggaacagct ttgaggtgcg     1020

tgtttgtgcc tgtcctggga gagaccggcg cacagaggaa gagaatctcc gcaagaaagg     1080

ggagcctcac cacgagctgc ccccagggag cactaagcga gcactgccca acaacaccag     1140

ctcctctccc cagccaaaga agaaaccact ggatggagaa tatttcaccc ttcagatgct     1200

acttgactta cgatggtgtt acttcctgat aaactcgtcg taagttgaaa atattatccg     1260

tgggcgtgag cgcttcgaga tgttccgaga gctgaatgag gccttggaac tcaaggatgc     1320

ccaggctggg aaggagccag gggggagcag ggctcactcc agccacctga agtccaaaaa     1380

gggtcagtct acctcccgcc ataaaaaact catgttcaag acagaagggc ctgactcaga     1440

ctgacattct ccacttcttg ttccccactg acagcctccc acccccatct ctccctcccc     1500

tgccattttg ggttttgggt ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac     1560

ccaggacttc catttgcttt gtcccggggc tccactgaac aagttggcct gcactggtgt     1620

tttgttgtgg ggaggaggat ggggagtagg acataccagc ttagatttta aggtttttac     1680

tgtgagggat gtttgggaga tgtaagaaat gttcttgcag ttaagggtta gtttacaatc     1740

agccacattc taggtagggg cccacttcac cgtactaacc agggaagctg tccctcactg     1800

ttgaattttc tctaacttca aggcccatat ctgtgaaatg ctggcatttg cacctacctc     1860

acagagtgca ttgtgagggt taatgaaata atgtacatct ggccttgaaa ccacctttta     1920

ttacatgggg tctagaactt gacccccttg agggtgcttg ttccctctcc ctgttggtcg     1980

gtgggttggt agtttctaca gttgggcagc tggttaggta gagggagttg tcaagtctct     2040

gctggcccag ccaaaccctg tctgacaacc tcttggtgaa ccttagtacc taaaaggaaa     2100

tctcacccca tcccacaccc tggaggattt catctcttgt atatgatgat ctggatccac     2160

caagacttgt tttatgctca gggtcaattt cttttttctt tttttttttt ttttttcttt     2220

ttctttgaga ctgggtctcg ctttgttgcc caggctggag tggagtggcg tgatcttggc     2280

ttactgcagc ctttgcctcc ccggctcgag cagtcctgcc tcagcctccg gagtagctgg     2340

gaccacaggt tcatgccacc atggccagcc aacttttgca tgttttgtag agatggggtc     2400

tcacagtgtt gcccaggctg gtctcaaact cctgggctca ggcgatccac ctgtctcagc     2460

ctcccagagt gctgggatta caattgtgag ccaccacgtc cagctggaag ggtcaacatc     2520

ttttacattc tgcaagcaca tctgcatttt caccccaccc ttcccctcct tctccctttt     2580

tatatcccat ttttatatcg atctcttatt ttacaataaa actttgctgc cacctgtgtg     2640

tctgaggggt g                                                          2651


<210>  201
<211>  307
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 4, 
       polypeptide

NCBI Reference Sequence: NM_001276695.1

<400>  201

Met Asp Asp Leu Met Leu Ser Pro Asp Asp Ile Glu Gln Trp Phe Thr 
1               5                   10                  15      


Glu Asp Pro Gly Pro Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro 
            20                  25                  30          


Pro Val Ala Pro Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro 
        35                  40                  45              


Ala Pro Ser Trp Pro Leu Ser Ser Ser Val Pro Ser Gln Lys Thr Tyr 
    50                  55                  60                  


Gln Gly Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala 
65                  70                  75                  80  


Lys Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys 
                85                  90                  95      


Gln Leu Ala Lys Thr Cys Pro Val Gln Leu Trp Val Asp Ser Thr Pro 
            100                 105                 110         


Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gln Ser Gln 
        115                 120                 125             


His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu Arg Cys Ser 
    130                 135                 140                 


Asp Ser Asp Gly Leu Ala Pro Pro Gln His Leu Ile Arg Val Glu Gly 
145                 150                 155                 160 


Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe Arg His Ser 
                165                 170                 175     


Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp Cys Thr Thr 
            180                 185                 190         


Ile His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly Gly Met Asn 
        195                 200                 205             


Arg Arg Pro Ile Leu Thr Ile Ile Thr Leu Glu Asp Ser Ser Gly Asn 
    210                 215                 220                 


Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala Cys Pro Gly 
225                 230                 235                 240 


Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro 
                245                 250                 255     


His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn 
            260                 265                 270         


Thr Ser Ser Ser Pro Gln Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr 
        275                 280                 285             


Phe Thr Leu Gln Met Leu Leu Asp Leu Arg Trp Cys Tyr Phe Leu Ile 
    290                 295                 300                 


Asn Ser Ser 
305         


<210>  202
<211>  2331
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 7, 
       mRNA

NCBI Reference Sequence: NM_001126117.1

<400>  202
tgaggccagg agatggaggc tgcagtgagc tgtgatcaca ccactgtgct ccagcctgag       60

tgacagagca agaccctatc tcaaaaaaaa aaaaaaaaaa gaaaagctcc tgaggtgtag      120

acgccaactc tctctagctc gctagtgggt tgcaggaggt gcttacgcat gtttgtttct      180

ttgctgccgt cttccagttg ctttatctgt tcacttgtgc cctgactttc aactctgtct      240

ccttcctctt cctacagtac tcccctgccc tcaacaagat gttttgccaa ctggccaaga      300

cctgccctgt gcagctgtgg gttgattcca cacccccgcc cggcacccgc gtccgcgcca      360

tggccatcta caagcagtca cagcacatga cggaggttgt gaggcgctgc ccccaccatg      420

agcgctgctc agatagcgat ggtctggccc ctcctcagca tcttatccga gtggaaggaa      480

atttgcgtgt ggagtatttg gatgacagaa acacttttcg acatagtgtg gtggtgccct      540

atgagccgcc tgaggttggc tctgactgta ccaccatcca ctacaactac atgtgtaaca      600

gttcctgcat gggcggcatg aaccggaggc ccatcctcac catcatcaca ctggaagact      660

ccagtggtaa tctactggga cggaacagct ttgaggtgcg tgtttgtgcc tgtcctggga      720

gagaccggcg cacagaggaa gagaatctcc gcaagaaagg ggagcctcac cacgagctgc      780

ccccagggag cactaagcga gcactgccca acaacaccag ctcctctccc cagccaaaga      840

agaaaccact ggatggagaa tatttcaccc ttcagatgct acttgactta cgatggtgtt      900

acttcctgat aaactcgtcg taagttgaaa atattatccg tgggcgtgag cgcttcgaga      960

tgttccgaga gctgaatgag gccttggaac tcaaggatgc ccaggctggg aaggagccag     1020

gggggagcag ggctcactcc agccacctga agtccaaaaa gggtcagtct acctcccgcc     1080

ataaaaaact catgttcaag acagaagggc ctgactcaga ctgacattct ccacttcttg     1140

ttccccactg acagcctccc acccccatct ctccctcccc tgccattttg ggttttgggt     1200

ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac ccaggacttc catttgcttt     1260

gtcccggggc tccactgaac aagttggcct gcactggtgt tttgttgtgg ggaggaggat     1320

ggggagtagg acataccagc ttagatttta aggtttttac tgtgagggat gtttgggaga     1380

tgtaagaaat gttcttgcag ttaagggtta gtttacaatc agccacattc taggtagggg     1440

cccacttcac cgtactaacc agggaagctg tccctcactg ttgaattttc tctaacttca     1500

aggcccatat ctgtgaaatg ctggcatttg cacctacctc acagagtgca ttgtgagggt     1560

taatgaaata atgtacatct ggccttgaaa ccacctttta ttacatgggg tctagaactt     1620

gacccccttg agggtgcttg ttccctctcc ctgttggtcg gtgggttggt agtttctaca     1680

gttgggcagc tggttaggta gagggagttg tcaagtctct gctggcccag ccaaaccctg     1740

tctgacaacc tcttggtgaa ccttagtacc taaaaggaaa tctcacccca tcccacaccc     1800

tggaggattt catctcttgt atatgatgat ctggatccac caagacttgt tttatgctca     1860

gggtcaattt cttttttctt tttttttttt ttttttcttt ttctttgaga ctgggtctcg     1920

ctttgttgcc caggctggag tggagtggcg tgatcttggc ttactgcagc ctttgcctcc     1980

ccggctcgag cagtcctgcc tcagcctccg gagtagctgg gaccacaggt tcatgccacc     2040

atggccagcc aacttttgca tgttttgtag agatggggtc tcacagtgtt gcccaggctg     2100

gtctcaaact cctgggctca ggcgatccac ctgtctcagc ctcccagagt gctgggatta     2160

caattgtgag ccaccacgtc cagctggaag ggtcaacatc ttttacattc tgcaagcaca     2220

tctgcatttt caccccaccc ttcccctcct tctccctttt tatatcccat ttttatatcg     2280

atctcttatt ttacaataaa actttgctgc cacctgtgtg tctgaggggt g              2331


<210>  203
<211>  214
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 7, 
       polypeptide

NCBI Reference Sequence: NM_001126117.1

<400>  203

Met Phe Cys Gln Leu Ala Lys Thr Cys Pro Val Gln Leu Trp Val Asp 
1               5                   10                  15      


Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys 
            20                  25                  30          


Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu 
        35                  40                  45              


Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln His Leu Ile Arg 
    50                  55                  60                  


Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe 
65                  70                  75                  80  


Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp 
                85                  90                  95      


Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly 
            100                 105                 110         


Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr Leu Glu Asp Ser 
        115                 120                 125             


Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala 
    130                 135                 140                 


Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys 
145                 150                 155                 160 


Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu 
                165                 170                 175     


Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys Lys Lys Pro Leu Asp 
            180                 185                 190         


Gly Glu Tyr Phe Thr Leu Gln Met Leu Leu Asp Leu Arg Trp Cys Tyr 
        195                 200                 205             


Phe Leu Ile Asn Ser Ser 
    210                 


<210>  204
<211>  2404
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 6, 
       mRNA

NCBI Reference Sequence: NM_001126116.1

<400>  204
tgaggccagg agatggaggc tgcagtgagc tgtgatcaca ccactgtgct ccagcctgag       60

tgacagagca agaccctatc tcaaaaaaaa aaaaaaaaaa gaaaagctcc tgaggtgtag      120

acgccaactc tctctagctc gctagtgggt tgcaggaggt gcttacgcat gtttgtttct      180

ttgctgccgt cttccagttg ctttatctgt tcacttgtgc cctgactttc aactctgtct      240

ccttcctctt cctacagtac tcccctgccc tcaacaagat gttttgccaa ctggccaaga      300

cctgccctgt gcagctgtgg gttgattcca cacccccgcc cggcacccgc gtccgcgcca      360

tggccatcta caagcagtca cagcacatga cggaggttgt gaggcgctgc ccccaccatg      420

agcgctgctc agatagcgat ggtctggccc ctcctcagca tcttatccga gtggaaggaa      480

atttgcgtgt ggagtatttg gatgacagaa acacttttcg acatagtgtg gtggtgccct      540

atgagccgcc tgaggttggc tctgactgta ccaccatcca ctacaactac atgtgtaaca      600

gttcctgcat gggcggcatg aaccggaggc ccatcctcac catcatcaca ctggaagact      660

ccagtggtaa tctactggga cggaacagct ttgaggtgcg tgtttgtgcc tgtcctggga      720

gagaccggcg cacagaggaa gagaatctcc gcaagaaagg ggagcctcac cacgagctgc      780

ccccagggag cactaagcga gcactgccca acaacaccag ctcctctccc cagccaaaga      840

agaaaccact ggatggagaa tatttcaccc ttcaggacca gaccagcttt caaaaagaaa      900

attgttaaag agagcatgaa aatggttcta tgactttgcc tgatacagat gctacttgac      960

ttacgatggt gttacttcct gataaactcg tcgtaagttg aaaatattat ccgtgggcgt     1020

gagcgcttcg agatgttccg agagctgaat gaggccttgg aactcaagga tgcccaggct     1080

gggaaggagc caggggggag cagggctcac tccagccacc tgaagtccaa aaagggtcag     1140

tctacctccc gccataaaaa actcatgttc aagacagaag ggcctgactc agactgacat     1200

tctccacttc ttgttcccca ctgacagcct cccaccccca tctctccctc ccctgccatt     1260

ttgggttttg ggtctttgaa cccttgcttg caataggtgt gcgtcagaag cacccaggac     1320

ttccatttgc tttgtcccgg ggctccactg aacaagttgg cctgcactgg tgttttgttg     1380

tggggaggag gatggggagt aggacatacc agcttagatt ttaaggtttt tactgtgagg     1440

gatgtttggg agatgtaaga aatgttcttg cagttaaggg ttagtttaca atcagccaca     1500

ttctaggtag gggcccactt caccgtacta accagggaag ctgtccctca ctgttgaatt     1560

ttctctaact tcaaggccca tatctgtgaa atgctggcat ttgcacctac ctcacagagt     1620

gcattgtgag ggttaatgaa ataatgtaca tctggccttg aaaccacctt ttattacatg     1680

gggtctagaa cttgaccccc ttgagggtgc ttgttccctc tccctgttgg tcggtgggtt     1740

ggtagtttct acagttgggc agctggttag gtagagggag ttgtcaagtc tctgctggcc     1800

cagccaaacc ctgtctgaca acctcttggt gaaccttagt acctaaaagg aaatctcacc     1860

ccatcccaca ccctggagga tttcatctct tgtatatgat gatctggatc caccaagact     1920

tgttttatgc tcagggtcaa tttctttttt cttttttttt tttttttttc tttttctttg     1980

agactgggtc tcgctttgtt gcccaggctg gagtggagtg gcgtgatctt ggcttactgc     2040

agcctttgcc tccccggctc gagcagtcct gcctcagcct ccggagtagc tgggaccaca     2100

ggttcatgcc accatggcca gccaactttt gcatgttttg tagagatggg gtctcacagt     2160

gttgcccagg ctggtctcaa actcctgggc tcaggcgatc cacctgtctc agcctcccag     2220

agtgctggga ttacaattgt gagccaccac gtccagctgg aagggtcaac atcttttaca     2280

ttctgcaagc acatctgcat tttcacccca cccttcccct ccttctccct ttttatatcc     2340

catttttata tcgatctctt attttacaat aaaactttgc tgccacctgt gtgtctgagg     2400

ggtg                                                                  2404


<210>  205
<211>  209
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens tumor protein p53 (TP53), transcript variant 6, 
       polypeptide

NCBI Reference Sequence: NM_001126116.1

<400>  205

Met Phe Cys Gln Leu Ala Lys Thr Cys Pro Val Gln Leu Trp Val Asp 
1               5                   10                  15      


Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys 
            20                  25                  30          


Gln Ser Gln His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu 
        35                  40                  45              


Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln His Leu Ile Arg 
    50                  55                  60                  


Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe 
65                  70                  75                  80  


Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp 
                85                  90                  95      


Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly 
            100                 105                 110         


Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr Leu Glu Asp Ser 
        115                 120                 125             


Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala 
    130                 135                 140                 


Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys 
145                 150                 155                 160 


Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu 
                165                 170                 175     


Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys Lys Lys Pro Leu Asp 
            180                 185                 190         


Gly Glu Tyr Phe Thr Leu Gln Asp Gln Thr Ser Phe Gln Lys Glu Asn 
        195                 200                 205             


Cys 
    


<210>  206
<211>  10443
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human Smad2 mRNA transcript variant 1 GenBank Accession No.: 
       NM_005901.5  GI:385862243

<400>  206
ccctctcctc ccctcccctc ccctctcttc ccctaccctc ccgcgcgccc gggccgccgg       60

ccgggcccgg gcctgggggc ggggcgggaa gacggcggcc gggagtgttt tcagttccgc      120

ctccaatcgc ccattcccct cttcccctcc cagccccctc catcccatcg gaagaggaag      180

gaacaaaagg tcccggaccc cccggatctg acggggcggg acctggcgcc accttgcagg      240

ttcgatacaa gaggctgttt tcctagcgtg gcttgctgcc tttggtaaga acatgtcgtc      300

catcttgcca ttcacgccgc cagttgtgaa gagactgctg ggatggaaga agtcagctgg      360

tgggtctgga ggagcaggcg gaggagagca gaatgggcag gaagaaaagt ggtgtgagaa      420

agcagtgaaa agtctggtga agaagctaaa gaaaacagga cgattagatg agcttgagaa      480

agccatcacc actcaaaact gtaatactaa atgtgttacc ataccaagca cttgctctga      540

aatttgggga ctgagtacac caaatacgat agatcagtgg gatacaacag gcctttacag      600

cttctctgaa caaaccaggt ctcttgatgg tcgtctccag gtatcccatc gaaaaggatt      660

gccacatgtt atatattgcc gattatggcg ctggcctgat cttcacagtc atcatgaact      720

caaggcaatt gaaaactgcg aatatgcttt taatcttaaa aaggatgaag tatgtgtaaa      780

cccttaccac tatcagagag ttgagacacc agttttgcct ccagtattag tgccccgaca      840

caccgagatc ctaacagaac ttccgcctct ggatgactat actcactcca ttccagaaaa      900

cactaacttc ccagcaggaa ttgagccaca gagtaattat attccagaaa cgccacctcc      960

tggatatatc agtgaagatg gagaaacaag tgaccaacag ttgaatcaaa gtatggacac     1020

aggctctcca gcagaactat ctcctactac tctttcccct gttaatcata gcttggattt     1080

acagccagtt acttactcag aacctgcatt ttggtgttcg atagcatatt atgaattaaa     1140

tcagagggtt ggagaaacct tccatgcatc acagccctca ctcactgtag atggctttac     1200

agacccatca aattcagaga ggttctgctt aggtttactc tccaatgtta accgaaatgc     1260

cacggtagaa atgacaagaa ggcatatagg aagaggagtg cgcttatact acataggtgg     1320

ggaagttttt gctgagtgcc taagtgatag tgcaatcttt gtgcagagcc ccaattgtaa     1380

tcagagatat ggctggcacc ctgcaacagt gtgtaaaatt ccaccaggct gtaatctgaa     1440

gatcttcaac aaccaggaat ttgctgctct tctggctcag tctgttaatc agggttttga     1500

agccgtctat cagctaacta gaatgtgcac cataagaatg agttttgtga aagggtgggg     1560

agcagaatac cgaaggcaga cggtaacaag tactccttgc tggattgaac ttcatctgaa     1620

tggacctcta cagtggttgg acaaagtatt aactcagatg ggatcccctt cagtgcgttg     1680

ctcaagcatg tcataaagct tcaccaatca agtcccatga aaagacttaa tgtaacaact     1740

cttctgtcat agcattgtgt gtggtcccta tggactgttt actatccaaa agttcaagag     1800

agaaaacagc acttgaggtc tcatcaatta aagcaccttg tggaatctgt ttcctatatt     1860

tgaatattag atgggaaaat tagtgtctag aaatactctc ccattaaaga ggaagagaag     1920

attttaaaga cttaatgatg tcttattggg cataaaactg agtgtcccaa aggtttatta     1980

ataacagtag tagttatgtg tacaggtaat gtatcatgat ccagtatcac agtattgtgc     2040

tgtttatata catttttagt ttgcatagat gaggtgtgtg tgtgcgctgc ttcttgatct     2100

aggcaaacct ttataaagtt gcagtaccta atctgttatt cccacttctc tgttattttt     2160

gtgtgtcttt tttaatatat aatatatatc aagattttca aattatttag aagcagattt     2220

tcctgtagaa aaactaattt ttctgccttt taccaaaaat aaactcttgg gggaagaaaa     2280

gtggattaac ttttgaaatc cttgacctta atgtgttcag tggggcttaa acagtcattc     2340

tttttgtggt tttttgtttt tttttgtttt tttttttaac tgctaaatct tattataagg     2400

aaaccatact gaaaaccttt ccaagcctct tttttccatt cccatttttg tcctcataat     2460

caaaacagca taacatgaca tcatcaccag taatagttgc attgatactg ctggcaccag     2520

ttaattctgg gatacagtaa gaattcatat ggagaaagtc cctttgtctt atgcccaaat     2580

ttcaacagga ataattggct tgtataatct agcagtctgt tgatttatcc ttccacctca     2640

taaaaaatgc ataggtggca gtataattat tttcagggat atgctagaat tacttccaca     2700

tatttatccc tttttaaaaa agctaatcta taaataccgt ttttccaaag gtattttaca     2760

atatttcaac agcagacctt ctgctcttcg agtagtttga tttggtttag taaccagatt     2820

gcattatgaa atgggccttt tgtaaatgta attgtttctg caaaatacct agaaaagtga     2880

tgctgaggta ggatcagcag atatgggcca tctgttttta aagtatgttg tattcagttt     2940

ataaattgat tgttattcta cacataatta tgaattcaga attttaaaaa ttgggggaaa     3000

agccatttat ttagcaagtt ttttagctta taagttacct gcagtctgag ctgttcttaa     3060

ctgatcctgg ttttgtgatt gacaatattt catgctctgt agtgagagga gatttccgaa     3120

actctgttgc tagttcattc tgcagcaaat aattattatg tctgatgttg actcattgca     3180

gtttaaacat ttcttcttgt ttgcatctta gtagaaatgg aaaataacca ctcctggtcg     3240

tcttttcata aattttcata tttttgaagc tgtctttggt acttgttctt tgaaatcata     3300

tccacctgtc tctataggta tcattttcaa tactttcaac atttggtggt tttctattgg     3360

gtactcccca ttttcctata tttgtgtgta tatgtatgtg ttcatgtaaa tttggtatag     3420

taatttttta ttcattcaac aaatatttat tgttcacctg tttgtaccag gaacttttct     3480

tagtctttgg gtaaaggtga acaagacaac tacagttcct gcctttgctg agacagcagt     3540

tacactaacc cttaattatc ttacttgtct atgaaggaga taaacagggt actgtactgg     3600

agaataacag atgggatgct tcaggtagga catcaaggaa agcctctaag gaaaggatgc     3660

atgagctaac acctgacatt aaagaagcaa gccaagtgag gagccagggg agataagcat     3720

tcctggcaaa gagaatagca tcaaatgcaa aaaggttcac actaaaggaa actcctgatt     3780

aggtattaat gctttataca gaaacctcta tacaaatcca aacttgaaga tcagaatggt     3840

tctacagttc ataacatttt gaaggtggcc ttattttgtg atagtctgct tcatgtgatt     3900

ctcactaaca tatctccttc ctcaaccttt gctgtaaaaa tttcatttgc accacatcag     3960

tactacttaa tttaacaagc ttttgttgtg taagctctca ctgttttagt gccctgctgc     4020

ttgcttccag actttgtgct gtccagtaat tatgtcttcc actacccatc ttgtgagcag     4080

agtaaatgtc ctaggtaata ccactatcag gcctgtagga gatactcagt ggagcctctg     4140

cccttctttt tcttacttga gaacttgtaa tggtgttagg gaacagttgt aggggcagaa     4200

aacaactctg aaagtggtag aaggtcctga tcttggtggt tactcttgca ttactgtgtt     4260

aggtcaagca gtgcctacta tgctgtttca gtagtggagc gcatctctac agttctgatg     4320

cgatttttct gtacagtatg aaattgggac tcaactcttt gaaaacacct attgagcagt     4380

tatacctgtt gagcagttta cttcctggtt gtaattacat ttgtgtgaat gtgtttgatg     4440

ctttttaacg agatgatgtt ttttgtattt tatctactgt ggcctgattt tttttttgtt     4500

ttctgcccct ccccccattt ataggtgtgg ttttcatttt tctaagtgat agaatcccct     4560

ctttgttgaa tttttgtctt tatttaaatt agcaacatta cttaggattt attcttcaca     4620

atactgttaa ttttctagga atgatgacct gagaaccgaa tggccatgct ttctatcaca     4680

tttctaagat gagtaatatt ttttccagta ggttccacag agacaccttg ggggctggct     4740

taggggaggc tgttggagtt ctcactgact tagtggcata tttattctgt actgaagaac     4800

tgcatggggt ttcttttgga aagagtttca ttgctttaaa aagaagctca gaaagtcttt     4860

ataaccactg gtcaacgatt agaaaaatat aactggattt aggcctacct tctggaatac     4920

cgctgattgt gctcttttta tcctacttta aagaagcttt catgattaga tttgagctat     4980

atcagttata ccgattatac cttataatac acattcagtt agtaaacatt tattgatgcc     5040

tgttgtttgc ccagccactg tgatggatat tgaataataa aaagatgact aggacggggc     5100

cctgaccctt gagctgtgct tggtcttgta gaggttgtgt tttttttcct caggacctgt     5160

cactttggca gaaggaaatc tgcctaattt ttcttgaaag ctaaattttc tttgtaagtt     5220

tttacaaatt gtttaatacc tagttgtatt ttttacctta agccacattg agttttgctt     5280

gatttgtctg tcttttaaac actgtcaaat gctttccctt ttgttaaaat tattttaatt     5340

tcactttttt tgtgcccttg tcaatttaag actaagactt tgaaggtaaa acaaacaaac     5400

aaacatcagt cttagtctct tgctagttga aatcaaataa aagaaaatat atacccagtt     5460

ggtttctcta cctcttaaaa gcttcccata tataccttta agatccttct cttttttctt     5520

taactactaa ataggttcag catttattca gtgttagata ccctcttcgt ctgagggtgg     5580

cgtaggttta tgttgggata taaagtaaca caagacaatc ttcactgtac ataaaatatg     5640

tcttcatgta cagtctttac tttaaaagct gaacattcca atttgcgcct tccctcccaa     5700

gcccctgccc accaagtatc tctttagata tctagtctgt ggacatgaac aatgaatact     5760

tttttcttac tctgatcgaa ggcattgata cttagacata tcaaacattt cttcctttca     5820

tatgctttac tttgctaaat ctattatatt cattgcctga attttattct tcctttctac     5880

ctgacaacac acatccaggt ggtacttgct ggttatcctc tttcttgtta gccttgtttt     5940

ttgttttttt tttttttttt tgagagggag tctcgctctg ttgcccaacc tggagtgcag     6000

tggtgcgatc ttggttcact gcaagctccg cctcccgggt tcacgccatg cttctgcctc     6060

agcctcccaa gtagctggga ctacaggcgc ccaccaccac actcggctaa ttttttgtat     6120

ttttagtaga gacggggttt caccgtgttg gccaggatgg tctcgatctc ctgacctcgt     6180

gatctgtcca cctcggcttc ccaaagtgct gggattacag gcatgagcca ccgcgcccag     6240

cctagccata tttttatctg catatatcag aatgtttctc tcctttgaac ttattaacaa     6300

aaaaggaaca tgcttttcat acctagagtc ctaatttctt catcatgaag gttgctattc     6360

aaattgatca atcattttaa ttttacaaat ggctcaaaaa ttctgttcag taaatgtctt     6420

tgtgactggc aaatggcata aattatgttt aagattatga acttttctga cagttgcagc     6480

caatgttttc cctacgatac cagatttcca tcttggggca tattggattg ttgtatttaa     6540

gacagtcaga ataatgatag tgtgtggtct ccagaggtag tcagaatcct gctattgagt     6600

tctttttata tcttcctttt caatttttta ttaccatttt gtttgtttag actacacttt     6660

gtagggattg aggggcaaat tatctcttgg agtggaattc ctgtgttttg agccttacaa     6720

ccaggaaata tgagctatac tagatagcct catgatagca tttacgataa gaacttatct     6780

cgtgtgttca tgtaattttt tgagtaggaa ctgttttatc ttgaatattg tagctaacta     6840

tatatagcag aactgcctca gtctttttaa gaaggaaata aataatatat gtgtatgaat     6900

ttatatatac atatacactc atagacaaac ttaacagttg gggtcattct aacagttaaa     6960

acaattgttc cattgtttaa atctcagatc ctggtaaaat gttcttaatt tgtctgtgta     7020

cattttcctt tcatggacag accattggag tacattaatt ttcttaatct gccatttggc     7080

agttcattta atataccatt ttttggcaac ttggtaacta agaatcacag ccaaaatttg     7140

ttaacatcaa agaaagctct gccatatacc ccgttactaa attattatac atccagcaga     7200

ttctgggatg tactaactta gggttaactt tgttgttgtt gataatacta gattgctccc     7260

tctttaattc ttcttctggt gcaaggttgc tgcttaagtt accctgggaa atactactac     7320

aaggtcaaat tttctagtat cttacagcct gattgaaggt gattcagatc tttgctcaat     7380

ataaatggat tttccaagat tctctgggcc atccttgacc cacaggtgat ctcgctggag     7440

tatattaact taacttcagt gccagttggt ttggtgccat gagatccata atgaatccag     7500

aacttcacca ttgcttagat ataagagtcc cttggaagaa taatgccact gatgatgggg     7560

gtcagaaggt gtattaactc aacatagagg gcttttagat ttttcttcaa aaaaatttcg     7620

agaaaagtat tcttttaccc tccaaacagt taacagctct tagtttctcc aaatatgctc     7680

tttgatttac ttatttttaa ttaaagatgg taatttattg aacaatgaaa tccgtaatat     7740

attgatttaa ggacaaaagt gaagttttag aattataaaa gtacttaaat attatatatt     7800

ttccatttca taattgtttt cctttctctg tggctttaaa gtttttgact attttacaat     7860

gttaatcact aggtaacttg ccatatttct ggttctatat taagttctat cctttataat     7920

gctgttatta taaagctggt ttttagcatt tgtctgtagc aatagaaatt ttactaagtc     7980

tctgttctcc cagtaagttt tttcttttct cagtaagtcc ctaagaaaac atttgtttgc     8040

cactcttact attcccaatc ttggattgtt cgagctgaaa aaaaatttga tgagaaacag     8100

gaggatcctt ttctggtgaa tataggttcc tgctttaaga atgtggaaat ccattgcttt     8160

atataactaa tatacacaca gattaattaa aattgtgaga aataattcac acatgacaag     8220

taggtaacat gcatgagttt tgaatttttt taaaaaccca actgtttgac aaaatataga     8280

acccaaattg gtactttctt agaccagtgt aacctcacac ctcagttttg cttttccaac     8340

cctgacttga aaggcatatt tgtatctttt tattagtgat agtgaagctg tgacactaac     8400

cttttataca aaagagtaaa gaaagaaaaa ctacagcgat taagatgaga acagttctgc     8460

agttgttgaa ctagatcaca gcattgtagg cagaataaaa aatgttcata tctgagaata     8520

ttcctttcgc catcttttcc caaggccaga cctcctggtg gagcacagtt aaaagtaaca     8580

ttctgggcct ttgtaatcgg agggctgtgt ctccagctgg cagcctttgt tttaatatat     8640

aatgcaggac tgtggaaaac agttggcata gaatattttc acctaaaaaa gaaagaaaag     8700

acatacaaaa ctggattaat tgcaaaaaga gaatacagta aaataccata taactggaca     8760

aagctagaag aacctttaga agatttgtct gaaaacagat ttcaagagtg agcttttata     8820

cactgctcac taatttgctt gattactacc aactcttctt aaagttaaca cgtttaaggt     8880

atttctggac ttcctagcct tttagcaagc ttagaggaac tagccattag ctagtgatgt     8940

aaaaatattt tggggactga tgcccttaaa ggttatgccc ttgaaagttc ttaccttttc     9000

tctagtgata ttaaggaacg agtgggtagt gttctcaggg tgaccagctg ccctaaagtg     9060

cctgggattg agggtttccc tggatgcggg actttccctg gatacaaaac ttttagcaga     9120

gttttgtata tatgtggatt tttctgataa gtagcacatc agaggcctta accactgccc     9180

aaaagcgatt ctccattgag agtacatatc ttgaacttaa gaaattcatt tgctctgatt     9240

tttaatcttg taaagttttt gctaaactca aaacaagtcc caggcacacc agaaggagct     9300

gaccacctta ggtgttcttg tgatttatcc ttacttccct atgttgtcat agttgcttct     9360

aaactcagct gcactatggc tgtcaacatt tctgatactt attgggatat gtgccatcca     9420

gtcatttagt actttgaatg gaacatgaga tttataacac aggtaatagc tgaaggtacc     9480

agtatggtgg tgagactcac acttagtgat ccagctaagg taactgatgt tataatggaa     9540

cagagaagag gccaactaga tagctaagtt cttctgaacc tatgtgtata tgtaagtaca     9600

aatcatgcgt ccttatgggg ttaaacttaa tctgaaattt acatttttca tagtaaaagg     9660

aaaccaattg ttgcagattt cttttcttgt gaggaaatac atggcctttg atgctctggc     9720

gtctactgca tttcccagtc tgttctgctc gagaagccag aatgtgttgt taacattttt     9780

ccgtgaatgt tgtgttaaaa tgattaaatg catcagccaa tggcaagtga aggaattggg     9840

tgtcctgatg cagactgagc agtttctctc aattgtagcc tcatactcat aaggtgctta     9900

ccagctagaa cattgagcac gtgaggtgag attttttttc tctgatggca ttaactttgt     9960

aatgcaatat gatggatgca gaccctgttc ttgtttccct ctggaagtcc ttagtggctg    10020

catccttggt gcactgtgat ggagatatta aatgtgttct ttgtgagctt tcgttctatg    10080

attgtcaaaa gtacgatgtg gttccttttt tatttttatt aaacaatgag ctgaggcttt    10140

attacagctg gttttcaagt taaaattgtt gaatactgat gtctttctcc cacctacacc    10200

aaatatttta gtctatttaa agtacaaaaa aagttctgct taagaaaaca ttgcttacat    10260

gtcctgtgat ttctggtcaa tttttatata tatttgtgtg catcatctgt atgtgctttc    10320

actttttacc ttgtttgctc ttacctgtgt taacagccct gtcaccgttg aaaggtggac    10380

agttttccta gcattaaaag aaagccattt gagttgttta ccatgttaaa aaaaaaaaaa    10440

aaa                                                                  10443


<210>  207
<211>  467
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human Smad2 Isoform 1polypeptide encoded by mRNA transcript 
       variant 1 GenBank Accession No.: NP_005892.1  GI:5174511

<400>  207

Met Ser Ser Ile Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 
1               5                   10                  15      


Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
            20                  25                  30          


Gln Asn Gly Gln Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
        35                  40                  45              


Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
    50                  55                  60                  


Ile Thr Thr Gln Asn Cys Asn Thr Lys Cys Val Thr Ile Pro Ser Thr 
65                  70                  75                  80  


Cys Ser Glu Ile Trp Gly Leu Ser Thr Pro Asn Thr Ile Asp Gln Trp 
                85                  90                  95      


Asp Thr Thr Gly Leu Tyr Ser Phe Ser Glu Gln Thr Arg Ser Leu Asp 
            100                 105                 110         


Gly Arg Leu Gln Val Ser His Arg Lys Gly Leu Pro His Val Ile Tyr 
        115                 120                 125             


Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys 
    130                 135                 140                 


Ala Ile Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val 
145                 150                 155                 160 


Cys Val Asn Pro Tyr His Tyr Gln Arg Val Glu Thr Pro Val Leu Pro 
                165                 170                 175     


Pro Val Leu Val Pro Arg His Thr Glu Ile Leu Thr Glu Leu Pro Pro 
            180                 185                 190         


Leu Asp Asp Tyr Thr His Ser Ile Pro Glu Asn Thr Asn Phe Pro Ala 
        195                 200                 205             


Gly Ile Glu Pro Gln Ser Asn Tyr Ile Pro Glu Thr Pro Pro Pro Gly 
    210                 215                 220                 


Tyr Ile Ser Glu Asp Gly Glu Thr Ser Asp Gln Gln Leu Asn Gln Ser 
225                 230                 235                 240 


Met Asp Thr Gly Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro 
                245                 250                 255     


Val Asn His Ser Leu Asp Leu Gln Pro Val Thr Tyr Ser Glu Pro Ala 
            260                 265                 270         


Phe Trp Cys Ser Ile Ala Tyr Tyr Glu Leu Asn Gln Arg Val Gly Glu 
        275                 280                 285             


Thr Phe His Ala Ser Gln Pro Ser Leu Thr Val Asp Gly Phe Thr Asp 
    290                 295                 300                 


Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn 
305                 310                 315                 320 


Arg Asn Ala Thr Val Glu Met Thr Arg Arg His Ile Gly Arg Gly Val 
                325                 330                 335     


Arg Leu Tyr Tyr Ile Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp 
            340                 345                 350         


Ser Ala Ile Phe Val Gln Ser Pro Asn Cys Asn Gln Arg Tyr Gly Trp 
        355                 360                 365             


His Pro Ala Thr Val Cys Lys Ile Pro Pro Gly Cys Asn Leu Lys Ile 
    370                 375                 380                 


Phe Asn Asn Gln Glu Phe Ala Ala Leu Leu Ala Gln Ser Val Asn Gln 
385                 390                 395                 400 


Gly Phe Glu Ala Val Tyr Gln Leu Thr Arg Met Cys Thr Ile Arg Met 
                405                 410                 415     


Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gln Thr Val Thr 
            420                 425                 430         


Ser Thr Pro Cys Trp Ile Glu Leu His Leu Asn Gly Pro Leu Gln Trp 
        435                 440                 445             


Leu Asp Lys Val Leu Thr Gln Met Gly Ser Pro Ser Val Arg Cys Ser 
    450                 455                 460                 


Ser Met Ser 
465         


<210>  208
<211>  10551
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 2 (SMAD2), transcript variant 2, 
       mRNA

NCBI Reference Sequence: NM_001003652.3

<400>  208
cggccgggag gcggggcggg ccgtaggcaa agggaggtgg ggaggcggtg gccggcgact       60

ccccgcgccc cgctcgcccc ccggcccttc ccgcggtgct cggcctcgtt cctttcctcc      120

tccgctccct ccgtcttcca tacccgcccc gcgcggcttt cggccggcgt gcctcgcgcc      180

ctaacgggcg gctggaggcg ccaatcagcg ggcggcaggg tgccagcccc ggggctgcgc      240

cggcgaatcg gcggggcccg cggcccaggg tggcaggcgg gtctacccgc gcggccgcgg      300

cggcggagaa gcagctcgcc agccagcagc ccgccagccg ccgggaggtt cgatacaaga      360

ggctgttttc ctagcgtggc ttgctgcctt tggtaagaac atgtcgtcca tcttgccatt      420

cacgccgcca gttgtgaaga gactgctggg atggaagaag tcagctggtg ggtctggagg      480

agcaggcgga ggagagcaga atgggcagga agaaaagtgg tgtgagaaag cagtgaaaag      540

tctggtgaag aagctaaaga aaacaggacg attagatgag cttgagaaag ccatcaccac      600

tcaaaactgt aatactaaat gtgttaccat accaagcact tgctctgaaa tttggggact      660

gagtacacca aatacgatag atcagtggga tacaacaggc ctttacagct tctctgaaca      720

aaccaggtct cttgatggtc gtctccaggt atcccatcga aaaggattgc cacatgttat      780

atattgccga ttatggcgct ggcctgatct tcacagtcat catgaactca aggcaattga      840

aaactgcgaa tatgctttta atcttaaaaa ggatgaagta tgtgtaaacc cttaccacta      900

tcagagagtt gagacaccag ttttgcctcc agtattagtg ccccgacaca ccgagatcct      960

aacagaactt ccgcctctgg atgactatac tcactccatt ccagaaaaca ctaacttccc     1020

agcaggaatt gagccacaga gtaattatat tccagaaacg ccacctcctg gatatatcag     1080

tgaagatgga gaaacaagtg accaacagtt gaatcaaagt atggacacag gctctccagc     1140

agaactatct cctactactc tttcccctgt taatcatagc ttggatttac agccagttac     1200

ttactcagaa cctgcatttt ggtgttcgat agcatattat gaattaaatc agagggttgg     1260

agaaaccttc catgcatcac agccctcact cactgtagat ggctttacag acccatcaaa     1320

ttcagagagg ttctgcttag gtttactctc caatgttaac cgaaatgcca cggtagaaat     1380

gacaagaagg catataggaa gaggagtgcg cttatactac ataggtgggg aagtttttgc     1440

tgagtgccta agtgatagtg caatctttgt gcagagcccc aattgtaatc agagatatgg     1500

ctggcaccct gcaacagtgt gtaaaattcc accaggctgt aatctgaaga tcttcaacaa     1560

ccaggaattt gctgctcttc tggctcagtc tgttaatcag ggttttgaag ccgtctatca     1620

gctaactaga atgtgcacca taagaatgag ttttgtgaaa gggtggggag cagaataccg     1680

aaggcagacg gtaacaagta ctccttgctg gattgaactt catctgaatg gacctctaca     1740

gtggttggac aaagtattaa ctcagatggg atccccttca gtgcgttgct caagcatgtc     1800

ataaagcttc accaatcaag tcccatgaaa agacttaatg taacaactct tctgtcatag     1860

cattgtgtgt ggtccctatg gactgtttac tatccaaaag ttcaagagag aaaacagcac     1920

ttgaggtctc atcaattaaa gcaccttgtg gaatctgttt cctatatttg aatattagat     1980

gggaaaatta gtgtctagaa atactctccc attaaagagg aagagaagat tttaaagact     2040

taatgatgtc ttattgggca taaaactgag tgtcccaaag gtttattaat aacagtagta     2100

gttatgtgta caggtaatgt atcatgatcc agtatcacag tattgtgctg tttatataca     2160

tttttagttt gcatagatga ggtgtgtgtg tgcgctgctt cttgatctag gcaaaccttt     2220

ataaagttgc agtacctaat ctgttattcc cacttctctg ttatttttgt gtgtcttttt     2280

taatatataa tatatatcaa gattttcaaa ttatttagaa gcagattttc ctgtagaaaa     2340

actaattttt ctgcctttta ccaaaaataa actcttgggg gaagaaaagt ggattaactt     2400

ttgaaatcct tgaccttaat gtgttcagtg gggcttaaac agtcattctt tttgtggttt     2460

tttgtttttt tttgtttttt tttttaactg ctaaatctta ttataaggaa accatactga     2520

aaacctttcc aagcctcttt tttccattcc catttttgtc ctcataatca aaacagcata     2580

acatgacatc atcaccagta atagttgcat tgatactgct ggcaccagtt aattctggga     2640

tacagtaaga attcatatgg agaaagtccc tttgtcttat gcccaaattt caacaggaat     2700

aattggcttg tataatctag cagtctgttg atttatcctt ccacctcata aaaaatgcat     2760

aggtggcagt ataattattt tcagggatat gctagaatta cttccacata tttatccctt     2820

tttaaaaaag ctaatctata aataccgttt ttccaaaggt attttacaat atttcaacag     2880

cagaccttct gctcttcgag tagtttgatt tggtttagta accagattgc attatgaaat     2940

gggccttttg taaatgtaat tgtttctgca aaatacctag aaaagtgatg ctgaggtagg     3000

atcagcagat atgggccatc tgtttttaaa gtatgttgta ttcagtttat aaattgattg     3060

ttattctaca cataattatg aattcagaat tttaaaaatt gggggaaaag ccatttattt     3120

agcaagtttt ttagcttata agttacctgc agtctgagct gttcttaact gatcctggtt     3180

ttgtgattga caatatttca tgctctgtag tgagaggaga tttccgaaac tctgttgcta     3240

gttcattctg cagcaaataa ttattatgtc tgatgttgac tcattgcagt ttaaacattt     3300

cttcttgttt gcatcttagt agaaatggaa aataaccact cctggtcgtc ttttcataaa     3360

ttttcatatt tttgaagctg tctttggtac ttgttctttg aaatcatatc cacctgtctc     3420

tataggtatc attttcaata ctttcaacat ttggtggttt tctattgggt actccccatt     3480

ttcctatatt tgtgtgtata tgtatgtgtt catgtaaatt tggtatagta attttttatt     3540

cattcaacaa atatttattg ttcacctgtt tgtaccagga acttttctta gtctttgggt     3600

aaaggtgaac aagacaacta cagttcctgc ctttgctgag acagcagtta cactaaccct     3660

taattatctt acttgtctat gaaggagata aacagggtac tgtactggag aataacagat     3720

gggatgcttc aggtaggaca tcaaggaaag cctctaagga aaggatgcat gagctaacac     3780

ctgacattaa agaagcaagc caagtgagga gccaggggag ataagcattc ctggcaaaga     3840

gaatagcatc aaatgcaaaa aggttcacac taaaggaaac tcctgattag gtattaatgc     3900

tttatacaga aacctctata caaatccaaa cttgaagatc agaatggttc tacagttcat     3960

aacattttga aggtggcctt attttgtgat agtctgcttc atgtgattct cactaacata     4020

tctccttcct caacctttgc tgtaaaaatt tcatttgcac cacatcagta ctacttaatt     4080

taacaagctt ttgttgtgta agctctcact gttttagtgc cctgctgctt gcttccagac     4140

tttgtgctgt ccagtaatta tgtcttccac tacccatctt gtgagcagag taaatgtcct     4200

aggtaatacc actatcaggc ctgtaggaga tactcagtgg agcctctgcc cttctttttc     4260

ttacttgaga acttgtaatg gtgttaggga acagttgtag gggcagaaaa caactctgaa     4320

agtggtagaa ggtcctgatc ttggtggtta ctcttgcatt actgtgttag gtcaagcagt     4380

gcctactatg ctgtttcagt agtggagcgc atctctacag ttctgatgcg atttttctgt     4440

acagtatgaa attgggactc aactctttga aaacacctat tgagcagtta tacctgttga     4500

gcagtttact tcctggttgt aattacattt gtgtgaatgt gtttgatgct ttttaacgag     4560

atgatgtttt ttgtatttta tctactgtgg cctgattttt tttttgtttt ctgcccctcc     4620

ccccatttat aggtgtggtt ttcatttttc taagtgatag aatcccctct ttgttgaatt     4680

tttgtcttta tttaaattag caacattact taggatttat tcttcacaat actgttaatt     4740

ttctaggaat gatgacctga gaaccgaatg gccatgcttt ctatcacatt tctaagatga     4800

gtaatatttt ttccagtagg ttccacagag acaccttggg ggctggctta ggggaggctg     4860

ttggagttct cactgactta gtggcatatt tattctgtac tgaagaactg catggggttt     4920

cttttggaaa gagtttcatt gctttaaaaa gaagctcaga aagtctttat aaccactggt     4980

caacgattag aaaaatataa ctggatttag gcctaccttc tggaataccg ctgattgtgc     5040

tctttttatc ctactttaaa gaagctttca tgattagatt tgagctatat cagttatacc     5100

gattatacct tataatacac attcagttag taaacattta ttgatgcctg ttgtttgccc     5160

agccactgtg atggatattg aataataaaa agatgactag gacggggccc tgacccttga     5220

gctgtgcttg gtcttgtaga ggttgtgttt tttttcctca ggacctgtca ctttggcaga     5280

aggaaatctg cctaattttt cttgaaagct aaattttctt tgtaagtttt tacaaattgt     5340

ttaataccta gttgtatttt ttaccttaag ccacattgag ttttgcttga tttgtctgtc     5400

ttttaaacac tgtcaaatgc tttccctttt gttaaaatta ttttaatttc actttttttg     5460

tgcccttgtc aatttaagac taagactttg aaggtaaaac aaacaaacaa acatcagtct     5520

tagtctcttg ctagttgaaa tcaaataaaa gaaaatatat acccagttgg tttctctacc     5580

tcttaaaagc ttcccatata tacctttaag atccttctct tttttcttta actactaaat     5640

aggttcagca tttattcagt gttagatacc ctcttcgtct gagggtggcg taggtttatg     5700

ttgggatata aagtaacaca agacaatctt cactgtacat aaaatatgtc ttcatgtaca     5760

gtctttactt taaaagctga acattccaat ttgcgccttc cctcccaagc ccctgcccac     5820

caagtatctc tttagatatc tagtctgtgg acatgaacaa tgaatacttt tttcttactc     5880

tgatcgaagg cattgatact tagacatatc aaacatttct tcctttcata tgctttactt     5940

tgctaaatct attatattca ttgcctgaat tttattcttc ctttctacct gacaacacac     6000

atccaggtgg tacttgctgg ttatcctctt tcttgttagc cttgtttttt gttttttttt     6060

tttttttttg agagggagtc tcgctctgtt gcccaacctg gagtgcagtg gtgcgatctt     6120

ggttcactgc aagctccgcc tcccgggttc acgccatgct tctgcctcag cctcccaagt     6180

agctgggact acaggcgccc accaccacac tcggctaatt ttttgtattt ttagtagaga     6240

cggggtttca ccgtgttggc caggatggtc tcgatctcct gacctcgtga tctgtccacc     6300

tcggcttccc aaagtgctgg gattacaggc atgagccacc gcgcccagcc tagccatatt     6360

tttatctgca tatatcagaa tgtttctctc ctttgaactt attaacaaaa aaggaacatg     6420

cttttcatac ctagagtcct aatttcttca tcatgaaggt tgctattcaa attgatcaat     6480

cattttaatt ttacaaatgg ctcaaaaatt ctgttcagta aatgtctttg tgactggcaa     6540

atggcataaa ttatgtttaa gattatgaac ttttctgaca gttgcagcca atgttttccc     6600

tacgatacca gatttccatc ttggggcata ttggattgtt gtatttaaga cagtcagaat     6660

aatgatagtg tgtggtctcc agaggtagtc agaatcctgc tattgagttc tttttatatc     6720

ttccttttca attttttatt accattttgt ttgtttagac tacactttgt agggattgag     6780

gggcaaatta tctcttggag tggaattcct gtgttttgag ccttacaacc aggaaatatg     6840

agctatacta gatagcctca tgatagcatt tacgataaga acttatctcg tgtgttcatg     6900

taattttttg agtaggaact gttttatctt gaatattgta gctaactata tatagcagaa     6960

ctgcctcagt ctttttaaga aggaaataaa taatatatgt gtatgaattt atatatacat     7020

atacactcat agacaaactt aacagttggg gtcattctaa cagttaaaac aattgttcca     7080

ttgtttaaat ctcagatcct ggtaaaatgt tcttaatttg tctgtgtaca ttttcctttc     7140

atggacagac cattggagta cattaatttt cttaatctgc catttggcag ttcatttaat     7200

ataccatttt ttggcaactt ggtaactaag aatcacagcc aaaatttgtt aacatcaaag     7260

aaagctctgc catatacccc gttactaaat tattatacat ccagcagatt ctgggatgta     7320

ctaacttagg gttaactttg ttgttgttga taatactaga ttgctccctc tttaattctt     7380

cttctggtgc aaggttgctg cttaagttac cctgggaaat actactacaa ggtcaaattt     7440

tctagtatct tacagcctga ttgaaggtga ttcagatctt tgctcaatat aaatggattt     7500

tccaagattc tctgggccat ccttgaccca caggtgatct cgctggagta tattaactta     7560

acttcagtgc cagttggttt ggtgccatga gatccataat gaatccagaa cttcaccatt     7620

gcttagatat aagagtccct tggaagaata atgccactga tgatgggggt cagaaggtgt     7680

attaactcaa catagagggc ttttagattt ttcttcaaaa aaatttcgag aaaagtattc     7740

ttttaccctc caaacagtta acagctctta gtttctccaa atatgctctt tgatttactt     7800

atttttaatt aaagatggta atttattgaa caatgaaatc cgtaatatat tgatttaagg     7860

acaaaagtga agttttagaa ttataaaagt acttaaatat tatatatttt ccatttcata     7920

attgttttcc tttctctgtg gctttaaagt ttttgactat tttacaatgt taatcactag     7980

gtaacttgcc atatttctgg ttctatatta agttctatcc tttataatgc tgttattata     8040

aagctggttt ttagcatttg tctgtagcaa tagaaatttt actaagtctc tgttctccca     8100

gtaagttttt tcttttctca gtaagtccct aagaaaacat ttgtttgcca ctcttactat     8160

tcccaatctt ggattgttcg agctgaaaaa aaatttgatg agaaacagga ggatcctttt     8220

ctggtgaata taggttcctg ctttaagaat gtggaaatcc attgctttat ataactaata     8280

tacacacaga ttaattaaaa ttgtgagaaa taattcacac atgacaagta ggtaacatgc     8340

atgagttttg aattttttta aaaacccaac tgtttgacaa aatatagaac ccaaattggt     8400

actttcttag accagtgtaa cctcacacct cagttttgct tttccaaccc tgacttgaaa     8460

ggcatatttg tatcttttta ttagtgatag tgaagctgtg acactaacct tttatacaaa     8520

agagtaaaga aagaaaaact acagcgatta agatgagaac agttctgcag ttgttgaact     8580

agatcacagc attgtaggca gaataaaaaa tgttcatatc tgagaatatt cctttcgcca     8640

tcttttccca aggccagacc tcctggtgga gcacagttaa aagtaacatt ctgggccttt     8700

gtaatcggag ggctgtgtct ccagctggca gcctttgttt taatatataa tgcaggactg     8760

tggaaaacag ttggcataga atattttcac ctaaaaaaga aagaaaagac atacaaaact     8820

ggattaattg caaaaagaga atacagtaaa ataccatata actggacaaa gctagaagaa     8880

cctttagaag atttgtctga aaacagattt caagagtgag cttttataca ctgctcacta     8940

atttgcttga ttactaccaa ctcttcttaa agttaacacg tttaaggtat ttctggactt     9000

cctagccttt tagcaagctt agaggaacta gccattagct agtgatgtaa aaatattttg     9060

gggactgatg cccttaaagg ttatgccctt gaaagttctt accttttctc tagtgatatt     9120

aaggaacgag tgggtagtgt tctcagggtg accagctgcc ctaaagtgcc tgggattgag     9180

ggtttccctg gatgcgggac tttccctgga tacaaaactt ttagcagagt tttgtatata     9240

tgtggatttt tctgataagt agcacatcag aggccttaac cactgcccaa aagcgattct     9300

ccattgagag tacatatctt gaacttaaga aattcatttg ctctgatttt taatcttgta     9360

aagtttttgc taaactcaaa acaagtccca ggcacaccag aaggagctga ccaccttagg     9420

tgttcttgtg atttatcctt acttccctat gttgtcatag ttgcttctaa actcagctgc     9480

actatggctg tcaacatttc tgatacttat tgggatatgt gccatccagt catttagtac     9540

tttgaatgga acatgagatt tataacacag gtaatagctg aaggtaccag tatggtggtg     9600

agactcacac ttagtgatcc agctaaggta actgatgtta taatggaaca gagaagaggc     9660

caactagata gctaagttct tctgaaccta tgtgtatatg taagtacaaa tcatgcgtcc     9720

ttatggggtt aaacttaatc tgaaatttac atttttcata gtaaaaggaa accaattgtt     9780

gcagatttct tttcttgtga ggaaatacat ggcctttgat gctctggcgt ctactgcatt     9840

tcccagtctg ttctgctcga gaagccagaa tgtgttgtta acatttttcc gtgaatgttg     9900

tgttaaaatg attaaatgca tcagccaatg gcaagtgaag gaattgggtg tcctgatgca     9960

gactgagcag tttctctcaa ttgtagcctc atactcataa ggtgcttacc agctagaaca    10020

ttgagcacgt gaggtgagat tttttttctc tgatggcatt aactttgtaa tgcaatatga    10080

tggatgcaga ccctgttctt gtttccctct ggaagtcctt agtggctgca tccttggtgc    10140

actgtgatgg agatattaaa tgtgttcttt gtgagctttc gttctatgat tgtcaaaagt    10200

acgatgtggt tcctttttta tttttattaa acaatgagct gaggctttat tacagctggt    10260

tttcaagtta aaattgttga atactgatgt ctttctccca cctacaccaa atattttagt    10320

ctatttaaag tacaaaaaaa gttctgctta agaaaacatt gcttacatgt cctgtgattt    10380

ctggtcaatt tttatatata tttgtgtgca tcatctgtat gtgctttcac tttttacctt    10440

gtttgctctt acctgtgtta acagccctgt caccgttgaa aggtggacag ttttcctagc    10500

attaaaagaa agccatttga gttgtttacc atgttaaaaa aaaaaaaaaa a             10551


<210>  209
<211>  467
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 2 (SMAD2), transcript variant 2, 
       polypeptide

NCBI Reference Sequence: NM_001003652.3

<400>  209

Met Ser Ser Ile Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 
1               5                   10                  15      


Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
            20                  25                  30          


Gln Asn Gly Gln Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
        35                  40                  45              


Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
    50                  55                  60                  


Ile Thr Thr Gln Asn Cys Asn Thr Lys Cys Val Thr Ile Pro Ser Thr 
65                  70                  75                  80  


Cys Ser Glu Ile Trp Gly Leu Ser Thr Pro Asn Thr Ile Asp Gln Trp 
                85                  90                  95      


Asp Thr Thr Gly Leu Tyr Ser Phe Ser Glu Gln Thr Arg Ser Leu Asp 
            100                 105                 110         


Gly Arg Leu Gln Val Ser His Arg Lys Gly Leu Pro His Val Ile Tyr 
        115                 120                 125             


Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys 
    130                 135                 140                 


Ala Ile Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val 
145                 150                 155                 160 


Cys Val Asn Pro Tyr His Tyr Gln Arg Val Glu Thr Pro Val Leu Pro 
                165                 170                 175     


Pro Val Leu Val Pro Arg His Thr Glu Ile Leu Thr Glu Leu Pro Pro 
            180                 185                 190         


Leu Asp Asp Tyr Thr His Ser Ile Pro Glu Asn Thr Asn Phe Pro Ala 
        195                 200                 205             


Gly Ile Glu Pro Gln Ser Asn Tyr Ile Pro Glu Thr Pro Pro Pro Gly 
    210                 215                 220                 


Tyr Ile Ser Glu Asp Gly Glu Thr Ser Asp Gln Gln Leu Asn Gln Ser 
225                 230                 235                 240 


Met Asp Thr Gly Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro 
                245                 250                 255     


Val Asn His Ser Leu Asp Leu Gln Pro Val Thr Tyr Ser Glu Pro Ala 
            260                 265                 270         


Phe Trp Cys Ser Ile Ala Tyr Tyr Glu Leu Asn Gln Arg Val Gly Glu 
        275                 280                 285             


Thr Phe His Ala Ser Gln Pro Ser Leu Thr Val Asp Gly Phe Thr Asp 
    290                 295                 300                 


Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn 
305                 310                 315                 320 


Arg Asn Ala Thr Val Glu Met Thr Arg Arg His Ile Gly Arg Gly Val 
                325                 330                 335     


Arg Leu Tyr Tyr Ile Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp 
            340                 345                 350         


Ser Ala Ile Phe Val Gln Ser Pro Asn Cys Asn Gln Arg Tyr Gly Trp 
        355                 360                 365             


His Pro Ala Thr Val Cys Lys Ile Pro Pro Gly Cys Asn Leu Lys Ile 
    370                 375                 380                 


Phe Asn Asn Gln Glu Phe Ala Ala Leu Leu Ala Gln Ser Val Asn Gln 
385                 390                 395                 400 


Gly Phe Glu Ala Val Tyr Gln Leu Thr Arg Met Cys Thr Ile Arg Met 
                405                 410                 415     


Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gln Thr Val Thr 
            420                 425                 430         


Ser Thr Pro Cys Trp Ile Glu Leu His Leu Asn Gly Pro Leu Gln Trp 
        435                 440                 445             


Leu Asp Lys Val Leu Thr Gln Met Gly Ser Pro Ser Val Arg Cys Ser 
    450                 455                 460                 


Ser Met Ser 
465         


<210>  210
<211>  10461
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 2 (SMAD2), transcript variant 3, 
       mRNA

NCBI Reference Sequence: NM_001135937.2

<400>  210
cggccgggag gcggggcggg ccgtaggcaa agggaggtgg ggaggcggtg gccggcgact       60

ccccgcgccc cgctcgcccc ccggcccttc ccgcggtgct cggcctcgtt cctttcctcc      120

tccgctccct ccgtcttcca tacccgcccc gcgcggcttt cggccggcgt gcctcgcgcc      180

ctaacgggcg gctggaggcg ccaatcagcg ggcggcaggg tgccagcccc ggggctgcgc      240

cggcgaatcg gcggggcccg cggcccaggg tggcaggcgg gtctacccgc gcggccgcgg      300

cggcggagaa gcagctcgcc agccagcagc ccgccagccg ccgggaggtt cgatacaaga      360

ggctgttttc ctagcgtggc ttgctgcctt tggtaagaac atgtcgtcca tcttgccatt      420

cacgccgcca gttgtgaaga gactgctggg atggaagaag tcagctggtg ggtctggagg      480

agcaggcgga ggagagcaga atgggcagga agaaaagtgg tgtgagaaag cagtgaaaag      540

tctggtgaag aagctaaaga aaacaggacg attagatgag cttgagaaag ccatcaccac      600

tcaaaactgt aatactaaat gtgttaccat accaaggtct cttgatggtc gtctccaggt      660

atcccatcga aaaggattgc cacatgttat atattgccga ttatggcgct ggcctgatct      720

tcacagtcat catgaactca aggcaattga aaactgcgaa tatgctttta atcttaaaaa      780

ggatgaagta tgtgtaaacc cttaccacta tcagagagtt gagacaccag ttttgcctcc      840

agtattagtg ccccgacaca ccgagatcct aacagaactt ccgcctctgg atgactatac      900

tcactccatt ccagaaaaca ctaacttccc agcaggaatt gagccacaga gtaattatat      960

tccagaaacg ccacctcctg gatatatcag tgaagatgga gaaacaagtg accaacagtt     1020

gaatcaaagt atggacacag gctctccagc agaactatct cctactactc tttcccctgt     1080

taatcatagc ttggatttac agccagttac ttactcagaa cctgcatttt ggtgttcgat     1140

agcatattat gaattaaatc agagggttgg agaaaccttc catgcatcac agccctcact     1200

cactgtagat ggctttacag acccatcaaa ttcagagagg ttctgcttag gtttactctc     1260

caatgttaac cgaaatgcca cggtagaaat gacaagaagg catataggaa gaggagtgcg     1320

cttatactac ataggtgggg aagtttttgc tgagtgccta agtgatagtg caatctttgt     1380

gcagagcccc aattgtaatc agagatatgg ctggcaccct gcaacagtgt gtaaaattcc     1440

accaggctgt aatctgaaga tcttcaacaa ccaggaattt gctgctcttc tggctcagtc     1500

tgttaatcag ggttttgaag ccgtctatca gctaactaga atgtgcacca taagaatgag     1560

ttttgtgaaa gggtggggag cagaataccg aaggcagacg gtaacaagta ctccttgctg     1620

gattgaactt catctgaatg gacctctaca gtggttggac aaagtattaa ctcagatggg     1680

atccccttca gtgcgttgct caagcatgtc ataaagcttc accaatcaag tcccatgaaa     1740

agacttaatg taacaactct tctgtcatag cattgtgtgt ggtccctatg gactgtttac     1800

tatccaaaag ttcaagagag aaaacagcac ttgaggtctc atcaattaaa gcaccttgtg     1860

gaatctgttt cctatatttg aatattagat gggaaaatta gtgtctagaa atactctccc     1920

attaaagagg aagagaagat tttaaagact taatgatgtc ttattgggca taaaactgag     1980

tgtcccaaag gtttattaat aacagtagta gttatgtgta caggtaatgt atcatgatcc     2040

agtatcacag tattgtgctg tttatataca tttttagttt gcatagatga ggtgtgtgtg     2100

tgcgctgctt cttgatctag gcaaaccttt ataaagttgc agtacctaat ctgttattcc     2160

cacttctctg ttatttttgt gtgtcttttt taatatataa tatatatcaa gattttcaaa     2220

ttatttagaa gcagattttc ctgtagaaaa actaattttt ctgcctttta ccaaaaataa     2280

actcttgggg gaagaaaagt ggattaactt ttgaaatcct tgaccttaat gtgttcagtg     2340

gggcttaaac agtcattctt tttgtggttt tttgtttttt tttgtttttt tttttaactg     2400

ctaaatctta ttataaggaa accatactga aaacctttcc aagcctcttt tttccattcc     2460

catttttgtc ctcataatca aaacagcata acatgacatc atcaccagta atagttgcat     2520

tgatactgct ggcaccagtt aattctggga tacagtaaga attcatatgg agaaagtccc     2580

tttgtcttat gcccaaattt caacaggaat aattggcttg tataatctag cagtctgttg     2640

atttatcctt ccacctcata aaaaatgcat aggtggcagt ataattattt tcagggatat     2700

gctagaatta cttccacata tttatccctt tttaaaaaag ctaatctata aataccgttt     2760

ttccaaaggt attttacaat atttcaacag cagaccttct gctcttcgag tagtttgatt     2820

tggtttagta accagattgc attatgaaat gggccttttg taaatgtaat tgtttctgca     2880

aaatacctag aaaagtgatg ctgaggtagg atcagcagat atgggccatc tgtttttaaa     2940

gtatgttgta ttcagtttat aaattgattg ttattctaca cataattatg aattcagaat     3000

tttaaaaatt gggggaaaag ccatttattt agcaagtttt ttagcttata agttacctgc     3060

agtctgagct gttcttaact gatcctggtt ttgtgattga caatatttca tgctctgtag     3120

tgagaggaga tttccgaaac tctgttgcta gttcattctg cagcaaataa ttattatgtc     3180

tgatgttgac tcattgcagt ttaaacattt cttcttgttt gcatcttagt agaaatggaa     3240

aataaccact cctggtcgtc ttttcataaa ttttcatatt tttgaagctg tctttggtac     3300

ttgttctttg aaatcatatc cacctgtctc tataggtatc attttcaata ctttcaacat     3360

ttggtggttt tctattgggt actccccatt ttcctatatt tgtgtgtata tgtatgtgtt     3420

catgtaaatt tggtatagta attttttatt cattcaacaa atatttattg ttcacctgtt     3480

tgtaccagga acttttctta gtctttgggt aaaggtgaac aagacaacta cagttcctgc     3540

ctttgctgag acagcagtta cactaaccct taattatctt acttgtctat gaaggagata     3600

aacagggtac tgtactggag aataacagat gggatgcttc aggtaggaca tcaaggaaag     3660

cctctaagga aaggatgcat gagctaacac ctgacattaa agaagcaagc caagtgagga     3720

gccaggggag ataagcattc ctggcaaaga gaatagcatc aaatgcaaaa aggttcacac     3780

taaaggaaac tcctgattag gtattaatgc tttatacaga aacctctata caaatccaaa     3840

cttgaagatc agaatggttc tacagttcat aacattttga aggtggcctt attttgtgat     3900

agtctgcttc atgtgattct cactaacata tctccttcct caacctttgc tgtaaaaatt     3960

tcatttgcac cacatcagta ctacttaatt taacaagctt ttgttgtgta agctctcact     4020

gttttagtgc cctgctgctt gcttccagac tttgtgctgt ccagtaatta tgtcttccac     4080

tacccatctt gtgagcagag taaatgtcct aggtaatacc actatcaggc ctgtaggaga     4140

tactcagtgg agcctctgcc cttctttttc ttacttgaga acttgtaatg gtgttaggga     4200

acagttgtag gggcagaaaa caactctgaa agtggtagaa ggtcctgatc ttggtggtta     4260

ctcttgcatt actgtgttag gtcaagcagt gcctactatg ctgtttcagt agtggagcgc     4320

atctctacag ttctgatgcg atttttctgt acagtatgaa attgggactc aactctttga     4380

aaacacctat tgagcagtta tacctgttga gcagtttact tcctggttgt aattacattt     4440

gtgtgaatgt gtttgatgct ttttaacgag atgatgtttt ttgtatttta tctactgtgg     4500

cctgattttt tttttgtttt ctgcccctcc ccccatttat aggtgtggtt ttcatttttc     4560

taagtgatag aatcccctct ttgttgaatt tttgtcttta tttaaattag caacattact     4620

taggatttat tcttcacaat actgttaatt ttctaggaat gatgacctga gaaccgaatg     4680

gccatgcttt ctatcacatt tctaagatga gtaatatttt ttccagtagg ttccacagag     4740

acaccttggg ggctggctta ggggaggctg ttggagttct cactgactta gtggcatatt     4800

tattctgtac tgaagaactg catggggttt cttttggaaa gagtttcatt gctttaaaaa     4860

gaagctcaga aagtctttat aaccactggt caacgattag aaaaatataa ctggatttag     4920

gcctaccttc tggaataccg ctgattgtgc tctttttatc ctactttaaa gaagctttca     4980

tgattagatt tgagctatat cagttatacc gattatacct tataatacac attcagttag     5040

taaacattta ttgatgcctg ttgtttgccc agccactgtg atggatattg aataataaaa     5100

agatgactag gacggggccc tgacccttga gctgtgcttg gtcttgtaga ggttgtgttt     5160

tttttcctca ggacctgtca ctttggcaga aggaaatctg cctaattttt cttgaaagct     5220

aaattttctt tgtaagtttt tacaaattgt ttaataccta gttgtatttt ttaccttaag     5280

ccacattgag ttttgcttga tttgtctgtc ttttaaacac tgtcaaatgc tttccctttt     5340

gttaaaatta ttttaatttc actttttttg tgcccttgtc aatttaagac taagactttg     5400

aaggtaaaac aaacaaacaa acatcagtct tagtctcttg ctagttgaaa tcaaataaaa     5460

gaaaatatat acccagttgg tttctctacc tcttaaaagc ttcccatata tacctttaag     5520

atccttctct tttttcttta actactaaat aggttcagca tttattcagt gttagatacc     5580

ctcttcgtct gagggtggcg taggtttatg ttgggatata aagtaacaca agacaatctt     5640

cactgtacat aaaatatgtc ttcatgtaca gtctttactt taaaagctga acattccaat     5700

ttgcgccttc cctcccaagc ccctgcccac caagtatctc tttagatatc tagtctgtgg     5760

acatgaacaa tgaatacttt tttcttactc tgatcgaagg cattgatact tagacatatc     5820

aaacatttct tcctttcata tgctttactt tgctaaatct attatattca ttgcctgaat     5880

tttattcttc ctttctacct gacaacacac atccaggtgg tacttgctgg ttatcctctt     5940

tcttgttagc cttgtttttt gttttttttt tttttttttg agagggagtc tcgctctgtt     6000

gcccaacctg gagtgcagtg gtgcgatctt ggttcactgc aagctccgcc tcccgggttc     6060

acgccatgct tctgcctcag cctcccaagt agctgggact acaggcgccc accaccacac     6120

tcggctaatt ttttgtattt ttagtagaga cggggtttca ccgtgttggc caggatggtc     6180

tcgatctcct gacctcgtga tctgtccacc tcggcttccc aaagtgctgg gattacaggc     6240

atgagccacc gcgcccagcc tagccatatt tttatctgca tatatcagaa tgtttctctc     6300

ctttgaactt attaacaaaa aaggaacatg cttttcatac ctagagtcct aatttcttca     6360

tcatgaaggt tgctattcaa attgatcaat cattttaatt ttacaaatgg ctcaaaaatt     6420

ctgttcagta aatgtctttg tgactggcaa atggcataaa ttatgtttaa gattatgaac     6480

ttttctgaca gttgcagcca atgttttccc tacgatacca gatttccatc ttggggcata     6540

ttggattgtt gtatttaaga cagtcagaat aatgatagtg tgtggtctcc agaggtagtc     6600

agaatcctgc tattgagttc tttttatatc ttccttttca attttttatt accattttgt     6660

ttgtttagac tacactttgt agggattgag gggcaaatta tctcttggag tggaattcct     6720

gtgttttgag ccttacaacc aggaaatatg agctatacta gatagcctca tgatagcatt     6780

tacgataaga acttatctcg tgtgttcatg taattttttg agtaggaact gttttatctt     6840

gaatattgta gctaactata tatagcagaa ctgcctcagt ctttttaaga aggaaataaa     6900

taatatatgt gtatgaattt atatatacat atacactcat agacaaactt aacagttggg     6960

gtcattctaa cagttaaaac aattgttcca ttgtttaaat ctcagatcct ggtaaaatgt     7020

tcttaatttg tctgtgtaca ttttcctttc atggacagac cattggagta cattaatttt     7080

cttaatctgc catttggcag ttcatttaat ataccatttt ttggcaactt ggtaactaag     7140

aatcacagcc aaaatttgtt aacatcaaag aaagctctgc catatacccc gttactaaat     7200

tattatacat ccagcagatt ctgggatgta ctaacttagg gttaactttg ttgttgttga     7260

taatactaga ttgctccctc tttaattctt cttctggtgc aaggttgctg cttaagttac     7320

cctgggaaat actactacaa ggtcaaattt tctagtatct tacagcctga ttgaaggtga     7380

ttcagatctt tgctcaatat aaatggattt tccaagattc tctgggccat ccttgaccca     7440

caggtgatct cgctggagta tattaactta acttcagtgc cagttggttt ggtgccatga     7500

gatccataat gaatccagaa cttcaccatt gcttagatat aagagtccct tggaagaata     7560

atgccactga tgatgggggt cagaaggtgt attaactcaa catagagggc ttttagattt     7620

ttcttcaaaa aaatttcgag aaaagtattc ttttaccctc caaacagtta acagctctta     7680

gtttctccaa atatgctctt tgatttactt atttttaatt aaagatggta atttattgaa     7740

caatgaaatc cgtaatatat tgatttaagg acaaaagtga agttttagaa ttataaaagt     7800

acttaaatat tatatatttt ccatttcata attgttttcc tttctctgtg gctttaaagt     7860

ttttgactat tttacaatgt taatcactag gtaacttgcc atatttctgg ttctatatta     7920

agttctatcc tttataatgc tgttattata aagctggttt ttagcatttg tctgtagcaa     7980

tagaaatttt actaagtctc tgttctccca gtaagttttt tcttttctca gtaagtccct     8040

aagaaaacat ttgtttgcca ctcttactat tcccaatctt ggattgttcg agctgaaaaa     8100

aaatttgatg agaaacagga ggatcctttt ctggtgaata taggttcctg ctttaagaat     8160

gtggaaatcc attgctttat ataactaata tacacacaga ttaattaaaa ttgtgagaaa     8220

taattcacac atgacaagta ggtaacatgc atgagttttg aattttttta aaaacccaac     8280

tgtttgacaa aatatagaac ccaaattggt actttcttag accagtgtaa cctcacacct     8340

cagttttgct tttccaaccc tgacttgaaa ggcatatttg tatcttttta ttagtgatag     8400

tgaagctgtg acactaacct tttatacaaa agagtaaaga aagaaaaact acagcgatta     8460

agatgagaac agttctgcag ttgttgaact agatcacagc attgtaggca gaataaaaaa     8520

tgttcatatc tgagaatatt cctttcgcca tcttttccca aggccagacc tcctggtgga     8580

gcacagttaa aagtaacatt ctgggccttt gtaatcggag ggctgtgtct ccagctggca     8640

gcctttgttt taatatataa tgcaggactg tggaaaacag ttggcataga atattttcac     8700

ctaaaaaaga aagaaaagac atacaaaact ggattaattg caaaaagaga atacagtaaa     8760

ataccatata actggacaaa gctagaagaa cctttagaag atttgtctga aaacagattt     8820

caagagtgag cttttataca ctgctcacta atttgcttga ttactaccaa ctcttcttaa     8880

agttaacacg tttaaggtat ttctggactt cctagccttt tagcaagctt agaggaacta     8940

gccattagct agtgatgtaa aaatattttg gggactgatg cccttaaagg ttatgccctt     9000

gaaagttctt accttttctc tagtgatatt aaggaacgag tgggtagtgt tctcagggtg     9060

accagctgcc ctaaagtgcc tgggattgag ggtttccctg gatgcgggac tttccctgga     9120

tacaaaactt ttagcagagt tttgtatata tgtggatttt tctgataagt agcacatcag     9180

aggccttaac cactgcccaa aagcgattct ccattgagag tacatatctt gaacttaaga     9240

aattcatttg ctctgatttt taatcttgta aagtttttgc taaactcaaa acaagtccca     9300

ggcacaccag aaggagctga ccaccttagg tgttcttgtg atttatcctt acttccctat     9360

gttgtcatag ttgcttctaa actcagctgc actatggctg tcaacatttc tgatacttat     9420

tgggatatgt gccatccagt catttagtac tttgaatgga acatgagatt tataacacag     9480

gtaatagctg aaggtaccag tatggtggtg agactcacac ttagtgatcc agctaaggta     9540

actgatgtta taatggaaca gagaagaggc caactagata gctaagttct tctgaaccta     9600

tgtgtatatg taagtacaaa tcatgcgtcc ttatggggtt aaacttaatc tgaaatttac     9660

atttttcata gtaaaaggaa accaattgtt gcagatttct tttcttgtga ggaaatacat     9720

ggcctttgat gctctggcgt ctactgcatt tcccagtctg ttctgctcga gaagccagaa     9780

tgtgttgtta acatttttcc gtgaatgttg tgttaaaatg attaaatgca tcagccaatg     9840

gcaagtgaag gaattgggtg tcctgatgca gactgagcag tttctctcaa ttgtagcctc     9900

atactcataa ggtgcttacc agctagaaca ttgagcacgt gaggtgagat tttttttctc     9960

tgatggcatt aactttgtaa tgcaatatga tggatgcaga ccctgttctt gtttccctct    10020

ggaagtcctt agtggctgca tccttggtgc actgtgatgg agatattaaa tgtgttcttt    10080

gtgagctttc gttctatgat tgtcaaaagt acgatgtggt tcctttttta tttttattaa    10140

acaatgagct gaggctttat tacagctggt tttcaagtta aaattgttga atactgatgt    10200

ctttctccca cctacaccaa atattttagt ctatttaaag tacaaaaaaa gttctgctta    10260

agaaaacatt gcttacatgt cctgtgattt ctggtcaatt tttatatata tttgtgtgca    10320

tcatctgtat gtgctttcac tttttacctt gtttgctctt acctgtgtta acagccctgt    10380

caccgttgaa aggtggacag ttttcctagc attaaaagaa agccatttga gttgtttacc    10440

atgttaaaaa aaaaaaaaaa a                                              10461


<210>  211
<211>  437
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 2 (SMAD2), transcript variant 3, 
       polypeptide

NCBI Reference Sequence: NM_001135937.2

<400>  211

Met Ser Ser Ile Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 
1               5                   10                  15      


Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
            20                  25                  30          


Gln Asn Gly Gln Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
        35                  40                  45              


Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
    50                  55                  60                  


Ile Thr Thr Gln Asn Cys Asn Thr Lys Cys Val Thr Ile Pro Arg Ser 
65                  70                  75                  80  


Leu Asp Gly Arg Leu Gln Val Ser His Arg Lys Gly Leu Pro His Val 
                85                  90                  95      


Ile Tyr Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu 
            100                 105                 110         


Leu Lys Ala Ile Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp 
        115                 120                 125             


Glu Val Cys Val Asn Pro Tyr His Tyr Gln Arg Val Glu Thr Pro Val 
    130                 135                 140                 


Leu Pro Pro Val Leu Val Pro Arg His Thr Glu Ile Leu Thr Glu Leu 
145                 150                 155                 160 


Pro Pro Leu Asp Asp Tyr Thr His Ser Ile Pro Glu Asn Thr Asn Phe 
                165                 170                 175     


Pro Ala Gly Ile Glu Pro Gln Ser Asn Tyr Ile Pro Glu Thr Pro Pro 
            180                 185                 190         


Pro Gly Tyr Ile Ser Glu Asp Gly Glu Thr Ser Asp Gln Gln Leu Asn 
        195                 200                 205             


Gln Ser Met Asp Thr Gly Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu 
    210                 215                 220                 


Ser Pro Val Asn His Ser Leu Asp Leu Gln Pro Val Thr Tyr Ser Glu 
225                 230                 235                 240 


Pro Ala Phe Trp Cys Ser Ile Ala Tyr Tyr Glu Leu Asn Gln Arg Val 
                245                 250                 255     


Gly Glu Thr Phe His Ala Ser Gln Pro Ser Leu Thr Val Asp Gly Phe 
            260                 265                 270         


Thr Asp Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn 
        275                 280                 285             


Val Asn Arg Asn Ala Thr Val Glu Met Thr Arg Arg His Ile Gly Arg 
    290                 295                 300                 


Gly Val Arg Leu Tyr Tyr Ile Gly Gly Glu Val Phe Ala Glu Cys Leu 
305                 310                 315                 320 


Ser Asp Ser Ala Ile Phe Val Gln Ser Pro Asn Cys Asn Gln Arg Tyr 
                325                 330                 335     


Gly Trp His Pro Ala Thr Val Cys Lys Ile Pro Pro Gly Cys Asn Leu 
            340                 345                 350         


Lys Ile Phe Asn Asn Gln Glu Phe Ala Ala Leu Leu Ala Gln Ser Val 
        355                 360                 365             


Asn Gln Gly Phe Glu Ala Val Tyr Gln Leu Thr Arg Met Cys Thr Ile 
    370                 375                 380                 


Arg Met Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gln Thr 
385                 390                 395                 400 


Val Thr Ser Thr Pro Cys Trp Ile Glu Leu His Leu Asn Gly Pro Leu 
                405                 410                 415     


Gln Trp Leu Asp Lys Val Leu Thr Gln Met Gly Ser Pro Ser Val Arg 
            420                 425                 430         


Cys Ser Ser Met Ser 
        435         


<210>  212
<211>  6256
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human Smad3 (paralog of Smad2) mRNA transcript variant 1 GenBank 
       Accession No.: NM_005902.3  GI:52352808

<400>  212
gcggccgccg cctccgcccc gcgttcgggg ccttcccgac cctgcactgc tgccgtccgc       60

ccgcccggcc gctcttctct tcgccgtggg agccgctccg ggcgcagggc cgcgcgccga      120

gccccgcagg ctgcagcgcc gcggcccggc ccggcgcccc ggcaacttcg ccgagagttg      180

aggcgaagtt tgggcgaccg cggcaggccc cggccgagct cccctctgcg cccccggcgt      240

cccgtcgagc ccagccccgc cgggggcgct cctcgccgcc cgcgcgccct ccccagccat      300

gtcgtccatc ctgcctttca ctcccccgat cgtgaagcgc ctgctgggct ggaagaaggg      360

cgagcagaac gggcaggagg agaaatggtg cgagaaggcg gtcaagagcc tggtcaagaa      420

actcaagaag acggggcagc tggacgagct ggagaaggcc atcaccacgc agaacgtcaa      480

caccaagtgc atcaccatcc ccaggtccct ggatggccgg ttgcaggtgt cccatcggaa      540

ggggctccct catgtcatct actgccgcct gtggcgatgg ccagacctgc acagccacca      600

cgagctacgg gccatggagc tgtgtgagtt cgccttcaat atgaagaagg acgaggtctg      660

cgtgaatccc taccactacc agagagtaga gacaccagtt ctacctcctg tgttggtgcc      720

acgccacaca gagatcccgg ccgagttccc cccactggac gactacagcc attccatccc      780

cgaaaacact aacttccccg caggcatcga gccccagagc aatattccag agaccccacc      840

ccctggctac ctgagtgaag atggagaaac cagtgaccac cagatgaacc acagcatgga      900

cgcaggttct ccaaacctat ccccgaatcc gatgtcccca gcacataata acttggacct      960

gcagccagtt acctactgcg agccggcctt ctggtgctcc atctcctact acgagctgaa     1020

ccagcgcgtc ggggagacat tccacgcctc gcagccatcc atgactgtgg atggcttcac     1080

cgacccctcc aattcggagc gcttctgcct agggctgctc tccaatgtca acaggaatgc     1140

agcagtggag ctgacacgga gacacatcgg aagaggcgtg cggctctact acatcggagg     1200

ggaggtcttc gcagagtgcc tcagtgacag cgctattttt gtccagtctc ccaactgtaa     1260

ccagcgctat ggctggcacc cggccaccgt ctgcaagatc ccaccaggat gcaacctgaa     1320

gatcttcaac aaccaggagt tcgctgccct cctggcccag tcggtcaacc agggctttga     1380

ggctgtctac cagttgaccc gaatgtgcac catccgcatg agcttcgtca aaggctgggg     1440

agcggagtac aggagacaga ctgtgaccag taccccctgc tggattgagc tgcacctgaa     1500

tgggcctttg cagtggcttg acaaggtcct cacccagatg ggctccccaa gcatccgctg     1560

ttccagtgtg tcttagagac atcaagtatg gtaggggagg gcaggcttgg ggaaaatggc     1620

catgcaggag gtggagaaaa ttggaactct actcaaccca ttgttgtcaa ggaagaagaa     1680

atctttctcc ctcaactgaa ggggtgcacc cacctgtttt ctgaaacaca cgagcaaacc     1740

cagaggtgga tgttatgaac agctgtgtct gccaaacaca tttacccttt ggccccactt     1800

tgaagggcaa gaaatggcgt ctgctctggt ggcttaagtg agcagaacag gtagtattac     1860

accaccggcc ccctcccccc agactctttt tttgagtgac agctttctgg gatgtcacag     1920

tccaaccaga aacacccctc tgtctaggac tgcagtgtgg agttcacctt ggaagggcgt     1980

tctaggtagg aagagcccgc agggccatgc agacctcatg cccagctctc tgacgcttgt     2040

gacagtgcct cttccagtga acattcccag cccagccccg ccccgccccg ccccaccact     2100

ccagcagacc ttgccccttg tgagctggat agacttggga tggggaggga gggagttttg     2160

tctgtctccc tcccctctca gaacatactg attgggaggt gcgtgttcag cagaacctgc     2220

acacaggaca gcgggaaaaa tcgatgagcg ccacctcttt aaaaactcac ttacgtttgt     2280

cctttttcac tttgaaaagt tggaaggatc tgctgaggcc cagtgcatat gcaatgtata     2340

gtgtctatta tcacattaat ctcaaagaga ttcgaatgac ggtaagtgtt ctcatgaagc     2400

aggaggccct tgtcgtggga tggcatttgg tctcaggcag caccacactg ggtgcgtctc     2460

cagtcatctg taagagcttg ctccagattc tgatgcatac ggctatattg gtttatgtag     2520

tcagttgcat tcattaaatc aactttatca tatgctcttt taaatgtttg gtttatatat     2580

tttctttaaa aatcctgggc tggcacattg actgggaaac ctgagtgaga cccagcaact     2640

gcttctctcc cttctctctc ctgaggtgaa gcttttccag gttttgttga agagatacct     2700

gccagcactt ctgcaagctg aaatttacag aagcaaattc accagaaggg aaacatctca     2760

ggccaacata ggcaaatgaa aagggctatt aaaatatttt tacacctttg aaaattgcag     2820

gcttggtaca aagaggtctg tcatcttccc cctgggatat aagatgatct agctcccggt     2880

agaggatcac cggtgacaac tatagcagtt gtattgtgta acaagtactg ctcccagcag     2940

caattaggga gaaaactagt ctaaattatt tcaactggaa aaaagaaaaa agagtcctct     3000

tcttttccca gccttttgca gaacacagta gacagaactg ccaccttcaa ttggtacttt     3060

attctttgct gctgtttttg tataaaatga cctatcccac gtttttgcat gaatttatag     3120

caggaaaaat caagggattt cctatggaag tcctgcttta ttccaggtga agggaaggaa     3180

gtgtatatac ttttggcaag tcatacagct caaatgtgat gagatttctg atgttagagg     3240

gagatggaga ggcttcctga tgcctcatct gcagggtcct gtgcctctga agttctagcc     3300

atgaggtttc caggtaggac agctgctccc caagcctcct gaggacacag gaagagacgg     3360

aaggagcacc ttgacagact tgtgtgagtc ttctcgaagg agggttgact cagaacccag     3420

agacaataca aaacccctca cttcctctga gagggccaaa tgctgtgagt ctgaagtatg     3480

tgcctggtgt gaaatgatct atggcctgtt tcttacacag gaagccccct gaacctcctg     3540

tacatgtgtt catgttccca gccagctctg agacccagga accaaatatt ccattttggc     3600

ttctgctaga gcagtcatgg ttcctctcct aaaagccatg ggcagcagtt tccgagggcc     3660

tgcatgatcc acctgctgca cgatcctatg agggcttcct gtggcacaca gccctctggg     3720

tgcttgggaa ctagcttcag gcacagcctg attctggtga tccagtgatc tatggaagtc     3780

gtgtcttact ccaggtgaag ggggaaaaaa aaagcctata ctttggcagg ttatgaactt     3840

tgaatgtgat gaaatgacac gtttggctgc atttggatgg tgtcttagaa ccctcattgc     3900

tcagacctga aggctacttc taggagcatg aagtttgagt tttgtgtttt tccaaaggat     3960

acttccttgg ccctttttct ttattgacta gaccaccaga ggaggatgtg tgggattgta     4020

ggcaaaccca cctgtggcat cactgaaaat aaatttgatc atacctaaga ggttaggaaa     4080

tggtgccatt cccaccttag agtgctacat aggtgctttg ggcgtatgta acattagtgt     4140

ccttccttga agccacaagc tagttttctt agttttaaaa tcctgttgta tgaatggcat     4200

ttgtatatta aaacactttt ttaaaggaca gttgaaaagg gcaagaggaa accagggcag     4260

ttctagagga gtgctggtga ctggatagca gttttaagtg gcgttcacct agtcaacacg     4320

accgcgtgtg ttgcccctgc cctgggctcc ccgccatgac atcttcacct tgcagcttgt     4380

gctgagactg acccaagtgc agctagcact gggacacaga tccttgtctt cagcaccttc     4440

caaggagcca acttttattc cctttcctct ctcccctccc cacctcgctt cttcccaatt     4500

tagtaactta gatgcttcca gcacatacgt aggtagctac cccagccggt ttggattaca     4560

ggcctgtgct ggaacatcat ctcagttggc caccttcctg gcaggctgta gacctgacat     4620

tttgagacaa gcctagagtc aggagcaggg actttgactc ttaggaagag cacacatgag     4680

ggcaaggctg ctggcagacg tctccattgt ccttatgttg tctgtgttgt attttttttt     4740

ttttattgac catggtgatt atttttttaa accatcgtta atatactgaa gtgagctata     4800

gcacatatca tgtgcttagt ttgtttattt ttctccatct ccccttggct tcctagagtt     4860

tggacatatt ccaggctaaa tgcttttact caagactaca gaaaggtttg aagtagtgtg     4920

tgcatggcat gcacgtatgt aagtaatctg gggaagaagc aaagatctgt ttcattctta     4980

gcctcaggcc tcatgagggt ctccacaggg ccggagctca ggttacacca ctccttcgtc     5040

cttacaggag atgtagggag aagaatctgc aggctgcttg taggactgtt caccaagggg     5100

gataccagca gcaagagagt gcacccgttt agccctggac cctgtttctt actgtgtgac     5160

ttggctagag ttgggagttc ccccaaaata aacgtgtccc cattttacca gaaccaaacc     5220

tcaacacagc gaagctgtac tgtctttgtg tggcaaagat gttcccttgt aggccccttt     5280

caggtaaccg tcttcacaat gtattttcat cacagtttaa ggagcatcag ccgcttctca     5340

agtgggtagg gaaagcagaa aaacgtacgc aagaggacat ggatccaaaa tgatgatgaa     5400

gcatctccca tggggaggtg atggtgggga gatgatgggc taaacaggca acttttcaaa     5460

aacacagcta tcatagaaaa gaaacttgcc tcatgtaaac tggattgaga aattctcagt     5520

gattctgcaa tggatttttt tttaatgcag aagtaatgta tactctagta ttctggtgtt     5580

tttatattta tgtaataatt tcttaaaacc attcagacag ataactattt aatttttttt     5640

aagaaagttg gaaaggtctc tcctcccaag gacagtggct ggaagagttg gggcacagcc     5700

agttctgaat gttggtggag ggtgtagtgg ctttttggct cagcatccag aaacaccaaa     5760

ccaggctggc taaacaagtg gccgcgtgta aaaacagaca gctctgagtc aaatctgggc     5820

ccttccacaa gggtcctctg aaccaagccc cactcccttg ctaggggtga aagcattaca     5880

gagagatgga gccatctatc caagaagcct tcactcacct tcactgctgc tgttgcaact     5940

cggctgttct ggactctgat gtgtgtggag ggatggggaa tagaacattg actgtgttga     6000

ttaccttcac tattcggcca gcctgacctt ttaataactt tgtaaaaagc atgtatgtat     6060

ttatagtgtt ttagattttt ctaactttta tatcttaaaa gcagagcacc tgtttaagca     6120

ttgtacccct attgttaaag atttgtgtcc tctcattccc tctcttcctc ttgtaagtgc     6180

ccttctaata aacttttcat ggaaaagctc ctgtgccagg agctcagtct gaaaaaaaaa     6240

aaaaaaaaaa aaaaaa                                                     6256


<210>  213
<211>  425
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human Smad3 (paralog of Smad2) polypeptide encoded by mRNA 
       transcript variant 1 GenBank Accession No.: NP_005893.1  
       GI:5174513

<400>  213

Met Ser Ser Ile Leu Pro Phe Thr Pro Pro Ile Val Lys Arg Leu Leu 
1               5                   10                  15      


Gly Trp Lys Lys Gly Glu Gln Asn Gly Gln Glu Glu Lys Trp Cys Glu 
            20                  25                  30          


Lys Ala Val Lys Ser Leu Val Lys Lys Leu Lys Lys Thr Gly Gln Leu 
        35                  40                  45              


Asp Glu Leu Glu Lys Ala Ile Thr Thr Gln Asn Val Asn Thr Lys Cys 
    50                  55                  60                  


Ile Thr Ile Pro Arg Ser Leu Asp Gly Arg Leu Gln Val Ser His Arg 
65                  70                  75                  80  


Lys Gly Leu Pro His Val Ile Tyr Cys Arg Leu Trp Arg Trp Pro Asp 
                85                  90                  95      


Leu His Ser His His Glu Leu Arg Ala Met Glu Leu Cys Glu Phe Ala 
            100                 105                 110         


Phe Asn Met Lys Lys Asp Glu Val Cys Val Asn Pro Tyr His Tyr Gln 
        115                 120                 125             


Arg Val Glu Thr Pro Val Leu Pro Pro Val Leu Val Pro Arg His Thr 
    130                 135                 140                 


Glu Ile Pro Ala Glu Phe Pro Pro Leu Asp Asp Tyr Ser His Ser Ile 
145                 150                 155                 160 


Pro Glu Asn Thr Asn Phe Pro Ala Gly Ile Glu Pro Gln Ser Asn Ile 
                165                 170                 175     


Pro Glu Thr Pro Pro Pro Gly Tyr Leu Ser Glu Asp Gly Glu Thr Ser 
            180                 185                 190         


Asp His Gln Met Asn His Ser Met Asp Ala Gly Ser Pro Asn Leu Ser 
        195                 200                 205             


Pro Asn Pro Met Ser Pro Ala His Asn Asn Leu Asp Leu Gln Pro Val 
    210                 215                 220                 


Thr Tyr Cys Glu Pro Ala Phe Trp Cys Ser Ile Ser Tyr Tyr Glu Leu 
225                 230                 235                 240 


Asn Gln Arg Val Gly Glu Thr Phe His Ala Ser Gln Pro Ser Met Thr 
                245                 250                 255     


Val Asp Gly Phe Thr Asp Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly 
            260                 265                 270         


Leu Leu Ser Asn Val Asn Arg Asn Ala Ala Val Glu Leu Thr Arg Arg 
        275                 280                 285             


His Ile Gly Arg Gly Val Arg Leu Tyr Tyr Ile Gly Gly Glu Val Phe 
    290                 295                 300                 


Ala Glu Cys Leu Ser Asp Ser Ala Ile Phe Val Gln Ser Pro Asn Cys 
305                 310                 315                 320 


Asn Gln Arg Tyr Gly Trp His Pro Ala Thr Val Cys Lys Ile Pro Pro 
                325                 330                 335     


Gly Cys Asn Leu Lys Ile Phe Asn Asn Gln Glu Phe Ala Ala Leu Leu 
            340                 345                 350         


Ala Gln Ser Val Asn Gln Gly Phe Glu Ala Val Tyr Gln Leu Thr Arg 
        355                 360                 365             


Met Cys Thr Ile Arg Met Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr 
    370                 375                 380                 


Arg Arg Gln Thr Val Thr Ser Thr Pro Cys Trp Ile Glu Leu His Leu 
385                 390                 395                 400 


Asn Gly Pro Leu Gln Trp Leu Asp Lys Val Leu Thr Gln Met Gly Ser 
                405                 410                 415     


Pro Ser Ile Arg Cys Ser Ser Val Ser 
            420                 425 


<210>  214
<211>  10551
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 2 (SMAD2), transcript variant 2, 
       mRNA

NCBI Reference Sequence: NM_001003652.3

<400>  214
cggccgggag gcggggcggg ccgtaggcaa agggaggtgg ggaggcggtg gccggcgact       60

ccccgcgccc cgctcgcccc ccggcccttc ccgcggtgct cggcctcgtt cctttcctcc      120

tccgctccct ccgtcttcca tacccgcccc gcgcggcttt cggccggcgt gcctcgcgcc      180

ctaacgggcg gctggaggcg ccaatcagcg ggcggcaggg tgccagcccc ggggctgcgc      240

cggcgaatcg gcggggcccg cggcccaggg tggcaggcgg gtctacccgc gcggccgcgg      300

cggcggagaa gcagctcgcc agccagcagc ccgccagccg ccgggaggtt cgatacaaga      360

ggctgttttc ctagcgtggc ttgctgcctt tggtaagaac atgtcgtcca tcttgccatt      420

cacgccgcca gttgtgaaga gactgctggg atggaagaag tcagctggtg ggtctggagg      480

agcaggcgga ggagagcaga atgggcagga agaaaagtgg tgtgagaaag cagtgaaaag      540

tctggtgaag aagctaaaga aaacaggacg attagatgag cttgagaaag ccatcaccac      600

tcaaaactgt aatactaaat gtgttaccat accaagcact tgctctgaaa tttggggact      660

gagtacacca aatacgatag atcagtggga tacaacaggc ctttacagct tctctgaaca      720

aaccaggtct cttgatggtc gtctccaggt atcccatcga aaaggattgc cacatgttat      780

atattgccga ttatggcgct ggcctgatct tcacagtcat catgaactca aggcaattga      840

aaactgcgaa tatgctttta atcttaaaaa ggatgaagta tgtgtaaacc cttaccacta      900

tcagagagtt gagacaccag ttttgcctcc agtattagtg ccccgacaca ccgagatcct      960

aacagaactt ccgcctctgg atgactatac tcactccatt ccagaaaaca ctaacttccc     1020

agcaggaatt gagccacaga gtaattatat tccagaaacg ccacctcctg gatatatcag     1080

tgaagatgga gaaacaagtg accaacagtt gaatcaaagt atggacacag gctctccagc     1140

agaactatct cctactactc tttcccctgt taatcatagc ttggatttac agccagttac     1200

ttactcagaa cctgcatttt ggtgttcgat agcatattat gaattaaatc agagggttgg     1260

agaaaccttc catgcatcac agccctcact cactgtagat ggctttacag acccatcaaa     1320

ttcagagagg ttctgcttag gtttactctc caatgttaac cgaaatgcca cggtagaaat     1380

gacaagaagg catataggaa gaggagtgcg cttatactac ataggtgggg aagtttttgc     1440

tgagtgccta agtgatagtg caatctttgt gcagagcccc aattgtaatc agagatatgg     1500

ctggcaccct gcaacagtgt gtaaaattcc accaggctgt aatctgaaga tcttcaacaa     1560

ccaggaattt gctgctcttc tggctcagtc tgttaatcag ggttttgaag ccgtctatca     1620

gctaactaga atgtgcacca taagaatgag ttttgtgaaa gggtggggag cagaataccg     1680

aaggcagacg gtaacaagta ctccttgctg gattgaactt catctgaatg gacctctaca     1740

gtggttggac aaagtattaa ctcagatggg atccccttca gtgcgttgct caagcatgtc     1800

ataaagcttc accaatcaag tcccatgaaa agacttaatg taacaactct tctgtcatag     1860

cattgtgtgt ggtccctatg gactgtttac tatccaaaag ttcaagagag aaaacagcac     1920

ttgaggtctc atcaattaaa gcaccttgtg gaatctgttt cctatatttg aatattagat     1980

gggaaaatta gtgtctagaa atactctccc attaaagagg aagagaagat tttaaagact     2040

taatgatgtc ttattgggca taaaactgag tgtcccaaag gtttattaat aacagtagta     2100

gttatgtgta caggtaatgt atcatgatcc agtatcacag tattgtgctg tttatataca     2160

tttttagttt gcatagatga ggtgtgtgtg tgcgctgctt cttgatctag gcaaaccttt     2220

ataaagttgc agtacctaat ctgttattcc cacttctctg ttatttttgt gtgtcttttt     2280

taatatataa tatatatcaa gattttcaaa ttatttagaa gcagattttc ctgtagaaaa     2340

actaattttt ctgcctttta ccaaaaataa actcttgggg gaagaaaagt ggattaactt     2400

ttgaaatcct tgaccttaat gtgttcagtg gggcttaaac agtcattctt tttgtggttt     2460

tttgtttttt tttgtttttt tttttaactg ctaaatctta ttataaggaa accatactga     2520

aaacctttcc aagcctcttt tttccattcc catttttgtc ctcataatca aaacagcata     2580

acatgacatc atcaccagta atagttgcat tgatactgct ggcaccagtt aattctggga     2640

tacagtaaga attcatatgg agaaagtccc tttgtcttat gcccaaattt caacaggaat     2700

aattggcttg tataatctag cagtctgttg atttatcctt ccacctcata aaaaatgcat     2760

aggtggcagt ataattattt tcagggatat gctagaatta cttccacata tttatccctt     2820

tttaaaaaag ctaatctata aataccgttt ttccaaaggt attttacaat atttcaacag     2880

cagaccttct gctcttcgag tagtttgatt tggtttagta accagattgc attatgaaat     2940

gggccttttg taaatgtaat tgtttctgca aaatacctag aaaagtgatg ctgaggtagg     3000

atcagcagat atgggccatc tgtttttaaa gtatgttgta ttcagtttat aaattgattg     3060

ttattctaca cataattatg aattcagaat tttaaaaatt gggggaaaag ccatttattt     3120

agcaagtttt ttagcttata agttacctgc agtctgagct gttcttaact gatcctggtt     3180

ttgtgattga caatatttca tgctctgtag tgagaggaga tttccgaaac tctgttgcta     3240

gttcattctg cagcaaataa ttattatgtc tgatgttgac tcattgcagt ttaaacattt     3300

cttcttgttt gcatcttagt agaaatggaa aataaccact cctggtcgtc ttttcataaa     3360

ttttcatatt tttgaagctg tctttggtac ttgttctttg aaatcatatc cacctgtctc     3420

tataggtatc attttcaata ctttcaacat ttggtggttt tctattgggt actccccatt     3480

ttcctatatt tgtgtgtata tgtatgtgtt catgtaaatt tggtatagta attttttatt     3540

cattcaacaa atatttattg ttcacctgtt tgtaccagga acttttctta gtctttgggt     3600

aaaggtgaac aagacaacta cagttcctgc ctttgctgag acagcagtta cactaaccct     3660

taattatctt acttgtctat gaaggagata aacagggtac tgtactggag aataacagat     3720

gggatgcttc aggtaggaca tcaaggaaag cctctaagga aaggatgcat gagctaacac     3780

ctgacattaa agaagcaagc caagtgagga gccaggggag ataagcattc ctggcaaaga     3840

gaatagcatc aaatgcaaaa aggttcacac taaaggaaac tcctgattag gtattaatgc     3900

tttatacaga aacctctata caaatccaaa cttgaagatc agaatggttc tacagttcat     3960

aacattttga aggtggcctt attttgtgat agtctgcttc atgtgattct cactaacata     4020

tctccttcct caacctttgc tgtaaaaatt tcatttgcac cacatcagta ctacttaatt     4080

taacaagctt ttgttgtgta agctctcact gttttagtgc cctgctgctt gcttccagac     4140

tttgtgctgt ccagtaatta tgtcttccac tacccatctt gtgagcagag taaatgtcct     4200

aggtaatacc actatcaggc ctgtaggaga tactcagtgg agcctctgcc cttctttttc     4260

ttacttgaga acttgtaatg gtgttaggga acagttgtag gggcagaaaa caactctgaa     4320

agtggtagaa ggtcctgatc ttggtggtta ctcttgcatt actgtgttag gtcaagcagt     4380

gcctactatg ctgtttcagt agtggagcgc atctctacag ttctgatgcg atttttctgt     4440

acagtatgaa attgggactc aactctttga aaacacctat tgagcagtta tacctgttga     4500

gcagtttact tcctggttgt aattacattt gtgtgaatgt gtttgatgct ttttaacgag     4560

atgatgtttt ttgtatttta tctactgtgg cctgattttt tttttgtttt ctgcccctcc     4620

ccccatttat aggtgtggtt ttcatttttc taagtgatag aatcccctct ttgttgaatt     4680

tttgtcttta tttaaattag caacattact taggatttat tcttcacaat actgttaatt     4740

ttctaggaat gatgacctga gaaccgaatg gccatgcttt ctatcacatt tctaagatga     4800

gtaatatttt ttccagtagg ttccacagag acaccttggg ggctggctta ggggaggctg     4860

ttggagttct cactgactta gtggcatatt tattctgtac tgaagaactg catggggttt     4920

cttttggaaa gagtttcatt gctttaaaaa gaagctcaga aagtctttat aaccactggt     4980

caacgattag aaaaatataa ctggatttag gcctaccttc tggaataccg ctgattgtgc     5040

tctttttatc ctactttaaa gaagctttca tgattagatt tgagctatat cagttatacc     5100

gattatacct tataatacac attcagttag taaacattta ttgatgcctg ttgtttgccc     5160

agccactgtg atggatattg aataataaaa agatgactag gacggggccc tgacccttga     5220

gctgtgcttg gtcttgtaga ggttgtgttt tttttcctca ggacctgtca ctttggcaga     5280

aggaaatctg cctaattttt cttgaaagct aaattttctt tgtaagtttt tacaaattgt     5340

ttaataccta gttgtatttt ttaccttaag ccacattgag ttttgcttga tttgtctgtc     5400

ttttaaacac tgtcaaatgc tttccctttt gttaaaatta ttttaatttc actttttttg     5460

tgcccttgtc aatttaagac taagactttg aaggtaaaac aaacaaacaa acatcagtct     5520

tagtctcttg ctagttgaaa tcaaataaaa gaaaatatat acccagttgg tttctctacc     5580

tcttaaaagc ttcccatata tacctttaag atccttctct tttttcttta actactaaat     5640

aggttcagca tttattcagt gttagatacc ctcttcgtct gagggtggcg taggtttatg     5700

ttgggatata aagtaacaca agacaatctt cactgtacat aaaatatgtc ttcatgtaca     5760

gtctttactt taaaagctga acattccaat ttgcgccttc cctcccaagc ccctgcccac     5820

caagtatctc tttagatatc tagtctgtgg acatgaacaa tgaatacttt tttcttactc     5880

tgatcgaagg cattgatact tagacatatc aaacatttct tcctttcata tgctttactt     5940

tgctaaatct attatattca ttgcctgaat tttattcttc ctttctacct gacaacacac     6000

atccaggtgg tacttgctgg ttatcctctt tcttgttagc cttgtttttt gttttttttt     6060

tttttttttg agagggagtc tcgctctgtt gcccaacctg gagtgcagtg gtgcgatctt     6120

ggttcactgc aagctccgcc tcccgggttc acgccatgct tctgcctcag cctcccaagt     6180

agctgggact acaggcgccc accaccacac tcggctaatt ttttgtattt ttagtagaga     6240

cggggtttca ccgtgttggc caggatggtc tcgatctcct gacctcgtga tctgtccacc     6300

tcggcttccc aaagtgctgg gattacaggc atgagccacc gcgcccagcc tagccatatt     6360

tttatctgca tatatcagaa tgtttctctc ctttgaactt attaacaaaa aaggaacatg     6420

cttttcatac ctagagtcct aatttcttca tcatgaaggt tgctattcaa attgatcaat     6480

cattttaatt ttacaaatgg ctcaaaaatt ctgttcagta aatgtctttg tgactggcaa     6540

atggcataaa ttatgtttaa gattatgaac ttttctgaca gttgcagcca atgttttccc     6600

tacgatacca gatttccatc ttggggcata ttggattgtt gtatttaaga cagtcagaat     6660

aatgatagtg tgtggtctcc agaggtagtc agaatcctgc tattgagttc tttttatatc     6720

ttccttttca attttttatt accattttgt ttgtttagac tacactttgt agggattgag     6780

gggcaaatta tctcttggag tggaattcct gtgttttgag ccttacaacc aggaaatatg     6840

agctatacta gatagcctca tgatagcatt tacgataaga acttatctcg tgtgttcatg     6900

taattttttg agtaggaact gttttatctt gaatattgta gctaactata tatagcagaa     6960

ctgcctcagt ctttttaaga aggaaataaa taatatatgt gtatgaattt atatatacat     7020

atacactcat agacaaactt aacagttggg gtcattctaa cagttaaaac aattgttcca     7080

ttgtttaaat ctcagatcct ggtaaaatgt tcttaatttg tctgtgtaca ttttcctttc     7140

atggacagac cattggagta cattaatttt cttaatctgc catttggcag ttcatttaat     7200

ataccatttt ttggcaactt ggtaactaag aatcacagcc aaaatttgtt aacatcaaag     7260

aaagctctgc catatacccc gttactaaat tattatacat ccagcagatt ctgggatgta     7320

ctaacttagg gttaactttg ttgttgttga taatactaga ttgctccctc tttaattctt     7380

cttctggtgc aaggttgctg cttaagttac cctgggaaat actactacaa ggtcaaattt     7440

tctagtatct tacagcctga ttgaaggtga ttcagatctt tgctcaatat aaatggattt     7500

tccaagattc tctgggccat ccttgaccca caggtgatct cgctggagta tattaactta     7560

acttcagtgc cagttggttt ggtgccatga gatccataat gaatccagaa cttcaccatt     7620

gcttagatat aagagtccct tggaagaata atgccactga tgatgggggt cagaaggtgt     7680

attaactcaa catagagggc ttttagattt ttcttcaaaa aaatttcgag aaaagtattc     7740

ttttaccctc caaacagtta acagctctta gtttctccaa atatgctctt tgatttactt     7800

atttttaatt aaagatggta atttattgaa caatgaaatc cgtaatatat tgatttaagg     7860

acaaaagtga agttttagaa ttataaaagt acttaaatat tatatatttt ccatttcata     7920

attgttttcc tttctctgtg gctttaaagt ttttgactat tttacaatgt taatcactag     7980

gtaacttgcc atatttctgg ttctatatta agttctatcc tttataatgc tgttattata     8040

aagctggttt ttagcatttg tctgtagcaa tagaaatttt actaagtctc tgttctccca     8100

gtaagttttt tcttttctca gtaagtccct aagaaaacat ttgtttgcca ctcttactat     8160

tcccaatctt ggattgttcg agctgaaaaa aaatttgatg agaaacagga ggatcctttt     8220

ctggtgaata taggttcctg ctttaagaat gtggaaatcc attgctttat ataactaata     8280

tacacacaga ttaattaaaa ttgtgagaaa taattcacac atgacaagta ggtaacatgc     8340

atgagttttg aattttttta aaaacccaac tgtttgacaa aatatagaac ccaaattggt     8400

actttcttag accagtgtaa cctcacacct cagttttgct tttccaaccc tgacttgaaa     8460

ggcatatttg tatcttttta ttagtgatag tgaagctgtg acactaacct tttatacaaa     8520

agagtaaaga aagaaaaact acagcgatta agatgagaac agttctgcag ttgttgaact     8580

agatcacagc attgtaggca gaataaaaaa tgttcatatc tgagaatatt cctttcgcca     8640

tcttttccca aggccagacc tcctggtgga gcacagttaa aagtaacatt ctgggccttt     8700

gtaatcggag ggctgtgtct ccagctggca gcctttgttt taatatataa tgcaggactg     8760

tggaaaacag ttggcataga atattttcac ctaaaaaaga aagaaaagac atacaaaact     8820

ggattaattg caaaaagaga atacagtaaa ataccatata actggacaaa gctagaagaa     8880

cctttagaag atttgtctga aaacagattt caagagtgag cttttataca ctgctcacta     8940

atttgcttga ttactaccaa ctcttcttaa agttaacacg tttaaggtat ttctggactt     9000

cctagccttt tagcaagctt agaggaacta gccattagct agtgatgtaa aaatattttg     9060

gggactgatg cccttaaagg ttatgccctt gaaagttctt accttttctc tagtgatatt     9120

aaggaacgag tgggtagtgt tctcagggtg accagctgcc ctaaagtgcc tgggattgag     9180

ggtttccctg gatgcgggac tttccctgga tacaaaactt ttagcagagt tttgtatata     9240

tgtggatttt tctgataagt agcacatcag aggccttaac cactgcccaa aagcgattct     9300

ccattgagag tacatatctt gaacttaaga aattcatttg ctctgatttt taatcttgta     9360

aagtttttgc taaactcaaa acaagtccca ggcacaccag aaggagctga ccaccttagg     9420

tgttcttgtg atttatcctt acttccctat gttgtcatag ttgcttctaa actcagctgc     9480

actatggctg tcaacatttc tgatacttat tgggatatgt gccatccagt catttagtac     9540

tttgaatgga acatgagatt tataacacag gtaatagctg aaggtaccag tatggtggtg     9600

agactcacac ttagtgatcc agctaaggta actgatgtta taatggaaca gagaagaggc     9660

caactagata gctaagttct tctgaaccta tgtgtatatg taagtacaaa tcatgcgtcc     9720

ttatggggtt aaacttaatc tgaaatttac atttttcata gtaaaaggaa accaattgtt     9780

gcagatttct tttcttgtga ggaaatacat ggcctttgat gctctggcgt ctactgcatt     9840

tcccagtctg ttctgctcga gaagccagaa tgtgttgtta acatttttcc gtgaatgttg     9900

tgttaaaatg attaaatgca tcagccaatg gcaagtgaag gaattgggtg tcctgatgca     9960

gactgagcag tttctctcaa ttgtagcctc atactcataa ggtgcttacc agctagaaca    10020

ttgagcacgt gaggtgagat tttttttctc tgatggcatt aactttgtaa tgcaatatga    10080

tggatgcaga ccctgttctt gtttccctct ggaagtcctt agtggctgca tccttggtgc    10140

actgtgatgg agatattaaa tgtgttcttt gtgagctttc gttctatgat tgtcaaaagt    10200

acgatgtggt tcctttttta tttttattaa acaatgagct gaggctttat tacagctggt    10260

tttcaagtta aaattgttga atactgatgt ctttctccca cctacaccaa atattttagt    10320

ctatttaaag tacaaaaaaa gttctgctta agaaaacatt gcttacatgt cctgtgattt    10380

ctggtcaatt tttatatata tttgtgtgca tcatctgtat gtgctttcac tttttacctt    10440

gtttgctctt acctgtgtta acagccctgt caccgttgaa aggtggacag ttttcctagc    10500

attaaaagaa agccatttga gttgtttacc atgttaaaaa aaaaaaaaaa a             10551


<210>  215
<211>  397
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 2 (SMAD2), transcript variant 2, 
       polypeptide

NCBI Reference Sequence: NM_001003652.3

<400>  215

Asn Thr Lys Cys Val Thr Ile Pro Ser Thr Cys Ser Glu Ile Trp Gly 
1               5                   10                  15      


Leu Ser Thr Pro Asn Thr Ile Asp Gln Trp Asp Thr Thr Gly Leu Tyr 
            20                  25                  30          


Ser Phe Ser Glu Gln Thr Arg Ser Leu Asp Gly Arg Leu Gln Val Ser 
        35                  40                  45              


His Arg Lys Gly Leu Pro His Val Ile Tyr Cys Arg Leu Trp Arg Trp 
    50                  55                  60                  


Pro Asp Leu His Ser His His Glu Leu Lys Ala Ile Glu Asn Cys Glu 
65                  70                  75                  80  


Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val Cys Val Asn Pro Tyr His 
                85                  90                  95      


Tyr Gln Arg Val Glu Thr Pro Val Leu Pro Pro Val Leu Val Pro Arg 
            100                 105                 110         


His Thr Glu Ile Leu Thr Glu Leu Pro Pro Leu Asp Asp Tyr Thr His 
        115                 120                 125             


Ser Ile Pro Glu Asn Thr Asn Phe Pro Ala Gly Ile Glu Pro Gln Ser 
    130                 135                 140                 


Asn Tyr Ile Pro Glu Thr Pro Pro Pro Gly Tyr Ile Ser Glu Asp Gly 
145                 150                 155                 160 


Glu Thr Ser Asp Gln Gln Leu Asn Gln Ser Met Asp Thr Gly Ser Pro 
                165                 170                 175     


Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro Val Asn His Ser Leu Asp 
            180                 185                 190         


Leu Gln Pro Val Thr Tyr Ser Glu Pro Ala Phe Trp Cys Ser Ile Ala 
        195                 200                 205             


Tyr Tyr Glu Leu Asn Gln Arg Val Gly Glu Thr Phe His Ala Ser Gln 
    210                 215                 220                 


Pro Ser Leu Thr Val Asp Gly Phe Thr Asp Pro Ser Asn Ser Glu Arg 
225                 230                 235                 240 


Phe Cys Leu Gly Leu Leu Ser Asn Val Asn Arg Asn Ala Thr Val Glu 
                245                 250                 255     


Met Thr Arg Arg His Ile Gly Arg Gly Val Arg Leu Tyr Tyr Ile Gly 
            260                 265                 270         


Gly Glu Val Phe Ala Glu Cys Leu Ser Asp Ser Ala Ile Phe Val Gln 
        275                 280                 285             


Ser Pro Asn Cys Asn Gln Arg Tyr Gly Trp His Pro Ala Thr Val Cys 
    290                 295                 300                 


Lys Ile Pro Pro Gly Cys Asn Leu Lys Ile Phe Asn Asn Gln Glu Phe 
305                 310                 315                 320 


Ala Ala Leu Leu Ala Gln Ser Val Asn Gln Gly Phe Glu Ala Val Tyr 
                325                 330                 335     


Gln Leu Thr Arg Met Cys Thr Ile Arg Met Ser Phe Val Lys Gly Trp 
            340                 345                 350         


Gly Ala Glu Tyr Arg Arg Gln Thr Val Thr Ser Thr Pro Cys Trp Ile 
        355                 360                 365             


Glu Leu His Leu Asn Gly Pro Leu Gln Trp Leu Asp Lys Val Leu Thr 
    370                 375                 380                 


Gln Met Gly Ser Pro Ser Val Arg Cys Ser Ser Met Ser 
385                 390                 395         


<210>  216
<211>  10461
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 2 (SMAD2), transcript variant 3, 
       mRNA

NCBI Reference Sequence: NM_001135937.2

<400>  216
cggccgggag gcggggcggg ccgtaggcaa agggaggtgg ggaggcggtg gccggcgact       60

ccccgcgccc cgctcgcccc ccggcccttc ccgcggtgct cggcctcgtt cctttcctcc      120

tccgctccct ccgtcttcca tacccgcccc gcgcggcttt cggccggcgt gcctcgcgcc      180

ctaacgggcg gctggaggcg ccaatcagcg ggcggcaggg tgccagcccc ggggctgcgc      240

cggcgaatcg gcggggcccg cggcccaggg tggcaggcgg gtctacccgc gcggccgcgg      300

cggcggagaa gcagctcgcc agccagcagc ccgccagccg ccgggaggtt cgatacaaga      360

ggctgttttc ctagcgtggc ttgctgcctt tggtaagaac atgtcgtcca tcttgccatt      420

cacgccgcca gttgtgaaga gactgctggg atggaagaag tcagctggtg ggtctggagg      480

agcaggcgga ggagagcaga atgggcagga agaaaagtgg tgtgagaaag cagtgaaaag      540

tctggtgaag aagctaaaga aaacaggacg attagatgag cttgagaaag ccatcaccac      600

tcaaaactgt aatactaaat gtgttaccat accaaggtct cttgatggtc gtctccaggt      660

atcccatcga aaaggattgc cacatgttat atattgccga ttatggcgct ggcctgatct      720

tcacagtcat catgaactca aggcaattga aaactgcgaa tatgctttta atcttaaaaa      780

ggatgaagta tgtgtaaacc cttaccacta tcagagagtt gagacaccag ttttgcctcc      840

agtattagtg ccccgacaca ccgagatcct aacagaactt ccgcctctgg atgactatac      900

tcactccatt ccagaaaaca ctaacttccc agcaggaatt gagccacaga gtaattatat      960

tccagaaacg ccacctcctg gatatatcag tgaagatgga gaaacaagtg accaacagtt     1020

gaatcaaagt atggacacag gctctccagc agaactatct cctactactc tttcccctgt     1080

taatcatagc ttggatttac agccagttac ttactcagaa cctgcatttt ggtgttcgat     1140

agcatattat gaattaaatc agagggttgg agaaaccttc catgcatcac agccctcact     1200

cactgtagat ggctttacag acccatcaaa ttcagagagg ttctgcttag gtttactctc     1260

caatgttaac cgaaatgcca cggtagaaat gacaagaagg catataggaa gaggagtgcg     1320

cttatactac ataggtgggg aagtttttgc tgagtgccta agtgatagtg caatctttgt     1380

gcagagcccc aattgtaatc agagatatgg ctggcaccct gcaacagtgt gtaaaattcc     1440

accaggctgt aatctgaaga tcttcaacaa ccaggaattt gctgctcttc tggctcagtc     1500

tgttaatcag ggttttgaag ccgtctatca gctaactaga atgtgcacca taagaatgag     1560

ttttgtgaaa gggtggggag cagaataccg aaggcagacg gtaacaagta ctccttgctg     1620

gattgaactt catctgaatg gacctctaca gtggttggac aaagtattaa ctcagatggg     1680

atccccttca gtgcgttgct caagcatgtc ataaagcttc accaatcaag tcccatgaaa     1740

agacttaatg taacaactct tctgtcatag cattgtgtgt ggtccctatg gactgtttac     1800

tatccaaaag ttcaagagag aaaacagcac ttgaggtctc atcaattaaa gcaccttgtg     1860

gaatctgttt cctatatttg aatattagat gggaaaatta gtgtctagaa atactctccc     1920

attaaagagg aagagaagat tttaaagact taatgatgtc ttattgggca taaaactgag     1980

tgtcccaaag gtttattaat aacagtagta gttatgtgta caggtaatgt atcatgatcc     2040

agtatcacag tattgtgctg tttatataca tttttagttt gcatagatga ggtgtgtgtg     2100

tgcgctgctt cttgatctag gcaaaccttt ataaagttgc agtacctaat ctgttattcc     2160

cacttctctg ttatttttgt gtgtcttttt taatatataa tatatatcaa gattttcaaa     2220

ttatttagaa gcagattttc ctgtagaaaa actaattttt ctgcctttta ccaaaaataa     2280

actcttgggg gaagaaaagt ggattaactt ttgaaatcct tgaccttaat gtgttcagtg     2340

gggcttaaac agtcattctt tttgtggttt tttgtttttt tttgtttttt tttttaactg     2400

ctaaatctta ttataaggaa accatactga aaacctttcc aagcctcttt tttccattcc     2460

catttttgtc ctcataatca aaacagcata acatgacatc atcaccagta atagttgcat     2520

tgatactgct ggcaccagtt aattctggga tacagtaaga attcatatgg agaaagtccc     2580

tttgtcttat gcccaaattt caacaggaat aattggcttg tataatctag cagtctgttg     2640

atttatcctt ccacctcata aaaaatgcat aggtggcagt ataattattt tcagggatat     2700

gctagaatta cttccacata tttatccctt tttaaaaaag ctaatctata aataccgttt     2760

ttccaaaggt attttacaat atttcaacag cagaccttct gctcttcgag tagtttgatt     2820

tggtttagta accagattgc attatgaaat gggccttttg taaatgtaat tgtttctgca     2880

aaatacctag aaaagtgatg ctgaggtagg atcagcagat atgggccatc tgtttttaaa     2940

gtatgttgta ttcagtttat aaattgattg ttattctaca cataattatg aattcagaat     3000

tttaaaaatt gggggaaaag ccatttattt agcaagtttt ttagcttata agttacctgc     3060

agtctgagct gttcttaact gatcctggtt ttgtgattga caatatttca tgctctgtag     3120

tgagaggaga tttccgaaac tctgttgcta gttcattctg cagcaaataa ttattatgtc     3180

tgatgttgac tcattgcagt ttaaacattt cttcttgttt gcatcttagt agaaatggaa     3240

aataaccact cctggtcgtc ttttcataaa ttttcatatt tttgaagctg tctttggtac     3300

ttgttctttg aaatcatatc cacctgtctc tataggtatc attttcaata ctttcaacat     3360

ttggtggttt tctattgggt actccccatt ttcctatatt tgtgtgtata tgtatgtgtt     3420

catgtaaatt tggtatagta attttttatt cattcaacaa atatttattg ttcacctgtt     3480

tgtaccagga acttttctta gtctttgggt aaaggtgaac aagacaacta cagttcctgc     3540

ctttgctgag acagcagtta cactaaccct taattatctt acttgtctat gaaggagata     3600

aacagggtac tgtactggag aataacagat gggatgcttc aggtaggaca tcaaggaaag     3660

cctctaagga aaggatgcat gagctaacac ctgacattaa agaagcaagc caagtgagga     3720

gccaggggag ataagcattc ctggcaaaga gaatagcatc aaatgcaaaa aggttcacac     3780

taaaggaaac tcctgattag gtattaatgc tttatacaga aacctctata caaatccaaa     3840

cttgaagatc agaatggttc tacagttcat aacattttga aggtggcctt attttgtgat     3900

agtctgcttc atgtgattct cactaacata tctccttcct caacctttgc tgtaaaaatt     3960

tcatttgcac cacatcagta ctacttaatt taacaagctt ttgttgtgta agctctcact     4020

gttttagtgc cctgctgctt gcttccagac tttgtgctgt ccagtaatta tgtcttccac     4080

tacccatctt gtgagcagag taaatgtcct aggtaatacc actatcaggc ctgtaggaga     4140

tactcagtgg agcctctgcc cttctttttc ttacttgaga acttgtaatg gtgttaggga     4200

acagttgtag gggcagaaaa caactctgaa agtggtagaa ggtcctgatc ttggtggtta     4260

ctcttgcatt actgtgttag gtcaagcagt gcctactatg ctgtttcagt agtggagcgc     4320

atctctacag ttctgatgcg atttttctgt acagtatgaa attgggactc aactctttga     4380

aaacacctat tgagcagtta tacctgttga gcagtttact tcctggttgt aattacattt     4440

gtgtgaatgt gtttgatgct ttttaacgag atgatgtttt ttgtatttta tctactgtgg     4500

cctgattttt tttttgtttt ctgcccctcc ccccatttat aggtgtggtt ttcatttttc     4560

taagtgatag aatcccctct ttgttgaatt tttgtcttta tttaaattag caacattact     4620

taggatttat tcttcacaat actgttaatt ttctaggaat gatgacctga gaaccgaatg     4680

gccatgcttt ctatcacatt tctaagatga gtaatatttt ttccagtagg ttccacagag     4740

acaccttggg ggctggctta ggggaggctg ttggagttct cactgactta gtggcatatt     4800

tattctgtac tgaagaactg catggggttt cttttggaaa gagtttcatt gctttaaaaa     4860

gaagctcaga aagtctttat aaccactggt caacgattag aaaaatataa ctggatttag     4920

gcctaccttc tggaataccg ctgattgtgc tctttttatc ctactttaaa gaagctttca     4980

tgattagatt tgagctatat cagttatacc gattatacct tataatacac attcagttag     5040

taaacattta ttgatgcctg ttgtttgccc agccactgtg atggatattg aataataaaa     5100

agatgactag gacggggccc tgacccttga gctgtgcttg gtcttgtaga ggttgtgttt     5160

tttttcctca ggacctgtca ctttggcaga aggaaatctg cctaattttt cttgaaagct     5220

aaattttctt tgtaagtttt tacaaattgt ttaataccta gttgtatttt ttaccttaag     5280

ccacattgag ttttgcttga tttgtctgtc ttttaaacac tgtcaaatgc tttccctttt     5340

gttaaaatta ttttaatttc actttttttg tgcccttgtc aatttaagac taagactttg     5400

aaggtaaaac aaacaaacaa acatcagtct tagtctcttg ctagttgaaa tcaaataaaa     5460

gaaaatatat acccagttgg tttctctacc tcttaaaagc ttcccatata tacctttaag     5520

atccttctct tttttcttta actactaaat aggttcagca tttattcagt gttagatacc     5580

ctcttcgtct gagggtggcg taggtttatg ttgggatata aagtaacaca agacaatctt     5640

cactgtacat aaaatatgtc ttcatgtaca gtctttactt taaaagctga acattccaat     5700

ttgcgccttc cctcccaagc ccctgcccac caagtatctc tttagatatc tagtctgtgg     5760

acatgaacaa tgaatacttt tttcttactc tgatcgaagg cattgatact tagacatatc     5820

aaacatttct tcctttcata tgctttactt tgctaaatct attatattca ttgcctgaat     5880

tttattcttc ctttctacct gacaacacac atccaggtgg tacttgctgg ttatcctctt     5940

tcttgttagc cttgtttttt gttttttttt tttttttttg agagggagtc tcgctctgtt     6000

gcccaacctg gagtgcagtg gtgcgatctt ggttcactgc aagctccgcc tcccgggttc     6060

acgccatgct tctgcctcag cctcccaagt agctgggact acaggcgccc accaccacac     6120

tcggctaatt ttttgtattt ttagtagaga cggggtttca ccgtgttggc caggatggtc     6180

tcgatctcct gacctcgtga tctgtccacc tcggcttccc aaagtgctgg gattacaggc     6240

atgagccacc gcgcccagcc tagccatatt tttatctgca tatatcagaa tgtttctctc     6300

ctttgaactt attaacaaaa aaggaacatg cttttcatac ctagagtcct aatttcttca     6360

tcatgaaggt tgctattcaa attgatcaat cattttaatt ttacaaatgg ctcaaaaatt     6420

ctgttcagta aatgtctttg tgactggcaa atggcataaa ttatgtttaa gattatgaac     6480

ttttctgaca gttgcagcca atgttttccc tacgatacca gatttccatc ttggggcata     6540

ttggattgtt gtatttaaga cagtcagaat aatgatagtg tgtggtctcc agaggtagtc     6600

agaatcctgc tattgagttc tttttatatc ttccttttca attttttatt accattttgt     6660

ttgtttagac tacactttgt agggattgag gggcaaatta tctcttggag tggaattcct     6720

gtgttttgag ccttacaacc aggaaatatg agctatacta gatagcctca tgatagcatt     6780

tacgataaga acttatctcg tgtgttcatg taattttttg agtaggaact gttttatctt     6840

gaatattgta gctaactata tatagcagaa ctgcctcagt ctttttaaga aggaaataaa     6900

taatatatgt gtatgaattt atatatacat atacactcat agacaaactt aacagttggg     6960

gtcattctaa cagttaaaac aattgttcca ttgtttaaat ctcagatcct ggtaaaatgt     7020

tcttaatttg tctgtgtaca ttttcctttc atggacagac cattggagta cattaatttt     7080

cttaatctgc catttggcag ttcatttaat ataccatttt ttggcaactt ggtaactaag     7140

aatcacagcc aaaatttgtt aacatcaaag aaagctctgc catatacccc gttactaaat     7200

tattatacat ccagcagatt ctgggatgta ctaacttagg gttaactttg ttgttgttga     7260

taatactaga ttgctccctc tttaattctt cttctggtgc aaggttgctg cttaagttac     7320

cctgggaaat actactacaa ggtcaaattt tctagtatct tacagcctga ttgaaggtga     7380

ttcagatctt tgctcaatat aaatggattt tccaagattc tctgggccat ccttgaccca     7440

caggtgatct cgctggagta tattaactta acttcagtgc cagttggttt ggtgccatga     7500

gatccataat gaatccagaa cttcaccatt gcttagatat aagagtccct tggaagaata     7560

atgccactga tgatgggggt cagaaggtgt attaactcaa catagagggc ttttagattt     7620

ttcttcaaaa aaatttcgag aaaagtattc ttttaccctc caaacagtta acagctctta     7680

gtttctccaa atatgctctt tgatttactt atttttaatt aaagatggta atttattgaa     7740

caatgaaatc cgtaatatat tgatttaagg acaaaagtga agttttagaa ttataaaagt     7800

acttaaatat tatatatttt ccatttcata attgttttcc tttctctgtg gctttaaagt     7860

ttttgactat tttacaatgt taatcactag gtaacttgcc atatttctgg ttctatatta     7920

agttctatcc tttataatgc tgttattata aagctggttt ttagcatttg tctgtagcaa     7980

tagaaatttt actaagtctc tgttctccca gtaagttttt tcttttctca gtaagtccct     8040

aagaaaacat ttgtttgcca ctcttactat tcccaatctt ggattgttcg agctgaaaaa     8100

aaatttgatg agaaacagga ggatcctttt ctggtgaata taggttcctg ctttaagaat     8160

gtggaaatcc attgctttat ataactaata tacacacaga ttaattaaaa ttgtgagaaa     8220

taattcacac atgacaagta ggtaacatgc atgagttttg aattttttta aaaacccaac     8280

tgtttgacaa aatatagaac ccaaattggt actttcttag accagtgtaa cctcacacct     8340

cagttttgct tttccaaccc tgacttgaaa ggcatatttg tatcttttta ttagtgatag     8400

tgaagctgtg acactaacct tttatacaaa agagtaaaga aagaaaaact acagcgatta     8460

agatgagaac agttctgcag ttgttgaact agatcacagc attgtaggca gaataaaaaa     8520

tgttcatatc tgagaatatt cctttcgcca tcttttccca aggccagacc tcctggtgga     8580

gcacagttaa aagtaacatt ctgggccttt gtaatcggag ggctgtgtct ccagctggca     8640

gcctttgttt taatatataa tgcaggactg tggaaaacag ttggcataga atattttcac     8700

ctaaaaaaga aagaaaagac atacaaaact ggattaattg caaaaagaga atacagtaaa     8760

ataccatata actggacaaa gctagaagaa cctttagaag atttgtctga aaacagattt     8820

caagagtgag cttttataca ctgctcacta atttgcttga ttactaccaa ctcttcttaa     8880

agttaacacg tttaaggtat ttctggactt cctagccttt tagcaagctt agaggaacta     8940

gccattagct agtgatgtaa aaatattttg gggactgatg cccttaaagg ttatgccctt     9000

gaaagttctt accttttctc tagtgatatt aaggaacgag tgggtagtgt tctcagggtg     9060

accagctgcc ctaaagtgcc tgggattgag ggtttccctg gatgcgggac tttccctgga     9120

tacaaaactt ttagcagagt tttgtatata tgtggatttt tctgataagt agcacatcag     9180

aggccttaac cactgcccaa aagcgattct ccattgagag tacatatctt gaacttaaga     9240

aattcatttg ctctgatttt taatcttgta aagtttttgc taaactcaaa acaagtccca     9300

ggcacaccag aaggagctga ccaccttagg tgttcttgtg atttatcctt acttccctat     9360

gttgtcatag ttgcttctaa actcagctgc actatggctg tcaacatttc tgatacttat     9420

tgggatatgt gccatccagt catttagtac tttgaatgga acatgagatt tataacacag     9480

gtaatagctg aaggtaccag tatggtggtg agactcacac ttagtgatcc agctaaggta     9540

actgatgtta taatggaaca gagaagaggc caactagata gctaagttct tctgaaccta     9600

tgtgtatatg taagtacaaa tcatgcgtcc ttatggggtt aaacttaatc tgaaatttac     9660

atttttcata gtaaaaggaa accaattgtt gcagatttct tttcttgtga ggaaatacat     9720

ggcctttgat gctctggcgt ctactgcatt tcccagtctg ttctgctcga gaagccagaa     9780

tgtgttgtta acatttttcc gtgaatgttg tgttaaaatg attaaatgca tcagccaatg     9840

gcaagtgaag gaattgggtg tcctgatgca gactgagcag tttctctcaa ttgtagcctc     9900

atactcataa ggtgcttacc agctagaaca ttgagcacgt gaggtgagat tttttttctc     9960

tgatggcatt aactttgtaa tgcaatatga tggatgcaga ccctgttctt gtttccctct    10020

ggaagtcctt agtggctgca tccttggtgc actgtgatgg agatattaaa tgtgttcttt    10080

gtgagctttc gttctatgat tgtcaaaagt acgatgtggt tcctttttta tttttattaa    10140

acaatgagct gaggctttat tacagctggt tttcaagtta aaattgttga atactgatgt    10200

ctttctccca cctacaccaa atattttagt ctatttaaag tacaaaaaaa gttctgctta    10260

agaaaacatt gcttacatgt cctgtgattt ctggtcaatt tttatatata tttgtgtgca    10320

tcatctgtat gtgctttcac tttttacctt gtttgctctt acctgtgtta acagccctgt    10380

caccgttgaa aggtggacag ttttcctagc attaaaagaa agccatttga gttgtttacc    10440

atgttaaaaa aaaaaaaaaa a                                              10461


<210>  217
<211>  437
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 2 (SMAD2), transcript variant 3, 
       polypeptide

NCBI Reference Sequence: NM_001135937.2

<400>  217

Met Ser Ser Ile Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 
1               5                   10                  15      


Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
            20                  25                  30          


Gln Asn Gly Gln Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
        35                  40                  45              


Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
    50                  55                  60                  


Ile Thr Thr Gln Asn Cys Asn Thr Lys Cys Val Thr Ile Pro Arg Ser 
65                  70                  75                  80  


Leu Asp Gly Arg Leu Gln Val Ser His Arg Lys Gly Leu Pro His Val 
                85                  90                  95      


Ile Tyr Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu 
            100                 105                 110         


Leu Lys Ala Ile Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp 
        115                 120                 125             


Glu Val Cys Val Asn Pro Tyr His Tyr Gln Arg Val Glu Thr Pro Val 
    130                 135                 140                 


Leu Pro Pro Val Leu Val Pro Arg His Thr Glu Ile Leu Thr Glu Leu 
145                 150                 155                 160 


Pro Pro Leu Asp Asp Tyr Thr His Ser Ile Pro Glu Asn Thr Asn Phe 
                165                 170                 175     


Pro Ala Gly Ile Glu Pro Gln Ser Asn Tyr Ile Pro Glu Thr Pro Pro 
            180                 185                 190         


Pro Gly Tyr Ile Ser Glu Asp Gly Glu Thr Ser Asp Gln Gln Leu Asn 
        195                 200                 205             


Gln Ser Met Asp Thr Gly Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu 
    210                 215                 220                 


Ser Pro Val Asn His Ser Leu Asp Leu Gln Pro Val Thr Tyr Ser Glu 
225                 230                 235                 240 


Pro Ala Phe Trp Cys Ser Ile Ala Tyr Tyr Glu Leu Asn Gln Arg Val 
                245                 250                 255     


Gly Glu Thr Phe His Ala Ser Gln Pro Ser Leu Thr Val Asp Gly Phe 
            260                 265                 270         


Thr Asp Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn 
        275                 280                 285             


Val Asn Arg Asn Ala Thr Val Glu Met Thr Arg Arg His Ile Gly Arg 
    290                 295                 300                 


Gly Val Arg Leu Tyr Tyr Ile Gly Gly Glu Val Phe Ala Glu Cys Leu 
305                 310                 315                 320 


Ser Asp Ser Ala Ile Phe Val Gln Ser Pro Asn Cys Asn Gln Arg Tyr 
                325                 330                 335     


Gly Trp His Pro Ala Thr Val Cys Lys Ile Pro Pro Gly Cys Asn Leu 
            340                 345                 350         


Lys Ile Phe Asn Asn Gln Glu Phe Ala Ala Leu Leu Ala Gln Ser Val 
        355                 360                 365             


Asn Gln Gly Phe Glu Ala Val Tyr Gln Leu Thr Arg Met Cys Thr Ile 
    370                 375                 380                 


Arg Met Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gln Thr 
385                 390                 395                 400 


Val Thr Ser Thr Pro Cys Trp Ile Glu Leu His Leu Asn Gly Pro Leu 
                405                 410                 415     


Gln Trp Leu Asp Lys Val Leu Thr Gln Met Gly Ser Pro Ser Val Arg 
            420                 425                 430         


Cys Ser Ser Met Ser 
        435         


<210>  218
<211>  5441
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 3 (SMAD3), transcript variant 4, 
       mRNA

<400>  218
cttctcagat cctttgcggg tagccctggc gtcccgcgga gaccccaccc cctggctacc       60

tgagtgaaga tggagaaacc agtgaccacc agatgaacca cagcatggac gcaggttctc      120

caaacctatc cccgaatccg atgtccccag cacataataa cttggacctg cagccagtta      180

cctactgcga gccggccttc tggtgctcca tctcctacta cgagctgaac cagcgcgtcg      240

gggagacatt ccacgcctcg cagccatcca tgactgtgga tggcttcacc gacccctcca      300

attcggagcg cttctgccta gggctgctct ccaatgtcaa caggaatgca gcagtggagc      360

tgacacggag acacatcgga agaggcgtgc ggctctacta catcggaggg gaggtcttcg      420

cagagtgcct cagtgacagc gctatttttg tccagtctcc caactgtaac cagcgctatg      480

gctggcaccc ggccaccgtc tgcaagatcc caccaggatg caacctgaag atcttcaaca      540

accaggagtt cgctgccctc ctggcccagt cggtcaacca gggctttgag gctgtctacc      600

agttgacccg aatgtgcacc atccgcatga gcttcgtcaa aggctgggga gcggagtaca      660

ggagacagac tgtgaccagt accccctgct ggattgagct gcacctgaat gggcctttgc      720

agtggcttga caaggtcctc acccagatgg gctccccaag catccgctgt tccagtgtgt      780

cttagagaca tcaagtatgg taggggaggg caggcttggg gaaaatggcc atgcaggagg      840

tggagaaaat tggaactcta ctcaacccat tgttgtcaag gaagaagaaa tctttctccc      900

tcaactgaag gggtgcaccc acctgttttc tgaaacacac gagcaaaccc agaggtggat      960

gttatgaaca gctgtgtctg ccaaacacat ttaccctttg gccccacttt gaagggcaag     1020

aaatggcgtc tgctctggtg gcttaagtga gcagaacagg tagtattaca ccaccggccc     1080

cctcccccca gactcttttt ttgagtgaca gctttctggg atgtcacagt ccaaccagaa     1140

acacccctct gtctaggact gcagtgtgga gttcaccttg gaagggcgtt ctaggtagga     1200

agagcccgca gggccatgca gacctcatgc ccagctctct gacgcttgtg acagtgcctc     1260

ttccagtgaa cattcccagc ccagccccgc cccgccccgc cccaccactc cagcagacct     1320

tgccccttgt gagctggata gacttgggat ggggagggag ggagttttgt ctgtctccct     1380

cccctctcag aacatactga ttgggaggtg cgtgttcagc agaacctgca cacaggacag     1440

cgggaaaaat cgatgagcgc cacctcttta aaaactcact tacgtttgtc ctttttcact     1500

ttgaaaagtt ggaaggatct gctgaggccc agtgcatatg caatgtatag tgtctattat     1560

cacattaatc tcaaagagat tcgaatgacg gtaagtgttc tcatgaagca ggaggccctt     1620

gtcgtgggat ggcatttggt ctcaggcagc accacactgg gtgcgtctcc agtcatctgt     1680

aagagcttgc tccagattct gatgcatacg gctatattgg tttatgtagt cagttgcatt     1740

cattaaatca actttatcat atgctctttt aaatgtttgg tttatatatt ttctttaaaa     1800

atcctgggct ggcacattga ctgggaaacc tgagtgagac ccagcaactg cttctctccc     1860

ttctctctcc tgaggtgaag cttttccagg ttttgttgaa gagatacctg ccagcacttc     1920

tgcaagctga aatttacaga agcaaattca ccagaaggga aacatctcag gccaacatag     1980

gcaaatgaaa agggctatta aaatattttt acacctttga aaattgcagg cttggtacaa     2040

agaggtctgt catcttcccc ctgggatata agatgatcta gctcccggta gaggatcacc     2100

ggtgacaact atagcagttg tattgtgtaa caagtactgc tcccagcagc aattagggag     2160

aaaactagtc taaattattt caactggaaa aaagaaaaaa gagtcctctt cttttcccag     2220

ccttttgcag aacacagtag acagaactgc caccttcaat tggtacttta ttctttgctg     2280

ctgtttttgt ataaaatgac ctatcccacg tttttgcatg aatttatagc aggaaaaatc     2340

aagggatttc ctatggaagt cctgctttat tccaggtgaa gggaaggaag tgtatatact     2400

tttggcaagt catacagctc aaatgtgatg agatttctga tgttagaggg agatggagag     2460

gcttcctgat gcctcatctg cagggtcctg tgcctctgaa gttctagcca tgaggtttcc     2520

aggtaggaca gctgctcccc aagcctcctg aggacacagg aagagacgga aggagcacct     2580

tgacagactt gtgtgagtct tctcgaagga gggttgactc agaacccaga gacaatacaa     2640

aacccctcac ttcctctgag agggccaaat gctgtgagtc tgaagtatgt gcctggtgtg     2700

aaatgatcta tggcctgttt cttacacagg aagccccctg aacctcctgt acatgtgttc     2760

atgttcccag ccagctctga gacccaggaa ccaaatattc cattttggct tctgctagag     2820

cagtcatggt tcctctccta aaagccatgg gcagcagttt ccgagggcct gcatgatcca     2880

cctgctgcac gatcctatga gggcttcctg tggcacacag ccctctgggt gcttgggaac     2940

tagcttcagg cacagcctga ttctggtgat ccagtgatct atggaagtcg tgtcttactc     3000

caggtgaagg gggaaaaaaa aagcctatac tttggcaggt tatgaacttt gaatgtgatg     3060

aaatgacacg tttggctgca tttggatggt gtcttagaac cctcattgct cagacctgaa     3120

ggctacttct aggagcatga agtttgagtt ttgtgttttt ccaaaggata cttccttggc     3180

cctttttctt tattgactag accaccagag gaggatgtgt gggattgtag gcaaacccac     3240

ctgtggcatc actgaaaata aatttgatca tacctaagag gttaggaaat ggtgccattc     3300

ccaccttaga gtgctacata ggtgctttgg gcgtatgtaa cattagtgtc cttccttgaa     3360

gccacaagct agttttctta gttttaaaat cctgttgtat gaatggcatt tgtatattaa     3420

aacacttttt taaaggacag ttgaaaaggg caagaggaaa ccagggcagt tctagaggag     3480

tgctggtgac tggatagcag ttttaagtgg cgttcaccta gtcaacacga ccgcgtgtgt     3540

tgcccctgcc ctgggctccc cgccatgaca tcttcacctt gcagcttgtg ctgagactga     3600

cccaagtgca gctagcactg ggacacagat ccttgtcttc agcaccttcc aaggagccaa     3660

cttttattcc ctttcctctc tcccctcccc acctcgcttc ttcccaattt agtaacttag     3720

atgcttccag cacatacgta ggtagctacc ccagccggtt tggattacag gcctgtgctg     3780

gaacatcatc tcagttggcc accttcctgg caggctgtag acctgacatt ttgagacaag     3840

cctagagtca ggagcaggga ctttgactct taggaagagc acacatgagg gcaaggctgc     3900

tggcagacgt ctccattgtc cttatgttgt ctgtgttgta tttttttttt tttattgacc     3960

atggtgatta tttttttaaa ccatcgttaa tatactgaag tgagctatag cacatatcat     4020

gtgcttagtt tgtttatttt tctccatctc cccttggctt cctagagttt ggacatattc     4080

caggctaaat gcttttactc aagactacag aaaggtttga agtagtgtgt gcatggcatg     4140

cacgtatgta agtaatctgg ggaagaagca aagatctgtt tcattcttag cctcaggcct     4200

catgagggtc tccacagggc cggagctcag gttacaccac tccttcgtcc ttacaggaga     4260

tgtagggaga agaatctgca ggctgcttgt aggactgttc accaaggggg ataccagcag     4320

caagagagtg cacccgttta gccctggacc ctgtttctta ctgtgtgact tggctagagt     4380

tgggagttcc cccaaaataa acgtgtcccc attttaccag aaccaaacct caacacagcg     4440

aagctgtact gtctttgtgt ggcaaagatg ttcccttgta ggcccctttc aggtaaccgt     4500

cttcacaatg tattttcatc acagtttaag gagcatcagc cgcttctcaa gtgggtaggg     4560

aaagcagaaa aacgtacgca agaggacatg gatccaaaat gatgatgaag catctcccat     4620

ggggaggtga tggtggggag atgatgggct aaacaggcaa cttttcaaaa acacagctat     4680

catagaaaag aaacttgcct catgtaaact ggattgagaa attctcagtg attctgcaat     4740

ggattttttt ttaatgcaga agtaatgtat actctagtat tctggtgttt ttatatttat     4800

gtaataattt cttaaaacca ttcagacaga taactattta atttttttta agaaagttgg     4860

aaaggtctct cctcccaagg acagtggctg gaagagttgg ggcacagcca gttctgaatg     4920

ttggtggagg gtgtagtggc tttttggctc agcatccaga aacaccaaac caggctggct     4980

aaacaagtgg ccgcgtgtaa aaacagacag ctctgagtca aatctgggcc cttccacaag     5040

ggtcctctga accaagcccc actcccttgc taggggtgaa agcattacag agagatggag     5100

ccatctatcc aagaagcctt cactcacctt cactgctgct gttgcaactc ggctgttctg     5160

gactctgatg tgtgtggagg gatggggaat agaacattga ctgtgttgat taccttcact     5220

attcggccag cctgaccttt taataacttt gtaaaaagca tgtatgtatt tatagtgttt     5280

tagatttttc taacttttat atcttaaaag cagagcacct gtttaagcat tgtaccccta     5340

ttgttaaaga tttgtgtcct ctcattccct ctcttcctct tgtaagtgcc cttctaataa     5400

acttttcatg gaaaagctcc tgtgccagga gctcagtctg a                         5441


<210>  219
<211>  230
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens SMAD family member 3 (SMAD3), transcript variant 4, 
       polypeptide

NCBI Reference Sequence: NM_001145104.1

<400>  219

Met Asn His Ser Met Asp Ala Gly Ser Pro Asn Leu Ser Pro Asn Pro 
1               5                   10                  15      


Met Ser Pro Ala His Asn Asn Leu Asp Leu Gln Pro Val Thr Tyr Cys 
            20                  25                  30          


Glu Pro Ala Phe Trp Cys Ser Ile Ser Tyr Tyr Glu Leu Asn Gln Arg 
        35                  40                  45              


Val Gly Glu Thr Phe His Ala Ser Gln Pro Ser Met Thr Val Asp Gly 
    50                  55                  60                  


Phe Thr Asp Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser 
65                  70                  75                  80  


Asn Val Asn Arg Asn Ala Ala Val Glu Leu Thr Arg Arg His Ile Gly 
                85                  90                  95      


Arg Gly Val Arg Leu Tyr Tyr Ile Gly Gly Glu Val Phe Ala Glu Cys 
            100                 105                 110         


Leu Ser Asp Ser Ala Ile Phe Val Gln Ser Pro Asn Cys Asn Gln Arg 
        115                 120                 125             


Tyr Gly Trp His Pro Ala Thr Val Cys Lys Ile Pro Pro Gly Cys Asn 
    130                 135                 140                 


Leu Lys Ile Phe Asn Asn Gln Glu Phe Ala Ala Leu Leu Ala Gln Ser 
145                 150                 155                 160 


Val Asn Gln Gly Phe Glu Ala Val Tyr Gln Leu Thr Arg Met Cys Thr 
                165                 170                 175     


Ile Arg Met Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gln 
            180                 185                 190         


Thr Val Thr Ser Thr Pro Cys Trp Ile Glu Leu His Leu Asn Gly Pro 
        195                 200                 205             


Leu Gln Trp Leu Asp Lys Val Leu Thr Gln Met Gly Ser Pro Ser Ile 
    210                 215                 220                 


Arg Cys Ser Ser Val Ser 
225                 230 


<210>  220
<211>  5916
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ERK1(alias MAPK1) mRNA transcript variant 1 GenBank 
       Accession No.: NM_002745.4  GI:75709178

<400>  220
gcccctccct ccgcccgccc gccggcccgc ccgtcagtct ggcaggcagg caggcaatcg       60

gtccgagtgg ctgtcggctc ttcagctctc ccgctcggcg tcttccttcc tcctcccggt      120

cagcgtcggc ggctgcaccg gcggcggcgc agtccctgcg ggaggggcga caagagctga      180

gcggcggccg ccgagcgtcg agctcagcgc ggcggaggcg gcggcggccc ggcagccaac      240

atggcggcgg cggcggcggc gggcgcgggc ccggagatgg tccgcgggca ggtgttcgac      300

gtggggccgc gctacaccaa cctctcgtac atcggcgagg gcgcctacgg catggtgtgc      360

tctgcttatg ataatgtcaa caaagttcga gtagctatca agaaaatcag cccctttgag      420

caccagacct actgccagag aaccctgagg gagataaaaa tcttactgcg cttcagacat      480

gagaacatca ttggaatcaa tgacattatt cgagcaccaa ccatcgagca aatgaaagat      540

gtatatatag tacaggacct catggaaaca gatctttaca agctcttgaa gacacaacac      600

ctcagcaatg accatatctg ctattttctc taccagatcc tcagagggtt aaaatatatc      660

cattcagcta acgttctgca ccgtgacctc aagccttcca acctgctgct caacaccacc      720

tgtgatctca agatctgtga ctttggcctg gcccgtgttg cagatccaga ccatgatcac      780

acagggttcc tgacagaata tgtggccaca cgttggtaca gggctccaga aattatgttg      840

aattccaagg gctacaccaa gtccattgat atttggtctg taggctgcat tctggcagaa      900

atgctttcta acaggcccat ctttccaggg aagcattatc ttgaccagct gaaccacatt      960

ttgggtattc ttggatcccc atcacaagaa gacctgaatt gtataataaa tttaaaagct     1020

aggaactatt tgctttctct tccacacaaa aataaggtgc catggaacag gctgttccca     1080

aatgctgact ccaaagctct ggacttattg gacaaaatgt tgacattcaa cccacacaag     1140

aggattgaag tagaacaggc tctggcccac ccatatctgg agcagtatta cgacccgagt     1200

gacgagccca tcgccgaagc accattcaag ttcgacatgg aattggatga cttgcctaag     1260

gaaaagctca aagaactaat ttttgaagag actgctagat tccagccagg atacagatct     1320

taaatttgtc aggacaaggg ctcagaggac tggacgtgct cagacatcgg tgttcttctt     1380

cccagttctt gacccctggt cctgtctcca gcccgtcttg gcttatccac tttgactcct     1440

ttgagccgtt tggaggggcg gtttctggta gttgtggctt ttatgctttc aaagaatttc     1500

ttcagtccag agaattcctc ctggcagccc tgtgtgtgtc acccattggt gacctgcggc     1560

agtatgtact tcagtgcacc tactgcttac tgttgcttta gtcactaatt gctttctggt     1620

ttgaaagatg cagtggttcc tccctctcct gaatcctttt ctacatgatg ccctgctgac     1680

catgcagccg caccagagag agattcttcc ccaattggct ctagtcactg gcatctcact     1740

ttatgatagg gaaggctact acctagggca ctttaagtca gtgacagccc cttatttgca     1800

cttcaccttt tgaccataac tgtttcccca gagcaggagc ttgtggaaat accttggctg     1860

atgttgcagc ctgcagcaag tgcttccgtc tccggaatcc ttggggagca cttgtccacg     1920

tcttttctca tatcatggta gtcactaaca tatataaggt atgtgctatt ggcccagctt     1980

ttagaaaatg cagtcatttt tctaaataaa aaggaagtac tgcacccagc agtgtcactc     2040

tgtagttact gtggtcactt gtaccatata gaggtgtaac acttgtcaag aagcgttatg     2100

tgcagtactt aatgtttgta agacttacaa aaaaagattt aaagtggcag cttcactcga     2160

catttggtga gagaagtaca aaggttgcag tgctgagctg tgggcggttt ctggggatgt     2220

cccagggtgg aactccacat gctggtgcat atacgccctt gagctacttc aaatgtgggt     2280

gtttcagtaa ccacgttcca tgcctgagga tttagcagag aggaacactg cgtctttaaa     2340

tgagaaagta tacaattctt tttccttcta cagcatgtca gcatctcaag ttcatttttc     2400

aacctacagt ataacaattt gtaataaagc ctccaggagc tcatgacgtg aagcactgtt     2460

ctgtcctcaa gtactcaaat atttctgata ctgctgagtc agactgtcag aaaaagctag     2520

cactaactcg tgtttggagc tctatccata ttttactgat ctctttaagt atttgttcct     2580

gccactgtgt actgtggagt tgactcggtg ttctgtccca gtgcggtgcc tcctcttgac     2640

ttccccactg ctctctgtgg tgagaaattt gccttgttca ataattactg taccctcgca     2700

tgactgttac agctttctgt gcagagatga ctgtccaagt gccacatgcc tacgattgaa     2760

atgaaaactc tattgttacc tctgagttgt gttccacgga aaatgctatc cagcagatca     2820

tttaggaaaa ataattctat ttttagcttt tcatttctca gctgtccttt tttcttgttt     2880

gatttttgac agcaatggag aatgggttat ataaagactg cctgctaata tgaacagaaa     2940

tgcatttgta attcatgaaa ataaatgtac atcttctatc ttcacattca tgttaagatt     3000

cagtgttgct ttcctctgga tcagcgtgtc tgaatggaca gtcaggttca ggttgtgctg     3060

aacacagaaa tgctcacagg cctcactttg ccgcccaggc actggcccag cacttggatt     3120

tacataagat gagttagaaa ggtacttctg tagggtcctt tttacctctg ctcggcagag     3180

aatcgatgct gtcatgttcc tttattcaca atcttaggtc tcaaatattc tgtcaaaccc     3240

taacaaagaa gccccgacat ctcaggttgg attccctggt tctctctaaa gagggcctgc     3300

ccttgtgccc cagaggtgct gctgggcaca gccaagagtt gggaagggcc gccccacagt     3360

acgcagtcct caccacccag cccagggtgc tcacgctcac cactcctgtg gctgaggaag     3420

gatagctggc tcatcctcgg aaaacagacc cacatctcta ttcttgccct gaaatacgcg     3480

cttttcactt gcgtgctcag agctgccgtc tgaaggtcca cacagcattg acgggacaca     3540

gaaatgtgac tgttaccgga taacactgat tagtcagttt tcatttataa aaaagcattg     3600

acagttttat tactcttgtt tctttttaaa tggaaagtta ctattataag gttaatttgg     3660

agtcctcttc taaatagaaa accatatcct tggctactaa catctggaga ctgtgagctc     3720

cttcccattc cccttcctgg tactgtggag tcagattggc atgaaaccac taacttcatt     3780

ctagaatcat tgtagccata agttgtgtgc tttttattaa tcatgccaaa cataatgtaa     3840

ctgggcagag aatggtccta accaaggtac ctatgaaaag cgctagctat catgtgtagt     3900

agatgcatca ttttggctct tcttacattt gtaaaaatgt acagattagg tcatcttaat     3960

tcatattagt gacacggaac agcacctcca ctatttgtat gttcaaataa gctttcagac     4020

taatagcttt tttggtgtct aaaatgtaag caaaaaattc ctgctgaaac attccagtcc     4080

tttcatttag tataaaagaa atactgaaca agccagtggg atggaattga aagaactaat     4140

catgaggact ctgtcctgac acaggtcctc aaagctagca gagatacgca gacattgtgg     4200

catctgggta gaagaatact gtattgtgtg tgcagtgcac agtgtgtggt gtgtgcacac     4260

tcattccttc tgctcttggg cacaggcagt gggtgtagag gtaaccagta gctttgagaa     4320

gctacatgta gctcaccagt ggttttctct aaggaatcac aaaagtaaac tacccaacca     4380

catgccacgt aatatttcag ccattcagag gaaactgttt tctctttatt tgcttatatg     4440

ttaatatggt ttttaaattg gtaactttta tatagtatgg taacagtatg ttaatacaca     4500

catacatacg cacacatgct ttgggtcctt ccataatact tttatatttg taaatcaatg     4560

ttttggagca atcccaagtt taagggaaat atttttgtaa atgtaatggt tttgaaaatc     4620

tgagcaatcc ttttgcttat acatttttaa agcatttgtg ctttaaaatt gttatgctgg     4680

tgtttgaaac atgatactcc tgtggtgcag atgagaagct ataacagtga atatgtggtt     4740

tctcttacgt catccacctt gacatgatgg gtcagaaaca aatggaaatc cagagcaagt     4800

cctccagggt tgcaccaggt ttacctaaag cttgttgcct tttcttgtgc tgtttatgcg     4860

tgtagagcac tcaagaaagt tctgaaactg ctttgtatct gctttgtact gttggtgcct     4920

tcttggtatt gtaccccaaa attctgcata gattatttag tataatggta agttaaaaaa     4980

tgttaaagga agattttatt aagaatctga atgtttattc attatattgt tacaatttaa     5040

cattaacatt tatttgtggt atttgtgatt tggttaatct gtataaaaat tgtaagtaga     5100

aaggtttata tttcatctta attcttttga tgttgtaaac gtacttttta aaagatggat     5160

tatttgaatg tttatggcac ctgacttgta aaaaaaaaaa actacaaaaa aatccttaga     5220

atcattaaat tgtgtccctg tattaccaaa ataacacagc accgtgcatg tatagtttaa     5280

ttgcagtttc atctgtgaaa acgtgaaatt gtctagtcct tcgttatgtt ccccagatgt     5340

cttccagatt tgctctgcat gtggtaactt gtgttagggc tgtgagctgt tcctcgagtt     5400

gaatggggat gtcagtgctc ctagggttct ccaggtggtt cttcagacct tcacctgtgg     5460

gggggggggt aggcggtgcc cacgcccatc tcctcatcct cctgaacttc tgcaacccca     5520

ctgctgggca gacatcctgg gcaacccctt ttttcagagc aagaagtcat aaagatagga     5580

tttcttggac atttggttct tatcaatatt gggcattatg taatgactta tttacaaaac     5640

aaagatactg gaaaatgttt tggatgtggt gttatggaaa gagcacaggc cttggaccca     5700

tccagctggg ttcagaacta ccccctgctt ataactgcgg ctggctgtgg gccagtcatt     5760

ctgcgtctct gctttcttcc tctgcttcag actgtcagct gtaaagtgga agcaatatta     5820

cttgccttgt atatggtaaa gattataaaa atacatttca actgttcagc atagtacttc     5880

aaagcaagta ctcagtaaat agcaagtctt tttaaa                               5916


<210>  221
<211>  360
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ERK1(alias MAPK1) polypeptide encoded by mRNA transcript 
       variant 1 GenBank Accession No.: NP_002736.3  GI:66932916

<400>  221

Met Ala Ala Ala Ala Ala Ala Gly Ala Gly Pro Glu Met Val Arg Gly 
1               5                   10                  15      


Gln Val Phe Asp Val Gly Pro Arg Tyr Thr Asn Leu Ser Tyr Ile Gly 
            20                  25                  30          


Glu Gly Ala Tyr Gly Met Val Cys Ser Ala Tyr Asp Asn Val Asn Lys 
        35                  40                  45              


Val Arg Val Ala Ile Lys Lys Ile Ser Pro Phe Glu His Gln Thr Tyr 
    50                  55                  60                  


Cys Gln Arg Thr Leu Arg Glu Ile Lys Ile Leu Leu Arg Phe Arg His 
65                  70                  75                  80  


Glu Asn Ile Ile Gly Ile Asn Asp Ile Ile Arg Ala Pro Thr Ile Glu 
                85                  90                  95      


Gln Met Lys Asp Val Tyr Ile Val Gln Asp Leu Met Glu Thr Asp Leu 
            100                 105                 110         


Tyr Lys Leu Leu Lys Thr Gln His Leu Ser Asn Asp His Ile Cys Tyr 
        115                 120                 125             


Phe Leu Tyr Gln Ile Leu Arg Gly Leu Lys Tyr Ile His Ser Ala Asn 
    130                 135                 140                 


Val Leu His Arg Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Thr Thr 
145                 150                 155                 160 


Cys Asp Leu Lys Ile Cys Asp Phe Gly Leu Ala Arg Val Ala Asp Pro 
                165                 170                 175     


Asp His Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp 
            180                 185                 190         


Tyr Arg Ala Pro Glu Ile Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser 
        195                 200                 205             


Ile Asp Ile Trp Ser Val Gly Cys Ile Leu Ala Glu Met Leu Ser Asn 
    210                 215                 220                 


Arg Pro Ile Phe Pro Gly Lys His Tyr Leu Asp Gln Leu Asn His Ile 
225                 230                 235                 240 


Leu Gly Ile Leu Gly Ser Pro Ser Gln Glu Asp Leu Asn Cys Ile Ile 
                245                 250                 255     


Asn Leu Lys Ala Arg Asn Tyr Leu Leu Ser Leu Pro His Lys Asn Lys 
            260                 265                 270         


Val Pro Trp Asn Arg Leu Phe Pro Asn Ala Asp Ser Lys Ala Leu Asp 
        275                 280                 285             


Leu Leu Asp Lys Met Leu Thr Phe Asn Pro His Lys Arg Ile Glu Val 
    290                 295                 300                 


Glu Gln Ala Leu Ala His Pro Tyr Leu Glu Gln Tyr Tyr Asp Pro Ser 
305                 310                 315                 320 


Asp Glu Pro Ile Ala Glu Ala Pro Phe Lys Phe Asp Met Glu Leu Asp 
                325                 330                 335     


Asp Leu Pro Lys Glu Lys Leu Lys Glu Leu Ile Phe Glu Glu Thr Ala 
            340                 345                 350         


Arg Phe Gln Pro Gly Tyr Arg Ser 
        355                 360 


<210>  222
<211>  1514
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens mitogen-activated protein kinase 1 (MAPK1), 
       transcript variant 2, mRNA NCBI Reference Sequence: NM_138957.3

<400>  222
gcccctccct ccgcccgccc gccggcccgc ccgtcagtct ggcaggcagg caggcaatcg       60

gtccgagtgg ctgtcggctc ttcagctctc ccgctcggcg tcttccttcc tcctcccggt      120

cagcgtcggc ggctgcaccg gcggcggcgc agtccctgcg ggaggggcga caagagctga      180

gcggcggccg ccgagcgtcg agctcagcgc ggcggaggcg gcggcggccc ggcagccaac      240

atggcggcgg cggcggcggc gggcgcgggc ccggagatgg tccgcgggca ggtgttcgac      300

gtggggccgc gctacaccaa cctctcgtac atcggcgagg gcgcctacgg catggtgtgc      360

tctgcttatg ataatgtcaa caaagttcga gtagctatca agaaaatcag cccctttgag      420

caccagacct actgccagag aaccctgagg gagataaaaa tcttactgcg cttcagacat      480

gagaacatca ttggaatcaa tgacattatt cgagcaccaa ccatcgagca aatgaaagat      540

gtatatatag tacaggacct catggaaaca gatctttaca agctcttgaa gacacaacac      600

ctcagcaatg accatatctg ctattttctc taccagatcc tcagagggtt aaaatatatc      660

cattcagcta acgttctgca ccgtgacctc aagccttcca acctgctgct caacaccacc      720

tgtgatctca agatctgtga ctttggcctg gcccgtgttg cagatccaga ccatgatcac      780

acagggttcc tgacagaata tgtggccaca cgttggtaca gggctccaga aattatgttg      840

aattccaagg gctacaccaa gtccattgat atttggtctg taggctgcat tctggcagaa      900

atgctttcta acaggcccat ctttccaggg aagcattatc ttgaccagct gaaccacatt      960

ttgggtattc ttggatcccc atcacaagaa gacctgaatt gtataataaa tttaaaagct     1020

aggaactatt tgctttctct tccacacaaa aataaggtgc catggaacag gctgttccca     1080

aatgctgact ccaaagctct ggacttattg gacaaaatgt tgacattcaa cccacacaag     1140

aggattgaag tagaacaggc tctggcccac ccatatctgg agcagtatta cgacccgagt     1200

gacgagccca tcgccgaagc accattcaag ttcgacatgg aattggatga cttgcctaag     1260

gaaaagctca aagaactaat ttttgaagag actgctagat tccagccagg atacagatct     1320

taaatttgtc aggtacctgg agtttaatac agtgagctct agcaagggag gcgctgcctt     1380

ttgtttctag aatattatgt tcctcaaggt ccattatttt gtattctttt ccaagctcct     1440

tattggaagg tattttttta aatttagaat taaaaattat ttagaaagtt acatataaaa     1500

aaaaaaaaaa aaaa                                                       1514


<210>  223
<211>  360
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens mitogen-activated protein kinase 1 (MAPK1), 
       transcript variant 2, polypeptide NCBI Reference Sequence: 
       NM_138957.3

<400>  223

Met Ala Ala Ala Ala Ala Ala Gly Ala Gly Pro Glu Met Val Arg Gly 
1               5                   10                  15      


Gln Val Phe Asp Val Gly Pro Arg Tyr Thr Asn Leu Ser Tyr Ile Gly 
            20                  25                  30          


Glu Gly Ala Tyr Gly Met Val Cys Ser Ala Tyr Asp Asn Val Asn Lys 
        35                  40                  45              


Val Arg Val Ala Ile Lys Lys Ile Ser Pro Phe Glu His Gln Thr Tyr 
    50                  55                  60                  


Cys Gln Arg Thr Leu Arg Glu Ile Lys Ile Leu Leu Arg Phe Arg His 
65                  70                  75                  80  


Glu Asn Ile Ile Gly Ile Asn Asp Ile Ile Arg Ala Pro Thr Ile Glu 
                85                  90                  95      


Gln Met Lys Asp Val Tyr Ile Val Gln Asp Leu Met Glu Thr Asp Leu 
            100                 105                 110         


Tyr Lys Leu Leu Lys Thr Gln His Leu Ser Asn Asp His Ile Cys Tyr 
        115                 120                 125             


Phe Leu Tyr Gln Ile Leu Arg Gly Leu Lys Tyr Ile His Ser Ala Asn 
    130                 135                 140                 


Val Leu His Arg Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Thr Thr 
145                 150                 155                 160 


Cys Asp Leu Lys Ile Cys Asp Phe Gly Leu Ala Arg Val Ala Asp Pro 
                165                 170                 175     


Asp His Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp 
            180                 185                 190         


Tyr Arg Ala Pro Glu Ile Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser 
        195                 200                 205             


Ile Asp Ile Trp Ser Val Gly Cys Ile Leu Ala Glu Met Leu Ser Asn 
    210                 215                 220                 


Arg Pro Ile Phe Pro Gly Lys His Tyr Leu Asp Gln Leu Asn His Ile 
225                 230                 235                 240 


Leu Gly Ile Leu Gly Ser Pro Ser Gln Glu Asp Leu Asn Cys Ile Ile 
                245                 250                 255     


Asn Leu Lys Ala Arg Asn Tyr Leu Leu Ser Leu Pro His Lys Asn Lys 
            260                 265                 270         


Val Pro Trp Asn Arg Leu Phe Pro Asn Ala Asp Ser Lys Ala Leu Asp 
        275                 280                 285             


Leu Leu Asp Lys Met Leu Thr Phe Asn Pro His Lys Arg Ile Glu Val 
    290                 295                 300                 


Glu Gln Ala Leu Ala His Pro Tyr Leu Glu Gln Tyr Tyr Asp Pro Ser 
305                 310                 315                 320 


Asp Glu Pro Ile Ala Glu Ala Pro Phe Lys Phe Asp Met Glu Leu Asp 
                325                 330                 335     


Asp Leu Pro Lys Glu Lys Leu Lys Glu Leu Ile Phe Glu Glu Thr Ala 
            340                 345                 350         


Arg Phe Gln Pro Gly Tyr Arg Ser 
        355                 360 


<210>  224
<211>  4001
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human PARP1 mRNA GenBank Accession No.: NM_001618.3  GI:156523967

<400>  224
aggcatcagc aatctatcag ggaacggcgg tggccggtgc ggcgtgttcg gtggcggctc       60

tggccgctca ggcgcctgcg gctgggtgag cgcacgcgag gcggcgaggc ggcagcgtgt      120

ttctaggtcg tggcgtcggg cttccggagc tttggcggca gctaggggag gatggcggag      180

tcttcggata agctctatcg agtcgagtac gccaagagcg ggcgcgcctc ttgcaagaaa      240

tgcagcgaga gcatccccaa ggactcgctc cggatggcca tcatggtgca gtcgcccatg      300

tttgatggaa aagtcccaca ctggtaccac ttctcctgct tctggaaggt gggccactcc      360

atccggcacc ctgacgttga ggtggatggg ttctctgagc ttcggtggga tgaccagcag      420

aaagtcaaga agacagcgga agctggagga gtgacaggca aaggccagga tggaattggt      480

agcaaggcag agaagactct gggtgacttt gcagcagagt atgccaagtc caacagaagt      540

acgtgcaagg ggtgtatgga gaagatagaa aagggccagg tgcgcctgtc caagaagatg      600

gtggacccgg agaagccaca gctaggcatg attgaccgct ggtaccatcc aggctgcttt      660

gtcaagaaca gggaggagct gggtttccgg cccgagtaca gtgcgagtca gctcaagggc      720

ttcagcctcc ttgctacaga ggataaagaa gccctgaaga agcagctccc aggagtcaag      780

agtgaaggaa agagaaaagg cgatgaggtg gatggagtgg atgaagtggc gaagaagaaa      840

tctaaaaaag aaaaagacaa ggatagtaag cttgaaaaag ccctaaaggc tcagaacgac      900

ctgatctgga acatcaagga cgagctaaag aaagtgtgtt caactaatga cctgaaggag      960

ctactcatct tcaacaagca gcaagtgcct tctggggagt cggcgatctt ggaccgagta     1020

gctgatggca tggtgttcgg tgccctcctt ccctgcgagg aatgctcggg tcagctggtc     1080

ttcaagagcg atgcctatta ctgcactggg gacgtcactg cctggaccaa gtgtatggtc     1140

aagacacaga cacccaaccg gaaggagtgg gtaaccccaa aggaattccg agaaatctct     1200

tacctcaaga aattgaaggt taaaaaacag gaccgtatat tccccccaga aaccagcgcc     1260

tccgtggcgg ccacgcctcc gccctccaca gcctcggctc ctgctgctgt gaactcctct     1320

gcttcagcag ataagccatt atccaacatg aagatcctga ctctcgggaa gctgtcccgg     1380

aacaaggatg aagtgaaggc catgattgag aaactcgggg ggaagttgac ggggacggcc     1440

aacaaggctt ccctgtgcat cagcaccaaa aaggaggtgg aaaagatgaa taagaagatg     1500

gaggaagtaa aggaagccaa catccgagtt gtgtctgagg acttcctcca ggacgtctcc     1560

gcctccacca agagccttca ggagttgttc ttagcgcaca tcttgtcccc ttggggggca     1620

gaggtgaagg cagagcctgt tgaagttgtg gccccaagag ggaagtcagg ggctgcgctc     1680

tccaaaaaaa gcaagggcca ggtcaaggag gaaggtatca acaaatctga aaagagaatg     1740

aaattaactc ttaaaggagg agcagctgtg gatcctgatt ctggactgga acactctgcg     1800

catgtcctgg agaaaggtgg gaaggtcttc agtgccaccc ttggcctggt ggacatcgtt     1860

aaaggaacca actcctacta caagctgcag cttctggagg acgacaagga aaacaggtat     1920

tggatattca ggtcctgggg ccgtgtgggt acggtgatcg gtagcaacaa actggaacag     1980

atgccgtcca aggaggatgc cattgagcac ttcatgaaat tatatgaaga aaaaaccggg     2040

aacgcttggc actccaaaaa tttcacgaag tatcccaaaa agttctaccc cctggagatt     2100

gactatggcc aggatgaaga ggcagtgaag aagctgacag taaatcctgg caccaagtcc     2160

aagctcccca agccagttca ggacctcatc aagatgatct ttgatgtgga aagtatgaag     2220

aaagccatgg tggagtatga gatcgacctt cagaagatgc ccttggggaa gctgagcaaa     2280

aggcagatcc aggccgcata ctccatcctc agtgaggtcc agcaggcggt gtctcagggc     2340

agcagcgact ctcagatcct ggatctctca aatcgctttt acaccctgat cccccacgac     2400

tttgggatga agaagcctcc gctcctgaac aatgcagaca gtgtgcaggc caaggtggaa     2460

atgcttgaca acctgctgga catcgaggtg gcctacagtc tgctcagggg agggtctgat     2520

gatagcagca aggatcccat cgatgtcaac tatgagaagc tcaaaactga cattaaggtg     2580

gttgacagag attctgaaga agccgagatc atcaggaagt atgttaagaa cactcatgca     2640

accacacaca atgcgtatga cttggaagtc atcgatatct ttaagataga gcgtgaaggc     2700

gaatgccagc gttacaagcc ctttaagcag cttcataacc gaagattgct gtggcacggg     2760

tccaggacca ccaactttgc tgggatcctg tcccagggtc ttcggatagc cccgcctgaa     2820

gcgcccgtga caggctacat gtttggtaaa gggatctatt tcgctgacat ggtctccaag     2880

agtgccaact actgccatac gtctcaggga gacccaatag gcttaatcct gttgggagaa     2940

gttgcccttg gaaacatgta tgaactgaag cacgcttcac atatcagcaa gttacccaag     3000

ggcaagcaca gtgtcaaagg tttgggcaaa actacccctg atccttcagc taacattagt     3060

ctggatggtg tagacgttcc tcttgggacc gggatttcat ctggtgtgaa tgacacctct     3120

ctactatata acgagtacat tgtctatgat attgctcagg taaatctgaa gtatctgctg     3180

aaactgaaat tcaattttaa gacctccctg tggtaattgg gagaggtagc cgagtcacac     3240

ccggtggctc tggtatgaat tcacccgaag cgcttctgca ccaactcacc tggccgctaa     3300

gttgctgatg ggtagtacct gtactaaacc acctcagaaa ggattttaca gaaacgtgtt     3360

aaaggttttc tctaacttct caagtccctt gttttgtgtt gtgtctgtgg ggaggggttg     3420

ttttggggtt gtttttgttt tttcttgcca ggtagataaa actgacatag agaaaaggct     3480

ggagagagat tctgttgcat agactagtcc tatggaaaaa accaagcttc gttagaatgt     3540

ctgccttact ggtttcccca gggaaggaaa aatacacttc cacccttttt tctaagtgtt     3600

cgtctttagt tttgattttg gaaagatgtt aagcatttat ttttagttaa aaataaaaac     3660

taatttcata ctatttagat tttctttttt atcttgcact tattgtcccc tttttagttt     3720

tttttgtttg cctcttgtgg tgaggggtgt gggaagacca aaggaaggaa cgctaacaat     3780

ttctcatact tagaaacaaa aagagctttc cttctccagg aatactgaac atgggagctc     3840

ttgaaatatg tagtattaaa agttgcattt gaaattcttg actttcttat gggcactttt     3900

gtcttccaaa ttaaaactct accacaaata tacttaccca agggctaata gtaatactcg     3960

attaaaaatg cagatgcctt ctctaaaaaa aaaaaaaaaa a                         4001


<210>  225
<211>  1014
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human PARP1 polypeptide GenBank Accession No.: NP_001609.2  
       GI:156523968

<400>  225

Met Ala Glu Ser Ser Asp Lys Leu Tyr Arg Val Glu Tyr Ala Lys Ser 
1               5                   10                  15      


Gly Arg Ala Ser Cys Lys Lys Cys Ser Glu Ser Ile Pro Lys Asp Ser 
            20                  25                  30          


Leu Arg Met Ala Ile Met Val Gln Ser Pro Met Phe Asp Gly Lys Val 
        35                  40                  45              


Pro His Trp Tyr His Phe Ser Cys Phe Trp Lys Val Gly His Ser Ile 
    50                  55                  60                  


Arg His Pro Asp Val Glu Val Asp Gly Phe Ser Glu Leu Arg Trp Asp 
65                  70                  75                  80  


Asp Gln Gln Lys Val Lys Lys Thr Ala Glu Ala Gly Gly Val Thr Gly 
                85                  90                  95      


Lys Gly Gln Asp Gly Ile Gly Ser Lys Ala Glu Lys Thr Leu Gly Asp 
            100                 105                 110         


Phe Ala Ala Glu Tyr Ala Lys Ser Asn Arg Ser Thr Cys Lys Gly Cys 
        115                 120                 125             


Met Glu Lys Ile Glu Lys Gly Gln Val Arg Leu Ser Lys Lys Met Val 
    130                 135                 140                 


Asp Pro Glu Lys Pro Gln Leu Gly Met Ile Asp Arg Trp Tyr His Pro 
145                 150                 155                 160 


Gly Cys Phe Val Lys Asn Arg Glu Glu Leu Gly Phe Arg Pro Glu Tyr 
                165                 170                 175     


Ser Ala Ser Gln Leu Lys Gly Phe Ser Leu Leu Ala Thr Glu Asp Lys 
            180                 185                 190         


Glu Ala Leu Lys Lys Gln Leu Pro Gly Val Lys Ser Glu Gly Lys Arg 
        195                 200                 205             


Lys Gly Asp Glu Val Asp Gly Val Asp Glu Val Ala Lys Lys Lys Ser 
    210                 215                 220                 


Lys Lys Glu Lys Asp Lys Asp Ser Lys Leu Glu Lys Ala Leu Lys Ala 
225                 230                 235                 240 


Gln Asn Asp Leu Ile Trp Asn Ile Lys Asp Glu Leu Lys Lys Val Cys 
                245                 250                 255     


Ser Thr Asn Asp Leu Lys Glu Leu Leu Ile Phe Asn Lys Gln Gln Val 
            260                 265                 270         


Pro Ser Gly Glu Ser Ala Ile Leu Asp Arg Val Ala Asp Gly Met Val 
        275                 280                 285             


Phe Gly Ala Leu Leu Pro Cys Glu Glu Cys Ser Gly Gln Leu Val Phe 
    290                 295                 300                 


Lys Ser Asp Ala Tyr Tyr Cys Thr Gly Asp Val Thr Ala Trp Thr Lys 
305                 310                 315                 320 


Cys Met Val Lys Thr Gln Thr Pro Asn Arg Lys Glu Trp Val Thr Pro 
                325                 330                 335     


Lys Glu Phe Arg Glu Ile Ser Tyr Leu Lys Lys Leu Lys Val Lys Lys 
            340                 345                 350         


Gln Asp Arg Ile Phe Pro Pro Glu Thr Ser Ala Ser Val Ala Ala Thr 
        355                 360                 365             


Pro Pro Pro Ser Thr Ala Ser Ala Pro Ala Ala Val Asn Ser Ser Ala 
    370                 375                 380                 


Ser Ala Asp Lys Pro Leu Ser Asn Met Lys Ile Leu Thr Leu Gly Lys 
385                 390                 395                 400 


Leu Ser Arg Asn Lys Asp Glu Val Lys Ala Met Ile Glu Lys Leu Gly 
                405                 410                 415     


Gly Lys Leu Thr Gly Thr Ala Asn Lys Ala Ser Leu Cys Ile Ser Thr 
            420                 425                 430         


Lys Lys Glu Val Glu Lys Met Asn Lys Lys Met Glu Glu Val Lys Glu 
        435                 440                 445             


Ala Asn Ile Arg Val Val Ser Glu Asp Phe Leu Gln Asp Val Ser Ala 
    450                 455                 460                 


Ser Thr Lys Ser Leu Gln Glu Leu Phe Leu Ala His Ile Leu Ser Pro 
465                 470                 475                 480 


Trp Gly Ala Glu Val Lys Ala Glu Pro Val Glu Val Val Ala Pro Arg 
                485                 490                 495     


Gly Lys Ser Gly Ala Ala Leu Ser Lys Lys Ser Lys Gly Gln Val Lys 
            500                 505                 510         


Glu Glu Gly Ile Asn Lys Ser Glu Lys Arg Met Lys Leu Thr Leu Lys 
        515                 520                 525             


Gly Gly Ala Ala Val Asp Pro Asp Ser Gly Leu Glu His Ser Ala His 
    530                 535                 540                 


Val Leu Glu Lys Gly Gly Lys Val Phe Ser Ala Thr Leu Gly Leu Val 
545                 550                 555                 560 


Asp Ile Val Lys Gly Thr Asn Ser Tyr Tyr Lys Leu Gln Leu Leu Glu 
                565                 570                 575     


Asp Asp Lys Glu Asn Arg Tyr Trp Ile Phe Arg Ser Trp Gly Arg Val 
            580                 585                 590         


Gly Thr Val Ile Gly Ser Asn Lys Leu Glu Gln Met Pro Ser Lys Glu 
        595                 600                 605             


Asp Ala Ile Glu His Phe Met Lys Leu Tyr Glu Glu Lys Thr Gly Asn 
    610                 615                 620                 


Ala Trp His Ser Lys Asn Phe Thr Lys Tyr Pro Lys Lys Phe Tyr Pro 
625                 630                 635                 640 


Leu Glu Ile Asp Tyr Gly Gln Asp Glu Glu Ala Val Lys Lys Leu Thr 
                645                 650                 655     


Val Asn Pro Gly Thr Lys Ser Lys Leu Pro Lys Pro Val Gln Asp Leu 
            660                 665                 670         


Ile Lys Met Ile Phe Asp Val Glu Ser Met Lys Lys Ala Met Val Glu 
        675                 680                 685             


Tyr Glu Ile Asp Leu Gln Lys Met Pro Leu Gly Lys Leu Ser Lys Arg 
    690                 695                 700                 


Gln Ile Gln Ala Ala Tyr Ser Ile Leu Ser Glu Val Gln Gln Ala Val 
705                 710                 715                 720 


Ser Gln Gly Ser Ser Asp Ser Gln Ile Leu Asp Leu Ser Asn Arg Phe 
                725                 730                 735     


Tyr Thr Leu Ile Pro His Asp Phe Gly Met Lys Lys Pro Pro Leu Leu 
            740                 745                 750         


Asn Asn Ala Asp Ser Val Gln Ala Lys Val Glu Met Leu Asp Asn Leu 
        755                 760                 765             


Leu Asp Ile Glu Val Ala Tyr Ser Leu Leu Arg Gly Gly Ser Asp Asp 
    770                 775                 780                 


Ser Ser Lys Asp Pro Ile Asp Val Asn Tyr Glu Lys Leu Lys Thr Asp 
785                 790                 795                 800 


Ile Lys Val Val Asp Arg Asp Ser Glu Glu Ala Glu Ile Ile Arg Lys 
                805                 810                 815     


Tyr Val Lys Asn Thr His Ala Thr Thr His Asn Ala Tyr Asp Leu Glu 
            820                 825                 830         


Val Ile Asp Ile Phe Lys Ile Glu Arg Glu Gly Glu Cys Gln Arg Tyr 
        835                 840                 845             


Lys Pro Phe Lys Gln Leu His Asn Arg Arg Leu Leu Trp His Gly Ser 
    850                 855                 860                 


Arg Thr Thr Asn Phe Ala Gly Ile Leu Ser Gln Gly Leu Arg Ile Ala 
865                 870                 875                 880 


Pro Pro Glu Ala Pro Val Thr Gly Tyr Met Phe Gly Lys Gly Ile Tyr 
                885                 890                 895     


Phe Ala Asp Met Val Ser Lys Ser Ala Asn Tyr Cys His Thr Ser Gln 
            900                 905                 910         


Gly Asp Pro Ile Gly Leu Ile Leu Leu Gly Glu Val Ala Leu Gly Asn 
        915                 920                 925             


Met Tyr Glu Leu Lys His Ala Ser His Ile Ser Lys Leu Pro Lys Gly 
    930                 935                 940                 


Lys His Ser Val Lys Gly Leu Gly Lys Thr Thr Pro Asp Pro Ser Ala 
945                 950                 955                 960 


Asn Ile Ser Leu Asp Gly Val Asp Val Pro Leu Gly Thr Gly Ile Ser 
                965                 970                 975     


Ser Gly Val Asn Asp Thr Ser Leu Leu Tyr Asn Glu Tyr Ile Val Tyr 
            980                 985                 990         


Asp Ile Ala Gln Val Asn Leu Lys  Tyr Leu Leu Lys Leu  Lys Phe Asn 
        995                 1000                 1005             


Phe Lys  Thr Ser Leu Trp 
    1010                 


<210>  226
<211>  1904
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens poly (ADP-ribose) polymerase 2 (PARP2), transcript 
       variant 1, mRNA

NCBI Reference Sequence: NM_005484.3

<400>  226
gggttgatga cgtcagcgtt cgaattccat ggcggcgcgg cggcgacgga gcaccggcgg       60

cggcagggcg agagcattaa atgaaagcaa aagagttaat aatggcaaca cggctccaga      120

agactcttcc cctgccaaga aaactcgtag atgccagaga caggagtcga aaaagatgcc      180

tgtggctgga ggaaaagcta ataaggacag gacagaagac aagcaagatg gtatgccagg      240

aaggtcatgg gccagcaaaa gggtctctga atctgtgaag gccttgctgt taaagggcaa      300

agctcctgtg gacccagagt gtacagccaa ggtggggaag gctcatgtgt attgtgaagg      360

aaatgatgtc tatgatgtca tgctaaatca gaccaatctc cagttcaaca acaacaagta      420

ctatctgatt cagctattag aagatgatgc ccagaggaac ttcagtgttt ggatgagatg      480

gggccgagtt gggaaaatgg gacagcacag cctggtggct tgttcaggca atctcaacaa      540

ggccaaggaa atctttcaga agaaattcct tgacaaaacg aaaaacaatt gggaagatcg      600

agaaaagttt gagaaggtgc ctggaaaata tgatatgcta cagatggact atgccaccaa      660

tactcaggat gaagaggaaa caaagaaaga ggaatctctt aaatctccct tgaagccaga      720

gtcacagcta gatcttcggg tacaggagtt aataaagttg atctgtaatg ttcaggccat      780

ggaagaaatg atgatggaaa tgaagtataa taccaagaaa gccccacttg ggaagctgac      840

agtggcacaa atcaaggcag gttaccagtc tcttaagaag attgaggatt gtattcgggc      900

tggccagcat ggacgagctc tcatggaagc atgcaatgaa ttctacacca ggattccgca      960

tgactttgga ctccgtactc ctccactaat ccggacacag aaggaactgt cagaaaaaat     1020

acaattacta gaggctttgg gagacattga aattgctatt aagctggtga aaacagagct     1080

acaaagccca gaacacccat tggaccaaca ctatagaaac ctacattgtg ccttgcgccc     1140

ccttgaccat gaaagttatg agttcaaagt gatttcccag tacctacaat ctacccatgc     1200

tcccacacac agcgactata ccatgacctt gctggatttg tttgaagtgg agaaggatgg     1260

tgagaaagaa gccttcagag aggaccttca taacaggatg cttctatggc atggttccag     1320

gatgagtaac tgggtgggaa tcttgagcca tgggcttcga attgccccac ctgaagctcc     1380

catcacaggt tacatgtttg ggaaaggaat ctactttgct gacatgtctt ccaagagtgc     1440

caattactgc tttgcctctc gcctaaagaa tacaggactg ctgctcttat cagaggtagc     1500

tctaggtcag tgtaatgaac tactagaggc caatcctaag gccgaaggat tgcttcaagg     1560

taaacatagc accaaggggc tgggcaagat ggctcccagt tctgcccact tcgtcaccct     1620

gaatgggagt acagtgccat taggaccagc aagtgacaca ggaattctga atccagatgg     1680

ttataccctc aactacaatg aatatattgt atataacccc aaccaggtcc gtatgcggta     1740

ccttttaaag gttcagttta atttccttca gctgtggtga atgttgatat taaataaacc     1800

agagatctga tcttcaagca agaaaataag cagtgttgta cttgtgaatt ttgtgatatt     1860

ttatgtaata aaaactgtac aggtctaaaa aaaaaaaaaa aaaa                      1904


<210>  227
<211>  583
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens poly (ADP-ribose) polymerase 2 (PARP2), transcript 
       variant 1, polypeptide NCBI Reference Sequence: NM_005484.3

<400>  227

Met Ala Ala Arg Arg Arg Arg Ser Thr Gly Gly Gly Arg Ala Arg Ala 
1               5                   10                  15      


Leu Asn Glu Ser Lys Arg Val Asn Asn Gly Asn Thr Ala Pro Glu Asp 
            20                  25                  30          


Ser Ser Pro Ala Lys Lys Thr Arg Arg Cys Gln Arg Gln Glu Ser Lys 
        35                  40                  45              


Lys Met Pro Val Ala Gly Gly Lys Ala Asn Lys Asp Arg Thr Glu Asp 
    50                  55                  60                  


Lys Gln Asp Gly Met Pro Gly Arg Ser Trp Ala Ser Lys Arg Val Ser 
65                  70                  75                  80  


Glu Ser Val Lys Ala Leu Leu Leu Lys Gly Lys Ala Pro Val Asp Pro 
                85                  90                  95      


Glu Cys Thr Ala Lys Val Gly Lys Ala His Val Tyr Cys Glu Gly Asn 
            100                 105                 110         


Asp Val Tyr Asp Val Met Leu Asn Gln Thr Asn Leu Gln Phe Asn Asn 
        115                 120                 125             


Asn Lys Tyr Tyr Leu Ile Gln Leu Leu Glu Asp Asp Ala Gln Arg Asn 
    130                 135                 140                 


Phe Ser Val Trp Met Arg Trp Gly Arg Val Gly Lys Met Gly Gln His 
145                 150                 155                 160 


Ser Leu Val Ala Cys Ser Gly Asn Leu Asn Lys Ala Lys Glu Ile Phe 
                165                 170                 175     


Gln Lys Lys Phe Leu Asp Lys Thr Lys Asn Asn Trp Glu Asp Arg Glu 
            180                 185                 190         


Lys Phe Glu Lys Val Pro Gly Lys Tyr Asp Met Leu Gln Met Asp Tyr 
        195                 200                 205             


Ala Thr Asn Thr Gln Asp Glu Glu Glu Thr Lys Lys Glu Glu Ser Leu 
    210                 215                 220                 


Lys Ser Pro Leu Lys Pro Glu Ser Gln Leu Asp Leu Arg Val Gln Glu 
225                 230                 235                 240 


Leu Ile Lys Leu Ile Cys Asn Val Gln Ala Met Glu Glu Met Met Met 
                245                 250                 255     


Glu Met Lys Tyr Asn Thr Lys Lys Ala Pro Leu Gly Lys Leu Thr Val 
            260                 265                 270         


Ala Gln Ile Lys Ala Gly Tyr Gln Ser Leu Lys Lys Ile Glu Asp Cys 
        275                 280                 285             


Ile Arg Ala Gly Gln His Gly Arg Ala Leu Met Glu Ala Cys Asn Glu 
    290                 295                 300                 


Phe Tyr Thr Arg Ile Pro His Asp Phe Gly Leu Arg Thr Pro Pro Leu 
305                 310                 315                 320 


Ile Arg Thr Gln Lys Glu Leu Ser Glu Lys Ile Gln Leu Leu Glu Ala 
                325                 330                 335     


Leu Gly Asp Ile Glu Ile Ala Ile Lys Leu Val Lys Thr Glu Leu Gln 
            340                 345                 350         


Ser Pro Glu His Pro Leu Asp Gln His Tyr Arg Asn Leu His Cys Ala 
        355                 360                 365             


Leu Arg Pro Leu Asp His Glu Ser Tyr Glu Phe Lys Val Ile Ser Gln 
    370                 375                 380                 


Tyr Leu Gln Ser Thr His Ala Pro Thr His Ser Asp Tyr Thr Met Thr 
385                 390                 395                 400 


Leu Leu Asp Leu Phe Glu Val Glu Lys Asp Gly Glu Lys Glu Ala Phe 
                405                 410                 415     


Arg Glu Asp Leu His Asn Arg Met Leu Leu Trp His Gly Ser Arg Met 
            420                 425                 430         


Ser Asn Trp Val Gly Ile Leu Ser His Gly Leu Arg Ile Ala Pro Pro 
        435                 440                 445             


Glu Ala Pro Ile Thr Gly Tyr Met Phe Gly Lys Gly Ile Tyr Phe Ala 
    450                 455                 460                 


Asp Met Ser Ser Lys Ser Ala Asn Tyr Cys Phe Ala Ser Arg Leu Lys 
465                 470                 475                 480 


Asn Thr Gly Leu Leu Leu Leu Ser Glu Val Ala Leu Gly Gln Cys Asn 
                485                 490                 495     


Glu Leu Leu Glu Ala Asn Pro Lys Ala Glu Gly Leu Leu Gln Gly Lys 
            500                 505                 510         


His Ser Thr Lys Gly Leu Gly Lys Met Ala Pro Ser Ser Ala His Phe 
        515                 520                 525             


Val Thr Leu Asn Gly Ser Thr Val Pro Leu Gly Pro Ala Ser Asp Thr 
    530                 535                 540                 


Gly Ile Leu Asn Pro Asp Gly Tyr Thr Leu Asn Tyr Asn Glu Tyr Ile 
545                 550                 555                 560 


Val Tyr Asn Pro Asn Gln Val Arg Met Arg Tyr Leu Leu Lys Val Gln 
                565                 570                 575     


Phe Asn Phe Leu Gln Leu Trp 
            580             


<210>  228
<211>  1865
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens poly (ADP-ribose) polymerase 2 (PARP2), transcript 
       variant 2, mRNA

NCBI Reference Sequence: NM_001042618.1

<400>  228
gggttgatga cgtcagcgtt cgaattccat ggcggcgcgg cggcgacgga gcaccggcgg       60

cggcagggcg agagcattaa atgaaagcaa aagagttaat aatggcaaca cggctccaga      120

agactcttcc cctgccaaga aaactcgtag atgccagaga caggagtcga aaaagatgcc      180

tgtggctgga ggaaaagcta ataaggacag gacagaagac aagcaagatg aatctgtgaa      240

ggccttgctg ttaaagggca aagctcctgt ggacccagag tgtacagcca aggtggggaa      300

ggctcatgtg tattgtgaag gaaatgatgt ctatgatgtc atgctaaatc agaccaatct      360

ccagttcaac aacaacaagt actatctgat tcagctatta gaagatgatg cccagaggaa      420

cttcagtgtt tggatgagat ggggccgagt tgggaaaatg ggacagcaca gcctggtggc      480

ttgttcaggc aatctcaaca aggccaagga aatctttcag aagaaattcc ttgacaaaac      540

gaaaaacaat tgggaagatc gagaaaagtt tgagaaggtg cctggaaaat atgatatgct      600

acagatggac tatgccacca atactcagga tgaagaggaa acaaagaaag aggaatctct      660

taaatctccc ttgaagccag agtcacagct agatcttcgg gtacaggagt taataaagtt      720

gatctgtaat gttcaggcca tggaagaaat gatgatggaa atgaagtata ataccaagaa      780

agccccactt gggaagctga cagtggcaca aatcaaggca ggttaccagt ctcttaagaa      840

gattgaggat tgtattcggg ctggccagca tggacgagct ctcatggaag catgcaatga      900

attctacacc aggattccgc atgactttgg actccgtact cctccactaa tccggacaca      960

gaaggaactg tcagaaaaaa tacaattact agaggctttg ggagacattg aaattgctat     1020

taagctggtg aaaacagagc tacaaagccc agaacaccca ttggaccaac actatagaaa     1080

cctacattgt gccttgcgcc cccttgacca tgaaagttat gagttcaaag tgatttccca     1140

gtacctacaa tctacccatg ctcccacaca cagcgactat accatgacct tgctggattt     1200

gtttgaagtg gagaaggatg gtgagaaaga agccttcaga gaggaccttc ataacaggat     1260

gcttctatgg catggttcca ggatgagtaa ctgggtggga atcttgagcc atgggcttcg     1320

aattgcccca cctgaagctc ccatcacagg ttacatgttt gggaaaggaa tctactttgc     1380

tgacatgtct tccaagagtg ccaattactg ctttgcctct cgcctaaaga atacaggact     1440

gctgctctta tcagaggtag ctctaggtca gtgtaatgaa ctactagagg ccaatcctaa     1500

ggccgaagga ttgcttcaag gtaaacatag caccaagggg ctgggcaaga tggctcccag     1560

ttctgcccac ttcgtcaccc tgaatgggag tacagtgcca ttaggaccag caagtgacac     1620

aggaattctg aatccagatg gttataccct caactacaat gaatatattg tatataaccc     1680

caaccaggtc cgtatgcggt accttttaaa ggttcagttt aatttccttc agctgtggtg     1740

aatgttgata ttaaataaac cagagatctg atcttcaagc aagaaaataa gcagtgttgt     1800

acttgtgaat tttgtgatat tttatgtaat aaaaactgta caggtctaaa aaaaaaaaaa     1860

aaaaa                                                                 1865


<210>  229
<211>  570
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens poly (ADP-ribose) polymerase 2 (PARP2), transcript 
       variant 2, polypeptide NCBI Reference Sequence: NM_001042618.1

<400>  229

Met Ala Ala Arg Arg Arg Arg Ser Thr Gly Gly Gly Arg Ala Arg Ala 
1               5                   10                  15      


Leu Asn Glu Ser Lys Arg Val Asn Asn Gly Asn Thr Ala Pro Glu Asp 
            20                  25                  30          


Ser Ser Pro Ala Lys Lys Thr Arg Arg Cys Gln Arg Gln Glu Ser Lys 
        35                  40                  45              


Lys Met Pro Val Ala Gly Gly Lys Ala Asn Lys Asp Arg Thr Glu Asp 
    50                  55                  60                  


Lys Gln Asp Glu Ser Val Lys Ala Leu Leu Leu Lys Gly Lys Ala Pro 
65                  70                  75                  80  


Val Asp Pro Glu Cys Thr Ala Lys Val Gly Lys Ala His Val Tyr Cys 
                85                  90                  95      


Glu Gly Asn Asp Val Tyr Asp Val Met Leu Asn Gln Thr Asn Leu Gln 
            100                 105                 110         


Phe Asn Asn Asn Lys Tyr Tyr Leu Ile Gln Leu Leu Glu Asp Asp Ala 
        115                 120                 125             


Gln Arg Asn Phe Ser Val Trp Met Arg Trp Gly Arg Val Gly Lys Met 
    130                 135                 140                 


Gly Gln His Ser Leu Val Ala Cys Ser Gly Asn Leu Asn Lys Ala Lys 
145                 150                 155                 160 


Glu Ile Phe Gln Lys Lys Phe Leu Asp Lys Thr Lys Asn Asn Trp Glu 
                165                 170                 175     


Asp Arg Glu Lys Phe Glu Lys Val Pro Gly Lys Tyr Asp Met Leu Gln 
            180                 185                 190         


Met Asp Tyr Ala Thr Asn Thr Gln Asp Glu Glu Glu Thr Lys Lys Glu 
        195                 200                 205             


Glu Ser Leu Lys Ser Pro Leu Lys Pro Glu Ser Gln Leu Asp Leu Arg 
    210                 215                 220                 


Val Gln Glu Leu Ile Lys Leu Ile Cys Asn Val Gln Ala Met Glu Glu 
225                 230                 235                 240 


Met Met Met Glu Met Lys Tyr Asn Thr Lys Lys Ala Pro Leu Gly Lys 
                245                 250                 255     


Leu Thr Val Ala Gln Ile Lys Ala Gly Tyr Gln Ser Leu Lys Lys Ile 
            260                 265                 270         


Glu Asp Cys Ile Arg Ala Gly Gln His Gly Arg Ala Leu Met Glu Ala 
        275                 280                 285             


Cys Asn Glu Phe Tyr Thr Arg Ile Pro His Asp Phe Gly Leu Arg Thr 
    290                 295                 300                 


Pro Pro Leu Ile Arg Thr Gln Lys Glu Leu Ser Glu Lys Ile Gln Leu 
305                 310                 315                 320 


Leu Glu Ala Leu Gly Asp Ile Glu Ile Ala Ile Lys Leu Val Lys Thr 
                325                 330                 335     


Glu Leu Gln Ser Pro Glu His Pro Leu Asp Gln His Tyr Arg Asn Leu 
            340                 345                 350         


His Cys Ala Leu Arg Pro Leu Asp His Glu Ser Tyr Glu Phe Lys Val 
        355                 360                 365             


Ile Ser Gln Tyr Leu Gln Ser Thr His Ala Pro Thr His Ser Asp Tyr 
    370                 375                 380                 


Thr Met Thr Leu Leu Asp Leu Phe Glu Val Glu Lys Asp Gly Glu Lys 
385                 390                 395                 400 


Glu Ala Phe Arg Glu Asp Leu His Asn Arg Met Leu Leu Trp His Gly 
                405                 410                 415     


Ser Arg Met Ser Asn Trp Val Gly Ile Leu Ser His Gly Leu Arg Ile 
            420                 425                 430         


Ala Pro Pro Glu Ala Pro Ile Thr Gly Tyr Met Phe Gly Lys Gly Ile 
        435                 440                 445             


Tyr Phe Ala Asp Met Ser Ser Lys Ser Ala Asn Tyr Cys Phe Ala Ser 
    450                 455                 460                 


Arg Leu Lys Asn Thr Gly Leu Leu Leu Leu Ser Glu Val Ala Leu Gly 
465                 470                 475                 480 


Gln Cys Asn Glu Leu Leu Glu Ala Asn Pro Lys Ala Glu Gly Leu Leu 
                485                 490                 495     


Gln Gly Lys His Ser Thr Lys Gly Leu Gly Lys Met Ala Pro Ser Ser 
            500                 505                 510         


Ala His Phe Val Thr Leu Asn Gly Ser Thr Val Pro Leu Gly Pro Ala 
        515                 520                 525             


Ser Asp Thr Gly Ile Leu Asn Pro Asp Gly Tyr Thr Leu Asn Tyr Asn 
    530                 535                 540                 


Glu Tyr Ile Val Tyr Asn Pro Asn Gln Val Arg Met Arg Tyr Leu Leu 
545                 550                 555                 560 


Lys Val Gln Phe Asn Phe Leu Gln Leu Trp 
                565                 570 


<210>  230
<211>  2355
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens poly (ADP-ribose) polymerase family, member 3 
       (PARP3), transcript variant 1, mRNA

<400>  230
gtatccgggc ccaaggtcac cgcgcgaccg gcagatgcgt gctgcaggcc ccggccacat       60

gagcagcgct acggacgcga ctgccccggc cttggatatg ccagatcgag tgtccacccg      120

tccgtgggac tggtcgcctg actcggcctg ccccagcctc tgcttcaccc cactggtggc      180

caaatagccg atgtctaatc ccccacacaa gctcatcccc ggcctctggc gattgttggg      240

aattctctcc ctaattcacg cctgaggctc atggagagtt gctagacctg ggactgccct      300

gggaggcgca cacaaccagg ccgggtggca gccaggacct ctcccatgtc cctgcttttc      360

ttggccatgg ctccaaagcc gaagccctgg gtacagactg agggccctga gaagaagaag      420

ggccggcagg caggaaggga ggaggacccc ttccgctcca ccgctgaggc cctcaaggcc      480

atacccgcag agaagcgcat aatccgcgtg gatccaacat gtccactcag cagcaacccc      540

gggacccagg tgtatgagga ctacaactgc accctgaacc agaccaacat cgagaacaac      600

aacaacaagt tctacatcat ccagctgctc caagacagca accgcttctt cacctgctgg      660

aaccactggg gccgtgtggg agaggtcggc cagtcaaaga tcaaccactt cacaaggcta      720

gaagatgcaa agaaggactt tgagaagaaa tttcgggaaa agaccaagaa caactgggca      780

gagcgggacc actttgtgtc tcacccgggc aagtacacac ttatcgaagt acaggcagag      840

gatgaggccc aggaagctgt ggtgaaggtg gacagaggcc cagtgaggac tgtgactaag      900

cgggtgcagc cctgctccct ggacccagcc acgcagaagc tcatcactaa catcttcagc      960

aaggagatgt tcaagaacac catggccctc atggacctgg atgtgaagaa gatgcccctg     1020

ggaaagctga gcaagcaaca gattgcacgg ggtttcgagg ccttggaggc gctggaggag     1080

gccctgaaag gccccacgga tggtggccaa agcctggagg agctgtcctc acacttttac     1140

accgtcatcc cgcacaactt cggccacagc cagcccccgc ccatcaattc ccctgagctt     1200

ctgcaggcca agaaggacat gctgctggtg ctggcggaca tcgagctggc ccaggccctg     1260

caggcagtct ctgagcagga gaagacggtg gaggaggtgc cacaccccct ggaccgagac     1320

taccagcttc tcaagtgcca gctgcagctg ctagactctg gagcacctga gtacaaggtg     1380

atacagacct acttagaaca gactggcagc aaccacaggt gccctacact tcaacacatc     1440

tggaaagtaa accaagaagg ggaggaagac agattccagg cccactccaa actgggtaat     1500

cggaagctgc tgtggcatgg caccaacatg gccgtggtgg ccgccatcct cactagtggg     1560

ctccgcatca tgccacattc tggtgggcgt gttggcaagg gcatctactt tgcctcagag     1620

aacagcaagt cagctggata tgttattggc atgaagtgtg gggcccacca tgtcggctac     1680

atgttcctgg gtgaggtggc cctgggcaga gagcaccata tcaacacgga caaccccagc     1740

ttgaagagcc cacctcctgg cttcgacagt gtcattgccc gaggccacac cgagcctgat     1800

ccgacccagg acactgagtt ggagctggat ggccagcaag tggtggtgcc ccagggccag     1860

cctgtgccct gcccagagtt cagcagctcc acattctccc agagcgagta cctcatctac     1920

caggagagcc agtgtcgcct gcgctacctg ctggaggtcc acctctgagt gcccgccctg     1980

tcccccgggg tcctgcaagg ctggactgtg atcttcaatc atcctgccca tctctggtac     2040

ccctatatca ctcctttttt tcaagaatac aatacgttgt tgttaactat agtcaccatg     2100

ctgtacaaga tccctgaact tatgcctcct aactgaaatt ttgtattctt tgacacatct     2160

gcccagtccc tctcctccca gcccatggta accagcattt gactctttac ttgtataagg     2220

gcagctttta taggttccac atgtaagtga gatcatgcag tgtttgtctt tctgtgcctg     2280

gcttatttca ctcagcataa tgtgcaccgg gttcacccat gttttcataa atgacaagat     2340

ttcctccttt tttaa                                                      2355


<210>  231
<211>  540
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens poly (ADP-ribose) polymerase family, member 3 
       (PARP3), transcript variant 1, polypeptide NCBI Reference 
       Sequence: NM_001003931.2

<400>  231

Met Ser Leu Leu Phe Leu Ala Met Ala Pro Lys Pro Lys Pro Trp Val 
1               5                   10                  15      


Gln Thr Glu Gly Pro Glu Lys Lys Lys Gly Arg Gln Ala Gly Arg Glu 
            20                  25                  30          


Glu Asp Pro Phe Arg Ser Thr Ala Glu Ala Leu Lys Ala Ile Pro Ala 
        35                  40                  45              


Glu Lys Arg Ile Ile Arg Val Asp Pro Thr Cys Pro Leu Ser Ser Asn 
    50                  55                  60                  


Pro Gly Thr Gln Val Tyr Glu Asp Tyr Asn Cys Thr Leu Asn Gln Thr 
65                  70                  75                  80  


Asn Ile Glu Asn Asn Asn Asn Lys Phe Tyr Ile Ile Gln Leu Leu Gln 
                85                  90                  95      


Asp Ser Asn Arg Phe Phe Thr Cys Trp Asn His Trp Gly Arg Val Gly 
            100                 105                 110         


Glu Val Gly Gln Ser Lys Ile Asn His Phe Thr Arg Leu Glu Asp Ala 
        115                 120                 125             


Lys Lys Asp Phe Glu Lys Lys Phe Arg Glu Lys Thr Lys Asn Asn Trp 
    130                 135                 140                 


Ala Glu Arg Asp His Phe Val Ser His Pro Gly Lys Tyr Thr Leu Ile 
145                 150                 155                 160 


Glu Val Gln Ala Glu Asp Glu Ala Gln Glu Ala Val Val Lys Val Asp 
                165                 170                 175     


Arg Gly Pro Val Arg Thr Val Thr Lys Arg Val Gln Pro Cys Ser Leu 
            180                 185                 190         


Asp Pro Ala Thr Gln Lys Leu Ile Thr Asn Ile Phe Ser Lys Glu Met 
        195                 200                 205             


Phe Lys Asn Thr Met Ala Leu Met Asp Leu Asp Val Lys Lys Met Pro 
    210                 215                 220                 


Leu Gly Lys Leu Ser Lys Gln Gln Ile Ala Arg Gly Phe Glu Ala Leu 
225                 230                 235                 240 


Glu Ala Leu Glu Glu Ala Leu Lys Gly Pro Thr Asp Gly Gly Gln Ser 
                245                 250                 255     


Leu Glu Glu Leu Ser Ser His Phe Tyr Thr Val Ile Pro His Asn Phe 
            260                 265                 270         


Gly His Ser Gln Pro Pro Pro Ile Asn Ser Pro Glu Leu Leu Gln Ala 
        275                 280                 285             


Lys Lys Asp Met Leu Leu Val Leu Ala Asp Ile Glu Leu Ala Gln Ala 
    290                 295                 300                 


Leu Gln Ala Val Ser Glu Gln Glu Lys Thr Val Glu Glu Val Pro His 
305                 310                 315                 320 


Pro Leu Asp Arg Asp Tyr Gln Leu Leu Lys Cys Gln Leu Gln Leu Leu 
                325                 330                 335     


Asp Ser Gly Ala Pro Glu Tyr Lys Val Ile Gln Thr Tyr Leu Glu Gln 
            340                 345                 350         


Thr Gly Ser Asn His Arg Cys Pro Thr Leu Gln His Ile Trp Lys Val 
        355                 360                 365             


Asn Gln Glu Gly Glu Glu Asp Arg Phe Gln Ala His Ser Lys Leu Gly 
    370                 375                 380                 


Asn Arg Lys Leu Leu Trp His Gly Thr Asn Met Ala Val Val Ala Ala 
385                 390                 395                 400 


Ile Leu Thr Ser Gly Leu Arg Ile Met Pro His Ser Gly Gly Arg Val 
                405                 410                 415     


Gly Lys Gly Ile Tyr Phe Ala Ser Glu Asn Ser Lys Ser Ala Gly Tyr 
            420                 425                 430         


Val Ile Gly Met Lys Cys Gly Ala His His Val Gly Tyr Met Phe Leu 
        435                 440                 445             


Gly Glu Val Ala Leu Gly Arg Glu His His Ile Asn Thr Asp Asn Pro 
    450                 455                 460                 


Ser Leu Lys Ser Pro Pro Pro Gly Phe Asp Ser Val Ile Ala Arg Gly 
465                 470                 475                 480 


His Thr Glu Pro Asp Pro Thr Gln Asp Thr Glu Leu Glu Leu Asp Gly 
                485                 490                 495     


Gln Gln Val Val Val Pro Gln Gly Gln Pro Val Pro Cys Pro Glu Phe 
            500                 505                 510         


Ser Ser Ser Thr Phe Ser Gln Ser Glu Tyr Leu Ile Tyr Gln Glu Ser 
        515                 520                 525             


Gln Cys Arg Leu Arg Tyr Leu Leu Glu Val His Leu 
    530                 535                 540 


<210>  232
<211>  2360
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens poly (ADP-ribose) polymerase family, member 3 
       (PARP3), transcript variant 2, mRNA NCBI Reference Sequence: 
       NM_005485.4

<400>  232
gtatccgggc ccaaggtcac cgcgcgaccg gcagatgcgt gctgcaggcc ccggccacat       60

gagcagcgct acggacgcga ctgccccggc cttggatatg ccagatcgag tgtccacccg      120

tccgtgggac tggtcgcctg actcggcctg ccccagcctc tgcttcaccc cactggtggc      180

caaatagccg atgtctaatc ccccacacaa gctcatcccc ggcctctggc gattgttggg      240

aattctctcc ctaattcacg cctgaggctc atggagagtt gctagacctg ggactgccct      300

gggaggcgca cacaaccagg ccgggtggca gccaggacct ctcccatgtc cctgcttttc      360

ttgggacagc catggctcca aagccgaagc cctgggtaca gactgagggc cctgagaaga      420

agaagggccg gcaggcagga agggaggagg accccttccg ctccaccgct gaggccctca      480

aggccatacc cgcagagaag cgcataatcc gcgtggatcc aacatgtcca ctcagcagca      540

accccgggac ccaggtgtat gaggactaca actgcaccct gaaccagacc aacatcgaga      600

acaacaacaa caagttctac atcatccagc tgctccaaga cagcaaccgc ttcttcacct      660

gctggaacca ctggggccgt gtgggagagg tcggccagtc aaagatcaac cacttcacaa      720

ggctagaaga tgcaaagaag gactttgaga agaaatttcg ggaaaagacc aagaacaact      780

gggcagagcg ggaccacttt gtgtctcacc cgggcaagta cacacttatc gaagtacagg      840

cagaggatga ggcccaggaa gctgtggtga aggtggacag aggcccagtg aggactgtga      900

ctaagcgggt gcagccctgc tccctggacc cagccacgca gaagctcatc actaacatct      960

tcagcaagga gatgttcaag aacaccatgg ccctcatgga cctggatgtg aagaagatgc     1020

ccctgggaaa gctgagcaag caacagattg cacggggttt cgaggccttg gaggcgctgg     1080

aggaggccct gaaaggcccc acggatggtg gccaaagcct ggaggagctg tcctcacact     1140

tttacaccgt catcccgcac aacttcggcc acagccagcc cccgcccatc aattcccctg     1200

agcttctgca ggccaagaag gacatgctgc tggtgctggc ggacatcgag ctggcccagg     1260

ccctgcaggc agtctctgag caggagaaga cggtggagga ggtgccacac cccctggacc     1320

gagactacca gcttctcaag tgccagctgc agctgctaga ctctggagca cctgagtaca     1380

aggtgataca gacctactta gaacagactg gcagcaacca caggtgccct acacttcaac     1440

acatctggaa agtaaaccaa gaaggggagg aagacagatt ccaggcccac tccaaactgg     1500

gtaatcggaa gctgctgtgg catggcacca acatggccgt ggtggccgcc atcctcacta     1560

gtgggctccg catcatgcca cattctggtg ggcgtgttgg caagggcatc tactttgcct     1620

cagagaacag caagtcagct ggatatgtta ttggcatgaa gtgtggggcc caccatgtcg     1680

gctacatgtt cctgggtgag gtggccctgg gcagagagca ccatatcaac acggacaacc     1740

ccagcttgaa gagcccacct cctggcttcg acagtgtcat tgcccgaggc cacaccgagc     1800

ctgatccgac ccaggacact gagttggagc tggatggcca gcaagtggtg gtgccccagg     1860

gccagcctgt gccctgccca gagttcagca gctccacatt ctcccagagc gagtacctca     1920

tctaccagga gagccagtgt cgcctgcgct acctgctgga ggtccacctc tgagtgcccg     1980

ccctgtcccc cggggtcctg caaggctgga ctgtgatctt caatcatcct gcccatctct     2040

ggtaccccta tatcactcct ttttttcaag aatacaatac gttgttgtta actatagtca     2100

ccatgctgta caagatccct gaacttatgc ctcctaactg aaattttgta ttctttgaca     2160

catctgccca gtccctctcc tcccagccca tggtaaccag catttgactc tttacttgta     2220

taagggcagc ttttataggt tccacatgta agtgagatca tgcagtgttt gtctttctgt     2280

gcctggctta tttcactcag cataatgtgc accgggttca cccatgtttt cataaatgac     2340

aagatttcct ccttttttaa                                                 2360


<210>  233
<211>  533
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens poly (ADP-ribose) polymerase family, member 3 
       (PARP3), transcript variant 2, polypeptide NCBI Reference 
       Sequence: NM_005485.4

<400>  233

Met Ala Pro Lys Pro Lys Pro Trp Val Gln Thr Glu Gly Pro Glu Lys 
1               5                   10                  15      


Lys Lys Gly Arg Gln Ala Gly Arg Glu Glu Asp Pro Phe Arg Ser Thr 
            20                  25                  30          


Ala Glu Ala Leu Lys Ala Ile Pro Ala Glu Lys Arg Ile Ile Arg Val 
        35                  40                  45              


Asp Pro Thr Cys Pro Leu Ser Ser Asn Pro Gly Thr Gln Val Tyr Glu 
    50                  55                  60                  


Asp Tyr Asn Cys Thr Leu Asn Gln Thr Asn Ile Glu Asn Asn Asn Asn 
65                  70                  75                  80  


Lys Phe Tyr Ile Ile Gln Leu Leu Gln Asp Ser Asn Arg Phe Phe Thr 
                85                  90                  95      


Cys Trp Asn His Trp Gly Arg Val Gly Glu Val Gly Gln Ser Lys Ile 
            100                 105                 110         


Asn His Phe Thr Arg Leu Glu Asp Ala Lys Lys Asp Phe Glu Lys Lys 
        115                 120                 125             


Phe Arg Glu Lys Thr Lys Asn Asn Trp Ala Glu Arg Asp His Phe Val 
    130                 135                 140                 


Ser His Pro Gly Lys Tyr Thr Leu Ile Glu Val Gln Ala Glu Asp Glu 
145                 150                 155                 160 


Ala Gln Glu Ala Val Val Lys Val Asp Arg Gly Pro Val Arg Thr Val 
                165                 170                 175     


Thr Lys Arg Val Gln Pro Cys Ser Leu Asp Pro Ala Thr Gln Lys Leu 
            180                 185                 190         


Ile Thr Asn Ile Phe Ser Lys Glu Met Phe Lys Asn Thr Met Ala Leu 
        195                 200                 205             


Met Asp Leu Asp Val Lys Lys Met Pro Leu Gly Lys Leu Ser Lys Gln 
    210                 215                 220                 


Gln Ile Ala Arg Gly Phe Glu Ala Leu Glu Ala Leu Glu Glu Ala Leu 
225                 230                 235                 240 


Lys Gly Pro Thr Asp Gly Gly Gln Ser Leu Glu Glu Leu Ser Ser His 
                245                 250                 255     


Phe Tyr Thr Val Ile Pro His Asn Phe Gly His Ser Gln Pro Pro Pro 
            260                 265                 270         


Ile Asn Ser Pro Glu Leu Leu Gln Ala Lys Lys Asp Met Leu Leu Val 
        275                 280                 285             


Leu Ala Asp Ile Glu Leu Ala Gln Ala Leu Gln Ala Val Ser Glu Gln 
    290                 295                 300                 


Glu Lys Thr Val Glu Glu Val Pro His Pro Leu Asp Arg Asp Tyr Gln 
305                 310                 315                 320 


Leu Leu Lys Cys Gln Leu Gln Leu Leu Asp Ser Gly Ala Pro Glu Tyr 
                325                 330                 335     


Lys Val Ile Gln Thr Tyr Leu Glu Gln Thr Gly Ser Asn His Arg Cys 
            340                 345                 350         


Pro Thr Leu Gln His Ile Trp Lys Val Asn Gln Glu Gly Glu Glu Asp 
        355                 360                 365             


Arg Phe Gln Ala His Ser Lys Leu Gly Asn Arg Lys Leu Leu Trp His 
    370                 375                 380                 


Gly Thr Asn Met Ala Val Val Ala Ala Ile Leu Thr Ser Gly Leu Arg 
385                 390                 395                 400 


Ile Met Pro His Ser Gly Gly Arg Val Gly Lys Gly Ile Tyr Phe Ala 
                405                 410                 415     


Ser Glu Asn Ser Lys Ser Ala Gly Tyr Val Ile Gly Met Lys Cys Gly 
            420                 425                 430         


Ala His His Val Gly Tyr Met Phe Leu Gly Glu Val Ala Leu Gly Arg 
        435                 440                 445             


Glu His His Ile Asn Thr Asp Asn Pro Ser Leu Lys Ser Pro Pro Pro 
    450                 455                 460                 


Gly Phe Asp Ser Val Ile Ala Arg Gly His Thr Glu Pro Asp Pro Thr 
465                 470                 475                 480 


Gln Asp Thr Glu Leu Glu Leu Asp Gly Gln Gln Val Val Val Pro Gln 
                485                 490                 495     


Gly Gln Pro Val Pro Cys Pro Glu Phe Ser Ser Ser Thr Phe Ser Gln 
            500                 505                 510         


Ser Glu Tyr Leu Ile Tyr Gln Glu Ser Gln Cys Arg Leu Arg Tyr Leu 
        515                 520                 525             


Leu Glu Val His Leu 
    530             


<210>  234
<211>  2923
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens sequestosome 1 (SQSTM1), transcript variant 1, mRNA

       NCBI Reference Sequence: NM_003900.4

<400>  234
cctctcgagg cggggcgggg cctccgcgtt cgctacaaaa gccgcgcggc ggctgcgacc       60

gggacggccc gttttccgcc agctcgccgc tcgctatggc gtcgctcacc gtgaaggcct      120

accttctggg caaggaggac gcggcgcgcg agattcgccg cttcagcttc tgctgcagcc      180

ccgagcctga ggcggaagcc gaggctgcgg cgggtccggg accctgcgag cggctgctga      240

gccgggtggc cgccctgttc cccgcgctgc ggcctggcgg cttccaggcg cactaccgcg      300

atgaggacgg ggacttggtt gccttttcca gtgacgagga attgacaatg gccatgtcct      360

acgtgaagga tgacatcttc cgaatctaca ttaaagagaa aaaagagtgc cggcgggacc      420

accgcccacc gtgtgctcag gaggcgcccc gcaacatggt gcaccccaat gtgatctgcg      480

atggctgcaa tgggcctgtg gtaggaaccc gctacaagtg cagcgtctgc ccagactacg      540

acttgtgtag cgtctgcgag ggaaagggct tgcaccgggg gcacaccaag ctcgcattcc      600

ccagcccctt cgggcacctg tctgagggct tctcgcacag ccgctggctc cggaaggtga      660

aacacggaca cttcgggtgg ccaggatggg aaatgggtcc accaggaaac tggagcccac      720

gtcctcctcg tgcaggggag gcccgccctg gccccacggc agaatcagct tctggtccat      780

cggaggatcc gagtgtgaat ttcctgaaga acgttgggga gagtgtggca gctgccctta      840

gccctctggg cattgaagtt gatatcgatg tggagcacgg agggaaaaga agccgcctga      900

cccccgtctc tccagagagt tccagcacag aggagaagag cagctcacag ccaagcagct      960

gctgctctga ccccagcaag ccgggtggga atgttgaggg cgccacgcag tctctggcgg     1020

agcagatgag gaagatcgcc ttggagtccg aggggcgccc tgaggaacag atggagtcgg     1080

ataactgttc aggaggagat gatgactgga cccatctgtc ttcaaaagaa gtggacccgt     1140

ctacaggtga actccagtcc ctacagatgc cagaatccga agggccaagc tctctggacc     1200

cctcccagga gggacccaca gggctgaagg aagctgcctt gtacccacat ctcccgccag     1260

aggctgaccc gcggctgatt gagtccctct cccagatgct gtccatgggc ttctctgatg     1320

aaggcggctg gctcaccagg ctcctgcaga ccaagaacta tgacatcgga gcggctctgg     1380

acaccatcca gtattcaaag catcccccgc cgttgtgacc acttttgccc acctcttctg     1440

cgtgcccctc ttctgtctca tagttgtgtt aagcttgcgt agaattgcag gtctctgtac     1500

gggccagttt ctctgccttc ttccaggatc aggggttagg gtgcaagaag ccatttaggg     1560

cagcaaaaca agtgacatga agggagggtc cctgtgtgtg tgtgtgctga tgtttcctgg     1620

gtgccctggc tccttgcagc agggctgggc ctgcgagacc caaggctcac tgcagcgcgc     1680

tcctgacccc tccctgcagg ggctacgtta gcagcccagc acatagcttg cctaatggct     1740

ttcactttct cttttgtttt aaatgactca taggtccctg acatttagtt gattattttc     1800

tgctacagac ctggtacact ctgattttag ataaagtaag cctaggtgtt gtcagcaggc     1860

aggctgggga ggccagtgtt gtgggcttcc tgctgggact gagaaggctc acgaagggca     1920

tccgcaatgt tggtttcact gagagctgcc tcctggtctc ttcaccactg tagttctctc     1980

atttccaaac catcagctgc ttttaaaata agatctcttt gtagccatcc tgttaaattt     2040

gtaaacaatc taattaaatg gcatcagcac tttaaccaat gacgtttgca tagagagaaa     2100

tgattgacag taagtttatt gttaatggtt cttacagagt atctttaaaa gtgccttagg     2160

ggaaccctgt ccctcctaac aagtgtatct cgattaataa cctgccagtc ccagatcaca     2220

catcatcatc gaagtcttcc ccagttataa agaggtcaca tagtcgtgtg ggtcgaggat     2280

tctgtgcctc caggaccagg ggcccaccct ctgcccaggg agtccttgcg tcccatgagg     2340

tcttcccgca aggcctctca gacccagatg tgacggggtg tgtggcccga ggaagctgga     2400

cagcggcagt gggcctgctg aggccttctc ttgaggcctg tgctctgggg gtcccttgct     2460

tagcctgtgc tggaccagct ggcctggggt ccctctgaag agaccttggc tgctcactgt     2520

ccacatgtga actttttcta ggtggcagga caaattgcgc ccatttagag gatgtggctg     2580

taacctgctg gatgggactc catagctcct tcccaggacc cctcagctcc ccggcactgc     2640

agtctgcaga gttctcctgg aggcaggggc tgctgccttg tttcaccttc catgtcaggc     2700

cagcctgtcc ctgaaagaga agatggccat gccctccatg tgtaagaaca atgccagggc     2760

ccaggaggac cgcctgccct gcctgggcct tggctgggcc tctggttctg acactttctg     2820

ctggaagctg tcaggctggg acaggctttg attttgaggg ttagcaagac aaagcaaata     2880

aatgccttcc acctcaccgc aaaaaaaaaa aaaaaaaaaa aaa                       2923


<210>  235
<211>  440
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens sequestosome 1 (SQSTM1), transcript variant 1, 
       polypeptide

NCBI Reference Sequence: NM_003900.4

<400>  235

Met Ala Ser Leu Thr Val Lys Ala Tyr Leu Leu Gly Lys Glu Asp Ala 
1               5                   10                  15      


Ala Arg Glu Ile Arg Arg Phe Ser Phe Cys Cys Ser Pro Glu Pro Glu 
            20                  25                  30          


Ala Glu Ala Glu Ala Ala Ala Gly Pro Gly Pro Cys Glu Arg Leu Leu 
        35                  40                  45              


Ser Arg Val Ala Ala Leu Phe Pro Ala Leu Arg Pro Gly Gly Phe Gln 
    50                  55                  60                  


Ala His Tyr Arg Asp Glu Asp Gly Asp Leu Val Ala Phe Ser Ser Asp 
65                  70                  75                  80  


Glu Glu Leu Thr Met Ala Met Ser Tyr Val Lys Asp Asp Ile Phe Arg 
                85                  90                  95      


Ile Tyr Ile Lys Glu Lys Lys Glu Cys Arg Arg Asp His Arg Pro Pro 
            100                 105                 110         


Cys Ala Gln Glu Ala Pro Arg Asn Met Val His Pro Asn Val Ile Cys 
        115                 120                 125             


Asp Gly Cys Asn Gly Pro Val Val Gly Thr Arg Tyr Lys Cys Ser Val 
    130                 135                 140                 


Cys Pro Asp Tyr Asp Leu Cys Ser Val Cys Glu Gly Lys Gly Leu His 
145                 150                 155                 160 


Arg Gly His Thr Lys Leu Ala Phe Pro Ser Pro Phe Gly His Leu Ser 
                165                 170                 175     


Glu Gly Phe Ser His Ser Arg Trp Leu Arg Lys Val Lys His Gly His 
            180                 185                 190         


Phe Gly Trp Pro Gly Trp Glu Met Gly Pro Pro Gly Asn Trp Ser Pro 
        195                 200                 205             


Arg Pro Pro Arg Ala Gly Glu Ala Arg Pro Gly Pro Thr Ala Glu Ser 
    210                 215                 220                 


Ala Ser Gly Pro Ser Glu Asp Pro Ser Val Asn Phe Leu Lys Asn Val 
225                 230                 235                 240 


Gly Glu Ser Val Ala Ala Ala Leu Ser Pro Leu Gly Ile Glu Val Asp 
                245                 250                 255     


Ile Asp Val Glu His Gly Gly Lys Arg Ser Arg Leu Thr Pro Val Ser 
            260                 265                 270         


Pro Glu Ser Ser Ser Thr Glu Glu Lys Ser Ser Ser Gln Pro Ser Ser 
        275                 280                 285             


Cys Cys Ser Asp Pro Ser Lys Pro Gly Gly Asn Val Glu Gly Ala Thr 
    290                 295                 300                 


Gln Ser Leu Ala Glu Gln Met Arg Lys Ile Ala Leu Glu Ser Glu Gly 
305                 310                 315                 320 


Arg Pro Glu Glu Gln Met Glu Ser Asp Asn Cys Ser Gly Gly Asp Asp 
                325                 330                 335     


Asp Trp Thr His Leu Ser Ser Lys Glu Val Asp Pro Ser Thr Gly Glu 
            340                 345                 350         


Leu Gln Ser Leu Gln Met Pro Glu Ser Glu Gly Pro Ser Ser Leu Asp 
        355                 360                 365             


Pro Ser Gln Glu Gly Pro Thr Gly Leu Lys Glu Ala Ala Leu Tyr Pro 
    370                 375                 380                 


His Leu Pro Pro Glu Ala Asp Pro Arg Leu Ile Glu Ser Leu Ser Gln 
385                 390                 395                 400 


Met Leu Ser Met Gly Phe Ser Asp Glu Gly Gly Trp Leu Thr Arg Leu 
                405                 410                 415     


Leu Gln Thr Lys Asn Tyr Asp Ile Gly Ala Ala Leu Asp Thr Ile Gln 
            420                 425                 430         


Tyr Ser Lys His Pro Pro Pro Leu 
        435                 440 


<210>  236
<211>  2931
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens sequestosome 1 (SQSTM1), transcript variant 2, 
       polypeptide

NCBI Reference Sequence: NM_001142298.1

<400>  236
gcgtcggctt ccggccgcct tccgcggcca ccgccgggcc cgctcccgcc gccgacgccc       60

aggtgcgcca ggtgcgggcc gggcgggggt cgcgctcacc tttctggccg ctgagtgccg      120

cgtaccagga cagcgagagg aaggcgcaca ggcagaagag cagcagcgtc aggaaggtgc      180

cattgcggag cctcatctcc tcggtgtctg cgagattaat ctctcatggc cgctgcacaa      240

gaacctggct tttagctgaa ctaaggagaa agtcctacaa cagtttggcg tgcaacatgg      300

ggcttgagaa aggatgagga cggggacttg gttgcctttt ccagtgacga ggaattgaca      360

atggccatgt cctacgtgaa ggatgacatc ttccgaatct acattaaaga gaaaaaagag      420

tgccggcggg accaccgccc accgtgtgct caggaggcgc cccgcaacat ggtgcacccc      480

aatgtgatct gcgatggctg caatgggcct gtggtaggaa cccgctacaa gtgcagcgtc      540

tgcccagact acgacttgtg tagcgtctgc gagggaaagg gcttgcaccg ggggcacacc      600

aagctcgcat tccccagccc cttcgggcac ctgtctgagg gcttctcgca cagccgctgg      660

ctccggaagg tgaaacacgg acacttcggg tggccaggat gggaaatggg tccaccagga      720

aactggagcc cacgtcctcc tcgtgcaggg gaggcccgcc ctggccccac ggcagaatca      780

gcttctggtc catcggagga tccgagtgtg aatttcctga agaacgttgg ggagagtgtg      840

gcagctgccc ttagccctct gggcattgaa gttgatatcg atgtggagca cggagggaaa      900

agaagccgcc tgacccccgt ctctccagag agttccagca cagaggagaa gagcagctca      960

cagccaagca gctgctgctc tgaccccagc aagccgggtg ggaatgttga gggcgccacg     1020

cagtctctgg cggagcagat gaggaagatc gccttggagt ccgaggggcg ccctgaggaa     1080

cagatggagt cggataactg ttcaggagga gatgatgact ggacccatct gtcttcaaaa     1140

gaagtggacc cgtctacagg tgaactccag tccctacaga tgccagaatc cgaagggcca     1200

agctctctgg acccctccca ggagggaccc acagggctga aggaagctgc cttgtaccca     1260

catctcccgc cagaggctga cccgcggctg attgagtccc tctcccagat gctgtccatg     1320

ggcttctctg atgaaggcgg ctggctcacc aggctcctgc agaccaagaa ctatgacatc     1380

ggagcggctc tggacaccat ccagtattca aagcatcccc cgccgttgtg accacttttg     1440

cccacctctt ctgcgtgccc ctcttctgtc tcatagttgt gttaagcttg cgtagaattg     1500

caggtctctg tacgggccag tttctctgcc ttcttccagg atcaggggtt agggtgcaag     1560

aagccattta gggcagcaaa acaagtgaca tgaagggagg gtccctgtgt gtgtgtgtgc     1620

tgatgtttcc tgggtgccct ggctccttgc agcagggctg ggcctgcgag acccaaggct     1680

cactgcagcg cgctcctgac ccctccctgc aggggctacg ttagcagccc agcacatagc     1740

ttgcctaatg gctttcactt tctcttttgt tttaaatgac tcataggtcc ctgacattta     1800

gttgattatt ttctgctaca gacctggtac actctgattt tagataaagt aagcctaggt     1860

gttgtcagca ggcaggctgg ggaggccagt gttgtgggct tcctgctggg actgagaagg     1920

ctcacgaagg gcatccgcaa tgttggtttc actgagagct gcctcctggt ctcttcacca     1980

ctgtagttct ctcatttcca aaccatcagc tgcttttaaa ataagatctc tttgtagcca     2040

tcctgttaaa tttgtaaaca atctaattaa atggcatcag cactttaacc aatgacgttt     2100

gcatagagag aaatgattga cagtaagttt attgttaatg gttcttacag agtatcttta     2160

aaagtgcctt aggggaaccc tgtccctcct aacaagtgta tctcgattaa taacctgcca     2220

gtcccagatc acacatcatc atcgaagtct tccccagtta taaagaggtc acatagtcgt     2280

gtgggtcgag gattctgtgc ctccaggacc aggggcccac cctctgccca gggagtcctt     2340

gcgtcccatg aggtcttccc gcaaggcctc tcagacccag atgtgacggg gtgtgtggcc     2400

cgaggaagct ggacagcggc agtgggcctg ctgaggcctt ctcttgaggc ctgtgctctg     2460

ggggtccctt gcttagcctg tgctggacca gctggcctgg ggtccctctg aagagacctt     2520

ggctgctcac tgtccacatg tgaacttttt ctaggtggca ggacaaattg cgcccattta     2580

gaggatgtgg ctgtaacctg ctggatggga ctccatagct ccttcccagg acccctcagc     2640

tccccggcac tgcagtctgc agagttctcc tggaggcagg ggctgctgcc ttgtttcacc     2700

ttccatgtca ggccagcctg tccctgaaag agaagatggc catgccctcc atgtgtaaga     2760

acaatgccag ggcccaggag gaccgcctgc cctgcctggg ccttggctgg gcctctggtt     2820

ctgacacttt ctgctggaag ctgtcaggct gggacaggct ttgattttga gggttagcaa     2880

gacaaagcaa ataaatgcct tccacctcac cgcaaaaaaa aaaaaaaaaa a              2931


<210>  237
<211>  356
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens sequestosome 1 (SQSTM1), transcript variant 2, 
       polypeptide

NCBI Reference Sequence: NM_001142298.1

<400>  237

Met Ala Met Ser Tyr Val Lys Asp Asp Ile Phe Arg Ile Tyr Ile Lys 
1               5                   10                  15      


Glu Lys Lys Glu Cys Arg Arg Asp His Arg Pro Pro Cys Ala Gln Glu 
            20                  25                  30          


Ala Pro Arg Asn Met Val His Pro Asn Val Ile Cys Asp Gly Cys Asn 
        35                  40                  45              


Gly Pro Val Val Gly Thr Arg Tyr Lys Cys Ser Val Cys Pro Asp Tyr 
    50                  55                  60                  


Asp Leu Cys Ser Val Cys Glu Gly Lys Gly Leu His Arg Gly His Thr 
65                  70                  75                  80  


Lys Leu Ala Phe Pro Ser Pro Phe Gly His Leu Ser Glu Gly Phe Ser 
                85                  90                  95      


His Ser Arg Trp Leu Arg Lys Val Lys His Gly His Phe Gly Trp Pro 
            100                 105                 110         


Gly Trp Glu Met Gly Pro Pro Gly Asn Trp Ser Pro Arg Pro Pro Arg 
        115                 120                 125             


Ala Gly Glu Ala Arg Pro Gly Pro Thr Ala Glu Ser Ala Ser Gly Pro 
    130                 135                 140                 


Ser Glu Asp Pro Ser Val Asn Phe Leu Lys Asn Val Gly Glu Ser Val 
145                 150                 155                 160 


Ala Ala Ala Leu Ser Pro Leu Gly Ile Glu Val Asp Ile Asp Val Glu 
                165                 170                 175     


His Gly Gly Lys Arg Ser Arg Leu Thr Pro Val Ser Pro Glu Ser Ser 
            180                 185                 190         


Ser Thr Glu Glu Lys Ser Ser Ser Gln Pro Ser Ser Cys Cys Ser Asp 
        195                 200                 205             


Pro Ser Lys Pro Gly Gly Asn Val Glu Gly Ala Thr Gln Ser Leu Ala 
    210                 215                 220                 


Glu Gln Met Arg Lys Ile Ala Leu Glu Ser Glu Gly Arg Pro Glu Glu 
225                 230                 235                 240 


Gln Met Glu Ser Asp Asn Cys Ser Gly Gly Asp Asp Asp Trp Thr His 
                245                 250                 255     


Leu Ser Ser Lys Glu Val Asp Pro Ser Thr Gly Glu Leu Gln Ser Leu 
            260                 265                 270         


Gln Met Pro Glu Ser Glu Gly Pro Ser Ser Leu Asp Pro Ser Gln Glu 
        275                 280                 285             


Gly Pro Thr Gly Leu Lys Glu Ala Ala Leu Tyr Pro His Leu Pro Pro 
    290                 295                 300                 


Glu Ala Asp Pro Arg Leu Ile Glu Ser Leu Ser Gln Met Leu Ser Met 
305                 310                 315                 320 


Gly Phe Ser Asp Glu Gly Gly Trp Leu Thr Arg Leu Leu Gln Thr Lys 
                325                 330                 335     


Asn Tyr Asp Ile Gly Ala Ala Leu Asp Thr Ile Gln Tyr Ser Lys His 
            340                 345                 350         


Pro Pro Pro Leu 
        355     


<210>  238
<211>  2848
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens sequestosome 1 (SQSTM1), transcript variant 3, mRNA

       NCBI Reference Sequence: NM_001142299.1

<400>  238
ggatttaaag gggccgcagc accgccgtcg ccggcgccgc gagggggtgg ggtgggggcc       60

ggcggccggg atcccgatcg gctcccgcag ccccgcgtgg gctcgtgcga gtcggcctca      120

gtgtctgcga gattaatctc tcatggccgc tgcacaagaa cctggctttt agctgaacta      180

aggagaaagt cctacaacag tttggcgtgc aacatggggc ttgagaaagg atgaggacgg      240

ggacttggtt gccttttcca gtgacgagga attgacaatg gccatgtcct acgtgaagga      300

tgacatcttc cgaatctaca ttaaagagaa aaaagagtgc cggcgggacc accgcccacc      360

gtgtgctcag gaggcgcccc gcaacatggt gcaccccaat gtgatctgcg atggctgcaa      420

tgggcctgtg gtaggaaccc gctacaagtg cagcgtctgc ccagactacg acttgtgtag      480

cgtctgcgag ggaaagggct tgcaccgggg gcacaccaag ctcgcattcc ccagcccctt      540

cgggcacctg tctgagggct tctcgcacag ccgctggctc cggaaggtga aacacggaca      600

cttcgggtgg ccaggatggg aaatgggtcc accaggaaac tggagcccac gtcctcctcg      660

tgcaggggag gcccgccctg gccccacggc agaatcagct tctggtccat cggaggatcc      720

gagtgtgaat ttcctgaaga acgttgggga gagtgtggca gctgccctta gccctctggg      780

cattgaagtt gatatcgatg tggagcacgg agggaaaaga agccgcctga cccccgtctc      840

tccagagagt tccagcacag aggagaagag cagctcacag ccaagcagct gctgctctga      900

ccccagcaag ccgggtggga atgttgaggg cgccacgcag tctctggcgg agcagatgag      960

gaagatcgcc ttggagtccg aggggcgccc tgaggaacag atggagtcgg ataactgttc     1020

aggaggagat gatgactgga cccatctgtc ttcaaaagaa gtggacccgt ctacaggtga     1080

actccagtcc ctacagatgc cagaatccga agggccaagc tctctggacc cctcccagga     1140

gggacccaca gggctgaagg aagctgcctt gtacccacat ctcccgccag aggctgaccc     1200

gcggctgatt gagtccctct cccagatgct gtccatgggc ttctctgatg aaggcggctg     1260

gctcaccagg ctcctgcaga ccaagaacta tgacatcgga gcggctctgg acaccatcca     1320

gtattcaaag catcccccgc cgttgtgacc acttttgccc acctcttctg cgtgcccctc     1380

ttctgtctca tagttgtgtt aagcttgcgt agaattgcag gtctctgtac gggccagttt     1440

ctctgccttc ttccaggatc aggggttagg gtgcaagaag ccatttaggg cagcaaaaca     1500

agtgacatga agggagggtc cctgtgtgtg tgtgtgctga tgtttcctgg gtgccctggc     1560

tccttgcagc agggctgggc ctgcgagacc caaggctcac tgcagcgcgc tcctgacccc     1620

tccctgcagg ggctacgtta gcagcccagc acatagcttg cctaatggct ttcactttct     1680

cttttgtttt aaatgactca taggtccctg acatttagtt gattattttc tgctacagac     1740

ctggtacact ctgattttag ataaagtaag cctaggtgtt gtcagcaggc aggctgggga     1800

ggccagtgtt gtgggcttcc tgctgggact gagaaggctc acgaagggca tccgcaatgt     1860

tggtttcact gagagctgcc tcctggtctc ttcaccactg tagttctctc atttccaaac     1920

catcagctgc ttttaaaata agatctcttt gtagccatcc tgttaaattt gtaaacaatc     1980

taattaaatg gcatcagcac tttaaccaat gacgtttgca tagagagaaa tgattgacag     2040

taagtttatt gttaatggtt cttacagagt atctttaaaa gtgccttagg ggaaccctgt     2100

ccctcctaac aagtgtatct cgattaataa cctgccagtc ccagatcaca catcatcatc     2160

gaagtcttcc ccagttataa agaggtcaca tagtcgtgtg ggtcgaggat tctgtgcctc     2220

caggaccagg ggcccaccct ctgcccaggg agtccttgcg tcccatgagg tcttcccgca     2280

aggcctctca gacccagatg tgacggggtg tgtggcccga ggaagctgga cagcggcagt     2340

gggcctgctg aggccttctc ttgaggcctg tgctctgggg gtcccttgct tagcctgtgc     2400

tggaccagct ggcctggggt ccctctgaag agaccttggc tgctcactgt ccacatgtga     2460

actttttcta ggtggcagga caaattgcgc ccatttagag gatgtggctg taacctgctg     2520

gatgggactc catagctcct tcccaggacc cctcagctcc ccggcactgc agtctgcaga     2580

gttctcctgg aggcaggggc tgctgccttg tttcaccttc catgtcaggc cagcctgtcc     2640

ctgaaagaga agatggccat gccctccatg tgtaagaaca atgccagggc ccaggaggac     2700

cgcctgccct gcctgggcct tggctgggcc tctggttctg acactttctg ctggaagctg     2760

tcaggctggg acaggctttg attttgaggg ttagcaagac aaagcaaata aatgccttcc     2820

acctcaccgc aaaaaaaaaa aaaaaaaa                                        2848


<210>  239
<211>  356
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens sequestosome 1 (SQSTM1), transcript variant 3, 
       polypeptide

NCBI Reference Sequence: NM_001142299.1

<400>  239

Met Ala Met Ser Tyr Val Lys Asp Asp Ile Phe Arg Ile Tyr Ile Lys 
1               5                   10                  15      


Glu Lys Lys Glu Cys Arg Arg Asp His Arg Pro Pro Cys Ala Gln Glu 
            20                  25                  30          


Ala Pro Arg Asn Met Val His Pro Asn Val Ile Cys Asp Gly Cys Asn 
        35                  40                  45              


Gly Pro Val Val Gly Thr Arg Tyr Lys Cys Ser Val Cys Pro Asp Tyr 
    50                  55                  60                  


Asp Leu Cys Ser Val Cys Glu Gly Lys Gly Leu His Arg Gly His Thr 
65                  70                  75                  80  


Lys Leu Ala Phe Pro Ser Pro Phe Gly His Leu Ser Glu Gly Phe Ser 
                85                  90                  95      


His Ser Arg Trp Leu Arg Lys Val Lys His Gly His Phe Gly Trp Pro 
            100                 105                 110         


Gly Trp Glu Met Gly Pro Pro Gly Asn Trp Ser Pro Arg Pro Pro Arg 
        115                 120                 125             


Ala Gly Glu Ala Arg Pro Gly Pro Thr Ala Glu Ser Ala Ser Gly Pro 
    130                 135                 140                 


Ser Glu Asp Pro Ser Val Asn Phe Leu Lys Asn Val Gly Glu Ser Val 
145                 150                 155                 160 


Ala Ala Ala Leu Ser Pro Leu Gly Ile Glu Val Asp Ile Asp Val Glu 
                165                 170                 175     


His Gly Gly Lys Arg Ser Arg Leu Thr Pro Val Ser Pro Glu Ser Ser 
            180                 185                 190         


Ser Thr Glu Glu Lys Ser Ser Ser Gln Pro Ser Ser Cys Cys Ser Asp 
        195                 200                 205             


Pro Ser Lys Pro Gly Gly Asn Val Glu Gly Ala Thr Gln Ser Leu Ala 
    210                 215                 220                 


Glu Gln Met Arg Lys Ile Ala Leu Glu Ser Glu Gly Arg Pro Glu Glu 
225                 230                 235                 240 


Gln Met Glu Ser Asp Asn Cys Ser Gly Gly Asp Asp Asp Trp Thr His 
                245                 250                 255     


Leu Ser Ser Lys Glu Val Asp Pro Ser Thr Gly Glu Leu Gln Ser Leu 
            260                 265                 270         


Gln Met Pro Glu Ser Glu Gly Pro Ser Ser Leu Asp Pro Ser Gln Glu 
        275                 280                 285             


Gly Pro Thr Gly Leu Lys Glu Ala Ala Leu Tyr Pro His Leu Pro Pro 
    290                 295                 300                 


Glu Ala Asp Pro Arg Leu Ile Glu Ser Leu Ser Gln Met Leu Ser Met 
305                 310                 315                 320 


Gly Phe Ser Asp Glu Gly Gly Trp Leu Thr Arg Leu Leu Gln Thr Lys 
                325                 330                 335     


Asn Tyr Asp Ile Gly Ala Ala Leu Asp Thr Ile Gln Tyr Ser Lys His 
            340                 345                 350         


Pro Pro Pro Leu 
        355     


<210>  240
<211>  1245
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human ferritin (FTH1) mRNA GenBank Accession No.: NM_002032.2  
       GI:56682958

<400>  240
ataagagacc acaagcgacc cgcagggcca gacgttcttc gccgagagtc gtcggggttt       60

cctgcttcaa cagtgcttgg acggaacccg gcgctcgttc cccaccccgg ccggccgccc      120

atagccagcc ctccgtcacc tcttcaccgc accctcggac tgccccaagg cccccgccgc      180

cgctccagcg ccgcgcagcc accgccgccg ccgccgcctc tccttagtcg ccgccatgac      240

gaccgcgtcc acctcgcagg tgcgccagaa ctaccaccag gactcagagg ccgccatcaa      300

ccgccagatc aacctggagc tctacgcctc ctacgtttac ctgtccatgt cttactactt      360

tgaccgcgat gatgtggctt tgaagaactt tgccaaatac tttcttcacc aatctcatga      420

ggagagggaa catgctgaga aactgatgaa gctgcagaac caacgaggtg gccgaatctt      480

ccttcaggat atcaagaaac cagactgtga tgactgggag agcgggctga atgcaatgga      540

gtgtgcatta catttggaaa aaaatgtgaa tcagtcacta ctggaactgc acaaactggc      600

cactgacaaa aatgaccccc atttgtgtga cttcattgag acacattacc tgaatgagca      660

ggtgaaagcc atcaaagaat tgggtgacca cgtgaccaac ttgcgcaaga tgggagcgcc      720

cgaatctggc ttggcggaat atctctttga caagcacacc ctgggagaca gtgataatga      780

aagctaagcc tcgggctaat ttccccatag ccgtggggtg acttccctgg tcaccaaggc      840

agtgcatgca tgttggggtt tcctttacct tttctataag ttgtaccaaa acatccactt      900

aagttctttg atttgtacca ttccttcaaa taaagaaatt tggtacccag gtgttgtctt      960

tgaggtcttg ggatgaatca gaaatctatc caggctatct tccagattcc ttaagtgccg     1020

ttgttcagtt ctaatcacac taatcaaaaa gaaacgagta tttgtattta ttaaactcat     1080

tagtttgggc agtatactaa ggtgtggctg tcttggattc agatagaact aagggttccc     1140

gactctgaat ccagagtctg agttaaatgt ttccaatggt tcagtctagc tttcacagtt     1200

tttatgaata aaaggcatta aaggctgaaa aaaaaaaaaa aaaaa                     1245


<210>  241
<211>  183
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human ferritin (FTH1) polypeptide GenBank Accession No.: 
       NP_002023.2  GI:56682959

<400>  241

Met Thr Thr Ala Ser Thr Ser Gln Val Arg Gln Asn Tyr His Gln Asp 
1               5                   10                  15      


Ser Glu Ala Ala Ile Asn Arg Gln Ile Asn Leu Glu Leu Tyr Ala Ser 
            20                  25                  30          


Tyr Val Tyr Leu Ser Met Ser Tyr Tyr Phe Asp Arg Asp Asp Val Ala 
        35                  40                  45              


Leu Lys Asn Phe Ala Lys Tyr Phe Leu His Gln Ser His Glu Glu Arg 
    50                  55                  60                  


Glu His Ala Glu Lys Leu Met Lys Leu Gln Asn Gln Arg Gly Gly Arg 
65                  70                  75                  80  


Ile Phe Leu Gln Asp Ile Lys Lys Pro Asp Cys Asp Asp Trp Glu Ser 
                85                  90                  95      


Gly Leu Asn Ala Met Glu Cys Ala Leu His Leu Glu Lys Asn Val Asn 
            100                 105                 110         


Gln Ser Leu Leu Glu Leu His Lys Leu Ala Thr Asp Lys Asn Asp Pro 
        115                 120                 125             


His Leu Cys Asp Phe Ile Glu Thr His Tyr Leu Asn Glu Gln Val Lys 
    130                 135                 140                 


Ala Ile Lys Glu Leu Gly Asp His Val Thr Asn Leu Arg Lys Met Gly 
145                 150                 155                 160 


Ala Pro Glu Ser Gly Leu Ala Glu Tyr Leu Phe Asp Lys His Thr Leu 
                165                 170                 175     


Gly Asp Ser Asp Asn Glu Ser 
            180             


<210>  242
<211>  4815
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human E-cadherin (CDH1) mRNA GeneBank Accession No.: NM_004360.3 
       GI:169790842

<400>  242
agtggcgtcg gaactgcaaa gcacctgtga gcttgcggaa gtcagttcag actccagccc       60

gctccagccc ggcccgaccc gaccgcaccc ggcgcctgcc ctcgctcggc gtccccggcc      120

agccatgggc ccttggagcc gcagcctctc ggcgctgctg ctgctgctgc aggtctcctc      180

ttggctctgc caggagccgg agccctgcca ccctggcttt gacgccgaga gctacacgtt      240

cacggtgccc cggcgccacc tggagagagg ccgcgtcctg ggcagagtga attttgaaga      300

ttgcaccggt cgacaaagga cagcctattt ttccctcgac acccgattca aagtgggcac      360

agatggtgtg attacagtca aaaggcctct acggtttcat aacccacaga tccatttctt      420

ggtctacgcc tgggactcca cctacagaaa gttttccacc aaagtcacgc tgaatacagt      480

ggggcaccac caccgccccc cgccccatca ggcctccgtt tctggaatcc aagcagaatt      540

gctcacattt cccaactcct ctcctggcct cagaagacag aagagagact gggttattcc      600

tcccatcagc tgcccagaaa atgaaaaagg cccatttcct aaaaacctgg ttcagatcaa      660

atccaacaaa gacaaagaag gcaaggtttt ctacagcatc actggccaag gagctgacac      720

accccctgtt ggtgtcttta ttattgaaag agaaacagga tggctgaagg tgacagagcc      780

tctggataga gaacgcattg ccacatacac tctcttctct cacgctgtgt catccaacgg      840

gaatgcagtt gaggatccaa tggagatttt gatcacggta accgatcaga atgacaacaa      900

gcccgaattc acccaggagg tctttaaggg gtctgtcatg gaaggtgctc ttccaggaac      960

ctctgtgatg gaggtcacag ccacagacgc ggacgatgat gtgaacacct acaatgccgc     1020

catcgcttac accatcctca gccaagatcc tgagctccct gacaaaaata tgttcaccat     1080

taacaggaac acaggagtca tcagtgtggt caccactggg ctggaccgag agagtttccc     1140

tacgtatacc ctggtggttc aagctgctga ccttcaaggt gaggggttaa gcacaacagc     1200

aacagctgtg atcacagtca ctgacaccaa cgataatcct ccgatcttca atcccaccac     1260

gtacaagggt caggtgcctg agaacgaggc taacgtcgta atcaccacac tgaaagtgac     1320

tgatgctgat gcccccaata ccccagcgtg ggaggctgta tacaccatat tgaatgatga     1380

tggtggacaa tttgtcgtca ccacaaatcc agtgaacaac gatggcattt tgaaaacagc     1440

aaagggcttg gattttgagg ccaagcagca gtacattcta cacgtagcag tgacgaatgt     1500

ggtacctttt gaggtctctc tcaccacctc cacagccacc gtcaccgtgg atgtgctgga     1560

tgtgaatgaa gcccccatct ttgtgcctcc tgaaaagaga gtggaagtgt ccgaggactt     1620

tggcgtgggc caggaaatca catcctacac tgcccaggag ccagacacat ttatggaaca     1680

gaaaataaca tatcggattt ggagagacac tgccaactgg ctggagatta atccggacac     1740

tggtgccatt tccactcggg ctgagctgga cagggaggat tttgagcacg tgaagaacag     1800

cacgtacaca gccctaatca tagctacaga caatggttct ccagttgcta ctggaacagg     1860

gacacttctg ctgatcctgt ctgatgtgaa tgacaacgcc cccataccag aacctcgaac     1920

tatattcttc tgtgagagga atccaaagcc tcaggtcata aacatcattg atgcagacct     1980

tcctcccaat acatctccct tcacagcaga actaacacac ggggcgagtg ccaactggac     2040

cattcagtac aacgacccaa cccaagaatc tatcattttg aagccaaaga tggccttaga     2100

ggtgggtgac tacaaaatca atctcaagct catggataac cagaataaag accaagtgac     2160

caccttagag gtcagcgtgt gtgactgtga aggggccgct ggcgtctgta ggaaggcaca     2220

gcctgtcgaa gcaggattgc aaattcctgc cattctgggg attcttggag gaattcttgc     2280

tttgctaatt ctgattctgc tgctcttgct gtttcttcgg aggagagcgg tggtcaaaga     2340

gcccttactg cccccagagg atgacacccg ggacaacgtt tattactatg atgaagaagg     2400

aggcggagaa gaggaccagg actttgactt gagccagctg cacaggggcc tggacgctcg     2460

gcctgaagtg actcgtaacg acgttgcacc aaccctcatg agtgtccccc ggtatcttcc     2520

ccgccctgcc aatcccgatg aaattggaaa ttttattgat gaaaatctga aagcggctga     2580

tactgacccc acagccccgc cttatgattc tctgctcgtg tttgactatg aaggaagcgg     2640

ttccgaagct gctagtctga gctccctgaa ctcctcagag tcagacaaag accaggacta     2700

tgactacttg aacgaatggg gcaatcgctt caagaagctg gctgacatgt acggaggcgg     2760

cgaggacgac taggggactc gagagaggcg ggccccagac ccatgtgctg ggaaatgcag     2820

aaatcacgtt gctggtggtt tttcagctcc cttcccttga gatgagtttc tggggaaaaa     2880

aaagagactg gttagtgatg cagttagtat agctttatac tctctccact ttatagctct     2940

aataagtttg tgttagaaaa gtttcgactt atttcttaaa gctttttttt ttttcccatc     3000

actctttaca tggtggtgat gtccaaaaga tacccaaatt ttaatattcc agaagaacaa     3060

ctttagcatc agaaggttca cccagcacct tgcagatttt cttaaggaat tttgtctcac     3120

ttttaaaaag aaggggagaa gtcagctact ctagttctgt tgttttgtgt atataatttt     3180

ttaaaaaaaa tttgtgtgct tctgctcatt actacactgg tgtgtccctc tgcctttttt     3240

ttttttttaa gacagggtct cattctatcg gccaggctgg agtgcagtgg tgcaatcaca     3300

gctcactgca gccttgtcct cccaggctca agctatcctt gcacctcagc ctcccaagta     3360

gctgggacca caggcatgca ccactacgca tgactaattt tttaaatatt tgagacgggg     3420

tctccctgtg ttacccaggc tggtctcaaa ctcctgggct caagtgatcc tcccatcttg     3480

gcctcccaga gtattgggat tacagacatg agccactgca cctgcccagc tccccaactc     3540

cctgccattt tttaagagac agtttcgctc catcgcccag gcctgggatg cagtgatgtg     3600

atcatagctc actgtaacct caaactctgg ggctcaagca gttctcccac cagcctcctt     3660

tttatttttt tgtacagatg gggtcttgct atgttgccca agctggtctt aaactcctgg     3720

cctcaagcaa tccttctgcc ttggcccccc aaagtgctgg gattgtgggc atgagctgct     3780

gtgcccagcc tccatgtttt aatatcaact ctcactcctg aattcagttg ctttgcccaa     3840

gataggagtt ctctgatgca gaaattattg ggctctttta gggtaagaag tttgtgtctt     3900

tgtctggcca catcttgact aggtattgtc tactctgaag acctttaatg gcttccctct     3960

ttcatctcct gagtatgtaa cttgcaatgg gcagctatcc agtgacttgt tctgagtaag     4020

tgtgttcatt aatgtttatt tagctctgaa gcaagagtga tatactccag gacttagaat     4080

agtgcctaaa gtgctgcagc caaagacaga gcggaactat gaaaagtggg cttggagatg     4140

gcaggagagc ttgtcattga gcctggcaat ttagcaaact gatgctgagg atgattgagg     4200

tgggtctacc tcatctctga aaattctgga aggaatggag gagtctcaac atgtgtttct     4260

gacacaagat ccgtggtttg tactcaaagc ccagaatccc caagtgcctg cttttgatga     4320

tgtctacaga aaatgctggc tgagctgaac acatttgccc aattccaggt gtgcacagaa     4380

aaccgagaat attcaaaatt ccaaattttt ttcttaggag caagaagaaa atgtggccct     4440

aaagggggtt agttgagggg tagggggtag tgaggatctt gatttggatc tctttttatt     4500

taaatgtgaa tttcaacttt tgacaatcaa agaaaagact tttgttgaaa tagctttact     4560

gtttctcaag tgttttggag aaaaaaatca accctgcaat cactttttgg aattgtcttg     4620

atttttcggc agttcaagct atatcgaata tagttctgtg tagagaatgt cactgtagtt     4680

ttgagtgtat acatgtgtgg gtgctgataa ttgtgtattt tctttggggg tggaaaagga     4740

aaacaattca agctgagaaa agtattctca aagatgcatt tttataaatt ttattaaaca     4800

attttgttaa accat                                                      4815


<210>  243
<211>  882
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human E-cadherin (CDH1) polypeptide GeneBank Accession No.: 
       NP_004351.1  GI:4757960

<400>  243

Met Gly Pro Trp Ser Arg Ser Leu Ser Ala Leu Leu Leu Leu Leu Gln 
1               5                   10                  15      


Val Ser Ser Trp Leu Cys Gln Glu Pro Glu Pro Cys His Pro Gly Phe 
            20                  25                  30          


Asp Ala Glu Ser Tyr Thr Phe Thr Val Pro Arg Arg His Leu Glu Arg 
        35                  40                  45              


Gly Arg Val Leu Gly Arg Val Asn Phe Glu Asp Cys Thr Gly Arg Gln 
    50                  55                  60                  


Arg Thr Ala Tyr Phe Ser Leu Asp Thr Arg Phe Lys Val Gly Thr Asp 
65                  70                  75                  80  


Gly Val Ile Thr Val Lys Arg Pro Leu Arg Phe His Asn Pro Gln Ile 
                85                  90                  95      


His Phe Leu Val Tyr Ala Trp Asp Ser Thr Tyr Arg Lys Phe Ser Thr 
            100                 105                 110         


Lys Val Thr Leu Asn Thr Val Gly His His His Arg Pro Pro Pro His 
        115                 120                 125             


Gln Ala Ser Val Ser Gly Ile Gln Ala Glu Leu Leu Thr Phe Pro Asn 
    130                 135                 140                 


Ser Ser Pro Gly Leu Arg Arg Gln Lys Arg Asp Trp Val Ile Pro Pro 
145                 150                 155                 160 


Ile Ser Cys Pro Glu Asn Glu Lys Gly Pro Phe Pro Lys Asn Leu Val 
                165                 170                 175     


Gln Ile Lys Ser Asn Lys Asp Lys Glu Gly Lys Val Phe Tyr Ser Ile 
            180                 185                 190         


Thr Gly Gln Gly Ala Asp Thr Pro Pro Val Gly Val Phe Ile Ile Glu 
        195                 200                 205             


Arg Glu Thr Gly Trp Leu Lys Val Thr Glu Pro Leu Asp Arg Glu Arg 
    210                 215                 220                 


Ile Ala Thr Tyr Thr Leu Phe Ser His Ala Val Ser Ser Asn Gly Asn 
225                 230                 235                 240 


Ala Val Glu Asp Pro Met Glu Ile Leu Ile Thr Val Thr Asp Gln Asn 
                245                 250                 255     


Asp Asn Lys Pro Glu Phe Thr Gln Glu Val Phe Lys Gly Ser Val Met 
            260                 265                 270         


Glu Gly Ala Leu Pro Gly Thr Ser Val Met Glu Val Thr Ala Thr Asp 
        275                 280                 285             


Ala Asp Asp Asp Val Asn Thr Tyr Asn Ala Ala Ile Ala Tyr Thr Ile 
    290                 295                 300                 


Leu Ser Gln Asp Pro Glu Leu Pro Asp Lys Asn Met Phe Thr Ile Asn 
305                 310                 315                 320 


Arg Asn Thr Gly Val Ile Ser Val Val Thr Thr Gly Leu Asp Arg Glu 
                325                 330                 335     


Ser Phe Pro Thr Tyr Thr Leu Val Val Gln Ala Ala Asp Leu Gln Gly 
            340                 345                 350         


Glu Gly Leu Ser Thr Thr Ala Thr Ala Val Ile Thr Val Thr Asp Thr 
        355                 360                 365             


Asn Asp Asn Pro Pro Ile Phe Asn Pro Thr Thr Tyr Lys Gly Gln Val 
    370                 375                 380                 


Pro Glu Asn Glu Ala Asn Val Val Ile Thr Thr Leu Lys Val Thr Asp 
385                 390                 395                 400 


Ala Asp Ala Pro Asn Thr Pro Ala Trp Glu Ala Val Tyr Thr Ile Leu 
                405                 410                 415     


Asn Asp Asp Gly Gly Gln Phe Val Val Thr Thr Asn Pro Val Asn Asn 
            420                 425                 430         


Asp Gly Ile Leu Lys Thr Ala Lys Gly Leu Asp Phe Glu Ala Lys Gln 
        435                 440                 445             


Gln Tyr Ile Leu His Val Ala Val Thr Asn Val Val Pro Phe Glu Val 
    450                 455                 460                 


Ser Leu Thr Thr Ser Thr Ala Thr Val Thr Val Asp Val Leu Asp Val 
465                 470                 475                 480 


Asn Glu Ala Pro Ile Phe Val Pro Pro Glu Lys Arg Val Glu Val Ser 
                485                 490                 495     


Glu Asp Phe Gly Val Gly Gln Glu Ile Thr Ser Tyr Thr Ala Gln Glu 
            500                 505                 510         


Pro Asp Thr Phe Met Glu Gln Lys Ile Thr Tyr Arg Ile Trp Arg Asp 
        515                 520                 525             


Thr Ala Asn Trp Leu Glu Ile Asn Pro Asp Thr Gly Ala Ile Ser Thr 
    530                 535                 540                 


Arg Ala Glu Leu Asp Arg Glu Asp Phe Glu His Val Lys Asn Ser Thr 
545                 550                 555                 560 


Tyr Thr Ala Leu Ile Ile Ala Thr Asp Asn Gly Ser Pro Val Ala Thr 
                565                 570                 575     


Gly Thr Gly Thr Leu Leu Leu Ile Leu Ser Asp Val Asn Asp Asn Ala 
            580                 585                 590         


Pro Ile Pro Glu Pro Arg Thr Ile Phe Phe Cys Glu Arg Asn Pro Lys 
        595                 600                 605             


Pro Gln Val Ile Asn Ile Ile Asp Ala Asp Leu Pro Pro Asn Thr Ser 
    610                 615                 620                 


Pro Phe Thr Ala Glu Leu Thr His Gly Ala Ser Ala Asn Trp Thr Ile 
625                 630                 635                 640 


Gln Tyr Asn Asp Pro Thr Gln Glu Ser Ile Ile Leu Lys Pro Lys Met 
                645                 650                 655     


Ala Leu Glu Val Gly Asp Tyr Lys Ile Asn Leu Lys Leu Met Asp Asn 
            660                 665                 670         


Gln Asn Lys Asp Gln Val Thr Thr Leu Glu Val Ser Val Cys Asp Cys 
        675                 680                 685             


Glu Gly Ala Ala Gly Val Cys Arg Lys Ala Gln Pro Val Glu Ala Gly 
    690                 695                 700                 


Leu Gln Ile Pro Ala Ile Leu Gly Ile Leu Gly Gly Ile Leu Ala Leu 
705                 710                 715                 720 


Leu Ile Leu Ile Leu Leu Leu Leu Leu Phe Leu Arg Arg Arg Ala Val 
                725                 730                 735     


Val Lys Glu Pro Leu Leu Pro Pro Glu Asp Asp Thr Arg Asp Asn Val 
            740                 745                 750         


Tyr Tyr Tyr Asp Glu Glu Gly Gly Gly Glu Glu Asp Gln Asp Phe Asp 
        755                 760                 765             


Leu Ser Gln Leu His Arg Gly Leu Asp Ala Arg Pro Glu Val Thr Arg 
    770                 775                 780                 


Asn Asp Val Ala Pro Thr Leu Met Ser Val Pro Arg Tyr Leu Pro Arg 
785                 790                 795                 800 


Pro Ala Asn Pro Asp Glu Ile Gly Asn Phe Ile Asp Glu Asn Leu Lys 
                805                 810                 815     


Ala Ala Asp Thr Asp Pro Thr Ala Pro Pro Tyr Asp Ser Leu Leu Val 
            820                 825                 830         


Phe Asp Tyr Glu Gly Ser Gly Ser Glu Ala Ala Ser Leu Ser Ser Leu 
        835                 840                 845             


Asn Ser Ser Glu Ser Asp Lys Asp Gln Asp Tyr Asp Tyr Leu Asn Glu 
    850                 855                 860                 


Trp Gly Asn Arg Phe Lys Lys Leu Ala Asp Met Tyr Gly Gly Gly Glu 
865                 870                 875                 880 


Asp Asp 
        


<210>  244
<211>  4380
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human N-cadherin (CDH2) mRNA GeneBank Accession No.: NM_001792.3 
       GI:215422305

<400>  244
ggggagcgcc atccgctcca cttccacctc cacatcctcc accggccaag gtccccgccg       60

ctgcatccct cgcggcttcc gctgcgctcc gggccggagc cgagccgcct gcgctgccac      120

agcagccgcc tccacacact cgcagacgct cacacgctct ccctccctgt tcccccgccc      180

cctccccagc tccttgatct ctgggtctgt tttattactc ctggtgcgag tcccgcggac      240

tccgcggccc gctatttgtc atcagctcgc tctccattgg cggggagcgg agagcagcga      300

agaagggggt ggggagggga ggggaaggga agggggtgga aactgcctgg agccgtttct      360

ccgcgccgct gttggtgctg ccgctgcctc ctcctcctcc gccgccgccg ccgccgccgc      420

cgcctcctcc ggctcttcgc tcggcccctc tccgcctcca tgtgccggat agcgggagcg      480

ctgcggaccc tgctgccgct gctggcggcc ctgcttcagg cgtctgtaga ggcttctggt      540

gaaatcgcat tatgcaagac tggatttcct gaagatgttt acagtgcagt cttatcgaag      600

gatgtgcatg aaggacagcc tcttctcaat gtgaagttta gcaactgcaa tggaaaaaga      660

aaagtacaat atgagagcag tgagcctgca gattttaagg tggatgaaga tggcatggtg      720

tatgccgtga gaagctttcc actctcttct gagcatgcca agttcctgat atatgcccaa      780

gacaaagaga cccaggaaaa gtggcaagtg gcagtaaaat tgagcctgaa gccaacctta      840

actgaggagt cagtgaagga gtcagcagaa gttgaagaaa tagtgttccc aagacaattc      900

agtaagcaca gtggccacct acaaaggcag aagagagact gggtcatccc tccaatcaac      960

ttgccagaaa actccagggg accttttcct caagagcttg tcaggatcag gtctgataga     1020

gataaaaacc tttcactgcg gtacagtgta actgggccag gagctgacca gcctccaact     1080

ggtatcttca ttatcaaccc catctcgggt cagctgtcgg tgacaaagcc cctggatcgc     1140

gagcagatag cccggtttca tttgagggca catgcagtag atattaatgg aaatcaagtg     1200

gagaacccca ttgacattgt catcaatgtt attgacatga atgacaacag acctgagttc     1260

ttacaccagg tttggaatgg gacagttcct gagggatcaa agcctggaac atatgtgatg     1320

accgtaacag caattgatgc tgacgatccc aatgccctca atgggatgtt gaggtacaga     1380

atcgtgtctc aggctccaag caccccttca cccaacatgt ttacaatcaa caatgagact     1440

ggtgacatca tcacagtggc agctggactt gatcgagaaa aagtgcaaca gtatacgtta     1500

ataattcaag ctacagacat ggaaggcaat cccacatatg gcctttcaaa cacagccacg     1560

gccgtcatca cagtgacaga tgtcaatgac aatcctccag agtttactgc catgacgttt     1620

tatggtgaag ttcctgagaa cagggtagac atcatagtag ctaatctaac tgtgaccgat     1680

aaggatcaac cccatacacc agcctggaac gcagtgtaca gaatcagtgg cggagatcct     1740

actggacggt tcgccatcca gaccgaccca aacagcaacg acgggttagt caccgtggtc     1800

aaaccaatcg actttgaaac aaataggatg tttgtcctta ctgttgctgc agaaaatcaa     1860

gtgccattag ccaagggaat tcagcacccg cctcagtcaa ctgcaaccgt gtctgttaca     1920

gttattgacg taaatgaaaa cccttatttt gcccccaatc ctaagatcat tcgccaagaa     1980

gaagggcttc atgccggtac catgttgaca acattcactg ctcaggaccc agatcgatat     2040

atgcagcaaa atattagata cactaaatta tctgatcctg ccaattggct aaaaatagat     2100

cctgtgaatg gacaaataac tacaattgct gttttggacc gagaatcacc aaatgtgaaa     2160

aacaatatat ataatgctac tttccttgct tctgacaatg gaattcctcc tatgagtgga     2220

acaggaacgc tgcagatcta tttacttgat attaatgaca atgcccctca agtgttacct     2280

caagaggcag agacttgcga aactccagac cccaattcaa ttaatattac agcacttgat     2340

tatgacattg atccaaatgc tggaccattt gcttttgatc ttcctttatc tccagtgact     2400

attaagagaa attggaccat cactcggctt aatggtgatt ttgctcagct taatttaaag     2460

ataaaatttc ttgaagctgg tatctatgaa gttcccatca taatcacaga ttcgggtaat     2520

cctcccaaat caaatatttc catcctgcgc gtgaaggttt gccagtgtga ctccaacggg     2580

gactgcacag atgtggacag gattgtgggt gcggggcttg gcaccggtgc catcattgcc     2640

atcctgctct gcatcatcat cctgcttatc cttgtgctga tgtttgtggt atggatgaaa     2700

cgccgggata aagaacgcca ggccaaacaa cttttaattg atccagaaga tgatgtaaga     2760

gataatattt taaaatatga tgaagaaggt ggaggagaag aagaccagga ctatgacttg     2820

agccagctgc agcagcctga cactgtggag cctgatgcca tcaagcctgt gggaatccga     2880

cgaatggatg aaagacccat ccacgccgag ccccagtatc cggtccgatc tgcagcccca     2940

caccctggag acattgggga cttcattaat gagggcctta aagcggctga caatgacccc     3000

acagctccac catatgactc cctgttagtg tttgactatg aaggcagtgg ctccactgct     3060

gggtccttga gctcccttaa ttcctcaagt agtggtggtg agcaggacta tgattacctg     3120

aacgactggg ggccacggtt caagaaactt gctgacatgt atggtggagg tgatgactga     3180

acttcagggt gaacttggtt tttggacaag tacaaacaat ttcaactgat attcccaaaa     3240

agcattcaga agctaggctt taactttgta gtctactagc acagtgcttg ctggaggctt     3300

tggcataggc tgcaaaccaa tttgggctca gagggaatat cagtgatcca tactgtttgg     3360

aaaaacactg agctcagtta cacttgaatt ttacagtaca gaagcactgg gattttatgt     3420

gcctttttgt acctttttca gattggaatt agttttctgt ttaaggcttt aatggtactg     3480

atttctgaaa cgataagtaa aagacaaaat attttgtggt gggagcagta agttaaacca     3540

tgatatgctt caacacgctt ttgttacatt gcatttgctt ttattaaaat acaaaattaa     3600

acaaacaaaa aaactcatgg agcgatttta ttatcttggg ggatgagacc atgagattgg     3660

aaaatgtaca ttacttctag ttttagactt tagtttgttt tttttttttt cactaaaatc     3720

ttaaaactta ctcagctggt tgcaaataaa gggagttttc atatcaccaa tttgtagcaa     3780

aattgaattt tttcataaac tagaatgtta gacacatttt ggtcttaatc catgtacact     3840

tttttatttc tgtatttttc cacttcactg taaaaatagt atgtgtacat aatgttttat     3900

tggcatagtc tatggagaag tgcagaaact tcagaacatg tgtatgtatt atttggacta     3960

tggattcagg ttttttgcat gtttatatct ttcgttatgg ataaagtatt tacaaaacag     4020

tgacatttga ttcaattgtt gagctgtagt tagaatactc aatttttaat ttttttaatt     4080

tttttatttt ttattttctt tttggtttgg ggagggagaa aagttcttag cacaaatgtt     4140

ttacataatt tgtaccaaaa aaaaaaaaaa aggaaaggaa agaaaggggt ggcctgacac     4200

tggtggcact actaagtgtg tgttttttta aaaaaaaaat ggaaaaaaaa aagcttttaa     4260

actggagaga cttctgacaa cagctttgcc tctgtattgt gtaccagaat ataaatgata     4320

cacctctgac cccagcgttc tgaataaaat gctaattttg gatctggaaa aaaaaaaaaa     4380


<210>  245
<211>  906
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human N-cadherin (CDH2) polypeptide GeneBank Accession No.: 
       NP_001783.2  GI:14589889

<400>  245

Met Cys Arg Ile Ala Gly Ala Leu Arg Thr Leu Leu Pro Leu Leu Ala 
1               5                   10                  15      


Ala Leu Leu Gln Ala Ser Val Glu Ala Ser Gly Glu Ile Ala Leu Cys 
            20                  25                  30          


Lys Thr Gly Phe Pro Glu Asp Val Tyr Ser Ala Val Leu Ser Lys Asp 
        35                  40                  45              


Val His Glu Gly Gln Pro Leu Leu Asn Val Lys Phe Ser Asn Cys Asn 
    50                  55                  60                  


Gly Lys Arg Lys Val Gln Tyr Glu Ser Ser Glu Pro Ala Asp Phe Lys 
65                  70                  75                  80  


Val Asp Glu Asp Gly Met Val Tyr Ala Val Arg Ser Phe Pro Leu Ser 
                85                  90                  95      


Ser Glu His Ala Lys Phe Leu Ile Tyr Ala Gln Asp Lys Glu Thr Gln 
            100                 105                 110         


Glu Lys Trp Gln Val Ala Val Lys Leu Ser Leu Lys Pro Thr Leu Thr 
        115                 120                 125             


Glu Glu Ser Val Lys Glu Ser Ala Glu Val Glu Glu Ile Val Phe Pro 
    130                 135                 140                 


Arg Gln Phe Ser Lys His Ser Gly His Leu Gln Arg Gln Lys Arg Asp 
145                 150                 155                 160 


Trp Val Ile Pro Pro Ile Asn Leu Pro Glu Asn Ser Arg Gly Pro Phe 
                165                 170                 175     


Pro Gln Glu Leu Val Arg Ile Arg Ser Asp Arg Asp Lys Asn Leu Ser 
            180                 185                 190         


Leu Arg Tyr Ser Val Thr Gly Pro Gly Ala Asp Gln Pro Pro Thr Gly 
        195                 200                 205             


Ile Phe Ile Ile Asn Pro Ile Ser Gly Gln Leu Ser Val Thr Lys Pro 
    210                 215                 220                 


Leu Asp Arg Glu Gln Ile Ala Arg Phe His Leu Arg Ala His Ala Val 
225                 230                 235                 240 


Asp Ile Asn Gly Asn Gln Val Glu Asn Pro Ile Asp Ile Val Ile Asn 
                245                 250                 255     


Val Ile Asp Met Asn Asp Asn Arg Pro Glu Phe Leu His Gln Val Trp 
            260                 265                 270         


Asn Gly Thr Val Pro Glu Gly Ser Lys Pro Gly Thr Tyr Val Met Thr 
        275                 280                 285             


Val Thr Ala Ile Asp Ala Asp Asp Pro Asn Ala Leu Asn Gly Met Leu 
    290                 295                 300                 


Arg Tyr Arg Ile Val Ser Gln Ala Pro Ser Thr Pro Ser Pro Asn Met 
305                 310                 315                 320 


Phe Thr Ile Asn Asn Glu Thr Gly Asp Ile Ile Thr Val Ala Ala Gly 
                325                 330                 335     


Leu Asp Arg Glu Lys Val Gln Gln Tyr Thr Leu Ile Ile Gln Ala Thr 
            340                 345                 350         


Asp Met Glu Gly Asn Pro Thr Tyr Gly Leu Ser Asn Thr Ala Thr Ala 
        355                 360                 365             


Val Ile Thr Val Thr Asp Val Asn Asp Asn Pro Pro Glu Phe Thr Ala 
    370                 375                 380                 


Met Thr Phe Tyr Gly Glu Val Pro Glu Asn Arg Val Asp Ile Ile Val 
385                 390                 395                 400 


Ala Asn Leu Thr Val Thr Asp Lys Asp Gln Pro His Thr Pro Ala Trp 
                405                 410                 415     


Asn Ala Val Tyr Arg Ile Ser Gly Gly Asp Pro Thr Gly Arg Phe Ala 
            420                 425                 430         


Ile Gln Thr Asp Pro Asn Ser Asn Asp Gly Leu Val Thr Val Val Lys 
        435                 440                 445             


Pro Ile Asp Phe Glu Thr Asn Arg Met Phe Val Leu Thr Val Ala Ala 
    450                 455                 460                 


Glu Asn Gln Val Pro Leu Ala Lys Gly Ile Gln His Pro Pro Gln Ser 
465                 470                 475                 480 


Thr Ala Thr Val Ser Val Thr Val Ile Asp Val Asn Glu Asn Pro Tyr 
                485                 490                 495     


Phe Ala Pro Asn Pro Lys Ile Ile Arg Gln Glu Glu Gly Leu His Ala 
            500                 505                 510         


Gly Thr Met Leu Thr Thr Phe Thr Ala Gln Asp Pro Asp Arg Tyr Met 
        515                 520                 525             


Gln Gln Asn Ile Arg Tyr Thr Lys Leu Ser Asp Pro Ala Asn Trp Leu 
    530                 535                 540                 


Lys Ile Asp Pro Val Asn Gly Gln Ile Thr Thr Ile Ala Val Leu Asp 
545                 550                 555                 560 


Arg Glu Ser Pro Asn Val Lys Asn Asn Ile Tyr Asn Ala Thr Phe Leu 
                565                 570                 575     


Ala Ser Asp Asn Gly Ile Pro Pro Met Ser Gly Thr Gly Thr Leu Gln 
            580                 585                 590         


Ile Tyr Leu Leu Asp Ile Asn Asp Asn Ala Pro Gln Val Leu Pro Gln 
        595                 600                 605             


Glu Ala Glu Thr Cys Glu Thr Pro Asp Pro Asn Ser Ile Asn Ile Thr 
    610                 615                 620                 


Ala Leu Asp Tyr Asp Ile Asp Pro Asn Ala Gly Pro Phe Ala Phe Asp 
625                 630                 635                 640 


Leu Pro Leu Ser Pro Val Thr Ile Lys Arg Asn Trp Thr Ile Thr Arg 
                645                 650                 655     


Leu Asn Gly Asp Phe Ala Gln Leu Asn Leu Lys Ile Lys Phe Leu Glu 
            660                 665                 670         


Ala Gly Ile Tyr Glu Val Pro Ile Ile Ile Thr Asp Ser Gly Asn Pro 
        675                 680                 685             


Pro Lys Ser Asn Ile Ser Ile Leu Arg Val Lys Val Cys Gln Cys Asp 
    690                 695                 700                 


Ser Asn Gly Asp Cys Thr Asp Val Asp Arg Ile Val Gly Ala Gly Leu 
705                 710                 715                 720 


Gly Thr Gly Ala Ile Ile Ala Ile Leu Leu Cys Ile Ile Ile Leu Leu 
                725                 730                 735     


Ile Leu Val Leu Met Phe Val Val Trp Met Lys Arg Arg Asp Lys Glu 
            740                 745                 750         


Arg Gln Ala Lys Gln Leu Leu Ile Asp Pro Glu Asp Asp Val Arg Asp 
        755                 760                 765             


Asn Ile Leu Lys Tyr Asp Glu Glu Gly Gly Gly Glu Glu Asp Gln Asp 
    770                 775                 780                 


Tyr Asp Leu Ser Gln Leu Gln Gln Pro Asp Thr Val Glu Pro Asp Ala 
785                 790                 795                 800 


Ile Lys Pro Val Gly Ile Arg Arg Met Asp Glu Arg Pro Ile His Ala 
                805                 810                 815     


Glu Pro Gln Tyr Pro Val Arg Ser Ala Ala Pro His Pro Gly Asp Ile 
            820                 825                 830         


Gly Asp Phe Ile Asn Glu Gly Leu Lys Ala Ala Asp Asn Asp Pro Thr 
        835                 840                 845             


Ala Pro Pro Tyr Asp Ser Leu Leu Val Phe Asp Tyr Glu Gly Ser Gly 
    850                 855                 860                 


Ser Thr Ala Gly Ser Leu Ser Ser Leu Asn Ser Ser Ser Ser Gly Gly 
865                 870                 875                 880 


Glu Gln Asp Tyr Asp Tyr Leu Asn Asp Trp Gly Pro Arg Phe Lys Lys 
                885                 890                 895     


Leu Ala Asp Met Tyr Gly Gly Gly Asp Asp 
            900                 905     


<210>  246
<211>  4380
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens cadherin 2, type 1, N-cadherin (neuronal) (CDH2), 
       mRNA

NCBI Reference Sequence: NM_001792.3

<400>  246
ggggagcgcc atccgctcca cttccacctc cacatcctcc accggccaag gtccccgccg       60

ctgcatccct cgcggcttcc gctgcgctcc gggccggagc cgagccgcct gcgctgccac      120

agcagccgcc tccacacact cgcagacgct cacacgctct ccctccctgt tcccccgccc      180

cctccccagc tccttgatct ctgggtctgt tttattactc ctggtgcgag tcccgcggac      240

tccgcggccc gctatttgtc atcagctcgc tctccattgg cggggagcgg agagcagcga      300

agaagggggt ggggagggga ggggaaggga agggggtgga aactgcctgg agccgtttct      360

ccgcgccgct gttggtgctg ccgctgcctc ctcctcctcc gccgccgccg ccgccgccgc      420

cgcctcctcc ggctcttcgc tcggcccctc tccgcctcca tgtgccggat agcgggagcg      480

ctgcggaccc tgctgccgct gctggcggcc ctgcttcagg cgtctgtaga ggcttctggt      540

gaaatcgcat tatgcaagac tggatttcct gaagatgttt acagtgcagt cttatcgaag      600

gatgtgcatg aaggacagcc tcttctcaat gtgaagttta gcaactgcaa tggaaaaaga      660

aaagtacaat atgagagcag tgagcctgca gattttaagg tggatgaaga tggcatggtg      720

tatgccgtga gaagctttcc actctcttct gagcatgcca agttcctgat atatgcccaa      780

gacaaagaga cccaggaaaa gtggcaagtg gcagtaaaat tgagcctgaa gccaacctta      840

actgaggagt cagtgaagga gtcagcagaa gttgaagaaa tagtgttccc aagacaattc      900

agtaagcaca gtggccacct acaaaggcag aagagagact gggtcatccc tccaatcaac      960

ttgccagaaa actccagggg accttttcct caagagcttg tcaggatcag gtctgataga     1020

gataaaaacc tttcactgcg gtacagtgta actgggccag gagctgacca gcctccaact     1080

ggtatcttca ttatcaaccc catctcgggt cagctgtcgg tgacaaagcc cctggatcgc     1140

gagcagatag cccggtttca tttgagggca catgcagtag atattaatgg aaatcaagtg     1200

gagaacccca ttgacattgt catcaatgtt attgacatga atgacaacag acctgagttc     1260

ttacaccagg tttggaatgg gacagttcct gagggatcaa agcctggaac atatgtgatg     1320

accgtaacag caattgatgc tgacgatccc aatgccctca atgggatgtt gaggtacaga     1380

atcgtgtctc aggctccaag caccccttca cccaacatgt ttacaatcaa caatgagact     1440

ggtgacatca tcacagtggc agctggactt gatcgagaaa aagtgcaaca gtatacgtta     1500

ataattcaag ctacagacat ggaaggcaat cccacatatg gcctttcaaa cacagccacg     1560

gccgtcatca cagtgacaga tgtcaatgac aatcctccag agtttactgc catgacgttt     1620

tatggtgaag ttcctgagaa cagggtagac atcatagtag ctaatctaac tgtgaccgat     1680

aaggatcaac cccatacacc agcctggaac gcagtgtaca gaatcagtgg cggagatcct     1740

actggacggt tcgccatcca gaccgaccca aacagcaacg acgggttagt caccgtggtc     1800

aaaccaatcg actttgaaac aaataggatg tttgtcctta ctgttgctgc agaaaatcaa     1860

gtgccattag ccaagggaat tcagcacccg cctcagtcaa ctgcaaccgt gtctgttaca     1920

gttattgacg taaatgaaaa cccttatttt gcccccaatc ctaagatcat tcgccaagaa     1980

gaagggcttc atgccggtac catgttgaca acattcactg ctcaggaccc agatcgatat     2040

atgcagcaaa atattagata cactaaatta tctgatcctg ccaattggct aaaaatagat     2100

cctgtgaatg gacaaataac tacaattgct gttttggacc gagaatcacc aaatgtgaaa     2160

aacaatatat ataatgctac tttccttgct tctgacaatg gaattcctcc tatgagtgga     2220

acaggaacgc tgcagatcta tttacttgat attaatgaca atgcccctca agtgttacct     2280

caagaggcag agacttgcga aactccagac cccaattcaa ttaatattac agcacttgat     2340

tatgacattg atccaaatgc tggaccattt gcttttgatc ttcctttatc tccagtgact     2400

attaagagaa attggaccat cactcggctt aatggtgatt ttgctcagct taatttaaag     2460

ataaaatttc ttgaagctgg tatctatgaa gttcccatca taatcacaga ttcgggtaat     2520

cctcccaaat caaatatttc catcctgcgc gtgaaggttt gccagtgtga ctccaacggg     2580

gactgcacag atgtggacag gattgtgggt gcggggcttg gcaccggtgc catcattgcc     2640

atcctgctct gcatcatcat cctgcttatc cttgtgctga tgtttgtggt atggatgaaa     2700

cgccgggata aagaacgcca ggccaaacaa cttttaattg atccagaaga tgatgtaaga     2760

gataatattt taaaatatga tgaagaaggt ggaggagaag aagaccagga ctatgacttg     2820

agccagctgc agcagcctga cactgtggag cctgatgcca tcaagcctgt gggaatccga     2880

cgaatggatg aaagacccat ccacgccgag ccccagtatc cggtccgatc tgcagcccca     2940

caccctggag acattgggga cttcattaat gagggcctta aagcggctga caatgacccc     3000

acagctccac catatgactc cctgttagtg tttgactatg aaggcagtgg ctccactgct     3060

gggtccttga gctcccttaa ttcctcaagt agtggtggtg agcaggacta tgattacctg     3120

aacgactggg ggccacggtt caagaaactt gctgacatgt atggtggagg tgatgactga     3180

acttcagggt gaacttggtt tttggacaag tacaaacaat ttcaactgat attcccaaaa     3240

agcattcaga agctaggctt taactttgta gtctactagc acagtgcttg ctggaggctt     3300

tggcataggc tgcaaaccaa tttgggctca gagggaatat cagtgatcca tactgtttgg     3360

aaaaacactg agctcagtta cacttgaatt ttacagtaca gaagcactgg gattttatgt     3420

gcctttttgt acctttttca gattggaatt agttttctgt ttaaggcttt aatggtactg     3480

atttctgaaa cgataagtaa aagacaaaat attttgtggt gggagcagta agttaaacca     3540

tgatatgctt caacacgctt ttgttacatt gcatttgctt ttattaaaat acaaaattaa     3600

acaaacaaaa aaactcatgg agcgatttta ttatcttggg ggatgagacc atgagattgg     3660

aaaatgtaca ttacttctag ttttagactt tagtttgttt tttttttttt cactaaaatc     3720

ttaaaactta ctcagctggt tgcaaataaa gggagttttc atatcaccaa tttgtagcaa     3780

aattgaattt tttcataaac tagaatgtta gacacatttt ggtcttaatc catgtacact     3840

tttttatttc tgtatttttc cacttcactg taaaaatagt atgtgtacat aatgttttat     3900

tggcatagtc tatggagaag tgcagaaact tcagaacatg tgtatgtatt atttggacta     3960

tggattcagg ttttttgcat gtttatatct ttcgttatgg ataaagtatt tacaaaacag     4020

tgacatttga ttcaattgtt gagctgtagt tagaatactc aatttttaat ttttttaatt     4080

tttttatttt ttattttctt tttggtttgg ggagggagaa aagttcttag cacaaatgtt     4140

ttacataatt tgtaccaaaa aaaaaaaaaa aggaaaggaa agaaaggggt ggcctgacac     4200

tggtggcact actaagtgtg tgttttttta aaaaaaaaat ggaaaaaaaa aagcttttaa     4260

actggagaga cttctgacaa cagctttgcc tctgtattgt gtaccagaat ataaatgata     4320

cacctctgac cccagcgttc tgaataaaat gctaattttg gatctggaaa aaaaaaaaaa     4380


<210>  247
<211>  906
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  247 Homo sapiens cadherin 2, type 1, N-cadherin (neuronal) 
       (CDH2), polypeptide

NCBI Reference Sequence: NM_001792.3

<400>  247

Met Cys Arg Ile Ala Gly Ala Leu Arg Thr Leu Leu Pro Leu Leu Ala 
1               5                   10                  15      


Ala Leu Leu Gln Ala Ser Val Glu Ala Ser Gly Glu Ile Ala Leu Cys 
            20                  25                  30          


Lys Thr Gly Phe Pro Glu Asp Val Tyr Ser Ala Val Leu Ser Lys Asp 
        35                  40                  45              


Val His Glu Gly Gln Pro Leu Leu Asn Val Lys Phe Ser Asn Cys Asn 
    50                  55                  60                  


Gly Lys Arg Lys Val Gln Tyr Glu Ser Ser Glu Pro Ala Asp Phe Lys 
65                  70                  75                  80  


Val Asp Glu Asp Gly Met Val Tyr Ala Val Arg Ser Phe Pro Leu Ser 
                85                  90                  95      


Ser Glu His Ala Lys Phe Leu Ile Tyr Ala Gln Asp Lys Glu Thr Gln 
            100                 105                 110         


Glu Lys Trp Gln Val Ala Val Lys Leu Ser Leu Lys Pro Thr Leu Thr 
        115                 120                 125             


Glu Glu Ser Val Lys Glu Ser Ala Glu Val Glu Glu Ile Val Phe Pro 
    130                 135                 140                 


Arg Gln Phe Ser Lys His Ser Gly His Leu Gln Arg Gln Lys Arg Asp 
145                 150                 155                 160 


Trp Val Ile Pro Pro Ile Asn Leu Pro Glu Asn Ser Arg Gly Pro Phe 
                165                 170                 175     


Pro Gln Glu Leu Val Arg Ile Arg Ser Asp Arg Asp Lys Asn Leu Ser 
            180                 185                 190         


Leu Arg Tyr Ser Val Thr Gly Pro Gly Ala Asp Gln Pro Pro Thr Gly 
        195                 200                 205             


Ile Phe Ile Ile Asn Pro Ile Ser Gly Gln Leu Ser Val Thr Lys Pro 
    210                 215                 220                 


Leu Asp Arg Glu Gln Ile Ala Arg Phe His Leu Arg Ala His Ala Val 
225                 230                 235                 240 


Asp Ile Asn Gly Asn Gln Val Glu Asn Pro Ile Asp Ile Val Ile Asn 
                245                 250                 255     


Val Ile Asp Met Asn Asp Asn Arg Pro Glu Phe Leu His Gln Val Trp 
            260                 265                 270         


Asn Gly Thr Val Pro Glu Gly Ser Lys Pro Gly Thr Tyr Val Met Thr 
        275                 280                 285             


Val Thr Ala Ile Asp Ala Asp Asp Pro Asn Ala Leu Asn Gly Met Leu 
    290                 295                 300                 


Arg Tyr Arg Ile Val Ser Gln Ala Pro Ser Thr Pro Ser Pro Asn Met 
305                 310                 315                 320 


Phe Thr Ile Asn Asn Glu Thr Gly Asp Ile Ile Thr Val Ala Ala Gly 
                325                 330                 335     


Leu Asp Arg Glu Lys Val Gln Gln Tyr Thr Leu Ile Ile Gln Ala Thr 
            340                 345                 350         


Asp Met Glu Gly Asn Pro Thr Tyr Gly Leu Ser Asn Thr Ala Thr Ala 
        355                 360                 365             


Val Ile Thr Val Thr Asp Val Asn Asp Asn Pro Pro Glu Phe Thr Ala 
    370                 375                 380                 


Met Thr Phe Tyr Gly Glu Val Pro Glu Asn Arg Val Asp Ile Ile Val 
385                 390                 395                 400 


Ala Asn Leu Thr Val Thr Asp Lys Asp Gln Pro His Thr Pro Ala Trp 
                405                 410                 415     


Asn Ala Val Tyr Arg Ile Ser Gly Gly Asp Pro Thr Gly Arg Phe Ala 
            420                 425                 430         


Ile Gln Thr Asp Pro Asn Ser Asn Asp Gly Leu Val Thr Val Val Lys 
        435                 440                 445             


Pro Ile Asp Phe Glu Thr Asn Arg Met Phe Val Leu Thr Val Ala Ala 
    450                 455                 460                 


Glu Asn Gln Val Pro Leu Ala Lys Gly Ile Gln His Pro Pro Gln Ser 
465                 470                 475                 480 


Thr Ala Thr Val Ser Val Thr Val Ile Asp Val Asn Glu Asn Pro Tyr 
                485                 490                 495     


Phe Ala Pro Asn Pro Lys Ile Ile Arg Gln Glu Glu Gly Leu His Ala 
            500                 505                 510         


Gly Thr Met Leu Thr Thr Phe Thr Ala Gln Asp Pro Asp Arg Tyr Met 
        515                 520                 525             


Gln Gln Asn Ile Arg Tyr Thr Lys Leu Ser Asp Pro Ala Asn Trp Leu 
    530                 535                 540                 


Lys Ile Asp Pro Val Asn Gly Gln Ile Thr Thr Ile Ala Val Leu Asp 
545                 550                 555                 560 


Arg Glu Ser Pro Asn Val Lys Asn Asn Ile Tyr Asn Ala Thr Phe Leu 
                565                 570                 575     


Ala Ser Asp Asn Gly Ile Pro Pro Met Ser Gly Thr Gly Thr Leu Gln 
            580                 585                 590         


Ile Tyr Leu Leu Asp Ile Asn Asp Asn Ala Pro Gln Val Leu Pro Gln 
        595                 600                 605             


Glu Ala Glu Thr Cys Glu Thr Pro Asp Pro Asn Ser Ile Asn Ile Thr 
    610                 615                 620                 


Ala Leu Asp Tyr Asp Ile Asp Pro Asn Ala Gly Pro Phe Ala Phe Asp 
625                 630                 635                 640 


Leu Pro Leu Ser Pro Val Thr Ile Lys Arg Asn Trp Thr Ile Thr Arg 
                645                 650                 655     


Leu Asn Gly Asp Phe Ala Gln Leu Asn Leu Lys Ile Lys Phe Leu Glu 
            660                 665                 670         


Ala Gly Ile Tyr Glu Val Pro Ile Ile Ile Thr Asp Ser Gly Asn Pro 
        675                 680                 685             


Pro Lys Ser Asn Ile Ser Ile Leu Arg Val Lys Val Cys Gln Cys Asp 
    690                 695                 700                 


Ser Asn Gly Asp Cys Thr Asp Val Asp Arg Ile Val Gly Ala Gly Leu 
705                 710                 715                 720 


Gly Thr Gly Ala Ile Ile Ala Ile Leu Leu Cys Ile Ile Ile Leu Leu 
                725                 730                 735     


Ile Leu Val Leu Met Phe Val Val Trp Met Lys Arg Arg Asp Lys Glu 
            740                 745                 750         


Arg Gln Ala Lys Gln Leu Leu Ile Asp Pro Glu Asp Asp Val Arg Asp 
        755                 760                 765             


Asn Ile Leu Lys Tyr Asp Glu Glu Gly Gly Gly Glu Glu Asp Gln Asp 
    770                 775                 780                 


Tyr Asp Leu Ser Gln Leu Gln Gln Pro Asp Thr Val Glu Pro Asp Ala 
785                 790                 795                 800 


Ile Lys Pro Val Gly Ile Arg Arg Met Asp Glu Arg Pro Ile His Ala 
                805                 810                 815     


Glu Pro Gln Tyr Pro Val Arg Ser Ala Ala Pro His Pro Gly Asp Ile 
            820                 825                 830         


Gly Asp Phe Ile Asn Glu Gly Leu Lys Ala Ala Asp Asn Asp Pro Thr 
        835                 840                 845             


Ala Pro Pro Tyr Asp Ser Leu Leu Val Phe Asp Tyr Glu Gly Ser Gly 
    850                 855                 860                 


Ser Thr Ala Gly Ser Leu Ser Ser Leu Asn Ser Ser Ser Ser Gly Gly 
865                 870                 875                 880 


Glu Gln Asp Tyr Asp Tyr Leu Asn Asp Trp Gly Pro Arg Phe Lys Lys 
                885                 890                 895     


Leu Ala Asp Met Tyr Gly Gly Gly Asp Asp 
            900                 905     


<210>  248
<211>  3435
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human p62 mRNA transcript variant 1 GenBank Accession No.: 
       NM_153719.3  GI:301069406

<400>  248
gtactacttc tgcgcctgcg cgaccgtgat tccccgctcg cgactcccca ccccccaggg       60

ctccctaaag agggccacga gctgcgaaag ggcgggaaag gcagttggag aagaggtaag      120

cggttactca ctccatggct gcagcaagga gaggcggcgg cggcctcggc tgaagaaaga      180

aggtgggagc ggagagcgca ggcgtgccga ggtggatgtc cgtcttttct ctgttgcaga      240

aacccacctt gtcccatcca catcaggaca tcccagctgg agttcaacct tcatcccttc      300

tgtggcagtt aggagactga atcaaggtcc agagaaggtg gaggaatcct gatactgagc      360

gaaatcttcc caaggctgca gacaccgacg gatttgcttt gggagccaga gtagctgccg      420

ccaccagagt ccggagccat gagcgggttt aattttggag gcactggggc ccctacaggc      480

gggttcacgt ttggcactgc aaagacggca acaaccacac ctgctacagg gttttctttc      540

tccacctctg gcactggagg gtttaatttt ggggctccct tccaaccagc cacaagtacc      600

ccttccaccg gcctgttctc acttgccacc cagactccgg ccacacagac gacaggcttc      660

acttttggaa cagcgactct tgcttcgggg ggaactggat tttctttggg gatcggtgct      720

tcaaagctca acttgagcaa cacagctgcc accccagcca tggcaaaccc cagcggcttt      780

gggctgggca gcagcaacct cactaatgcc atatcgagca ccgtcacctc cagccagggc      840

acagcaccca ccggctttgt gtttggcccc tccaccacct ctgtggctcc agctaccaca      900

tctggaggct tctcattcac tggtggaagc acggcccaac cctccggttt caacattggc      960

tcagcaggga attcagccca gcccacggca cctgccacgt tgcccttcac tccggccacg     1020

ccagcagcca ccacagcagg tgccacacag ccagctgctc ccacacccac agccaccatc     1080

accagcactg ggcccagcct ctttgcgtca atagcaactg ctccaacctc atctgccacc     1140

actggactct ccctctgtac ccctgtgacc acagcgggcg cccccactgc tgggacacag     1200

ggcttcagct taaaggcacc tggagcagct tccggcacct ccacaacaac atccaccgct     1260

gccaccgcca ccgccaccac caccagcagc agcagcacca ccggctttgc cttgaattta     1320

aaaccactgg cgccagccgg gatccccagc aatacagcag ctgccgtgac cgctccacct     1380

ggccctggcg cagctgcagg ggcggctgcc agctccgcca tgacctacgc gcagctggag     1440

agcctgatca acaaatggag cctggagcta gaggaccagg agcggcactt cctccagcag     1500

gccacccagg tcaacgcctg ggaccgcacg ctgatcgaga atggagaaaa gatcaccagc     1560

ctgcaccgcg aggtggagaa ggtgaagctg gaccagaaga ggctggacca ggagctcgac     1620

ttcatcctgt cccagcagaa ggagctggaa gacctgctga gcccactgga ggagttggtc     1680

aaggagcaga gcgggaccat ctacctgcag cacgcggatg aggagcgtga gaaaacctac     1740

aagctggctg agaacatcga tgcacagctc aagcgcatgg cccaggatct caaggacatc     1800

atcgagcacc tgaacacgtc cggggccccc gccgacacca gtgacccact gcagcagatc     1860

tgcaagatcc tcaatgcgca catggactca ctgcagtgga tcgaccagaa ctcggccctg     1920

ctgcagagga aggtggagga ggtgaccaag gtgtgcgagg gccggcgcaa ggagcaggag     1980

cgcagcttcc ggatcacctt tgactgagcg acagcagccc tggggcccgc aggtccctag     2040

ggagttcatg aggggaatgc gccctgttgt ctgtagtttg gggttgtggc aagatacttg     2100

tttgtttgtt tctttctttc acatgactgc ccttgacatg atcgctgtgt gctttgcgtt     2160

tttccattta ggagggtatt ctgggccttc tgcccaggca gcagcctcat gggtgtggct     2220

tctgtggctt tcatttgagt atctttggcc ccttttcacc tactgcgacc acccacctca     2280

tcctggctca gcctggtgat ggagaagtgc tgatggtctt ggtcccagcc agggtcgtgg     2340

gggcagccac tctctccaaa gcatagtcat aggtgtcatg aaaaaatacc aaatgtaaga     2400

gaacctccaa gtcagggcgc agtggctcac ccctgtaatc tcagcacttt gggtggccaa     2460

ggcgggcaga tgacttgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc     2520

cgtctctact aaaaatacaa aaattagtca ggtgtggtgg acgcctgtga tctcaatctc     2580

agctactcgg gaggctgagg caggagaatc acttgaaccc aggaggtgtt gcagtgaacc     2640

aagatcacac cactgcactc cagcctaggc aacagagact ctgtctcaaa aaaaaaaaaa     2700

aaaaaaaaga aactcccagg agacagcagc ctagttttcg agtgtgagct tgtgcttgtg     2760

aaagctaacc atgctaacca ccaaggcaaa gcagcacagt gtgaatagaa cagagcggga     2820

tcaagaattt cacagaagac aggtcagctg aggggcctgc acacacaggg tgttgaggaa     2880

ccacagatgg gcgccgagag gcctgccttt tgcctggccc aggctcaccc ccaccttggg     2940

cctcacctcc tccaggaagc cttcccagct acccgaagct caggtggcct tcttgcaggt     3000

ccccgtagca ccctgagcct gtaccttggg tggcacttgt tatgctatcc tgtgctagcc     3060

gtttgtgcct cgtctcgctg ttagattgtg agttcccatg ggcagagacc cactgtcgtt     3120

ccccgtgtgt ccccagcccg gtccctgtca catttgttaa atgaaagaac aatgaagccc     3180

agtgtaacgt cagtccacag aaatagccac agcttccagt ggtggccgta gacttggctc     3240

ggaacttagt ggcaccagag taactctagt cagttacagt aaaatccact gtgtgtggaa     3300

ggcagaagct agcggttgta tcccaagcat cttttgtatt tgtctttata ctttgctgaa     3360

ttctctgaaa tacctattac tgtatgttgc ttttctaaat aaatgtattg tgaaaccaaa     3420

aaaaaaaaaa aaaaa                                                      3435


<210>  249
<211>  522
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human p62 polypeptide encoded by mRNA transcript variant 1 
       GenBank Accession No.: NP_714941.1  GI:24497609

<400>  249

Met Ser Gly Phe Asn Phe Gly Gly Thr Gly Ala Pro Thr Gly Gly Phe 
1               5                   10                  15      


Thr Phe Gly Thr Ala Lys Thr Ala Thr Thr Thr Pro Ala Thr Gly Phe 
            20                  25                  30          


Ser Phe Ser Thr Ser Gly Thr Gly Gly Phe Asn Phe Gly Ala Pro Phe 
        35                  40                  45              


Gln Pro Ala Thr Ser Thr Pro Ser Thr Gly Leu Phe Ser Leu Ala Thr 
    50                  55                  60                  


Gln Thr Pro Ala Thr Gln Thr Thr Gly Phe Thr Phe Gly Thr Ala Thr 
65                  70                  75                  80  


Leu Ala Ser Gly Gly Thr Gly Phe Ser Leu Gly Ile Gly Ala Ser Lys 
                85                  90                  95      


Leu Asn Leu Ser Asn Thr Ala Ala Thr Pro Ala Met Ala Asn Pro Ser 
            100                 105                 110         


Gly Phe Gly Leu Gly Ser Ser Asn Leu Thr Asn Ala Ile Ser Ser Thr 
        115                 120                 125             


Val Thr Ser Ser Gln Gly Thr Ala Pro Thr Gly Phe Val Phe Gly Pro 
    130                 135                 140                 


Ser Thr Thr Ser Val Ala Pro Ala Thr Thr Ser Gly Gly Phe Ser Phe 
145                 150                 155                 160 


Thr Gly Gly Ser Thr Ala Gln Pro Ser Gly Phe Asn Ile Gly Ser Ala 
                165                 170                 175     


Gly Asn Ser Ala Gln Pro Thr Ala Pro Ala Thr Leu Pro Phe Thr Pro 
            180                 185                 190         


Ala Thr Pro Ala Ala Thr Thr Ala Gly Ala Thr Gln Pro Ala Ala Pro 
        195                 200                 205             


Thr Pro Thr Ala Thr Ile Thr Ser Thr Gly Pro Ser Leu Phe Ala Ser 
    210                 215                 220                 


Ile Ala Thr Ala Pro Thr Ser Ser Ala Thr Thr Gly Leu Ser Leu Cys 
225                 230                 235                 240 


Thr Pro Val Thr Thr Ala Gly Ala Pro Thr Ala Gly Thr Gln Gly Phe 
                245                 250                 255     


Ser Leu Lys Ala Pro Gly Ala Ala Ser Gly Thr Ser Thr Thr Thr Ser 
            260                 265                 270         


Thr Ala Ala Thr Ala Thr Ala Thr Thr Thr Ser Ser Ser Ser Thr Thr 
        275                 280                 285             


Gly Phe Ala Leu Asn Leu Lys Pro Leu Ala Pro Ala Gly Ile Pro Ser 
    290                 295                 300                 


Asn Thr Ala Ala Ala Val Thr Ala Pro Pro Gly Pro Gly Ala Ala Ala 
305                 310                 315                 320 


Gly Ala Ala Ala Ser Ser Ala Met Thr Tyr Ala Gln Leu Glu Ser Leu 
                325                 330                 335     


Ile Asn Lys Trp Ser Leu Glu Leu Glu Asp Gln Glu Arg His Phe Leu 
            340                 345                 350         


Gln Gln Ala Thr Gln Val Asn Ala Trp Asp Arg Thr Leu Ile Glu Asn 
        355                 360                 365             


Gly Glu Lys Ile Thr Ser Leu His Arg Glu Val Glu Lys Val Lys Leu 
    370                 375                 380                 


Asp Gln Lys Arg Leu Asp Gln Glu Leu Asp Phe Ile Leu Ser Gln Gln 
385                 390                 395                 400 


Lys Glu Leu Glu Asp Leu Leu Ser Pro Leu Glu Glu Leu Val Lys Glu 
                405                 410                 415     


Gln Ser Gly Thr Ile Tyr Leu Gln His Ala Asp Glu Glu Arg Glu Lys 
            420                 425                 430         


Thr Tyr Lys Leu Ala Glu Asn Ile Asp Ala Gln Leu Lys Arg Met Ala 
        435                 440                 445             


Gln Asp Leu Lys Asp Ile Ile Glu His Leu Asn Thr Ser Gly Ala Pro 
    450                 455                 460                 


Ala Asp Thr Ser Asp Pro Leu Gln Gln Ile Cys Lys Ile Leu Asn Ala 
465                 470                 475                 480 


His Met Asp Ser Leu Gln Trp Ile Asp Gln Asn Ser Ala Leu Leu Gln 
                485                 490                 495     


Arg Lys Val Glu Glu Val Thr Lys Val Cys Glu Gly Arg Arg Lys Glu 
            500                 505                 510         


Gln Glu Arg Ser Phe Arg Ile Thr Phe Asp 
        515                 520         


<210>  250
<211>  2151
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human vimentin (VIM) mRNA GeneBank Accession No.: NM_003380.3  
       GI:240849334

<400>  250
gcctctccaa aggctgcaga agtttcttgc taacaaaaag tccgcacatt cgagcaaaga       60

caggctttag cgagttatta aaaacttagg ggcgctcttg tcccccacag ggcccgaccg      120

cacacagcaa ggcgatggcc cagctgtaag ttggtagcac tgagaactag cagcgcgcgc      180

ggagcccgct gagacttgaa tcaatctggt ctaacggttt cccctaaacc gctaggagcc      240

ctcaatcggc gggacagcag ggcgcgtcct ctgccactct cgctccgagg tccccgcgcc      300

agagacgcag ccgcgctccc accacccaca cccaccgcgc cctcgttcgc ctcttctccg      360

ggagccagtc cgcgccaccg ccgccgccca ggccatcgcc accctccgca gccatgtcca      420

ccaggtccgt gtcctcgtcc tcctaccgca ggatgttcgg cggcccgggc accgcgagcc      480

ggccgagctc cagccggagc tacgtgacta cgtccacccg cacctacagc ctgggcagcg      540

cgctgcgccc cagcaccagc cgcagcctct acgcctcgtc cccgggcggc gtgtatgcca      600

cgcgctcctc tgccgtgcgc ctgcggagca gcgtgcccgg ggtgcggctc ctgcaggact      660

cggtggactt ctcgctggcc gacgccatca acaccgagtt caagaacacc cgcaccaacg      720

agaaggtgga gctgcaggag ctgaatgacc gcttcgccaa ctacatcgac aaggtgcgct      780

tcctggagca gcagaataag atcctgctgg ccgagctcga gcagctcaag ggccaaggca      840

agtcgcgcct gggggacctc tacgaggagg agatgcggga gctgcgccgg caggtggacc      900

agctaaccaa cgacaaagcc cgcgtcgagg tggagcgcga caacctggcc gaggacatca      960

tgcgcctccg ggagaaattg caggaggaga tgcttcagag agaggaagcc gaaaacaccc     1020

tgcaatcttt cagacaggat gttgacaatg cgtctctggc acgtcttgac cttgaacgca     1080

aagtggaatc tttgcaagaa gagattgcct ttttgaagaa actccacgaa gaggaaatcc     1140

aggagctgca ggctcagatt caggaacagc atgtccaaat cgatgtggat gtttccaagc     1200

ctgacctcac ggctgccctg cgtgacgtac gtcagcaata tgaaagtgtg gctgccaaga     1260

acctgcagga ggcagaagaa tggtacaaat ccaagtttgc tgacctctct gaggctgcca     1320

accggaacaa tgacgccctg cgccaggcaa agcaggagtc cactgagtac cggagacagg     1380

tgcagtccct cacctgtgaa gtggatgccc ttaaaggaac caatgagtcc ctggaacgcc     1440

agatgcgtga aatggaagag aactttgccg ttgaagctgc taactaccaa gacactattg     1500

gccgcctgca ggatgagatt cagaatatga aggaggaaat ggctcgtcac cttcgtgaat     1560

accaagacct gctcaatgtt aagatggccc ttgacattga gattgccacc tacaggaagc     1620

tgctggaagg cgaggagagc aggatttctc tgcctcttcc aaacttttcc tccctgaacc     1680

tgagggaaac taatctggat tcactccctc tggttgatac ccactcaaaa aggacacttc     1740

tgattaagac ggttgaaact agagatggac aggttatcaa cgaaacttct cagcatcacg     1800

atgaccttga ataaaaattg cacacactca gtgcagcaat atattaccag caagaataaa     1860

aaagaaatcc atatcttaaa gaaacagctt tcaagtgcct ttctgcagtt tttcaggagc     1920

gcaagataga tttggaatag gaataagctc tagttcttaa caaccgacac tcctacaaga     1980

tttagaaaaa agtttacaac ataatctagt ttacagaaaa atcttgtgct agaatacttt     2040

ttaaaaggta ttttgaatac cattaaaact gctttttttt ttccagcaag tatccaacca     2100

acttggttct gcttcaataa atctttggaa aaactcaaaa aaaaaaaaaa a              2151


<210>  251
<211>  466
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human vimentin (VIM) polypeptide GeneBank Accession No.: 
       NP_003371.2  GI:62414289

<400>  251

Met Ser Thr Arg Ser Val Ser Ser Ser Ser Tyr Arg Arg Met Phe Gly 
1               5                   10                  15      


Gly Pro Gly Thr Ala Ser Arg Pro Ser Ser Ser Arg Ser Tyr Val Thr 
            20                  25                  30          


Thr Ser Thr Arg Thr Tyr Ser Leu Gly Ser Ala Leu Arg Pro Ser Thr 
        35                  40                  45              


Ser Arg Ser Leu Tyr Ala Ser Ser Pro Gly Gly Val Tyr Ala Thr Arg 
    50                  55                  60                  


Ser Ser Ala Val Arg Leu Arg Ser Ser Val Pro Gly Val Arg Leu Leu 
65                  70                  75                  80  


Gln Asp Ser Val Asp Phe Ser Leu Ala Asp Ala Ile Asn Thr Glu Phe 
                85                  90                  95      


Lys Asn Thr Arg Thr Asn Glu Lys Val Glu Leu Gln Glu Leu Asn Asp 
            100                 105                 110         


Arg Phe Ala Asn Tyr Ile Asp Lys Val Arg Phe Leu Glu Gln Gln Asn 
        115                 120                 125             


Lys Ile Leu Leu Ala Glu Leu Glu Gln Leu Lys Gly Gln Gly Lys Ser 
    130                 135                 140                 


Arg Leu Gly Asp Leu Tyr Glu Glu Glu Met Arg Glu Leu Arg Arg Gln 
145                 150                 155                 160 


Val Asp Gln Leu Thr Asn Asp Lys Ala Arg Val Glu Val Glu Arg Asp 
                165                 170                 175     


Asn Leu Ala Glu Asp Ile Met Arg Leu Arg Glu Lys Leu Gln Glu Glu 
            180                 185                 190         


Met Leu Gln Arg Glu Glu Ala Glu Asn Thr Leu Gln Ser Phe Arg Gln 
        195                 200                 205             


Asp Val Asp Asn Ala Ser Leu Ala Arg Leu Asp Leu Glu Arg Lys Val 
    210                 215                 220                 


Glu Ser Leu Gln Glu Glu Ile Ala Phe Leu Lys Lys Leu His Glu Glu 
225                 230                 235                 240 


Glu Ile Gln Glu Leu Gln Ala Gln Ile Gln Glu Gln His Val Gln Ile 
                245                 250                 255     


Asp Val Asp Val Ser Lys Pro Asp Leu Thr Ala Ala Leu Arg Asp Val 
            260                 265                 270         


Arg Gln Gln Tyr Glu Ser Val Ala Ala Lys Asn Leu Gln Glu Ala Glu 
        275                 280                 285             


Glu Trp Tyr Lys Ser Lys Phe Ala Asp Leu Ser Glu Ala Ala Asn Arg 
    290                 295                 300                 


Asn Asn Asp Ala Leu Arg Gln Ala Lys Gln Glu Ser Thr Glu Tyr Arg 
305                 310                 315                 320 


Arg Gln Val Gln Ser Leu Thr Cys Glu Val Asp Ala Leu Lys Gly Thr 
                325                 330                 335     


Asn Glu Ser Leu Glu Arg Gln Met Arg Glu Met Glu Glu Asn Phe Ala 
            340                 345                 350         


Val Glu Ala Ala Asn Tyr Gln Asp Thr Ile Gly Arg Leu Gln Asp Glu 
        355                 360                 365             


Ile Gln Asn Met Lys Glu Glu Met Ala Arg His Leu Arg Glu Tyr Gln 
    370                 375                 380                 


Asp Leu Leu Asn Val Lys Met Ala Leu Asp Ile Glu Ile Ala Thr Tyr 
385                 390                 395                 400 


Arg Lys Leu Leu Glu Gly Glu Glu Ser Arg Ile Ser Leu Pro Leu Pro 
                405                 410                 415     


Asn Phe Ser Ser Leu Asn Leu Arg Glu Thr Asn Leu Asp Ser Leu Pro 
            420                 425                 430         


Leu Val Asp Thr His Ser Lys Arg Thr Leu Leu Ile Lys Thr Val Glu 
        435                 440                 445             


Thr Arg Asp Gly Gln Val Ile Asn Glu Thr Ser Gln His His Asp Asp 
    450                 455                 460                 


Leu Glu 
465     


<210>  252
<211>  1485
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human cytokeratin-18 (KRT18) mRNA transcript variant 1 GeneBank 
       Accession No.: NM_000224.2  GI:40354193

<400>  252
tccggggcgg gggcggggcc tcactctgcg atataactcg ggtcgcgcgg ctcgcgcagg       60

ccgccaccgt cgtccgcaaa gcctgagtcc tgtcctttct ctctccccgg acagcatgag      120

cttcaccact cgctccacct tctccaccaa ctaccggtcc ctgggctctg tccaggcgcc      180

cagctacggc gcccggccgg tcagcagcgc ggccagcgtc tatgcaggcg ctgggggctc      240

tggttcccgg atctccgtgt cccgctccac cagcttcagg ggcggcatgg ggtccggggg      300

cctggccacc gggatagccg ggggtctggc aggaatggga ggcatccaga acgagaagga      360

gaccatgcaa agcctgaacg accgcctggc ctcttacctg gacagagtga ggagcctgga      420

gaccgagaac cggaggctgg agagcaaaat ccgggagcac ttggagaaga agggacccca      480

ggtcagagac tggagccatt acttcaagat catcgaggac ctgagggctc agatcttcgc      540

aaatactgtg gacaatgccc gcatcgttct gcagattgac aatgcccgtc ttgctgctga      600

tgactttaga gtcaagtatg agacagagct ggccatgcgc cagtctgtgg agaacgacat      660

ccatgggctc cgcaaggtca ttgatgacac caatatcaca cgactgcagc tggagacaga      720

gatcgaggct ctcaaggagg agctgctctt catgaagaag aaccacgaag aggaagtaaa      780

aggcctacaa gcccagattg ccagctctgg gttgaccgtg gaggtagatg cccccaaatc      840

tcaggacctc gccaagatca tggcagacat ccgggcccaa tatgacgagc tggctcggaa      900

gaaccgagag gagctagaca agtactggtc tcagcagatt gaggagagca ccacagtggt      960

caccacacag tctgctgagg ttggagctgc tgagacgacg ctcacagagc tgagacgtac     1020

agtccagtcc ttggagatcg acctggactc catgagaaat ctgaaggcca gcttggagaa     1080

cagcctgagg gaggtggagg cccgctacgc cctacagatg gagcagctca acgggatcct     1140

gctgcacctt gagtcagagc tggcacagac ccgggcagag ggacagcgcc aggcccagga     1200

gtatgaggcc ctgctgaaca tcaaggtcaa gctggaggct gagatcgcca cctaccgccg     1260

cctgctggaa gatggcgagg actttaatct tggtgatgcc ttggacagca gcaactccat     1320

gcaaaccatc caaaagacca ccacccgccg gatagtggat ggcaaagtgg tgtctgagac     1380

caatgacacc aaagttctga ggcattaagc cagcagaagc agggtaccct ttggggagca     1440

ggaggccaat aaaaagttca gagttcaaaa aaaaaaaaaa aaaaa                     1485


<210>  253
<211>  430
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human cytokeratin 18 (KRT18) polypeptide encoded by mRNA 
       transcript variant 1 GeneBank Accession No.: NP_000215.1  
       GI:4557888

<400>  253

Met Ser Phe Thr Thr Arg Ser Thr Phe Ser Thr Asn Tyr Arg Ser Leu 
1               5                   10                  15      


Gly Ser Val Gln Ala Pro Ser Tyr Gly Ala Arg Pro Val Ser Ser Ala 
            20                  25                  30          


Ala Ser Val Tyr Ala Gly Ala Gly Gly Ser Gly Ser Arg Ile Ser Val 
        35                  40                  45              


Ser Arg Ser Thr Ser Phe Arg Gly Gly Met Gly Ser Gly Gly Leu Ala 
    50                  55                  60                  


Thr Gly Ile Ala Gly Gly Leu Ala Gly Met Gly Gly Ile Gln Asn Glu 
65                  70                  75                  80  


Lys Glu Thr Met Gln Ser Leu Asn Asp Arg Leu Ala Ser Tyr Leu Asp 
                85                  90                  95      


Arg Val Arg Ser Leu Glu Thr Glu Asn Arg Arg Leu Glu Ser Lys Ile 
            100                 105                 110         


Arg Glu His Leu Glu Lys Lys Gly Pro Gln Val Arg Asp Trp Ser His 
        115                 120                 125             


Tyr Phe Lys Ile Ile Glu Asp Leu Arg Ala Gln Ile Phe Ala Asn Thr 
    130                 135                 140                 


Val Asp Asn Ala Arg Ile Val Leu Gln Ile Asp Asn Ala Arg Leu Ala 
145                 150                 155                 160 


Ala Asp Asp Phe Arg Val Lys Tyr Glu Thr Glu Leu Ala Met Arg Gln 
                165                 170                 175     


Ser Val Glu Asn Asp Ile His Gly Leu Arg Lys Val Ile Asp Asp Thr 
            180                 185                 190         


Asn Ile Thr Arg Leu Gln Leu Glu Thr Glu Ile Glu Ala Leu Lys Glu 
        195                 200                 205             


Glu Leu Leu Phe Met Lys Lys Asn His Glu Glu Glu Val Lys Gly Leu 
    210                 215                 220                 


Gln Ala Gln Ile Ala Ser Ser Gly Leu Thr Val Glu Val Asp Ala Pro 
225                 230                 235                 240 


Lys Ser Gln Asp Leu Ala Lys Ile Met Ala Asp Ile Arg Ala Gln Tyr 
                245                 250                 255     


Asp Glu Leu Ala Arg Lys Asn Arg Glu Glu Leu Asp Lys Tyr Trp Ser 
            260                 265                 270         


Gln Gln Ile Glu Glu Ser Thr Thr Val Val Thr Thr Gln Ser Ala Glu 
        275                 280                 285             


Val Gly Ala Ala Glu Thr Thr Leu Thr Glu Leu Arg Arg Thr Val Gln 
    290                 295                 300                 


Ser Leu Glu Ile Asp Leu Asp Ser Met Arg Asn Leu Lys Ala Ser Leu 
305                 310                 315                 320 


Glu Asn Ser Leu Arg Glu Val Glu Ala Arg Tyr Ala Leu Gln Met Glu 
                325                 330                 335     


Gln Leu Asn Gly Ile Leu Leu His Leu Glu Ser Glu Leu Ala Gln Thr 
            340                 345                 350         


Arg Ala Glu Gly Gln Arg Gln Ala Gln Glu Tyr Glu Ala Leu Leu Asn 
        355                 360                 365             


Ile Lys Val Lys Leu Glu Ala Glu Ile Ala Thr Tyr Arg Arg Leu Leu 
    370                 375                 380                 


Glu Asp Gly Glu Asp Phe Asn Leu Gly Asp Ala Leu Asp Ser Ser Asn 
385                 390                 395                 400 


Ser Met Gln Thr Ile Gln Lys Thr Thr Thr Arg Arg Ile Val Asp Gly 
                405                 410                 415     


Lys Val Val Ser Glu Thr Asn Asp Thr Lys Val Leu Arg His 
            420                 425                 430 


<210>  254
<211>  1439
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  human cytokeratin-18 (KRT18) mRNA transcript variant GeneBank 
       Accession No.: NM_199187.1  GI:40354194

<400>  254
gcagcctcga gggccaacaa cacctgctgt ccgtgtccat gcccggttgg ccaccccgtt       60

tctgggggca tgagcttcac cactcgctcc accttctcca ccaactaccg gtccctgggc      120

tctgtccagg cgcccagcta cggcgcccgg ccggtcagca gcgcggccag cgtctatgca      180

ggcgctgggg gctctggttc ccggatctcc gtgtcccgct ccaccagctt caggggcggc      240

atggggtccg ggggcctggc caccgggata gccgggggtc tggcaggaat gggaggcatc      300

cagaacgaga aggagaccat gcaaagcctg aacgaccgcc tggcctctta cctggacaga      360

gtgaggagcc tggagaccga gaaccggagg ctggagagca aaatccggga gcacttggag      420

aagaagggac cccaggtcag agactggagc cattacttca agatcatcga ggacctgagg      480

gctcagatct tcgcaaatac tgtggacaat gcccgcatcg ttctgcagat tgacaatgcc      540

cgtcttgctg ctgatgactt tagagtcaag tatgagacag agctggccat gcgccagtct      600

gtggagaacg acatccatgg gctccgcaag gtcattgatg acaccaatat cacacgactg      660

cagctggaga cagagatcga ggctctcaag gaggagctgc tcttcatgaa gaagaaccac      720

gaagaggaag taaaaggcct acaagcccag attgccagct ctgggttgac cgtggaggta      780

gatgccccca aatctcagga cctcgccaag atcatggcag acatccgggc ccaatatgac      840

gagctggctc ggaagaaccg agaggagcta gacaagtact ggtctcagca gattgaggag      900

agcaccacag tggtcaccac acagtctgct gaggttggag ctgctgagac gacgctcaca      960

gagctgagac gtacagtcca gtccttggag atcgacctgg actccatgag aaatctgaag     1020

gccagcttgg agaacagcct gagggaggtg gaggcccgct acgccctaca gatggagcag     1080

ctcaacggga tcctgctgca ccttgagtca gagctggcac agacccgggc agagggacag     1140

cgccaggccc aggagtatga ggccctgctg aacatcaagg tcaagctgga ggctgagatc     1200

gccacctacc gccgcctgct ggaagatggc gaggacttta atcttggtga tgccttggac     1260

agcagcaact ccatgcaaac catccaaaag accaccaccc gccggatagt ggatggcaaa     1320

gtggtgtctg agaccaatga caccaaagtt ctgaggcatt aagccagcag aagcagggta     1380

ccctttgggg agcaggaggc caataaaaag ttcagagttc aaaaaaaaaa aaaaaaaaa      1439


<210>  255
<211>  430
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  human cytokeratin 18 (KRT18) polypeptide encoded by mRNA 
       transcript variant 2 GeneBank Accession No.: NP_954657.1 
       GI:40354195

<400>  255

Met Ser Phe Thr Thr Arg Ser Thr Phe Ser Thr Asn Tyr Arg Ser Leu 
1               5                   10                  15      


Gly Ser Val Gln Ala Pro Ser Tyr Gly Ala Arg Pro Val Ser Ser Ala 
            20                  25                  30          


Ala Ser Val Tyr Ala Gly Ala Gly Gly Ser Gly Ser Arg Ile Ser Val 
        35                  40                  45              


Ser Arg Ser Thr Ser Phe Arg Gly Gly Met Gly Ser Gly Gly Leu Ala 
    50                  55                  60                  


Thr Gly Ile Ala Gly Gly Leu Ala Gly Met Gly Gly Ile Gln Asn Glu 
65                  70                  75                  80  


Lys Glu Thr Met Gln Ser Leu Asn Asp Arg Leu Ala Ser Tyr Leu Asp 
                85                  90                  95      


Arg Val Arg Ser Leu Glu Thr Glu Asn Arg Arg Leu Glu Ser Lys Ile 
            100                 105                 110         


Arg Glu His Leu Glu Lys Lys Gly Pro Gln Val Arg Asp Trp Ser His 
        115                 120                 125             


Tyr Phe Lys Ile Ile Glu Asp Leu Arg Ala Gln Ile Phe Ala Asn Thr 
    130                 135                 140                 


Val Asp Asn Ala Arg Ile Val Leu Gln Ile Asp Asn Ala Arg Leu Ala 
145                 150                 155                 160 


Ala Asp Asp Phe Arg Val Lys Tyr Glu Thr Glu Leu Ala Met Arg Gln 
                165                 170                 175     


Ser Val Glu Asn Asp Ile His Gly Leu Arg Lys Val Ile Asp Asp Thr 
            180                 185                 190         


Asn Ile Thr Arg Leu Gln Leu Glu Thr Glu Ile Glu Ala Leu Lys Glu 
        195                 200                 205             


Glu Leu Leu Phe Met Lys Lys Asn His Glu Glu Glu Val Lys Gly Leu 
    210                 215                 220                 


Gln Ala Gln Ile Ala Ser Ser Gly Leu Thr Val Glu Val Asp Ala Pro 
225                 230                 235                 240 


Lys Ser Gln Asp Leu Ala Lys Ile Met Ala Asp Ile Arg Ala Gln Tyr 
                245                 250                 255     


Asp Glu Leu Ala Arg Lys Asn Arg Glu Glu Leu Asp Lys Tyr Trp Ser 
            260                 265                 270         


Gln Gln Ile Glu Glu Ser Thr Thr Val Val Thr Thr Gln Ser Ala Glu 
        275                 280                 285             


Val Gly Ala Ala Glu Thr Thr Leu Thr Glu Leu Arg Arg Thr Val Gln 
    290                 295                 300                 


Ser Leu Glu Ile Asp Leu Asp Ser Met Arg Asn Leu Lys Ala Ser Leu 
305                 310                 315                 320 


Glu Asn Ser Leu Arg Glu Val Glu Ala Arg Tyr Ala Leu Gln Met Glu 
                325                 330                 335     


Gln Leu Asn Gly Ile Leu Leu His Leu Glu Ser Glu Leu Ala Gln Thr 
            340                 345                 350         


Arg Ala Glu Gly Gln Arg Gln Ala Gln Glu Tyr Glu Ala Leu Leu Asn 
        355                 360                 365             


Ile Lys Val Lys Leu Glu Ala Glu Ile Ala Thr Tyr Arg Arg Leu Leu 
    370                 375                 380                 


Glu Asp Gly Glu Asp Phe Asn Leu Gly Asp Ala Leu Asp Ser Ser Asn 
385                 390                 395                 400 


Ser Met Gln Thr Ile Gln Lys Thr Thr Thr Arg Arg Ile Val Asp Gly 
                405                 410                 415     


Lys Val Val Ser Glu Thr Asn Asp Thr Lys Val Leu Arg His 
            420                 425                 430 


<210>  256
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>   PI3KCA Exon 9 Fwd. Primer

<400>  256
gaatccagag gggaaaaa                                                     18


<210>  257
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PI3KCA Exon 9 Rev. Primer

<400>  257
ccattttagc acttacctg                                                    19


<210>  258
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PI3KCA Exon 12 Fwd. Primer

<400>  258
ttgatgacat tgcatacatt cg                                                22


<210>  259
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  PI3KCA Exon 12 Rev. Primer

<400>  259
acctgtgact ccatagaaa                                                    19


<210>  260
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  K-RAS Fwd. Primer

<400>  260
gcctgctgaa aatgactg                                                     18


<210>  261
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  261 K-RAS Rev. Primer

<400>  261
gttggatcat attcgtcca                                                    19


<210>  262
<211>  3878
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITGAE integrin, alpha E (antigen CD103, human mucosal lymphocyte 
       antigen 1; alpha polypeptide) mRNA GeneBank Accession No.: 
       NM_002208.4  GI:148728187

<400>  262
ccgcctcctg gcctcctggc tgaggggaag ctgagtgggc cacggcccat gtgtcgcact       60

cgcctcggct cccacacagc cgcctctgct ccagcaagga tgtggctctt ccacactctg      120

ctctgcatag ccagcctggc cctgctggcc gctttcaatg tggatgtggc ccggccctgg      180

ctcacgccca agggaggtgc ccctttcgtg ctcagctccc ttctgcacca agaccccagc      240

accaaccaga cctggctcct ggtcaccagc cccagaacca agaggacacc agggcccctc      300

catcgatgtt cccttgtcca ggatgaaatc ctttgccatc ctgtagagca tgtccccatc      360

cccaagggga ggcaccgggg agtgaccgtt gtccggagcc accacggtgt tttgatatgc      420

attcaagtgc tggtccggcg gcctcacagc ctcagctcag aactcacagg cacctgtagc      480

ctcctgggcc ctgacctccg tccccaggct caggccaact tcttcgacct tgaaaatctc      540

ctggatccag atgcacgtgt ggacactgga gactgctaca gcaacaaaga aggcggtgga      600

gaagacgatg tgaacacagc caggcagcgc cgggctctgg agaaggagga ggaggaagac      660

aaggaggagg aggaagacga ggaggaggag gaagctggca ccgagattgc catcatcctg      720

gatggctcag gaagcattga tcccccagac tttcagagag ccaaagactt catctccaac      780

atgatgagga acttctatga aaagtgtttt gagtgcaact ttgccttggt gcagtatgga      840

ggagtgatcc agactgagtt tgaccttcgg gacagccagg atgtgatggc ctccctcgcc      900

agagtccaga acatcactca agtggggagt gtcaccaaga ctgcctcagc catgcaacac      960

gtcttagaca gcatcttcac ctcaagccac ggctccagga gaaaggcatc caaggtcatg     1020

gtggtgctca ccgatggtgg catattcgag gaccccctca accttacgac agtcatcaac     1080

tcccccaaaa tgcagggtgt tgagcgcttt gccattgggg tgggagaaga atttaagagt     1140

gctaggactg cgagggaact gaacctgatc gcctcagacc cggatgagac ccatgctttc     1200

aaggtgacca actacatggc gctggatggg ctgctgagca aactgcggta caacatcatc     1260

agcatggaag gcacggttgg agacgccctt cactaccagc tggcacagat tggcttcagt     1320

gctcagatcc tggatgagcg gcaggtgctg ctcggcgccg tcggggcctt tgactggtcc     1380

ggaggggcgt tgctctacga cacacgcagc cgccggggcc gcttcctgaa ccagacagcg     1440

gcggcggcgg cagacgcgga ggctgcgcag tacagctacc tgggttacgc tgtggccgtg     1500

ctgcacaaga cctgcagcct ctcctacatc gcgggggctc cacggtacaa acatcatggg     1560

gccgtgtttg agctccagaa ggagggcaga gaggccagct tcctgccagt gctggaggga     1620

gagcagatgg ggtcctattt tggctctgag ctgtgccctg tggacattga catggatgga     1680

agcacggact tcttgctggt ggctgctcca ttttaccacg ttcatggaga agaaggcaga     1740

gtctacgtgt accgtctcag cgagcaggat ggttctttct ccttggcacg catactgagt     1800

gggcaccccg ggttcaccaa tgcccgcttt ggctttgcca tggcggctat gggggatctc     1860

agtcaggata agctcacaga tgtggccatc ggggcccccc tggaaggttt tggggcagat     1920

gatggtgcca gcttcggcag tgtgtatatc tacaatggac actgggacgg cctctccgcc     1980

agcccctcgc agcggatcag agcctccacg gtggccccag gactccagta cttcggcatg     2040

tccatggctg gtggctttga tattagtggc gacggccttg ccgacatcac cgtgggcact     2100

ctgggccagg cggttgtgtt ccgctcccgg cctgtggttc gcctgaaggt ctccatggcc     2160

ttcaccccca gcgcactgcc catcggcttc aacggcgtcg tgaatgtccg tttatgtttt     2220

gaaatcagct ctgtaaccac agcctctgag tcaggcctcc gcgaggcact tctcaacttc     2280

acgctggatg tggatgtggg gaagcagagg agacggctgc agtgttcaga cgtaagaagc     2340

tgtctgggct gcctgaggga gtggagcagc ggatcccagc tttgtgagga cctcctgctc     2400

atgcccacag agggagagct ctgtgaggag gactgcttct ccaatgccag tgtcaaagtc     2460

agctaccagc tccagacccc tgagggacag acggaccatc cccagcccat cctggaccgc     2520

tacactgagc cctttgccat cttccagctg ccctatgaga aggcctgcaa gaataagctg     2580

ttttgtgtcg cagaattaca gttggccacc accgtctctc agcaggagtt ggtggtgggt     2640

ctcacaaagg agctgaccct gaacattaac ctaactaact ccggggaaga ttcctacatg     2700

acaagcatgg ccttgaatta ccccagaaac ctgcagttga agaggatgca aaagcctccc     2760

tctccaaaca ttcagtgtga tgaccctcag ccggttgctt ctgtcctgat catgaactgc     2820

aggattggtc accccgtcct caagaggtca tctgctcatg tttcagtcgt ttggcagcta     2880

gaggagaatg cctttccaaa caggacagca gacatcactg tgactgtcac caattccaat     2940

gaaagacggt ctttggccaa cgagacccac acccttcaat tcaggcatgg cttcgttgca     3000

gttctgtcca aaccatccat aatgtacgtg aacacaggcc aggggctttc tcaccacaaa     3060

gaattcctct tccatgtaca tggggagaac ctctttggag cagaatacca gttgcaaatt     3120

tgcgtcccaa ccaaattacg aggtctccag gttgtagcag tgaagaagct gacgaggact     3180

caggcctcca cggtgtgcac ctggagtcag gagcgcgctt gtgcgtacag ttcggttcag     3240

catgtggaag aatggcattc agtgagctgt gtcatcgctt cagataaaga aaatgtcacc     3300

gtggctgcag agatctcctg ggatcactct gaggagttac taaaagatgt aactgaactg     3360

cagatccttg gtgaaatatc tttcaacaaa tctctatatg agggactgaa tgcagagaac     3420

cacagaacta agatcactgt cgtcttcctg aaagatgaga agtaccattc tttgcctatc     3480

atcattaaag gcagcgttgg tggacttctg gtgttgatcg tgattctggt catcctgttc     3540

aagtgtggct tttttaaaag aaaatatcaa caactgaact tggagagcat caggaaggcc     3600

cagctgaaat cagagaatct gctcgaagaa gagaattagg acctgctatc cactgggaga     3660

ggctatcagc cagtcctggg acttggagac ccagcatcct ttgcattact ttttccttca     3720

ggatgatcta gagcagcatg gagctgttgg tagaatatta gtttttaacc atacattgtc     3780

ccaaaagtgt ctgtgcattg tgcaaaaagt aaacttagga aacatttggt attaaataaa     3840

tttacacttt tctttgcagt aaaaaaaaaa aaaaaaaa                             3878


<210>  263
<211>  1179
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITGAE integrin, alpha E (antigen CD103, human mucosal lymphocyte 
       antigen 1; alpha polypeptide) polypeptide GeneBank Accession No.:
       NM_002208.4  GI:148728187

<400>  263

Met Trp Leu Phe His Thr Leu Leu Cys Ile Ala Ser Leu Ala Leu Leu 
1               5                   10                  15      


Ala Ala Phe Asn Val Asp Val Ala Arg Pro Trp Leu Thr Pro Lys Gly 
            20                  25                  30          


Gly Ala Pro Phe Val Leu Ser Ser Leu Leu His Gln Asp Pro Ser Thr 
        35                  40                  45              


Asn Gln Thr Trp Leu Leu Val Thr Ser Pro Arg Thr Lys Arg Thr Pro 
    50                  55                  60                  


Gly Pro Leu His Arg Cys Ser Leu Val Gln Asp Glu Ile Leu Cys His 
65                  70                  75                  80  


Pro Val Glu His Val Pro Ile Pro Lys Gly Arg His Arg Gly Val Thr 
                85                  90                  95      


Val Val Arg Ser His His Gly Val Leu Ile Cys Ile Gln Val Leu Val 
            100                 105                 110         


Arg Arg Pro His Ser Leu Ser Ser Glu Leu Thr Gly Thr Cys Ser Leu 
        115                 120                 125             


Leu Gly Pro Asp Leu Arg Pro Gln Ala Gln Ala Asn Phe Phe Asp Leu 
    130                 135                 140                 


Glu Asn Leu Leu Asp Pro Asp Ala Arg Val Asp Thr Gly Asp Cys Tyr 
145                 150                 155                 160 


Ser Asn Lys Glu Gly Gly Gly Glu Asp Asp Val Asn Thr Ala Arg Gln 
                165                 170                 175     


Arg Arg Ala Leu Glu Lys Glu Glu Glu Glu Asp Lys Glu Glu Glu Glu 
            180                 185                 190         


Asp Glu Glu Glu Glu Glu Ala Gly Thr Glu Ile Ala Ile Ile Leu Asp 
        195                 200                 205             


Gly Ser Gly Ser Ile Asp Pro Pro Asp Phe Gln Arg Ala Lys Asp Phe 
    210                 215                 220                 


Ile Ser Asn Met Met Arg Asn Phe Tyr Glu Lys Cys Phe Glu Cys Asn 
225                 230                 235                 240 


Phe Ala Leu Val Gln Tyr Gly Gly Val Ile Gln Thr Glu Phe Asp Leu 
                245                 250                 255     


Arg Asp Ser Gln Asp Val Met Ala Ser Leu Ala Arg Val Gln Asn Ile 
            260                 265                 270         


Thr Gln Val Gly Ser Val Thr Lys Thr Ala Ser Ala Met Gln His Val 
        275                 280                 285             


Leu Asp Ser Ile Phe Thr Ser Ser His Gly Ser Arg Arg Lys Ala Ser 
    290                 295                 300                 


Lys Val Met Val Val Leu Thr Asp Gly Gly Ile Phe Glu Asp Pro Leu 
305                 310                 315                 320 


Asn Leu Thr Thr Val Ile Asn Ser Pro Lys Met Gln Gly Val Glu Arg 
                325                 330                 335     


Phe Ala Ile Gly Val Gly Glu Glu Phe Lys Ser Ala Arg Thr Ala Arg 
            340                 345                 350         


Glu Leu Asn Leu Ile Ala Ser Asp Pro Asp Glu Thr His Ala Phe Lys 
        355                 360                 365             


Val Thr Asn Tyr Met Ala Leu Asp Gly Leu Leu Ser Lys Leu Arg Tyr 
    370                 375                 380                 


Asn Ile Ile Ser Met Glu Gly Thr Val Gly Asp Ala Leu His Tyr Gln 
385                 390                 395                 400 


Leu Ala Gln Ile Gly Phe Ser Ala Gln Ile Leu Asp Glu Arg Gln Val 
                405                 410                 415     


Leu Leu Gly Ala Val Gly Ala Phe Asp Trp Ser Gly Gly Ala Leu Leu 
            420                 425                 430         


Tyr Asp Thr Arg Ser Arg Arg Gly Arg Phe Leu Asn Gln Thr Ala Ala 
        435                 440                 445             


Ala Ala Ala Asp Ala Glu Ala Ala Gln Tyr Ser Tyr Leu Gly Tyr Ala 
    450                 455                 460                 


Val Ala Val Leu His Lys Thr Cys Ser Leu Ser Tyr Ile Ala Gly Ala 
465                 470                 475                 480 


Pro Arg Tyr Lys His His Gly Ala Val Phe Glu Leu Gln Lys Glu Gly 
                485                 490                 495     


Arg Glu Ala Ser Phe Leu Pro Val Leu Glu Gly Glu Gln Met Gly Ser 
            500                 505                 510         


Tyr Phe Gly Ser Glu Leu Cys Pro Val Asp Ile Asp Met Asp Gly Ser 
        515                 520                 525             


Thr Asp Phe Leu Leu Val Ala Ala Pro Phe Tyr His Val His Gly Glu 
    530                 535                 540                 


Glu Gly Arg Val Tyr Val Tyr Arg Leu Ser Glu Gln Asp Gly Ser Phe 
545                 550                 555                 560 


Ser Leu Ala Arg Ile Leu Ser Gly His Pro Gly Phe Thr Asn Ala Arg 
                565                 570                 575     


Phe Gly Phe Ala Met Ala Ala Met Gly Asp Leu Ser Gln Asp Lys Leu 
            580                 585                 590         


Thr Asp Val Ala Ile Gly Ala Pro Leu Glu Gly Phe Gly Ala Asp Asp 
        595                 600                 605             


Gly Ala Ser Phe Gly Ser Val Tyr Ile Tyr Asn Gly His Trp Asp Gly 
    610                 615                 620                 


Leu Ser Ala Ser Pro Ser Gln Arg Ile Arg Ala Ser Thr Val Ala Pro 
625                 630                 635                 640 


Gly Leu Gln Tyr Phe Gly Met Ser Met Ala Gly Gly Phe Asp Ile Ser 
                645                 650                 655     


Gly Asp Gly Leu Ala Asp Ile Thr Val Gly Thr Leu Gly Gln Ala Val 
            660                 665                 670         


Val Phe Arg Ser Arg Pro Val Val Arg Leu Lys Val Ser Met Ala Phe 
        675                 680                 685             


Thr Pro Ser Ala Leu Pro Ile Gly Phe Asn Gly Val Val Asn Val Arg 
    690                 695                 700                 


Leu Cys Phe Glu Ile Ser Ser Val Thr Thr Ala Ser Glu Ser Gly Leu 
705                 710                 715                 720 


Arg Glu Ala Leu Leu Asn Phe Thr Leu Asp Val Asp Val Gly Lys Gln 
                725                 730                 735     


Arg Arg Arg Leu Gln Cys Ser Asp Val Arg Ser Cys Leu Gly Cys Leu 
            740                 745                 750         


Arg Glu Trp Ser Ser Gly Ser Gln Leu Cys Glu Asp Leu Leu Leu Met 
        755                 760                 765             


Pro Thr Glu Gly Glu Leu Cys Glu Glu Asp Cys Phe Ser Asn Ala Ser 
    770                 775                 780                 


Val Lys Val Ser Tyr Gln Leu Gln Thr Pro Glu Gly Gln Thr Asp His 
785                 790                 795                 800 


Pro Gln Pro Ile Leu Asp Arg Tyr Thr Glu Pro Phe Ala Ile Phe Gln 
                805                 810                 815     


Leu Pro Tyr Glu Lys Ala Cys Lys Asn Lys Leu Phe Cys Val Ala Glu 
            820                 825                 830         


Leu Gln Leu Ala Thr Thr Val Ser Gln Gln Glu Leu Val Val Gly Leu 
        835                 840                 845             


Thr Lys Glu Leu Thr Leu Asn Ile Asn Leu Thr Asn Ser Gly Glu Asp 
    850                 855                 860                 


Ser Tyr Met Thr Ser Met Ala Leu Asn Tyr Pro Arg Asn Leu Gln Leu 
865                 870                 875                 880 


Lys Arg Met Gln Lys Pro Pro Ser Pro Asn Ile Gln Cys Asp Asp Pro 
                885                 890                 895     


Gln Pro Val Ala Ser Val Leu Ile Met Asn Cys Arg Ile Gly His Pro 
            900                 905                 910         


Val Leu Lys Arg Ser Ser Ala His Val Ser Val Val Trp Gln Leu Glu 
        915                 920                 925             


Glu Asn Ala Phe Pro Asn Arg Thr Ala Asp Ile Thr Val Thr Val Thr 
    930                 935                 940                 


Asn Ser Asn Glu Arg Arg Ser Leu Ala Asn Glu Thr His Thr Leu Gln 
945                 950                 955                 960 


Phe Arg His Gly Phe Val Ala Val Leu Ser Lys Pro Ser Ile Met Tyr 
                965                 970                 975     


Val Asn Thr Gly Gln Gly Leu Ser His His Lys Glu Phe Leu Phe His 
            980                 985                 990         


Val His Gly Glu Asn Leu Phe Gly  Ala Glu Tyr Gln Leu  Gln Ile Cys 
        995                 1000                 1005             


Val Pro  Thr Lys Leu Arg Gly  Leu Gln Val Val Ala  Val Lys Lys 
    1010                 1015                 1020             


Leu Thr  Arg Thr Gln Ala Ser  Thr Val Cys Thr Trp  Ser Gln Glu 
    1025                 1030                 1035             


Arg Ala  Cys Ala Tyr Ser Ser  Val Gln His Val Glu  Glu Trp His 
    1040                 1045                 1050             


Ser Val  Ser Cys Val Ile Ala  Ser Asp Lys Glu Asn  Val Thr Val 
    1055                 1060                 1065             


Ala Ala  Glu Ile Ser Trp Asp  His Ser Glu Glu Leu  Leu Lys Asp 
    1070                 1075                 1080             


Val Thr  Glu Leu Gln Ile Leu  Gly Glu Ile Ser Phe  Asn Lys Ser 
    1085                 1090                 1095             


Leu Tyr  Glu Gly Leu Asn Ala  Glu Asn His Arg Thr  Lys Ile Thr 
    1100                 1105                 1110             


Val Val  Phe Leu Lys Asp Glu  Lys Tyr His Ser Leu  Pro Ile Ile 
    1115                 1120                 1125             


Ile Lys  Gly Ser Val Gly Gly  Leu Leu Val Leu Ile  Val Ile Leu 
    1130                 1135                 1140             


Val Ile  Leu Phe Lys Cys Gly  Phe Phe Lys Arg Lys  Tyr Gln Gln 
    1145                 1150                 1155             


Leu Asn  Leu Glu Ser Ile Arg  Lys Ala Gln Leu Lys  Ser Glu Asn 
    1160                 1165                 1170             


Leu Leu  Glu Glu Glu Asn 
    1175                 


<210>  264
<211>  5226
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITGAL integrin, alpha L (antigen CD11A (p180), lymphocyte 
       function-associated antigen 1; alpha polypeptide; Human) mRNA for
       transcript variant 1 GeneBank Accession No: NM_002209.2  
       GI:167466214

<400>  264
acacctccct ccccgcctgc cagtgtcacc agcctgttgc ctctgtgaga aagtaccact       60

gtaagaggcc aaagggcatg atcattttcc tctttcaccc tgtctaggtt gccagcaaat      120

cccacgggcc tcctgacgct gcccctgggg ccacaggtcc ctcgagtgct ggaaggatga      180

aggattcctg catcactgtg atggccatgg cgctgctgtc tgggttcttt ttcttcgcgc      240

cggcctcgag ctacaacctg gacgtgcggg gcgcgcggag cttctcccca ccgcgcgccg      300

ggaggcactt tggataccgc gtcctgcagg tcggaaacgg ggtcatcgtg ggagctccag      360

gggaggggaa cagcacagga agcctctatc agtgccagtc gggcacagga cactgcctgc      420

cagtcaccct gagaggttcc aactatacct ccaagtactt gggaatgacc ttggcaacag      480

accccacaga tggaagcatt ttggcctgtg accctgggct gtctcgaacg tgtgaccaga      540

acacctatct gagtggcctg tgttacctct tccgccagaa tctgcagggt cccatgctgc      600

aggggcgccc tggttttcag gaatgtatca agggcaacgt agacctggta tttctgtttg      660

atggttcgat gagcttgcag ccagatgaat ttcagaaaat tctggacttc atgaaggatg      720

tgatgaagaa actcagcaac acttcgtacc agtttgctgc tgttcagttt tccacaagct      780

acaaaacaga atttgatttc tcagattatg ttaaacggaa ggaccctgat gctctgctga      840

agcatgtaaa gcacatgttg ctgttgacca atacctttgg tgccatcaat tatgtcgcga      900

cagaggtgtt ccgggaggag ctgggggccc ggccagatgc caccaaagtg cttatcatca      960

tcacggatgg ggaggccact gacagtggca acatcgatgc ggccaaagac atcatccgct     1020

acatcatcgg gattggaaag cattttcaga ccaaggagag tcaggagacc ctccacaaat     1080

ttgcatcaaa acccgcgagc gagtttgtga aaattctgga cacatttgag aagctgaaag     1140

atctattcac tgagctgcag aagaagatct atgtcattga gggcacaagc aaacaggacc     1200

tgacttcctt caacatggag ctgtcctcca gcggcatcag tgctgacctc agcaggggcc     1260

atgcagtcgt gggggcagta ggagccaagg actgggctgg gggctttctt gacctgaagg     1320

cagacctgca ggatgacaca tttattggga atgaaccatt gacaccagaa gtgagagcag     1380

gctatttggg ttacaccgtg acctggctgc cctcccggca aaagacttcg ttgctggcct     1440

cgggagcccc tcgataccag cacatgggcc gagtgctgct gttccaagag ccacagggcg     1500

gaggacactg gagccaggtc cagacaatcc atgggaccca gattggctct tatttcggtg     1560

gggagctgtg tggcgtcgac gtggaccaag atggggagac agagctgctg ctgattggtg     1620

ccccactgtt ctatggggag cagagaggag gccgggtgtt tatctaccag agaagacagt     1680

tggggtttga agaagtctca gagctgcagg gggaccccgg ctacccactc gggcggtttg     1740

gagaagccat cactgctctg acagacatca acggcgatgg gctggtagac gtggctgtgg     1800

gggcccctct ggaggagcag ggggctgtgt acatcttcaa tgggaggcac ggggggctta     1860

gtccccagcc aagtcagcgg atagaaggga cccaagtgct ctcaggaatt cagtggtttg     1920

gacgctccat ccatggggtg aaggaccttg aaggggatgg cttggcagat gtggctgtgg     1980

gggctgagag ccagatgatc gtgctgagct cccggcccgt ggtggatatg gtcaccctga     2040

tgtccttctc tccagctgag atcccagtgc atgaagtgga gtgctcctat tcaaccagta     2100

acaagatgaa agaaggagtt aatatcacaa tctgtttcca gatcaagtct ctcatccccc     2160

agttccaagg ccgcctggtt gccaatctca cttacactct gcagctggat ggccaccgga     2220

ccagaagacg ggggttgttc ccaggaggga gacatgaact cagaaggaat atagctgtca     2280

ccaccagcat gtcatgcact gacttctcat ttcatttccc ggtatgtgtt caagacctca     2340

tctcccccat caatgtttcc ctgaatttct ctctttggga ggaggaaggg acaccgaggg     2400

accaaagggc gcagggcaag gacataccgc ccatcctgag accctccctg cactcggaaa     2460

cctgggagat cccttttgag aagaactgtg gggaggacaa gaagtgtgag gcaaacttga     2520

gagtgtcctt ctctcctgca agatccagag ccctgcgtct aactgctttt gccagcctct     2580

ctgtggagct gagcctgagt aacttggaag aagatgctta ctgggtccag ctggacctgc     2640

acttcccccc gggactctcc ttccgcaagg tggagatgct gaagccccat agccagatac     2700

ctgtgagctg cgaggagctt cctgaagagt ccaggcttct gtccagggca ttatcttgca     2760

atgtgagctc tcccatcttc aaagcaggcc actcggttgc tctgcagatg atgtttaata     2820

cactggtaaa cagctcctgg ggggactcgg ttgaattgca cgccaatgtg acctgtaaca     2880

atgaggactc agacctcctg gaggacaact cagccactac catcatcccc atcctgtacc     2940

ccatcaacat cctcatccag gaccaagaag actccacact ctatgtcagt ttcaccccca     3000

aaggccccaa gatccaccaa gtcaagcaca tgtaccaggt gaggatccag ccttccatcc     3060

acgaccacaa catacccacc ctggaggctg tggttggggt gccacagcct cccagcgagg     3120

ggcccatcac acaccagtgg agcgtgcaga tggagcctcc cgtgccctgc cactatgagg     3180

atctggagag gctcccggat gcagctgagc cttgtctccc cggagccctg ttccgctgcc     3240

ctgttgtctt caggcaggag atcctcgtcc aagtgatcgg gactctggag ctggtgggag     3300

agatcgaggc ctcttccatg ttcagcctct gcagctccct ctccatctcc ttcaacagca     3360

gcaagcattt ccacctctat ggcagcaacg cctccctggc ccaggttgtc atgaaggttg     3420

acgtggtgta tgagaagcag atgctctacc tctacgtgct gagcggcatc ggggggctgc     3480

tgctgctgct gctcattttc atagtgctgt acaaggttgg tttcttcaaa cggaacctga     3540

aggagaagat ggaggctggc agaggtgtcc cgaatggaat ccctgcagaa gactctgagc     3600

agctggcatc tgggcaagag gctggggatc ccggctgcct gaagcccctc catgagaagg     3660

actctgagag tggtggtggc aaggactgag tccaggcctg tgaggtgcag agtgcccaga     3720

actggactca ggatgcccag ggccactctg cctctgcctg cattctgccg tgtgccctcg     3780

ggcgagtcac tgcctctccc tggccctcag tttccctatc tcgaacatgg aactcattcc     3840

tgcctgtctc ctttgcaggc tcatagggaa gacctgctga gggaccagcc aagagggctg     3900

caaaagtgag ggcttgtcat taccagacgg ttcaccagcc tctcttggtt tccttccttg     3960

gaagagaatg tctgatctaa atgtggagaa actgtagtct caggacctag ggatgttctg     4020

gccctcaccc ctgccctggg atgtccacag atgcctccac cccccagaac ctgtccttgc     4080

acactcccct gcactggagt ccagtctctt ctgctggcag aaagcaaatg tgacctgtgt     4140

cactacgtga ctgtggcaca cgccttgttc ttggccaaag accaaattcc ttggcatgcc     4200

ttccagcacc ctgcaaaatg agaccctcgt ggccttcccc agcctcttct agagccgtga     4260

tgcctccctg ttgaagctct ggtgacacca gcctttctcc caggccaggc tccttcctgt     4320

cttcctgcat tcacccagac agctccctct gcctgaacct tccatctcgc cacccctcct     4380

tccttgacca gcagatccca gctcacgtca cacttggttg ggtcctcaca tctttcacac     4440

ttccaccagc ctgcactact ccctcaaagc acacgtcatg tttcttcatc cggcagcctg     4500

gatgtttttt ccctgtttaa tgattgacgt acttagcagc tatctctcag tgaactgtga     4560

gggtaaaggc tatacttgtc ttgttcacct tgggatgatg cctcatgata tgtcagggcg     4620

tgggacatct agtaggtgct tgacataatt tcactgaatt aatgacagag ccagtgggaa     4680

gatacagaaa aagaggggct gggctgggcg cggtggttca cgcctgtaat cccagcactt     4740

tgggaggcca aggagggtgg atcacctgag gtcaggagtt agaggccagc ctggcgaaac     4800

cccatctcta ctaaaaatac aaaatccagg cgtggtggca cacacctgta gtcccagcta     4860

ctcaggaggt tgaggtagga gaattgcttg aacctgggag gtggaggttg cagtgagcca     4920

agattgcgcc attgcactcc agcctgggca acacagcgag actccgtctc aaggaaaaaa     4980

taaaaataaa aagcgggcac gggcccgtga catccccacc cttggaggct gtcttctcag     5040

gctctgccct gccctagctc cacaccctct cccaggaccc atcacgcctg tgcagtggcc     5100

cccacagaaa gactgagctc aaggtgggaa ccacgtctgc taacttggag ccccagtgcc     5160

aagcacagtg cctgcatgta tttatccaat aaatgtgaaa ttctgtccaa aaaaaaaaaa     5220

aaaaaa                                                                5226


<210>  265
<211>  1170
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITGAL integrin, alpha L (antigen CD11A (p180), lymphocyte 
       function-associated antigen 1; alpha polypeptide; Human) 
       polypeptide for transcript variant 1 GeneBank Accession No: 
       NP_002200.2 GI:167466215

<400>  265

Met Lys Asp Ser Cys Ile Thr Val Met Ala Met Ala Leu Leu Ser Gly 
1               5                   10                  15      


Phe Phe Phe Phe Ala Pro Ala Ser Ser Tyr Asn Leu Asp Val Arg Gly 
            20                  25                  30          


Ala Arg Ser Phe Ser Pro Pro Arg Ala Gly Arg His Phe Gly Tyr Arg 
        35                  40                  45              


Val Leu Gln Val Gly Asn Gly Val Ile Val Gly Ala Pro Gly Glu Gly 
    50                  55                  60                  


Asn Ser Thr Gly Ser Leu Tyr Gln Cys Gln Ser Gly Thr Gly His Cys 
65                  70                  75                  80  


Leu Pro Val Thr Leu Arg Gly Ser Asn Tyr Thr Ser Lys Tyr Leu Gly 
                85                  90                  95      


Met Thr Leu Ala Thr Asp Pro Thr Asp Gly Ser Ile Leu Ala Cys Asp 
            100                 105                 110         


Pro Gly Leu Ser Arg Thr Cys Asp Gln Asn Thr Tyr Leu Ser Gly Leu 
        115                 120                 125             


Cys Tyr Leu Phe Arg Gln Asn Leu Gln Gly Pro Met Leu Gln Gly Arg 
    130                 135                 140                 


Pro Gly Phe Gln Glu Cys Ile Lys Gly Asn Val Asp Leu Val Phe Leu 
145                 150                 155                 160 


Phe Asp Gly Ser Met Ser Leu Gln Pro Asp Glu Phe Gln Lys Ile Leu 
                165                 170                 175     


Asp Phe Met Lys Asp Val Met Lys Lys Leu Ser Asn Thr Ser Tyr Gln 
            180                 185                 190         


Phe Ala Ala Val Gln Phe Ser Thr Ser Tyr Lys Thr Glu Phe Asp Phe 
        195                 200                 205             


Ser Asp Tyr Val Lys Arg Lys Asp Pro Asp Ala Leu Leu Lys His Val 
    210                 215                 220                 


Lys His Met Leu Leu Leu Thr Asn Thr Phe Gly Ala Ile Asn Tyr Val 
225                 230                 235                 240 


Ala Thr Glu Val Phe Arg Glu Glu Leu Gly Ala Arg Pro Asp Ala Thr 
                245                 250                 255     


Lys Val Leu Ile Ile Ile Thr Asp Gly Glu Ala Thr Asp Ser Gly Asn 
            260                 265                 270         


Ile Asp Ala Ala Lys Asp Ile Ile Arg Tyr Ile Ile Gly Ile Gly Lys 
        275                 280                 285             


His Phe Gln Thr Lys Glu Ser Gln Glu Thr Leu His Lys Phe Ala Ser 
    290                 295                 300                 


Lys Pro Ala Ser Glu Phe Val Lys Ile Leu Asp Thr Phe Glu Lys Leu 
305                 310                 315                 320 


Lys Asp Leu Phe Thr Glu Leu Gln Lys Lys Ile Tyr Val Ile Glu Gly 
                325                 330                 335     


Thr Ser Lys Gln Asp Leu Thr Ser Phe Asn Met Glu Leu Ser Ser Ser 
            340                 345                 350         


Gly Ile Ser Ala Asp Leu Ser Arg Gly His Ala Val Val Gly Ala Val 
        355                 360                 365             


Gly Ala Lys Asp Trp Ala Gly Gly Phe Leu Asp Leu Lys Ala Asp Leu 
    370                 375                 380                 


Gln Asp Asp Thr Phe Ile Gly Asn Glu Pro Leu Thr Pro Glu Val Arg 
385                 390                 395                 400 


Ala Gly Tyr Leu Gly Tyr Thr Val Thr Trp Leu Pro Ser Arg Gln Lys 
                405                 410                 415     


Thr Ser Leu Leu Ala Ser Gly Ala Pro Arg Tyr Gln His Met Gly Arg 
            420                 425                 430         


Val Leu Leu Phe Gln Glu Pro Gln Gly Gly Gly His Trp Ser Gln Val 
        435                 440                 445             


Gln Thr Ile His Gly Thr Gln Ile Gly Ser Tyr Phe Gly Gly Glu Leu 
    450                 455                 460                 


Cys Gly Val Asp Val Asp Gln Asp Gly Glu Thr Glu Leu Leu Leu Ile 
465                 470                 475                 480 


Gly Ala Pro Leu Phe Tyr Gly Glu Gln Arg Gly Gly Arg Val Phe Ile 
                485                 490                 495     


Tyr Gln Arg Arg Gln Leu Gly Phe Glu Glu Val Ser Glu Leu Gln Gly 
            500                 505                 510         


Asp Pro Gly Tyr Pro Leu Gly Arg Phe Gly Glu Ala Ile Thr Ala Leu 
        515                 520                 525             


Thr Asp Ile Asn Gly Asp Gly Leu Val Asp Val Ala Val Gly Ala Pro 
    530                 535                 540                 


Leu Glu Glu Gln Gly Ala Val Tyr Ile Phe Asn Gly Arg His Gly Gly 
545                 550                 555                 560 


Leu Ser Pro Gln Pro Ser Gln Arg Ile Glu Gly Thr Gln Val Leu Ser 
                565                 570                 575     


Gly Ile Gln Trp Phe Gly Arg Ser Ile His Gly Val Lys Asp Leu Glu 
            580                 585                 590         


Gly Asp Gly Leu Ala Asp Val Ala Val Gly Ala Glu Ser Gln Met Ile 
        595                 600                 605             


Val Leu Ser Ser Arg Pro Val Val Asp Met Val Thr Leu Met Ser Phe 
    610                 615                 620                 


Ser Pro Ala Glu Ile Pro Val His Glu Val Glu Cys Ser Tyr Ser Thr 
625                 630                 635                 640 


Ser Asn Lys Met Lys Glu Gly Val Asn Ile Thr Ile Cys Phe Gln Ile 
                645                 650                 655     


Lys Ser Leu Ile Pro Gln Phe Gln Gly Arg Leu Val Ala Asn Leu Thr 
            660                 665                 670         


Tyr Thr Leu Gln Leu Asp Gly His Arg Thr Arg Arg Arg Gly Leu Phe 
        675                 680                 685             


Pro Gly Gly Arg His Glu Leu Arg Arg Asn Ile Ala Val Thr Thr Ser 
    690                 695                 700                 


Met Ser Cys Thr Asp Phe Ser Phe His Phe Pro Val Cys Val Gln Asp 
705                 710                 715                 720 


Leu Ile Ser Pro Ile Asn Val Ser Leu Asn Phe Ser Leu Trp Glu Glu 
                725                 730                 735     


Glu Gly Thr Pro Arg Asp Gln Arg Ala Gln Gly Lys Asp Ile Pro Pro 
            740                 745                 750         


Ile Leu Arg Pro Ser Leu His Ser Glu Thr Trp Glu Ile Pro Phe Glu 
        755                 760                 765             


Lys Asn Cys Gly Glu Asp Lys Lys Cys Glu Ala Asn Leu Arg Val Ser 
    770                 775                 780                 


Phe Ser Pro Ala Arg Ser Arg Ala Leu Arg Leu Thr Ala Phe Ala Ser 
785                 790                 795                 800 


Leu Ser Val Glu Leu Ser Leu Ser Asn Leu Glu Glu Asp Ala Tyr Trp 
                805                 810                 815     


Val Gln Leu Asp Leu His Phe Pro Pro Gly Leu Ser Phe Arg Lys Val 
            820                 825                 830         


Glu Met Leu Lys Pro His Ser Gln Ile Pro Val Ser Cys Glu Glu Leu 
        835                 840                 845             


Pro Glu Glu Ser Arg Leu Leu Ser Arg Ala Leu Ser Cys Asn Val Ser 
    850                 855                 860                 


Ser Pro Ile Phe Lys Ala Gly His Ser Val Ala Leu Gln Met Met Phe 
865                 870                 875                 880 


Asn Thr Leu Val Asn Ser Ser Trp Gly Asp Ser Val Glu Leu His Ala 
                885                 890                 895     


Asn Val Thr Cys Asn Asn Glu Asp Ser Asp Leu Leu Glu Asp Asn Ser 
            900                 905                 910         


Ala Thr Thr Ile Ile Pro Ile Leu Tyr Pro Ile Asn Ile Leu Ile Gln 
        915                 920                 925             


Asp Gln Glu Asp Ser Thr Leu Tyr Val Ser Phe Thr Pro Lys Gly Pro 
    930                 935                 940                 


Lys Ile His Gln Val Lys His Met Tyr Gln Val Arg Ile Gln Pro Ser 
945                 950                 955                 960 


Ile His Asp His Asn Ile Pro Thr Leu Glu Ala Val Val Gly Val Pro 
                965                 970                 975     


Gln Pro Pro Ser Glu Gly Pro Ile Thr His Gln Trp Ser Val Gln Met 
            980                 985                 990         


Glu Pro Pro Val Pro Cys His Tyr  Glu Asp Leu Glu Arg  Leu Pro Asp 
        995                 1000                 1005             


Ala Ala  Glu Pro Cys Leu Pro  Gly Ala Leu Phe Arg  Cys Pro Val 
    1010                 1015                 1020             


Val Phe  Arg Gln Glu Ile Leu  Val Gln Val Ile Gly  Thr Leu Glu 
    1025                 1030                 1035             


Leu Val  Gly Glu Ile Glu Ala  Ser Ser Met Phe Ser  Leu Cys Ser 
    1040                 1045                 1050             


Ser Leu  Ser Ile Ser Phe Asn  Ser Ser Lys His Phe  His Leu Tyr 
    1055                 1060                 1065             


Gly Ser  Asn Ala Ser Leu Ala  Gln Val Val Met Lys  Val Asp Val 
    1070                 1075                 1080             


Val Tyr  Glu Lys Gln Met Leu  Tyr Leu Tyr Val Leu  Ser Gly Ile 
    1085                 1090                 1095             


Gly Gly  Leu Leu Leu Leu Leu  Leu Ile Phe Ile Val  Leu Tyr Lys 
    1100                 1105                 1110             


Val Gly  Phe Phe Lys Arg Asn  Leu Lys Glu Lys Met  Glu Ala Gly 
    1115                 1120                 1125             


Arg Gly  Val Pro Asn Gly Ile  Pro Ala Glu Asp Ser  Glu Gln Leu 
    1130                 1135                 1140             


Ala Ser  Gly Gln Glu Ala Gly  Asp Pro Gly Cys Leu  Lys Pro Leu 
    1145                 1150                 1155             


His Glu  Lys Asp Ser Glu Ser  Gly Gly Gly Lys Asp  
    1160                 1165                 1170 


<210>  266
<211>  4974
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  ITGAL integrin, alpha L (antigen CD11A (p180), lymphocyte 
       function-associated antigen 1; alpha polypeptide; Human) mRNA for
       transcript variant 2 GeneBank Accession No: NM_001114380.1  
       GI:167466216

<400>  266
acacctccct ccccgcctgc cagtgtcacc agcctgttgc ctctgtgaga aagtaccact       60

gtaagaggcc aaagggcatg atcattttcc tctttcaccc tgtctaggtt gccagcaaat      120

cccacgggcc tcctgacgct gcccctgggg ccacaggtcc ctcgagtgct ggaaggatga      180

aggattcctg catcactgtg atggccatgg cgctgctgtc tgggttcttt ttcttcgcgc      240

cggcctcgag ctacaacctg gacgtgcggg gcgcgcggag cttctcccca ccgcgcgccg      300

ggaggcactt tggataccgc gtcctgcagg tcggaaacgg ggtcatcgtg ggagctccag      360

gggaggggaa cagcacagga agcctctatc agtgccagtc gggcacagga cactgcctgc      420

cagtcaccct gagaggttcc aactatacct ccaagtactt gggaatgacc ttggcaacag      480

accccacaga tggaagcatt ttgtttgctg ctgttcagtt ttccacaagc tacaaaacag      540

aatttgattt ctcagattat gttaaacgga aggaccctga tgctctgctg aagcatgtaa      600

agcacatgtt gctgttgacc aatacctttg gtgccatcaa ttatgtcgcg acagaggtgt      660

tccgggagga gctgggggcc cggccagatg ccaccaaagt gcttatcatc atcacggatg      720

gggaggccac tgacagtggc aacatcgatg cggccaaaga catcatccgc tacatcatcg      780

ggattggaaa gcattttcag accaaggaga gtcaggagac cctccacaaa tttgcatcaa      840

aacccgcgag cgagtttgtg aaaattctgg acacatttga gaagctgaaa gatctattca      900

ctgagctgca gaagaagatc tatgtcattg agggcacaag caaacaggac ctgacttcct      960

tcaacatgga gctgtcctcc agcggcatca gtgctgacct cagcaggggc catgcagtcg     1020

tgggggcagt aggagccaag gactgggctg ggggctttct tgacctgaag gcagacctgc     1080

aggatgacac atttattggg aatgaaccat tgacaccaga agtgagagca ggctatttgg     1140

gttacaccgt gacctggctg ccctcccggc aaaagacttc gttgctggcc tcgggagccc     1200

ctcgatacca gcacatgggc cgagtgctgc tgttccaaga gccacagggc ggaggacact     1260

ggagccaggt ccagacaatc catgggaccc agattggctc ttatttcggt ggggagctgt     1320

gtggcgtcga cgtggaccaa gatggggaga cagagctgct gctgattggt gccccactgt     1380

tctatgggga gcagagagga ggccgggtgt ttatctacca gagaagacag ttggggtttg     1440

aagaagtctc agagctgcag ggggaccccg gctacccact cgggcggttt ggagaagcca     1500

tcactgctct gacagacatc aacggcgatg ggctggtaga cgtggctgtg ggggcccctc     1560

tggaggagca gggggctgtg tacatcttca atgggaggca cggggggctt agtccccagc     1620

caagtcagcg gatagaaggg acccaagtgc tctcaggaat tcagtggttt ggacgctcca     1680

tccatggggt gaaggacctt gaaggggatg gcttggcaga tgtggctgtg ggggctgaga     1740

gccagatgat cgtgctgagc tcccggcccg tggtggatat ggtcaccctg atgtccttct     1800

ctccagctga gatcccagtg catgaagtgg agtgctccta ttcaaccagt aacaagatga     1860

aagaaggagt taatatcaca atctgtttcc agatcaagtc tctcatcccc cagttccaag     1920

gccgcctggt tgccaatctc acttacactc tgcagctgga tggccaccgg accagaagac     1980

gggggttgtt cccaggaggg agacatgaac tcagaaggaa tatagctgtc accaccagca     2040

tgtcatgcac tgacttctca tttcatttcc cggtatgtgt tcaagacctc atctccccca     2100

tcaatgtttc cctgaatttc tctctttggg aggaggaagg gacaccgagg gaccaaaggg     2160

cgggcaagga cataccgccc atcctgagac cctccctgca ctcggaaacc tgggagatcc     2220

cttttgagaa gaactgtggg gaggacaaga agtgtgaggc aaacttgaga gtgtccttct     2280

ctcctgcaag atccagagcc ctgcgtctaa ctgcttttgc cagcctctct gtggagctga     2340

gcctgagtaa cttggaagaa gatgcttact gggtccagct ggacctgcac ttccccccgg     2400

gactctcctt ccgcaaggtg gagatgctga agccccatag ccagatacct gtgagctgcg     2460

aggagcttcc tgaagagtcc aggcttctgt ccagggcatt atcttgcaat gtgagctctc     2520

ccatcttcaa agcaggccac tcggttgctc tgcagatgat gtttaataca ctggtaaaca     2580

gctcctgggg ggactcggtt gaattgcacg ccaatgtgac ctgtaacaat gaggactcag     2640

acctcctgga ggacaactca gccactacca tcatccccat cctgtacccc atcaacatcc     2700

tcatccagga ccaagaagac tccacactct atgtcagttt cacccccaaa ggccccaaga     2760

tccaccaagt caagcacatg taccaggtga ggatccagcc ttccatccac gaccacaaca     2820

tacccaccct ggaggctgtg gttggggtgc cacagcctcc cagcgagggg cccatcacac     2880

accagtggag cgtgcagatg gagcctcccg tgccctgcca ctatgaggat ctggagaggc     2940

tcccggatgc agctgagcct tgtctccccg gagccctgtt ccgctgccct gttgtcttca     3000

ggcaggagat cctcgtccaa gtgatcggga ctctggagct ggtgggagag atcgaggcct     3060

cttccatgtt cagcctctgc agctccctct ccatctcctt caacagcagc aagcatttcc     3120

acctctatgg cagcaacgcc tccctggccc aggttgtcat gaaggttgac gtggtgtatg     3180

agaagcagat gctctacctc tacgtgctga gcggcatcgg ggggctgctg ctgctgctgc     3240

tcattttcat agtgctgtac aaggttggtt tcttcaaacg gaacctgaag gagaagatgg     3300

aggctggcag aggtgtcccg aatggaatcc ctgcagaaga ctctgagcag ctggcatctg     3360

ggcaagaggc tggggatccc ggctgcctga agcccctcca tgagaaggac tctgagagtg     3420

gtggtggcaa ggactgagtc caggcctgtg aggtgcagag tgcccagaac tggactcagg     3480

atgcccaggg ccactctgcc tctgcctgca ttctgccgtg tgccctcggg cgagtcactg     3540

cctctccctg gccctcagtt tccctatctc gaacatggaa ctcattcctg cctgtctcct     3600

ttgcaggctc atagggaaga cctgctgagg gaccagccaa gagggctgca aaagtgaggg     3660

cttgtcatta ccagacggtt caccagcctc tcttggtttc cttccttgga agagaatgtc     3720

tgatctaaat gtggagaaac tgtagtctca ggacctaggg atgttctggc cctcacccct     3780

gccctgggat gtccacagat gcctccaccc cccagaacct gtccttgcac actcccctgc     3840

actggagtcc agtctcttct gctggcagaa agcaaatgtg acctgtgtca ctacgtgact     3900

gtggcacacg ccttgttctt ggccaaagac caaattcctt ggcatgcctt ccagcaccct     3960

gcaaaatgag accctcgtgg ccttccccag cctcttctag agccgtgatg cctccctgtt     4020

gaagctctgg tgacaccagc ctttctccca ggccaggctc cttcctgtct tcctgcattc     4080

acccagacag ctccctctgc ctgaaccttc catctcgcca cccctccttc cttgaccagc     4140

agatcccagc tcacgtcaca cttggttggg tcctcacatc tttcacactt ccaccagcct     4200

gcactactcc ctcaaagcac acgtcatgtt tcttcatccg gcagcctgga tgttttttcc     4260

ctgtttaatg attgacgtac ttagcagcta tctctcagtg aactgtgagg gtaaaggcta     4320

tacttgtctt gttcaccttg ggatgatgcc tcatgatatg tcagggcgtg ggacatctag     4380

taggtgcttg acataatttc actgaattaa tgacagagcc agtgggaaga tacagaaaaa     4440

gaggggctgg gctgggcgcg gtggttcacg cctgtaatcc cagcactttg ggaggccaag     4500

gagggtggat cacctgaggt caggagttag aggccagcct ggcgaaaccc catctctact     4560

aaaaatacaa aatccaggcg tggtggcaca cacctgtagt cccagctact caggaggttg     4620

aggtaggaga attgcttgaa cctgggaggt ggaggttgca gtgagccaag attgcgccat     4680

tgcactccag cctgggcaac acagcgagac tccgtctcaa ggaaaaaata aaaataaaaa     4740

gcgggcacgg gcccgtgaca tccccaccct tggaggctgt cttctcaggc tctgccctgc     4800

cctagctcca caccctctcc caggacccat cacgcctgtg cagtggcccc cacagaaaga     4860

ctgagctcaa ggtgggaacc acgtctgcta acttggagcc ccagtgccaa gcacagtgcc     4920

tgcatgtatt tatccaataa atgtgaaatt ctgtccaaaa aaaaaaaaaa aaaa           4974


<210>  267
<211>  1086
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  ITGAL integrin, alpha L (antigen CD11A (p180), lymphocyte 
       function-associated antigen 1; alpha polypeptide; Human) 
       polypeptide for transcript variant 2 GeneBank Accession No: 
       NP_001107852.1 GI:167466217

<400>  267

Met Lys Asp Ser Cys Ile Thr Val Met Ala Met Ala Leu Leu Ser Gly 
1               5                   10                  15      


Phe Phe Phe Phe Ala Pro Ala Ser Ser Tyr Asn Leu Asp Val Arg Gly 
            20                  25                  30          


Ala Arg Ser Phe Ser Pro Pro Arg Ala Gly Arg His Phe Gly Tyr Arg 
        35                  40                  45              


Val Leu Gln Val Gly Asn Gly Val Ile Val Gly Ala Pro Gly Glu Gly 
    50                  55                  60                  


Asn Ser Thr Gly Ser Leu Tyr Gln Cys Gln Ser Gly Thr Gly His Cys 
65                  70                  75                  80  


Leu Pro Val Thr Leu Arg Gly Ser Asn Tyr Thr Ser Lys Tyr Leu Gly 
                85                  90                  95      


Met Thr Leu Ala Thr Asp Pro Thr Asp Gly Ser Ile Leu Phe Ala Ala 
            100                 105                 110         


Val Gln Phe Ser Thr Ser Tyr Lys Thr Glu Phe Asp Phe Ser Asp Tyr 
        115                 120                 125             


Val Lys Arg Lys Asp Pro Asp Ala Leu Leu Lys His Val Lys His Met 
    130                 135                 140                 


Leu Leu Leu Thr Asn Thr Phe Gly Ala Ile Asn Tyr Val Ala Thr Glu 
145                 150                 155                 160 


Val Phe Arg Glu Glu Leu Gly Ala Arg Pro Asp Ala Thr Lys Val Leu 
                165                 170                 175     


Ile Ile Ile Thr Asp Gly Glu Ala Thr Asp Ser Gly Asn Ile Asp Ala 
            180                 185                 190         


Ala Lys Asp Ile Ile Arg Tyr Ile Ile Gly Ile Gly Lys His Phe Gln 
        195                 200                 205             


Thr Lys Glu Ser Gln Glu Thr Leu His Lys Phe Ala Ser Lys Pro Ala 
    210                 215                 220                 


Ser Glu Phe Val Lys Ile Leu Asp Thr Phe Glu Lys Leu Lys Asp Leu 
225                 230                 235                 240 


Phe Thr Glu Leu Gln Lys Lys Ile Tyr Val Ile Glu Gly Thr Ser Lys 
                245                 250                 255     


Gln Asp Leu Thr Ser Phe Asn Met Glu Leu Ser Ser Ser Gly Ile Ser 
            260                 265                 270         


Ala Asp Leu Ser Arg Gly His Ala Val Val Gly Ala Val Gly Ala Lys 
        275                 280                 285             


Asp Trp Ala Gly Gly Phe Leu Asp Leu Lys Ala Asp Leu Gln Asp Asp 
    290                 295                 300                 


Thr Phe Ile Gly Asn Glu Pro Leu Thr Pro Glu Val Arg Ala Gly Tyr 
305                 310                 315                 320 


Leu Gly Tyr Thr Val Thr Trp Leu Pro Ser Arg Gln Lys Thr Ser Leu 
                325                 330                 335     


Leu Ala Ser Gly Ala Pro Arg Tyr Gln His Met Gly Arg Val Leu Leu 
            340                 345                 350         


Phe Gln Glu Pro Gln Gly Gly Gly His Trp Ser Gln Val Gln Thr Ile 
        355                 360                 365             


His Gly Thr Gln Ile Gly Ser Tyr Phe Gly Gly Glu Leu Cys Gly Val 
    370                 375                 380                 


Asp Val Asp Gln Asp Gly Glu Thr Glu Leu Leu Leu Ile Gly Ala Pro 
385                 390                 395                 400 


Leu Phe Tyr Gly Glu Gln Arg Gly Gly Arg Val Phe Ile Tyr Gln Arg 
                405                 410                 415     


Arg Gln Leu Gly Phe Glu Glu Val Ser Glu Leu Gln Gly Asp Pro Gly 
            420                 425                 430         


Tyr Pro Leu Gly Arg Phe Gly Glu Ala Ile Thr Ala Leu Thr Asp Ile 
        435                 440                 445             


Asn Gly Asp Gly Leu Val Asp Val Ala Val Gly Ala Pro Leu Glu Glu 
    450                 455                 460                 


Gln Gly Ala Val Tyr Ile Phe Asn Gly Arg His Gly Gly Leu Ser Pro 
465                 470                 475                 480 


Gln Pro Ser Gln Arg Ile Glu Gly Thr Gln Val Leu Ser Gly Ile Gln 
                485                 490                 495     


Trp Phe Gly Arg Ser Ile His Gly Val Lys Asp Leu Glu Gly Asp Gly 
            500                 505                 510         


Leu Ala Asp Val Ala Val Gly Ala Glu Ser Gln Met Ile Val Leu Ser 
        515                 520                 525             


Ser Arg Pro Val Val Asp Met Val Thr Leu Met Ser Phe Ser Pro Ala 
    530                 535                 540                 


Glu Ile Pro Val His Glu Val Glu Cys Ser Tyr Ser Thr Ser Asn Lys 
545                 550                 555                 560 


Met Lys Glu Gly Val Asn Ile Thr Ile Cys Phe Gln Ile Lys Ser Leu 
                565                 570                 575     


Ile Pro Gln Phe Gln Gly Arg Leu Val Ala Asn Leu Thr Tyr Thr Leu 
            580                 585                 590         


Gln Leu Asp Gly His Arg Thr Arg Arg Arg Gly Leu Phe Pro Gly Gly 
        595                 600                 605             


Arg His Glu Leu Arg Arg Asn Ile Ala Val Thr Thr Ser Met Ser Cys 
    610                 615                 620                 


Thr Asp Phe Ser Phe His Phe Pro Val Cys Val Gln Asp Leu Ile Ser 
625                 630                 635                 640 


Pro Ile Asn Val Ser Leu Asn Phe Ser Leu Trp Glu Glu Glu Gly Thr 
                645                 650                 655     


Pro Arg Asp Gln Arg Ala Gly Lys Asp Ile Pro Pro Ile Leu Arg Pro 
            660                 665                 670         


Ser Leu His Ser Glu Thr Trp Glu Ile Pro Phe Glu Lys Asn Cys Gly 
        675                 680                 685             


Glu Asp Lys Lys Cys Glu Ala Asn Leu Arg Val Ser Phe Ser Pro Ala 
    690                 695                 700                 


Arg Ser Arg Ala Leu Arg Leu Thr Ala Phe Ala Ser Leu Ser Val Glu 
705                 710                 715                 720 


Leu Ser Leu Ser Asn Leu Glu Glu Asp Ala Tyr Trp Val Gln Leu Asp 
                725                 730                 735     


Leu His Phe Pro Pro Gly Leu Ser Phe Arg Lys Val Glu Met Leu Lys 
            740                 745                 750         


Pro His Ser Gln Ile Pro Val Ser Cys Glu Glu Leu Pro Glu Glu Ser 
        755                 760                 765             


Arg Leu Leu Ser Arg Ala Leu Ser Cys Asn Val Ser Ser Pro Ile Phe 
    770                 775                 780                 


Lys Ala Gly His Ser Val Ala Leu Gln Met Met Phe Asn Thr Leu Val 
785                 790                 795                 800 


Asn Ser Ser Trp Gly Asp Ser Val Glu Leu His Ala Asn Val Thr Cys 
                805                 810                 815     


Asn Asn Glu Asp Ser Asp Leu Leu Glu Asp Asn Ser Ala Thr Thr Ile 
            820                 825                 830         


Ile Pro Ile Leu Tyr Pro Ile Asn Ile Leu Ile Gln Asp Gln Glu Asp 
        835                 840                 845             


Ser Thr Leu Tyr Val Ser Phe Thr Pro Lys Gly Pro Lys Ile His Gln 
    850                 855                 860                 


Val Lys His Met Tyr Gln Val Arg Ile Gln Pro Ser Ile His Asp His 
865                 870                 875                 880 


Asn Ile Pro Thr Leu Glu Ala Val Val Gly Val Pro Gln Pro Pro Ser 
                885                 890                 895     


Glu Gly Pro Ile Thr His Gln Trp Ser Val Gln Met Glu Pro Pro Val 
            900                 905                 910         


Pro Cys His Tyr Glu Asp Leu Glu Arg Leu Pro Asp Ala Ala Glu Pro 
        915                 920                 925             


Cys Leu Pro Gly Ala Leu Phe Arg Cys Pro Val Val Phe Arg Gln Glu 
    930                 935                 940                 


Ile Leu Val Gln Val Ile Gly Thr Leu Glu Leu Val Gly Glu Ile Glu 
945                 950                 955                 960 


Ala Ser Ser Met Phe Ser Leu Cys Ser Ser Leu Ser Ile Ser Phe Asn 
                965                 970                 975     


Ser Ser Lys His Phe His Leu Tyr Gly Ser Asn Ala Ser Leu Ala Gln 
            980                 985                 990         


Val Val Met Lys Val Asp Val Val  Tyr Glu Lys Gln Met  Leu Tyr Leu 
        995                 1000                 1005             


Tyr Val  Leu Ser Gly Ile Gly  Gly Leu Leu Leu Leu  Leu Leu Ile 
    1010                 1015                 1020             


Phe Ile  Val Leu Tyr Lys Val  Gly Phe Phe Lys Arg  Asn Leu Lys 
    1025                 1030                 1035             


Glu Lys  Met Glu Ala Gly Arg  Gly Val Pro Asn Gly  Ile Pro Ala 
    1040                 1045                 1050             


Glu Asp  Ser Glu Gln Leu Ala  Ser Gly Gln Glu Ala  Gly Asp Pro 
    1055                 1060                 1065             


Gly Cys  Leu Lys Pro Leu His  Glu Lys Asp Ser Glu  Ser Gly Gly 
    1070                 1075                 1080             


Gly Lys  Asp 
    1085     


<210>  268
<211>  4745
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha M (complement component 3 receptor 3
       subunit) (ITGAM), transcript variant 1, mRNA GeneBank Accession 
       No: NM_001145808.1  GI:224831238

<400>  268
ttttctgccc ttctttgctt tggtggcttc cttgtggttc ctcagtggtg cctgcaaccc       60

ctggttcacc tccttccagg ttctggctcc ttccagccat ggctctcaga gtccttctgt      120

taacagcctt gaccttatgt catgggttca acttggacac tgaaaacgca atgaccttcc      180

aagagaacgc aaggggcttc gggcagagcg tggtccagct tcagggatcc agggtggtgg      240

ttggagcccc ccaggagata gtggctgcca accaaagggg cagcctctac cagtgcgact      300

acagcacagg ctcatgcgag cccatccgcc tgcaggtccc cgtggaggcc gtgaacatgt      360

ccctgggcct gtccctggca gccaccacca gcccccctca gctgctggcc tgtggtccca      420

ccgtgcacca gacttgcagt gagaacacgt atgtgaaagg gctctgcttc ctgtttggat      480

ccaacctacg gcagcagccc cagaagttcc cagaggccct ccgagggtgt cctcaagagg      540

atagtgacat tgccttcttg attgatggct ctggtagcat catcccacat gactttcggc      600

ggatgaagga gtttgtctca actgtgatgg agcaattaaa aaagtccaaa accttgttct      660

ctttgatgca gtactctgaa gaattccgga ttcactttac cttcaaagag ttccagaaca      720

accctaaccc aagatcactg gtgaagccaa taacgcagct gcttgggcgg acacacacgg      780

ccacgggcat ccgcaaagtg gtacgagagc tgtttaacat caccaacgga gcccgaaaga      840

atgcctttaa gatcctagtt gtcatcacgg atggagaaaa gtttggcgat cccttgggat      900

atgaggatgt catccctgag gcagacagag agggagtcat tcgctacgtc attggggtgg      960

gagatgcctt ccgcagtgag aaatcccgcc aagagcttaa taccatcgca tccaagccgc     1020

ctcgtgatca cgtgttccag gtgaataact ttgaggctct gaagaccatt cagaaccagc     1080

ttcgggagaa gatctttgcg atcgagggta ctcagacagg aagtagcagc tcctttgagc     1140

atgagatgtc tcaggaaggc ttcagcgctg ccatcacctc taatggcccc ttgctgagca     1200

ctgtggggag ctatgactgg gctggtggag tctttctata tacatcaaag gagaaaagca     1260

ccttcatcaa catgaccaga gtggattcag acatgaatga tgcttacttg ggttatgctg     1320

ccgccatcat cttacggaac cgggtgcaaa gcctggttct gggggcacct cgatatcagc     1380

acatcggcct ggtagcgatg ttcaggcaga acactggcat gtgggagtcc aacgctaatg     1440

tcaagggcac ccagatcggc gcctacttcg gggcctccct ctgctccgtg gacgtggaca     1500

gcaacggcag caccgacctg gtcctcatcg gggcccccca ttactacgag cagacccgag     1560

ggggccaggt gtccgtgtgc cccttgccca gggggcagag ggctcggtgg cagtgtgatg     1620

ctgttctcta cggggagcag ggccaaccct ggggccgctt tggggcagcc ctaacagtgc     1680

tgggggacgt aaatggggac aagctgacgg acgtggccat tggggcccca ggagaggagg     1740

acaaccgggg tgctgtttac ctgtttcacg gaacctcagg atctggcatc agcccctccc     1800

atagccagcg gatagcaggc tccaagctct ctcccaggct ccagtatttt ggtcagtcac     1860

tgagtggggg ccaggacctc acaatggatg gactggtaga cctgactgta ggagcccagg     1920

ggcacgtgct gctgctcagg tcccagccag tactgagagt caaggcaatc atggagttca     1980

atcccaggga agtggcaagg aatgtatttg agtgtaatga tcaggtggtg aaaggcaagg     2040

aagccggaga ggtcagagtc tgcctccatg tccagaagag cacacgggat cggctaagag     2100

aaggacagat ccagagtgtt gtgacttatg acctggctct ggactccggc cgcccacatt     2160

cccgcgccgt cttcaatgag acaaagaaca gcacacgcag acagacacag gtcttggggc     2220

tgacccagac ttgtgagacc ctgaaactac agttgccgaa ttgcatcgag gacccagtga     2280

gccccattgt gctgcgcctg aacttctctc tggtgggaac gccattgtct gctttcggga     2340

acctccggcc agtgctggcg gaggatgctc agagactctt cacagccttg tttccctttg     2400

agaagaattg tggcaatgac aacatctgcc aggatgacct cagcatcacc ttcagtttca     2460

tgagcctgga ctgcctcgtg gtgggtgggc cccgggagtt caacgtgaca gtgactgtga     2520

gaaatgatgg tgaggactcc tacaggacac aggtcacctt cttcttcccg cttgacctgt     2580

cctaccggaa ggtgtccacg ctccagaacc agcgctcaca gcgatcctgg cgcctggcct     2640

gtgagtctgc ctcctccacc gaagtgtctg gggccttgaa gagcaccagc tgcagcataa     2700

accaccccat cttcccggaa aactcagagg tcacctttaa tatcacgttt gatgtagact     2760

ctaaggcttc ccttggaaac aaactgctcc tcaaggccaa tgtgaccagt gagaacaaca     2820

tgcccagaac caacaaaacc gaattccaac tggagctgcc ggtgaaatat gctgtctaca     2880

tggtggtcac cagccatggg gtctccacta aatatctcaa cttcacggcc tcagagaata     2940

ccagtcgggt catgcagcat caatatcagg tcagcaacct ggggcagagg agcctcccca     3000

tcagcctggt gttcttggtg cccgtccggc tgaaccagac tgtcatatgg gaccgccccc     3060

aggtcacctt ctccgagaac ctctcgagta cgtgccacac caaggagcgc ttgccctctc     3120

actccgactt tctggctgag cttcggaagg cccccgtggt gaactgctcc atcgctgtct     3180

gccagagaat ccagtgtgac atcccgttct ttggcatcca ggaagaattc aatgctaccc     3240

tcaaaggcaa cctctcgttt gactggtaca tcaagacctc gcataaccac ctcctgatcg     3300

tgagcacagc tgagatcttg tttaacgatt ccgtgttcac cctgctgccg ggacaggggg     3360

cgtttgtgag gtcccagacg gagaccaaag tggagccgtt cgaggtcccc aaccccctgc     3420

cgctcatcgt gggcagctct gtcgggggac tgctgctcct ggccctcatc accgccgcgc     3480

tgtacaagct cggcttcttc aagcggcaat acaaggacat gatgagtgaa gggggtcccc     3540

cgggggccga accccagtag cggctccttc ccgacagagc tgcctctcgg tggccagcag     3600

gactctgccc agaccacacg tagcccccag gctgctggac acgtcggaca gcgaagtatc     3660

cccgacagga cgggcttggg cttccatttg tgtgtgtgca agtgtgtatg tgcgtgtgtg     3720

caagtgtctg tgtgcaagtg tgtgcacatg tgtgcgtgtg cgtgcatgtg cacttgcacg     3780

cccatgtgtg agtgtgtgca agtatgtgag tgtgtccaag tgtgtgtgcg tgtgtccatg     3840

tgtgtgcaag tgtgtgcatg tgtgcgagtg tgtgcatgtg tgtgctcagg ggcgtgtggc     3900

tcacgtgtgt gactcagatg tctctggcgt gtgggtaggt gacggcagcg tagcctctcc     3960

ggcagaaggg aactgcctgg gctcccttgt gcgtgggtga agccgctgct gggttttcct     4020

ccgggagagg ggacggtcaa tcctgtgggt gaagacagag ggaaacacag cagcttctct     4080

ccactgaaag aagtgggact tcccgtcgcc tgcgagcctg cggcctgctg gagcctgcgc     4140

agcttggatg gagactccat gagaagccgt gggtggaacc aggaacctcc tccacaccag     4200

cgctgatgcc caataaagat gcccactgag gaatgatgaa gcttcctttc tggattcatt     4260

tattatttca atgtgacttt aattttttgg atggataagc ttgtctatgg tacaaaaatc     4320

acaaggcatt caagtgtaca gtgaaaagtc tccctttcca gatattcaag tcacctcctt     4380

aaaggtagtc aagattgtgt tttgaggttt ccttcagaca gattccaggc gatgtgcaag     4440

tgtatgcacg tgtgcacaca caccacacat acacacacac aagctttttt acacaaatgg     4500

tagcatactt tatattggtc tgtatcttgc tttttttcac caatatttct cagacatcgg     4560

ttcatattaa gacataaatt actttttcat tcttttatac cgctgcatag tattccattg     4620

tgtgagtgta ccataatgta tttaaccagt cttcttttga tatactattt tcattctctt     4680

gttattgcat caatgctgag ttaataaatc aaatatatgt catttttgca tatatgtaag     4740

gataa                                                                 4745


<210>  269
<211>  1153
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha M (complement component 3 receptor 3
       subunit) (ITGAM), transcript variant 1, polypeptide GeneBank 
       Accession No: NP_001139280.1 GI:224831239

<400>  269

Met Ala Leu Arg Val Leu Leu Leu Thr Ala Leu Thr Leu Cys His Gly 
1               5                   10                  15      


Phe Asn Leu Asp Thr Glu Asn Ala Met Thr Phe Gln Glu Asn Ala Arg 
            20                  25                  30          


Gly Phe Gly Gln Ser Val Val Gln Leu Gln Gly Ser Arg Val Val Val 
        35                  40                  45              


Gly Ala Pro Gln Glu Ile Val Ala Ala Asn Gln Arg Gly Ser Leu Tyr 
    50                  55                  60                  


Gln Cys Asp Tyr Ser Thr Gly Ser Cys Glu Pro Ile Arg Leu Gln Val 
65                  70                  75                  80  


Pro Val Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Ala Ala Thr 
                85                  90                  95      


Thr Ser Pro Pro Gln Leu Leu Ala Cys Gly Pro Thr Val His Gln Thr 
            100                 105                 110         


Cys Ser Glu Asn Thr Tyr Val Lys Gly Leu Cys Phe Leu Phe Gly Ser 
        115                 120                 125             


Asn Leu Arg Gln Gln Pro Gln Lys Phe Pro Glu Ala Leu Arg Gly Cys 
    130                 135                 140                 


Pro Gln Glu Asp Ser Asp Ile Ala Phe Leu Ile Asp Gly Ser Gly Ser 
145                 150                 155                 160 


Ile Ile Pro His Asp Phe Arg Arg Met Lys Glu Phe Val Ser Thr Val 
                165                 170                 175     


Met Glu Gln Leu Lys Lys Ser Lys Thr Leu Phe Ser Leu Met Gln Tyr 
            180                 185                 190         


Ser Glu Glu Phe Arg Ile His Phe Thr Phe Lys Glu Phe Gln Asn Asn 
        195                 200                 205             


Pro Asn Pro Arg Ser Leu Val Lys Pro Ile Thr Gln Leu Leu Gly Arg 
    210                 215                 220                 


Thr His Thr Ala Thr Gly Ile Arg Lys Val Val Arg Glu Leu Phe Asn 
225                 230                 235                 240 


Ile Thr Asn Gly Ala Arg Lys Asn Ala Phe Lys Ile Leu Val Val Ile 
                245                 250                 255     


Thr Asp Gly Glu Lys Phe Gly Asp Pro Leu Gly Tyr Glu Asp Val Ile 
            260                 265                 270         


Pro Glu Ala Asp Arg Glu Gly Val Ile Arg Tyr Val Ile Gly Val Gly 
        275                 280                 285             


Asp Ala Phe Arg Ser Glu Lys Ser Arg Gln Glu Leu Asn Thr Ile Ala 
    290                 295                 300                 


Ser Lys Pro Pro Arg Asp His Val Phe Gln Val Asn Asn Phe Glu Ala 
305                 310                 315                 320 


Leu Lys Thr Ile Gln Asn Gln Leu Arg Glu Lys Ile Phe Ala Ile Glu 
                325                 330                 335     


Gly Thr Gln Thr Gly Ser Ser Ser Ser Phe Glu His Glu Met Ser Gln 
            340                 345                 350         


Glu Gly Phe Ser Ala Ala Ile Thr Ser Asn Gly Pro Leu Leu Ser Thr 
        355                 360                 365             


Val Gly Ser Tyr Asp Trp Ala Gly Gly Val Phe Leu Tyr Thr Ser Lys 
    370                 375                 380                 


Glu Lys Ser Thr Phe Ile Asn Met Thr Arg Val Asp Ser Asp Met Asn 
385                 390                 395                 400 


Asp Ala Tyr Leu Gly Tyr Ala Ala Ala Ile Ile Leu Arg Asn Arg Val 
                405                 410                 415     


Gln Ser Leu Val Leu Gly Ala Pro Arg Tyr Gln His Ile Gly Leu Val 
            420                 425                 430         


Ala Met Phe Arg Gln Asn Thr Gly Met Trp Glu Ser Asn Ala Asn Val 
        435                 440                 445             


Lys Gly Thr Gln Ile Gly Ala Tyr Phe Gly Ala Ser Leu Cys Ser Val 
    450                 455                 460                 


Asp Val Asp Ser Asn Gly Ser Thr Asp Leu Val Leu Ile Gly Ala Pro 
465                 470                 475                 480 


His Tyr Tyr Glu Gln Thr Arg Gly Gly Gln Val Ser Val Cys Pro Leu 
                485                 490                 495     


Pro Arg Gly Gln Arg Ala Arg Trp Gln Cys Asp Ala Val Leu Tyr Gly 
            500                 505                 510         


Glu Gln Gly Gln Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu 
        515                 520                 525             


Gly Asp Val Asn Gly Asp Lys Leu Thr Asp Val Ala Ile Gly Ala Pro 
    530                 535                 540                 


Gly Glu Glu Asp Asn Arg Gly Ala Val Tyr Leu Phe His Gly Thr Ser 
545                 550                 555                 560 


Gly Ser Gly Ile Ser Pro Ser His Ser Gln Arg Ile Ala Gly Ser Lys 
                565                 570                 575     


Leu Ser Pro Arg Leu Gln Tyr Phe Gly Gln Ser Leu Ser Gly Gly Gln 
            580                 585                 590         


Asp Leu Thr Met Asp Gly Leu Val Asp Leu Thr Val Gly Ala Gln Gly 
        595                 600                 605             


His Val Leu Leu Leu Arg Ser Gln Pro Val Leu Arg Val Lys Ala Ile 
    610                 615                 620                 


Met Glu Phe Asn Pro Arg Glu Val Ala Arg Asn Val Phe Glu Cys Asn 
625                 630                 635                 640 


Asp Gln Val Val Lys Gly Lys Glu Ala Gly Glu Val Arg Val Cys Leu 
                645                 650                 655     


His Val Gln Lys Ser Thr Arg Asp Arg Leu Arg Glu Gly Gln Ile Gln 
            660                 665                 670         


Ser Val Val Thr Tyr Asp Leu Ala Leu Asp Ser Gly Arg Pro His Ser 
        675                 680                 685             


Arg Ala Val Phe Asn Glu Thr Lys Asn Ser Thr Arg Arg Gln Thr Gln 
    690                 695                 700                 


Val Leu Gly Leu Thr Gln Thr Cys Glu Thr Leu Lys Leu Gln Leu Pro 
705                 710                 715                 720 


Asn Cys Ile Glu Asp Pro Val Ser Pro Ile Val Leu Arg Leu Asn Phe 
                725                 730                 735     


Ser Leu Val Gly Thr Pro Leu Ser Ala Phe Gly Asn Leu Arg Pro Val 
            740                 745                 750         


Leu Ala Glu Asp Ala Gln Arg Leu Phe Thr Ala Leu Phe Pro Phe Glu 
        755                 760                 765             


Lys Asn Cys Gly Asn Asp Asn Ile Cys Gln Asp Asp Leu Ser Ile Thr 
    770                 775                 780                 


Phe Ser Phe Met Ser Leu Asp Cys Leu Val Val Gly Gly Pro Arg Glu 
785                 790                 795                 800 


Phe Asn Val Thr Val Thr Val Arg Asn Asp Gly Glu Asp Ser Tyr Arg 
                805                 810                 815     


Thr Gln Val Thr Phe Phe Phe Pro Leu Asp Leu Ser Tyr Arg Lys Val 
            820                 825                 830         


Ser Thr Leu Gln Asn Gln Arg Ser Gln Arg Ser Trp Arg Leu Ala Cys 
        835                 840                 845             


Glu Ser Ala Ser Ser Thr Glu Val Ser Gly Ala Leu Lys Ser Thr Ser 
    850                 855                 860                 


Cys Ser Ile Asn His Pro Ile Phe Pro Glu Asn Ser Glu Val Thr Phe 
865                 870                 875                 880 


Asn Ile Thr Phe Asp Val Asp Ser Lys Ala Ser Leu Gly Asn Lys Leu 
                885                 890                 895     


Leu Leu Lys Ala Asn Val Thr Ser Glu Asn Asn Met Pro Arg Thr Asn 
            900                 905                 910         


Lys Thr Glu Phe Gln Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Met 
        915                 920                 925             


Val Val Thr Ser His Gly Val Ser Thr Lys Tyr Leu Asn Phe Thr Ala 
    930                 935                 940                 


Ser Glu Asn Thr Ser Arg Val Met Gln His Gln Tyr Gln Val Ser Asn 
945                 950                 955                 960 


Leu Gly Gln Arg Ser Leu Pro Ile Ser Leu Val Phe Leu Val Pro Val 
                965                 970                 975     


Arg Leu Asn Gln Thr Val Ile Trp Asp Arg Pro Gln Val Thr Phe Ser 
            980                 985                 990         


Glu Asn Leu Ser Ser Thr Cys His  Thr Lys Glu Arg Leu  Pro Ser His 
        995                 1000                 1005             


Ser Asp  Phe Leu Ala Glu Leu  Arg Lys Ala Pro Val  Val Asn Cys 
    1010                 1015                 1020             


Ser Ile  Ala Val Cys Gln Arg  Ile Gln Cys Asp Ile  Pro Phe Phe 
    1025                 1030                 1035             


Gly Ile  Gln Glu Glu Phe Asn  Ala Thr Leu Lys Gly  Asn Leu Ser 
    1040                 1045                 1050             


Phe Asp  Trp Tyr Ile Lys Thr  Ser His Asn His Leu  Leu Ile Val 
    1055                 1060                 1065             


Ser Thr  Ala Glu Ile Leu Phe  Asn Asp Ser Val Phe  Thr Leu Leu 
    1070                 1075                 1080             


Pro Gly  Gln Gly Ala Phe Val  Arg Ser Gln Thr Glu  Thr Lys Val 
    1085                 1090                 1095             


Glu Pro  Phe Glu Val Pro Asn  Pro Leu Pro Leu Ile  Val Gly Ser 
    1100                 1105                 1110             


Ser Val  Gly Gly Leu Leu Leu  Leu Ala Leu Ile Thr  Ala Ala Leu 
    1115                 1120                 1125             


Tyr Lys  Leu Gly Phe Phe Lys  Arg Gln Tyr Lys Asp  Met Met Ser 
    1130                 1135                 1140             


Glu Gly  Gly Pro Pro Gly Ala  Glu Pro Gln 
    1145                 1150             


<210>  270
<211>  4742
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha M (complement component 3 receptor 3
       subunit) (ITGAM), transcript variant 2, mRNA GeneBank Accession 
       No: NM_000632.3  GI:88501733

<400>  270
ttttctgccc ttctttgctt tggtggcttc cttgtggttc ctcagtggtg cctgcaaccc       60

ctggttcacc tccttccagg ttctggctcc ttccagccat ggctctcaga gtccttctgt      120

taacagcctt gaccttatgt catgggttca acttggacac tgaaaacgca atgaccttcc      180

aagagaacgc aaggggcttc gggcagagcg tggtccagct tcagggatcc agggtggtgg      240

ttggagcccc ccaggagata gtggctgcca accaaagggg cagcctctac cagtgcgact      300

acagcacagg ctcatgcgag cccatccgcc tgcaggtccc cgtggaggcc gtgaacatgt      360

ccctgggcct gtccctggca gccaccacca gcccccctca gctgctggcc tgtggtccca      420

ccgtgcacca gacttgcagt gagaacacgt atgtgaaagg gctctgcttc ctgtttggat      480

ccaacctacg gcagcagccc cagaagttcc cagaggccct ccgagggtgt cctcaagagg      540

atagtgacat tgccttcttg attgatggct ctggtagcat catcccacat gactttcggc      600

ggatgaagga gtttgtctca actgtgatgg agcaattaaa aaagtccaaa accttgttct      660

ctttgatgca gtactctgaa gaattccgga ttcactttac cttcaaagag ttccagaaca      720

accctaaccc aagatcactg gtgaagccaa taacgcagct gcttgggcgg acacacacgg      780

ccacgggcat ccgcaaagtg gtacgagagc tgtttaacat caccaacgga gcccgaaaga      840

atgcctttaa gatcctagtt gtcatcacgg atggagaaaa gtttggcgat cccttgggat      900

atgaggatgt catccctgag gcagacagag agggagtcat tcgctacgtc attggggtgg      960

gagatgcctt ccgcagtgag aaatcccgcc aagagcttaa taccatcgca tccaagccgc     1020

ctcgtgatca cgtgttccag gtgaataact ttgaggctct gaagaccatt cagaaccagc     1080

ttcgggagaa gatctttgcg atcgagggta ctcagacagg aagtagcagc tcctttgagc     1140

atgagatgtc tcaggaaggc ttcagcgctg ccatcacctc taatggcccc ttgctgagca     1200

ctgtggggag ctatgactgg gctggtggag tctttctata tacatcaaag gagaaaagca     1260

ccttcatcaa catgaccaga gtggattcag acatgaatga tgcttacttg ggttatgctg     1320

ccgccatcat cttacggaac cgggtgcaaa gcctggttct gggggcacct cgatatcagc     1380

acatcggcct ggtagcgatg ttcaggcaga acactggcat gtgggagtcc aacgctaatg     1440

tcaagggcac ccagatcggc gcctacttcg gggcctccct ctgctccgtg gacgtggaca     1500

gcaacggcag caccgacctg gtcctcatcg gggcccccca ttactacgag cagacccgag     1560

ggggccaggt gtccgtgtgc cccttgccca gggggagggc tcggtggcag tgtgatgctg     1620

ttctctacgg ggagcagggc caaccctggg gccgctttgg ggcagcccta acagtgctgg     1680

gggacgtaaa tggggacaag ctgacggacg tggccattgg ggccccagga gaggaggaca     1740

accggggtgc tgtttacctg tttcacggaa cctcaggatc tggcatcagc ccctcccata     1800

gccagcggat agcaggctcc aagctctctc ccaggctcca gtattttggt cagtcactga     1860

gtgggggcca ggacctcaca atggatggac tggtagacct gactgtagga gcccaggggc     1920

acgtgctgct gctcaggtcc cagccagtac tgagagtcaa ggcaatcatg gagttcaatc     1980

ccagggaagt ggcaaggaat gtatttgagt gtaatgatca ggtggtgaaa ggcaaggaag     2040

ccggagaggt cagagtctgc ctccatgtcc agaagagcac acgggatcgg ctaagagaag     2100

gacagatcca gagtgttgtg acttatgacc tggctctgga ctccggccgc ccacattccc     2160

gcgccgtctt caatgagaca aagaacagca cacgcagaca gacacaggtc ttggggctga     2220

cccagacttg tgagaccctg aaactacagt tgccgaattg catcgaggac ccagtgagcc     2280

ccattgtgct gcgcctgaac ttctctctgg tgggaacgcc attgtctgct ttcgggaacc     2340

tccggccagt gctggcggag gatgctcaga gactcttcac agccttgttt ccctttgaga     2400

agaattgtgg caatgacaac atctgccagg atgacctcag catcaccttc agtttcatga     2460

gcctggactg cctcgtggtg ggtgggcccc gggagttcaa cgtgacagtg actgtgagaa     2520

atgatggtga ggactcctac aggacacagg tcaccttctt cttcccgctt gacctgtcct     2580

accggaaggt gtccacgctc cagaaccagc gctcacagcg atcctggcgc ctggcctgtg     2640

agtctgcctc ctccaccgaa gtgtctgggg ccttgaagag caccagctgc agcataaacc     2700

accccatctt cccggaaaac tcagaggtca cctttaatat cacgtttgat gtagactcta     2760

aggcttccct tggaaacaaa ctgctcctca aggccaatgt gaccagtgag aacaacatgc     2820

ccagaaccaa caaaaccgaa ttccaactgg agctgccggt gaaatatgct gtctacatgg     2880

tggtcaccag ccatggggtc tccactaaat atctcaactt cacggcctca gagaatacca     2940

gtcgggtcat gcagcatcaa tatcaggtca gcaacctggg gcagaggagc ctccccatca     3000

gcctggtgtt cttggtgccc gtccggctga accagactgt catatgggac cgcccccagg     3060

tcaccttctc cgagaacctc tcgagtacgt gccacaccaa ggagcgcttg ccctctcact     3120

ccgactttct ggctgagctt cggaaggccc ccgtggtgaa ctgctccatc gctgtctgcc     3180

agagaatcca gtgtgacatc ccgttctttg gcatccagga agaattcaat gctaccctca     3240

aaggcaacct ctcgtttgac tggtacatca agacctcgca taaccacctc ctgatcgtga     3300

gcacagctga gatcttgttt aacgattccg tgttcaccct gctgccggga cagggggcgt     3360

ttgtgaggtc ccagacggag accaaagtgg agccgttcga ggtccccaac cccctgccgc     3420

tcatcgtggg cagctctgtc gggggactgc tgctcctggc cctcatcacc gccgcgctgt     3480

acaagctcgg cttcttcaag cggcaataca aggacatgat gagtgaaggg ggtcccccgg     3540

gggccgaacc ccagtagcgg ctccttcccg acagagctgc ctctcggtgg ccagcaggac     3600

tctgcccaga ccacacgtag cccccaggct gctggacacg tcggacagcg aagtatcccc     3660

gacaggacgg gcttgggctt ccatttgtgt gtgtgcaagt gtgtatgtgc gtgtgtgcaa     3720

gtgtctgtgt gcaagtgtgt gcacatgtgt gcgtgtgcgt gcatgtgcac ttgcacgccc     3780

atgtgtgagt gtgtgcaagt atgtgagtgt gtccaagtgt gtgtgcgtgt gtccatgtgt     3840

gtgcaagtgt gtgcatgtgt gcgagtgtgt gcatgtgtgt gctcaggggc gtgtggctca     3900

cgtgtgtgac tcagatgtct ctggcgtgtg ggtaggtgac ggcagcgtag cctctccggc     3960

agaagggaac tgcctgggct cccttgtgcg tgggtgaagc cgctgctggg ttttcctccg     4020

ggagagggga cggtcaatcc tgtgggtgaa gacagaggga aacacagcag cttctctcca     4080

ctgaaagaag tgggacttcc cgtcgcctgc gagcctgcgg cctgctggag cctgcgcagc     4140

ttggatggag actccatgag aagccgtggg tggaaccagg aacctcctcc acaccagcgc     4200

tgatgcccaa taaagatgcc cactgaggaa tgatgaagct tcctttctgg attcatttat     4260

tatttcaatg tgactttaat tttttggatg gataagcttg tctatggtac aaaaatcaca     4320

aggcattcaa gtgtacagtg aaaagtctcc ctttccagat attcaagtca cctccttaaa     4380

ggtagtcaag attgtgtttt gaggtttcct tcagacagat tccaggcgat gtgcaagtgt     4440

atgcacgtgt gcacacacac cacacataca cacacacaag cttttttaca caaatggtag     4500

catactttat attggtctgt atcttgcttt ttttcaccaa tatttctcag acatcggttc     4560

atattaagac ataaattact ttttcattct tttataccgc tgcatagtat tccattgtgt     4620

gagtgtacca taatgtattt aaccagtctt cttttgatat actattttca ttctcttgtt     4680

attgcatcaa tgctgagtta ataaatcaaa tatatgtcat ttttgcatat atgtaaggat     4740

aa                                                                    4742


<210>  271
<211>  1152
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha M (complement component 3 receptor 3
       subunit) (ITGAM), transcript variant 2, polypeptide GeneBank 
       Accession No: NP_000623.2 GI:88501734

<400>  271

Met Ala Leu Arg Val Leu Leu Leu Thr Ala Leu Thr Leu Cys His Gly 
1               5                   10                  15      


Phe Asn Leu Asp Thr Glu Asn Ala Met Thr Phe Gln Glu Asn Ala Arg 
            20                  25                  30          


Gly Phe Gly Gln Ser Val Val Gln Leu Gln Gly Ser Arg Val Val Val 
        35                  40                  45              


Gly Ala Pro Gln Glu Ile Val Ala Ala Asn Gln Arg Gly Ser Leu Tyr 
    50                  55                  60                  


Gln Cys Asp Tyr Ser Thr Gly Ser Cys Glu Pro Ile Arg Leu Gln Val 
65                  70                  75                  80  


Pro Val Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu Ala Ala Thr 
                85                  90                  95      


Thr Ser Pro Pro Gln Leu Leu Ala Cys Gly Pro Thr Val His Gln Thr 
            100                 105                 110         


Cys Ser Glu Asn Thr Tyr Val Lys Gly Leu Cys Phe Leu Phe Gly Ser 
        115                 120                 125             


Asn Leu Arg Gln Gln Pro Gln Lys Phe Pro Glu Ala Leu Arg Gly Cys 
    130                 135                 140                 


Pro Gln Glu Asp Ser Asp Ile Ala Phe Leu Ile Asp Gly Ser Gly Ser 
145                 150                 155                 160 


Ile Ile Pro His Asp Phe Arg Arg Met Lys Glu Phe Val Ser Thr Val 
                165                 170                 175     


Met Glu Gln Leu Lys Lys Ser Lys Thr Leu Phe Ser Leu Met Gln Tyr 
            180                 185                 190         


Ser Glu Glu Phe Arg Ile His Phe Thr Phe Lys Glu Phe Gln Asn Asn 
        195                 200                 205             


Pro Asn Pro Arg Ser Leu Val Lys Pro Ile Thr Gln Leu Leu Gly Arg 
    210                 215                 220                 


Thr His Thr Ala Thr Gly Ile Arg Lys Val Val Arg Glu Leu Phe Asn 
225                 230                 235                 240 


Ile Thr Asn Gly Ala Arg Lys Asn Ala Phe Lys Ile Leu Val Val Ile 
                245                 250                 255     


Thr Asp Gly Glu Lys Phe Gly Asp Pro Leu Gly Tyr Glu Asp Val Ile 
            260                 265                 270         


Pro Glu Ala Asp Arg Glu Gly Val Ile Arg Tyr Val Ile Gly Val Gly 
        275                 280                 285             


Asp Ala Phe Arg Ser Glu Lys Ser Arg Gln Glu Leu Asn Thr Ile Ala 
    290                 295                 300                 


Ser Lys Pro Pro Arg Asp His Val Phe Gln Val Asn Asn Phe Glu Ala 
305                 310                 315                 320 


Leu Lys Thr Ile Gln Asn Gln Leu Arg Glu Lys Ile Phe Ala Ile Glu 
                325                 330                 335     


Gly Thr Gln Thr Gly Ser Ser Ser Ser Phe Glu His Glu Met Ser Gln 
            340                 345                 350         


Glu Gly Phe Ser Ala Ala Ile Thr Ser Asn Gly Pro Leu Leu Ser Thr 
        355                 360                 365             


Val Gly Ser Tyr Asp Trp Ala Gly Gly Val Phe Leu Tyr Thr Ser Lys 
    370                 375                 380                 


Glu Lys Ser Thr Phe Ile Asn Met Thr Arg Val Asp Ser Asp Met Asn 
385                 390                 395                 400 


Asp Ala Tyr Leu Gly Tyr Ala Ala Ala Ile Ile Leu Arg Asn Arg Val 
                405                 410                 415     


Gln Ser Leu Val Leu Gly Ala Pro Arg Tyr Gln His Ile Gly Leu Val 
            420                 425                 430         


Ala Met Phe Arg Gln Asn Thr Gly Met Trp Glu Ser Asn Ala Asn Val 
        435                 440                 445             


Lys Gly Thr Gln Ile Gly Ala Tyr Phe Gly Ala Ser Leu Cys Ser Val 
    450                 455                 460                 


Asp Val Asp Ser Asn Gly Ser Thr Asp Leu Val Leu Ile Gly Ala Pro 
465                 470                 475                 480 


His Tyr Tyr Glu Gln Thr Arg Gly Gly Gln Val Ser Val Cys Pro Leu 
                485                 490                 495     


Pro Arg Gly Arg Ala Arg Trp Gln Cys Asp Ala Val Leu Tyr Gly Glu 
            500                 505                 510         


Gln Gly Gln Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu Gly 
        515                 520                 525             


Asp Val Asn Gly Asp Lys Leu Thr Asp Val Ala Ile Gly Ala Pro Gly 
    530                 535                 540                 


Glu Glu Asp Asn Arg Gly Ala Val Tyr Leu Phe His Gly Thr Ser Gly 
545                 550                 555                 560 


Ser Gly Ile Ser Pro Ser His Ser Gln Arg Ile Ala Gly Ser Lys Leu 
                565                 570                 575     


Ser Pro Arg Leu Gln Tyr Phe Gly Gln Ser Leu Ser Gly Gly Gln Asp 
            580                 585                 590         


Leu Thr Met Asp Gly Leu Val Asp Leu Thr Val Gly Ala Gln Gly His 
        595                 600                 605             


Val Leu Leu Leu Arg Ser Gln Pro Val Leu Arg Val Lys Ala Ile Met 
    610                 615                 620                 


Glu Phe Asn Pro Arg Glu Val Ala Arg Asn Val Phe Glu Cys Asn Asp 
625                 630                 635                 640 


Gln Val Val Lys Gly Lys Glu Ala Gly Glu Val Arg Val Cys Leu His 
                645                 650                 655     


Val Gln Lys Ser Thr Arg Asp Arg Leu Arg Glu Gly Gln Ile Gln Ser 
            660                 665                 670         


Val Val Thr Tyr Asp Leu Ala Leu Asp Ser Gly Arg Pro His Ser Arg 
        675                 680                 685             


Ala Val Phe Asn Glu Thr Lys Asn Ser Thr Arg Arg Gln Thr Gln Val 
    690                 695                 700                 


Leu Gly Leu Thr Gln Thr Cys Glu Thr Leu Lys Leu Gln Leu Pro Asn 
705                 710                 715                 720 


Cys Ile Glu Asp Pro Val Ser Pro Ile Val Leu Arg Leu Asn Phe Ser 
                725                 730                 735     


Leu Val Gly Thr Pro Leu Ser Ala Phe Gly Asn Leu Arg Pro Val Leu 
            740                 745                 750         


Ala Glu Asp Ala Gln Arg Leu Phe Thr Ala Leu Phe Pro Phe Glu Lys 
        755                 760                 765             


Asn Cys Gly Asn Asp Asn Ile Cys Gln Asp Asp Leu Ser Ile Thr Phe 
    770                 775                 780                 


Ser Phe Met Ser Leu Asp Cys Leu Val Val Gly Gly Pro Arg Glu Phe 
785                 790                 795                 800 


Asn Val Thr Val Thr Val Arg Asn Asp Gly Glu Asp Ser Tyr Arg Thr 
                805                 810                 815     


Gln Val Thr Phe Phe Phe Pro Leu Asp Leu Ser Tyr Arg Lys Val Ser 
            820                 825                 830         


Thr Leu Gln Asn Gln Arg Ser Gln Arg Ser Trp Arg Leu Ala Cys Glu 
        835                 840                 845             


Ser Ala Ser Ser Thr Glu Val Ser Gly Ala Leu Lys Ser Thr Ser Cys 
    850                 855                 860                 


Ser Ile Asn His Pro Ile Phe Pro Glu Asn Ser Glu Val Thr Phe Asn 
865                 870                 875                 880 


Ile Thr Phe Asp Val Asp Ser Lys Ala Ser Leu Gly Asn Lys Leu Leu 
                885                 890                 895     


Leu Lys Ala Asn Val Thr Ser Glu Asn Asn Met Pro Arg Thr Asn Lys 
            900                 905                 910         


Thr Glu Phe Gln Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Met Val 
        915                 920                 925             


Val Thr Ser His Gly Val Ser Thr Lys Tyr Leu Asn Phe Thr Ala Ser 
    930                 935                 940                 


Glu Asn Thr Ser Arg Val Met Gln His Gln Tyr Gln Val Ser Asn Leu 
945                 950                 955                 960 


Gly Gln Arg Ser Leu Pro Ile Ser Leu Val Phe Leu Val Pro Val Arg 
                965                 970                 975     


Leu Asn Gln Thr Val Ile Trp Asp Arg Pro Gln Val Thr Phe Ser Glu 
            980                 985                 990         


Asn Leu Ser Ser Thr Cys His Thr  Lys Glu Arg Leu Pro  Ser His Ser 
        995                 1000                 1005             


Asp Phe  Leu Ala Glu Leu Arg  Lys Ala Pro Val Val  Asn Cys Ser 
    1010                 1015                 1020             


Ile Ala  Val Cys Gln Arg Ile  Gln Cys Asp Ile Pro  Phe Phe Gly 
    1025                 1030                 1035             


Ile Gln  Glu Glu Phe Asn Ala  Thr Leu Lys Gly Asn  Leu Ser Phe 
    1040                 1045                 1050             


Asp Trp  Tyr Ile Lys Thr Ser  His Asn His Leu Leu  Ile Val Ser 
    1055                 1060                 1065             


Thr Ala  Glu Ile Leu Phe Asn  Asp Ser Val Phe Thr  Leu Leu Pro 
    1070                 1075                 1080             


Gly Gln  Gly Ala Phe Val Arg  Ser Gln Thr Glu Thr  Lys Val Glu 
    1085                 1090                 1095             


Pro Phe  Glu Val Pro Asn Pro  Leu Pro Leu Ile Val  Gly Ser Ser 
    1100                 1105                 1110             


Val Gly  Gly Leu Leu Leu Leu  Ala Leu Ile Thr Ala  Ala Leu Tyr 
    1115                 1120                 1125             


Lys Leu  Gly Phe Phe Lys Arg  Gln Tyr Lys Asp Met  Met Ser Glu 
    1130                 1135                 1140             


Gly Gly  Pro Pro Gly Ala Glu  Pro Gln 
    1145                 1150         


<210>  272
<211>  7053
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha V (ITGAV), transcript variant 1, 
       mRNA GeneBank Accession No. NM_002210.4  GI:495529074

<400>  272
agcctcagac gctgcgtgga gcggcggagc cggagggaag caaaggaccg tctgcgctgc       60

tgtccccgcc ccgcgcgctc tgcgcccctc gtccctggcg gtcgctccga agctcagccc      120

tcttgcctgc cccggagctg tcccgggcta gccgagaaga gagcggccgg caagtttggg      180

cgcgcgcagg cggcgggccg cgggcactgg gcgcctcgct ggggcggggg gaggtggcta      240

ccgctcccgg cttggcgtcc cgcgcgcact tcggcgatgg cttttccgcc gcggcgacgg      300

ctgcgcctcg gtccccgcgg cctcccgctt cttctctcgg gactcctgct acctctgtgc      360

cgcgccttca acctagacgt ggacagtcct gccgagtact ctggccccga gggaagttac      420

ttcggcttcg ccgtggattt cttcgtgccc agcgcgtctt cccggatgtt tcttctcgtg      480

ggagctccca aagcaaacac cacccagcct gggattgtgg aaggagggca ggtcctcaaa      540

tgtgactggt cttctacccg ccggtgccag ccaattgaat ttgatgcaac aggcaataga      600

gattatgcca aggatgatcc attggaattt aagtcccatc agtggtttgg agcatctgtg      660

aggtcgaaac aggataaaat tttggcctgt gccccattgt accattggag aactgagatg      720

aaacaggagc gagagcctgt tggaacatgc tttcttcaag atggaacaaa gactgttgag      780

tatgctccat gtagatcaca agatattgat gctgatggac agggattttg tcaaggagga      840

ttcagcattg attttactaa agctgacaga gtacttcttg gtggtcctgg tagcttttat      900

tggcaaggtc agcttatttc ggatcaagtg gcagaaatcg tatctaaata cgaccccaat      960

gtttacagca tcaagtataa taaccaatta gcaactcgga ctgcacaagc tatttttgat     1020

gacagctatt tgggttattc tgtggctgtc ggagatttca atggtgatgg catagatgac     1080

tttgtttcag gagttccaag agcagcaagg actttgggaa tggtttatat ttatgatggg     1140

aagaacatgt cctccttata caattttact ggcgagcaga tggctgcata tttcggattt     1200

tctgtagctg ccactgacat taatggagat gattatgcag atgtgtttat tggagcacct     1260

ctcttcatgg atcgtggctc tgatggcaaa ctccaagagg tggggcaggt ctcagtgtct     1320

ctacagagag cttcaggaga cttccagacg acaaagctga atggatttga ggtctttgca     1380

cggtttggca gtgccatagc tcctttggga gatctggacc aggatggttt caatgatatt     1440

gcaattgctg ctccatatgg gggtgaagat aaaaaaggaa ttgtttatat cttcaatgga     1500

agatcaacag gcttgaacgc agtcccatct caaatccttg aagggcagtg ggctgctcga     1560

agcatgccac caagctttgg ctattcaatg aaaggagcca cagatataga caaaaatgga     1620

tatccagact taattgtagg agcttttggt gtagatcgag ctatcttata cagggccaga     1680

ccagttatca ctgtaaatgc tggtcttgaa gtgtacccta gcattttaaa tcaagacaat     1740

aaaacctgct cactgcctgg aacagctctc aaagtttcct gttttaatgt taggttctgc     1800

ttaaaggcag atggcaaagg agtacttccc aggaaactta atttccaggt ggaacttctt     1860

ttggataaac tcaagcaaaa gggagcaatt cgacgagcac tgtttctcta cagcaggtcc     1920

ccaagtcact ccaagaacat gactatttca agggggggac tgatgcagtg tgaggaattg     1980

atagcgtatc tgcgggatga atctgaattt agagacaaac tcactccaat tactattttt     2040

atggaatatc ggttggatta tagaacagct gctgatacaa caggcttgca acccattctt     2100

aaccagttca cgcctgctaa cattagtcga caggctcaca ttctacttga ctgtggtgaa     2160

gacaatgtct gtaaacccaa gctggaagtt tctgtagata gtgatcaaaa gaagatctat     2220

attggggatg acaaccctct gacattgatt gttaaggctc agaatcaagg agaaggtgcc     2280

tacgaagctg agctcatcgt ttccattcca ctgcaggctg atttcatcgg ggttgtccga     2340

aacaatgaag ccttagcaag actttcctgt gcatttaaga cagaaaacca aactcgccag     2400

gtggtatgtg accttggaaa cccaatgaag gctggaactc aactcttagc tggtcttcgt     2460

ttcagtgtgc accagcagtc agagatggat acttctgtga aatttgactt acaaatccaa     2520

agctcaaatc tatttgacaa agtaagccca gttgtatctc acaaagttga tcttgctgtt     2580

ttagctgcag ttgagataag aggagtctcg agtcctgatc atatctttct tccgattcca     2640

aactgggagc acaaggagaa ccctgagact gaagaagatg ttgggccagt tgttcagcac     2700

atctatgagc tgagaaacaa tggtccaagt tcattcagca aggcaatgct ccatcttcag     2760

tggccttaca aatataataa taacactctg ttgtatatcc ttcattatga tattgatgga     2820

ccaatgaact gcacttcaga tatggagatc aaccctttga gaattaagat ctcatctttg     2880

caaacaactg aaaagaatga cacggttgcc gggcaaggtg agcgggacca tctcatcact     2940

aagcgggatc ttgccctcag tgaaggagat attcacactt tgggttgtgg agttgctcag     3000

tgcttgaaga ttgtctgcca agttgggaga ttagacagag gaaagagtgc aatcttgtac     3060

gtaaagtcat tactgtggac tgagactttt atgaataaag aaaatcagaa tcattcctat     3120

tctctgaagt cgtctgcttc atttaatgtc atagagtttc cttataagaa tcttccaatt     3180

gaggatatca ccaactccac attggttacc actaatgtca cctggggcat tcagccagcg     3240

cccatgcctg tgcctgtgtg ggtgatcatt ttagcagttc tagcaggatt gttgctactg     3300

gctgttttgg tatttgtaat gtacaggatg ggctttttta aacgggtccg gccacctcaa     3360

gaagaacaag aaagggagca gcttcaacct catgaaaatg gtgaaggaaa ctcagaaact     3420

taactgcagt ttttaagtta tgctacatct tgacccacta gaattagcaa ctttattata     3480

gatttaaact ttcttcatga ggagtaaaaa tccaaggctt tactgctgat agtgctaatt     3540

ggcattaacc acaaaatgag aattatattt gtcaaccttc tccttataaa taagttcaga     3600

catacattta ataacatagg gtgacttgtg tttttaggta tttaaataat aaaatttcaa     3660

gggatagttt ttattcaatg tatataagac aggtagtgcc tgatttacta ctttatataa     3720

aatagtacct ccttcagtta ctgtttctga tttaatgtac ggaactttat ttgttgttgt     3780

tgttgttgtt gttgttgttg ttttaaagca gtccaaattt ggaccttagc aatcatgtct     3840

tttgtatagg tacttaatgt taatacatat tacactacag tttacttttc agaatactaa     3900

agactttata actgcatgaa cttggatttt tttaatcact catatggtag aattttataa     3960

acacatacat gataccatcc aaattcttgc ttttaataac aaaggtacaa tattttgttt     4020

tagtatgaaa atctggtaga tcctattaca cttctgttta tattaaatcc acaatatttt     4080

attacatttt taacttgtat aaattttagg tcaaatcctt caagccaacc tatactaaaa     4140

attagttcca taatcacaaa tggctctttt gtgtaattgt ttaatttcac ctgaatatca     4200

taatgcttaa agccatatgg agttggaaat tatttccaaa gcatatttat tccattgttt     4260

tagtctggct atttacagta taaaaaaagc atttttatta aaatactgtg tagttctttg     4320

agatagttgc ttatgcatat agtaagtatt acattcttag agtagagcag agtttttagt     4380

tagtattaat ttattttcct ccattcatgt acttttcctt atatttccaa aactgttact     4440

gagaatgggt caagatcagt gagaaatctt tacagttgac aggaacctgg accccttacc     4500

ccaactttat gagtaatgct tggaataaaa actcttaagg caactcactg atttacttct     4560

agcaatagca tgatgttaca ggaatattac ctctgtttaa gcaaggtaat gtgtaaaatc     4620

agtctcggct gtcagaataa cttctaaaag gtatttttat aagcagttca agttactgaa     4680

aaccttttaa acctttctga agttcgttag tataaattac ttttctagga ttattaataa     4740

aagccacata ggtggcaagt tgtagtttta tatggctctg tagagtggtg aaccttctag     4800

aggaatatat gatttattca cagttcctca aggcctgggg atgatgatca gttataccta     4860

tttttgtgca attacatcat gttgtacatt agaaatggag agtttaatag ctctttaact     4920

gctgtcctca ttaggtaatg ataaatattt cccttaaata attgactatt ttgctgtgtt     4980

ttaaaaatga ttgaaattta tcttgccata tctcataatt tcatgcacaa gttgactgag     5040

ctaatcttga gaatatattc gtaaaatagg agcacattta gttgaggtat acaaggtagg     5100

actctagaca aaaccttcta ttttagcttt agtgaatttc aaaagtaatg ggtcttggag     5160

tatagatttt tattagtagc ttgaaagagc ttaatcatat gcagtaagta tttttattac     5220

caataaattt aaaatttttt aagaaaaata tttttatcct agggccaagt gttgcctgcc     5280

accaatcagt aagttagtct ataacaaatt ttaccctaac agttttacca cctagtaaca     5340

gtcatttctg aaaatatgtt ggatagaaag tcactctttg gcaaaagtgt tagaatttgc     5400

ttttgtgcca tctattcctt ttatggcatc tatcttgaaa gtaatcttgt attggagatt     5460

gaaagatgct gtaatttaga aattaacatg atatcttaaa ttacctttat gaaatatagt     5520

tttgtataat agcatagatt ttccttcaaa aaatgaacat ttatatatct acaaaaatat     5580

ggagaagagt aatttgaaag cctactttct gaagaaaatg gtgggatttt tttttatcat     5640

gattaaatat caaaaaattg ccctatgaaa actttaaatc tctaaaacat ttgaaatact     5700

accatatttg tgatttattg agaataaaaa tccattttga aatgtaaaat ttttatgatc     5760

tgattcagtt ttaagaaaac atgaatgaac tagaagatat taaaaacatt tgacattggt     5820

aagaaatatt gatactgata ttgattttta tataggtatt tatttcagaa ttgatatttt     5880

gagaaaaata catgtgagtc attttttctg tttctctttt ctcttaacga ttatcactgt     5940

aattctgaat ctgaaaggta aaacaattag tcaaaatatt attgccatca ttctacctgt     6000

gttatgaaac tacttattca tagttaattc tcattaacac ttacatttcc ataaagaaaa     6060

ctcaagtatt aataaaagag actttactgg cttaagaggg ctgtgaaaga tttttgatag     6120

tgaatcatga ccctaaggga gagatttgtg tgataaaagt attgtatata atagatcagc     6180

gatttttgta aggcaaacag aatttgtaag ttggcagatc ttcctaagtt gcaaaatgta     6240

atgatgagct tggtggagaa gaatgagtcg ttcttggaat acctatgtgc agccactacc     6300

catctcaatg tcaccttgtt tgcattcttg gatagcttgt atatgtagta gtttgatgaa     6360

taatttaaag aaaaacacct aaaatttgaa aaatgattgt aggatcaaaa aaggcagatg     6420

aaattactta atactcagtg ttttggagag tattcctttt agtttgttgg ttggctggtt     6480

tgaacgatag aaatatgcag catgcaatat atgcttatat ttcattttaa tttctgatat     6540

ataatgaact tcttgggaga ggtactgaat ctttgatgtt ttttgtcatt gttctcaagt     6600

gcaatataac aatgtaacca aatctagata atttcaaagt tgtcattaat ttagtaagcc     6660

taatataaac aaatatttgt attatttttg ttagcaggaa agagtgatta agtgaggtta     6720

tttaccccta aatggtccat tctgcattgt atttcaggct ggaaatgaat tattctttac     6780

cagttttgaa acactttgaa atatcctaag gtaacttgga agctgtgtag tatatcaaat     6840

taatttgcta cctaataaca tagaaagtaa atatctttgt ggtcacccac attgggtgag     6900

acagaaaatg aatctgttct aaaatttgta atttgctaac ttgatttgag ttagtgaaaa     6960

ctggtacagt gttctgcttg atttacaaca tgtaacttgt gactgtacaa taaacataag     7020

catatggtac cacaaaaaaa aaaaaaaaaa aaa                                  7053


<210>  273
<211>  1048
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha V (ITGAV), transcript variant 1, 
       polypeptide GeneBank Accession No. NP_002201.1 GI:4504763

<400>  273

Met Ala Phe Pro Pro Arg Arg Arg Leu Arg Leu Gly Pro Arg Gly Leu 
1               5                   10                  15      


Pro Leu Leu Leu Ser Gly Leu Leu Leu Pro Leu Cys Arg Ala Phe Asn 
            20                  25                  30          


Leu Asp Val Asp Ser Pro Ala Glu Tyr Ser Gly Pro Glu Gly Ser Tyr 
        35                  40                  45              


Phe Gly Phe Ala Val Asp Phe Phe Val Pro Ser Ala Ser Ser Arg Met 
    50                  55                  60                  


Phe Leu Leu Val Gly Ala Pro Lys Ala Asn Thr Thr Gln Pro Gly Ile 
65                  70                  75                  80  


Val Glu Gly Gly Gln Val Leu Lys Cys Asp Trp Ser Ser Thr Arg Arg 
                85                  90                  95      


Cys Gln Pro Ile Glu Phe Asp Ala Thr Gly Asn Arg Asp Tyr Ala Lys 
            100                 105                 110         


Asp Asp Pro Leu Glu Phe Lys Ser His Gln Trp Phe Gly Ala Ser Val 
        115                 120                 125             


Arg Ser Lys Gln Asp Lys Ile Leu Ala Cys Ala Pro Leu Tyr His Trp 
    130                 135                 140                 


Arg Thr Glu Met Lys Gln Glu Arg Glu Pro Val Gly Thr Cys Phe Leu 
145                 150                 155                 160 


Gln Asp Gly Thr Lys Thr Val Glu Tyr Ala Pro Cys Arg Ser Gln Asp 
                165                 170                 175     


Ile Asp Ala Asp Gly Gln Gly Phe Cys Gln Gly Gly Phe Ser Ile Asp 
            180                 185                 190         


Phe Thr Lys Ala Asp Arg Val Leu Leu Gly Gly Pro Gly Ser Phe Tyr 
        195                 200                 205             


Trp Gln Gly Gln Leu Ile Ser Asp Gln Val Ala Glu Ile Val Ser Lys 
    210                 215                 220                 


Tyr Asp Pro Asn Val Tyr Ser Ile Lys Tyr Asn Asn Gln Leu Ala Thr 
225                 230                 235                 240 


Arg Thr Ala Gln Ala Ile Phe Asp Asp Ser Tyr Leu Gly Tyr Ser Val 
                245                 250                 255     


Ala Val Gly Asp Phe Asn Gly Asp Gly Ile Asp Asp Phe Val Ser Gly 
            260                 265                 270         


Val Pro Arg Ala Ala Arg Thr Leu Gly Met Val Tyr Ile Tyr Asp Gly 
        275                 280                 285             


Lys Asn Met Ser Ser Leu Tyr Asn Phe Thr Gly Glu Gln Met Ala Ala 
    290                 295                 300                 


Tyr Phe Gly Phe Ser Val Ala Ala Thr Asp Ile Asn Gly Asp Asp Tyr 
305                 310                 315                 320 


Ala Asp Val Phe Ile Gly Ala Pro Leu Phe Met Asp Arg Gly Ser Asp 
                325                 330                 335     


Gly Lys Leu Gln Glu Val Gly Gln Val Ser Val Ser Leu Gln Arg Ala 
            340                 345                 350         


Ser Gly Asp Phe Gln Thr Thr Lys Leu Asn Gly Phe Glu Val Phe Ala 
        355                 360                 365             


Arg Phe Gly Ser Ala Ile Ala Pro Leu Gly Asp Leu Asp Gln Asp Gly 
    370                 375                 380                 


Phe Asn Asp Ile Ala Ile Ala Ala Pro Tyr Gly Gly Glu Asp Lys Lys 
385                 390                 395                 400 


Gly Ile Val Tyr Ile Phe Asn Gly Arg Ser Thr Gly Leu Asn Ala Val 
                405                 410                 415     


Pro Ser Gln Ile Leu Glu Gly Gln Trp Ala Ala Arg Ser Met Pro Pro 
            420                 425                 430         


Ser Phe Gly Tyr Ser Met Lys Gly Ala Thr Asp Ile Asp Lys Asn Gly 
        435                 440                 445             


Tyr Pro Asp Leu Ile Val Gly Ala Phe Gly Val Asp Arg Ala Ile Leu 
    450                 455                 460                 


Tyr Arg Ala Arg Pro Val Ile Thr Val Asn Ala Gly Leu Glu Val Tyr 
465                 470                 475                 480 


Pro Ser Ile Leu Asn Gln Asp Asn Lys Thr Cys Ser Leu Pro Gly Thr 
                485                 490                 495     


Ala Leu Lys Val Ser Cys Phe Asn Val Arg Phe Cys Leu Lys Ala Asp 
            500                 505                 510         


Gly Lys Gly Val Leu Pro Arg Lys Leu Asn Phe Gln Val Glu Leu Leu 
        515                 520                 525             


Leu Asp Lys Leu Lys Gln Lys Gly Ala Ile Arg Arg Ala Leu Phe Leu 
    530                 535                 540                 


Tyr Ser Arg Ser Pro Ser His Ser Lys Asn Met Thr Ile Ser Arg Gly 
545                 550                 555                 560 


Gly Leu Met Gln Cys Glu Glu Leu Ile Ala Tyr Leu Arg Asp Glu Ser 
                565                 570                 575     


Glu Phe Arg Asp Lys Leu Thr Pro Ile Thr Ile Phe Met Glu Tyr Arg 
            580                 585                 590         


Leu Asp Tyr Arg Thr Ala Ala Asp Thr Thr Gly Leu Gln Pro Ile Leu 
        595                 600                 605             


Asn Gln Phe Thr Pro Ala Asn Ile Ser Arg Gln Ala His Ile Leu Leu 
    610                 615                 620                 


Asp Cys Gly Glu Asp Asn Val Cys Lys Pro Lys Leu Glu Val Ser Val 
625                 630                 635                 640 


Asp Ser Asp Gln Lys Lys Ile Tyr Ile Gly Asp Asp Asn Pro Leu Thr 
                645                 650                 655     


Leu Ile Val Lys Ala Gln Asn Gln Gly Glu Gly Ala Tyr Glu Ala Glu 
            660                 665                 670         


Leu Ile Val Ser Ile Pro Leu Gln Ala Asp Phe Ile Gly Val Val Arg 
        675                 680                 685             


Asn Asn Glu Ala Leu Ala Arg Leu Ser Cys Ala Phe Lys Thr Glu Asn 
    690                 695                 700                 


Gln Thr Arg Gln Val Val Cys Asp Leu Gly Asn Pro Met Lys Ala Gly 
705                 710                 715                 720 


Thr Gln Leu Leu Ala Gly Leu Arg Phe Ser Val His Gln Gln Ser Glu 
                725                 730                 735     


Met Asp Thr Ser Val Lys Phe Asp Leu Gln Ile Gln Ser Ser Asn Leu 
            740                 745                 750         


Phe Asp Lys Val Ser Pro Val Val Ser His Lys Val Asp Leu Ala Val 
        755                 760                 765             


Leu Ala Ala Val Glu Ile Arg Gly Val Ser Ser Pro Asp His Ile Phe 
    770                 775                 780                 


Leu Pro Ile Pro Asn Trp Glu His Lys Glu Asn Pro Glu Thr Glu Glu 
785                 790                 795                 800 


Asp Val Gly Pro Val Val Gln His Ile Tyr Glu Leu Arg Asn Asn Gly 
                805                 810                 815     


Pro Ser Ser Phe Ser Lys Ala Met Leu His Leu Gln Trp Pro Tyr Lys 
            820                 825                 830         


Tyr Asn Asn Asn Thr Leu Leu Tyr Ile Leu His Tyr Asp Ile Asp Gly 
        835                 840                 845             


Pro Met Asn Cys Thr Ser Asp Met Glu Ile Asn Pro Leu Arg Ile Lys 
    850                 855                 860                 


Ile Ser Ser Leu Gln Thr Thr Glu Lys Asn Asp Thr Val Ala Gly Gln 
865                 870                 875                 880 


Gly Glu Arg Asp His Leu Ile Thr Lys Arg Asp Leu Ala Leu Ser Glu 
                885                 890                 895     


Gly Asp Ile His Thr Leu Gly Cys Gly Val Ala Gln Cys Leu Lys Ile 
            900                 905                 910         


Val Cys Gln Val Gly Arg Leu Asp Arg Gly Lys Ser Ala Ile Leu Tyr 
        915                 920                 925             


Val Lys Ser Leu Leu Trp Thr Glu Thr Phe Met Asn Lys Glu Asn Gln 
    930                 935                 940                 


Asn His Ser Tyr Ser Leu Lys Ser Ser Ala Ser Phe Asn Val Ile Glu 
945                 950                 955                 960 


Phe Pro Tyr Lys Asn Leu Pro Ile Glu Asp Ile Thr Asn Ser Thr Leu 
                965                 970                 975     


Val Thr Thr Asn Val Thr Trp Gly Ile Gln Pro Ala Pro Met Pro Val 
            980                 985                 990         


Pro Val Trp Val Ile Ile Leu Ala  Val Leu Ala Gly Leu  Leu Leu Leu 
        995                 1000                 1005             


Ala Val  Leu Val Phe Val Met  Tyr Arg Met Gly Phe  Phe Lys Arg 
    1010                 1015                 1020             


Val Arg  Pro Pro Gln Glu Glu  Gln Glu Arg Glu Gln  Leu Gln Pro 
    1025                 1030                 1035             


His Glu  Asn Gly Glu Gly Asn  Ser Glu Thr 
    1040                 1045             


<210>  274
<211>  6769
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha V (ITGAV), transcript variant 2, 
       mRNA GeneBank Accession No. NM_001144999.2  GI:495529073

<400>  274
ctttgtagct cctggagatt caattctttc ttgccagggt ctttctacct ctgcctcaca       60

tattccctcc atctcttcat tccacttccc tcatccccac cccccactcc cctccatcct      120

caatacacaa atgctcctag gcaccctcct tctgatcctg tacatcttaa tgttgtgccg      180

gatgtttctt ctcgtgggag ctcccaaagc aaacaccacc cagcctggga ttgtggaagg      240

agggcaggtc ctcaaatgtg actggtcttc tacccgccgg tgccagccaa ttgaatttga      300

tgcaacaggc aatagagatt atgccaagga tgatccattg gaatttaagt cccatcagtg      360

gtttggagca tctgtgaggt cgaaacagga taaaattttg gcctgtgccc cattgtacca      420

ttggagaact gagatgaaac aggagcgaga gcctgttgga acatgctttc ttcaagatgg      480

aacaaagact gttgagtatg ctccatgtag atcacaagat attgatgctg atggacaggg      540

attttgtcaa ggaggattca gcattgattt tactaaagct gacagagtac ttcttggtgg      600

tcctggtagc ttttattggc aaggtcagct tatttcggat caagtggcag aaatcgtatc      660

taaatacgac cccaatgttt acagcatcaa gtataataac caattagcaa ctcggactgc      720

acaagctatt tttgatgaca gctatttggg ttattctgtg gctgtcggag atttcaatgg      780

tgatggcata gatgactttg tttcaggagt tccaagagca gcaaggactt tgggaatggt      840

ttatatttat gatgggaaga acatgtcctc cttatacaat tttactggcg agcagatggc      900

tgcatatttc ggattttctg tagctgccac tgacattaat ggagatgatt atgcagatgt      960

gtttattgga gcacctctct tcatggatcg tggctctgat ggcaaactcc aagaggtggg     1020

gcaggtctca gtgtctctac agagagcttc aggagacttc cagacgacaa agctgaatgg     1080

atttgaggtc tttgcacggt ttggcagtgc catagctcct ttgggagatc tggaccagga     1140

tggtttcaat gatattgcaa ttgctgctcc atatgggggt gaagataaaa aaggaattgt     1200

ttatatcttc aatggaagat caacaggctt gaacgcagtc ccatctcaaa tccttgaagg     1260

gcagtgggct gctcgaagca tgccaccaag ctttggctat tcaatgaaag gagccacaga     1320

tatagacaaa aatggatatc cagacttaat tgtaggagct tttggtgtag atcgagctat     1380

cttatacagg gccagaccag ttatcactgt aaatgctggt cttgaagtgt accctagcat     1440

tttaaatcaa gacaataaaa cctgctcact gcctggaaca gctctcaaag tttcctgttt     1500

taatgttagg ttctgcttaa aggcagatgg caaaggagta cttcccagga aacttaattt     1560

ccaggtggaa cttcttttgg ataaactcaa gcaaaaggga gcaattcgac gagcactgtt     1620

tctctacagc aggtccccaa gtcactccaa gaacatgact atttcaaggg ggggactgat     1680

gcagtgtgag gaattgatag cgtatctgcg ggatgaatct gaatttagag acaaactcac     1740

tccaattact atttttatgg aatatcggtt ggattataga acagctgctg atacaacagg     1800

cttgcaaccc attcttaacc agttcacgcc tgctaacatt agtcgacagg ctcacattct     1860

acttgactgt ggtgaagaca atgtctgtaa acccaagctg gaagtttctg tagatagtga     1920

tcaaaagaag atctatattg gggatgacaa ccctctgaca ttgattgtta aggctcagaa     1980

tcaaggagaa ggtgcctacg aagctgagct catcgtttcc attccactgc aggctgattt     2040

catcggggtt gtccgaaaca atgaagcctt agcaagactt tcctgtgcat ttaagacaga     2100

aaaccaaact cgccaggtgg tatgtgacct tggaaaccca atgaaggctg gaactcaact     2160

cttagctggt cttcgtttca gtgtgcacca gcagtcagag atggatactt ctgtgaaatt     2220

tgacttacaa atccaaagct caaatctatt tgacaaagta agcccagttg tatctcacaa     2280

agttgatctt gctgttttag ctgcagttga gataagagga gtctcgagtc ctgatcatat     2340

ctttcttccg attccaaact gggagcacaa ggagaaccct gagactgaag aagatgttgg     2400

gccagttgtt cagcacatct atgagctgag aaacaatggt ccaagttcat tcagcaaggc     2460

aatgctccat cttcagtggc cttacaaata taataataac actctgttgt atatccttca     2520

ttatgatatt gatggaccaa tgaactgcac ttcagatatg gagatcaacc ctttgagaat     2580

taagatctca tctttgcaaa caactgaaaa gaatgacacg gttgccgggc aaggtgagcg     2640

ggaccatctc atcactaagc gggatcttgc cctcagtgaa ggagatattc acactttggg     2700

ttgtggagtt gctcagtgct tgaagattgt ctgccaagtt gggagattag acagaggaaa     2760

gagtgcaatc ttgtacgtaa agtcattact gtggactgag acttttatga ataaagaaaa     2820

tcagaatcat tcctattctc tgaagtcgtc tgcttcattt aatgtcatag agtttcctta     2880

taagaatctt ccaattgagg atatcaccaa ctccacattg gttaccacta atgtcacctg     2940

gggcattcag ccagcgccca tgcctgtgcc tgtgtgggtg atcattttag cagttctagc     3000

aggattgttg ctactggctg ttttggtatt tgtaatgtac aggatgggct tttttaaacg     3060

ggtccggcca cctcaagaag aacaagaaag ggagcagctt caacctcatg aaaatggtga     3120

aggaaactca gaaacttaac tgcagttttt aagttatgct acatcttgac ccactagaat     3180

tagcaacttt attatagatt taaactttct tcatgaggag taaaaatcca aggctttact     3240

gctgatagtg ctaattggca ttaaccacaa aatgagaatt atatttgtca accttctcct     3300

tataaataag ttcagacata catttaataa catagggtga cttgtgtttt taggtattta     3360

aataataaaa tttcaaggga tagtttttat tcaatgtata taagacaggt agtgcctgat     3420

ttactacttt atataaaata gtacctcctt cagttactgt ttctgattta atgtacggaa     3480

ctttatttgt tgttgttgtt gttgttgttg ttgttgtttt aaagcagtcc aaatttggac     3540

cttagcaatc atgtcttttg tataggtact taatgttaat acatattaca ctacagttta     3600

cttttcagaa tactaaagac tttataactg catgaacttg gattttttta atcactcata     3660

tggtagaatt ttataaacac atacatgata ccatccaaat tcttgctttt aataacaaag     3720

gtacaatatt ttgttttagt atgaaaatct ggtagatcct attacacttc tgtttatatt     3780

aaatccacaa tattttatta catttttaac ttgtataaat tttaggtcaa atccttcaag     3840

ccaacctata ctaaaaatta gttccataat cacaaatggc tcttttgtgt aattgtttaa     3900

tttcacctga atatcataat gcttaaagcc atatggagtt ggaaattatt tccaaagcat     3960

atttattcca ttgttttagt ctggctattt acagtataaa aaaagcattt ttattaaaat     4020

actgtgtagt tctttgagat agttgcttat gcatatagta agtattacat tcttagagta     4080

gagcagagtt tttagttagt attaatttat tttcctccat tcatgtactt ttccttatat     4140

ttccaaaact gttactgaga atgggtcaag atcagtgaga aatctttaca gttgacagga     4200

acctggaccc cttaccccaa ctttatgagt aatgcttgga ataaaaactc ttaaggcaac     4260

tcactgattt acttctagca atagcatgat gttacaggaa tattacctct gtttaagcaa     4320

ggtaatgtgt aaaatcagtc tcggctgtca gaataacttc taaaaggtat ttttataagc     4380

agttcaagtt actgaaaacc ttttaaacct ttctgaagtt cgttagtata aattactttt     4440

ctaggattat taataaaagc cacataggtg gcaagttgta gttttatatg gctctgtaga     4500

gtggtgaacc ttctagagga atatatgatt tattcacagt tcctcaaggc ctggggatga     4560

tgatcagtta tacctatttt tgtgcaatta catcatgttg tacattagaa atggagagtt     4620

taatagctct ttaactgctg tcctcattag gtaatgataa atatttccct taaataattg     4680

actattttgc tgtgttttaa aaatgattga aatttatctt gccatatctc ataatttcat     4740

gcacaagttg actgagctaa tcttgagaat atattcgtaa aataggagca catttagttg     4800

aggtatacaa ggtaggactc tagacaaaac cttctatttt agctttagtg aatttcaaaa     4860

gtaatgggtc ttggagtata gatttttatt agtagcttga aagagcttaa tcatatgcag     4920

taagtatttt tattaccaat aaatttaaaa ttttttaaga aaaatatttt tatcctaggg     4980

ccaagtgttg cctgccacca atcagtaagt tagtctataa caaattttac cctaacagtt     5040

ttaccaccta gtaacagtca tttctgaaaa tatgttggat agaaagtcac tctttggcaa     5100

aagtgttaga atttgctttt gtgccatcta ttccttttat ggcatctatc ttgaaagtaa     5160

tcttgtattg gagattgaaa gatgctgtaa tttagaaatt aacatgatat cttaaattac     5220

ctttatgaaa tatagttttg tataatagca tagattttcc ttcaaaaaat gaacatttat     5280

atatctacaa aaatatggag aagagtaatt tgaaagccta ctttctgaag aaaatggtgg     5340

gatttttttt tatcatgatt aaatatcaaa aaattgccct atgaaaactt taaatctcta     5400

aaacatttga aatactacca tatttgtgat ttattgagaa taaaaatcca ttttgaaatg     5460

taaaattttt atgatctgat tcagttttaa gaaaacatga atgaactaga agatattaaa     5520

aacatttgac attggtaaga aatattgata ctgatattga tttttatata ggtatttatt     5580

tcagaattga tattttgaga aaaatacatg tgagtcattt tttctgtttc tcttttctct     5640

taacgattat cactgtaatt ctgaatctga aaggtaaaac aattagtcaa aatattattg     5700

ccatcattct acctgtgtta tgaaactact tattcatagt taattctcat taacacttac     5760

atttccataa agaaaactca agtattaata aaagagactt tactggctta agagggctgt     5820

gaaagatttt tgatagtgaa tcatgaccct aagggagaga tttgtgtgat aaaagtattg     5880

tatataatag atcagcgatt tttgtaaggc aaacagaatt tgtaagttgg cagatcttcc     5940

taagttgcaa aatgtaatga tgagcttggt ggagaagaat gagtcgttct tggaatacct     6000

atgtgcagcc actacccatc tcaatgtcac cttgtttgca ttcttggata gcttgtatat     6060

gtagtagttt gatgaataat ttaaagaaaa acacctaaaa tttgaaaaat gattgtagga     6120

tcaaaaaagg cagatgaaat tacttaatac tcagtgtttt ggagagtatt ccttttagtt     6180

tgttggttgg ctggtttgaa cgatagaaat atgcagcatg caatatatgc ttatatttca     6240

ttttaatttc tgatatataa tgaacttctt gggagaggta ctgaatcttt gatgtttttt     6300

gtcattgttc tcaagtgcaa tataacaatg taaccaaatc tagataattt caaagttgtc     6360

attaatttag taagcctaat ataaacaaat atttgtatta tttttgttag caggaaagag     6420

tgattaagtg aggttattta cccctaaatg gtccattctg cattgtattt caggctggaa     6480

atgaattatt ctttaccagt tttgaaacac tttgaaatat cctaaggtaa cttggaagct     6540

gtgtagtata tcaaattaat ttgctaccta ataacataga aagtaaatat ctttgtggtc     6600

acccacattg ggtgagacag aaaatgaatc tgttctaaaa tttgtaattt gctaacttga     6660

tttgagttag tgaaaactgg tacagtgttc tgcttgattt acaacatgta acttgtgact     6720

gtacaataaa cataagcata tggtaccaca aaaaaaaaaa aaaaaaaaa                 6769


<210>  275
<211>  1002
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha V (ITGAV), transcript variant 2, 
       polypeptide GeneBank Accession No. NM_001144999.2  GI:495529073

<400>  275

Met Leu Leu Gly Thr Leu Leu Leu Ile Leu Tyr Ile Leu Met Leu Cys 
1               5                   10                  15      


Arg Met Phe Leu Leu Val Gly Ala Pro Lys Ala Asn Thr Thr Gln Pro 
            20                  25                  30          


Gly Ile Val Glu Gly Gly Gln Val Leu Lys Cys Asp Trp Ser Ser Thr 
        35                  40                  45              


Arg Arg Cys Gln Pro Ile Glu Phe Asp Ala Thr Gly Asn Arg Asp Tyr 
    50                  55                  60                  


Ala Lys Asp Asp Pro Leu Glu Phe Lys Ser His Gln Trp Phe Gly Ala 
65                  70                  75                  80  


Ser Val Arg Ser Lys Gln Asp Lys Ile Leu Ala Cys Ala Pro Leu Tyr 
                85                  90                  95      


His Trp Arg Thr Glu Met Lys Gln Glu Arg Glu Pro Val Gly Thr Cys 
            100                 105                 110         


Phe Leu Gln Asp Gly Thr Lys Thr Val Glu Tyr Ala Pro Cys Arg Ser 
        115                 120                 125             


Gln Asp Ile Asp Ala Asp Gly Gln Gly Phe Cys Gln Gly Gly Phe Ser 
    130                 135                 140                 


Ile Asp Phe Thr Lys Ala Asp Arg Val Leu Leu Gly Gly Pro Gly Ser 
145                 150                 155                 160 


Phe Tyr Trp Gln Gly Gln Leu Ile Ser Asp Gln Val Ala Glu Ile Val 
                165                 170                 175     


Ser Lys Tyr Asp Pro Asn Val Tyr Ser Ile Lys Tyr Asn Asn Gln Leu 
            180                 185                 190         


Ala Thr Arg Thr Ala Gln Ala Ile Phe Asp Asp Ser Tyr Leu Gly Tyr 
        195                 200                 205             


Ser Val Ala Val Gly Asp Phe Asn Gly Asp Gly Ile Asp Asp Phe Val 
    210                 215                 220                 


Ser Gly Val Pro Arg Ala Ala Arg Thr Leu Gly Met Val Tyr Ile Tyr 
225                 230                 235                 240 


Asp Gly Lys Asn Met Ser Ser Leu Tyr Asn Phe Thr Gly Glu Gln Met 
                245                 250                 255     


Ala Ala Tyr Phe Gly Phe Ser Val Ala Ala Thr Asp Ile Asn Gly Asp 
            260                 265                 270         


Asp Tyr Ala Asp Val Phe Ile Gly Ala Pro Leu Phe Met Asp Arg Gly 
        275                 280                 285             


Ser Asp Gly Lys Leu Gln Glu Val Gly Gln Val Ser Val Ser Leu Gln 
    290                 295                 300                 


Arg Ala Ser Gly Asp Phe Gln Thr Thr Lys Leu Asn Gly Phe Glu Val 
305                 310                 315                 320 


Phe Ala Arg Phe Gly Ser Ala Ile Ala Pro Leu Gly Asp Leu Asp Gln 
                325                 330                 335     


Asp Gly Phe Asn Asp Ile Ala Ile Ala Ala Pro Tyr Gly Gly Glu Asp 
            340                 345                 350         


Lys Lys Gly Ile Val Tyr Ile Phe Asn Gly Arg Ser Thr Gly Leu Asn 
        355                 360                 365             


Ala Val Pro Ser Gln Ile Leu Glu Gly Gln Trp Ala Ala Arg Ser Met 
    370                 375                 380                 


Pro Pro Ser Phe Gly Tyr Ser Met Lys Gly Ala Thr Asp Ile Asp Lys 
385                 390                 395                 400 


Asn Gly Tyr Pro Asp Leu Ile Val Gly Ala Phe Gly Val Asp Arg Ala 
                405                 410                 415     


Ile Leu Tyr Arg Ala Arg Pro Val Ile Thr Val Asn Ala Gly Leu Glu 
            420                 425                 430         


Val Tyr Pro Ser Ile Leu Asn Gln Asp Asn Lys Thr Cys Ser Leu Pro 
        435                 440                 445             


Gly Thr Ala Leu Lys Val Ser Cys Phe Asn Val Arg Phe Cys Leu Lys 
    450                 455                 460                 


Ala Asp Gly Lys Gly Val Leu Pro Arg Lys Leu Asn Phe Gln Val Glu 
465                 470                 475                 480 


Leu Leu Leu Asp Lys Leu Lys Gln Lys Gly Ala Ile Arg Arg Ala Leu 
                485                 490                 495     


Phe Leu Tyr Ser Arg Ser Pro Ser His Ser Lys Asn Met Thr Ile Ser 
            500                 505                 510         


Arg Gly Gly Leu Met Gln Cys Glu Glu Leu Ile Ala Tyr Leu Arg Asp 
        515                 520                 525             


Glu Ser Glu Phe Arg Asp Lys Leu Thr Pro Ile Thr Ile Phe Met Glu 
    530                 535                 540                 


Tyr Arg Leu Asp Tyr Arg Thr Ala Ala Asp Thr Thr Gly Leu Gln Pro 
545                 550                 555                 560 


Ile Leu Asn Gln Phe Thr Pro Ala Asn Ile Ser Arg Gln Ala His Ile 
                565                 570                 575     


Leu Leu Asp Cys Gly Glu Asp Asn Val Cys Lys Pro Lys Leu Glu Val 
            580                 585                 590         


Ser Val Asp Ser Asp Gln Lys Lys Ile Tyr Ile Gly Asp Asp Asn Pro 
        595                 600                 605             


Leu Thr Leu Ile Val Lys Ala Gln Asn Gln Gly Glu Gly Ala Tyr Glu 
    610                 615                 620                 


Ala Glu Leu Ile Val Ser Ile Pro Leu Gln Ala Asp Phe Ile Gly Val 
625                 630                 635                 640 


Val Arg Asn Asn Glu Ala Leu Ala Arg Leu Ser Cys Ala Phe Lys Thr 
                645                 650                 655     


Glu Asn Gln Thr Arg Gln Val Val Cys Asp Leu Gly Asn Pro Met Lys 
            660                 665                 670         


Ala Gly Thr Gln Leu Leu Ala Gly Leu Arg Phe Ser Val His Gln Gln 
        675                 680                 685             


Ser Glu Met Asp Thr Ser Val Lys Phe Asp Leu Gln Ile Gln Ser Ser 
    690                 695                 700                 


Asn Leu Phe Asp Lys Val Ser Pro Val Val Ser His Lys Val Asp Leu 
705                 710                 715                 720 


Ala Val Leu Ala Ala Val Glu Ile Arg Gly Val Ser Ser Pro Asp His 
                725                 730                 735     


Ile Phe Leu Pro Ile Pro Asn Trp Glu His Lys Glu Asn Pro Glu Thr 
            740                 745                 750         


Glu Glu Asp Val Gly Pro Val Val Gln His Ile Tyr Glu Leu Arg Asn 
        755                 760                 765             


Asn Gly Pro Ser Ser Phe Ser Lys Ala Met Leu His Leu Gln Trp Pro 
    770                 775                 780                 


Tyr Lys Tyr Asn Asn Asn Thr Leu Leu Tyr Ile Leu His Tyr Asp Ile 
785                 790                 795                 800 


Asp Gly Pro Met Asn Cys Thr Ser Asp Met Glu Ile Asn Pro Leu Arg 
                805                 810                 815     


Ile Lys Ile Ser Ser Leu Gln Thr Thr Glu Lys Asn Asp Thr Val Ala 
            820                 825                 830         


Gly Gln Gly Glu Arg Asp His Leu Ile Thr Lys Arg Asp Leu Ala Leu 
        835                 840                 845             


Ser Glu Gly Asp Ile His Thr Leu Gly Cys Gly Val Ala Gln Cys Leu 
    850                 855                 860                 


Lys Ile Val Cys Gln Val Gly Arg Leu Asp Arg Gly Lys Ser Ala Ile 
865                 870                 875                 880 


Leu Tyr Val Lys Ser Leu Leu Trp Thr Glu Thr Phe Met Asn Lys Glu 
                885                 890                 895     


Asn Gln Asn His Ser Tyr Ser Leu Lys Ser Ser Ala Ser Phe Asn Val 
            900                 905                 910         


Ile Glu Phe Pro Tyr Lys Asn Leu Pro Ile Glu Asp Ile Thr Asn Ser 
        915                 920                 925             


Thr Leu Val Thr Thr Asn Val Thr Trp Gly Ile Gln Pro Ala Pro Met 
    930                 935                 940                 


Pro Val Pro Val Trp Val Ile Ile Leu Ala Val Leu Ala Gly Leu Leu 
945                 950                 955                 960 


Leu Leu Ala Val Leu Val Phe Val Met Tyr Arg Met Gly Phe Phe Lys 
                965                 970                 975     


Arg Val Arg Pro Pro Gln Glu Glu Gln Glu Arg Glu Gln Leu Gln Pro 
            980                 985                 990         


His Glu Asn Gly Glu Gly Asn Ser  Glu Thr 
        995                 1000         


<210>  276
<211>  6945
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha V (ITGAV), transcript variant 3, 
       mRNA GeneBank Accession No. NM_001145000.2  GI:495529075

<400>  276
agcctcagac gctgcgtgga gcggcggagc cggagggaag caaaggaccg tctgcgctgc       60

tgtccccgcc ccgcgcgctc tgcgcccctc gtccctggcg gtcgctccga agctcagccc      120

tcttgcctgc cccggagctg tcccgggcta gccgagaaga gagcggccgg caagtttggg      180

cgcgcgcagg cggcgggccg cgggcactgg gcgcctcgct ggggcggggg gaggtggcta      240

ccgctcccgg cttggcgtcc cgcgcgcact tcggcgatgg cttttccgcc gcggcgacgg      300

ctgcgcctcg gtccccgcgg cctcccgctt cttctctcgg gactcctgct acctctgtgc      360

cgcgccttca acctagacgt ggacagtcct gccgagtact ctggccccga gggaagttac      420

ttcggcttcg ccgtggattt cttcgtgccc agcgcgtctt cccggatgtt tcttctcgtg      480

ggagctccca aagcaaacac cacccagcct gggattgtgg aaggagggca ggtcctcaaa      540

tgtgactggt cttctacccg ccggtgccag ccaattgaat ttgatgcaac aggcaataga      600

gattatgcca aggatgatcc attggaattt aagtcccatc agtggtttgg agcatctgtg      660

aggtcgaaac aggataaaat tttggcctgt gccccattgt accattggag aactgagatg      720

aaacaggagc gagagcctgt tggaacatgc tttcttcaag atggaacaaa gactgttgag      780

tatgctccat gtagatcacg tcagcttatt tcggatcaag tggcagaaat cgtatctaaa      840

tacgacccca atgtttacag catcaagtat aataaccaat tagcaactcg gactgcacaa      900

gctatttttg atgacagcta tttgggttat tctgtggctg tcggagattt caatggtgat      960

ggcatagatg actttgtttc aggagttcca agagcagcaa ggactttggg aatggtttat     1020

atttatgatg ggaagaacat gtcctcctta tacaatttta ctggcgagca gatggctgca     1080

tatttcggat tttctgtagc tgccactgac attaatggag atgattatgc agatgtgttt     1140

attggagcac ctctcttcat ggatcgtggc tctgatggca aactccaaga ggtggggcag     1200

gtctcagtgt ctctacagag agcttcagga gacttccaga cgacaaagct gaatggattt     1260

gaggtctttg cacggtttgg cagtgccata gctcctttgg gagatctgga ccaggatggt     1320

ttcaatgata ttgcaattgc tgctccatat gggggtgaag ataaaaaagg aattgtttat     1380

atcttcaatg gaagatcaac aggcttgaac gcagtcccat ctcaaatcct tgaagggcag     1440

tgggctgctc gaagcatgcc accaagcttt ggctattcaa tgaaaggagc cacagatata     1500

gacaaaaatg gatatccaga cttaattgta ggagcttttg gtgtagatcg agctatctta     1560

tacagggcca gaccagttat cactgtaaat gctggtcttg aagtgtaccc tagcatttta     1620

aatcaagaca ataaaacctg ctcactgcct ggaacagctc tcaaagtttc ctgttttaat     1680

gttaggttct gcttaaaggc agatggcaaa ggagtacttc ccaggaaact taatttccag     1740

gtggaacttc ttttggataa actcaagcaa aagggagcaa ttcgacgagc actgtttctc     1800

tacagcaggt ccccaagtca ctccaagaac atgactattt caaggggggg actgatgcag     1860

tgtgaggaat tgatagcgta tctgcgggat gaatctgaat ttagagacaa actcactcca     1920

attactattt ttatggaata tcggttggat tatagaacag ctgctgatac aacaggcttg     1980

caacccattc ttaaccagtt cacgcctgct aacattagtc gacaggctca cattctactt     2040

gactgtggtg aagacaatgt ctgtaaaccc aagctggaag tttctgtaga tagtgatcaa     2100

aagaagatct atattgggga tgacaaccct ctgacattga ttgttaaggc tcagaatcaa     2160

ggagaaggtg cctacgaagc tgagctcatc gtttccattc cactgcaggc tgatttcatc     2220

ggggttgtcc gaaacaatga agccttagca agactttcct gtgcatttaa gacagaaaac     2280

caaactcgcc aggtggtatg tgaccttgga aacccaatga aggctggaac tcaactctta     2340

gctggtcttc gtttcagtgt gcaccagcag tcagagatgg atacttctgt gaaatttgac     2400

ttacaaatcc aaagctcaaa tctatttgac aaagtaagcc cagttgtatc tcacaaagtt     2460

gatcttgctg ttttagctgc agttgagata agaggagtct cgagtcctga tcatatcttt     2520

cttccgattc caaactggga gcacaaggag aaccctgaga ctgaagaaga tgttgggcca     2580

gttgttcagc acatctatga gctgagaaac aatggtccaa gttcattcag caaggcaatg     2640

ctccatcttc agtggcctta caaatataat aataacactc tgttgtatat ccttcattat     2700

gatattgatg gaccaatgaa ctgcacttca gatatggaga tcaacccttt gagaattaag     2760

atctcatctt tgcaaacaac tgaaaagaat gacacggttg ccgggcaagg tgagcgggac     2820

catctcatca ctaagcggga tcttgccctc agtgaaggag atattcacac tttgggttgt     2880

ggagttgctc agtgcttgaa gattgtctgc caagttggga gattagacag aggaaagagt     2940

gcaatcttgt acgtaaagtc attactgtgg actgagactt ttatgaataa agaaaatcag     3000

aatcattcct attctctgaa gtcgtctgct tcatttaatg tcatagagtt tccttataag     3060

aatcttccaa ttgaggatat caccaactcc acattggtta ccactaatgt cacctggggc     3120

attcagccag cgcccatgcc tgtgcctgtg tgggtgatca ttttagcagt tctagcagga     3180

ttgttgctac tggctgtttt ggtatttgta atgtacagga tgggcttttt taaacgggtc     3240

cggccacctc aagaagaaca agaaagggag cagcttcaac ctcatgaaaa tggtgaagga     3300

aactcagaaa cttaactgca gtttttaagt tatgctacat cttgacccac tagaattagc     3360

aactttatta tagatttaaa ctttcttcat gaggagtaaa aatccaaggc tttactgctg     3420

atagtgctaa ttggcattaa ccacaaaatg agaattatat ttgtcaacct tctccttata     3480

aataagttca gacatacatt taataacata gggtgacttg tgtttttagg tatttaaata     3540

ataaaatttc aagggatagt ttttattcaa tgtatataag acaggtagtg cctgatttac     3600

tactttatat aaaatagtac ctccttcagt tactgtttct gatttaatgt acggaacttt     3660

atttgttgtt gttgttgttg ttgttgttgt tgttttaaag cagtccaaat ttggacctta     3720

gcaatcatgt cttttgtata ggtacttaat gttaatacat attacactac agtttacttt     3780

tcagaatact aaagacttta taactgcatg aacttggatt tttttaatca ctcatatggt     3840

agaattttat aaacacatac atgataccat ccaaattctt gcttttaata acaaaggtac     3900

aatattttgt tttagtatga aaatctggta gatcctatta cacttctgtt tatattaaat     3960

ccacaatatt ttattacatt tttaacttgt ataaatttta ggtcaaatcc ttcaagccaa     4020

cctatactaa aaattagttc cataatcaca aatggctctt ttgtgtaatt gtttaatttc     4080

acctgaatat cataatgctt aaagccatat ggagttggaa attatttcca aagcatattt     4140

attccattgt tttagtctgg ctatttacag tataaaaaaa gcatttttat taaaatactg     4200

tgtagttctt tgagatagtt gcttatgcat atagtaagta ttacattctt agagtagagc     4260

agagttttta gttagtatta atttattttc ctccattcat gtacttttcc ttatatttcc     4320

aaaactgtta ctgagaatgg gtcaagatca gtgagaaatc tttacagttg acaggaacct     4380

ggacccctta ccccaacttt atgagtaatg cttggaataa aaactcttaa ggcaactcac     4440

tgatttactt ctagcaatag catgatgtta caggaatatt acctctgttt aagcaaggta     4500

atgtgtaaaa tcagtctcgg ctgtcagaat aacttctaaa aggtattttt ataagcagtt     4560

caagttactg aaaacctttt aaacctttct gaagttcgtt agtataaatt acttttctag     4620

gattattaat aaaagccaca taggtggcaa gttgtagttt tatatggctc tgtagagtgg     4680

tgaaccttct agaggaatat atgatttatt cacagttcct caaggcctgg ggatgatgat     4740

cagttatacc tatttttgtg caattacatc atgttgtaca ttagaaatgg agagtttaat     4800

agctctttaa ctgctgtcct cattaggtaa tgataaatat ttcccttaaa taattgacta     4860

ttttgctgtg ttttaaaaat gattgaaatt tatcttgcca tatctcataa tttcatgcac     4920

aagttgactg agctaatctt gagaatatat tcgtaaaata ggagcacatt tagttgaggt     4980

atacaaggta ggactctaga caaaaccttc tattttagct ttagtgaatt tcaaaagtaa     5040

tgggtcttgg agtatagatt tttattagta gcttgaaaga gcttaatcat atgcagtaag     5100

tatttttatt accaataaat ttaaaatttt ttaagaaaaa tatttttatc ctagggccaa     5160

gtgttgcctg ccaccaatca gtaagttagt ctataacaaa ttttacccta acagttttac     5220

cacctagtaa cagtcatttc tgaaaatatg ttggatagaa agtcactctt tggcaaaagt     5280

gttagaattt gcttttgtgc catctattcc ttttatggca tctatcttga aagtaatctt     5340

gtattggaga ttgaaagatg ctgtaattta gaaattaaca tgatatctta aattaccttt     5400

atgaaatata gttttgtata atagcataga ttttccttca aaaaatgaac atttatatat     5460

ctacaaaaat atggagaaga gtaatttgaa agcctacttt ctgaagaaaa tggtgggatt     5520

tttttttatc atgattaaat atcaaaaaat tgccctatga aaactttaaa tctctaaaac     5580

atttgaaata ctaccatatt tgtgatttat tgagaataaa aatccatttt gaaatgtaaa     5640

atttttatga tctgattcag ttttaagaaa acatgaatga actagaagat attaaaaaca     5700

tttgacattg gtaagaaata ttgatactga tattgatttt tatataggta tttatttcag     5760

aattgatatt ttgagaaaaa tacatgtgag tcattttttc tgtttctctt ttctcttaac     5820

gattatcact gtaattctga atctgaaagg taaaacaatt agtcaaaata ttattgccat     5880

cattctacct gtgttatgaa actacttatt catagttaat tctcattaac acttacattt     5940

ccataaagaa aactcaagta ttaataaaag agactttact ggcttaagag ggctgtgaaa     6000

gatttttgat agtgaatcat gaccctaagg gagagatttg tgtgataaaa gtattgtata     6060

taatagatca gcgatttttg taaggcaaac agaatttgta agttggcaga tcttcctaag     6120

ttgcaaaatg taatgatgag cttggtggag aagaatgagt cgttcttgga atacctatgt     6180

gcagccacta cccatctcaa tgtcaccttg tttgcattct tggatagctt gtatatgtag     6240

tagtttgatg aataatttaa agaaaaacac ctaaaatttg aaaaatgatt gtaggatcaa     6300

aaaaggcaga tgaaattact taatactcag tgttttggag agtattcctt ttagtttgtt     6360

ggttggctgg tttgaacgat agaaatatgc agcatgcaat atatgcttat atttcatttt     6420

aatttctgat atataatgaa cttcttggga gaggtactga atctttgatg ttttttgtca     6480

ttgttctcaa gtgcaatata acaatgtaac caaatctaga taatttcaaa gttgtcatta     6540

atttagtaag cctaatataa acaaatattt gtattatttt tgttagcagg aaagagtgat     6600

taagtgaggt tatttacccc taaatggtcc attctgcatt gtatttcagg ctggaaatga     6660

attattcttt accagttttg aaacactttg aaatatccta aggtaacttg gaagctgtgt     6720

agtatatcaa attaatttgc tacctaataa catagaaagt aaatatcttt gtggtcaccc     6780

acattgggtg agacagaaaa tgaatctgtt ctaaaatttg taatttgcta acttgatttg     6840

agttagtgaa aactggtaca gtgttctgct tgatttacaa catgtaactt gtgactgtac     6900

aataaacata agcatatggt accacaaaaa aaaaaaaaaa aaaaa                     6945


<210>  277
<211>  1012
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha V (ITGAV), transcript variant 3, 
       polypeptide GeneBank Accession No. NP_001138472.1 GI:223468597

<400>  277

Met Ala Phe Pro Pro Arg Arg Arg Leu Arg Leu Gly Pro Arg Gly Leu 
1               5                   10                  15      


Pro Leu Leu Leu Ser Gly Leu Leu Leu Pro Leu Cys Arg Ala Phe Asn 
            20                  25                  30          


Leu Asp Val Asp Ser Pro Ala Glu Tyr Ser Gly Pro Glu Gly Ser Tyr 
        35                  40                  45              


Phe Gly Phe Ala Val Asp Phe Phe Val Pro Ser Ala Ser Ser Arg Met 
    50                  55                  60                  


Phe Leu Leu Val Gly Ala Pro Lys Ala Asn Thr Thr Gln Pro Gly Ile 
65                  70                  75                  80  


Val Glu Gly Gly Gln Val Leu Lys Cys Asp Trp Ser Ser Thr Arg Arg 
                85                  90                  95      


Cys Gln Pro Ile Glu Phe Asp Ala Thr Gly Asn Arg Asp Tyr Ala Lys 
            100                 105                 110         


Asp Asp Pro Leu Glu Phe Lys Ser His Gln Trp Phe Gly Ala Ser Val 
        115                 120                 125             


Arg Ser Lys Gln Asp Lys Ile Leu Ala Cys Ala Pro Leu Tyr His Trp 
    130                 135                 140                 


Arg Thr Glu Met Lys Gln Glu Arg Glu Pro Val Gly Thr Cys Phe Leu 
145                 150                 155                 160 


Gln Asp Gly Thr Lys Thr Val Glu Tyr Ala Pro Cys Arg Ser Arg Gln 
                165                 170                 175     


Leu Ile Ser Asp Gln Val Ala Glu Ile Val Ser Lys Tyr Asp Pro Asn 
            180                 185                 190         


Val Tyr Ser Ile Lys Tyr Asn Asn Gln Leu Ala Thr Arg Thr Ala Gln 
        195                 200                 205             


Ala Ile Phe Asp Asp Ser Tyr Leu Gly Tyr Ser Val Ala Val Gly Asp 
    210                 215                 220                 


Phe Asn Gly Asp Gly Ile Asp Asp Phe Val Ser Gly Val Pro Arg Ala 
225                 230                 235                 240 


Ala Arg Thr Leu Gly Met Val Tyr Ile Tyr Asp Gly Lys Asn Met Ser 
                245                 250                 255     


Ser Leu Tyr Asn Phe Thr Gly Glu Gln Met Ala Ala Tyr Phe Gly Phe 
            260                 265                 270         


Ser Val Ala Ala Thr Asp Ile Asn Gly Asp Asp Tyr Ala Asp Val Phe 
        275                 280                 285             


Ile Gly Ala Pro Leu Phe Met Asp Arg Gly Ser Asp Gly Lys Leu Gln 
    290                 295                 300                 


Glu Val Gly Gln Val Ser Val Ser Leu Gln Arg Ala Ser Gly Asp Phe 
305                 310                 315                 320 


Gln Thr Thr Lys Leu Asn Gly Phe Glu Val Phe Ala Arg Phe Gly Ser 
                325                 330                 335     


Ala Ile Ala Pro Leu Gly Asp Leu Asp Gln Asp Gly Phe Asn Asp Ile 
            340                 345                 350         


Ala Ile Ala Ala Pro Tyr Gly Gly Glu Asp Lys Lys Gly Ile Val Tyr 
        355                 360                 365             


Ile Phe Asn Gly Arg Ser Thr Gly Leu Asn Ala Val Pro Ser Gln Ile 
    370                 375                 380                 


Leu Glu Gly Gln Trp Ala Ala Arg Ser Met Pro Pro Ser Phe Gly Tyr 
385                 390                 395                 400 


Ser Met Lys Gly Ala Thr Asp Ile Asp Lys Asn Gly Tyr Pro Asp Leu 
                405                 410                 415     


Ile Val Gly Ala Phe Gly Val Asp Arg Ala Ile Leu Tyr Arg Ala Arg 
            420                 425                 430         


Pro Val Ile Thr Val Asn Ala Gly Leu Glu Val Tyr Pro Ser Ile Leu 
        435                 440                 445             


Asn Gln Asp Asn Lys Thr Cys Ser Leu Pro Gly Thr Ala Leu Lys Val 
    450                 455                 460                 


Ser Cys Phe Asn Val Arg Phe Cys Leu Lys Ala Asp Gly Lys Gly Val 
465                 470                 475                 480 


Leu Pro Arg Lys Leu Asn Phe Gln Val Glu Leu Leu Leu Asp Lys Leu 
                485                 490                 495     


Lys Gln Lys Gly Ala Ile Arg Arg Ala Leu Phe Leu Tyr Ser Arg Ser 
            500                 505                 510         


Pro Ser His Ser Lys Asn Met Thr Ile Ser Arg Gly Gly Leu Met Gln 
        515                 520                 525             


Cys Glu Glu Leu Ile Ala Tyr Leu Arg Asp Glu Ser Glu Phe Arg Asp 
    530                 535                 540                 


Lys Leu Thr Pro Ile Thr Ile Phe Met Glu Tyr Arg Leu Asp Tyr Arg 
545                 550                 555                 560 


Thr Ala Ala Asp Thr Thr Gly Leu Gln Pro Ile Leu Asn Gln Phe Thr 
                565                 570                 575     


Pro Ala Asn Ile Ser Arg Gln Ala His Ile Leu Leu Asp Cys Gly Glu 
            580                 585                 590         


Asp Asn Val Cys Lys Pro Lys Leu Glu Val Ser Val Asp Ser Asp Gln 
        595                 600                 605             


Lys Lys Ile Tyr Ile Gly Asp Asp Asn Pro Leu Thr Leu Ile Val Lys 
    610                 615                 620                 


Ala Gln Asn Gln Gly Glu Gly Ala Tyr Glu Ala Glu Leu Ile Val Ser 
625                 630                 635                 640 


Ile Pro Leu Gln Ala Asp Phe Ile Gly Val Val Arg Asn Asn Glu Ala 
                645                 650                 655     


Leu Ala Arg Leu Ser Cys Ala Phe Lys Thr Glu Asn Gln Thr Arg Gln 
            660                 665                 670         


Val Val Cys Asp Leu Gly Asn Pro Met Lys Ala Gly Thr Gln Leu Leu 
        675                 680                 685             


Ala Gly Leu Arg Phe Ser Val His Gln Gln Ser Glu Met Asp Thr Ser 
    690                 695                 700                 


Val Lys Phe Asp Leu Gln Ile Gln Ser Ser Asn Leu Phe Asp Lys Val 
705                 710                 715                 720 


Ser Pro Val Val Ser His Lys Val Asp Leu Ala Val Leu Ala Ala Val 
                725                 730                 735     


Glu Ile Arg Gly Val Ser Ser Pro Asp His Ile Phe Leu Pro Ile Pro 
            740                 745                 750         


Asn Trp Glu His Lys Glu Asn Pro Glu Thr Glu Glu Asp Val Gly Pro 
        755                 760                 765             


Val Val Gln His Ile Tyr Glu Leu Arg Asn Asn Gly Pro Ser Ser Phe 
    770                 775                 780                 


Ser Lys Ala Met Leu His Leu Gln Trp Pro Tyr Lys Tyr Asn Asn Asn 
785                 790                 795                 800 


Thr Leu Leu Tyr Ile Leu His Tyr Asp Ile Asp Gly Pro Met Asn Cys 
                805                 810                 815     


Thr Ser Asp Met Glu Ile Asn Pro Leu Arg Ile Lys Ile Ser Ser Leu 
            820                 825                 830         


Gln Thr Thr Glu Lys Asn Asp Thr Val Ala Gly Gln Gly Glu Arg Asp 
        835                 840                 845             


His Leu Ile Thr Lys Arg Asp Leu Ala Leu Ser Glu Gly Asp Ile His 
    850                 855                 860                 


Thr Leu Gly Cys Gly Val Ala Gln Cys Leu Lys Ile Val Cys Gln Val 
865                 870                 875                 880 


Gly Arg Leu Asp Arg Gly Lys Ser Ala Ile Leu Tyr Val Lys Ser Leu 
                885                 890                 895     


Leu Trp Thr Glu Thr Phe Met Asn Lys Glu Asn Gln Asn His Ser Tyr 
            900                 905                 910         


Ser Leu Lys Ser Ser Ala Ser Phe Asn Val Ile Glu Phe Pro Tyr Lys 
        915                 920                 925             


Asn Leu Pro Ile Glu Asp Ile Thr Asn Ser Thr Leu Val Thr Thr Asn 
    930                 935                 940                 


Val Thr Trp Gly Ile Gln Pro Ala Pro Met Pro Val Pro Val Trp Val 
945                 950                 955                 960 


Ile Ile Leu Ala Val Leu Ala Gly Leu Leu Leu Leu Ala Val Leu Val 
                965                 970                 975     


Phe Val Met Tyr Arg Met Gly Phe Phe Lys Arg Val Arg Pro Pro Gln 
            980                 985                 990         


Glu Glu Gln Glu Arg Glu Gln Leu  Gln Pro His Glu Asn  Gly Glu Gly 
        995                 1000                 1005             


Asn Ser  Glu Thr 
    1010         


<210>  278
<211>  4149
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha X (complement component 3 receptor 4
       subunit) (ITGAX), transcript variant 1, mRNA GeneBank Accession 
       No. NM_001286375.1  GI:556503453

<400>  278
atgctgacaa tcttcttcct tcccctggcc acctctctgc ccacttgctt cctcagtacc       60

ttggtccagc tcttcctgca acggcccagg agctcagagc tccacatctg accttctagt      120

catgaccagg accagggcag cactcctcct gttcacagcc ttagcaactt ctctaggttt      180

caacttggac acagaggagc tgacagcctt ccgtgtggac agcgctgggt ttggagacag      240

cgtggtccag tatgccaact cctgggtggt ggttggagcc ccccaaaaga taacagctgc      300

caaccaaacg ggtggcctct accagtgtgg ctacagcact ggtgcctgtg agcccatcgg      360

cctgcaggtg cccccggagg ccgtgaacat gtccctgggc ctgtccctgg cgtctaccac      420

cagcccttcc cagctgctgg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat      480

gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt      540

gtccaggcag gagtgcccaa gacaggagca ggacattgtg ttcctgatcg atggctcagg      600

cagcatctcc tcccgcaact ttgccacgat gatgaacttc gtgagagctg tgataagcca      660

gttccagaga cccagcaccc agttttccct gatgcagttc tccaacaaat tccaaacaca      720

cttcactttc gaggaattca ggcgcagctc aaaccccctc agcctgttgg cttctgttca      780

ccagctgcaa gggtttacat acacggccac cgccatccaa aatgtcgtgc accgattgtt      840

ccatgcctca tatggggccc gtagggatgc cgccaaaatt ctcattgtca tcactgatgg      900

gaagaaagaa ggcgacagcc tggattataa ggatgtcatc cccatggctg atgcagcagg      960

catcatccgc tatgcaattg gggttggatt agcttttcaa aacagaaatt cttggaaaga     1020

attaaatgac attgcatcga agccctccca ggaacacata tttaaagtgg aggactttga     1080

tgctctgaaa gatattcaaa accaactgaa ggagaagatc tttgccattg agggtacgga     1140

gaccacaagc agtagctcct tcgaattgga gatggcacag gagggcttca gcgctgtgtt     1200

cacacctgat ggccccgttc tgggggctgt ggggagcttc acctggtctg gaggtgcctt     1260

cctgtacccc ccaaatatga gccctacctt catcaacatg tctcaggaga atgtggacat     1320

gagggactct tacctgggtt actccaccga gctggccctc tggaaagggg tgcagagcct     1380

ggtcctgggg gccccccgct accagcacac cgggaaggct gtcatcttca cccaggtgtc     1440

caggcaatgg aggatgaagg ccgaagtcac ggggactcag atcggctcct acttcggggc     1500

ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc tcatcggggc     1560

cccccattac tacgagcaga cccgaggggg ccaggtgtct gtgtgtccct tgcccagggg     1620

gtggagaagg tggtggtgtg atgctgttct ctacggggag cagggccacc cctggggtcg     1680

ctttggggcg gctctgacag tgctggggga tgtgaatggg gacaagctga cagacgtggt     1740

catcggggcc ccaggagagg aggagaaccg gggtgctgtc tacctgtttc acggagtctt     1800

gggacccagc atcagcccct cccacagcca gcggatcgcg ggctcccagc tctcctccag     1860

gctgcagtat tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt     1920

ggacctggct gtgggggccc ggggccaggt gctcctgctc aggaccagac ctgtgctctg     1980

ggtgggggtg agcatgcagt tcatacctgc cgagatcccc aggtctgcgt ttgagtgtcg     2040

ggagcaggtg gtctctgagc agaccctggt acagtccaac atctgccttt acattgacaa     2100

acgttctaag aacctgcttg ggagccgtga cctccaaagc tctgtgacct tggacctggc     2160

cctcgaccct ggccgcctga gtccccgtgc caccttccag gaaacaaaga accggagtct     2220

gagccgagtc cgagtcctcg ggctgaaggc acactgtgaa aacttcaacc tgctgctccc     2280

gagctgcgtg gaggactctg tgacccccat taccttgcgt ctgaacttca cgctggtggg     2340

caagcccctc cttgccttca gaaacctgcg gcctatgctg gccgccgatg ctcagagata     2400

cttcacggcc tccctaccct ttgagaagaa ctgtggagcc gaccatatct gccaggacaa     2460

tctcggcatc tccttcagct tcccaggctt gaagtccctg ctggtgggga gtaacctgga     2520

gctgaacgca gaagtgatgg tgtggaatga cggggaagac tcctacggaa ccaccatcac     2580

cttctcccac cccgcaggac tgtcctaccg ctacgtggca gagggccaga aacaagggca     2640

gctgcgttcc ctgcacctga catgtgacag cgccccagtt gggagccagg gcacctggag     2700

caccagctgc agaatcaacc acctcatctt ccgtggcggc gcccagatca ccttcttggc     2760

tacctttgac gtctccccca aggctgtcct gggagaccgg ctgcttctga cagccaatgt     2820

gagcagtgag aacaacactc ccaggaccag caagaccacc ttccagctgg agctcccggt     2880

gaagtatgct gtctacactg tggttagcag ccacgaacaa ttcaccaaat acctcaactt     2940

ctcagagtct gaggagaagg aaagccatgt ggccatgcac agataccagg tcaataacct     3000

gggacagagg gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga     3060

ggctgtgtgg atggatgtgg aggtctccca cccccagaac ccatcccttc ggtgctcctc     3120

agagaaaatc gcacccccag catctgactt cctggcgcac attcagaaga atcccgtgct     3180

ggactgctcc attgctggct gcctgcggtt ccgctgtgac gtcccctcct tcagcgtcca     3240

ggaggagctg gatttcaccc tgaagggcaa cctcagcttt ggctgggtcc gccagatatt     3300

gcagaagaag gtgtcggtcg tgagtgtggc tgaaattacg ttcgacacat ccgtgtactc     3360

ccagcttcca ggacaggagg catttatgag agctcagacg acaacggtgc tggagaagta     3420

caaggtccac aaccccaccc ccctcatcgt aggcagctcc attgggggtc tgttgctgct     3480

ggcactcatc acagcggtac tgtacaaagt tggcttcttc aagcgtcagt acaaggaaat     3540

gatggaggag gcaaatggac aaattgcccc agaaaacggg acacagaccc ccagcccgcc     3600

cactccccat taccctcagg acaatgtctg aactctccag cttcgcgtga gaagtcccct     3660

tccatcccag agggtgggct tcagggcgca cagcatgaga ggctctgtgc ccccatcacc     3720

ctcgtttcca gtgaattagt gtcatgtcag catcagctca gggcttcatc gtggggctct     3780

cagttccgat ttcccaggct gaattgggag tgagatgcct gcatgctggg ttctgcacag     3840

ctggcctccc gcgttgggca acattgctgg ctggaaggga ggagcgccct ctagggaggg     3900

acatggcccc ggtgcggctg cagctcaccc agccccaggg gcagaagaga cccaaccact     3960

tctatttttt gaggctatga atatagtacc tgaaaaaatg ccaagacatg attatttttt     4020

taaaaagcgt actttaaatg tttgtgttaa taaattaaaa catgcacaaa aagatgcatc     4080

taccgctctt gggaaatatg tcaaaggtct aaaaataaaa aagccttctg tgaaaaaaaa     4140

aaaaaaaaa                                                             4149


<210>  279
<211>  1169
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha X (complement component 3 receptor 4
       subunit) (ITGAX), transcript variant 1, polypeptide GeneBank 
       Accession No. NP_001273304.1 GI:556503454

<400>  279

Met Thr Arg Thr Arg Ala Ala Leu Leu Leu Phe Thr Ala Leu Ala Thr 
1               5                   10                  15      


Ser Leu Gly Phe Asn Leu Asp Thr Glu Glu Leu Thr Ala Phe Arg Val 
            20                  25                  30          


Asp Ser Ala Gly Phe Gly Asp Ser Val Val Gln Tyr Ala Asn Ser Trp 
        35                  40                  45              


Val Val Val Gly Ala Pro Gln Lys Ile Thr Ala Ala Asn Gln Thr Gly 
    50                  55                  60                  


Gly Leu Tyr Gln Cys Gly Tyr Ser Thr Gly Ala Cys Glu Pro Ile Gly 
65                  70                  75                  80  


Leu Gln Val Pro Pro Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
                85                  90                  95      


Ala Ser Thr Thr Ser Pro Ser Gln Leu Leu Ala Cys Gly Pro Thr Val 
            100                 105                 110         


His His Glu Cys Gly Arg Asn Met Tyr Leu Thr Gly Leu Cys Phe Leu 
        115                 120                 125             


Leu Gly Pro Thr Gln Leu Thr Gln Arg Leu Pro Val Ser Arg Gln Glu 
    130                 135                 140                 


Cys Pro Arg Gln Glu Gln Asp Ile Val Phe Leu Ile Asp Gly Ser Gly 
145                 150                 155                 160 


Ser Ile Ser Ser Arg Asn Phe Ala Thr Met Met Asn Phe Val Arg Ala 
                165                 170                 175     


Val Ile Ser Gln Phe Gln Arg Pro Ser Thr Gln Phe Ser Leu Met Gln 
            180                 185                 190         


Phe Ser Asn Lys Phe Gln Thr His Phe Thr Phe Glu Glu Phe Arg Arg 
        195                 200                 205             


Ser Ser Asn Pro Leu Ser Leu Leu Ala Ser Val His Gln Leu Gln Gly 
    210                 215                 220                 


Phe Thr Tyr Thr Ala Thr Ala Ile Gln Asn Val Val His Arg Leu Phe 
225                 230                 235                 240 


His Ala Ser Tyr Gly Ala Arg Arg Asp Ala Ala Lys Ile Leu Ile Val 
                245                 250                 255     


Ile Thr Asp Gly Lys Lys Glu Gly Asp Ser Leu Asp Tyr Lys Asp Val 
            260                 265                 270         


Ile Pro Met Ala Asp Ala Ala Gly Ile Ile Arg Tyr Ala Ile Gly Val 
        275                 280                 285             


Gly Leu Ala Phe Gln Asn Arg Asn Ser Trp Lys Glu Leu Asn Asp Ile 
    290                 295                 300                 


Ala Ser Lys Pro Ser Gln Glu His Ile Phe Lys Val Glu Asp Phe Asp 
305                 310                 315                 320 


Ala Leu Lys Asp Ile Gln Asn Gln Leu Lys Glu Lys Ile Phe Ala Ile 
                325                 330                 335     


Glu Gly Thr Glu Thr Thr Ser Ser Ser Ser Phe Glu Leu Glu Met Ala 
            340                 345                 350         


Gln Glu Gly Phe Ser Ala Val Phe Thr Pro Asp Gly Pro Val Leu Gly 
        355                 360                 365             


Ala Val Gly Ser Phe Thr Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro 
    370                 375                 380                 


Asn Met Ser Pro Thr Phe Ile Asn Met Ser Gln Glu Asn Val Asp Met 
385                 390                 395                 400 


Arg Asp Ser Tyr Leu Gly Tyr Ser Thr Glu Leu Ala Leu Trp Lys Gly 
                405                 410                 415     


Val Gln Ser Leu Val Leu Gly Ala Pro Arg Tyr Gln His Thr Gly Lys 
            420                 425                 430         


Ala Val Ile Phe Thr Gln Val Ser Arg Gln Trp Arg Met Lys Ala Glu 
        435                 440                 445             


Val Thr Gly Thr Gln Ile Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser 
    450                 455                 460                 


Val Asp Val Asp Ser Asp Gly Ser Thr Asp Leu Val Leu Ile Gly Ala 
465                 470                 475                 480 


Pro His Tyr Tyr Glu Gln Thr Arg Gly Gly Gln Val Ser Val Cys Pro 
                485                 490                 495     


Leu Pro Arg Gly Trp Arg Arg Trp Trp Cys Asp Ala Val Leu Tyr Gly 
            500                 505                 510         


Glu Gln Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu 
        515                 520                 525             


Gly Asp Val Asn Gly Asp Lys Leu Thr Asp Val Val Ile Gly Ala Pro 
    530                 535                 540                 


Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr Leu Phe His Gly Val Leu 
545                 550                 555                 560 


Gly Pro Ser Ile Ser Pro Ser His Ser Gln Arg Ile Ala Gly Ser Gln 
                565                 570                 575     


Leu Ser Ser Arg Leu Gln Tyr Phe Gly Gln Ala Leu Ser Gly Gly Gln 
            580                 585                 590         


Asp Leu Thr Gln Asp Gly Leu Val Asp Leu Ala Val Gly Ala Arg Gly 
        595                 600                 605             


Gln Val Leu Leu Leu Arg Thr Arg Pro Val Leu Trp Val Gly Val Ser 
    610                 615                 620                 


Met Gln Phe Ile Pro Ala Glu Ile Pro Arg Ser Ala Phe Glu Cys Arg 
625                 630                 635                 640 


Glu Gln Val Val Ser Glu Gln Thr Leu Val Gln Ser Asn Ile Cys Leu 
                645                 650                 655     


Tyr Ile Asp Lys Arg Ser Lys Asn Leu Leu Gly Ser Arg Asp Leu Gln 
            660                 665                 670         


Ser Ser Val Thr Leu Asp Leu Ala Leu Asp Pro Gly Arg Leu Ser Pro 
        675                 680                 685             


Arg Ala Thr Phe Gln Glu Thr Lys Asn Arg Ser Leu Ser Arg Val Arg 
    690                 695                 700                 


Val Leu Gly Leu Lys Ala His Cys Glu Asn Phe Asn Leu Leu Leu Pro 
705                 710                 715                 720 


Ser Cys Val Glu Asp Ser Val Thr Pro Ile Thr Leu Arg Leu Asn Phe 
                725                 730                 735     


Thr Leu Val Gly Lys Pro Leu Leu Ala Phe Arg Asn Leu Arg Pro Met 
            740                 745                 750         


Leu Ala Ala Asp Ala Gln Arg Tyr Phe Thr Ala Ser Leu Pro Phe Glu 
        755                 760                 765             


Lys Asn Cys Gly Ala Asp His Ile Cys Gln Asp Asn Leu Gly Ile Ser 
    770                 775                 780                 


Phe Ser Phe Pro Gly Leu Lys Ser Leu Leu Val Gly Ser Asn Leu Glu 
785                 790                 795                 800 


Leu Asn Ala Glu Val Met Val Trp Asn Asp Gly Glu Asp Ser Tyr Gly 
                805                 810                 815     


Thr Thr Ile Thr Phe Ser His Pro Ala Gly Leu Ser Tyr Arg Tyr Val 
            820                 825                 830         


Ala Glu Gly Gln Lys Gln Gly Gln Leu Arg Ser Leu His Leu Thr Cys 
        835                 840                 845             


Asp Ser Ala Pro Val Gly Ser Gln Gly Thr Trp Ser Thr Ser Cys Arg 
    850                 855                 860                 


Ile Asn His Leu Ile Phe Arg Gly Gly Ala Gln Ile Thr Phe Leu Ala 
865                 870                 875                 880 


Thr Phe Asp Val Ser Pro Lys Ala Val Leu Gly Asp Arg Leu Leu Leu 
                885                 890                 895     


Thr Ala Asn Val Ser Ser Glu Asn Asn Thr Pro Arg Thr Ser Lys Thr 
            900                 905                 910         


Thr Phe Gln Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Thr Val Val 
        915                 920                 925             


Ser Ser His Glu Gln Phe Thr Lys Tyr Leu Asn Phe Ser Glu Ser Glu 
    930                 935                 940                 


Glu Lys Glu Ser His Val Ala Met His Arg Tyr Gln Val Asn Asn Leu 
945                 950                 955                 960 


Gly Gln Arg Asp Leu Pro Val Ser Ile Asn Phe Trp Val Pro Val Glu 
                965                 970                 975     


Leu Asn Gln Glu Ala Val Trp Met Asp Val Glu Val Ser His Pro Gln 
            980                 985                 990         


Asn Pro Ser Leu Arg Cys Ser Ser  Glu Lys Ile Ala Pro  Pro Ala Ser 
        995                 1000                 1005             


Asp Phe  Leu Ala His Ile Gln  Lys Asn Pro Val Leu  Asp Cys Ser 
    1010                 1015                 1020             


Ile Ala  Gly Cys Leu Arg Phe  Arg Cys Asp Val Pro  Ser Phe Ser 
    1025                 1030                 1035             


Val Gln  Glu Glu Leu Asp Phe  Thr Leu Lys Gly Asn  Leu Ser Phe 
    1040                 1045                 1050             


Gly Trp  Val Arg Gln Ile Leu  Gln Lys Lys Val Ser  Val Val Ser 
    1055                 1060                 1065             


Val Ala  Glu Ile Thr Phe Asp  Thr Ser Val Tyr Ser  Gln Leu Pro 
    1070                 1075                 1080             


Gly Gln  Glu Ala Phe Met Arg  Ala Gln Thr Thr Thr  Val Leu Glu 
    1085                 1090                 1095             


Lys Tyr  Lys Val His Asn Pro  Thr Pro Leu Ile Val  Gly Ser Ser 
    1100                 1105                 1110             


Ile Gly  Gly Leu Leu Leu Leu  Ala Leu Ile Thr Ala  Val Leu Tyr 
    1115                 1120                 1125             


Lys Val  Gly Phe Phe Lys Arg  Gln Tyr Lys Glu Met  Met Glu Glu 
    1130                 1135                 1140             


Ala Asn  Gly Gln Ile Ala Pro  Glu Asn Gly Thr Gln  Thr Pro Ser 
    1145                 1150                 1155             


Pro Pro  Thr Pro His Tyr Pro  Gln Asp Asn Val 
    1160                 1165                 


<210>  280
<211>  4720
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha X (complement component 3 receptor 4
       subunit) (ITGAX), transcript variant 2, mRNA GeneBank Accession 
       No. NM_000887.4  GI:556503455

<400>  280
atgctgacaa tcttcttcct tcccctggcc acctctctgc ccacttgctt cctcagtacc       60

ttggtccagc tcttcctgca acggcccagg agctcagagc tccacatctg accttctagt      120

catgaccagg accagggcag cactcctcct gttcacagcc ttagcaactt ctctaggttt      180

caacttggac acagaggagc tgacagcctt ccgtgtggac agcgctgggt ttggagacag      240

cgtggtccag tatgccaact cctgggtggt ggttggagcc ccccaaaaga taacagctgc      300

caaccaaacg ggtggcctct accagtgtgg ctacagcact ggtgcctgtg agcccatcgg      360

cctgcaggtg cccccggagg ccgtgaacat gtccctgggc ctgtccctgg cgtctaccac      420

cagcccttcc cagctgctgg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat      480

gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt      540

gtccaggcag gagtgcccaa gacaggagca ggacattgtg ttcctgatcg atggctcagg      600

cagcatctcc tcccgcaact ttgccacgat gatgaacttc gtgagagctg tgataagcca      660

gttccagaga cccagcaccc agttttccct gatgcagttc tccaacaaat tccaaacaca      720

cttcactttc gaggaattca ggcgcagctc aaaccccctc agcctgttgg cttctgttca      780

ccagctgcaa gggtttacat acacggccac cgccatccaa aatgtcgtgc accgattgtt      840

ccatgcctca tatggggccc gtagggatgc cgccaaaatt ctcattgtca tcactgatgg      900

gaagaaagaa ggcgacagcc tggattataa ggatgtcatc cccatggctg atgcagcagg      960

catcatccgc tatgcaattg gggttggatt agcttttcaa aacagaaatt cttggaaaga     1020

attaaatgac attgcatcga agccctccca ggaacacata tttaaagtgg aggactttga     1080

tgctctgaaa gatattcaaa accaactgaa ggagaagatc tttgccattg agggtacgga     1140

gaccacaagc agtagctcct tcgaattgga gatggcacag gagggcttca gcgctgtgtt     1200

cacacctgat ggccccgttc tgggggctgt ggggagcttc acctggtctg gaggtgcctt     1260

cctgtacccc ccaaatatga gccctacctt catcaacatg tctcaggaga atgtggacat     1320

gagggactct tacctgggtt actccaccga gctggccctc tggaaagggg tgcagagcct     1380

ggtcctgggg gccccccgct accagcacac cgggaaggct gtcatcttca cccaggtgtc     1440

caggcaatgg aggatgaagg ccgaagtcac ggggactcag atcggctcct acttcggggc     1500

ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc tcatcggggc     1560

cccccattac tacgagcaga cccgaggggg ccaggtgtct gtgtgtccct tgcccagggg     1620

gtggagaagg tggtggtgtg atgctgttct ctacggggag cagggccacc cctggggtcg     1680

ctttggggcg gctctgacag tgctggggga tgtgaatggg gacaagctga cagacgtggt     1740

catcggggcc ccaggagagg aggagaaccg gggtgctgtc tacctgtttc acggagtctt     1800

gggacccagc atcagcccct cccacagcca gcggatcgcg ggctcccagc tctcctccag     1860

gctgcagtat tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt     1920

ggacctggct gtgggggccc ggggccaggt gctcctgctc aggaccagac ctgtgctctg     1980

ggtgggggtg agcatgcagt tcatacctgc cgagatcccc aggtctgcgt ttgagtgtcg     2040

ggagcaggtg gtctctgagc agaccctggt acagtccaac atctgccttt acattgacaa     2100

acgttctaag aacctgcttg ggagccgtga cctccaaagc tctgtgacct tggacctggc     2160

cctcgaccct ggccgcctga gtccccgtgc caccttccag gaaacaaaga accggagtct     2220

gagccgagtc cgagtcctcg ggctgaaggc acactgtgaa aacttcaacc tgctgctccc     2280

gagctgcgtg gaggactctg tgacccccat taccttgcgt ctgaacttca cgctggtggg     2340

caagcccctc cttgccttca gaaacctgcg gcctatgctg gccgccgatg ctcagagata     2400

cttcacggcc tccctaccct ttgagaagaa ctgtggagcc gaccatatct gccaggacaa     2460

tctcggcatc tccttcagct tcccaggctt gaagtccctg ctggtgggga gtaacctgga     2520

gctgaacgca gaagtgatgg tgtggaatga cggggaagac tcctacggaa ccaccatcac     2580

cttctcccac cccgcaggac tgtcctaccg ctacgtggca gagggccaga aacaagggca     2640

gctgcgttcc ctgcacctga catgtgacag cgccccagtt gggagccagg gcacctggag     2700

caccagctgc agaatcaacc acctcatctt ccgtggcggc gcccagatca ccttcttggc     2760

tacctttgac gtctccccca aggctgtcct gggagaccgg ctgcttctga cagccaatgt     2820

gagcagtgag aacaacactc ccaggaccag caagaccacc ttccagctgg agctcccggt     2880

gaagtatgct gtctacactg tggttagcag ccacgaacaa ttcaccaaat acctcaactt     2940

ctcagagtct gaggagaagg aaagccatgt ggccatgcac agataccagg tcaataacct     3000

gggacagagg gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga     3060

ggctgtgtgg atggatgtgg aggtctccca cccccagaac ccatcccttc ggtgctcctc     3120

agagaaaatc gcacccccag catctgactt cctggcgcac attcagaaga atcccgtgct     3180

ggactgctcc attgctggct gcctgcggtt ccgctgtgac gtcccctcct tcagcgtcca     3240

ggaggagctg gatttcaccc tgaagggcaa cctcagcttt ggctgggtcc gccagatatt     3300

gcagaagaag gtgtcggtcg tgagtgtggc tgaaattacg ttcgacacat ccgtgtactc     3360

ccagcttcca ggacaggagg catttatgag agctcagacg acaacggtgc tggagaagta     3420

caaggtccac aaccccaccc ccctcatcgt aggcagctcc attgggggtc tgttgctgct     3480

ggcactcatc acagcggtac tgtacaaagt tggcttcttc aagcgtcagt acaaggaaat     3540

gatggaggag gcaaatggac aaattgcccc agaaaacggg acacagaccc ccagcccgcc     3600

cagtgagaaa tgatcccctc tttgccttgg acttcttctc ccccgcgagt tttccccact     3660

tacttaccct cacctgtcag gcctgacggg gaggaaccac tgcaccaccg agagaggctg     3720

ggatgggcct gcttcctgtc tttgggagaa aacgtcttgc ttgggaaggg gcctttgtct     3780

tgtcaaggtt ccaactggaa acccttagga cagggtccct gctgtgttcc ccaaaggact     3840

tgacttgcaa tttctaccta gaaatacatg gacaataccc ccaggcctca gtctcccttc     3900

tcccatgagg cacgaatgat ctttctttcc tttctttttt tttttttttc ttttcttttt     3960

tttttttttg agacggagtc tcgctctgtc acccaggctg gagtgcaatg gcgtgatctc     4020

ggctcactgc aacctccgcc tcccgggttc aagtaattct gctgtctcag cctcctgagt     4080

agctgggact acaggcacac gccacctcgc ccggcccgat ctttctaaaa tacagttctg     4140

aatatgctgc tcatccccac ctgtcttcaa cagctcccca ttaccctcag gacaatgtct     4200

gaactctcca gcttcgcgtg agaagtcccc ttccatccca gagggtgggc ttcagggcgc     4260

acagcatgag aggctctgtg cccccatcac cctcgtttcc agtgaattag tgtcatgtca     4320

gcatcagctc agggcttcat cgtggggctc tcagttccga tttcccaggc tgaattggga     4380

gtgagatgcc tgcatgctgg gttctgcaca gctggcctcc cgcgttgggc aacattgctg     4440

gctggaaggg aggagcgccc tctagggagg gacatggccc cggtgcggct gcagctcacc     4500

cagccccagg ggcagaagag acccaaccac ttctattttt tgaggctatg aatatagtac     4560

ctgaaaaaat gccaagacat gattattttt ttaaaaagcg tactttaaat gtttgtgtta     4620

ataaattaaa acatgcacaa aaagatgcat ctaccgctct tgggaaatat gtcaaaggtc     4680

taaaaataaa aaagccttct gtgaaaaaaa aaaaaaaaaa                           4720


<210>  281
<211>  1163
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha X (complement component 3 receptor 4
       subunit) (ITGAX), transcript variant 2, polypeptide GeneBank 
       Accession No. NP_000878.2 GI:34452173

<400>  281

Met Thr Arg Thr Arg Ala Ala Leu Leu Leu Phe Thr Ala Leu Ala Thr 
1               5                   10                  15      


Ser Leu Gly Phe Asn Leu Asp Thr Glu Glu Leu Thr Ala Phe Arg Val 
            20                  25                  30          


Asp Ser Ala Gly Phe Gly Asp Ser Val Val Gln Tyr Ala Asn Ser Trp 
        35                  40                  45              


Val Val Val Gly Ala Pro Gln Lys Ile Thr Ala Ala Asn Gln Thr Gly 
    50                  55                  60                  


Gly Leu Tyr Gln Cys Gly Tyr Ser Thr Gly Ala Cys Glu Pro Ile Gly 
65                  70                  75                  80  


Leu Gln Val Pro Pro Glu Ala Val Asn Met Ser Leu Gly Leu Ser Leu 
                85                  90                  95      


Ala Ser Thr Thr Ser Pro Ser Gln Leu Leu Ala Cys Gly Pro Thr Val 
            100                 105                 110         


His His Glu Cys Gly Arg Asn Met Tyr Leu Thr Gly Leu Cys Phe Leu 
        115                 120                 125             


Leu Gly Pro Thr Gln Leu Thr Gln Arg Leu Pro Val Ser Arg Gln Glu 
    130                 135                 140                 


Cys Pro Arg Gln Glu Gln Asp Ile Val Phe Leu Ile Asp Gly Ser Gly 
145                 150                 155                 160 


Ser Ile Ser Ser Arg Asn Phe Ala Thr Met Met Asn Phe Val Arg Ala 
                165                 170                 175     


Val Ile Ser Gln Phe Gln Arg Pro Ser Thr Gln Phe Ser Leu Met Gln 
            180                 185                 190         


Phe Ser Asn Lys Phe Gln Thr His Phe Thr Phe Glu Glu Phe Arg Arg 
        195                 200                 205             


Ser Ser Asn Pro Leu Ser Leu Leu Ala Ser Val His Gln Leu Gln Gly 
    210                 215                 220                 


Phe Thr Tyr Thr Ala Thr Ala Ile Gln Asn Val Val His Arg Leu Phe 
225                 230                 235                 240 


His Ala Ser Tyr Gly Ala Arg Arg Asp Ala Ala Lys Ile Leu Ile Val 
                245                 250                 255     


Ile Thr Asp Gly Lys Lys Glu Gly Asp Ser Leu Asp Tyr Lys Asp Val 
            260                 265                 270         


Ile Pro Met Ala Asp Ala Ala Gly Ile Ile Arg Tyr Ala Ile Gly Val 
        275                 280                 285             


Gly Leu Ala Phe Gln Asn Arg Asn Ser Trp Lys Glu Leu Asn Asp Ile 
    290                 295                 300                 


Ala Ser Lys Pro Ser Gln Glu His Ile Phe Lys Val Glu Asp Phe Asp 
305                 310                 315                 320 


Ala Leu Lys Asp Ile Gln Asn Gln Leu Lys Glu Lys Ile Phe Ala Ile 
                325                 330                 335     


Glu Gly Thr Glu Thr Thr Ser Ser Ser Ser Phe Glu Leu Glu Met Ala 
            340                 345                 350         


Gln Glu Gly Phe Ser Ala Val Phe Thr Pro Asp Gly Pro Val Leu Gly 
        355                 360                 365             


Ala Val Gly Ser Phe Thr Trp Ser Gly Gly Ala Phe Leu Tyr Pro Pro 
    370                 375                 380                 


Asn Met Ser Pro Thr Phe Ile Asn Met Ser Gln Glu Asn Val Asp Met 
385                 390                 395                 400 


Arg Asp Ser Tyr Leu Gly Tyr Ser Thr Glu Leu Ala Leu Trp Lys Gly 
                405                 410                 415     


Val Gln Ser Leu Val Leu Gly Ala Pro Arg Tyr Gln His Thr Gly Lys 
            420                 425                 430         


Ala Val Ile Phe Thr Gln Val Ser Arg Gln Trp Arg Met Lys Ala Glu 
        435                 440                 445             


Val Thr Gly Thr Gln Ile Gly Ser Tyr Phe Gly Ala Ser Leu Cys Ser 
    450                 455                 460                 


Val Asp Val Asp Ser Asp Gly Ser Thr Asp Leu Val Leu Ile Gly Ala 
465                 470                 475                 480 


Pro His Tyr Tyr Glu Gln Thr Arg Gly Gly Gln Val Ser Val Cys Pro 
                485                 490                 495     


Leu Pro Arg Gly Trp Arg Arg Trp Trp Cys Asp Ala Val Leu Tyr Gly 
            500                 505                 510         


Glu Gln Gly His Pro Trp Gly Arg Phe Gly Ala Ala Leu Thr Val Leu 
        515                 520                 525             


Gly Asp Val Asn Gly Asp Lys Leu Thr Asp Val Val Ile Gly Ala Pro 
    530                 535                 540                 


Gly Glu Glu Glu Asn Arg Gly Ala Val Tyr Leu Phe His Gly Val Leu 
545                 550                 555                 560 


Gly Pro Ser Ile Ser Pro Ser His Ser Gln Arg Ile Ala Gly Ser Gln 
                565                 570                 575     


Leu Ser Ser Arg Leu Gln Tyr Phe Gly Gln Ala Leu Ser Gly Gly Gln 
            580                 585                 590         


Asp Leu Thr Gln Asp Gly Leu Val Asp Leu Ala Val Gly Ala Arg Gly 
        595                 600                 605             


Gln Val Leu Leu Leu Arg Thr Arg Pro Val Leu Trp Val Gly Val Ser 
    610                 615                 620                 


Met Gln Phe Ile Pro Ala Glu Ile Pro Arg Ser Ala Phe Glu Cys Arg 
625                 630                 635                 640 


Glu Gln Val Val Ser Glu Gln Thr Leu Val Gln Ser Asn Ile Cys Leu 
                645                 650                 655     


Tyr Ile Asp Lys Arg Ser Lys Asn Leu Leu Gly Ser Arg Asp Leu Gln 
            660                 665                 670         


Ser Ser Val Thr Leu Asp Leu Ala Leu Asp Pro Gly Arg Leu Ser Pro 
        675                 680                 685             


Arg Ala Thr Phe Gln Glu Thr Lys Asn Arg Ser Leu Ser Arg Val Arg 
    690                 695                 700                 


Val Leu Gly Leu Lys Ala His Cys Glu Asn Phe Asn Leu Leu Leu Pro 
705                 710                 715                 720 


Ser Cys Val Glu Asp Ser Val Thr Pro Ile Thr Leu Arg Leu Asn Phe 
                725                 730                 735     


Thr Leu Val Gly Lys Pro Leu Leu Ala Phe Arg Asn Leu Arg Pro Met 
            740                 745                 750         


Leu Ala Ala Asp Ala Gln Arg Tyr Phe Thr Ala Ser Leu Pro Phe Glu 
        755                 760                 765             


Lys Asn Cys Gly Ala Asp His Ile Cys Gln Asp Asn Leu Gly Ile Ser 
    770                 775                 780                 


Phe Ser Phe Pro Gly Leu Lys Ser Leu Leu Val Gly Ser Asn Leu Glu 
785                 790                 795                 800 


Leu Asn Ala Glu Val Met Val Trp Asn Asp Gly Glu Asp Ser Tyr Gly 
                805                 810                 815     


Thr Thr Ile Thr Phe Ser His Pro Ala Gly Leu Ser Tyr Arg Tyr Val 
            820                 825                 830         


Ala Glu Gly Gln Lys Gln Gly Gln Leu Arg Ser Leu His Leu Thr Cys 
        835                 840                 845             


Asp Ser Ala Pro Val Gly Ser Gln Gly Thr Trp Ser Thr Ser Cys Arg 
    850                 855                 860                 


Ile Asn His Leu Ile Phe Arg Gly Gly Ala Gln Ile Thr Phe Leu Ala 
865                 870                 875                 880 


Thr Phe Asp Val Ser Pro Lys Ala Val Leu Gly Asp Arg Leu Leu Leu 
                885                 890                 895     


Thr Ala Asn Val Ser Ser Glu Asn Asn Thr Pro Arg Thr Ser Lys Thr 
            900                 905                 910         


Thr Phe Gln Leu Glu Leu Pro Val Lys Tyr Ala Val Tyr Thr Val Val 
        915                 920                 925             


Ser Ser His Glu Gln Phe Thr Lys Tyr Leu Asn Phe Ser Glu Ser Glu 
    930                 935                 940                 


Glu Lys Glu Ser His Val Ala Met His Arg Tyr Gln Val Asn Asn Leu 
945                 950                 955                 960 


Gly Gln Arg Asp Leu Pro Val Ser Ile Asn Phe Trp Val Pro Val Glu 
                965                 970                 975     


Leu Asn Gln Glu Ala Val Trp Met Asp Val Glu Val Ser His Pro Gln 
            980                 985                 990         


Asn Pro Ser Leu Arg Cys Ser Ser  Glu Lys Ile Ala Pro  Pro Ala Ser 
        995                 1000                 1005             


Asp Phe  Leu Ala His Ile Gln  Lys Asn Pro Val Leu  Asp Cys Ser 
    1010                 1015                 1020             


Ile Ala  Gly Cys Leu Arg Phe  Arg Cys Asp Val Pro  Ser Phe Ser 
    1025                 1030                 1035             


Val Gln  Glu Glu Leu Asp Phe  Thr Leu Lys Gly Asn  Leu Ser Phe 
    1040                 1045                 1050             


Gly Trp  Val Arg Gln Ile Leu  Gln Lys Lys Val Ser  Val Val Ser 
    1055                 1060                 1065             


Val Ala  Glu Ile Thr Phe Asp  Thr Ser Val Tyr Ser  Gln Leu Pro 
    1070                 1075                 1080             


Gly Gln  Glu Ala Phe Met Arg  Ala Gln Thr Thr Thr  Val Leu Glu 
    1085                 1090                 1095             


Lys Tyr  Lys Val His Asn Pro  Thr Pro Leu Ile Val  Gly Ser Ser 
    1100                 1105                 1110             


Ile Gly  Gly Leu Leu Leu Leu  Ala Leu Ile Thr Ala  Val Leu Tyr 
    1115                 1120                 1125             


Lys Val  Gly Phe Phe Lys Arg  Gln Tyr Lys Glu Met  Met Glu Glu 
    1130                 1135                 1140             


Ala Asn  Gly Gln Ile Ala Pro  Glu Asn Gly Thr Gln  Thr Pro Ser 
    1145                 1150                 1155             


Pro Pro  Ser Glu Lys 
    1160             


<210>  282
<211>  7878
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 
       receptor) (ITGA2), transcript variant 1, mRNA GeneBank Accession 
       No. NM_002203.3  GI:116295257

<400>  282
ttttccctgc tctcaccggg cgggggagag aagccctctg gacagcttct agagtgtgca       60

ggttctcgta tccctcggcc aagggtatcc tctgcaaacc tctgcaaacc cagcgcaact      120

acggtccccc ggtcagaccc aggatggggc cagaacggac aggggccgcg ccgctgccgc      180

tgctgctggt gttagcgctc agtcaaggca ttttaaattg ttgtttggcc tacaatgttg      240

gtctcccaga agcaaaaata ttttccggtc cttcaagtga acagtttggc tatgcagtgc      300

agcagtttat aaatccaaaa ggcaactggt tactggttgg ttcaccctgg agtggctttc      360

ctgagaaccg aatgggagat gtgtataaat gtcctgttga cctatccact gccacatgtg      420

aaaaactaaa tttgcaaact tcaacaagca ttccaaatgt tactgagatg aaaaccaaca      480

tgagcctcgg cttgatcctc accaggaaca tgggaactgg aggttttctc acatgtggtc      540

ctctgtgggc acagcaatgt gggaatcagt attacacaac gggtgtgtgt tctgacatca      600

gtcctgattt tcagctctca gccagcttct cacctgcaac tcagccctgc ccttccctca      660

tagatgttgt ggttgtgtgt gatgaatcaa atagtattta tccttgggat gcagtaaaga      720

attttttgga aaaatttgta caaggcctgg atataggccc cacaaagaca caggtggggt      780

taattcagta tgccaataat ccaagagttg tgtttaactt gaacacatat aaaaccaaag      840

aagaaatgat tgtagcaaca tcccagacat cccaatatgg tggggacctc acaaacacat      900

tcggagcaat tcaatatgca agaaaatatg cttattcagc agcttctggt gggcgacgaa      960

gtgctacgaa agtaatggta gttgtaactg acggtgaatc acatgatggt tcaatgttga     1020

aagctgtgat tgatcaatgc aaccatgaca atatactgag gtttggcata gcagttcttg     1080

ggtacttaaa cagaaacgcc cttgatacta aaaatttaat aaaagaaata aaagcaatcg     1140

ctagtattcc aacagaaaga tactttttca atgtgtctga tgaagcagct ctactagaaa     1200

aggctgggac attaggagaa caaattttca gcattgaagg tactgttcaa ggaggagaca     1260

actttcagat ggaaatgtca caagtgggat tcagtgcaga ttactcttct caaaatgata     1320

ttctgatgct gggtgcagtg ggagcttttg gctggagtgg gaccattgtc cagaagacat     1380

ctcatggcca tttgatcttt cctaaacaag cctttgacca aattctgcag gacagaaatc     1440

acagttcata tttaggttac tctgtggctg caatttctac tggagaaagc actcactttg     1500

ttgctggtgc tcctcgggca aattataccg gccagatagt gctatatagt gtgaatgaga     1560

atggcaatat cacggttatt caggctcacc gaggtgacca gattggctcc tattttggta     1620

gtgtgctgtg ttcagttgat gtggataaag acaccattac agacgtgctc ttggtaggtg     1680

caccaatgta catgagtgac ctaaagaaag aggaaggaag agtctacctg tttactatca     1740

aagagggcat tttgggtcag caccaatttc ttgaaggccc cgagggcatt gaaaacactc     1800

gatttggttc agcaattgca gctctttcag acatcaacat ggatggcttt aatgatgtga     1860

ttgttggttc accactagaa aatcagaatt ctggagctgt atacatttac aatggtcatc     1920

agggcactat ccgcacaaag tattcccaga aaatcttggg atccgatgga gcctttagga     1980

gccatctcca gtactttggg aggtccttgg atggctatgg agatttaaat ggggattcca     2040

tcaccgatgt gtctattggt gcctttggac aagtggttca actctggtca caaagtattg     2100

ctgatgtagc tatagaagct tcattcacac cagaaaaaat cactttggtc aacaagaatg     2160

ctcagataat tctcaaactc tgcttcagtg caaagttcag acctactaag caaaacaatc     2220

aagtggccat tgtatataac atcacacttg atgcagatgg attttcatcc agagtaacct     2280

ccagggggtt atttaaagaa aacaatgaaa ggtgcctgca gaagaatatg gtagtaaatc     2340

aagcacagag ttgccccgag cacatcattt atatacagga gccctctgat gttgtcaact     2400

ctttggattt gcgtgtggac atcagtctgg aaaaccctgg cactagccct gcccttgaag     2460

cctattctga gactgccaag gtcttcagta ttcctttcca caaagactgt ggtgaggacg     2520

gactttgcat ttctgatcta gtcctagatg tccgacaaat accagctgct caagaacaac     2580

cctttattgt cagcaaccaa aacaaaaggt taacattttc agtaacgctg aaaaataaaa     2640

gggaaagtgc atacaacact ggaattgttg ttgatttttc agaaaacttg ttttttgcat     2700

cattctccct gccggttgat gggacagaag taacatgcca ggtggctgca tctcagaagt     2760

ctgttgcctg cgatgtaggc taccctgctt taaagagaga acaacaggtg acttttacta     2820

ttaactttga cttcaatctt caaaaccttc agaatcaggc gtctctcagt ttccaagcct     2880

taagtgaaag ccaagaagaa aacaaggctg ataatttggt caacctcaaa attcctctcc     2940

tgtatgatgc tgaaattcac ttaacaagat ctaccaacat aaatttttat gaaatctctt     3000

cggatgggaa tgttccttca atcgtgcaca gttttgaaga tgttggtcca aaattcatct     3060

tctccctgaa ggtaacaaca ggaagtgttc cagtaagcat ggcaactgta atcatccaca     3120

tccctcagta taccaaagaa aagaacccac tgatgtacct aactggggtg caaacagaca     3180

aggctggtga catcagttgt aatgcagata tcaatccact gaaaatagga caaacatctt     3240

cttctgtatc tttcaaaagt gaaaatttca ggcacaccaa agaattgaac tgcagaactg     3300

cttcctgtag taatgttacc tgctggttga aagacgttca catgaaagga gaatactttg     3360

ttaatgtgac taccagaatt tggaacggga ctttcgcatc atcaacgttc cagacagtac     3420

agctaacggc agctgcagaa atcaacacct ataaccctga gatatatgtg attgaagata     3480

acactgttac gattcccctg atgataatga aacctgatga gaaagccgaa gtaccaacag     3540

gagttataat aggaagtata attgctggaa tccttttgct gttagctctg gttgcaattt     3600

tatggaagct cggcttcttc aaaagaaaat atgaaaagat gaccaaaaat ccagatgaga     3660

ttgatgagac cacagagctc agtagctgaa ccagcagacc tacctgcagt gggaaccggc     3720

agcatcccag ccagggtttg ctgtttgcgt gaatggattt ctttttaaat cccatatttt     3780

ttttatcatg tcgtaggtaa actaacctgg tattttaaga gaaaactgca ggtcagtttg     3840

gaatgaagaa attgtggggg gtgggggagg tgcggggggc aggtagggaa ataataggga     3900

aaatacctat tttatatgat gggggaaaaa aagtaatctt taaactggct ggcccagagt     3960

ttacattcta atttgcattg tgtcagaaac atgaaatgct tccaagcatg acaactttta     4020

aagaaaaata tgatactctc agattttaag ggggaaaact gttctcttta aaatatttgt     4080

ctttaaacag caactacaga agtggaagtg cttgatatgt aagtacttcc acttgtgtat     4140

attttaatga atattgatgt taacaagagg ggaaaacaaa acacaggttt tttcaattta     4200

tgctgctcat ccaaagttgc cacagatgat acttccaagt gataatttta tttataaact     4260

aggtaaaatt tgttgttggt tccttttaga ccacggctgc cccttccaca ccccatcttg     4320

ctctaatgat caaaacatgc ttgaataact gagcttagag tatacctcct atatgtccat     4380

ttaagttagg agagggggcg atatagagaa taaggcacaa aattttgttt aaaactcaga     4440

atataacatg taaaatccca tctgctagaa gcccatcctg tgccagagga aggaaaagga     4500

ggaaatttcc tttctctttt aggaggcaca acagttctct tctaggattt gtttggctga     4560

ctggcagtaa cctagtgaat ttctgaaaga tgagtaattt ctttggcaac cttcctcctc     4620

ccttactgaa ccactctccc acctcctggt ggtaccatta ttatagaagc cctctacagc     4680

ctgactttct ctccagcggt ccaaagttat cccctccttt acccctcatc caaagttccc     4740

actccttcag gacagctgct gtgcattaga tattaggggg gaaagtcatc tgtttaattt     4800

acacacttgc atgaattact gtatataaac tccttaactt cagggagcta ttttcattta     4860

gtgctaaaca agtaagaaaa ataagctcga gtgaatttct aaatgttgga atgttatggg     4920

atgtaaacaa tgtaaagtaa gacatctcag gatttcacca gaagttacag atgaggcact     4980

ggaagccacc aaattagcag gtgcaccttc tgtggctgtc ttgtttctga agtacttaaa     5040

cttccacaag agtgaatttg acctaggcaa gtttgttcaa aaggtagatc ctgagatgat     5100

ttggtcagat tgggataagg cccagcaatc tgcattttaa caagcacccc agtcactagg     5160

atgcagatgg accacacttt gagaaacacc acccatttct actttttgca ccttattttc     5220

tctgttcctg agcccccaca ttctctagga gaaacttaga ggaaaagggc acagacacta     5280

catatctaaa gctttggaca agtccttgac ctctataaac ttcagagtcc tcattataaa     5340

atgggaagac tgagctggag ttcagcagtg atgcttttag ttttaaaagt ctatgatctg     5400

gacttcctat aatacaaata cacaatcctc caagaatttg acttggaaaa aaatgtcaaa     5460

ggaaaacagg ttatctgccc atgtgcatat ggacaacctt gactaccctg gcctggcccg     5520

tggtggcagt ccagggctat ctgtactgtt tacagaatta ctttgtagtt gacaacacaa     5580

aacaaacaaa aaaggcataa aatgccagcg gtttatagaa aaaacagcat ggtattctcc     5640

agttaggtat gccagagtcc aattctttta acagctgtga gaatttgctg cttcattcca     5700

acaaaatttt atttaaaaaa aaaaaaaaaa gactggagaa actagtcatt agcttgataa     5760

agaatattta acagctagtg gtgctggtgt gtacctgaag ctccagctac ttgagagact     5820

gagacaggaa gatcgcttga gcccaggagt tcaagtccag cctaagcaac atagcaagac     5880

cctgtctcaa aaaaatgact atttaaaaag acaatgtggc caggcacggt ggctcacacc     5940

tgtaatccca acactttggg aggctgaggc cggtggatca cgaggtcagg agtttgagac     6000

tagcctggcc aacatggtga aaccccatct ctaataatat aaaaattagc tgggcgtagt     6060

agcaggtgcc tgtaatccca gttactcggg aagctgaggc aggagaatca cttgaacccg     6120

ggaggcagag gtttcagtga gccgagatcg cgccactgca ctccagcctg ggtgacaggg     6180

caagactctg tctcaaacaa acaaacaaaa aaaaagttag tactgtatat gtaaatacta     6240

gcttttcaat gtgctataca aacaattata gcacatcctt ccttttactc tgtctcacct     6300

cctttaggtg agtacttcct taaataagtg ctaaacatac atatacggaa cttgaaagct     6360

ttggttagcc ttgccttagg taatcagcct agtttacact gtttccaggg agtagttgaa     6420

ttactataaa ccattagcca cttgtctctg caccatttat cacaccagga cagggtctct     6480

caacctgggc gctactgtca tttggggcca ggtgattctt ccttgcaggg gctgtcctgt     6540

accttgtagg acagcagccc tgtcctagaa ggtatgttta gcagcattcc tggcctctag     6600

ctacccgatg ccagagcatg ctccccccgc agtcatgaca atcaaaaaat gtctccagac     6660

attgtcaaat gcctcctggg gggcagtatt tctcaagcac ttttaagcaa aggtaagtat     6720

tcatacaaga aatttagggg gaaaaaacat tgtttaaata aaagctatgt gttcctattc     6780

aacaatattt ttgctttaaa agtaagtaga gggcataaaa gatgtcatat tcaaatttcc     6840

atttcataaa tggtgtacag acaaggtcta tagaatgtgg taaaaacttg actgcaacac     6900

aaggcttata aaatagtaag atagtaaaat agcttatgaa gaaactacag agatttaaaa     6960

ttgtgcatga ctcatttcag cagcaaaata agaactccta actgaacaga aatttttcta     7020

cctagcaatg ttattcttgt aaaatagtta cctattaaaa ctgtgaagag taaaactaaa     7080

gccaatttat tatagtcaca caagtgatta tactaaaaat tattataaag gttataattt     7140

tataatgtat ttacctgtcc tgatatatag ctataaccca atatatgaaa atctcaaaaa     7200

ttaagacatc atcatacaga aggcaggatt ccttaaactg agatccctga tccatcttta     7260

atatttcaat ttgcacacat aaaacaatgc ccttttgtgt acattcaggc atacccattt     7320

taatcaattt gaaaggttaa tttaaacctc tagaggtgaa tgagaaacat gggggaaaag     7380

tatgaaatag gtgaaaatct taactatttc tttgaactct aaagactgaa actgtagcca     7440

ttatgtaaat aaagtttcat atgtacctgt ttattttggc agattaagtc aaaatatgaa     7500

tgtatatatt gcataactat gttagaattg tatatatttt aaagaaattg tcttggatat     7560

tttcctttat acataataga taagtctttt ttcaaatgtg gtgtttgatg tttttgatta     7620

aatgtgtttt gcctctttcc acaaaaactg taaaaataaa tgcatgtttg tacaaaaagt     7680

tgcagaattc atttgattta tgagaaacaa aaattaaatt gtagtcaaca gttagtagtt     7740

tttctcatat ccaagtataa caaacagaaa agtttcatta ttgtaaccca cttttttcat     7800

accacattat tgaatattgt tacaattgtt ttgaaaataa agccattttc tttgggcttt     7860

tataagttaa aaaaaaaa                                                   7878


<210>  283
<211>  1181
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 
       receptor) (ITGA2), transcript variant 1, polypeptide GeneBank 
       Accession No. NP_002194.2 GI:116295258

<400>  283

Met Gly Pro Glu Arg Thr Gly Ala Ala Pro Leu Pro Leu Leu Leu Val 
1               5                   10                  15      


Leu Ala Leu Ser Gln Gly Ile Leu Asn Cys Cys Leu Ala Tyr Asn Val 
            20                  25                  30          


Gly Leu Pro Glu Ala Lys Ile Phe Ser Gly Pro Ser Ser Glu Gln Phe 
        35                  40                  45              


Gly Tyr Ala Val Gln Gln Phe Ile Asn Pro Lys Gly Asn Trp Leu Leu 
    50                  55                  60                  


Val Gly Ser Pro Trp Ser Gly Phe Pro Glu Asn Arg Met Gly Asp Val 
65                  70                  75                  80  


Tyr Lys Cys Pro Val Asp Leu Ser Thr Ala Thr Cys Glu Lys Leu Asn 
                85                  90                  95      


Leu Gln Thr Ser Thr Ser Ile Pro Asn Val Thr Glu Met Lys Thr Asn 
            100                 105                 110         


Met Ser Leu Gly Leu Ile Leu Thr Arg Asn Met Gly Thr Gly Gly Phe 
        115                 120                 125             


Leu Thr Cys Gly Pro Leu Trp Ala Gln Gln Cys Gly Asn Gln Tyr Tyr 
    130                 135                 140                 


Thr Thr Gly Val Cys Ser Asp Ile Ser Pro Asp Phe Gln Leu Ser Ala 
145                 150                 155                 160 


Ser Phe Ser Pro Ala Thr Gln Pro Cys Pro Ser Leu Ile Asp Val Val 
                165                 170                 175     


Val Val Cys Asp Glu Ser Asn Ser Ile Tyr Pro Trp Asp Ala Val Lys 
            180                 185                 190         


Asn Phe Leu Glu Lys Phe Val Gln Gly Leu Asp Ile Gly Pro Thr Lys 
        195                 200                 205             


Thr Gln Val Gly Leu Ile Gln Tyr Ala Asn Asn Pro Arg Val Val Phe 
    210                 215                 220                 


Asn Leu Asn Thr Tyr Lys Thr Lys Glu Glu Met Ile Val Ala Thr Ser 
225                 230                 235                 240 


Gln Thr Ser Gln Tyr Gly Gly Asp Leu Thr Asn Thr Phe Gly Ala Ile 
                245                 250                 255     


Gln Tyr Ala Arg Lys Tyr Ala Tyr Ser Ala Ala Ser Gly Gly Arg Arg 
            260                 265                 270         


Ser Ala Thr Lys Val Met Val Val Val Thr Asp Gly Glu Ser His Asp 
        275                 280                 285             


Gly Ser Met Leu Lys Ala Val Ile Asp Gln Cys Asn His Asp Asn Ile 
    290                 295                 300                 


Leu Arg Phe Gly Ile Ala Val Leu Gly Tyr Leu Asn Arg Asn Ala Leu 
305                 310                 315                 320 


Asp Thr Lys Asn Leu Ile Lys Glu Ile Lys Ala Ile Ala Ser Ile Pro 
                325                 330                 335     


Thr Glu Arg Tyr Phe Phe Asn Val Ser Asp Glu Ala Ala Leu Leu Glu 
            340                 345                 350         


Lys Ala Gly Thr Leu Gly Glu Gln Ile Phe Ser Ile Glu Gly Thr Val 
        355                 360                 365             


Gln Gly Gly Asp Asn Phe Gln Met Glu Met Ser Gln Val Gly Phe Ser 
    370                 375                 380                 


Ala Asp Tyr Ser Ser Gln Asn Asp Ile Leu Met Leu Gly Ala Val Gly 
385                 390                 395                 400 


Ala Phe Gly Trp Ser Gly Thr Ile Val Gln Lys Thr Ser His Gly His 
                405                 410                 415     


Leu Ile Phe Pro Lys Gln Ala Phe Asp Gln Ile Leu Gln Asp Arg Asn 
            420                 425                 430         


His Ser Ser Tyr Leu Gly Tyr Ser Val Ala Ala Ile Ser Thr Gly Glu 
        435                 440                 445             


Ser Thr His Phe Val Ala Gly Ala Pro Arg Ala Asn Tyr Thr Gly Gln 
    450                 455                 460                 


Ile Val Leu Tyr Ser Val Asn Glu Asn Gly Asn Ile Thr Val Ile Gln 
465                 470                 475                 480 


Ala His Arg Gly Asp Gln Ile Gly Ser Tyr Phe Gly Ser Val Leu Cys 
                485                 490                 495     


Ser Val Asp Val Asp Lys Asp Thr Ile Thr Asp Val Leu Leu Val Gly 
            500                 505                 510         


Ala Pro Met Tyr Met Ser Asp Leu Lys Lys Glu Glu Gly Arg Val Tyr 
        515                 520                 525             


Leu Phe Thr Ile Lys Glu Gly Ile Leu Gly Gln His Gln Phe Leu Glu 
    530                 535                 540                 


Gly Pro Glu Gly Ile Glu Asn Thr Arg Phe Gly Ser Ala Ile Ala Ala 
545                 550                 555                 560 


Leu Ser Asp Ile Asn Met Asp Gly Phe Asn Asp Val Ile Val Gly Ser 
                565                 570                 575     


Pro Leu Glu Asn Gln Asn Ser Gly Ala Val Tyr Ile Tyr Asn Gly His 
            580                 585                 590         


Gln Gly Thr Ile Arg Thr Lys Tyr Ser Gln Lys Ile Leu Gly Ser Asp 
        595                 600                 605             


Gly Ala Phe Arg Ser His Leu Gln Tyr Phe Gly Arg Ser Leu Asp Gly 
    610                 615                 620                 


Tyr Gly Asp Leu Asn Gly Asp Ser Ile Thr Asp Val Ser Ile Gly Ala 
625                 630                 635                 640 


Phe Gly Gln Val Val Gln Leu Trp Ser Gln Ser Ile Ala Asp Val Ala 
                645                 650                 655     


Ile Glu Ala Ser Phe Thr Pro Glu Lys Ile Thr Leu Val Asn Lys Asn 
            660                 665                 670         


Ala Gln Ile Ile Leu Lys Leu Cys Phe Ser Ala Lys Phe Arg Pro Thr 
        675                 680                 685             


Lys Gln Asn Asn Gln Val Ala Ile Val Tyr Asn Ile Thr Leu Asp Ala 
    690                 695                 700                 


Asp Gly Phe Ser Ser Arg Val Thr Ser Arg Gly Leu Phe Lys Glu Asn 
705                 710                 715                 720 


Asn Glu Arg Cys Leu Gln Lys Asn Met Val Val Asn Gln Ala Gln Ser 
                725                 730                 735     


Cys Pro Glu His Ile Ile Tyr Ile Gln Glu Pro Ser Asp Val Val Asn 
            740                 745                 750         


Ser Leu Asp Leu Arg Val Asp Ile Ser Leu Glu Asn Pro Gly Thr Ser 
        755                 760                 765             


Pro Ala Leu Glu Ala Tyr Ser Glu Thr Ala Lys Val Phe Ser Ile Pro 
    770                 775                 780                 


Phe His Lys Asp Cys Gly Glu Asp Gly Leu Cys Ile Ser Asp Leu Val 
785                 790                 795                 800 


Leu Asp Val Arg Gln Ile Pro Ala Ala Gln Glu Gln Pro Phe Ile Val 
                805                 810                 815     


Ser Asn Gln Asn Lys Arg Leu Thr Phe Ser Val Thr Leu Lys Asn Lys 
            820                 825                 830         


Arg Glu Ser Ala Tyr Asn Thr Gly Ile Val Val Asp Phe Ser Glu Asn 
        835                 840                 845             


Leu Phe Phe Ala Ser Phe Ser Leu Pro Val Asp Gly Thr Glu Val Thr 
    850                 855                 860                 


Cys Gln Val Ala Ala Ser Gln Lys Ser Val Ala Cys Asp Val Gly Tyr 
865                 870                 875                 880 


Pro Ala Leu Lys Arg Glu Gln Gln Val Thr Phe Thr Ile Asn Phe Asp 
                885                 890                 895     


Phe Asn Leu Gln Asn Leu Gln Asn Gln Ala Ser Leu Ser Phe Gln Ala 
            900                 905                 910         


Leu Ser Glu Ser Gln Glu Glu Asn Lys Ala Asp Asn Leu Val Asn Leu 
        915                 920                 925             


Lys Ile Pro Leu Leu Tyr Asp Ala Glu Ile His Leu Thr Arg Ser Thr 
    930                 935                 940                 


Asn Ile Asn Phe Tyr Glu Ile Ser Ser Asp Gly Asn Val Pro Ser Ile 
945                 950                 955                 960 


Val His Ser Phe Glu Asp Val Gly Pro Lys Phe Ile Phe Ser Leu Lys 
                965                 970                 975     


Val Thr Thr Gly Ser Val Pro Val Ser Met Ala Thr Val Ile Ile His 
            980                 985                 990         


Ile Pro Gln Tyr Thr Lys Glu Lys  Asn Pro Leu Met Tyr  Leu Thr Gly 
        995                 1000                 1005             


Val Gln  Thr Asp Lys Ala Gly  Asp Ile Ser Cys Asn  Ala Asp Ile 
    1010                 1015                 1020             


Asn Pro  Leu Lys Ile Gly Gln  Thr Ser Ser Ser Val  Ser Phe Lys 
    1025                 1030                 1035             


Ser Glu  Asn Phe Arg His Thr  Lys Glu Leu Asn Cys  Arg Thr Ala 
    1040                 1045                 1050             


Ser Cys  Ser Asn Val Thr Cys  Trp Leu Lys Asp Val  His Met Lys 
    1055                 1060                 1065             


Gly Glu  Tyr Phe Val Asn Val  Thr Thr Arg Ile Trp  Asn Gly Thr 
    1070                 1075                 1080             


Phe Ala  Ser Ser Thr Phe Gln  Thr Val Gln Leu Thr  Ala Ala Ala 
    1085                 1090                 1095             


Glu Ile  Asn Thr Tyr Asn Pro  Glu Ile Tyr Val Ile  Glu Asp Asn 
    1100                 1105                 1110             


Thr Val  Thr Ile Pro Leu Met  Ile Met Lys Pro Asp  Glu Lys Ala 
    1115                 1120                 1125             


Glu Val  Pro Thr Gly Val Ile  Ile Gly Ser Ile Ile  Ala Gly Ile 
    1130                 1135                 1140             


Leu Leu  Leu Leu Ala Leu Val  Ala Ile Leu Trp Lys  Leu Gly Phe 
    1145                 1150                 1155             


Phe Lys  Arg Lys Tyr Glu Lys  Met Thr Lys Asn Pro  Asp Glu Ile 
    1160                 1165                 1170             


Asp Glu  Thr Thr Glu Leu Ser  Ser 
    1175                 1180     


<210>  284
<211>  3334
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 2b (platelet glycoprotein IIb of 
       IIb/IIIa complex, antigen CD41) (ITGA2B), mRNA GeneBank Accession
       No. NM_000419.3  GI:88758614

<400>  284
attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag       60

ccctctggct tctggagtgg gtgctgctgc tcttgggacc ttgtgctgcc cctccagcct      120

gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt      180

ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg      240

ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga      300

gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg      360

taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca      420

gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga      480

ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc      540

gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg      600

attttagctg ggacaagcgt tactgtgaag cgggcttcag ctccgtggtc actcaggccg      660

gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc      720

cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc      780

agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg      840

tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca      900

cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc      960

ggctgcgcgg agagcagatg gcgtcgtatt ttgggcattc agtggctgtc actgacgtca     1020

acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggag agccgggcag     1080

accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg     1140

cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg     1200

ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc     1260

cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc     1320

tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct     1380

tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag     1440

cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc     1500

agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca     1560

agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc     1620

ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc     1680

ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctgaacctg gatctgggcg     1740

gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc     1800

gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg     1860

gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg     1920

tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga     1980

cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca     2040

acgagggcga gggggcctat gaagcagagc tggccgtgca cctgccccag ggcgcccact     2100

acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg     2160

agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga     2220

taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct     2280

tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg     2340

acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc     2400

tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca     2460

aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc     2520

tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac     2580

agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact     2640

gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca     2700

gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa     2760

gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc     2820

agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc     2880

tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg     2940

tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct     3000

tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc     3060

tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggccac     3120

ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga     3180

gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt     3240

ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc     3300

ccctcctgct gcctaataaa gagactgagc cctg                                 3334


<210>  285
<211>  1039
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 2b (platelet glycoprotein IIb of 
       IIb/IIIa complex, antigen CD41) (ITGA2B), polypeptide GeneBank 
       Accession No. NP_000410.2 GI:88758615

<400>  285

Met Ala Arg Ala Leu Cys Pro Leu Gln Ala Leu Trp Leu Leu Glu Trp 
1               5                   10                  15      


Val Leu Leu Leu Leu Gly Pro Cys Ala Ala Pro Pro Ala Trp Ala Leu 
            20                  25                  30          


Asn Leu Asp Pro Val Gln Leu Thr Phe Tyr Ala Gly Pro Asn Gly Ser 
        35                  40                  45              


Gln Phe Gly Phe Ser Leu Asp Phe His Lys Asp Ser His Gly Arg Val 
    50                  55                  60                  


Ala Ile Val Val Gly Ala Pro Arg Thr Leu Gly Pro Ser Gln Glu Glu 
65                  70                  75                  80  


Thr Gly Gly Val Phe Leu Cys Pro Trp Arg Ala Glu Gly Gly Gln Cys 
                85                  90                  95      


Pro Ser Leu Leu Phe Asp Leu Arg Asp Glu Thr Arg Asn Val Gly Ser 
            100                 105                 110         


Gln Thr Leu Gln Thr Phe Lys Ala Arg Gln Gly Leu Gly Ala Ser Val 
        115                 120                 125             


Val Ser Trp Ser Asp Val Ile Val Ala Cys Ala Pro Trp Gln His Trp 
    130                 135                 140                 


Asn Val Leu Glu Lys Thr Glu Glu Ala Glu Lys Thr Pro Val Gly Ser 
145                 150                 155                 160 


Cys Phe Leu Ala Gln Pro Glu Ser Gly Arg Arg Ala Glu Tyr Ser Pro 
                165                 170                 175     


Cys Arg Gly Asn Thr Leu Ser Arg Ile Tyr Val Glu Asn Asp Phe Ser 
            180                 185                 190         


Trp Asp Lys Arg Tyr Cys Glu Ala Gly Phe Ser Ser Val Val Thr Gln 
        195                 200                 205             


Ala Gly Glu Leu Val Leu Gly Ala Pro Gly Gly Tyr Tyr Phe Leu Gly 
    210                 215                 220                 


Leu Leu Ala Gln Ala Pro Val Ala Asp Ile Phe Ser Ser Tyr Arg Pro 
225                 230                 235                 240 


Gly Ile Leu Leu Trp His Val Ser Ser Gln Ser Leu Ser Phe Asp Ser 
                245                 250                 255     


Ser Asn Pro Glu Tyr Phe Asp Gly Tyr Trp Gly Tyr Ser Val Ala Val 
            260                 265                 270         


Gly Glu Phe Asp Gly Asp Leu Asn Thr Thr Glu Tyr Val Val Gly Ala 
        275                 280                 285             


Pro Thr Trp Ser Trp Thr Leu Gly Ala Val Glu Ile Leu Asp Ser Tyr 
    290                 295                 300                 


Tyr Gln Arg Leu His Arg Leu Arg Gly Glu Gln Met Ala Ser Tyr Phe 
305                 310                 315                 320 


Gly His Ser Val Ala Val Thr Asp Val Asn Gly Asp Gly Arg His Asp 
                325                 330                 335     


Leu Leu Val Gly Ala Pro Leu Tyr Met Glu Ser Arg Ala Asp Arg Lys 
            340                 345                 350         


Leu Ala Glu Val Gly Arg Val Tyr Leu Phe Leu Gln Pro Arg Gly Pro 
        355                 360                 365             


His Ala Leu Gly Ala Pro Ser Leu Leu Leu Thr Gly Thr Gln Leu Tyr 
    370                 375                 380                 


Gly Arg Phe Gly Ser Ala Ile Ala Pro Leu Gly Asp Leu Asp Arg Asp 
385                 390                 395                 400 


Gly Tyr Asn Asp Ile Ala Val Ala Ala Pro Tyr Gly Gly Pro Ser Gly 
                405                 410                 415     


Arg Gly Gln Val Leu Val Phe Leu Gly Gln Ser Glu Gly Leu Arg Ser 
            420                 425                 430         


Arg Pro Ser Gln Val Leu Asp Ser Pro Phe Pro Thr Gly Ser Ala Phe 
        435                 440                 445             


Gly Phe Ser Leu Arg Gly Ala Val Asp Ile Asp Asp Asn Gly Tyr Pro 
    450                 455                 460                 


Asp Leu Ile Val Gly Ala Tyr Gly Ala Asn Gln Val Ala Val Tyr Arg 
465                 470                 475                 480 


Ala Gln Pro Val Val Lys Ala Ser Val Gln Leu Leu Val Gln Asp Ser 
                485                 490                 495     


Leu Asn Pro Ala Val Lys Ser Cys Val Leu Pro Gln Thr Lys Thr Pro 
            500                 505                 510         


Val Ser Cys Phe Asn Ile Gln Met Cys Val Gly Ala Thr Gly His Asn 
        515                 520                 525             


Ile Pro Gln Lys Leu Ser Leu Asn Ala Glu Leu Gln Leu Asp Arg Gln 
    530                 535                 540                 


Lys Pro Arg Gln Gly Arg Arg Val Leu Leu Leu Gly Ser Gln Gln Ala 
545                 550                 555                 560 


Gly Thr Thr Leu Asn Leu Asp Leu Gly Gly Lys His Ser Pro Ile Cys 
                565                 570                 575     


His Thr Thr Met Ala Phe Leu Arg Asp Glu Ala Asp Phe Arg Asp Lys 
            580                 585                 590         


Leu Ser Pro Ile Val Leu Ser Leu Asn Val Ser Leu Pro Pro Thr Glu 
        595                 600                 605             


Ala Gly Met Ala Pro Ala Val Val Leu His Gly Asp Thr His Val Gln 
    610                 615                 620                 


Glu Gln Thr Arg Ile Val Leu Asp Cys Gly Glu Asp Asp Val Cys Val 
625                 630                 635                 640 


Pro Gln Leu Gln Leu Thr Ala Ser Val Thr Gly Ser Pro Leu Leu Val 
                645                 650                 655     


Gly Ala Asp Asn Val Leu Glu Leu Gln Met Asp Ala Ala Asn Glu Gly 
            660                 665                 670         


Glu Gly Ala Tyr Glu Ala Glu Leu Ala Val His Leu Pro Gln Gly Ala 
        675                 680                 685             


His Tyr Met Arg Ala Leu Ser Asn Val Glu Gly Phe Glu Arg Leu Ile 
    690                 695                 700                 


Cys Asn Gln Lys Lys Glu Asn Glu Thr Arg Val Val Leu Cys Glu Leu 
705                 710                 715                 720 


Gly Asn Pro Met Lys Lys Asn Ala Gln Ile Gly Ile Ala Met Leu Val 
                725                 730                 735     


Ser Val Gly Asn Leu Glu Glu Ala Gly Glu Ser Val Ser Phe Gln Leu 
            740                 745                 750         


Gln Ile Arg Ser Lys Asn Ser Gln Asn Pro Asn Ser Lys Ile Val Leu 
        755                 760                 765             


Leu Asp Val Pro Val Arg Ala Glu Ala Gln Val Glu Leu Arg Gly Asn 
    770                 775                 780                 


Ser Phe Pro Ala Ser Leu Val Val Ala Ala Glu Glu Gly Glu Arg Glu 
785                 790                 795                 800 


Gln Asn Ser Leu Asp Ser Trp Gly Pro Lys Val Glu His Thr Tyr Glu 
                805                 810                 815     


Leu His Asn Asn Gly Pro Gly Thr Val Asn Gly Leu His Leu Ser Ile 
            820                 825                 830         


His Leu Pro Gly Gln Ser Gln Pro Ser Asp Leu Leu Tyr Ile Leu Asp 
        835                 840                 845             


Ile Gln Pro Gln Gly Gly Leu Gln Cys Phe Pro Gln Pro Pro Val Asn 
    850                 855                 860                 


Pro Leu Lys Val Asp Trp Gly Leu Pro Ile Pro Ser Pro Ser Pro Ile 
865                 870                 875                 880 


His Pro Ala His His Lys Arg Asp Arg Arg Gln Ile Phe Leu Pro Glu 
                885                 890                 895     


Pro Glu Gln Pro Ser Arg Leu Gln Asp Pro Val Leu Val Ser Cys Asp 
            900                 905                 910         


Ser Ala Pro Cys Thr Val Val Gln Cys Asp Leu Gln Glu Met Ala Arg 
        915                 920                 925             


Gly Gln Arg Ala Met Val Thr Val Leu Ala Phe Leu Trp Leu Pro Ser 
    930                 935                 940                 


Leu Tyr Gln Arg Pro Leu Asp Gln Phe Val Leu Gln Ser His Ala Trp 
945                 950                 955                 960 


Phe Asn Val Ser Ser Leu Pro Tyr Ala Val Pro Pro Leu Ser Leu Pro 
                965                 970                 975     


Arg Gly Glu Ala Gln Val Trp Thr Gln Leu Leu Arg Ala Leu Glu Glu 
            980                 985                 990         


Arg Ala Ile Pro Ile Trp Trp Val  Leu Val Gly Val Leu  Gly Gly Leu 
        995                 1000                 1005             


Leu Leu  Leu Thr Ile Leu Val  Leu Ala Met Trp Lys  Val Gly Phe 
    1010                 1015                 1020             


Phe Lys  Arg Asn Arg Pro Pro  Leu Glu Glu Asp Asp  Glu Glu Gly 
    1025                 1030                 1035             


Glu 
    


<210>  286
<211>  5044
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 3 (antigen CD49C, alpha 3 subunit of
       VLA-3 receptor) (ITGA3), transcript variant a, mRNA GeneBank 
       Accession No. NM_002204.2  GI:171846266

<400>  286
tttccccgga aggaaagcgc agcccgggct gggctcgcaa ggtggggagg tgcgggactg       60

ggcgtgggga ggcggggcgc gcgccggggg acccctccct cctgtcctcc ttgcggtcga      120

ccggtgcgct tgccagatcc gccgcgaagc cgggatcgaa ggcgacagcg cggccaaggg      180

ggcgcggccg ggacaagctg ggggccggtt gcccggggca gggacggcgg cgacccggcc      240

gctggggagg caggaagata gacccacgga tcttaggaag ggatccgaga gcgcagctgt      300

gaaactggct ggggctgggg gcacgaaacc gatcagcgct acggagcgca gcggccggcg      360

ggttccagtg tcctccggcg gcgcggggag caggtgaaca ggtcctcacg cccagctccg      420

cgccctcacg cgctctcgcc gggaccccgc ttccgctggc agccatgggc cccggcccca      480

gccgcgcgcc ccgcgcccca cgcctgatgc tctgtgcgct cgccttgatg gtggcggccg      540

gcggctgcgt cgtctccgcc ttcaacctgg atacccgatt cctggtagtg aaggaggccg      600

ggaacccggg cagcctcttc ggctactcgg tcgccctcca tcggcagaca gagcggcagc      660

agcgctacct gctcctggct ggtgcccccc gggagctcgc tgtgcccgat ggctacacca      720

accggactgg tgctgtgtac ctgtgcccac tcactgccca caaggatgac tgtgagcgga      780

tgaacatcac agtgaaaaat gaccctggcc atcacattat tgaggacatg tggcttggag      840

tgactgtggc cagccagggc cctgcaggca gagttctggt ctgtgcccac cgctacaccc      900

aggtgctgtg gtcagggtca gaagaccagc ggcgcatggt gggcaagtgc tacgtgcgag      960

gcaatgacct agagctggac tccagtgatg actggcagac ctaccacaac gagatgtgca     1020

atagcaacac agactacctg gagacgggca tgtgccagct gggcaccagc ggtggcttca     1080

cccagaacac tgtgtacttc ggcgcccccg gtgcctacaa ctggaaagga aacagctaca     1140

tgattcagcg caaggagtgg gacttatctg agtatagtta caaggaccca gaggaccaag     1200

gaaacctcta tattgggtac acgatgcagg taggcagctt catcctgcac cccaaaaaca     1260

tcaccattgt gacaggtgcc ccacggcacc gacatatggg cgcggtgttc ttgctgagcc     1320

aggaggcagg cggagacctg cggaggaggc aggtgctgga gggctcgcag gtgggcgcct     1380

attttggcag cgccattgcc ctggcagacc tgaacaatga tgggtggcag gacctcctgg     1440

tgggcgcccc ctactacttc gagaggaaag aggaagtagg gggtgccatc tatgtcttca     1500

tgaaccaggc gggaacctcc ttccctgctc acccctcact ccttcttcat ggccccagtg     1560

gctctgcctt tggtttatct gtggccagca ttggtgacat caaccaggat ggatttcagg     1620

atattgctgt gggagctccg tttgaaggct tgggcaaagt gtacatctat cacagtagct     1680

ctaaggggct ccttagacag ccccagcagg taatccatgg agagaagctg ggactgcctg     1740

ggttggccac cttcggctat tccctcagtg ggcagatgga tgtggatgag aacttctacc     1800

cagaccttct agtgggaagc ctgtcagacc acattgtgct gctgcgggcc cggcccgtca     1860

tcaacatcgt ccacaagacc ttggtgccca ggccagctgt gctggaccct gcactttgca     1920

cggccacctc ttgtgtgcaa gtggagctgt gctttgctta caaccagagt gccgggaacc     1980

ccaactacag gcgaaacatc accctggcct acactctgga ggctgacagg gaccgccggc     2040

cgccccggct ccgctttgcc ggcagtgagt ccgctgtctt ccacggcttc ttctccatgc     2100

ccgagatgcg ctgccagaag ctggagctgc tcctgatgga caacctccgt gacaaactcc     2160

gccccatcat catctccatg aactactctt tacctttgcg gatgcccgat cgcccccggc     2220

tggggctgcg gtccctggac gcctacccga tcctcaacca ggcacaggct ctggagaacc     2280

acactgaggt ccagttccag aaggagtgcg ggcctgacaa caagtgtgag agcaacttgc     2340

agatgcgggc agccttcgtg tcagagcagc agcagaagct gagcaggctc cagtacagca     2400

gagacgtccg gaaattgctc ctgagcatca acgtgacgaa cacccggacc tcggagcgct     2460

ccggggagga cgcccacgag gcgctgctca ccctggtggt gcctcccgcc ctgctgctgt     2520

cctcagtgcg cccccccggg gcctgccaag ctaatgagac catcttttgc gagctgggga     2580

accccttcaa acggaaccag aggatggagc tgctcatcgc ctttgaggtc atcggggtga     2640

ccctgcacac aagggacctt caggtgcagc tgcagctctc cacgtcgagt caccaggaca     2700

acctgtggcc catgatcctc actctgctgg tggactatac actccagacc tcgcttagca     2760

tggtaaatca ccggctacaa agcttctttg gggggacagt gatgggtgag tctggcatga     2820

aaactgtgga ggatgtagga agccccctca agtatgaatt ccaggtgggc ccaatggggg     2880

aggggctggt gggcctgggg accctggtcc taggtctgga gtggccctac gaagtcagca     2940

atggcaagtg gctgctgtat cccacggaga tcaccgtcca tggcaatggg tcctggccct     3000

gccgaccacc tggagacctt atcaaccctc tcaacctcac tctttctgac cctggggaca     3060

ggccatcatc cccacagcgc aggcggcgac agctggatcc agggggaggc cagggccccc     3120

cacctgtcac tctggctgct gccaaaaaag ccaagtctga gactgtgctg acctgtgcca     3180

cagggcgtgc ccactgtgtg tggctagagt gccccatccc tgatgccccc gttgtcacca     3240

acgtgactgt gaaggcacga gtgtggaaca gcaccttcat cgaggattac agagactttg     3300

accgagtccg ggtaaatggc tgggctaccc tattcctccg aaccagcatc cccaccatca     3360

acatggagaa caagaccacg tggttctctg tggacattga ctcggagctg gtggaggagc     3420

tgccggccga aatcgagctg tggctggtgc tggtggccgt gggtgcaggg ctgctgctgc     3480

tggggctgat catcctcctg ctgtggaagt gcggcttctt caagcgagcc cgcactcgcg     3540

ccctgtatga agctaagagg cagaaggcgg agatgaagag ccagccgtca gagacagaga     3600

ggctgaccga cgactactga gggggcagcc ccccgccccc ggcccacctg gtgtgacttc     3660

tttaagcgga cccgctatta tcagatcatg cccaagtacc acgcagtgcg gatccgggag     3720

gaggagcgct acccacctcc agggagcacc ctgcccacca agaagcactg ggtgaccagc     3780

tggcagactc gggaccaata ctactgacgt cctccctgat cccaccccct cctcccccag     3840

tgtccccttt cttcctattt atcataagtt atgcctctga cagtccacag gggccaccac     3900

ctttggctgg tagcagcagg ctcaggcaca tacacctcgt caagagcatg cacatgctgt     3960

ctggccctgg ggatcttccc acaggagggc cagcgctgtg gaccttacaa cgccgagtgc     4020

actgcattcc tgtgccctag atgcacgtgg ggcccactgc tcgtggactg tgctggtgca     4080

tcacggatgg tgcatgggct cgccgtgtct cagcctctgc cagcgccaaa acaagccaaa     4140

gagcctccca ccagagccgg gaggaaaagg cccctgcaat gtggtgacac ctcccccttt     4200

cacactggat ccatcttgag ccacagtcac tggattgact ttgctgtcaa aactactgac     4260

agggagcagc ccccgggccg ctggctggtg ggcccccaat gacacccatg ccagagaggt     4320

ggggatcctg cctaaggttg tctacggggg cacttggagg acctggcgtg ctcagaccca     4380

acagcaaagg aactagaaag aaggacccag aacggcttgc tttcctgcat ctctgtgaag     4440

cctctctcct tggccacaga ctgaactcgc agggaatgca gcaggaagga acaaagacag     4500

gcaaacggca acgtagcctg ggctcactgt gctggggcac ggcgggatcc tccacagaga     4560

ggaggggacc aattctggac agacagatgt tgggaggata cagaggagat gccacttctc     4620

actcaccact accagccagc ctcagaaggc cccagagaga ccctgcaaga ccacggaggg     4680

agcgacactt gaatgtagaa taggcagggg gccctgcccc accccatcca gccagacccc     4740

acgctgacca tgcgtcaggg gcctagaggt ggagttctta gctatccttg gctttcagag     4800

ccagcctggc tctgccccct cccccatggg ctgtgtccta aggcccattt gagaagctga     4860

ggctagttcc agaaaacctc tcctgacccc tgcctgttgg caggcccact ccccagcccc     4920

agccccttcc atggtactgt agcaggggaa ttccctcccc ctccttgtgc cttctttgta     4980

tataggcttc tcacggcgac caataaacag ctcccagttt gtatgcaaaa aaaaaaaaaa     5040

aaaa                                                                  5044


<210>  287
<211>  1051
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 3 (antigen CD49C, alpha 3 subunit of
       VLA-3 receptor) (ITGA3), transcript variant a, polypeptide 
       GeneBank Accession No. NP_002195.1 GI:4504747

<400>  287

Met Gly Pro Gly Pro Ser Arg Ala Pro Arg Ala Pro Arg Leu Met Leu 
1               5                   10                  15      


Cys Ala Leu Ala Leu Met Val Ala Ala Gly Gly Cys Val Val Ser Ala 
            20                  25                  30          


Phe Asn Leu Asp Thr Arg Phe Leu Val Val Lys Glu Ala Gly Asn Pro 
        35                  40                  45              


Gly Ser Leu Phe Gly Tyr Ser Val Ala Leu His Arg Gln Thr Glu Arg 
    50                  55                  60                  


Gln Gln Arg Tyr Leu Leu Leu Ala Gly Ala Pro Arg Glu Leu Ala Val 
65                  70                  75                  80  


Pro Asp Gly Tyr Thr Asn Arg Thr Gly Ala Val Tyr Leu Cys Pro Leu 
                85                  90                  95      


Thr Ala His Lys Asp Asp Cys Glu Arg Met Asn Ile Thr Val Lys Asn 
            100                 105                 110         


Asp Pro Gly His His Ile Ile Glu Asp Met Trp Leu Gly Val Thr Val 
        115                 120                 125             


Ala Ser Gln Gly Pro Ala Gly Arg Val Leu Val Cys Ala His Arg Tyr 
    130                 135                 140                 


Thr Gln Val Leu Trp Ser Gly Ser Glu Asp Gln Arg Arg Met Val Gly 
145                 150                 155                 160 


Lys Cys Tyr Val Arg Gly Asn Asp Leu Glu Leu Asp Ser Ser Asp Asp 
                165                 170                 175     


Trp Gln Thr Tyr His Asn Glu Met Cys Asn Ser Asn Thr Asp Tyr Leu 
            180                 185                 190         


Glu Thr Gly Met Cys Gln Leu Gly Thr Ser Gly Gly Phe Thr Gln Asn 
        195                 200                 205             


Thr Val Tyr Phe Gly Ala Pro Gly Ala Tyr Asn Trp Lys Gly Asn Ser 
    210                 215                 220                 


Tyr Met Ile Gln Arg Lys Glu Trp Asp Leu Ser Glu Tyr Ser Tyr Lys 
225                 230                 235                 240 


Asp Pro Glu Asp Gln Gly Asn Leu Tyr Ile Gly Tyr Thr Met Gln Val 
                245                 250                 255     


Gly Ser Phe Ile Leu His Pro Lys Asn Ile Thr Ile Val Thr Gly Ala 
            260                 265                 270         


Pro Arg His Arg His Met Gly Ala Val Phe Leu Leu Ser Gln Glu Ala 
        275                 280                 285             


Gly Gly Asp Leu Arg Arg Arg Gln Val Leu Glu Gly Ser Gln Val Gly 
    290                 295                 300                 


Ala Tyr Phe Gly Ser Ala Ile Ala Leu Ala Asp Leu Asn Asn Asp Gly 
305                 310                 315                 320 


Trp Gln Asp Leu Leu Val Gly Ala Pro Tyr Tyr Phe Glu Arg Lys Glu 
                325                 330                 335     


Glu Val Gly Gly Ala Ile Tyr Val Phe Met Asn Gln Ala Gly Thr Ser 
            340                 345                 350         


Phe Pro Ala His Pro Ser Leu Leu Leu His Gly Pro Ser Gly Ser Ala 
        355                 360                 365             


Phe Gly Leu Ser Val Ala Ser Ile Gly Asp Ile Asn Gln Asp Gly Phe 
    370                 375                 380                 


Gln Asp Ile Ala Val Gly Ala Pro Phe Glu Gly Leu Gly Lys Val Tyr 
385                 390                 395                 400 


Ile Tyr His Ser Ser Ser Lys Gly Leu Leu Arg Gln Pro Gln Gln Val 
                405                 410                 415     


Ile His Gly Glu Lys Leu Gly Leu Pro Gly Leu Ala Thr Phe Gly Tyr 
            420                 425                 430         


Ser Leu Ser Gly Gln Met Asp Val Asp Glu Asn Phe Tyr Pro Asp Leu 
        435                 440                 445             


Leu Val Gly Ser Leu Ser Asp His Ile Val Leu Leu Arg Ala Arg Pro 
    450                 455                 460                 


Val Ile Asn Ile Val His Lys Thr Leu Val Pro Arg Pro Ala Val Leu 
465                 470                 475                 480 


Asp Pro Ala Leu Cys Thr Ala Thr Ser Cys Val Gln Val Glu Leu Cys 
                485                 490                 495     


Phe Ala Tyr Asn Gln Ser Ala Gly Asn Pro Asn Tyr Arg Arg Asn Ile 
            500                 505                 510         


Thr Leu Ala Tyr Thr Leu Glu Ala Asp Arg Asp Arg Arg Pro Pro Arg 
        515                 520                 525             


Leu Arg Phe Ala Gly Ser Glu Ser Ala Val Phe His Gly Phe Phe Ser 
    530                 535                 540                 


Met Pro Glu Met Arg Cys Gln Lys Leu Glu Leu Leu Leu Met Asp Asn 
545                 550                 555                 560 


Leu Arg Asp Lys Leu Arg Pro Ile Ile Ile Ser Met Asn Tyr Ser Leu 
                565                 570                 575     


Pro Leu Arg Met Pro Asp Arg Pro Arg Leu Gly Leu Arg Ser Leu Asp 
            580                 585                 590         


Ala Tyr Pro Ile Leu Asn Gln Ala Gln Ala Leu Glu Asn His Thr Glu 
        595                 600                 605             


Val Gln Phe Gln Lys Glu Cys Gly Pro Asp Asn Lys Cys Glu Ser Asn 
    610                 615                 620                 


Leu Gln Met Arg Ala Ala Phe Val Ser Glu Gln Gln Gln Lys Leu Ser 
625                 630                 635                 640 


Arg Leu Gln Tyr Ser Arg Asp Val Arg Lys Leu Leu Leu Ser Ile Asn 
                645                 650                 655     


Val Thr Asn Thr Arg Thr Ser Glu Arg Ser Gly Glu Asp Ala His Glu 
            660                 665                 670         


Ala Leu Leu Thr Leu Val Val Pro Pro Ala Leu Leu Leu Ser Ser Val 
        675                 680                 685             


Arg Pro Pro Gly Ala Cys Gln Ala Asn Glu Thr Ile Phe Cys Glu Leu 
    690                 695                 700                 


Gly Asn Pro Phe Lys Arg Asn Gln Arg Met Glu Leu Leu Ile Ala Phe 
705                 710                 715                 720 


Glu Val Ile Gly Val Thr Leu His Thr Arg Asp Leu Gln Val Gln Leu 
                725                 730                 735     


Gln Leu Ser Thr Ser Ser His Gln Asp Asn Leu Trp Pro Met Ile Leu 
            740                 745                 750         


Thr Leu Leu Val Asp Tyr Thr Leu Gln Thr Ser Leu Ser Met Val Asn 
        755                 760                 765             


His Arg Leu Gln Ser Phe Phe Gly Gly Thr Val Met Gly Glu Ser Gly 
    770                 775                 780                 


Met Lys Thr Val Glu Asp Val Gly Ser Pro Leu Lys Tyr Glu Phe Gln 
785                 790                 795                 800 


Val Gly Pro Met Gly Glu Gly Leu Val Gly Leu Gly Thr Leu Val Leu 
                805                 810                 815     


Gly Leu Glu Trp Pro Tyr Glu Val Ser Asn Gly Lys Trp Leu Leu Tyr 
            820                 825                 830         


Pro Thr Glu Ile Thr Val His Gly Asn Gly Ser Trp Pro Cys Arg Pro 
        835                 840                 845             


Pro Gly Asp Leu Ile Asn Pro Leu Asn Leu Thr Leu Ser Asp Pro Gly 
    850                 855                 860                 


Asp Arg Pro Ser Ser Pro Gln Arg Arg Arg Arg Gln Leu Asp Pro Gly 
865                 870                 875                 880 


Gly Gly Gln Gly Pro Pro Pro Val Thr Leu Ala Ala Ala Lys Lys Ala 
                885                 890                 895     


Lys Ser Glu Thr Val Leu Thr Cys Ala Thr Gly Arg Ala His Cys Val 
            900                 905                 910         


Trp Leu Glu Cys Pro Ile Pro Asp Ala Pro Val Val Thr Asn Val Thr 
        915                 920                 925             


Val Lys Ala Arg Val Trp Asn Ser Thr Phe Ile Glu Asp Tyr Arg Asp 
    930                 935                 940                 


Phe Asp Arg Val Arg Val Asn Gly Trp Ala Thr Leu Phe Leu Arg Thr 
945                 950                 955                 960 


Ser Ile Pro Thr Ile Asn Met Glu Asn Lys Thr Thr Trp Phe Ser Val 
                965                 970                 975     


Asp Ile Asp Ser Glu Leu Val Glu Glu Leu Pro Ala Glu Ile Glu Leu 
            980                 985                 990         


Trp Leu Val Leu Val Ala Val Gly  Ala Gly Leu Leu Leu  Leu Gly Leu 
        995                 1000                 1005             


Ile Ile  Leu Leu Leu Trp Lys  Cys Gly Phe Phe Lys  Arg Ala Arg 
    1010                 1015                 1020             


Thr Arg  Ala Leu Tyr Glu Ala  Lys Arg Gln Lys Ala  Glu Met Lys 
    1025                 1030                 1035             


Ser Gln  Pro Ser Glu Thr Glu  Arg Leu Thr Asp Asp  Tyr 
    1040                 1045                 1050     


<210>  288
<211>  4902
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 3 (antigen CD49C, alpha 3 subunit of
       VLA-3 receptor) (ITGA3), transcript variant b, mRNA GeneBank 
       Accession No. NM_005501.2  GI:171846264

<400>  288
tttccccgga aggaaagcgc agcccgggct gggctcgcaa ggtggggagg tgcgggactg       60

ggcgtgggga ggcggggcgc gcgccggggg acccctccct cctgtcctcc ttgcggtcga      120

ccggtgcgct tgccagatcc gccgcgaagc cgggatcgaa ggcgacagcg cggccaaggg      180

ggcgcggccg ggacaagctg ggggccggtt gcccggggca gggacggcgg cgacccggcc      240

gctggggagg caggaagata gacccacgga tcttaggaag ggatccgaga gcgcagctgt      300

gaaactggct ggggctgggg gcacgaaacc gatcagcgct acggagcgca gcggccggcg      360

ggttccagtg tcctccggcg gcgcggggag caggtgaaca ggtcctcacg cccagctccg      420

cgccctcacg cgctctcgcc gggaccccgc ttccgctggc agccatgggc cccggcccca      480

gccgcgcgcc ccgcgcccca cgcctgatgc tctgtgcgct cgccttgatg gtggcggccg      540

gcggctgcgt cgtctccgcc ttcaacctgg atacccgatt cctggtagtg aaggaggccg      600

ggaacccggg cagcctcttc ggctactcgg tcgccctcca tcggcagaca gagcggcagc      660

agcgctacct gctcctggct ggtgcccccc gggagctcgc tgtgcccgat ggctacacca      720

accggactgg tgctgtgtac ctgtgcccac tcactgccca caaggatgac tgtgagcgga      780

tgaacatcac agtgaaaaat gaccctggcc atcacattat tgaggacatg tggcttggag      840

tgactgtggc cagccagggc cctgcaggca gagttctggt ctgtgcccac cgctacaccc      900

aggtgctgtg gtcagggtca gaagaccagc ggcgcatggt gggcaagtgc tacgtgcgag      960

gcaatgacct agagctggac tccagtgatg actggcagac ctaccacaac gagatgtgca     1020

atagcaacac agactacctg gagacgggca tgtgccagct gggcaccagc ggtggcttca     1080

cccagaacac tgtgtacttc ggcgcccccg gtgcctacaa ctggaaagga aacagctaca     1140

tgattcagcg caaggagtgg gacttatctg agtatagtta caaggaccca gaggaccaag     1200

gaaacctcta tattgggtac acgatgcagg taggcagctt catcctgcac cccaaaaaca     1260

tcaccattgt gacaggtgcc ccacggcacc gacatatggg cgcggtgttc ttgctgagcc     1320

aggaggcagg cggagacctg cggaggaggc aggtgctgga gggctcgcag gtgggcgcct     1380

attttggcag cgccattgcc ctggcagacc tgaacaatga tgggtggcag gacctcctgg     1440

tgggcgcccc ctactacttc gagaggaaag aggaagtagg gggtgccatc tatgtcttca     1500

tgaaccaggc gggaacctcc ttccctgctc acccctcact ccttcttcat ggccccagtg     1560

gctctgcctt tggtttatct gtggccagca ttggtgacat caaccaggat ggatttcagg     1620

atattgctgt gggagctccg tttgaaggct tgggcaaagt gtacatctat cacagtagct     1680

ctaaggggct ccttagacag ccccagcagg taatccatgg agagaagctg ggactgcctg     1740

ggttggccac cttcggctat tccctcagtg ggcagatgga tgtggatgag aacttctacc     1800

cagaccttct agtgggaagc ctgtcagacc acattgtgct gctgcgggcc cggcccgtca     1860

tcaacatcgt ccacaagacc ttggtgccca ggccagctgt gctggaccct gcactttgca     1920

cggccacctc ttgtgtgcaa gtggagctgt gctttgctta caaccagagt gccgggaacc     1980

ccaactacag gcgaaacatc accctggcct acactctgga ggctgacagg gaccgccggc     2040

cgccccggct ccgctttgcc ggcagtgagt ccgctgtctt ccacggcttc ttctccatgc     2100

ccgagatgcg ctgccagaag ctggagctgc tcctgatgga caacctccgt gacaaactcc     2160

gccccatcat catctccatg aactactctt tacctttgcg gatgcccgat cgcccccggc     2220

tggggctgcg gtccctggac gcctacccga tcctcaacca ggcacaggct ctggagaacc     2280

acactgaggt ccagttccag aaggagtgcg ggcctgacaa caagtgtgag agcaacttgc     2340

agatgcgggc agccttcgtg tcagagcagc agcagaagct gagcaggctc cagtacagca     2400

gagacgtccg gaaattgctc ctgagcatca acgtgacgaa cacccggacc tcggagcgct     2460

ccggggagga cgcccacgag gcgctgctca ccctggtggt gcctcccgcc ctgctgctgt     2520

cctcagtgcg cccccccggg gcctgccaag ctaatgagac catcttttgc gagctgggga     2580

accccttcaa acggaaccag aggatggagc tgctcatcgc ctttgaggtc atcggggtga     2640

ccctgcacac aagggacctt caggtgcagc tgcagctctc cacgtcgagt caccaggaca     2700

acctgtggcc catgatcctc actctgctgg tggactatac actccagacc tcgcttagca     2760

tggtaaatca ccggctacaa agcttctttg gggggacagt gatgggtgag tctggcatga     2820

aaactgtgga ggatgtagga agccccctca agtatgaatt ccaggtgggc ccaatggggg     2880

aggggctggt gggcctgggg accctggtcc taggtctgga gtggccctac gaagtcagca     2940

atggcaagtg gctgctgtat cccacggaga tcaccgtcca tggcaatggg tcctggccct     3000

gccgaccacc tggagacctt atcaaccctc tcaacctcac tctttctgac cctggggaca     3060

ggccatcatc cccacagcgc aggcggcgac agctggatcc agggggaggc cagggccccc     3120

cacctgtcac tctggctgct gccaaaaaag ccaagtctga gactgtgctg acctgtgcca     3180

cagggcgtgc ccactgtgtg tggctagagt gccccatccc tgatgccccc gttgtcacca     3240

acgtgactgt gaaggcacga gtgtggaaca gcaccttcat cgaggattac agagactttg     3300

accgagtccg ggtaaatggc tgggctaccc tattcctccg aaccagcatc cccaccatca     3360

acatggagaa caagaccacg tggttctctg tggacattga ctcggagctg gtggaggagc     3420

tgccggccga aatcgagctg tggctggtgc tggtggccgt gggtgcaggg ctgctgctgc     3480

tggggctgat catcctcctg ctgtggaagt gtgacttctt taagcggacc cgctattatc     3540

agatcatgcc caagtaccac gcagtgcgga tccgggagga ggagcgctac ccacctccag     3600

ggagcaccct gcccaccaag aagcactggg tgaccagctg gcagactcgg gaccaatact     3660

actgacgtcc tccctgatcc caccccctcc tcccccagtg tcccctttct tcctatttat     3720

cataagttat gcctctgaca gtccacaggg gccaccacct ttggctggta gcagcaggct     3780

caggcacata cacctcgtca agagcatgca catgctgtct ggccctgggg atcttcccac     3840

aggagggcca gcgctgtgga ccttacaacg ccgagtgcac tgcattcctg tgccctagat     3900

gcacgtgggg cccactgctc gtggactgtg ctggtgcatc acggatggtg catgggctcg     3960

ccgtgtctca gcctctgcca gcgccaaaac aagccaaaga gcctcccacc agagccggga     4020

ggaaaaggcc cctgcaatgt ggtgacacct ccccctttca cactggatcc atcttgagcc     4080

acagtcactg gattgacttt gctgtcaaaa ctactgacag ggagcagccc ccgggccgct     4140

ggctggtggg cccccaatga cacccatgcc agagaggtgg ggatcctgcc taaggttgtc     4200

tacgggggca cttggaggac ctggcgtgct cagacccaac agcaaaggaa ctagaaagaa     4260

ggacccagaa cggcttgctt tcctgcatct ctgtgaagcc tctctccttg gccacagact     4320

gaactcgcag ggaatgcagc aggaaggaac aaagacaggc aaacggcaac gtagcctggg     4380

ctcactgtgc tggggcacgg cgggatcctc cacagagagg aggggaccaa ttctggacag     4440

acagatgttg ggaggataca gaggagatgc cacttctcac tcaccactac cagccagcct     4500

cagaaggccc cagagagacc ctgcaagacc acggagggag cgacacttga atgtagaata     4560

ggcagggggc cctgccccac cccatccagc cagaccccac gctgaccatg cgtcaggggc     4620

ctagaggtgg agttcttagc tatccttggc tttcagagcc agcctggctc tgccccctcc     4680

cccatgggct gtgtcctaag gcccatttga gaagctgagg ctagttccag aaaacctctc     4740

ctgacccctg cctgttggca ggcccactcc ccagccccag ccccttccat ggtactgtag     4800

caggggaatt ccctccccct ccttgtgcct tctttgtata taggcttctc acggcgacca     4860

ataaacagct cccagtttgt atgcaaaaaa aaaaaaaaaa aa                        4902


<210>  289
<211>  1066
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 3 (antigen CD49C, alpha 3 subunit of
       VLA-3 receptor) (ITGA3), transcript variant b, polypeptide 
       GeneBank Accession No. NP_005492.1 GI:6006011

<400>  289

Met Gly Pro Gly Pro Ser Arg Ala Pro Arg Ala Pro Arg Leu Met Leu 
1               5                   10                  15      


Cys Ala Leu Ala Leu Met Val Ala Ala Gly Gly Cys Val Val Ser Ala 
            20                  25                  30          


Phe Asn Leu Asp Thr Arg Phe Leu Val Val Lys Glu Ala Gly Asn Pro 
        35                  40                  45              


Gly Ser Leu Phe Gly Tyr Ser Val Ala Leu His Arg Gln Thr Glu Arg 
    50                  55                  60                  


Gln Gln Arg Tyr Leu Leu Leu Ala Gly Ala Pro Arg Glu Leu Ala Val 
65                  70                  75                  80  


Pro Asp Gly Tyr Thr Asn Arg Thr Gly Ala Val Tyr Leu Cys Pro Leu 
                85                  90                  95      


Thr Ala His Lys Asp Asp Cys Glu Arg Met Asn Ile Thr Val Lys Asn 
            100                 105                 110         


Asp Pro Gly His His Ile Ile Glu Asp Met Trp Leu Gly Val Thr Val 
        115                 120                 125             


Ala Ser Gln Gly Pro Ala Gly Arg Val Leu Val Cys Ala His Arg Tyr 
    130                 135                 140                 


Thr Gln Val Leu Trp Ser Gly Ser Glu Asp Gln Arg Arg Met Val Gly 
145                 150                 155                 160 


Lys Cys Tyr Val Arg Gly Asn Asp Leu Glu Leu Asp Ser Ser Asp Asp 
                165                 170                 175     


Trp Gln Thr Tyr His Asn Glu Met Cys Asn Ser Asn Thr Asp Tyr Leu 
            180                 185                 190         


Glu Thr Gly Met Cys Gln Leu Gly Thr Ser Gly Gly Phe Thr Gln Asn 
        195                 200                 205             


Thr Val Tyr Phe Gly Ala Pro Gly Ala Tyr Asn Trp Lys Gly Asn Ser 
    210                 215                 220                 


Tyr Met Ile Gln Arg Lys Glu Trp Asp Leu Ser Glu Tyr Ser Tyr Lys 
225                 230                 235                 240 


Asp Pro Glu Asp Gln Gly Asn Leu Tyr Ile Gly Tyr Thr Met Gln Val 
                245                 250                 255     


Gly Ser Phe Ile Leu His Pro Lys Asn Ile Thr Ile Val Thr Gly Ala 
            260                 265                 270         


Pro Arg His Arg His Met Gly Ala Val Phe Leu Leu Ser Gln Glu Ala 
        275                 280                 285             


Gly Gly Asp Leu Arg Arg Arg Gln Val Leu Glu Gly Ser Gln Val Gly 
    290                 295                 300                 


Ala Tyr Phe Gly Ser Ala Ile Ala Leu Ala Asp Leu Asn Asn Asp Gly 
305                 310                 315                 320 


Trp Gln Asp Leu Leu Val Gly Ala Pro Tyr Tyr Phe Glu Arg Lys Glu 
                325                 330                 335     


Glu Val Gly Gly Ala Ile Tyr Val Phe Met Asn Gln Ala Gly Thr Ser 
            340                 345                 350         


Phe Pro Ala His Pro Ser Leu Leu Leu His Gly Pro Ser Gly Ser Ala 
        355                 360                 365             


Phe Gly Leu Ser Val Ala Ser Ile Gly Asp Ile Asn Gln Asp Gly Phe 
    370                 375                 380                 


Gln Asp Ile Ala Val Gly Ala Pro Phe Glu Gly Leu Gly Lys Val Tyr 
385                 390                 395                 400 


Ile Tyr His Ser Ser Ser Lys Gly Leu Leu Arg Gln Pro Gln Gln Val 
                405                 410                 415     


Ile His Gly Glu Lys Leu Gly Leu Pro Gly Leu Ala Thr Phe Gly Tyr 
            420                 425                 430         


Ser Leu Ser Gly Gln Met Asp Val Asp Glu Asn Phe Tyr Pro Asp Leu 
        435                 440                 445             


Leu Val Gly Ser Leu Ser Asp His Ile Val Leu Leu Arg Ala Arg Pro 
    450                 455                 460                 


Val Ile Asn Ile Val His Lys Thr Leu Val Pro Arg Pro Ala Val Leu 
465                 470                 475                 480 


Asp Pro Ala Leu Cys Thr Ala Thr Ser Cys Val Gln Val Glu Leu Cys 
                485                 490                 495     


Phe Ala Tyr Asn Gln Ser Ala Gly Asn Pro Asn Tyr Arg Arg Asn Ile 
            500                 505                 510         


Thr Leu Ala Tyr Thr Leu Glu Ala Asp Arg Asp Arg Arg Pro Pro Arg 
        515                 520                 525             


Leu Arg Phe Ala Gly Ser Glu Ser Ala Val Phe His Gly Phe Phe Ser 
    530                 535                 540                 


Met Pro Glu Met Arg Cys Gln Lys Leu Glu Leu Leu Leu Met Asp Asn 
545                 550                 555                 560 


Leu Arg Asp Lys Leu Arg Pro Ile Ile Ile Ser Met Asn Tyr Ser Leu 
                565                 570                 575     


Pro Leu Arg Met Pro Asp Arg Pro Arg Leu Gly Leu Arg Ser Leu Asp 
            580                 585                 590         


Ala Tyr Pro Ile Leu Asn Gln Ala Gln Ala Leu Glu Asn His Thr Glu 
        595                 600                 605             


Val Gln Phe Gln Lys Glu Cys Gly Pro Asp Asn Lys Cys Glu Ser Asn 
    610                 615                 620                 


Leu Gln Met Arg Ala Ala Phe Val Ser Glu Gln Gln Gln Lys Leu Ser 
625                 630                 635                 640 


Arg Leu Gln Tyr Ser Arg Asp Val Arg Lys Leu Leu Leu Ser Ile Asn 
                645                 650                 655     


Val Thr Asn Thr Arg Thr Ser Glu Arg Ser Gly Glu Asp Ala His Glu 
            660                 665                 670         


Ala Leu Leu Thr Leu Val Val Pro Pro Ala Leu Leu Leu Ser Ser Val 
        675                 680                 685             


Arg Pro Pro Gly Ala Cys Gln Ala Asn Glu Thr Ile Phe Cys Glu Leu 
    690                 695                 700                 


Gly Asn Pro Phe Lys Arg Asn Gln Arg Met Glu Leu Leu Ile Ala Phe 
705                 710                 715                 720 


Glu Val Ile Gly Val Thr Leu His Thr Arg Asp Leu Gln Val Gln Leu 
                725                 730                 735     


Gln Leu Ser Thr Ser Ser His Gln Asp Asn Leu Trp Pro Met Ile Leu 
            740                 745                 750         


Thr Leu Leu Val Asp Tyr Thr Leu Gln Thr Ser Leu Ser Met Val Asn 
        755                 760                 765             


His Arg Leu Gln Ser Phe Phe Gly Gly Thr Val Met Gly Glu Ser Gly 
    770                 775                 780                 


Met Lys Thr Val Glu Asp Val Gly Ser Pro Leu Lys Tyr Glu Phe Gln 
785                 790                 795                 800 


Val Gly Pro Met Gly Glu Gly Leu Val Gly Leu Gly Thr Leu Val Leu 
                805                 810                 815     


Gly Leu Glu Trp Pro Tyr Glu Val Ser Asn Gly Lys Trp Leu Leu Tyr 
            820                 825                 830         


Pro Thr Glu Ile Thr Val His Gly Asn Gly Ser Trp Pro Cys Arg Pro 
        835                 840                 845             


Pro Gly Asp Leu Ile Asn Pro Leu Asn Leu Thr Leu Ser Asp Pro Gly 
    850                 855                 860                 


Asp Arg Pro Ser Ser Pro Gln Arg Arg Arg Arg Gln Leu Asp Pro Gly 
865                 870                 875                 880 


Gly Gly Gln Gly Pro Pro Pro Val Thr Leu Ala Ala Ala Lys Lys Ala 
                885                 890                 895     


Lys Ser Glu Thr Val Leu Thr Cys Ala Thr Gly Arg Ala His Cys Val 
            900                 905                 910         


Trp Leu Glu Cys Pro Ile Pro Asp Ala Pro Val Val Thr Asn Val Thr 
        915                 920                 925             


Val Lys Ala Arg Val Trp Asn Ser Thr Phe Ile Glu Asp Tyr Arg Asp 
    930                 935                 940                 


Phe Asp Arg Val Arg Val Asn Gly Trp Ala Thr Leu Phe Leu Arg Thr 
945                 950                 955                 960 


Ser Ile Pro Thr Ile Asn Met Glu Asn Lys Thr Thr Trp Phe Ser Val 
                965                 970                 975     


Asp Ile Asp Ser Glu Leu Val Glu Glu Leu Pro Ala Glu Ile Glu Leu 
            980                 985                 990         


Trp Leu Val Leu Val Ala Val Gly  Ala Gly Leu Leu Leu  Leu Gly Leu 
        995                 1000                 1005             


Ile Ile  Leu Leu Leu Trp Lys  Cys Asp Phe Phe Lys  Arg Thr Arg 
    1010                 1015                 1020             


Tyr Tyr  Gln Ile Met Pro Lys  Tyr His Ala Val Arg  Ile Arg Glu 
    1025                 1030                 1035             


Glu Glu  Arg Tyr Pro Pro Pro  Gly Ser Thr Leu Pro  Thr Lys Lys 
    1040                 1045                 1050             


His Trp  Val Thr Ser Trp Gln  Thr Arg Asp Gln Tyr  Tyr 
    1055                 1060                 1065     


<210>  290
<211>  6082
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 4 (antigen CD49D, alpha 4 subunit of
       VLA-4 receptor) (ITGA4), mRNA GeneBank Accession No. NM_000885.4 
       GI:67191026

<400>  290
ataacgtctt tgtcactaaa atgttcccca ggggccttcg gcgagtcttt ttgtttggtt       60

ttttgttttt aatctgtggc tcttgataat ttatctagtg gttgcctaca cctgaaaaac      120

aagacacagt gtttaactat caacgaaaga actggacggc tccccgccgc agtcccactc      180

cccgagtttg tggctggcat ttgggccacg ccgggctggg cggtcacagc gaggggcgcg      240

cagtttgggg tcacacagct ccgcttctag gccccaacca ccgttaaaag gggaagcccg      300

tgccccatca ggtccgctct tgctgagccc agagccatcc cgcgctctgc gggctgggag      360

gcccgggcca ggacgcgagt cctgcgcagc cgaggttccc cagcgccccc tgcagccgcg      420

cgtaggcaga gacggagccc ggccctgcgc ctccgcacca cgcccgggac cccacccagc      480

ggcccgtacc cggagaagca gcgcgagcac ccgaagctcc cggctggcgg cagaaaccgg      540

gagtggggcc gggcgagtgc gcggcatccc aggccggccc gaacgctccg cccgcggtgg      600

gccgacttcc cctcctcttc cctctctcct tcctttagcc cgctggcgcc ggacacgctg      660

cgcctcatct cttggggcgt tcttccccgt tggccaaccg tcgcatcccg tgcaactttg      720

gggtagtggc cgtttagtgt tgaatgttcc ccaccgagag cgcatggctt gggaagcgag      780

gcgcgaaccc ggcccccgaa gggccgccgt ccgggagacg gtgatgctgt tgctgtgcct      840

gggggtcccg accggccgcc cctacaacgt ggacactgag agcgcgctgc tttaccaggg      900

cccccacaac acgctgttcg gctactcggt cgtgctgcac agccacgggg cgaaccgatg      960

gctcctagtg ggtgcgccca ctgccaactg gctcgccaac gcttcagtga tcaatcccgg     1020

ggcgatttac agatgcagga tcggaaagaa tcccggccag acgtgcgaac agctccagct     1080

gggtagccct aatggagaac cttgtggaaa gacttgtttg gaagagagag acaatcagtg     1140

gttgggggtc acactttcca gacagccagg agaaaatgga tccatcgtga cttgtgggca     1200

tagatggaaa aatatatttt acataaagaa tgaaaataag ctccccactg gtggttgcta     1260

tggagtgccc cctgatttac gaacagaact gagtaaaaga atagctccgt gttatcaaga     1320

ttatgtgaaa aaatttggag aaaattttgc atcatgtcaa gctggaatat ccagttttta     1380

cacaaaggat ttaattgtga tgggggcccc aggatcatct tactggactg gctctctttt     1440

tgtctacaat ataactacaa ataaatacaa ggctttttta gacaaacaaa atcaagtaaa     1500

atttggaagt tatttaggat attcagtcgg agctggtcat tttcggagcc agcatactac     1560

cgaagtagtc ggaggagctc ctcaacatga gcagattggt aaggcatata tattcagcat     1620

tgatgaaaaa gaactaaata tcttacatga aatgaaaggt aaaaagcttg gatcgtactt     1680

tggagcttct gtctgtgctg tggacctcaa tgcagatggc ttctcagatc tgctcgtggg     1740

agcacccatg cagagcacca tcagagagga aggaagagtg tttgtgtaca tcaactctgg     1800

ctcgggagca gtaatgaatg caatggaaac aaacctcgtt ggaagtgaca aatatgctgc     1860

aagatttggg gaatctatag ttaatcttgg cgacattgac aatgatggct ttgaagatgt     1920

tgctatcgga gctccacaag aagatgactt gcaaggtgct atttatattt acaatggccg     1980

tgcagatggg atctcgtcaa ccttctcaca gagaattgaa ggacttcaga tcagcaaatc     2040

gttaagtatg tttggacagt ctatatcagg acaaattgat gcagataata atggctatgt     2100

agatgtagca gttggtgctt ttcggtctga ttctgctgtc ttgctaagga caagacctgt     2160

agtaattgtt gacgcttctt taagccaccc tgagtcagta aatagaacga aatttgactg     2220

tgttgaaaat ggatggcctt ctgtgtgcat agatctaaca ctttgtttct catataaggg     2280

caaggaagtt ccaggttaca ttgttttgtt ttataacatg agtttggatg tgaacagaaa     2340

ggcagagtct ccaccaagat tctatttctc ttctaatgga acttctgacg tgattacagg     2400

aagcatacag gtgtccagca gagaagctaa ctgtagaaca catcaagcat ttatgcggaa     2460

agatgtgcgg gacatcctca ccccaattca gattgaagct gcttaccacc ttggtcctca     2520

tgtcatcagt aaacgaagta cagaggaatt cccaccactt cagccaattc ttcagcagaa     2580

gaaagaaaaa gacataatga aaaaaacaat aaactttgca aggttttgtg cccatgaaaa     2640

ttgttctgct gatttacagg tttctgcaaa gattgggttt ttgaagcccc atgaaaataa     2700

aacatatctt gctgttggga gtatgaagac attgatgttg aatgtgtcct tgtttaatgc     2760

tggagatgat gcatatgaaa cgactctaca tgtcaaacta cccgtgggtc tttatttcat     2820

taagatttta gagctggaag agaagcaaat aaactgtgaa gtcacagata actctggcgt     2880

ggtacaactt gactgcagta ttggctatat atatgtagat catctctcaa ggatagatat     2940

tagctttctc ctggatgtga gctcactcag cagagcggaa gaggacctca gtatcacagt     3000

gcatgctacc tgtgaaaatg aagaggaaat ggacaatcta aagcacagca gagtgactgt     3060

agcaatacct ttaaaatatg aggttaagct gactgttcat gggtttgtaa acccaacttc     3120

atttgtgtat ggatcaaatg atgaaaatga gcctgaaacg tgcatggtgg agaaaatgaa     3180

cttaactttc catgttatca acactggcaa tagtatggct cccaatgtta gtgtggaaat     3240

aatggtacca aattctttta gcccccaaac tgataagctg ttcaacattt tggatgtcca     3300

gactactact ggagaatgcc actttgaaaa ttatcaaaga gtgtgtgcat tagagcagca     3360

aaagagtgca atgcagacct tgaaaggcat agtccggttc ttgtccaaga ctgataagag     3420

gctattgtac tgcataaaag ctgatccaca ttgtttaaat ttcttgtgta attttgggaa     3480

aatggaaagt ggaaaagaag ccagtgttca tatccaactg gaaggccggc catccatttt     3540

agaaatggat gagacttcag cactcaagtt tgaaataaga gcaacaggtt ttccagagcc     3600

aaatccaaga gtaattgaac taaacaagga tgagaatgtt gcgcatgttc tactggaagg     3660

actacatcat caaagaccca aacgttattt caccatagtg attatttcaa gtagcttgct     3720

acttggactt attgtacttc tgttgatctc atatgttatg tggaaggctg gcttctttaa     3780

aagacaatac aaatctatcc tacaagaaga aaacagaaga gacagttgga gttatatcaa     3840

cagtaaaagc aatgatgatt aaggacttct ttcaaattga gagaatggaa aacagactca     3900

ggttgtagta aagaaattta aaagacactg tttacaagaa aaaatgaatt ttgtttggac     3960

ttcttttact catgatcttg tgacatatta tgtcttcatg caaggggaaa atctcagcaa     4020

tgattactct ttgagataga agaactgcaa aggtaataat acagccaaag ataatctctc     4080

agcttttaaa tgggtagaga aacactaaag cattcaattt attcaagaaa agtaagccct     4140

tgaagatatc ttgaaatgaa agtataactg agttaaatta tactggagaa gtcttagact     4200

tgaaatacta cttaccatat gtgcttgcct cagtaaaatg aaccccactg ggtgggcaga     4260

ggttcatttc aaatacatct ttgatacttg ttcaaaatat gttctttaaa aatataattt     4320

tttagagagc tgttcccaaa ttttctaacg agtggaccat tatcacttta aagcccttta     4380

tttataatac atttcctacg ggctgtgttc caacaaccat tttttttcag cagactatga     4440

atattatagt attataggcc aaactggcaa acttcagact gaacatgtac actggtttga     4500

gcttagtgaa attacttctg gataattatt tttttataat tatggatttc accatctttc     4560

tttctgtata tatacatgtg tttttatgta ggtatatatt taccattctt cctatctatt     4620

cttcctataa cacaccttta tcaagcatac ccaggagtaa tcttcaaatc ttttgttata     4680

ttctgaaaca aaagattgtg agtgttgcac tttacctgat acacgctgat ttagaaaata     4740

cagaaaccat acctcactaa taactttaaa atcaaagctg tgcaaagact agggggccta     4800

tacttcatat gtattatgta ctatgtaaaa tattgactat cacacaacta tttccttgga     4860

tgtaattctt tgttaccctt tacaagtata agtgttacct tacatggaaa cgaagaaaca     4920

aaattcataa atttaaattc ataaatttag ctgaaagata ctgattcaat ttgtatacag     4980

tgaatataaa tgagacgaca gcaaaatttt catgaaatgt aaaatatttt tatagtttgt     5040

tcatactata tgaggttcta ttttaaatga ctttctggat tttaaaaaat ttctttaaat     5100

acaatcattt ttgtaatatt tattttatgc ttatgatcta gataattgca gaatatcatt     5160

ttatctgact ctgccttcat aagagagctg tggccgaatt ttgaacatct gttataggga     5220

gtgatcaaat tagaaggcaa tgtggaaaaa caattctggg aaagatttct ttatatgaag     5280

tccctgccac tagccagcca tcctaattga tgaaagttat ctgttcacag gcctgcagtg     5340

atggtgagga atgttctgag atttgcgaag gcatttgagt agtgaaatgt aagcacaaaa     5400

cctcctgaac ccagagtgtg tatacacagg aataaacttt atgacattta tgtattttta     5460

aaaaactttg tatcgttata aaaaggctag tcattctttc aggagaacat ctaggatcat     5520

agatgaaaaa tcaagccccg atttagaact gtcttctcca ggatggtctc taaggaaatt     5580

tacatttggt tctttcctac tcagaactac tcagaaacaa ctatatattt caggttatct     5640

gagcacagtg aaagcagagt actatggttg tccaacacag gcctctcaga tacaagggga     5700

acacaattac atattgggct agattttgcc cagttcaaaa tagtatttgt tatcaactta     5760

ctttgttact tgtatcatga attttaaaac cctaccactt taagaagaca gggatgggtt     5820

attctttttt ggcaggtagg ctatataact atgtgatttt gaaatttaac tgctctggat     5880

tagggagcag tgaatcaagg cagacttatg aaatctgtat tatatttgta acagaatata     5940

ggaaatttaa cataattgat gagctcaaat cctgaaaaat gaaagaatcc aaattatttc     6000

agaattatct aggttaaata ttgatgtatt atgatggttg caaagttttt ttgtgtgtcc     6060

aataaacaca ttgtaaaaaa aa                                              6082


<210>  291
<211>  1032
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 4 (antigen CD49D, alpha 4 subunit of
       VLA-4 receptor) (ITGA4), polypeptide GeneBank Accession No. 
       NP_000876.3 GI:67191027

<400>  291

Met Ala Trp Glu Ala Arg Arg Glu Pro Gly Pro Arg Arg Ala Ala Val 
1               5                   10                  15      


Arg Glu Thr Val Met Leu Leu Leu Cys Leu Gly Val Pro Thr Gly Arg 
            20                  25                  30          


Pro Tyr Asn Val Asp Thr Glu Ser Ala Leu Leu Tyr Gln Gly Pro His 
        35                  40                  45              


Asn Thr Leu Phe Gly Tyr Ser Val Val Leu His Ser His Gly Ala Asn 
    50                  55                  60                  


Arg Trp Leu Leu Val Gly Ala Pro Thr Ala Asn Trp Leu Ala Asn Ala 
65                  70                  75                  80  


Ser Val Ile Asn Pro Gly Ala Ile Tyr Arg Cys Arg Ile Gly Lys Asn 
                85                  90                  95      


Pro Gly Gln Thr Cys Glu Gln Leu Gln Leu Gly Ser Pro Asn Gly Glu 
            100                 105                 110         


Pro Cys Gly Lys Thr Cys Leu Glu Glu Arg Asp Asn Gln Trp Leu Gly 
        115                 120                 125             


Val Thr Leu Ser Arg Gln Pro Gly Glu Asn Gly Ser Ile Val Thr Cys 
    130                 135                 140                 


Gly His Arg Trp Lys Asn Ile Phe Tyr Ile Lys Asn Glu Asn Lys Leu 
145                 150                 155                 160 


Pro Thr Gly Gly Cys Tyr Gly Val Pro Pro Asp Leu Arg Thr Glu Leu 
                165                 170                 175     


Ser Lys Arg Ile Ala Pro Cys Tyr Gln Asp Tyr Val Lys Lys Phe Gly 
            180                 185                 190         


Glu Asn Phe Ala Ser Cys Gln Ala Gly Ile Ser Ser Phe Tyr Thr Lys 
        195                 200                 205             


Asp Leu Ile Val Met Gly Ala Pro Gly Ser Ser Tyr Trp Thr Gly Ser 
    210                 215                 220                 


Leu Phe Val Tyr Asn Ile Thr Thr Asn Lys Tyr Lys Ala Phe Leu Asp 
225                 230                 235                 240 


Lys Gln Asn Gln Val Lys Phe Gly Ser Tyr Leu Gly Tyr Ser Val Gly 
                245                 250                 255     


Ala Gly His Phe Arg Ser Gln His Thr Thr Glu Val Val Gly Gly Ala 
            260                 265                 270         


Pro Gln His Glu Gln Ile Gly Lys Ala Tyr Ile Phe Ser Ile Asp Glu 
        275                 280                 285             


Lys Glu Leu Asn Ile Leu His Glu Met Lys Gly Lys Lys Leu Gly Ser 
    290                 295                 300                 


Tyr Phe Gly Ala Ser Val Cys Ala Val Asp Leu Asn Ala Asp Gly Phe 
305                 310                 315                 320 


Ser Asp Leu Leu Val Gly Ala Pro Met Gln Ser Thr Ile Arg Glu Glu 
                325                 330                 335     


Gly Arg Val Phe Val Tyr Ile Asn Ser Gly Ser Gly Ala Val Met Asn 
            340                 345                 350         


Ala Met Glu Thr Asn Leu Val Gly Ser Asp Lys Tyr Ala Ala Arg Phe 
        355                 360                 365             


Gly Glu Ser Ile Val Asn Leu Gly Asp Ile Asp Asn Asp Gly Phe Glu 
    370                 375                 380                 


Asp Val Ala Ile Gly Ala Pro Gln Glu Asp Asp Leu Gln Gly Ala Ile 
385                 390                 395                 400 


Tyr Ile Tyr Asn Gly Arg Ala Asp Gly Ile Ser Ser Thr Phe Ser Gln 
                405                 410                 415     


Arg Ile Glu Gly Leu Gln Ile Ser Lys Ser Leu Ser Met Phe Gly Gln 
            420                 425                 430         


Ser Ile Ser Gly Gln Ile Asp Ala Asp Asn Asn Gly Tyr Val Asp Val 
        435                 440                 445             


Ala Val Gly Ala Phe Arg Ser Asp Ser Ala Val Leu Leu Arg Thr Arg 
    450                 455                 460                 


Pro Val Val Ile Val Asp Ala Ser Leu Ser His Pro Glu Ser Val Asn 
465                 470                 475                 480 


Arg Thr Lys Phe Asp Cys Val Glu Asn Gly Trp Pro Ser Val Cys Ile 
                485                 490                 495     


Asp Leu Thr Leu Cys Phe Ser Tyr Lys Gly Lys Glu Val Pro Gly Tyr 
            500                 505                 510         


Ile Val Leu Phe Tyr Asn Met Ser Leu Asp Val Asn Arg Lys Ala Glu 
        515                 520                 525             


Ser Pro Pro Arg Phe Tyr Phe Ser Ser Asn Gly Thr Ser Asp Val Ile 
    530                 535                 540                 


Thr Gly Ser Ile Gln Val Ser Ser Arg Glu Ala Asn Cys Arg Thr His 
545                 550                 555                 560 


Gln Ala Phe Met Arg Lys Asp Val Arg Asp Ile Leu Thr Pro Ile Gln 
                565                 570                 575     


Ile Glu Ala Ala Tyr His Leu Gly Pro His Val Ile Ser Lys Arg Ser 
            580                 585                 590         


Thr Glu Glu Phe Pro Pro Leu Gln Pro Ile Leu Gln Gln Lys Lys Glu 
        595                 600                 605             


Lys Asp Ile Met Lys Lys Thr Ile Asn Phe Ala Arg Phe Cys Ala His 
    610                 615                 620                 


Glu Asn Cys Ser Ala Asp Leu Gln Val Ser Ala Lys Ile Gly Phe Leu 
625                 630                 635                 640 


Lys Pro His Glu Asn Lys Thr Tyr Leu Ala Val Gly Ser Met Lys Thr 
                645                 650                 655     


Leu Met Leu Asn Val Ser Leu Phe Asn Ala Gly Asp Asp Ala Tyr Glu 
            660                 665                 670         


Thr Thr Leu His Val Lys Leu Pro Val Gly Leu Tyr Phe Ile Lys Ile 
        675                 680                 685             


Leu Glu Leu Glu Glu Lys Gln Ile Asn Cys Glu Val Thr Asp Asn Ser 
    690                 695                 700                 


Gly Val Val Gln Leu Asp Cys Ser Ile Gly Tyr Ile Tyr Val Asp His 
705                 710                 715                 720 


Leu Ser Arg Ile Asp Ile Ser Phe Leu Leu Asp Val Ser Ser Leu Ser 
                725                 730                 735     


Arg Ala Glu Glu Asp Leu Ser Ile Thr Val His Ala Thr Cys Glu Asn 
            740                 745                 750         


Glu Glu Glu Met Asp Asn Leu Lys His Ser Arg Val Thr Val Ala Ile 
        755                 760                 765             


Pro Leu Lys Tyr Glu Val Lys Leu Thr Val His Gly Phe Val Asn Pro 
    770                 775                 780                 


Thr Ser Phe Val Tyr Gly Ser Asn Asp Glu Asn Glu Pro Glu Thr Cys 
785                 790                 795                 800 


Met Val Glu Lys Met Asn Leu Thr Phe His Val Ile Asn Thr Gly Asn 
                805                 810                 815     


Ser Met Ala Pro Asn Val Ser Val Glu Ile Met Val Pro Asn Ser Phe 
            820                 825                 830         


Ser Pro Gln Thr Asp Lys Leu Phe Asn Ile Leu Asp Val Gln Thr Thr 
        835                 840                 845             


Thr Gly Glu Cys His Phe Glu Asn Tyr Gln Arg Val Cys Ala Leu Glu 
    850                 855                 860                 


Gln Gln Lys Ser Ala Met Gln Thr Leu Lys Gly Ile Val Arg Phe Leu 
865                 870                 875                 880 


Ser Lys Thr Asp Lys Arg Leu Leu Tyr Cys Ile Lys Ala Asp Pro His 
                885                 890                 895     


Cys Leu Asn Phe Leu Cys Asn Phe Gly Lys Met Glu Ser Gly Lys Glu 
            900                 905                 910         


Ala Ser Val His Ile Gln Leu Glu Gly Arg Pro Ser Ile Leu Glu Met 
        915                 920                 925             


Asp Glu Thr Ser Ala Leu Lys Phe Glu Ile Arg Ala Thr Gly Phe Pro 
    930                 935                 940                 


Glu Pro Asn Pro Arg Val Ile Glu Leu Asn Lys Asp Glu Asn Val Ala 
945                 950                 955                 960 


His Val Leu Leu Glu Gly Leu His His Gln Arg Pro Lys Arg Tyr Phe 
                965                 970                 975     


Thr Ile Val Ile Ile Ser Ser Ser Leu Leu Leu Gly Leu Ile Val Leu 
            980                 985                 990         


Leu Leu Ile Ser Tyr Val Met Trp  Lys Ala Gly Phe Phe  Lys Arg Gln 
        995                 1000                 1005             


Tyr Lys  Ser Ile Leu Gln Glu  Glu Asn Arg Arg Asp  Ser Trp Ser 
    1010                 1015                 1020             


Tyr Ile  Asn Ser Lys Ser Asn  Asp Asp 
    1025                 1030         


<210>  292
<211>  4267
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 5 (fibronectin receptor, alpha 
       polypeptide) (ITGA5), mRNA GeneBank Accession No. NM_002205.2  
       GI:56237028

<400>  292
attcgcctct gggaggttta ggaagcggct ccgggtcggt ggccccagga cagggaagag       60

cgggcgctat ggggagccgg acgccagagt cccctctcca cgccgtgcag ctgcgctggg      120

gcccccggcg ccgacccccg ctgctgccgc tgctgttgct gctgctgccg ccgccaccca      180

gggtcggggg cttcaactta gacgcggagg ccccagcagt actctcgggg cccccgggct      240

ccttcttcgg attctcagtg gagttttacc ggccgggaac agacggggtc agtgtgctgg      300

tgggagcacc caaggctaat accagccagc caggagtgct gcagggtggt gctgtctacc      360

tctgtccttg gggtgccagc cccacacagt gcacccccat tgaatttgac agcaaaggct      420

ctcggctcct ggagtcctca ctgtccagct cagagggaga ggagcctgtg gagtacaagt      480

ccttgcagtg gttcggggca acagttcgag cccatggctc ctccatcttg gcatgcgctc      540

cactgtacag ctggcgcaca gagaaggagc cactgagcga ccccgtgggc acctgctacc      600

tctccacaga taacttcacc cgaattctgg agtatgcacc ctgccgctca gatttcagct      660

gggcagcagg acagggttac tgccaaggag gcttcagtgc cgagttcacc aagactggcc      720

gtgtggtttt aggtggacca ggaagctatt tctggcaagg ccagatcctg tctgccactc      780

aggagcagat tgcagaatct tattaccccg agtacctgat caacctggtt caggggcagc      840

tgcagactcg ccaggccagt tccatctatg atgacagcta cctaggatac tctgtggctg      900

ttggtgaatt cagtggtgat gacacagaag actttgttgc tggtgtgccc aaagggaacc      960

tcacttacgg ctatgtcacc atccttaatg gctcagacat tcgatccctc tacaacttct     1020

caggggaaca gatggcctcc tactttggct atgcagtggc cgccacagac gtcaatgggg     1080

acgggctgga tgacttgctg gtgggggcac ccctgctcat ggatcggacc cctgacgggc     1140

ggcctcagga ggtgggcagg gtctacgtct acctgcagca cccagccggc atagagccca     1200

cgcccaccct taccctcact ggccatgatg agtttggccg atttggcagc tccttgaccc     1260

ccctggggga cctggaccag gatggctaca atgatgtggc catcggggct ccctttggtg     1320

gggagaccca gcagggagta gtgtttgtat ttcctggggg cccaggaggg ctgggctcta     1380

agccttccca ggttctgcag cccctgtggg cagccagcca caccccagac ttctttggct     1440

ctgcccttcg aggaggccga gacctggatg gcaatggata tcctgatctg attgtggggt     1500

cctttggtgt ggacaaggct gtggtataca ggggccgccc catcgtgtcc gctagtgcct     1560

ccctcaccat cttccccgcc atgttcaacc cagaggagcg gagctgcagc ttagagggga     1620

accctgtggc ctgcatcaac cttagcttct gcctcaatgc ttctggaaaa cacgttgctg     1680

actccattgg tttcacagtg gaacttcagc tggactggca gaagcagaag ggaggggtac     1740

ggcgggcact gttcctggcc tccaggcagg caaccctgac ccagaccctg ctcatccaga     1800

atggggctcg agaggattgc agagagatga agatctacct caggaacgag tcagaatttc     1860

gagacaaact ctcgccgatt cacatcgctc tcaacttctc cttggacccc caagccccag     1920

tggacagcca cggcctcagg ccagccctac attatcagag caagagccgg atagaggaca     1980

aggctcagat cttgctggac tgtggagaag acaacatctg tgtgcctgac ctgcagctgg     2040

aagtgtttgg ggagcagaac catgtgtacc tgggtgacaa gaatgccctg aacctcactt     2100

tccatgccca gaatgtgggt gagggtggcg cctatgaggc tgagcttcgg gtcaccgccc     2160

ctccagaggc tgagtactca ggactcgtca gacacccagg gaacttctcc agcctgagct     2220

gtgactactt tgccgtgaac cagagccgcc tgctggtgtg tgacctgggc aaccccatga     2280

aggcaggagc cagtctgtgg ggtggccttc ggtttacagt ccctcatctc cgggacacta     2340

agaaaaccat ccagtttgac ttccagatcc tcagcaagaa tctcaacaac tcgcaaagcg     2400

acgtggtttc ctttcggctc tccgtggagg ctcaggccca ggtcaccctg aacggtgtct     2460

ccaagcctga ggcagtgcta ttcccagtaa gcgactggca tccccgagac cagcctcaga     2520

aggaggagga cctgggacct gctgtccacc atgtctatga gctcatcaac caaggcccca     2580

gctccattag ccagggtgtg ctggaactca gctgtcccca ggctctggaa ggtcagcagc     2640

tcctatatgt gaccagagtt acgggactca actgcaccac caatcacccc attaacccaa     2700

agggcctgga gttggatccc gagggttccc tgcaccacca gcaaaaacgg gaagctccaa     2760

gccgcagctc tgcttcctcg ggacctcaga tcctgaaatg cccggaggct gagtgtttca     2820

ggctgcgctg tgagctcggg cccctgcacc aacaagagag ccaaagtctg cagttgcatt     2880

tccgagtctg ggccaagact ttcttgcagc gggagcacca gccatttagc ctgcagtgtg     2940

aggctgtgta caaagccctg aagatgccct accgaatcct gcctcggcag ctgccccaaa     3000

aagagcgtca ggtggccaca gctgtgcaat ggaccaaggc agaaggcagc tatggcgtcc     3060

cactgtggat catcatccta gccatcctgt ttggcctcct gctcctaggt ctactcatct     3120

acatcctcta caagcttgga ttcttcaaac gctccctccc atatggcacc gccatggaaa     3180

aagctcagct caagcctcca gccacctctg atgcctgagt cctcccaatt tcagactccc     3240

attcctgaag aaccagtccc cccaccctca ttctactgaa aaggaggggt ctgggtactt     3300

cttgaaggtg ctgacggcca gggagaagct cctctcccca gcccagagac atacttgaag     3360

ggccagagcc aggggggtga ggagctgggg atccctcccc cccatgcact gtgaaggacc     3420

cttgtttaca cataccctct tcatggatgg gggaactcag atccagggac agaggcccca     3480

gcctccctga agcctttgca ttttggagag tttcctgaaa caacttggaa agataactag     3540

gaaatccatt cacagttctt tgggccagac atgccacaag gacttcctgt ccagctccaa     3600

cctgcaaaga tctgtcctca gccttgccag agatccaaaa gaagccccca gctaagaacc     3660

tggaacttgg ggagttaaga cctggcagct ctggacagcc ccaccctggt gggccaacaa     3720

agaacactaa ctatgcatgg tgccccagga ccagctcagg acagatgcca cacaaggata     3780

gatgctggcc cagggcccag agcccagctc caaggggaat cagaactcaa atggggccag     3840

atccagcctg gggtctggag ttgatctgga acccagactc agacattggc acctaatcca     3900

ggcagatcca ggactatatt tgggcctgct ccagacctga tcctggaggc ccagttcacc     3960

ctgatttagg agaagccagg aatttcccag gaccctgaag gggccatgat ggcaacagat     4020

ctggaacctc agcctggcca gacacaggcc ctccctgttc cccagagaaa ggggagccca     4080

ctgtcctggg cctgcagaat ttgggttctg cctgccagct gcactgatgc tgcccctcat     4140

ctctctgccc aacccttccc tcaccttggc accagacacc caggacttat ttaaactctg     4200

ttgcaagtgc aataaatctg acccagtgcc cccactgacc agaactagaa aaaaaaaaaa     4260

aaaaaaa                                                               4267


<210>  293
<211>  1049
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 5 (fibronectin receptor, alpha 
       polypeptide) (ITGA5), polypeptide GeneBank Accession No. 
       NP_002196.2 GI:56237029

<400>  293

Met Gly Ser Arg Thr Pro Glu Ser Pro Leu His Ala Val Gln Leu Arg 
1               5                   10                  15      


Trp Gly Pro Arg Arg Arg Pro Pro Leu Leu Pro Leu Leu Leu Leu Leu 
            20                  25                  30          


Leu Pro Pro Pro Pro Arg Val Gly Gly Phe Asn Leu Asp Ala Glu Ala 
        35                  40                  45              


Pro Ala Val Leu Ser Gly Pro Pro Gly Ser Phe Phe Gly Phe Ser Val 
    50                  55                  60                  


Glu Phe Tyr Arg Pro Gly Thr Asp Gly Val Ser Val Leu Val Gly Ala 
65                  70                  75                  80  


Pro Lys Ala Asn Thr Ser Gln Pro Gly Val Leu Gln Gly Gly Ala Val 
                85                  90                  95      


Tyr Leu Cys Pro Trp Gly Ala Ser Pro Thr Gln Cys Thr Pro Ile Glu 
            100                 105                 110         


Phe Asp Ser Lys Gly Ser Arg Leu Leu Glu Ser Ser Leu Ser Ser Ser 
        115                 120                 125             


Glu Gly Glu Glu Pro Val Glu Tyr Lys Ser Leu Gln Trp Phe Gly Ala 
    130                 135                 140                 


Thr Val Arg Ala His Gly Ser Ser Ile Leu Ala Cys Ala Pro Leu Tyr 
145                 150                 155                 160 


Ser Trp Arg Thr Glu Lys Glu Pro Leu Ser Asp Pro Val Gly Thr Cys 
                165                 170                 175     


Tyr Leu Ser Thr Asp Asn Phe Thr Arg Ile Leu Glu Tyr Ala Pro Cys 
            180                 185                 190         


Arg Ser Asp Phe Ser Trp Ala Ala Gly Gln Gly Tyr Cys Gln Gly Gly 
        195                 200                 205             


Phe Ser Ala Glu Phe Thr Lys Thr Gly Arg Val Val Leu Gly Gly Pro 
    210                 215                 220                 


Gly Ser Tyr Phe Trp Gln Gly Gln Ile Leu Ser Ala Thr Gln Glu Gln 
225                 230                 235                 240 


Ile Ala Glu Ser Tyr Tyr Pro Glu Tyr Leu Ile Asn Leu Val Gln Gly 
                245                 250                 255     


Gln Leu Gln Thr Arg Gln Ala Ser Ser Ile Tyr Asp Asp Ser Tyr Leu 
            260                 265                 270         


Gly Tyr Ser Val Ala Val Gly Glu Phe Ser Gly Asp Asp Thr Glu Asp 
        275                 280                 285             


Phe Val Ala Gly Val Pro Lys Gly Asn Leu Thr Tyr Gly Tyr Val Thr 
    290                 295                 300                 


Ile Leu Asn Gly Ser Asp Ile Arg Ser Leu Tyr Asn Phe Ser Gly Glu 
305                 310                 315                 320 


Gln Met Ala Ser Tyr Phe Gly Tyr Ala Val Ala Ala Thr Asp Val Asn 
                325                 330                 335     


Gly Asp Gly Leu Asp Asp Leu Leu Val Gly Ala Pro Leu Leu Met Asp 
            340                 345                 350         


Arg Thr Pro Asp Gly Arg Pro Gln Glu Val Gly Arg Val Tyr Val Tyr 
        355                 360                 365             


Leu Gln His Pro Ala Gly Ile Glu Pro Thr Pro Thr Leu Thr Leu Thr 
    370                 375                 380                 


Gly His Asp Glu Phe Gly Arg Phe Gly Ser Ser Leu Thr Pro Leu Gly 
385                 390                 395                 400 


Asp Leu Asp Gln Asp Gly Tyr Asn Asp Val Ala Ile Gly Ala Pro Phe 
                405                 410                 415     


Gly Gly Glu Thr Gln Gln Gly Val Val Phe Val Phe Pro Gly Gly Pro 
            420                 425                 430         


Gly Gly Leu Gly Ser Lys Pro Ser Gln Val Leu Gln Pro Leu Trp Ala 
        435                 440                 445             


Ala Ser His Thr Pro Asp Phe Phe Gly Ser Ala Leu Arg Gly Gly Arg 
    450                 455                 460                 


Asp Leu Asp Gly Asn Gly Tyr Pro Asp Leu Ile Val Gly Ser Phe Gly 
465                 470                 475                 480 


Val Asp Lys Ala Val Val Tyr Arg Gly Arg Pro Ile Val Ser Ala Ser 
                485                 490                 495     


Ala Ser Leu Thr Ile Phe Pro Ala Met Phe Asn Pro Glu Glu Arg Ser 
            500                 505                 510         


Cys Ser Leu Glu Gly Asn Pro Val Ala Cys Ile Asn Leu Ser Phe Cys 
        515                 520                 525             


Leu Asn Ala Ser Gly Lys His Val Ala Asp Ser Ile Gly Phe Thr Val 
    530                 535                 540                 


Glu Leu Gln Leu Asp Trp Gln Lys Gln Lys Gly Gly Val Arg Arg Ala 
545                 550                 555                 560 


Leu Phe Leu Ala Ser Arg Gln Ala Thr Leu Thr Gln Thr Leu Leu Ile 
                565                 570                 575     


Gln Asn Gly Ala Arg Glu Asp Cys Arg Glu Met Lys Ile Tyr Leu Arg 
            580                 585                 590         


Asn Glu Ser Glu Phe Arg Asp Lys Leu Ser Pro Ile His Ile Ala Leu 
        595                 600                 605             


Asn Phe Ser Leu Asp Pro Gln Ala Pro Val Asp Ser His Gly Leu Arg 
    610                 615                 620                 


Pro Ala Leu His Tyr Gln Ser Lys Ser Arg Ile Glu Asp Lys Ala Gln 
625                 630                 635                 640 


Ile Leu Leu Asp Cys Gly Glu Asp Asn Ile Cys Val Pro Asp Leu Gln 
                645                 650                 655     


Leu Glu Val Phe Gly Glu Gln Asn His Val Tyr Leu Gly Asp Lys Asn 
            660                 665                 670         


Ala Leu Asn Leu Thr Phe His Ala Gln Asn Val Gly Glu Gly Gly Ala 
        675                 680                 685             


Tyr Glu Ala Glu Leu Arg Val Thr Ala Pro Pro Glu Ala Glu Tyr Ser 
    690                 695                 700                 


Gly Leu Val Arg His Pro Gly Asn Phe Ser Ser Leu Ser Cys Asp Tyr 
705                 710                 715                 720 


Phe Ala Val Asn Gln Ser Arg Leu Leu Val Cys Asp Leu Gly Asn Pro 
                725                 730                 735     


Met Lys Ala Gly Ala Ser Leu Trp Gly Gly Leu Arg Phe Thr Val Pro 
            740                 745                 750         


His Leu Arg Asp Thr Lys Lys Thr Ile Gln Phe Asp Phe Gln Ile Leu 
        755                 760                 765             


Ser Lys Asn Leu Asn Asn Ser Gln Ser Asp Val Val Ser Phe Arg Leu 
    770                 775                 780                 


Ser Val Glu Ala Gln Ala Gln Val Thr Leu Asn Gly Val Ser Lys Pro 
785                 790                 795                 800 


Glu Ala Val Leu Phe Pro Val Ser Asp Trp His Pro Arg Asp Gln Pro 
                805                 810                 815     


Gln Lys Glu Glu Asp Leu Gly Pro Ala Val His His Val Tyr Glu Leu 
            820                 825                 830         


Ile Asn Gln Gly Pro Ser Ser Ile Ser Gln Gly Val Leu Glu Leu Ser 
        835                 840                 845             


Cys Pro Gln Ala Leu Glu Gly Gln Gln Leu Leu Tyr Val Thr Arg Val 
    850                 855                 860                 


Thr Gly Leu Asn Cys Thr Thr Asn His Pro Ile Asn Pro Lys Gly Leu 
865                 870                 875                 880 


Glu Leu Asp Pro Glu Gly Ser Leu His His Gln Gln Lys Arg Glu Ala 
                885                 890                 895     


Pro Ser Arg Ser Ser Ala Ser Ser Gly Pro Gln Ile Leu Lys Cys Pro 
            900                 905                 910         


Glu Ala Glu Cys Phe Arg Leu Arg Cys Glu Leu Gly Pro Leu His Gln 
        915                 920                 925             


Gln Glu Ser Gln Ser Leu Gln Leu His Phe Arg Val Trp Ala Lys Thr 
    930                 935                 940                 


Phe Leu Gln Arg Glu His Gln Pro Phe Ser Leu Gln Cys Glu Ala Val 
945                 950                 955                 960 


Tyr Lys Ala Leu Lys Met Pro Tyr Arg Ile Leu Pro Arg Gln Leu Pro 
                965                 970                 975     


Gln Lys Glu Arg Gln Val Ala Thr Ala Val Gln Trp Thr Lys Ala Glu 
            980                 985                 990         


Gly Ser Tyr Gly Val Pro Leu Trp  Ile Ile Ile Leu Ala  Ile Leu Phe 
        995                 1000                 1005             


Gly Leu  Leu Leu Leu Gly Leu  Leu Ile Tyr Ile Leu  Tyr Lys Leu 
    1010                 1015                 1020             


Gly Phe  Phe Lys Arg Ser Leu  Pro Tyr Gly Thr Ala  Met Glu Lys 
    1025                 1030                 1035             


Ala Gln  Leu Lys Pro Pro Ala  Thr Ser Asp Ala 
    1040                 1045                 


<210>  294
<211>  5680
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 6 (ITGA6), transcript variant 1, 
       mRNA GeneBank Accession No. NM_001079818.1  GI:119395739

<400>  294
aacgggctca ttcagcggtc gcgagctgcc cgcgaggggg agcggccgga cggagagcgc       60

gacccgtccc gggggtgggg ccgggcgcag cggcgagagg aggcgaaggt ggctgcggta      120

gcagcagcgc ggcagcctcg gacccagccc ggagcgcagg gcggccgctg caggtccccg      180

ctcccctccc cgtgcgtccg cccatggccg ccgccgggca gctgtgcttg ctctacctgt      240

cggcggggct cctgtcccgg ctcggcgcag ccttcaactt ggacactcgg gaggacaacg      300

tgatccggaa atatggagac cccgggagcc tcttcggctt ctcgctggcc atgcactggc      360

aactgcagcc cgaggacaag cggctgttgc tcgtgggggc cccgcgggca gaagcgcttc      420

cactgcagag agccaacaga acgggagggc tgtacagctg cgacatcacc gcccgggggc      480

catgcacgcg gatcgagttt gataacgatg ctgaccccac gtcagaaagc aaggaagatc      540

agtggatggg ggtcaccgtc cagagccaag gtccaggggg caaggtcgtg acatgtgctc      600

accgatatga aaaaaggcag catgttaata cgaagcagga atcccgagac atctttgggc      660

ggtgttatgt cctgagtcag aatctcagga ttgaagacga tatggatggg ggagattgga      720

gcttttgtga tgggcgattg agaggccatg agaaatttgg ctcttgccag caaggtgtag      780

cagctacttt tactaaagac tttcattaca ttgtatttgg agccccgggt acttataact      840

ggaaagggat tgttcgtgta gagcaaaaga ataacacttt ttttgacatg aacatctttg      900

aagatgggcc ttatgaagtt ggtggagaga ctgagcatga tgaaagtctc gttcctgttc      960

ctgctaacag ttacttaggt ttttctttgg actcagggaa aggtattgtt tctaaagatg     1020

agatcacttt tgtatctggt gctcccagag ccaatcacag tggagccgtg gttttgctga     1080

agagagacat gaagtctgca catctcctcc ctgagcacat attcgatgga gaaggtctgg     1140

cctcttcatt tggctatgat gtggcggtgg tggacctcaa caaggatggg tggcaagata     1200

tagttattgg agccccacag tattttgata gagatggaga agttggaggt gcagtgtatg     1260

tctacatgaa ccagcaaggc agatggaata atgtgaagcc aattcgtctt aatggaacca     1320

aagattctat gtttggcatt gcagtaaaaa atattggaga tattaatcaa gatggctacc     1380

cagatattgc agttggagct ccgtatgatg acttgggaaa ggtttttatc tatcatggat     1440

ctgcaaatgg aataaatacc aaaccaacac aggttctcaa gggtatatca ccttattttg     1500

gatattcaat tgctggaaac atggaccttg atcgaaattc ctaccctgat gttgctgttg     1560

gttccctctc agattcagta actattttca gatcccggcc tgtgattaat attcagaaaa     1620

ccatcacagt aactcctaac agaattgacc tccgccagaa aacagcgtgt ggggcgccta     1680

gtgggatatg cctccaggtt aaatcctgtt ttgaatatac tgctaacccc gctggttata     1740

atccttcaat atcaattgtg ggcacacttg aagctgaaaa agaaagaaga aaatctgggc     1800

tatcctcaag agttcagttt cgaaaccaag gttctgagcc caaatatact caagaactaa     1860

ctctgaagag gcagaaacag aaagtgtgca tggaggaaac cctgtggcta caggataata     1920

tcagagataa actgcgtccc attcccataa ctgcctcagt ggagatccaa gagccaagct     1980

ctcgtaggcg agtgaattca cttccagaag ttcttccaat tctgaattca gatgaaccca     2040

agacagctca tattgatgtt cacttcttaa aagagggatg tggagacgac aatgtatgta     2100

acagcaacct taaactagaa tataaatttt gcacccgaga aggaaatcaa gacaaatttt     2160

cttatttacc aattcaaaaa ggtgtaccag aactagttct aaaagatcag aaggatattg     2220

ctttagaaat aacagtgaca aacagccctt ccaacccaag gaatcccaca aaagatggcg     2280

atgacgccca tgaggctaaa ctgattgcaa cgtttccaga cactttaacc tattctgcat     2340

atagagaact gagggctttc cctgagaaac agttgagttg tgttgccaac cagaatggct     2400

cgcaagctga ctgtgagctc ggaaatcctt ttaaaagaaa ttcaaatgtc actttttatt     2460

tggttttaag tacaactgaa gtcacctttg acaccccaga tctggatatt aatctgaagt     2520

tagaaacaac aagcaatcaa gataatttgg ctccaattac agctaaagca aaagtggtta     2580

ttgaactgct tttatcggtc tcgggagttg ctaaaccttc ccaggtgtat tttggaggta     2640

cagttgttgg cgagcaagct atgaaatctg aagatgaagt gggaagttta atagagtatg     2700

aattcagggt aataaactta ggtaaacctc ttacaaacct cggcacagca accttgaaca     2760

ttcagtggcc aaaagaaatt agcaatggga aatggttgct ttatttggtg aaagtagaat     2820

ccaaaggatt ggaaaaggta acttgtgagc cacaaaagga gataaactcc ctgaacctaa     2880

cggagtctca caactcaaga aagaaacggg aaattactga aaaacagata gatgataaca     2940

gaaaattttc tttatttgct gaaagaaaat accagactct taactgtagc gtgaacgtga     3000

actgtgtgaa catcagatgc ccgctgcggg ggctggacag caaggcgtct cttattttgc     3060

gctcgaggtt atggaacagc acatttctag aggaatattc caaactgaac tacttggaca     3120

ttctcatgcg agccttcatt gatgtgactg ctgctgccga aaatatcagg ctgccaaatg     3180

caggcactca ggttcgagtg actgtgtttc cctcaaagac tgtagctcag tattcgggag     3240

taccttggtg gatcatccta gtggctattc tcgctgggat cttgatgctt gctttattag     3300

tgtttatact atggaagtgt ggattcttta aacgctctag gtacgatgac agtgttcccc     3360

gataccatgc tgtaaggatc cggaaagaag agcgagagat caaagatgaa aagtatattg     3420

ataaccttga aaaaaaacag tggatcacaa agtggaacga aaatgaaagc tactcatagc     3480

gggggcctaa aaaaaaaaag cttcacagta cccaaactgc tttttccaac tcagaaattc     3540

aatttggatt taaaagcctg ctcaatccct gaggactgat ttcagagtga ctacacacag     3600

tacgaaccta cagttttaac tgtggatatt gttacgtagc ctaaggctcc tgttttgcac     3660

agccaaattt aaaactgttg gaatggattt ttctttaact gccgtaattt aactttctgg     3720

gttgccttta tttttggcgt ggctgactta catcatgtgt tggggaaggg cctgcccagt     3780

tgcactcagg tgacatcctc cagatagtgt agctgaggag gcacctacac tcacctgcac     3840

taacagagtg gccgtcctaa cctcgggcct gctgcgcaga cgtccatcac gttagctgtc     3900

ccacatcaca agactatgcc attggggtag ttgtgtttca acggaaagtg ctgtcttaaa     3960

ctaaatgtgc aatagaaggt gatgttgcca tcctaccgtc ttttcctgtt tcctagctgt     4020

gtgaatacct gctcacgtca aatgcataca agtttcattc tccctttcac taaaacacac     4080

aggtgcaaca gacttgaatg ctagttatac ttatttgtat atggtattta ttttttcttt     4140

tctttacaaa ccattttgtt attgactaac aggccaaaga gtctccagtt tacccttcag     4200

gttggtttaa tcaatcagaa ttagagcatg ggaggtcatc actttgacct aaattattta     4260

ctgcaaaaag aaaatcttta taaatgtacc agagagagtt gttttaataa cttatctata     4320

aactataacc tctccttcat gacagcctcc accccacaac ccaaaaggtt taagaaatag     4380

aattataact gtaaagatgt ttatttcagg cattggatat tttttacttt agaagcctgc     4440

ataatgtttc tggatttcat actgtaacat tcaggaattc ttggagaaaa tgggtttatt     4500

cactgaactc tagtgcggtt tactcactgc tgcaaatact gtatattcag gacttgaaag     4560

aaatggtgaa tgcctatggt ggatccaaac tgatccagta taagactact gaatctgcta     4620

ccaaaacagt taatcagtga gtcgatgttc tattttttgt tttgtttcct cccctatctg     4680

tattcccaaa aattactttg gggctaattt aacaagaact ttaaattgtg ttttaattgt     4740

aaaaatggca gggggtggaa ttattactct atacattcaa cagagactga atagatatga     4800

aagctgattt tttttaatta ccatgcttca caatgttaag ttatatgggg agcaacagca     4860

aacaggtgct aatttgtttt ggatatagta taagcagtgt ctgtgttttg aaagaataga     4920

acacagtttg tagtgccact gttgttttgg gggggctttt ttcttttcgg aaatcttaaa     4980

ccttaagata ctaaggacgt tgttttggtt gtactttgga attcttagtc acaaaatata     5040

ttttgtttac aaaaatttct gtaaaacagg ttataacagt gtttaaagtc tcagtttctt     5100

gcttggggaa cttgtgtccc taatgtgttt agattgctag attgctaagg agctgatact     5160

ttgacagtgt ttttagacct gtgttactaa aaaaaagatg aatgtcctga aaagggtgtt     5220

gggagggtgg ttcaacaaag aaacaaagat gttatggtgt ttagatttat ggttgttaaa     5280

aatgtcatct caagtcaagt cactggtctg tttgcatttg atacattttt gtactaacta     5340

gcattgtaaa attatttcat gattagaaat tacctgtgga tatttgtata aaagtgtgaa     5400

ataaattttt tataaaagtg ttcattgttt cgtaacacag cattgtatat gtgaagcaaa     5460

ctctaaaatt ataaatgaca acctgaatta tctatttcat caaaccaaag ttcagtgttt     5520

ttatttttgg tgtctcatgt aatctcagat cagccaaaga tactagtgcc aaagcaatgg     5580

gattcggggt ttttttctgt tttcgctcta tgtaggtgat cctcaagtct ttcattttcc     5640

ttctttatga ttaaaagaaa cctacaggta tttaacaacc                           5680


<210>  295
<211>  1091
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 6 (ITGA6), transcript variant 1, 
       polypeptide GeneBank Accession No. NP_001073286.1 GI:119395740

<400>  295

Met Ala Ala Ala Gly Gln Leu Cys Leu Leu Tyr Leu Ser Ala Gly Leu 
1               5                   10                  15      


Leu Ser Arg Leu Gly Ala Ala Phe Asn Leu Asp Thr Arg Glu Asp Asn 
            20                  25                  30          


Val Ile Arg Lys Tyr Gly Asp Pro Gly Ser Leu Phe Gly Phe Ser Leu 
        35                  40                  45              


Ala Met His Trp Gln Leu Gln Pro Glu Asp Lys Arg Leu Leu Leu Val 
    50                  55                  60                  


Gly Ala Pro Arg Ala Glu Ala Leu Pro Leu Gln Arg Ala Asn Arg Thr 
65                  70                  75                  80  


Gly Gly Leu Tyr Ser Cys Asp Ile Thr Ala Arg Gly Pro Cys Thr Arg 
                85                  90                  95      


Ile Glu Phe Asp Asn Asp Ala Asp Pro Thr Ser Glu Ser Lys Glu Asp 
            100                 105                 110         


Gln Trp Met Gly Val Thr Val Gln Ser Gln Gly Pro Gly Gly Lys Val 
        115                 120                 125             


Val Thr Cys Ala His Arg Tyr Glu Lys Arg Gln His Val Asn Thr Lys 
    130                 135                 140                 


Gln Glu Ser Arg Asp Ile Phe Gly Arg Cys Tyr Val Leu Ser Gln Asn 
145                 150                 155                 160 


Leu Arg Ile Glu Asp Asp Met Asp Gly Gly Asp Trp Ser Phe Cys Asp 
                165                 170                 175     


Gly Arg Leu Arg Gly His Glu Lys Phe Gly Ser Cys Gln Gln Gly Val 
            180                 185                 190         


Ala Ala Thr Phe Thr Lys Asp Phe His Tyr Ile Val Phe Gly Ala Pro 
        195                 200                 205             


Gly Thr Tyr Asn Trp Lys Gly Ile Val Arg Val Glu Gln Lys Asn Asn 
    210                 215                 220                 


Thr Phe Phe Asp Met Asn Ile Phe Glu Asp Gly Pro Tyr Glu Val Gly 
225                 230                 235                 240 


Gly Glu Thr Glu His Asp Glu Ser Leu Val Pro Val Pro Ala Asn Ser 
                245                 250                 255     


Tyr Leu Gly Phe Ser Leu Asp Ser Gly Lys Gly Ile Val Ser Lys Asp 
            260                 265                 270         


Glu Ile Thr Phe Val Ser Gly Ala Pro Arg Ala Asn His Ser Gly Ala 
        275                 280                 285             


Val Val Leu Leu Lys Arg Asp Met Lys Ser Ala His Leu Leu Pro Glu 
    290                 295                 300                 


His Ile Phe Asp Gly Glu Gly Leu Ala Ser Ser Phe Gly Tyr Asp Val 
305                 310                 315                 320 


Ala Val Val Asp Leu Asn Lys Asp Gly Trp Gln Asp Ile Val Ile Gly 
                325                 330                 335     


Ala Pro Gln Tyr Phe Asp Arg Asp Gly Glu Val Gly Gly Ala Val Tyr 
            340                 345                 350         


Val Tyr Met Asn Gln Gln Gly Arg Trp Asn Asn Val Lys Pro Ile Arg 
        355                 360                 365             


Leu Asn Gly Thr Lys Asp Ser Met Phe Gly Ile Ala Val Lys Asn Ile 
    370                 375                 380                 


Gly Asp Ile Asn Gln Asp Gly Tyr Pro Asp Ile Ala Val Gly Ala Pro 
385                 390                 395                 400 


Tyr Asp Asp Leu Gly Lys Val Phe Ile Tyr His Gly Ser Ala Asn Gly 
                405                 410                 415     


Ile Asn Thr Lys Pro Thr Gln Val Leu Lys Gly Ile Ser Pro Tyr Phe 
            420                 425                 430         


Gly Tyr Ser Ile Ala Gly Asn Met Asp Leu Asp Arg Asn Ser Tyr Pro 
        435                 440                 445             


Asp Val Ala Val Gly Ser Leu Ser Asp Ser Val Thr Ile Phe Arg Ser 
    450                 455                 460                 


Arg Pro Val Ile Asn Ile Gln Lys Thr Ile Thr Val Thr Pro Asn Arg 
465                 470                 475                 480 


Ile Asp Leu Arg Gln Lys Thr Ala Cys Gly Ala Pro Ser Gly Ile Cys 
                485                 490                 495     


Leu Gln Val Lys Ser Cys Phe Glu Tyr Thr Ala Asn Pro Ala Gly Tyr 
            500                 505                 510         


Asn Pro Ser Ile Ser Ile Val Gly Thr Leu Glu Ala Glu Lys Glu Arg 
        515                 520                 525             


Arg Lys Ser Gly Leu Ser Ser Arg Val Gln Phe Arg Asn Gln Gly Ser 
    530                 535                 540                 


Glu Pro Lys Tyr Thr Gln Glu Leu Thr Leu Lys Arg Gln Lys Gln Lys 
545                 550                 555                 560 


Val Cys Met Glu Glu Thr Leu Trp Leu Gln Asp Asn Ile Arg Asp Lys 
                565                 570                 575     


Leu Arg Pro Ile Pro Ile Thr Ala Ser Val Glu Ile Gln Glu Pro Ser 
            580                 585                 590         


Ser Arg Arg Arg Val Asn Ser Leu Pro Glu Val Leu Pro Ile Leu Asn 
        595                 600                 605             


Ser Asp Glu Pro Lys Thr Ala His Ile Asp Val His Phe Leu Lys Glu 
    610                 615                 620                 


Gly Cys Gly Asp Asp Asn Val Cys Asn Ser Asn Leu Lys Leu Glu Tyr 
625                 630                 635                 640 


Lys Phe Cys Thr Arg Glu Gly Asn Gln Asp Lys Phe Ser Tyr Leu Pro 
                645                 650                 655     


Ile Gln Lys Gly Val Pro Glu Leu Val Leu Lys Asp Gln Lys Asp Ile 
            660                 665                 670         


Ala Leu Glu Ile Thr Val Thr Asn Ser Pro Ser Asn Pro Arg Asn Pro 
        675                 680                 685             


Thr Lys Asp Gly Asp Asp Ala His Glu Ala Lys Leu Ile Ala Thr Phe 
    690                 695                 700                 


Pro Asp Thr Leu Thr Tyr Ser Ala Tyr Arg Glu Leu Arg Ala Phe Pro 
705                 710                 715                 720 


Glu Lys Gln Leu Ser Cys Val Ala Asn Gln Asn Gly Ser Gln Ala Asp 
                725                 730                 735     


Cys Glu Leu Gly Asn Pro Phe Lys Arg Asn Ser Asn Val Thr Phe Tyr 
            740                 745                 750         


Leu Val Leu Ser Thr Thr Glu Val Thr Phe Asp Thr Pro Asp Leu Asp 
        755                 760                 765             


Ile Asn Leu Lys Leu Glu Thr Thr Ser Asn Gln Asp Asn Leu Ala Pro 
    770                 775                 780                 


Ile Thr Ala Lys Ala Lys Val Val Ile Glu Leu Leu Leu Ser Val Ser 
785                 790                 795                 800 


Gly Val Ala Lys Pro Ser Gln Val Tyr Phe Gly Gly Thr Val Val Gly 
                805                 810                 815     


Glu Gln Ala Met Lys Ser Glu Asp Glu Val Gly Ser Leu Ile Glu Tyr 
            820                 825                 830         


Glu Phe Arg Val Ile Asn Leu Gly Lys Pro Leu Thr Asn Leu Gly Thr 
        835                 840                 845             


Ala Thr Leu Asn Ile Gln Trp Pro Lys Glu Ile Ser Asn Gly Lys Trp 
    850                 855                 860                 


Leu Leu Tyr Leu Val Lys Val Glu Ser Lys Gly Leu Glu Lys Val Thr 
865                 870                 875                 880 


Cys Glu Pro Gln Lys Glu Ile Asn Ser Leu Asn Leu Thr Glu Ser His 
                885                 890                 895     


Asn Ser Arg Lys Lys Arg Glu Ile Thr Glu Lys Gln Ile Asp Asp Asn 
            900                 905                 910         


Arg Lys Phe Ser Leu Phe Ala Glu Arg Lys Tyr Gln Thr Leu Asn Cys 
        915                 920                 925             


Ser Val Asn Val Asn Cys Val Asn Ile Arg Cys Pro Leu Arg Gly Leu 
    930                 935                 940                 


Asp Ser Lys Ala Ser Leu Ile Leu Arg Ser Arg Leu Trp Asn Ser Thr 
945                 950                 955                 960 


Phe Leu Glu Glu Tyr Ser Lys Leu Asn Tyr Leu Asp Ile Leu Met Arg 
                965                 970                 975     


Ala Phe Ile Asp Val Thr Ala Ala Ala Glu Asn Ile Arg Leu Pro Asn 
            980                 985                 990         


Ala Gly Thr Gln Val Arg Val Thr  Val Phe Pro Ser Lys  Thr Val Ala 
        995                 1000                 1005             


Gln Tyr  Ser Gly Val Pro Trp  Trp Ile Ile Leu Val  Ala Ile Leu 
    1010                 1015                 1020             


Ala Gly  Ile Leu Met Leu Ala  Leu Leu Val Phe Ile  Leu Trp Lys 
    1025                 1030                 1035             


Cys Gly  Phe Phe Lys Arg Ser  Arg Tyr Asp Asp Ser  Val Pro Arg 
    1040                 1045                 1050             


Tyr His  Ala Val Arg Ile Arg  Lys Glu Glu Arg Glu  Ile Lys Asp 
    1055                 1060                 1065             


Glu Lys  Tyr Ile Asp Asn Leu  Glu Lys Lys Gln Trp  Ile Thr Lys 
    1070                 1075                 1080             


Trp Asn  Glu Asn Glu Ser Tyr  Ser 
    1085                 1090     


<210>  296
<211>  5810
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 6 (ITGA6), transcript variant 2, 
       mRNA GeneBank Accession No. NM_000210.2  GI:119395741

<400>  296
aacgggctca ttcagcggtc gcgagctgcc cgcgaggggg agcggccgga cggagagcgc       60

gacccgtccc gggggtgggg ccgggcgcag cggcgagagg aggcgaaggt ggctgcggta      120

gcagcagcgc ggcagcctcg gacccagccc ggagcgcagg gcggccgctg caggtccccg      180

ctcccctccc cgtgcgtccg cccatggccg ccgccgggca gctgtgcttg ctctacctgt      240

cggcggggct cctgtcccgg ctcggcgcag ccttcaactt ggacactcgg gaggacaacg      300

tgatccggaa atatggagac cccgggagcc tcttcggctt ctcgctggcc atgcactggc      360

aactgcagcc cgaggacaag cggctgttgc tcgtgggggc cccgcgggca gaagcgcttc      420

cactgcagag agccaacaga acgggagggc tgtacagctg cgacatcacc gcccgggggc      480

catgcacgcg gatcgagttt gataacgatg ctgaccccac gtcagaaagc aaggaagatc      540

agtggatggg ggtcaccgtc cagagccaag gtccaggggg caaggtcgtg acatgtgctc      600

accgatatga aaaaaggcag catgttaata cgaagcagga atcccgagac atctttgggc      660

ggtgttatgt cctgagtcag aatctcagga ttgaagacga tatggatggg ggagattgga      720

gcttttgtga tgggcgattg agaggccatg agaaatttgg ctcttgccag caaggtgtag      780

cagctacttt tactaaagac tttcattaca ttgtatttgg agccccgggt acttataact      840

ggaaagggat tgttcgtgta gagcaaaaga ataacacttt ttttgacatg aacatctttg      900

aagatgggcc ttatgaagtt ggtggagaga ctgagcatga tgaaagtctc gttcctgttc      960

ctgctaacag ttacttaggt ttttctttgg actcagggaa aggtattgtt tctaaagatg     1020

agatcacttt tgtatctggt gctcccagag ccaatcacag tggagccgtg gttttgctga     1080

agagagacat gaagtctgca catctcctcc ctgagcacat attcgatgga gaaggtctgg     1140

cctcttcatt tggctatgat gtggcggtgg tggacctcaa caaggatggg tggcaagata     1200

tagttattgg agccccacag tattttgata gagatggaga agttggaggt gcagtgtatg     1260

tctacatgaa ccagcaaggc agatggaata atgtgaagcc aattcgtctt aatggaacca     1320

aagattctat gtttggcatt gcagtaaaaa atattggaga tattaatcaa gatggctacc     1380

cagatattgc agttggagct ccgtatgatg acttgggaaa ggtttttatc tatcatggat     1440

ctgcaaatgg aataaatacc aaaccaacac aggttctcaa gggtatatca ccttattttg     1500

gatattcaat tgctggaaac atggaccttg atcgaaattc ctaccctgat gttgctgttg     1560

gttccctctc agattcagta actattttca gatcccggcc tgtgattaat attcagaaaa     1620

ccatcacagt aactcctaac agaattgacc tccgccagaa aacagcgtgt ggggcgccta     1680

gtgggatatg cctccaggtt aaatcctgtt ttgaatatac tgctaacccc gctggttata     1740

atccttcaat atcaattgtg ggcacacttg aagctgaaaa agaaagaaga aaatctgggc     1800

tatcctcaag agttcagttt cgaaaccaag gttctgagcc caaatatact caagaactaa     1860

ctctgaagag gcagaaacag aaagtgtgca tggaggaaac cctgtggcta caggataata     1920

tcagagataa actgcgtccc attcccataa ctgcctcagt ggagatccaa gagccaagct     1980

ctcgtaggcg agtgaattca cttccagaag ttcttccaat tctgaattca gatgaaccca     2040

agacagctca tattgatgtt cacttcttaa aagagggatg tggagacgac aatgtatgta     2100

acagcaacct taaactagaa tataaatttt gcacccgaga aggaaatcaa gacaaatttt     2160

cttatttacc aattcaaaaa ggtgtaccag aactagttct aaaagatcag aaggatattg     2220

ctttagaaat aacagtgaca aacagccctt ccaacccaag gaatcccaca aaagatggcg     2280

atgacgccca tgaggctaaa ctgattgcaa cgtttccaga cactttaacc tattctgcat     2340

atagagaact gagggctttc cctgagaaac agttgagttg tgttgccaac cagaatggct     2400

cgcaagctga ctgtgagctc ggaaatcctt ttaaaagaaa ttcaaatgtc actttttatt     2460

tggttttaag tacaactgaa gtcacctttg acaccccaga tctggatatt aatctgaagt     2520

tagaaacaac aagcaatcaa gataatttgg ctccaattac agctaaagca aaagtggtta     2580

ttgaactgct tttatcggtc tcgggagttg ctaaaccttc ccaggtgtat tttggaggta     2640

cagttgttgg cgagcaagct atgaaatctg aagatgaagt gggaagttta atagagtatg     2700

aattcagggt aataaactta ggtaaacctc ttacaaacct cggcacagca accttgaaca     2760

ttcagtggcc aaaagaaatt agcaatggga aatggttgct ttatttggtg aaagtagaat     2820

ccaaaggatt ggaaaaggta acttgtgagc cacaaaagga gataaactcc ctgaacctaa     2880

cggagtctca caactcaaga aagaaacggg aaattactga aaaacagata gatgataaca     2940

gaaaattttc tttatttgct gaaagaaaat accagactct taactgtagc gtgaacgtga     3000

actgtgtgaa catcagatgc ccgctgcggg ggctggacag caaggcgtct cttattttgc     3060

gctcgaggtt atggaacagc acatttctag aggaatattc caaactgaac tacttggaca     3120

ttctcatgcg agccttcatt gatgtgactg ctgctgccga aaatatcagg ctgccaaatg     3180

caggcactca ggttcgagtg actgtgtttc cctcaaagac tgtagctcag tattcgggag     3240

taccttggtg gatcatccta gtggctattc tcgctgggat cttgatgctt gctttattag     3300

tgtttatact atggaagtgt ggtttcttca agagaaataa gaaagatcat tatgatgcca     3360

catatcacaa ggctgagatc catgctcagc catctgataa agagaggctt acttctgatg     3420

catagtattg atctacttct gtaattgtgt ggattcttta aacgctctag gtacgatgac     3480

agtgttcccc gataccatgc tgtaaggatc cggaaagaag agcgagagat caaagatgaa     3540

aagtatattg ataaccttga aaaaaaacag tggatcacaa agtggaacga aaatgaaagc     3600

tactcatagc gggggcctaa aaaaaaaaag cttcacagta cccaaactgc tttttccaac     3660

tcagaaattc aatttggatt taaaagcctg ctcaatccct gaggactgat ttcagagtga     3720

ctacacacag tacgaaccta cagttttaac tgtggatatt gttacgtagc ctaaggctcc     3780

tgttttgcac agccaaattt aaaactgttg gaatggattt ttctttaact gccgtaattt     3840

aactttctgg gttgccttta tttttggcgt ggctgactta catcatgtgt tggggaaggg     3900

cctgcccagt tgcactcagg tgacatcctc cagatagtgt agctgaggag gcacctacac     3960

tcacctgcac taacagagtg gccgtcctaa cctcgggcct gctgcgcaga cgtccatcac     4020

gttagctgtc ccacatcaca agactatgcc attggggtag ttgtgtttca acggaaagtg     4080

ctgtcttaaa ctaaatgtgc aatagaaggt gatgttgcca tcctaccgtc ttttcctgtt     4140

tcctagctgt gtgaatacct gctcacgtca aatgcataca agtttcattc tccctttcac     4200

taaaacacac aggtgcaaca gacttgaatg ctagttatac ttatttgtat atggtattta     4260

ttttttcttt tctttacaaa ccattttgtt attgactaac aggccaaaga gtctccagtt     4320

tacccttcag gttggtttaa tcaatcagaa ttagagcatg ggaggtcatc actttgacct     4380

aaattattta ctgcaaaaag aaaatcttta taaatgtacc agagagagtt gttttaataa     4440

cttatctata aactataacc tctccttcat gacagcctcc accccacaac ccaaaaggtt     4500

taagaaatag aattataact gtaaagatgt ttatttcagg cattggatat tttttacttt     4560

agaagcctgc ataatgtttc tggatttcat actgtaacat tcaggaattc ttggagaaaa     4620

tgggtttatt cactgaactc tagtgcggtt tactcactgc tgcaaatact gtatattcag     4680

gacttgaaag aaatggtgaa tgcctatggt ggatccaaac tgatccagta taagactact     4740

gaatctgcta ccaaaacagt taatcagtga gtcgatgttc tattttttgt tttgtttcct     4800

cccctatctg tattcccaaa aattactttg gggctaattt aacaagaact ttaaattgtg     4860

ttttaattgt aaaaatggca gggggtggaa ttattactct atacattcaa cagagactga     4920

atagatatga aagctgattt tttttaatta ccatgcttca caatgttaag ttatatgggg     4980

agcaacagca aacaggtgct aatttgtttt ggatatagta taagcagtgt ctgtgttttg     5040

aaagaataga acacagtttg tagtgccact gttgttttgg gggggctttt ttcttttcgg     5100

aaatcttaaa ccttaagata ctaaggacgt tgttttggtt gtactttgga attcttagtc     5160

acaaaatata ttttgtttac aaaaatttct gtaaaacagg ttataacagt gtttaaagtc     5220

tcagtttctt gcttggggaa cttgtgtccc taatgtgttt agattgctag attgctaagg     5280

agctgatact ttgacagtgt ttttagacct gtgttactaa aaaaaagatg aatgtcctga     5340

aaagggtgtt gggagggtgg ttcaacaaag aaacaaagat gttatggtgt ttagatttat     5400

ggttgttaaa aatgtcatct caagtcaagt cactggtctg tttgcatttg atacattttt     5460

gtactaacta gcattgtaaa attatttcat gattagaaat tacctgtgga tatttgtata     5520

aaagtgtgaa ataaattttt tataaaagtg ttcattgttt cgtaacacag cattgtatat     5580

gtgaagcaaa ctctaaaatt ataaatgaca acctgaatta tctatttcat caaaccaaag     5640

ttcagtgttt ttatttttgg tgtctcatgt aatctcagat cagccaaaga tactagtgcc     5700

aaagcaatgg gattcggggt ttttttctgt tttcgctcta tgtaggtgat cctcaagtct     5760

ttcattttcc ttctttatga ttaaaagaaa cctacaggta tttaacaacc                5810


<210>  297
<211>  1073
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 6 (ITGA6), transcript variant 2, 
       polypeptide GeneBank Accession No. NP_000201.2 GI:119395742

<400>  297

Met Ala Ala Ala Gly Gln Leu Cys Leu Leu Tyr Leu Ser Ala Gly Leu 
1               5                   10                  15      


Leu Ser Arg Leu Gly Ala Ala Phe Asn Leu Asp Thr Arg Glu Asp Asn 
            20                  25                  30          


Val Ile Arg Lys Tyr Gly Asp Pro Gly Ser Leu Phe Gly Phe Ser Leu 
        35                  40                  45              


Ala Met His Trp Gln Leu Gln Pro Glu Asp Lys Arg Leu Leu Leu Val 
    50                  55                  60                  


Gly Ala Pro Arg Ala Glu Ala Leu Pro Leu Gln Arg Ala Asn Arg Thr 
65                  70                  75                  80  


Gly Gly Leu Tyr Ser Cys Asp Ile Thr Ala Arg Gly Pro Cys Thr Arg 
                85                  90                  95      


Ile Glu Phe Asp Asn Asp Ala Asp Pro Thr Ser Glu Ser Lys Glu Asp 
            100                 105                 110         


Gln Trp Met Gly Val Thr Val Gln Ser Gln Gly Pro Gly Gly Lys Val 
        115                 120                 125             


Val Thr Cys Ala His Arg Tyr Glu Lys Arg Gln His Val Asn Thr Lys 
    130                 135                 140                 


Gln Glu Ser Arg Asp Ile Phe Gly Arg Cys Tyr Val Leu Ser Gln Asn 
145                 150                 155                 160 


Leu Arg Ile Glu Asp Asp Met Asp Gly Gly Asp Trp Ser Phe Cys Asp 
                165                 170                 175     


Gly Arg Leu Arg Gly His Glu Lys Phe Gly Ser Cys Gln Gln Gly Val 
            180                 185                 190         


Ala Ala Thr Phe Thr Lys Asp Phe His Tyr Ile Val Phe Gly Ala Pro 
        195                 200                 205             


Gly Thr Tyr Asn Trp Lys Gly Ile Val Arg Val Glu Gln Lys Asn Asn 
    210                 215                 220                 


Thr Phe Phe Asp Met Asn Ile Phe Glu Asp Gly Pro Tyr Glu Val Gly 
225                 230                 235                 240 


Gly Glu Thr Glu His Asp Glu Ser Leu Val Pro Val Pro Ala Asn Ser 
                245                 250                 255     


Tyr Leu Gly Phe Ser Leu Asp Ser Gly Lys Gly Ile Val Ser Lys Asp 
            260                 265                 270         


Glu Ile Thr Phe Val Ser Gly Ala Pro Arg Ala Asn His Ser Gly Ala 
        275                 280                 285             


Val Val Leu Leu Lys Arg Asp Met Lys Ser Ala His Leu Leu Pro Glu 
    290                 295                 300                 


His Ile Phe Asp Gly Glu Gly Leu Ala Ser Ser Phe Gly Tyr Asp Val 
305                 310                 315                 320 


Ala Val Val Asp Leu Asn Lys Asp Gly Trp Gln Asp Ile Val Ile Gly 
                325                 330                 335     


Ala Pro Gln Tyr Phe Asp Arg Asp Gly Glu Val Gly Gly Ala Val Tyr 
            340                 345                 350         


Val Tyr Met Asn Gln Gln Gly Arg Trp Asn Asn Val Lys Pro Ile Arg 
        355                 360                 365             


Leu Asn Gly Thr Lys Asp Ser Met Phe Gly Ile Ala Val Lys Asn Ile 
    370                 375                 380                 


Gly Asp Ile Asn Gln Asp Gly Tyr Pro Asp Ile Ala Val Gly Ala Pro 
385                 390                 395                 400 


Tyr Asp Asp Leu Gly Lys Val Phe Ile Tyr His Gly Ser Ala Asn Gly 
                405                 410                 415     


Ile Asn Thr Lys Pro Thr Gln Val Leu Lys Gly Ile Ser Pro Tyr Phe 
            420                 425                 430         


Gly Tyr Ser Ile Ala Gly Asn Met Asp Leu Asp Arg Asn Ser Tyr Pro 
        435                 440                 445             


Asp Val Ala Val Gly Ser Leu Ser Asp Ser Val Thr Ile Phe Arg Ser 
    450                 455                 460                 


Arg Pro Val Ile Asn Ile Gln Lys Thr Ile Thr Val Thr Pro Asn Arg 
465                 470                 475                 480 


Ile Asp Leu Arg Gln Lys Thr Ala Cys Gly Ala Pro Ser Gly Ile Cys 
                485                 490                 495     


Leu Gln Val Lys Ser Cys Phe Glu Tyr Thr Ala Asn Pro Ala Gly Tyr 
            500                 505                 510         


Asn Pro Ser Ile Ser Ile Val Gly Thr Leu Glu Ala Glu Lys Glu Arg 
        515                 520                 525             


Arg Lys Ser Gly Leu Ser Ser Arg Val Gln Phe Arg Asn Gln Gly Ser 
    530                 535                 540                 


Glu Pro Lys Tyr Thr Gln Glu Leu Thr Leu Lys Arg Gln Lys Gln Lys 
545                 550                 555                 560 


Val Cys Met Glu Glu Thr Leu Trp Leu Gln Asp Asn Ile Arg Asp Lys 
                565                 570                 575     


Leu Arg Pro Ile Pro Ile Thr Ala Ser Val Glu Ile Gln Glu Pro Ser 
            580                 585                 590         


Ser Arg Arg Arg Val Asn Ser Leu Pro Glu Val Leu Pro Ile Leu Asn 
        595                 600                 605             


Ser Asp Glu Pro Lys Thr Ala His Ile Asp Val His Phe Leu Lys Glu 
    610                 615                 620                 


Gly Cys Gly Asp Asp Asn Val Cys Asn Ser Asn Leu Lys Leu Glu Tyr 
625                 630                 635                 640 


Lys Phe Cys Thr Arg Glu Gly Asn Gln Asp Lys Phe Ser Tyr Leu Pro 
                645                 650                 655     


Ile Gln Lys Gly Val Pro Glu Leu Val Leu Lys Asp Gln Lys Asp Ile 
            660                 665                 670         


Ala Leu Glu Ile Thr Val Thr Asn Ser Pro Ser Asn Pro Arg Asn Pro 
        675                 680                 685             


Thr Lys Asp Gly Asp Asp Ala His Glu Ala Lys Leu Ile Ala Thr Phe 
    690                 695                 700                 


Pro Asp Thr Leu Thr Tyr Ser Ala Tyr Arg Glu Leu Arg Ala Phe Pro 
705                 710                 715                 720 


Glu Lys Gln Leu Ser Cys Val Ala Asn Gln Asn Gly Ser Gln Ala Asp 
                725                 730                 735     


Cys Glu Leu Gly Asn Pro Phe Lys Arg Asn Ser Asn Val Thr Phe Tyr 
            740                 745                 750         


Leu Val Leu Ser Thr Thr Glu Val Thr Phe Asp Thr Pro Asp Leu Asp 
        755                 760                 765             


Ile Asn Leu Lys Leu Glu Thr Thr Ser Asn Gln Asp Asn Leu Ala Pro 
    770                 775                 780                 


Ile Thr Ala Lys Ala Lys Val Val Ile Glu Leu Leu Leu Ser Val Ser 
785                 790                 795                 800 


Gly Val Ala Lys Pro Ser Gln Val Tyr Phe Gly Gly Thr Val Val Gly 
                805                 810                 815     


Glu Gln Ala Met Lys Ser Glu Asp Glu Val Gly Ser Leu Ile Glu Tyr 
            820                 825                 830         


Glu Phe Arg Val Ile Asn Leu Gly Lys Pro Leu Thr Asn Leu Gly Thr 
        835                 840                 845             


Ala Thr Leu Asn Ile Gln Trp Pro Lys Glu Ile Ser Asn Gly Lys Trp 
    850                 855                 860                 


Leu Leu Tyr Leu Val Lys Val Glu Ser Lys Gly Leu Glu Lys Val Thr 
865                 870                 875                 880 


Cys Glu Pro Gln Lys Glu Ile Asn Ser Leu Asn Leu Thr Glu Ser His 
                885                 890                 895     


Asn Ser Arg Lys Lys Arg Glu Ile Thr Glu Lys Gln Ile Asp Asp Asn 
            900                 905                 910         


Arg Lys Phe Ser Leu Phe Ala Glu Arg Lys Tyr Gln Thr Leu Asn Cys 
        915                 920                 925             


Ser Val Asn Val Asn Cys Val Asn Ile Arg Cys Pro Leu Arg Gly Leu 
    930                 935                 940                 


Asp Ser Lys Ala Ser Leu Ile Leu Arg Ser Arg Leu Trp Asn Ser Thr 
945                 950                 955                 960 


Phe Leu Glu Glu Tyr Ser Lys Leu Asn Tyr Leu Asp Ile Leu Met Arg 
                965                 970                 975     


Ala Phe Ile Asp Val Thr Ala Ala Ala Glu Asn Ile Arg Leu Pro Asn 
            980                 985                 990         


Ala Gly Thr Gln Val Arg Val Thr  Val Phe Pro Ser Lys  Thr Val Ala 
        995                 1000                 1005             


Gln Tyr  Ser Gly Val Pro Trp  Trp Ile Ile Leu Val  Ala Ile Leu 
    1010                 1015                 1020             


Ala Gly  Ile Leu Met Leu Ala  Leu Leu Val Phe Ile  Leu Trp Lys 
    1025                 1030                 1035             


Cys Gly  Phe Phe Lys Arg Asn  Lys Lys Asp His Tyr  Asp Ala Thr 
    1040                 1045                 1050             


Tyr His  Lys Ala Glu Ile His  Ala Gln Pro Ser Asp  Lys Glu Arg 
    1055                 1060                 1065             


Leu Thr  Ser Asp Ala 
    1070             


<210>  298
<211>  4150
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 7 (ITGA7), transcript variant 1, 
       mRNA GeneBank Accession No. NM_001144996.1  GI:222418612

<400>  298
gggcgccgga gctgcggctg ctgtagttgt cctagccggt gctggggcgg cggggtggcg       60

gagcggcggg cgggcgggag ggctggcggg gcgaacgtct gggagacgtc tgaaagacca      120

acgagacttt ggagaccaga gacgcgcctg gggggacctg gggcttgggg cgtgcgagat      180

ttcccttgca ttcgctggga gctcgcgcag ggatcgtccc atggccgggg ctcggagccg      240

cgacccttgg ggggcctccg ggatttgcta cctttttggc tccctgctcg tcgaactgct      300

cttctcacgg gctgtcgcct tcaatctgga cgtgatgggt gccttgcgca aggagggcga      360

gccaggcagc ctcttcggct tctctgtggc cctgcaccgg cagttgcagc cccgacccca      420

gagctggctg ctggtgggtg ctccccaggc cctggctctt cctgggcagc aggcgaatcg      480

cactggaggc ctcttcgctt gcccgttgag cctggaggag actgactgct acagagtgga      540

catcgaccag ggagctgata tgcaaaagga aagcaaggag aaccagtggt tgggagtcag      600

tgttcggagc caggggcctg ggggcaagat tgttacctgt gcacaccgat atgaggcaag      660

gcagcgagtg gaccagatcc tggagacgcg ggatatgatt ggtcgctgct ttgtgctcag      720

ccaggacctg gccatccggg atgagttgga tggtggggaa tggaagttct gtgagggacg      780

cccccaaggc catgaacaat ttgggttctg ccagcagggc acagctgccg ccttctcccc      840

tgatagccac tacctcctct ttggggcccc aggaacctat aattggaagg gcacggccag      900

ggtggagctc tgtgcacagg gctcagcgga cctggcacac ctggacgacg gtccctacga      960

ggcgggggga gagaaggagc aggacccccg cctcatcccg gtccctgcca acagctactt     1020

tggcttctct attgactcgg ggaaaggtct ggtgcgtgca gaagagctga gctttgtggc     1080

tggagccccc cgcgccaacc acaagggtgc tgtggtcatc ctgcgcaagg acagcgccag     1140

tcgcctggtg cccgaggtta tgctgtctgg ggagcgcctg acctccggct ttggctactc     1200

actggctgtg gctgacctca acagtgatgg ctggccagac ctgatagtgg gtgcccccta     1260

cttctttgag cgccaagaag agctgggggg tgctgtgtat gtgtacttga accagggggg     1320

tcactgggct gggatctccc ctctccggct ctgcggctcc cctgactcca tgttcgggat     1380

cagcctggct gtcctggggg acctcaacca agatggcttt ccagatattg cagtgggtgc     1440

cccctttgat ggtgatggga aagtcttcat ctaccatggg agcagcctgg gggttgtcgc     1500

caaaccttca caggtgctgg agggcgaggc tgtgggcatc aagagcttcg gctactccct     1560

gtcaggcagc ttggatatgg atgggaacca ataccctgac ctgctggtgg gctccctggc     1620

tgacaccgca gtgctcttca gggccagacc catcctccat gtctcccatg aggtctctat     1680

tgctccacga agcatcgacc tggagcagcc caactgtgct ggcggccact cggtctgtgt     1740

ggacctaagg gtctgtttca gctacattgc agtccccagc agctatagcc ctactgtggc     1800

cctggactat gtgttagatg cggacacaga ccggaggctc cggggccagg ttccccgtgt     1860

gacgttcctg agccgtaacc tggaagaacc caagcaccag gcctcgggca ccgtgtggct     1920

gaagcaccag catgaccgag tctgtggaga cgccatgttc cagctccagg aaaatgtcaa     1980

agacaagctt cgggccattg tagtgacctt gtcctacagt ctccagaccc ctcggctccg     2040

gcgacaggct cctggccagg ggctgcctcc agtggccccc atcctcaatg cccaccagcc     2100

cagcacccag cgggcagaga tccacttcct gaagcaaggc tgtggtgaag acaagatctg     2160

ccagagcaat ctgcagctgg tccgcgcccg cttctgtacc cgggtcagcg acacggaatt     2220

ccaacctctg cccatggatg tggatggaac aacagccctg tttgcactga gtgggcagcc     2280

agtcattggc ctggagctga tggtcaccaa cctgccatcg gacccagccc agccccaggc     2340

tgatggggat gatgcccatg aagcccagct cctggtcatg cttcctgact cactgcacta     2400

ctcaggggtc cgggccctgg accctgcgga gaagccactc tgcctgtcca atgagaatgc     2460

ctcccatgtt gagtgtgagc tggggaaccc catgaagaga ggtgcccagg tcaccttcta     2520

cctcatcctt agcacctccg ggatcagcat tgagaccacg gaactggagg tagagctgct     2580

gttggccacg atcagtgagc aggagctgca tccagtctct gcacgagccc gtgtcttcat     2640

tgagctgcca ctgtccattg caggaatggc cattccccag caactcttct tctctggtgt     2700

ggtgaggggc gagagagcca tgcagtctga gcgggatgtg ggcagcaagg tcaagtatga     2760

ggtcacggtt tccaaccaag gccagtcgct cagaaccctg ggctctgcct tcctcaacat     2820

catgtggcct catgagattg ccaatgggaa gtggttgctg tacccaatgc aggttgagct     2880

ggagggcggg caggggcctg ggcagaaagg gctttgctct cccaggccca acatcctcca     2940

cctggatgtg gacagtaggg ataggaggcg gcgggagctg gagccacctg agcagcagga     3000

gcctggtgag cggcaggagc ccagcatgtc ctggtggcca gtgtcctctg ctgagaagaa     3060

gaaaaacatc accctggact gcgcccgggg cacggccaac tgtgtggtgt tcagctgccc     3120

actctacagc tttgaccgcg cggctgtgct gcatgtctgg ggccgtctct ggaacagcac     3180

ctttctggag gagtactcag ctgtgaagtc cctggaagtg attgtccggg ccaacatcac     3240

agtgaagtcc tccataaaga acttgatgct ccgagatgcc tccacagtga tcccagtgat     3300

ggtatacttg gaccccatgg ctgtggtggc agaaggagtg ccctggtggg tcatcctcct     3360

ggctgtactg gctgggctgc tggtgctagc actgctggtg ctgctcctgt ggaagatggg     3420

attcttcaaa cgggcgaagc accccgaggc caccgtgccc cagtaccatg cggtgaagat     3480

tcctcgggaa gaccgacagc agttcaagga ggagaagacg ggcaccatcc tgaggaacaa     3540

ctggggcagc ccccggcggg agggcccgga tgcacacccc atcctggctg ctgacgggca     3600

tcccgagctg ggccccgatg ggcatccagg gccaggcacc gcctaggttc ccatgtccca     3660

gcctggcctg tggctgccct ccatcccttc cccagagatg gctccttggg atgaagaggg     3720

tagagtgggc tgctggtgtc gcatcaagat ttggcaggat cggcttcctc aggggcacag     3780

acctctccca cccacaagaa ctcctcccac ccaacttccc cttagagtgc tgtgagatga     3840

gagtgggtaa atcagggaca gggccatggg gtagggtgag aagggcaggg gtgtcctgat     3900

gcaaaggtgg ggagaaggga tcctaatccc ttcctctccc attcaccctg tgtaacagga     3960

ccccaaggac ctgcctcccc ggaagtgcct taacctagag ggtcggggag gaggttgtgt     4020

cactgactca ggctgctcct tctctagttt cccctctcat ctgaccttag tttgctgcca     4080

tcagtctagt ggtttcgtgg tttcgtctat ttattaaaaa atatttgaga acaaaaaaaa     4140

aaaaaaaaaa                                                            4150


<210>  299
<211>  4138
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 7 (ITGA7), transcript variant 2, 
       mRNA GeneBank Accession No. NM_002206.2  GI:222418610

<400>  299
gggcgccgga gctgcggctg ctgtagttgt cctagccggt gctggggcgg cggggtggcg       60

gagcggcggg cgggcgggag ggctggcggg gcgaacgtct gggagacgtc tgaaagacca      120

acgagacttt ggagaccaga gacgcgcctg gggggacctg gggcttgggg cgtgcgagat      180

ttcccttgca ttcgctggga gctcgcgcag ggatcgtccc atggccgggg ctcggagccg      240

cgacccttgg ggggcctccg ggatttgcta cctttttggc tccctgctcg tcgaactgct      300

cttctcacgg gctgtcgcct tcaatctgga cgtgatgggt gccttgcgca aggagggcga      360

gccaggcagc ctcttcggct tctctgtggc cctgcaccgg cagttgcagc cccgacccca      420

gagctggctg ctggtgggtg ctccccaggc cctggctctt cctgggcagc aggcgaatcg      480

cactggaggc ctcttcgctt gcccgttgag cctggaggag actgactgct acagagtgga      540

catcgaccag ggagctgata tgcaaaagga aagcaaggag aaccagtggt tgggagtcag      600

tgttcggagc caggggcctg ggggcaagat tgttacctgt gcacaccgat atgaggcaag      660

gcagcgagtg gaccagatcc tggagacgcg ggatatgatt ggtcgctgct ttgtgctcag      720

ccaggacctg gccatccggg atgagttgga tggtggggaa tggaagttct gtgagggacg      780

cccccaaggc catgaacaat ttgggttctg ccagcagggc acagctgccg ccttctcccc      840

tgatagccac tacctcctct ttggggcccc aggaacctat aattggaagg ggttgctttt      900

tgtgaccaac attgatagct cagaccccga ccagctggtg tataaaactt tggaccctgc      960

tgaccggctc ccaggaccag ccggagactt ggccctcaat agctacttag gcttctctat     1020

tgactcgggg aaaggtctgg tgcgtgcaga agagctgagc tttgtggctg gagccccccg     1080

cgccaaccac aagggtgctg tggtcatcct gcgcaaggac agcgccagtc gcctggtgcc     1140

cgaggttatg ctgtctgggg agcgcctgac ctccggcttt ggctactcac tggctgtggc     1200

tgacctcaac agtgatggct ggccagacct gatagtgggt gccccctact tctttgagcg     1260

ccaagaagag ctggggggtg ctgtgtatgt gtacttgaac caggggggtc actgggctgg     1320

gatctcccct ctccggctct gcggctcccc tgactccatg ttcgggatca gcctggctgt     1380

cctgggggac ctcaaccaag atggctttcc agatattgca gtgggtgccc cctttgatgg     1440

tgatgggaaa gtcttcatct accatgggag cagcctgggg gttgtcgcca aaccttcaca     1500

ggtgctggag ggcgaggctg tgggcatcaa gagcttcggc tactccctgt caggcagctt     1560

ggatatggat gggaaccaat accctgacct gctggtgggc tccctggctg acaccgcagt     1620

gctcttcagg gccagaccca tcctccatgt ctcccatgag gtctctattg ctccacgaag     1680

catcgacctg gagcagccca actgtgctgg cggccactcg gtctgtgtgg acctaagggt     1740

ctgtttcagc tacattgcag tccccagcag ctatagccct actgtggccc tggactatgt     1800

gttagatgcg gacacagacc ggaggctccg gggccaggtt ccccgtgtga cgttcctgag     1860

ccgtaacctg gaagaaccca agcaccaggc ctcgggcacc gtgtggctga agcaccagca     1920

tgaccgagtc tgtggagacg ccatgttcca gctccaggaa aatgtcaaag acaagcttcg     1980

ggccattgta gtgaccttgt cctacagtct ccagacccct cggctccggc gacaggctcc     2040

tggccagggg ctgcctccag tggcccccat cctcaatgcc caccagccca gcacccagcg     2100

ggcagagatc cacttcctga agcaaggctg tggtgaagac aagatctgcc agagcaatct     2160

gcagctggtc cgcgcccgct tctgtacccg ggtcagcgac acggaattcc aacctctgcc     2220

catggatgtg gatggaacaa cagccctgtt tgcactgagt gggcagccag tcattggcct     2280

ggagctgatg gtcaccaacc tgccatcgga cccagcccag ccccaggctg atggggatga     2340

tgcccatgaa gcccagctcc tggtcatgct tcctgactca ctgcactact caggggtccg     2400

ggccctggac cctgcggaga agccactctg cctgtccaat gagaatgcct cccatgttga     2460

gtgtgagctg gggaacccca tgaagagagg tgcccaggtc accttctacc tcatccttag     2520

cacctccggg atcagcattg agaccacgga actggaggta gagctgctgt tggccacgat     2580

cagtgagcag gagctgcatc cagtctctgc acgagcccgt gtcttcattg agctgccact     2640

gtccattgca ggaatggcca ttccccagca actcttcttc tctggtgtgg tgaggggcga     2700

gagagccatg cagtctgagc gggatgtggg cagcaaggtc aagtatgagg tcacggtttc     2760

caaccaaggc cagtcgctca gaaccctggg ctctgccttc ctcaacatca tgtggcctca     2820

tgagattgcc aatgggaagt ggttgctgta cccaatgcag gttgagctgg agggcgggca     2880

ggggcctggg cagaaagggc tttgctctcc caggcccaac atcctccacc tggatgtgga     2940

cagtagggat aggaggcggc gggagctgga gccacctgag cagcaggagc ctggtgagcg     3000

gcaggagccc agcatgtcct ggtggccagt gtcctctgct gagaagaaga aaaacatcac     3060

cctggactgc gcccggggca cggccaactg tgtggtgttc agctgcccac tctacagctt     3120

tgaccgcgcg gctgtgctgc atgtctgggg ccgtctctgg aacagcacct ttctggagga     3180

gtactcagct gtgaagtccc tggaagtgat tgtccgggcc aacatcacag tgaagtcctc     3240

cataaagaac ttgatgctcc gagatgcctc cacagtgatc ccagtgatgg tatacttgga     3300

ccccatggct gtggtggcag aaggagtgcc ctggtgggtc atcctcctgg ctgtactggc     3360

tgggctgctg gtgctagcac tgctggtgct gctcctgtgg aagatgggat tcttcaaacg     3420

ggcgaagcac cccgaggcca ccgtgcccca gtaccatgcg gtgaagattc ctcgggaaga     3480

ccgacagcag ttcaaggagg agaagacggg caccatcctg aggaacaact ggggcagccc     3540

ccggcgggag ggcccggatg cacaccccat cctggctgct gacgggcatc ccgagctggg     3600

ccccgatggg catccagggc caggcaccgc ctaggttccc atgtcccagc ctggcctgtg     3660

gctgccctcc atcccttccc cagagatggc tccttgggat gaagagggta gagtgggctg     3720

ctggtgtcgc atcaagattt ggcaggatcg gcttcctcag gggcacagac ctctcccacc     3780

cacaagaact cctcccaccc aacttcccct tagagtgctg tgagatgaga gtgggtaaat     3840

cagggacagg gccatggggt agggtgagaa gggcaggggt gtcctgatgc aaaggtgggg     3900

agaagggatc ctaatccctt cctctcccat tcaccctgtg taacaggacc ccaaggacct     3960

gcctccccgg aagtgcctta acctagaggg tcggggagga ggttgtgtca ctgactcagg     4020

ctgctccttc tctagtttcc cctctcatct gaccttagtt tgctgccatc agtctagtgg     4080

tttcgtggtt tcgtctattt attaaaaaat atttgagaac aaaaaaaaaa aaaaaaaa       4138


<210>  300
<211>  1137
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 7 (ITGA7), transcript variant 2, 
       polypeptide GeneBank Accession No. NP_002197.2 GI:222418611

<400>  300

Met Ala Gly Ala Arg Ser Arg Asp Pro Trp Gly Ala Ser Gly Ile Cys 
1               5                   10                  15      


Tyr Leu Phe Gly Ser Leu Leu Val Glu Leu Leu Phe Ser Arg Ala Val 
            20                  25                  30          


Ala Phe Asn Leu Asp Val Met Gly Ala Leu Arg Lys Glu Gly Glu Pro 
        35                  40                  45              


Gly Ser Leu Phe Gly Phe Ser Val Ala Leu His Arg Gln Leu Gln Pro 
    50                  55                  60                  


Arg Pro Gln Ser Trp Leu Leu Val Gly Ala Pro Gln Ala Leu Ala Leu 
65                  70                  75                  80  


Pro Gly Gln Gln Ala Asn Arg Thr Gly Gly Leu Phe Ala Cys Pro Leu 
                85                  90                  95      


Ser Leu Glu Glu Thr Asp Cys Tyr Arg Val Asp Ile Asp Gln Gly Ala 
            100                 105                 110         


Asp Met Gln Lys Glu Ser Lys Glu Asn Gln Trp Leu Gly Val Ser Val 
        115                 120                 125             


Arg Ser Gln Gly Pro Gly Gly Lys Ile Val Thr Cys Ala His Arg Tyr 
    130                 135                 140                 


Glu Ala Arg Gln Arg Val Asp Gln Ile Leu Glu Thr Arg Asp Met Ile 
145                 150                 155                 160 


Gly Arg Cys Phe Val Leu Ser Gln Asp Leu Ala Ile Arg Asp Glu Leu 
                165                 170                 175     


Asp Gly Gly Glu Trp Lys Phe Cys Glu Gly Arg Pro Gln Gly His Glu 
            180                 185                 190         


Gln Phe Gly Phe Cys Gln Gln Gly Thr Ala Ala Ala Phe Ser Pro Asp 
        195                 200                 205             


Ser His Tyr Leu Leu Phe Gly Ala Pro Gly Thr Tyr Asn Trp Lys Gly 
    210                 215                 220                 


Leu Leu Phe Val Thr Asn Ile Asp Ser Ser Asp Pro Asp Gln Leu Val 
225                 230                 235                 240 


Tyr Lys Thr Leu Asp Pro Ala Asp Arg Leu Pro Gly Pro Ala Gly Asp 
                245                 250                 255     


Leu Ala Leu Asn Ser Tyr Leu Gly Phe Ser Ile Asp Ser Gly Lys Gly 
            260                 265                 270         


Leu Val Arg Ala Glu Glu Leu Ser Phe Val Ala Gly Ala Pro Arg Ala 
        275                 280                 285             


Asn His Lys Gly Ala Val Val Ile Leu Arg Lys Asp Ser Ala Ser Arg 
    290                 295                 300                 


Leu Val Pro Glu Val Met Leu Ser Gly Glu Arg Leu Thr Ser Gly Phe 
305                 310                 315                 320 


Gly Tyr Ser Leu Ala Val Ala Asp Leu Asn Ser Asp Gly Trp Pro Asp 
                325                 330                 335     


Leu Ile Val Gly Ala Pro Tyr Phe Phe Glu Arg Gln Glu Glu Leu Gly 
            340                 345                 350         


Gly Ala Val Tyr Val Tyr Leu Asn Gln Gly Gly His Trp Ala Gly Ile 
        355                 360                 365             


Ser Pro Leu Arg Leu Cys Gly Ser Pro Asp Ser Met Phe Gly Ile Ser 
    370                 375                 380                 


Leu Ala Val Leu Gly Asp Leu Asn Gln Asp Gly Phe Pro Asp Ile Ala 
385                 390                 395                 400 


Val Gly Ala Pro Phe Asp Gly Asp Gly Lys Val Phe Ile Tyr His Gly 
                405                 410                 415     


Ser Ser Leu Gly Val Val Ala Lys Pro Ser Gln Val Leu Glu Gly Glu 
            420                 425                 430         


Ala Val Gly Ile Lys Ser Phe Gly Tyr Ser Leu Ser Gly Ser Leu Asp 
        435                 440                 445             


Met Asp Gly Asn Gln Tyr Pro Asp Leu Leu Val Gly Ser Leu Ala Asp 
    450                 455                 460                 


Thr Ala Val Leu Phe Arg Ala Arg Pro Ile Leu His Val Ser His Glu 
465                 470                 475                 480 


Val Ser Ile Ala Pro Arg Ser Ile Asp Leu Glu Gln Pro Asn Cys Ala 
                485                 490                 495     


Gly Gly His Ser Val Cys Val Asp Leu Arg Val Cys Phe Ser Tyr Ile 
            500                 505                 510         


Ala Val Pro Ser Ser Tyr Ser Pro Thr Val Ala Leu Asp Tyr Val Leu 
        515                 520                 525             


Asp Ala Asp Thr Asp Arg Arg Leu Arg Gly Gln Val Pro Arg Val Thr 
    530                 535                 540                 


Phe Leu Ser Arg Asn Leu Glu Glu Pro Lys His Gln Ala Ser Gly Thr 
545                 550                 555                 560 


Val Trp Leu Lys His Gln His Asp Arg Val Cys Gly Asp Ala Met Phe 
                565                 570                 575     


Gln Leu Gln Glu Asn Val Lys Asp Lys Leu Arg Ala Ile Val Val Thr 
            580                 585                 590         


Leu Ser Tyr Ser Leu Gln Thr Pro Arg Leu Arg Arg Gln Ala Pro Gly 
        595                 600                 605             


Gln Gly Leu Pro Pro Val Ala Pro Ile Leu Asn Ala His Gln Pro Ser 
    610                 615                 620                 


Thr Gln Arg Ala Glu Ile His Phe Leu Lys Gln Gly Cys Gly Glu Asp 
625                 630                 635                 640 


Lys Ile Cys Gln Ser Asn Leu Gln Leu Val Arg Ala Arg Phe Cys Thr 
                645                 650                 655     


Arg Val Ser Asp Thr Glu Phe Gln Pro Leu Pro Met Asp Val Asp Gly 
            660                 665                 670         


Thr Thr Ala Leu Phe Ala Leu Ser Gly Gln Pro Val Ile Gly Leu Glu 
        675                 680                 685             


Leu Met Val Thr Asn Leu Pro Ser Asp Pro Ala Gln Pro Gln Ala Asp 
    690                 695                 700                 


Gly Asp Asp Ala His Glu Ala Gln Leu Leu Val Met Leu Pro Asp Ser 
705                 710                 715                 720 


Leu His Tyr Ser Gly Val Arg Ala Leu Asp Pro Ala Glu Lys Pro Leu 
                725                 730                 735     


Cys Leu Ser Asn Glu Asn Ala Ser His Val Glu Cys Glu Leu Gly Asn 
            740                 745                 750         


Pro Met Lys Arg Gly Ala Gln Val Thr Phe Tyr Leu Ile Leu Ser Thr 
        755                 760                 765             


Ser Gly Ile Ser Ile Glu Thr Thr Glu Leu Glu Val Glu Leu Leu Leu 
    770                 775                 780                 


Ala Thr Ile Ser Glu Gln Glu Leu His Pro Val Ser Ala Arg Ala Arg 
785                 790                 795                 800 


Val Phe Ile Glu Leu Pro Leu Ser Ile Ala Gly Met Ala Ile Pro Gln 
                805                 810                 815     


Gln Leu Phe Phe Ser Gly Val Val Arg Gly Glu Arg Ala Met Gln Ser 
            820                 825                 830         


Glu Arg Asp Val Gly Ser Lys Val Lys Tyr Glu Val Thr Val Ser Asn 
        835                 840                 845             


Gln Gly Gln Ser Leu Arg Thr Leu Gly Ser Ala Phe Leu Asn Ile Met 
    850                 855                 860                 


Trp Pro His Glu Ile Ala Asn Gly Lys Trp Leu Leu Tyr Pro Met Gln 
865                 870                 875                 880 


Val Glu Leu Glu Gly Gly Gln Gly Pro Gly Gln Lys Gly Leu Cys Ser 
                885                 890                 895     


Pro Arg Pro Asn Ile Leu His Leu Asp Val Asp Ser Arg Asp Arg Arg 
            900                 905                 910         


Arg Arg Glu Leu Glu Pro Pro Glu Gln Gln Glu Pro Gly Glu Arg Gln 
        915                 920                 925             


Glu Pro Ser Met Ser Trp Trp Pro Val Ser Ser Ala Glu Lys Lys Lys 
    930                 935                 940                 


Asn Ile Thr Leu Asp Cys Ala Arg Gly Thr Ala Asn Cys Val Val Phe 
945                 950                 955                 960 


Ser Cys Pro Leu Tyr Ser Phe Asp Arg Ala Ala Val Leu His Val Trp 
                965                 970                 975     


Gly Arg Leu Trp Asn Ser Thr Phe Leu Glu Glu Tyr Ser Ala Val Lys 
            980                 985                 990         


Ser Leu Glu Val Ile Val Arg Ala  Asn Ile Thr Val Lys  Ser Ser Ile 
        995                 1000                 1005             


Lys Asn  Leu Met Leu Arg Asp  Ala Ser Thr Val Ile  Pro Val Met 
    1010                 1015                 1020             


Val Tyr  Leu Asp Pro Met Ala  Val Val Ala Glu Gly  Val Pro Trp 
    1025                 1030                 1035             


Trp Val  Ile Leu Leu Ala Val  Leu Ala Gly Leu Leu  Val Leu Ala 
    1040                 1045                 1050             


Leu Leu  Val Leu Leu Leu Trp  Lys Met Gly Phe Phe  Lys Arg Ala 
    1055                 1060                 1065             


Lys His  Pro Glu Ala Thr Val  Pro Gln Tyr His Ala  Val Lys Ile 
    1070                 1075                 1080             


Pro Arg  Glu Asp Arg Gln Gln  Phe Lys Glu Glu Lys  Thr Gly Thr 
    1085                 1090                 1095             


Ile Leu  Arg Asn Asn Trp Gly  Ser Pro Arg Arg Glu  Gly Pro Asp 
    1100                 1105                 1110             


Ala His  Pro Ile Leu Ala Ala  Asp Gly His Pro Glu  Leu Gly Pro 
    1115                 1120                 1125             


Asp Gly  His Pro Gly Pro Gly  Thr Ala 
    1130                 1135         


<210>  301
<211>  3675
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 7 (ITGA7), transcript variant 3, 
       mRNA GeneBank Accession No. NM_001144997.1  GI:222418614

<400>  301
agtgctcaaa gaaagggggc ccttgagaca gtccaaatgg ctccctttgc cactcccatg       60

gttcaagctt tgactacaac cagaattcag aggcaggcag aaggattcca gtgctggaga      120

gaatgtggaa caaggagatc tccatttgag ggcaaggaaa cctgtgcaca ccgatatgag      180

gcaaggcagc gagtggacca gatcctggag acgcgggata tgattggtcg ctgctttgtg      240

ctcagccagg acctggccat ccgggatgag ttggatggtg gggaatggaa gttctgtgag      300

ggacgccccc aaggccatga acaatttggg ttctgccagc agggcacagc tgccgccttc      360

tcccctgata gccactacct cctctttggg gccccaggaa cctataattg gaagggcacg      420

gccagggtgg agctctgtgc acagggctca gcggacctgg cacacctgga cgacggtccc      480

tacgaggcgg ggggagagaa ggagcaggac ccccgcctca tcccggtccc tgccaacagc      540

tactttggct tctctattga ctcggggaaa ggtctggtgc gtgcagaaga gctgagcttt      600

gtggctggag ccccccgcgc caaccacaag ggtgctgtgg tcatcctgcg caaggacagc      660

gccagtcgcc tggtgcccga ggttatgctg tctggggagc gcctgacctc cggctttggc      720

tactcactgg ctgtggctga cctcaacagt gatggctggc cagacctgat agtgggtgcc      780

ccctacttct ttgagcgcca agaagagctg gggggtgctg tgtatgtgta cttgaaccag      840

gggggtcact gggctgggat ctcccctctc cggctctgcg gctcccctga ctccatgttc      900

gggatcagcc tggctgtcct gggggacctc aaccaagatg gctttccaga tattgcagtg      960

ggtgccccct ttgatggtga tgggaaagtc ttcatctacc atgggagcag cctgggggtt     1020

gtcgccaaac cttcacaggt gctggagggc gaggctgtgg gcatcaagag cttcggctac     1080

tccctgtcag gcagcttgga tatggatggg aaccaatacc ctgacctgct ggtgggctcc     1140

ctggctgaca ccgcagtgct cttcagggcc agacccatcc tccatgtctc ccatgaggtc     1200

tctattgctc cacgaagcat cgacctggag cagcccaact gtgctggcgg ccactcggtc     1260

tgtgtggacc taagggtctg tttcagctac attgcagtcc ccagcagcta tagccctact     1320

gtggccctgg actatgtgtt agatgcggac acagaccgga ggctccgggg ccaggttccc     1380

cgtgtgacgt tcctgagccg taacctggaa gaacccaagc accaggcctc gggcaccgtg     1440

tggctgaagc accagcatga ccgagtctgt ggagacgcca tgttccagct ccaggaaaat     1500

gtcaaagaca agcttcgggc cattgtagtg accttgtcct acagtctcca gacccctcgg     1560

ctccggcgac aggctcctgg ccaggggctg cctccagtgg cccccatcct caatgcccac     1620

cagcccagca cccagcgggc agagatccac ttcctgaagc aaggctgtgg tgaagacaag     1680

atctgccaga gcaatctgca gctggtccgc gcccgcttct gtacccgggt cagcgacacg     1740

gaattccaac ctctgcccat ggatgtggat ggaacaacag ccctgtttgc actgagtggg     1800

cagccagtca ttggcctgga gctgatggtc accaacctgc catcggaccc agcccagccc     1860

caggctgatg gggatgatgc ccatgaagcc cagctcctgg tcatgcttcc tgactcactg     1920

cactactcag gggtccgggc cctggaccct gcggagaagc cactctgcct gtccaatgag     1980

aatgcctccc atgttgagtg tgagctgggg aaccccatga agagaggtgc ccaggtcacc     2040

ttctacctca tccttagcac ctccgggatc agcattgaga ccacggaact ggaggtagag     2100

ctgctgttgg ccacgatcag tgagcaggag ctgcatccag tctctgcacg agcccgtgtc     2160

ttcattgagc tgccactgtc cattgcagga atggccattc cccagcaact cttcttctct     2220

ggtgtggtga ggggcgagag agccatgcag tctgagcggg atgtgggcag caaggtcaag     2280

tatgaggtca cggtttccaa ccaaggccag tcgctcagaa ccctgggctc tgccttcctc     2340

aacatcatgt ggcctcatga gattgccaat gggaagtggt tgctgtaccc aatgcaggtt     2400

gagctggagg gcgggcaggg gcctgggcag aaagggcttt gctctcccag gcccaacatc     2460

ctccacctgg atgtggacag tagggatagg aggcggcggg agctggagcc acctgagcag     2520

caggagcctg gtgagcggca ggagcccagc atgtcctggt ggccagtgtc ctctgctgag     2580

aagaagaaaa acatcaccct ggactgcgcc cggggcacgg ccaactgtgt ggtgttcagc     2640

tgcccactct acagctttga ccgcgcggct gtgctgcatg tctggggccg tctctggaac     2700

agcacctttc tggaggagta ctcagctgtg aagtccctgg aagtgattgt ccgggccaac     2760

atcacagtga agtcctccat aaagaacttg atgctccgag atgcctccac agtgatccca     2820

gtgatggtat acttggaccc catggctgtg gtggcagaag gagtgccctg gtgggtcatc     2880

ctcctggctg tactggctgg gctgctggtg ctagcactgc tggtgctgct cctgtggaag     2940

atgggattct tcaaacgggc gaagcacccc gaggccaccg tgccccagta ccatgcggtg     3000

aagattcctc gggaagaccg acagcagttc aaggaggaga agacgggcac catcctgagg     3060

aacaactggg gcagcccccg gcgggagggc ccggatgcac accccatcct ggctgctgac     3120

gggcatcccg agctgggccc cgatgggcat ccagggccag gcaccgccta ggttcccatg     3180

tcccagcctg gcctgtggct gccctccatc ccttccccag agatggctcc ttgggatgaa     3240

gagggtagag tgggctgctg gtgtcgcatc aagatttggc aggatcggct tcctcagggg     3300

cacagacctc tcccacccac aagaactcct cccacccaac ttccccttag agtgctgtga     3360

gatgagagtg ggtaaatcag ggacagggcc atggggtagg gtgagaaggg caggggtgtc     3420

ctgatgcaaa ggtggggaga agggatccta atcccttcct ctcccattca ccctgtgtaa     3480

caggacccca aggacctgcc tccccggaag tgccttaacc tagagggtcg gggaggaggt     3540

tgtgtcactg actcaggctg ctccttctct agtttcccct ctcatctgac cttagtttgc     3600

tgccatcagt ctagtggttt cgtggtttcg tctatttatt aaaaaatatt tgagaacaaa     3660

aaaaaaaaaa aaaaa                                                      3675


<210>  302
<211>  1044
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, alpha 7 (ITGA7), transcript variant 3, 
       polypeptide GeneBank Accession No. NP_001138469.1 GI:222418615

<400>  302

Met Ala Pro Phe Ala Thr Pro Met Val Gln Ala Leu Thr Thr Thr Arg 
1               5                   10                  15      


Ile Gln Arg Gln Ala Glu Gly Phe Gln Cys Trp Arg Glu Cys Gly Thr 
            20                  25                  30          


Arg Arg Ser Pro Phe Glu Gly Lys Glu Thr Cys Ala His Arg Tyr Glu 
        35                  40                  45              


Ala Arg Gln Arg Val Asp Gln Ile Leu Glu Thr Arg Asp Met Ile Gly 
    50                  55                  60                  


Arg Cys Phe Val Leu Ser Gln Asp Leu Ala Ile Arg Asp Glu Leu Asp 
65                  70                  75                  80  


Gly Gly Glu Trp Lys Phe Cys Glu Gly Arg Pro Gln Gly His Glu Gln 
                85                  90                  95      


Phe Gly Phe Cys Gln Gln Gly Thr Ala Ala Ala Phe Ser Pro Asp Ser 
            100                 105                 110         


His Tyr Leu Leu Phe Gly Ala Pro Gly Thr Tyr Asn Trp Lys Gly Thr 
        115                 120                 125             


Ala Arg Val Glu Leu Cys Ala Gln Gly Ser Ala Asp Leu Ala His Leu 
    130                 135                 140                 


Asp Asp Gly Pro Tyr Glu Ala Gly Gly Glu Lys Glu Gln Asp Pro Arg 
145                 150                 155                 160 


Leu Ile Pro Val Pro Ala Asn Ser Tyr Phe Gly Phe Ser Ile Asp Ser 
                165                 170                 175     


Gly Lys Gly Leu Val Arg Ala Glu Glu Leu Ser Phe Val Ala Gly Ala 
            180                 185                 190         


Pro Arg Ala Asn His Lys Gly Ala Val Val Ile Leu Arg Lys Asp Ser 
        195                 200                 205             


Ala Ser Arg Leu Val Pro Glu Val Met Leu Ser Gly Glu Arg Leu Thr 
    210                 215                 220                 


Ser Gly Phe Gly Tyr Ser Leu Ala Val Ala Asp Leu Asn Ser Asp Gly 
225                 230                 235                 240 


Trp Pro Asp Leu Ile Val Gly Ala Pro Tyr Phe Phe Glu Arg Gln Glu 
                245                 250                 255     


Glu Leu Gly Gly Ala Val Tyr Val Tyr Leu Asn Gln Gly Gly His Trp 
            260                 265                 270         


Ala Gly Ile Ser Pro Leu Arg Leu Cys Gly Ser Pro Asp Ser Met Phe 
        275                 280                 285             


Gly Ile Ser Leu Ala Val Leu Gly Asp Leu Asn Gln Asp Gly Phe Pro 
    290                 295                 300                 


Asp Ile Ala Val Gly Ala Pro Phe Asp Gly Asp Gly Lys Val Phe Ile 
305                 310                 315                 320 


Tyr His Gly Ser Ser Leu Gly Val Val Ala Lys Pro Ser Gln Val Leu 
                325                 330                 335     


Glu Gly Glu Ala Val Gly Ile Lys Ser Phe Gly Tyr Ser Leu Ser Gly 
            340                 345                 350         


Ser Leu Asp Met Asp Gly Asn Gln Tyr Pro Asp Leu Leu Val Gly Ser 
        355                 360                 365             


Leu Ala Asp Thr Ala Val Leu Phe Arg Ala Arg Pro Ile Leu His Val 
    370                 375                 380                 


Ser His Glu Val Ser Ile Ala Pro Arg Ser Ile Asp Leu Glu Gln Pro 
385                 390                 395                 400 


Asn Cys Ala Gly Gly His Ser Val Cys Val Asp Leu Arg Val Cys Phe 
                405                 410                 415     


Ser Tyr Ile Ala Val Pro Ser Ser Tyr Ser Pro Thr Val Ala Leu Asp 
            420                 425                 430         


Tyr Val Leu Asp Ala Asp Thr Asp Arg Arg Leu Arg Gly Gln Val Pro 
        435                 440                 445             


Arg Val Thr Phe Leu Ser Arg Asn Leu Glu Glu Pro Lys His Gln Ala 
    450                 455                 460                 


Ser Gly Thr Val Trp Leu Lys His Gln His Asp Arg Val Cys Gly Asp 
465                 470                 475                 480 


Ala Met Phe Gln Leu Gln Glu Asn Val Lys Asp Lys Leu Arg Ala Ile 
                485                 490                 495     


Val Val Thr Leu Ser Tyr Ser Leu Gln Thr Pro Arg Leu Arg Arg Gln 
            500                 505                 510         


Ala Pro Gly Gln Gly Leu Pro Pro Val Ala Pro Ile Leu Asn Ala His 
        515                 520                 525             


Gln Pro Ser Thr Gln Arg Ala Glu Ile His Phe Leu Lys Gln Gly Cys 
    530                 535                 540                 


Gly Glu Asp Lys Ile Cys Gln Ser Asn Leu Gln Leu Val Arg Ala Arg 
545                 550                 555                 560 


Phe Cys Thr Arg Val Ser Asp Thr Glu Phe Gln Pro Leu Pro Met Asp 
                565                 570                 575     


Val Asp Gly Thr Thr Ala Leu Phe Ala Leu Ser Gly Gln Pro Val Ile 
            580                 585                 590         


Gly Leu Glu Leu Met Val Thr Asn Leu Pro Ser Asp Pro Ala Gln Pro 
        595                 600                 605             


Gln Ala Asp Gly Asp Asp Ala His Glu Ala Gln Leu Leu Val Met Leu 
    610                 615                 620                 


Pro Asp Ser Leu His Tyr Ser Gly Val Arg Ala Leu Asp Pro Ala Glu 
625                 630                 635                 640 


Lys Pro Leu Cys Leu Ser Asn Glu Asn Ala Ser His Val Glu Cys Glu 
                645                 650                 655     


Leu Gly Asn Pro Met Lys Arg Gly Ala Gln Val Thr Phe Tyr Leu Ile 
            660                 665                 670         


Leu Ser Thr Ser Gly Ile Ser Ile Glu Thr Thr Glu Leu Glu Val Glu 
        675                 680                 685             


Leu Leu Leu Ala Thr Ile Ser Glu Gln Glu Leu His Pro Val Ser Ala 
    690                 695                 700                 


Arg Ala Arg Val Phe Ile Glu Leu Pro Leu Ser Ile Ala Gly Met Ala 
705                 710                 715                 720 


Ile Pro Gln Gln Leu Phe Phe Ser Gly Val Val Arg Gly Glu Arg Ala 
                725                 730                 735     


Met Gln Ser Glu Arg Asp Val Gly Ser Lys Val Lys Tyr Glu Val Thr 
            740                 745                 750         


Val Ser Asn Gln Gly Gln Ser Leu Arg Thr Leu Gly Ser Ala Phe Leu 
        755                 760                 765             


Asn Ile Met Trp Pro His Glu Ile Ala Asn Gly Lys Trp Leu Leu Tyr 
    770                 775                 780                 


Pro Met Gln Val Glu Leu Glu Gly Gly Gln Gly Pro Gly Gln Lys Gly 
785                 790                 795                 800 


Leu Cys Ser Pro Arg Pro Asn Ile Leu His Leu Asp Val Asp Ser Arg 
                805                 810                 815     


Asp Arg Arg Arg Arg Glu Leu Glu Pro Pro Glu Gln Gln Glu Pro Gly 
            820                 825                 830         


Glu Arg Gln Glu Pro Ser Met Ser Trp Trp Pro Val Ser Ser Ala Glu 
        835                 840                 845             


Lys Lys Lys Asn Ile Thr Leu Asp Cys Ala Arg Gly Thr Ala Asn Cys 
    850                 855                 860                 


Val Val Phe Ser Cys Pro Leu Tyr Ser Phe Asp Arg Ala Ala Val Leu 
865                 870                 875                 880 


His Val Trp Gly Arg Leu Trp Asn Ser Thr Phe Leu Glu Glu Tyr Ser 
                885                 890                 895     


Ala Val Lys Ser Leu Glu Val Ile Val Arg Ala Asn Ile Thr Val Lys 
            900                 905                 910         


Ser Ser Ile Lys Asn Leu Met Leu Arg Asp Ala Ser Thr Val Ile Pro 
        915                 920                 925             


Val Met Val Tyr Leu Asp Pro Met Ala Val Val Ala Glu Gly Val Pro 
    930                 935                 940                 


Trp Trp Val Ile Leu Leu Ala Val Leu Ala Gly Leu Leu Val Leu Ala 
945                 950                 955                 960 


Leu Leu Val Leu Leu Leu Trp Lys Met Gly Phe Phe Lys Arg Ala Lys 
                965                 970                 975     


His Pro Glu Ala Thr Val Pro Gln Tyr His Ala Val Lys Ile Pro Arg 
            980                 985                 990         


Glu Asp Arg Gln Gln Phe Lys Glu  Glu Lys Thr Gly Thr  Ile Leu Arg 
        995                 1000                 1005             


Asn Asn  Trp Gly Ser Pro Arg  Arg Glu Gly Pro Asp  Ala His Pro 
    1010                 1015                 1020             


Ile Leu  Ala Ala Asp Gly His  Pro Glu Leu Gly Pro  Asp Gly His 
    1025                 1030                 1035             


Pro Gly  Pro Gly Thr Ala 
    1040                 


<210>  303
<211>  3879
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 1 (fibronectin receptor, beta 
       polypeptide, antigen CD29 includes MDF2, MSK12) (ITGB1), 
       transcript variant 1A, mRNA GeneBank Accession No. NM_002211.3  
       GI:182519230

<400>  303
atcagacgcg cagaggaggc ggggccgcgg ctggtttcct gccggggggc ggctctgggc       60

cgccgagtcc cctcctcccg cccctgagga ggaggagccg ccgccacccg ccgcgcccga      120

cacccgggag gccccgccag cccgcgggag aggcccagcg ggagtcgcgg aacagcaggc      180

ccgagcccac cgcgccgggc cccggacgcc gcgcggaaaa gatgaattta caaccaattt      240

tctggattgg actgatcagt tcagtttgct gtgtgtttgc tcaaacagat gaaaatagat      300

gtttaaaagc aaatgccaaa tcatgtggag aatgtataca agcagggcca aattgtgggt      360

ggtgcacaaa ttcaacattt ttacaggaag gaatgcctac ttctgcacga tgtgatgatt      420

tagaagcctt aaaaaagaag ggttgccctc cagatgacat agaaaatccc agaggctcca      480

aagatataaa gaaaaataaa aatgtaacca accgtagcaa aggaacagca gagaagctca      540

agccagagga tattactcag atccaaccac agcagttggt tttgcgatta agatcagggg      600

agccacagac atttacatta aaattcaaga gagctgaaga ctatcccatt gacctctact      660

accttatgga cctgtcttac tcaatgaaag acgatttgga gaatgtaaaa agtcttggaa      720

cagatctgat gaatgaaatg aggaggatta cttcggactt cagaattgga tttggctcat      780

ttgtggaaaa gactgtgatg ccttacatta gcacaacacc agctaagctc aggaaccctt      840

gcacaagtga acagaactgc accagcccat ttagctacaa aaatgtgctc agtcttacta      900

ataaaggaga agtatttaat gaacttgttg gaaaacagcg catatctgga aatttggatt      960

ctccagaagg tggtttcgat gccatcatgc aagttgcagt ttgtggatca ctgattggct     1020

ggaggaatgt tacacggctg ctggtgtttt ccacagatgc cgggtttcac tttgctggag     1080

atgggaaact tggtggcatt gttttaccaa atgatggaca atgtcacctg gaaaataata     1140

tgtacacaat gagccattat tatgattatc cttctattgc tcaccttgtc cagaaactga     1200

gtgaaaataa tattcagaca atttttgcag ttactgaaga atttcagcct gtttacaagg     1260

agctgaaaaa cttgatccct aagtcagcag taggaacatt atctgcaaat tctagcaatg     1320

taattcagtt gatcattgat gcatacaatt ccctttcctc agaagtcatt ttggaaaacg     1380

gcaaattgtc agaaggcgta acaataagtt acaaatctta ctgcaagaac ggggtgaatg     1440

gaacagggga aaatggaaga aaatgttcca atatttccat tggagatgag gttcaatttg     1500

aaattagcat aacttcaaat aagtgtccaa aaaaggattc tgacagcttt aaaattaggc     1560

ctctgggctt tacggaggaa gtagaggtta ttcttcagta catctgtgaa tgtgaatgcc     1620

aaagcgaagg catccctgaa agtcccaagt gtcatgaagg aaatgggaca tttgagtgtg     1680

gcgcgtgcag gtgcaatgaa gggcgtgttg gtagacattg tgaatgcagc acagatgaag     1740

ttaacagtga agacatggat gcttactgca ggaaagaaaa cagttcagaa atctgcagta     1800

acaatggaga gtgcgtctgc ggacagtgtg tttgtaggaa gagggataat acaaatgaaa     1860

tttattctgg caaattctgc gagtgtgata atttcaactg tgatagatcc aatggcttaa     1920

tttgtggagg aaatggtgtt tgcaagtgtc gtgtgtgtga gtgcaacccc aactacactg     1980

gcagtgcatg tgactgttct ttggatacta gtacttgtga agccagcaac ggacagatct     2040

gcaatggccg gggcatctgc gagtgtggtg tctgtaagtg tacagatccg aagtttcaag     2100

ggcaaacgtg tgagatgtgt cagacctgcc ttggtgtctg tgctgagcat aaagaatgtg     2160

ttcagtgcag agccttcaat aaaggagaaa agaaagacac atgcacacag gaatgttcct     2220

attttaacat taccaaggta gaaagtcggg acaaattacc ccagccggtc caacctgatc     2280

ctgtgtccca ttgtaaggag aaggatgttg acgactgttg gttctatttt acgtattcag     2340

tgaatgggaa caacgaggtc atggttcatg ttgtggagaa tccagagtgt cccactggtc     2400

cagacatcat tccaattgta gctggtgtgg ttgctggaat tgttcttatt ggccttgcat     2460

tactgctgat atggaagctt ttaatgataa ttcatgacag aagggagttt gctaaatttg     2520

aaaaggagaa aatgaatgcc aaatgggaca cgggtgaaaa tcctatttat aagagtgccg     2580

taacaactgt ggtcaatccg aagtatgagg gaaaatgagt actgcccgtg caaatcccac     2640

aacactgaat gcaaagtagc aatttccata gtcacagtta ggtagcttta gggcaatatt     2700

gccatggttt tactcatgtg caggttttga aaatgtacaa tatgtataat ttttaaaatg     2760

ttttattatt ttgaaaataa tgttgtaatt catgccaggg actgacaaaa gacttgagac     2820

aggatggtta ctcttgtcag ctaaggtcac attgtgcctt tttgaccttt tcttcctgga     2880

ctattgaaat caagcttatt ggattaagtg atatttctat agcgattgaa agggcaatag     2940

ttaaagtaat gagcatgatg agagtttctg ttaatcatgt attaaaactg atttttagct     3000

ttacaaatat gtcagtttgc agttatgcag aatccaaagt aaatgtcctg ctagctagtt     3060

aaggattgtt ttaaatctgt tattttgcta tttgcctgtt agacatgact gatgacatat     3120

ctgaaagaca agtatgttga gagttgctgg tgtaaaatac gtttgaaata gttgatctac     3180

aaaggccatg ggaaaaattc agagagttag gaaggaaaaa ccaatagctt taaaacctgt     3240

gtgccatttt aagagttact taatgtttgg taacttttat gccttcactt tacaaattca     3300

agccttagat aaaagaaccg agcaattttc tgctaaaaag tccttgattt agcactattt     3360

acatacaggc catactttac aaagtatttg ctgaatgggg accttttgag ttgaatttat     3420

tttattattt ttattttgtt taatgtctgg tgctttctgt cacctcttct aatcttttaa     3480

tgtatttgtt tgcaattttg gggtaagact ttttttatga gtactttttc tttgaagttt     3540

tagcggtcaa tttgcctttt taatgaacat gtgaagttat actgtggcta tgcaacagct     3600

ctcacctacg cgagtcttac tttgagttag tgccataaca gaccactgta tgtttacttc     3660

tcaccatttg agttgcccat cttgtttcac actagtcaca ttcttgtttt aagtgccttt     3720

agttttaaca gttcactttt tacagtgcta tttactgaag ttatttatta aatatgccta     3780

aaatacttaa atcggatgtc ttgactctga tgtattttat caggttgtgt gcatgaaatt     3840

tttatagatt aaagaagttg aggaaaagca aaaaaaaaa                            3879


<210>  304
<211>  798
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 1 (fibronectin receptor, beta 
       polypeptide, antigen CD29 includes MDF2, MSK12) (ITGB1), 
       transcript variant 1A, polypeptide GeneBank Accession No. 
       NP_002202.2 GI:19743813

<400>  304

Met Asn Leu Gln Pro Ile Phe Trp Ile Gly Leu Ile Ser Ser Val Cys 
1               5                   10                  15      


Cys Val Phe Ala Gln Thr Asp Glu Asn Arg Cys Leu Lys Ala Asn Ala 
            20                  25                  30          


Lys Ser Cys Gly Glu Cys Ile Gln Ala Gly Pro Asn Cys Gly Trp Cys 
        35                  40                  45              


Thr Asn Ser Thr Phe Leu Gln Glu Gly Met Pro Thr Ser Ala Arg Cys 
    50                  55                  60                  


Asp Asp Leu Glu Ala Leu Lys Lys Lys Gly Cys Pro Pro Asp Asp Ile 
65                  70                  75                  80  


Glu Asn Pro Arg Gly Ser Lys Asp Ile Lys Lys Asn Lys Asn Val Thr 
                85                  90                  95      


Asn Arg Ser Lys Gly Thr Ala Glu Lys Leu Lys Pro Glu Asp Ile Thr 
            100                 105                 110         


Gln Ile Gln Pro Gln Gln Leu Val Leu Arg Leu Arg Ser Gly Glu Pro 
        115                 120                 125             


Gln Thr Phe Thr Leu Lys Phe Lys Arg Ala Glu Asp Tyr Pro Ile Asp 
    130                 135                 140                 


Leu Tyr Tyr Leu Met Asp Leu Ser Tyr Ser Met Lys Asp Asp Leu Glu 
145                 150                 155                 160 


Asn Val Lys Ser Leu Gly Thr Asp Leu Met Asn Glu Met Arg Arg Ile 
                165                 170                 175     


Thr Ser Asp Phe Arg Ile Gly Phe Gly Ser Phe Val Glu Lys Thr Val 
            180                 185                 190         


Met Pro Tyr Ile Ser Thr Thr Pro Ala Lys Leu Arg Asn Pro Cys Thr 
        195                 200                 205             


Ser Glu Gln Asn Cys Thr Ser Pro Phe Ser Tyr Lys Asn Val Leu Ser 
    210                 215                 220                 


Leu Thr Asn Lys Gly Glu Val Phe Asn Glu Leu Val Gly Lys Gln Arg 
225                 230                 235                 240 


Ile Ser Gly Asn Leu Asp Ser Pro Glu Gly Gly Phe Asp Ala Ile Met 
                245                 250                 255     


Gln Val Ala Val Cys Gly Ser Leu Ile Gly Trp Arg Asn Val Thr Arg 
            260                 265                 270         


Leu Leu Val Phe Ser Thr Asp Ala Gly Phe His Phe Ala Gly Asp Gly 
        275                 280                 285             


Lys Leu Gly Gly Ile Val Leu Pro Asn Asp Gly Gln Cys His Leu Glu 
    290                 295                 300                 


Asn Asn Met Tyr Thr Met Ser His Tyr Tyr Asp Tyr Pro Ser Ile Ala 
305                 310                 315                 320 


His Leu Val Gln Lys Leu Ser Glu Asn Asn Ile Gln Thr Ile Phe Ala 
                325                 330                 335     


Val Thr Glu Glu Phe Gln Pro Val Tyr Lys Glu Leu Lys Asn Leu Ile 
            340                 345                 350         


Pro Lys Ser Ala Val Gly Thr Leu Ser Ala Asn Ser Ser Asn Val Ile 
        355                 360                 365             


Gln Leu Ile Ile Asp Ala Tyr Asn Ser Leu Ser Ser Glu Val Ile Leu 
    370                 375                 380                 


Glu Asn Gly Lys Leu Ser Glu Gly Val Thr Ile Ser Tyr Lys Ser Tyr 
385                 390                 395                 400 


Cys Lys Asn Gly Val Asn Gly Thr Gly Glu Asn Gly Arg Lys Cys Ser 
                405                 410                 415     


Asn Ile Ser Ile Gly Asp Glu Val Gln Phe Glu Ile Ser Ile Thr Ser 
            420                 425                 430         


Asn Lys Cys Pro Lys Lys Asp Ser Asp Ser Phe Lys Ile Arg Pro Leu 
        435                 440                 445             


Gly Phe Thr Glu Glu Val Glu Val Ile Leu Gln Tyr Ile Cys Glu Cys 
    450                 455                 460                 


Glu Cys Gln Ser Glu Gly Ile Pro Glu Ser Pro Lys Cys His Glu Gly 
465                 470                 475                 480 


Asn Gly Thr Phe Glu Cys Gly Ala Cys Arg Cys Asn Glu Gly Arg Val 
                485                 490                 495     


Gly Arg His Cys Glu Cys Ser Thr Asp Glu Val Asn Ser Glu Asp Met 
            500                 505                 510         


Asp Ala Tyr Cys Arg Lys Glu Asn Ser Ser Glu Ile Cys Ser Asn Asn 
        515                 520                 525             


Gly Glu Cys Val Cys Gly Gln Cys Val Cys Arg Lys Arg Asp Asn Thr 
    530                 535                 540                 


Asn Glu Ile Tyr Ser Gly Lys Phe Cys Glu Cys Asp Asn Phe Asn Cys 
545                 550                 555                 560 


Asp Arg Ser Asn Gly Leu Ile Cys Gly Gly Asn Gly Val Cys Lys Cys 
                565                 570                 575     


Arg Val Cys Glu Cys Asn Pro Asn Tyr Thr Gly Ser Ala Cys Asp Cys 
            580                 585                 590         


Ser Leu Asp Thr Ser Thr Cys Glu Ala Ser Asn Gly Gln Ile Cys Asn 
        595                 600                 605             


Gly Arg Gly Ile Cys Glu Cys Gly Val Cys Lys Cys Thr Asp Pro Lys 
    610                 615                 620                 


Phe Gln Gly Gln Thr Cys Glu Met Cys Gln Thr Cys Leu Gly Val Cys 
625                 630                 635                 640 


Ala Glu His Lys Glu Cys Val Gln Cys Arg Ala Phe Asn Lys Gly Glu 
                645                 650                 655     


Lys Lys Asp Thr Cys Thr Gln Glu Cys Ser Tyr Phe Asn Ile Thr Lys 
            660                 665                 670         


Val Glu Ser Arg Asp Lys Leu Pro Gln Pro Val Gln Pro Asp Pro Val 
        675                 680                 685             


Ser His Cys Lys Glu Lys Asp Val Asp Asp Cys Trp Phe Tyr Phe Thr 
    690                 695                 700                 


Tyr Ser Val Asn Gly Asn Asn Glu Val Met Val His Val Val Glu Asn 
705                 710                 715                 720 


Pro Glu Cys Pro Thr Gly Pro Asp Ile Ile Pro Ile Val Ala Gly Val 
                725                 730                 735     


Val Ala Gly Ile Val Leu Ile Gly Leu Ala Leu Leu Leu Ile Trp Lys 
            740                 745                 750         


Leu Leu Met Ile Ile His Asp Arg Arg Glu Phe Ala Lys Phe Glu Lys 
        755                 760                 765             


Glu Lys Met Asn Ala Lys Trp Asp Thr Gly Glu Asn Pro Ile Tyr Lys 
    770                 775                 780                 


Ser Ala Val Thr Thr Val Val Asn Pro Lys Tyr Glu Gly Lys 
785                 790                 795             


<210>  305
<211>  3739
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 1 (fibronectin receptor, beta 
       polypeptide, antigen CD29 includes MDF2, MSK12) (ITGB1), 
       transcript variant 1D, mRNA GeneBank Accession No. NM_033668.2  
       GI:182507160

<400>  305
atgaatttac aaccaatttt ctggattgga ctgatcagtt cagtttgctg tgtgtttgct       60

caaacagatg aaaatagatg tttaaaagca aatgccaaat catgtggaga atgtatacaa      120

gcagggccaa attgtgggtg gtgcacaaat tcaacatttt tacaggaagg aatgcctact      180

tctgcacgat gtgatgattt agaagcctta aaaaagaagg gttgccctcc agatgacata      240

gaaaatccca gaggctccaa agatataaag aaaaataaaa atgtaaccaa ccgtagcaaa      300

ggaacagcag agaagctcaa gccagaggat attactcaga tccaaccaca gcagttggtt      360

ttgcgattaa gatcagggga gccacagaca tttacattaa aattcaagag agctgaagac      420

tatcccattg acctctacta ccttatggac ctgtcttact caatgaaaga cgatttggag      480

aatgtaaaaa gtcttggaac agatctgatg aatgaaatga ggaggattac ttcggacttc      540

agaattggat ttggctcatt tgtggaaaag actgtgatgc cttacattag cacaacacca      600

gctaagctca ggaacccttg cacaagtgaa cagaactgca ccagcccatt tagctacaaa      660

aatgtgctca gtcttactaa taaaggagaa gtatttaatg aacttgttgg aaaacagcgc      720

atatctggaa atttggattc tccagaaggt ggtttcgatg ccatcatgca agttgcagtt      780

tgtggatcac tgattggctg gaggaatgtt acacggctgc tggtgttttc cacagatgcc      840

gggtttcact ttgctggaga tgggaaactt ggtggcattg ttttaccaaa tgatggacaa      900

tgtcacctgg aaaataatat gtacacaatg agccattatt atgattatcc ttctattgct      960

caccttgtcc agaaactgag tgaaaataat attcagacaa tttttgcagt tactgaagaa     1020

tttcagcctg tttacaagga gctgaaaaac ttgatcccta agtcagcagt aggaacatta     1080

tctgcaaatt ctagcaatgt aattcagttg atcattgatg catacaattc cctttcctca     1140

gaagtcattt tggaaaacgg caaattgtca gaaggcgtaa caataagtta caaatcttac     1200

tgcaagaacg gggtgaatgg aacaggggaa aatggaagaa aatgttccaa tatttccatt     1260

ggagatgagg ttcaatttga aattagcata acttcaaata agtgtccaaa aaaggattct     1320

gacagcttta aaattaggcc tctgggcttt acggaggaag tagaggttat tcttcagtac     1380

atctgtgaat gtgaatgcca aagcgaaggc atccctgaaa gtcccaagtg tcatgaagga     1440

aatgggacat ttgagtgtgg cgcgtgcagg tgcaatgaag ggcgtgttgg tagacattgt     1500

gaatgcagca cagatgaagt taacagtgaa gacatggatg cttactgcag gaaagaaaac     1560

agttcagaaa tctgcagtaa caatggagag tgcgtctgcg gacagtgtgt ttgtaggaag     1620

agggataata caaatgaaat ttattctggc aaattctgcg agtgtgataa tttcaactgt     1680

gatagatcca atggcttaat ttgtggagga aatggtgttt gcaagtgtcg tgtgtgtgag     1740

tgcaacccca actacactgg cagtgcatgt gactgttctt tggatactag tacttgtgaa     1800

gccagcaacg gacagatctg caatggccgg ggcatctgcg agtgtggtgt ctgtaagtgt     1860

acagatccga agtttcaagg gcaaacgtgt gagatgtgtc agacctgcct tggtgtctgt     1920

gctgagcata aagaatgtgt tcagtgcaga gccttcaata aaggagaaaa gaaagacaca     1980

tgcacacagg aatgttccta ttttaacatt accaaggtag aaagtcggga caaattaccc     2040

cagccggtcc aacctgatcc tgtgtcccat tgtaaggaga aggatgttga cgactgttgg     2100

ttctatttta cgtattcagt gaatgggaac aacgaggtca tggttcatgt tgtggagaat     2160

ccagagtgtc ccactggtcc agacatcatt ccaattgtag ctggtgtggt tgctggaatt     2220

gttcttattg gccttgcatt actgctgata tggaagcttt taatgataat tcatgacaga     2280

agggagtttg ctaaatttga aaaggagaaa atgaatgcca aatgggacac gcaagaaaat     2340

ccgatttaca agagtcctat taataatttc aagaatccaa actacggacg taaagctggt     2400

ctctaaattg ccggtgaaaa tcctatttat aagagtgccg taacaactgt ggtcaatccg     2460

aagtatgagg gaaaatgagt actgcccgtg caaatcccac aacactgaat gcaaagtagc     2520

aatttccata gtcacagtta ggtagcttta gggcaatatt gccatggttt tactcatgtg     2580

caggttttga aaatgtacaa tatgtataat ttttaaaatg ttttattatt ttgaaaataa     2640

tgttgtaatt catgccaggg actgacaaaa gacttgagac aggatggtta ctcttgtcag     2700

ctaaggtcac attgtgcctt tttgaccttt tcttcctgga ctattgaaat caagcttatt     2760

ggattaagtg atatttctat agcgattgaa agggcaatag ttaaagtaat gagcatgatg     2820

agagtttctg ttaatcatgt attaaaactg atttttagct ttacaaatat gtcagtttgc     2880

agttatgcag aatccaaagt aaatgtcctg ctagctagtt aaggattgtt ttaaatctgt     2940

tattttgcta tttgcctgtt agacatgact gatgacatat ctgaaagaca agtatgttga     3000

gagttgctgg tgtaaaatac gtttgaaata gttgatctac aaaggccatg ggaaaaattc     3060

agagagttag gaaggaaaaa ccaatagctt taaaacctgt gtgccatttt aagagttact     3120

taatgtttgg taacttttat gccttcactt tacaaattca agccttagat aaaagaaccg     3180

agcaattttc tgctaaaaag tccttgattt agcactattt acatacaggc catactttac     3240

aaagtatttg ctgaatgggg accttttgag ttgaatttat tttattattt ttattttgtt     3300

taatgtctgg tgctttctgt cacctcttct aatcttttaa tgtatttgtt tgcaattttg     3360

gggtaagact ttttttatga gtactttttc tttgaagttt tagcggtcaa tttgcctttt     3420

taatgaacat gtgaagttat actgtggcta tgcaacagct ctcacctacg cgagtcttac     3480

tttgagttag tgccataaca gaccactgta tgtttacttc tcaccatttg agttgcccat     3540

cttgtttcac actagtcaca ttcttgtttt aagtgccttt agttttaaca gttcactttt     3600

tacagtgcta tttactgaag ttatttatta aatatgccta aaatacttaa atcggatgtc     3660

ttgactctga tgtattttat caggttgtgt gcatgaaatt tttatagatt aaagaagttg     3720

aggaaaagca aaaaaaaaa                                                  3739


<210>  306
<211>  801
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 1 (fibronectin receptor, beta 
       polypeptide, antigen CD29 includes MDF2, MSK12) (ITGB1), 
       transcript variant 1D, polypeptide GeneBank Accession No. 
       NM_033668.2  GI:182507160

<400>  306

Met Asn Leu Gln Pro Ile Phe Trp Ile Gly Leu Ile Ser Ser Val Cys 
1               5                   10                  15      


Cys Val Phe Ala Gln Thr Asp Glu Asn Arg Cys Leu Lys Ala Asn Ala 
            20                  25                  30          


Lys Ser Cys Gly Glu Cys Ile Gln Ala Gly Pro Asn Cys Gly Trp Cys 
        35                  40                  45              


Thr Asn Ser Thr Phe Leu Gln Glu Gly Met Pro Thr Ser Ala Arg Cys 
    50                  55                  60                  


Asp Asp Leu Glu Ala Leu Lys Lys Lys Gly Cys Pro Pro Asp Asp Ile 
65                  70                  75                  80  


Glu Asn Pro Arg Gly Ser Lys Asp Ile Lys Lys Asn Lys Asn Val Thr 
                85                  90                  95      


Asn Arg Ser Lys Gly Thr Ala Glu Lys Leu Lys Pro Glu Asp Ile Thr 
            100                 105                 110         


Gln Ile Gln Pro Gln Gln Leu Val Leu Arg Leu Arg Ser Gly Glu Pro 
        115                 120                 125             


Gln Thr Phe Thr Leu Lys Phe Lys Arg Ala Glu Asp Tyr Pro Ile Asp 
    130                 135                 140                 


Leu Tyr Tyr Leu Met Asp Leu Ser Tyr Ser Met Lys Asp Asp Leu Glu 
145                 150                 155                 160 


Asn Val Lys Ser Leu Gly Thr Asp Leu Met Asn Glu Met Arg Arg Ile 
                165                 170                 175     


Thr Ser Asp Phe Arg Ile Gly Phe Gly Ser Phe Val Glu Lys Thr Val 
            180                 185                 190         


Met Pro Tyr Ile Ser Thr Thr Pro Ala Lys Leu Arg Asn Pro Cys Thr 
        195                 200                 205             


Ser Glu Gln Asn Cys Thr Ser Pro Phe Ser Tyr Lys Asn Val Leu Ser 
    210                 215                 220                 


Leu Thr Asn Lys Gly Glu Val Phe Asn Glu Leu Val Gly Lys Gln Arg 
225                 230                 235                 240 


Ile Ser Gly Asn Leu Asp Ser Pro Glu Gly Gly Phe Asp Ala Ile Met 
                245                 250                 255     


Gln Val Ala Val Cys Gly Ser Leu Ile Gly Trp Arg Asn Val Thr Arg 
            260                 265                 270         


Leu Leu Val Phe Ser Thr Asp Ala Gly Phe His Phe Ala Gly Asp Gly 
        275                 280                 285             


Lys Leu Gly Gly Ile Val Leu Pro Asn Asp Gly Gln Cys His Leu Glu 
    290                 295                 300                 


Asn Asn Met Tyr Thr Met Ser His Tyr Tyr Asp Tyr Pro Ser Ile Ala 
305                 310                 315                 320 


His Leu Val Gln Lys Leu Ser Glu Asn Asn Ile Gln Thr Ile Phe Ala 
                325                 330                 335     


Val Thr Glu Glu Phe Gln Pro Val Tyr Lys Glu Leu Lys Asn Leu Ile 
            340                 345                 350         


Pro Lys Ser Ala Val Gly Thr Leu Ser Ala Asn Ser Ser Asn Val Ile 
        355                 360                 365             


Gln Leu Ile Ile Asp Ala Tyr Asn Ser Leu Ser Ser Glu Val Ile Leu 
    370                 375                 380                 


Glu Asn Gly Lys Leu Ser Glu Gly Val Thr Ile Ser Tyr Lys Ser Tyr 
385                 390                 395                 400 


Cys Lys Asn Gly Val Asn Gly Thr Gly Glu Asn Gly Arg Lys Cys Ser 
                405                 410                 415     


Asn Ile Ser Ile Gly Asp Glu Val Gln Phe Glu Ile Ser Ile Thr Ser 
            420                 425                 430         


Asn Lys Cys Pro Lys Lys Asp Ser Asp Ser Phe Lys Ile Arg Pro Leu 
        435                 440                 445             


Gly Phe Thr Glu Glu Val Glu Val Ile Leu Gln Tyr Ile Cys Glu Cys 
    450                 455                 460                 


Glu Cys Gln Ser Glu Gly Ile Pro Glu Ser Pro Lys Cys His Glu Gly 
465                 470                 475                 480 


Asn Gly Thr Phe Glu Cys Gly Ala Cys Arg Cys Asn Glu Gly Arg Val 
                485                 490                 495     


Gly Arg His Cys Glu Cys Ser Thr Asp Glu Val Asn Ser Glu Asp Met 
            500                 505                 510         


Asp Ala Tyr Cys Arg Lys Glu Asn Ser Ser Glu Ile Cys Ser Asn Asn 
        515                 520                 525             


Gly Glu Cys Val Cys Gly Gln Cys Val Cys Arg Lys Arg Asp Asn Thr 
    530                 535                 540                 


Asn Glu Ile Tyr Ser Gly Lys Phe Cys Glu Cys Asp Asn Phe Asn Cys 
545                 550                 555                 560 


Asp Arg Ser Asn Gly Leu Ile Cys Gly Gly Asn Gly Val Cys Lys Cys 
                565                 570                 575     


Arg Val Cys Glu Cys Asn Pro Asn Tyr Thr Gly Ser Ala Cys Asp Cys 
            580                 585                 590         


Ser Leu Asp Thr Ser Thr Cys Glu Ala Ser Asn Gly Gln Ile Cys Asn 
        595                 600                 605             


Gly Arg Gly Ile Cys Glu Cys Gly Val Cys Lys Cys Thr Asp Pro Lys 
    610                 615                 620                 


Phe Gln Gly Gln Thr Cys Glu Met Cys Gln Thr Cys Leu Gly Val Cys 
625                 630                 635                 640 


Ala Glu His Lys Glu Cys Val Gln Cys Arg Ala Phe Asn Lys Gly Glu 
                645                 650                 655     


Lys Lys Asp Thr Cys Thr Gln Glu Cys Ser Tyr Phe Asn Ile Thr Lys 
            660                 665                 670         


Val Glu Ser Arg Asp Lys Leu Pro Gln Pro Val Gln Pro Asp Pro Val 
        675                 680                 685             


Ser His Cys Lys Glu Lys Asp Val Asp Asp Cys Trp Phe Tyr Phe Thr 
    690                 695                 700                 


Tyr Ser Val Asn Gly Asn Asn Glu Val Met Val His Val Val Glu Asn 
705                 710                 715                 720 


Pro Glu Cys Pro Thr Gly Pro Asp Ile Ile Pro Ile Val Ala Gly Val 
                725                 730                 735     


Val Ala Gly Ile Val Leu Ile Gly Leu Ala Leu Leu Leu Ile Trp Lys 
            740                 745                 750         


Leu Leu Met Ile Ile His Asp Arg Arg Glu Phe Ala Lys Phe Glu Lys 
        755                 760                 765             


Glu Lys Met Asn Ala Lys Trp Asp Thr Gln Glu Asn Pro Ile Tyr Lys 
    770                 775                 780                 


Ser Pro Ile Asn Asn Phe Lys Asn Pro Asn Tyr Gly Arg Lys Ala Gly 
785                 790                 795                 800 


Leu 
    


<210>  307
<211>  3794
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 1 (fibronectin receptor, beta 
       polypeptide, antigen CD29 includes MDF2, MSK12) (ITGB1), 
       transcript variant 1E, mRNA GeneBank Accession No. NM_133376.2  
       GI:182507162

<400>  307
gagccagccc agccgcgttc cgaacgtgag ggtcgccggc ctgggcgctg tcacgtcggg       60

gctgccggag ctgcggggga ccgggcccga acggcccctg acacctgcgg tctcccgccg      120

ggctgggcaa gcgcagatga atttacaacc aattttctgg attggactga tcagttcagt      180

ttgctgtgtg tttgctcaaa cagatgaaaa tagatgttta aaagcaaatg ccaaatcatg      240

tggagaatgt atacaagcag ggccaaattg tgggtggtgc acaaattcaa catttttaca      300

ggaaggaatg cctacttctg cacgatgtga tgatttagaa gccttaaaaa agaagggttg      360

ccctccagat gacatagaaa atcccagagg ctccaaagat ataaagaaaa ataaaaatgt      420

aaccaaccgt agcaaaggaa cagcagagaa gctcaagcca gaggatatta ctcagatcca      480

accacagcag ttggttttgc gattaagatc aggggagcca cagacattta cattaaaatt      540

caagagagct gaagactatc ccattgacct ctactacctt atggacctgt cttactcaat      600

gaaagacgat ttggagaatg taaaaagtct tggaacagat ctgatgaatg aaatgaggag      660

gattacttcg gacttcagaa ttggatttgg ctcatttgtg gaaaagactg tgatgcctta      720

cattagcaca acaccagcta agctcaggaa cccttgcaca agtgaacaga actgcaccag      780

cccatttagc tacaaaaatg tgctcagtct tactaataaa ggagaagtat ttaatgaact      840

tgttggaaaa cagcgcatat ctggaaattt ggattctcca gaaggtggtt tcgatgccat      900

catgcaagtt gcagtttgtg gatcactgat tggctggagg aatgttacac ggctgctggt      960

gttttccaca gatgccgggt ttcactttgc tggagatggg aaacttggtg gcattgtttt     1020

accaaatgat ggacaatgtc acctggaaaa taatatgtac acaatgagcc attattatga     1080

ttatccttct attgctcacc ttgtccagaa actgagtgaa aataatattc agacaatttt     1140

tgcagttact gaagaatttc agcctgttta caaggagctg aaaaacttga tccctaagtc     1200

agcagtagga acattatctg caaattctag caatgtaatt cagttgatca ttgatgcata     1260

caattccctt tcctcagaag tcattttgga aaacggcaaa ttgtcagaag gcgtaacaat     1320

aagttacaaa tcttactgca agaacggggt gaatggaaca ggggaaaatg gaagaaaatg     1380

ttccaatatt tccattggag atgaggttca atttgaaatt agcataactt caaataagtg     1440

tccaaaaaag gattctgaca gctttaaaat taggcctctg ggctttacgg aggaagtaga     1500

ggttattctt cagtacatct gtgaatgtga atgccaaagc gaaggcatcc ctgaaagtcc     1560

caagtgtcat gaaggaaatg ggacatttga gtgtggcgcg tgcaggtgca atgaagggcg     1620

tgttggtaga cattgtgaat gcagcacaga tgaagttaac agtgaagaca tggatgctta     1680

ctgcaggaaa gaaaacagtt cagaaatctg cagtaacaat ggagagtgcg tctgcggaca     1740

gtgtgtttgt aggaagaggg ataatacaaa tgaaatttat tctggcaaat tctgcgagtg     1800

tgataatttc aactgtgata gatccaatgg cttaatttgt ggaggaaatg gtgtttgcaa     1860

gtgtcgtgtg tgtgagtgca accccaacta cactggcagt gcatgtgact gttctttgga     1920

tactagtact tgtgaagcca gcaacggaca gatctgcaat ggccggggca tctgcgagtg     1980

tggtgtctgt aagtgtacag atccgaagtt tcaagggcaa acgtgtgaga tgtgtcagac     2040

ctgccttggt gtctgtgctg agcataaaga atgtgttcag tgcagagcct tcaataaagg     2100

agaaaagaaa gacacatgca cacaggaatg ttcctatttt aacattacca aggtagaaag     2160

tcgggacaaa ttaccccagc cggtccaacc tgatcctgtg tcccattgta aggagaagga     2220

tgttgacgac tgttggttct attttacgta ttcagtgaat gggaacaacg aggtcatggt     2280

tcatgttgtg gagaatccag agtgtcccac tggtccagac atcattccaa ttgtagctgg     2340

tgtggttgct ggaattgttc ttattggcct tgcattactg ctgatatgga agcttttaat     2400

gataattcat gacagaaggg agtttgctaa atttgaaaag gagaaaatga atgccaaatg     2460

ggacacgggt gaaaatccta tttataagag tgccgtaaca actgtggtca atccgaagta     2520

tgagggaaaa tgagtactgc ccgtgcaaat cccacaacac tgaatgcaaa gtagcaattt     2580

ccatagtcac agttaggtag ctttagggca atattgccat ggttttactc atgtgcaggt     2640

tttgaaaatg tacaatatgt ataattttta aaatgtttta ttattttgaa aataatgttg     2700

taattcatgc cagggactga caaaagactt gagacaggat ggttactctt gtcagctaag     2760

gtcacattgt gcctttttga ccttttcttc ctggactatt gaaatcaagc ttattggatt     2820

aagtgatatt tctatagcga ttgaaagggc aatagttaaa gtaatgagca tgatgagagt     2880

ttctgttaat catgtattaa aactgatttt tagctttaca aatatgtcag tttgcagtta     2940

tgcagaatcc aaagtaaatg tcctgctagc tagttaagga ttgttttaaa tctgttattt     3000

tgctatttgc ctgttagaca tgactgatga catatctgaa agacaagtat gttgagagtt     3060

gctggtgtaa aatacgtttg aaatagttga tctacaaagg ccatgggaaa aattcagaga     3120

gttaggaagg aaaaaccaat agctttaaaa cctgtgtgcc attttaagag ttacttaatg     3180

tttggtaact tttatgcctt cactttacaa attcaagcct tagataaaag aaccgagcaa     3240

ttttctgcta aaaagtcctt gatttagcac tatttacata caggccatac tttacaaagt     3300

atttgctgaa tggggacctt ttgagttgaa tttattttat tatttttatt ttgtttaatg     3360

tctggtgctt tctgtcacct cttctaatct tttaatgtat ttgtttgcaa ttttggggta     3420

agactttttt tatgagtact ttttctttga agttttagcg gtcaatttgc ctttttaatg     3480

aacatgtgaa gttatactgt ggctatgcaa cagctctcac ctacgcgagt cttactttga     3540

gttagtgcca taacagacca ctgtatgttt acttctcacc atttgagttg cccatcttgt     3600

ttcacactag tcacattctt gttttaagtg cctttagttt taacagttca ctttttacag     3660

tgctatttac tgaagttatt tattaaatat gcctaaaata cttaaatcgg atgtcttgac     3720

tctgatgtat tttatcaggt tgtgtgcatg aaatttttat agattaaaga agttgaggaa     3780

aagcaaaaaa aaaa                                                       3794


<210>  308
<211>  798
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 1 (fibronectin receptor, beta 
       polypeptide, antigen CD29 includes MDF2, MSK12) (ITGB1), 
       transcript variant 1E, polypeptide GeneBank Accession No. 
       NP_596867.1 GI:19743823

<400>  308

Met Asn Leu Gln Pro Ile Phe Trp Ile Gly Leu Ile Ser Ser Val Cys 
1               5                   10                  15      


Cys Val Phe Ala Gln Thr Asp Glu Asn Arg Cys Leu Lys Ala Asn Ala 
            20                  25                  30          


Lys Ser Cys Gly Glu Cys Ile Gln Ala Gly Pro Asn Cys Gly Trp Cys 
        35                  40                  45              


Thr Asn Ser Thr Phe Leu Gln Glu Gly Met Pro Thr Ser Ala Arg Cys 
    50                  55                  60                  


Asp Asp Leu Glu Ala Leu Lys Lys Lys Gly Cys Pro Pro Asp Asp Ile 
65                  70                  75                  80  


Glu Asn Pro Arg Gly Ser Lys Asp Ile Lys Lys Asn Lys Asn Val Thr 
                85                  90                  95      


Asn Arg Ser Lys Gly Thr Ala Glu Lys Leu Lys Pro Glu Asp Ile Thr 
            100                 105                 110         


Gln Ile Gln Pro Gln Gln Leu Val Leu Arg Leu Arg Ser Gly Glu Pro 
        115                 120                 125             


Gln Thr Phe Thr Leu Lys Phe Lys Arg Ala Glu Asp Tyr Pro Ile Asp 
    130                 135                 140                 


Leu Tyr Tyr Leu Met Asp Leu Ser Tyr Ser Met Lys Asp Asp Leu Glu 
145                 150                 155                 160 


Asn Val Lys Ser Leu Gly Thr Asp Leu Met Asn Glu Met Arg Arg Ile 
                165                 170                 175     


Thr Ser Asp Phe Arg Ile Gly Phe Gly Ser Phe Val Glu Lys Thr Val 
            180                 185                 190         


Met Pro Tyr Ile Ser Thr Thr Pro Ala Lys Leu Arg Asn Pro Cys Thr 
        195                 200                 205             


Ser Glu Gln Asn Cys Thr Ser Pro Phe Ser Tyr Lys Asn Val Leu Ser 
    210                 215                 220                 


Leu Thr Asn Lys Gly Glu Val Phe Asn Glu Leu Val Gly Lys Gln Arg 
225                 230                 235                 240 


Ile Ser Gly Asn Leu Asp Ser Pro Glu Gly Gly Phe Asp Ala Ile Met 
                245                 250                 255     


Gln Val Ala Val Cys Gly Ser Leu Ile Gly Trp Arg Asn Val Thr Arg 
            260                 265                 270         


Leu Leu Val Phe Ser Thr Asp Ala Gly Phe His Phe Ala Gly Asp Gly 
        275                 280                 285             


Lys Leu Gly Gly Ile Val Leu Pro Asn Asp Gly Gln Cys His Leu Glu 
    290                 295                 300                 


Asn Asn Met Tyr Thr Met Ser His Tyr Tyr Asp Tyr Pro Ser Ile Ala 
305                 310                 315                 320 


His Leu Val Gln Lys Leu Ser Glu Asn Asn Ile Gln Thr Ile Phe Ala 
                325                 330                 335     


Val Thr Glu Glu Phe Gln Pro Val Tyr Lys Glu Leu Lys Asn Leu Ile 
            340                 345                 350         


Pro Lys Ser Ala Val Gly Thr Leu Ser Ala Asn Ser Ser Asn Val Ile 
        355                 360                 365             


Gln Leu Ile Ile Asp Ala Tyr Asn Ser Leu Ser Ser Glu Val Ile Leu 
    370                 375                 380                 


Glu Asn Gly Lys Leu Ser Glu Gly Val Thr Ile Ser Tyr Lys Ser Tyr 
385                 390                 395                 400 


Cys Lys Asn Gly Val Asn Gly Thr Gly Glu Asn Gly Arg Lys Cys Ser 
                405                 410                 415     


Asn Ile Ser Ile Gly Asp Glu Val Gln Phe Glu Ile Ser Ile Thr Ser 
            420                 425                 430         


Asn Lys Cys Pro Lys Lys Asp Ser Asp Ser Phe Lys Ile Arg Pro Leu 
        435                 440                 445             


Gly Phe Thr Glu Glu Val Glu Val Ile Leu Gln Tyr Ile Cys Glu Cys 
    450                 455                 460                 


Glu Cys Gln Ser Glu Gly Ile Pro Glu Ser Pro Lys Cys His Glu Gly 
465                 470                 475                 480 


Asn Gly Thr Phe Glu Cys Gly Ala Cys Arg Cys Asn Glu Gly Arg Val 
                485                 490                 495     


Gly Arg His Cys Glu Cys Ser Thr Asp Glu Val Asn Ser Glu Asp Met 
            500                 505                 510         


Asp Ala Tyr Cys Arg Lys Glu Asn Ser Ser Glu Ile Cys Ser Asn Asn 
        515                 520                 525             


Gly Glu Cys Val Cys Gly Gln Cys Val Cys Arg Lys Arg Asp Asn Thr 
    530                 535                 540                 


Asn Glu Ile Tyr Ser Gly Lys Phe Cys Glu Cys Asp Asn Phe Asn Cys 
545                 550                 555                 560 


Asp Arg Ser Asn Gly Leu Ile Cys Gly Gly Asn Gly Val Cys Lys Cys 
                565                 570                 575     


Arg Val Cys Glu Cys Asn Pro Asn Tyr Thr Gly Ser Ala Cys Asp Cys 
            580                 585                 590         


Ser Leu Asp Thr Ser Thr Cys Glu Ala Ser Asn Gly Gln Ile Cys Asn 
        595                 600                 605             


Gly Arg Gly Ile Cys Glu Cys Gly Val Cys Lys Cys Thr Asp Pro Lys 
    610                 615                 620                 


Phe Gln Gly Gln Thr Cys Glu Met Cys Gln Thr Cys Leu Gly Val Cys 
625                 630                 635                 640 


Ala Glu His Lys Glu Cys Val Gln Cys Arg Ala Phe Asn Lys Gly Glu 
                645                 650                 655     


Lys Lys Asp Thr Cys Thr Gln Glu Cys Ser Tyr Phe Asn Ile Thr Lys 
            660                 665                 670         


Val Glu Ser Arg Asp Lys Leu Pro Gln Pro Val Gln Pro Asp Pro Val 
        675                 680                 685             


Ser His Cys Lys Glu Lys Asp Val Asp Asp Cys Trp Phe Tyr Phe Thr 
    690                 695                 700                 


Tyr Ser Val Asn Gly Asn Asn Glu Val Met Val His Val Val Glu Asn 
705                 710                 715                 720 


Pro Glu Cys Pro Thr Gly Pro Asp Ile Ile Pro Ile Val Ala Gly Val 
                725                 730                 735     


Val Ala Gly Ile Val Leu Ile Gly Leu Ala Leu Leu Leu Ile Trp Lys 
            740                 745                 750         


Leu Leu Met Ile Ile His Asp Arg Arg Glu Phe Ala Lys Phe Glu Lys 
        755                 760                 765             


Glu Lys Met Asn Ala Lys Trp Asp Thr Gly Glu Asn Pro Ile Tyr Lys 
    770                 775                 780                 


Ser Ala Val Thr Thr Val Val Asn Pro Lys Tyr Glu Gly Lys 
785                 790                 795             


<210>  309
<211>  2977
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 2 (complement component 3 receptor 3 
       and 4 subunit) (ITGB2), transcript variant 1, mRNA GeneBank 
       Accession No. NM_000211.3  GI:188595673

<400>  309
gggccgctct ctgacatcag agctgctgta gagcggagag gggcaggggt gaagggccac       60

ggtggtgcaa cccaccactt cctccaagga ggagctgaga ggaacaggaa gtgtcaggac      120

tttacgaccc gcgcctccag ctgaggtttc tagacgtgac ccagggcaga ctggtagcaa      180

agcccccacg cccagccagg agcaccgccg aggactccag cacaccgagg gacatgctgg      240

gcctgcgccc cccactgctc gccctggtgg ggctgctctc cctcgggtgc gtcctctctc      300

aggagtgcac gaagttcaag gtcagcagct gccgggaatg catcgagtcg gggcccggct      360

gcacctggtg ccagaagctg aacttcacag ggccggggga tcctgactcc attcgctgcg      420

acacccggcc acagctgctc atgaggggct gtgcggctga cgacatcatg gaccccacaa      480

gcctcgctga aacccaggaa gaccacaatg ggggccagaa gcagctgtcc ccacaaaaag      540

tgacgcttta cctgcgacca ggccaggcag cagcgttcaa cgtgaccttc cggcgggcca      600

agggctaccc catcgacctg tactatctga tggacctctc ctactccatg cttgatgacc      660

tcaggaatgt caagaagcta ggtggcgacc tgctccgggc cctcaacgag atcaccgagt      720

ccggccgcat tggcttcggg tccttcgtgg acaagaccgt gctgccgttc gtgaacacgc      780

accctgataa gctgcgaaac ccatgcccca acaaggagaa agagtgccag cccccgtttg      840

ccttcaggca cgtgctgaag ctgaccaaca actccaacca gtttcagacc gaggtcggga      900

agcagctgat ttccggaaac ctggatgcac ccgagggtgg gctggacgcc atgatgcagg      960

tcgccgcctg cccggaggaa atcggctggc gcaacgtcac gcggctgctg gtgtttgcca     1020

ctgatgacgg cttccatttc gcgggcgacg ggaagctggg cgccatcctg acccccaacg     1080

acggccgctg tcacctggag gacaacttgt acaagaggag caacgaattc gactacccat     1140

cggtgggcca gctggcgcac aagctggctg aaaacaacat ccagcccatc ttcgcggtga     1200

ccagtaggat ggtgaagacc tacgagaaac tcaccgagat catccccaag tcagccgtgg     1260

gggagctgtc tgaggactcc agcaatgtgg tccaactcat taagaatgct tacaataaac     1320

tctcctccag ggtcttcctg gatcacaacg ccctccccga caccctgaaa gtcacctacg     1380

actccttctg cagcaatgga gtgacgcaca ggaaccagcc cagaggtgac tgtgatggcg     1440

tgcagatcaa tgtcccgatc accttccagg tgaaggtcac ggccacagag tgcatccagg     1500

agcagtcgtt tgtcatccgg gcgctgggct tcacggacat agtgaccgtg caggttcttc     1560

cccagtgtga gtgccggtgc cgggaccaga gcagagaccg cagcctctgc catggcaagg     1620

gcttcttgga gtgcggcatc tgcaggtgtg acactggcta cattgggaaa aactgtgagt     1680

gccagacaca gggccggagc agccaggagc tggaaggaag ctgccggaag gacaacaact     1740

ccatcatctg ctcagggctg ggggactgtg tctgcgggca gtgcctgtgc cacaccagcg     1800

acgtccccgg caagctgata tacgggcagt actgcgagtg tgacaccatc aactgtgagc     1860

gctacaacgg ccaggtctgc ggcggcccgg ggagggggct ctgcttctgc gggaagtgcc     1920

gctgccaccc gggctttgag ggctcagcgt gccagtgcga gaggaccact gagggctgcc     1980

tgaacccgcg gcgtgttgag tgtagtggtc gtggccggtg ccgctgcaac gtatgcgagt     2040

gccattcagg ctaccagctg cctctgtgcc aggagtgccc cggctgcccc tcaccctgtg     2100

gcaagtacat ctcctgcgcc gagtgcctga agttcgaaaa gggccccttt gggaagaact     2160

gcagcgcggc gtgtccgggc ctgcagctgt cgaacaaccc cgtgaagggc aggacctgca     2220

aggagaggga ctcagagggc tgctgggtgg cctacacgct ggagcagcag gacgggatgg     2280

accgctacct catctatgtg gatgagagcc gagagtgtgt ggcaggcccc aacatcgccg     2340

ccatcgtcgg gggcaccgtg gcaggcatcg tgctgatcgg cattctcctg ctggtcatct     2400

ggaaggctct gatccacctg agcgacctcc gggagtacag gcgctttgag aaggagaagc     2460

tcaagtccca gtggaacaat gataatcccc ttttcaagag cgccaccacg acggtcatga     2520

accccaagtt tgctgagagt taggagcact tggtgaagac aaggccgtca ggacccacca     2580

tgtctgcccc atcacgcggc cgagacatgg cttgccacag ctcttgagga tgtcaccaat     2640

taaccagaaa tccagttatt ttccgccctc aaaatgacag ccatggccgg ccgggtgctt     2700

ctgggggctc gtcgggggga cagctccact ctgactggca cagtctttgc atggagactt     2760

gaggagggag ggcttgaggt tggtgaggtt aggtgcgtgt ttcctgtgca agtcaggaca     2820

tcagtctgat taaaggtggt gccaatttat ttacatttaa acttgtcagg gtataaaatg     2880

acatcccatt aattatattg ttaatcaatc acgtgtatag aaaaaaaata aaacttcaat     2940

acaggctgtc catggaaaaa aaaaaaaaaa aaaaaaa                              2977


<210>  310
<211>  769
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 2 (complement component 3 receptor 3 
       and 4 subunit) (ITGB2), transcript variant 1, polypeptide 
       GeneBank Accession No. NP_000202.2 GI:89191865

<400>  310

Met Leu Gly Leu Arg Pro Pro Leu Leu Ala Leu Val Gly Leu Leu Ser 
1               5                   10                  15      


Leu Gly Cys Val Leu Ser Gln Glu Cys Thr Lys Phe Lys Val Ser Ser 
            20                  25                  30          


Cys Arg Glu Cys Ile Glu Ser Gly Pro Gly Cys Thr Trp Cys Gln Lys 
        35                  40                  45              


Leu Asn Phe Thr Gly Pro Gly Asp Pro Asp Ser Ile Arg Cys Asp Thr 
    50                  55                  60                  


Arg Pro Gln Leu Leu Met Arg Gly Cys Ala Ala Asp Asp Ile Met Asp 
65                  70                  75                  80  


Pro Thr Ser Leu Ala Glu Thr Gln Glu Asp His Asn Gly Gly Gln Lys 
                85                  90                  95      


Gln Leu Ser Pro Gln Lys Val Thr Leu Tyr Leu Arg Pro Gly Gln Ala 
            100                 105                 110         


Ala Ala Phe Asn Val Thr Phe Arg Arg Ala Lys Gly Tyr Pro Ile Asp 
        115                 120                 125             


Leu Tyr Tyr Leu Met Asp Leu Ser Tyr Ser Met Leu Asp Asp Leu Arg 
    130                 135                 140                 


Asn Val Lys Lys Leu Gly Gly Asp Leu Leu Arg Ala Leu Asn Glu Ile 
145                 150                 155                 160 


Thr Glu Ser Gly Arg Ile Gly Phe Gly Ser Phe Val Asp Lys Thr Val 
                165                 170                 175     


Leu Pro Phe Val Asn Thr His Pro Asp Lys Leu Arg Asn Pro Cys Pro 
            180                 185                 190         


Asn Lys Glu Lys Glu Cys Gln Pro Pro Phe Ala Phe Arg His Val Leu 
        195                 200                 205             


Lys Leu Thr Asn Asn Ser Asn Gln Phe Gln Thr Glu Val Gly Lys Gln 
    210                 215                 220                 


Leu Ile Ser Gly Asn Leu Asp Ala Pro Glu Gly Gly Leu Asp Ala Met 
225                 230                 235                 240 


Met Gln Val Ala Ala Cys Pro Glu Glu Ile Gly Trp Arg Asn Val Thr 
                245                 250                 255     


Arg Leu Leu Val Phe Ala Thr Asp Asp Gly Phe His Phe Ala Gly Asp 
            260                 265                 270         


Gly Lys Leu Gly Ala Ile Leu Thr Pro Asn Asp Gly Arg Cys His Leu 
        275                 280                 285             


Glu Asp Asn Leu Tyr Lys Arg Ser Asn Glu Phe Asp Tyr Pro Ser Val 
    290                 295                 300                 


Gly Gln Leu Ala His Lys Leu Ala Glu Asn Asn Ile Gln Pro Ile Phe 
305                 310                 315                 320 


Ala Val Thr Ser Arg Met Val Lys Thr Tyr Glu Lys Leu Thr Glu Ile 
                325                 330                 335     


Ile Pro Lys Ser Ala Val Gly Glu Leu Ser Glu Asp Ser Ser Asn Val 
            340                 345                 350         


Val Gln Leu Ile Lys Asn Ala Tyr Asn Lys Leu Ser Ser Arg Val Phe 
        355                 360                 365             


Leu Asp His Asn Ala Leu Pro Asp Thr Leu Lys Val Thr Tyr Asp Ser 
    370                 375                 380                 


Phe Cys Ser Asn Gly Val Thr His Arg Asn Gln Pro Arg Gly Asp Cys 
385                 390                 395                 400 


Asp Gly Val Gln Ile Asn Val Pro Ile Thr Phe Gln Val Lys Val Thr 
                405                 410                 415     


Ala Thr Glu Cys Ile Gln Glu Gln Ser Phe Val Ile Arg Ala Leu Gly 
            420                 425                 430         


Phe Thr Asp Ile Val Thr Val Gln Val Leu Pro Gln Cys Glu Cys Arg 
        435                 440                 445             


Cys Arg Asp Gln Ser Arg Asp Arg Ser Leu Cys His Gly Lys Gly Phe 
    450                 455                 460                 


Leu Glu Cys Gly Ile Cys Arg Cys Asp Thr Gly Tyr Ile Gly Lys Asn 
465                 470                 475                 480 


Cys Glu Cys Gln Thr Gln Gly Arg Ser Ser Gln Glu Leu Glu Gly Ser 
                485                 490                 495     


Cys Arg Lys Asp Asn Asn Ser Ile Ile Cys Ser Gly Leu Gly Asp Cys 
            500                 505                 510         


Val Cys Gly Gln Cys Leu Cys His Thr Ser Asp Val Pro Gly Lys Leu 
        515                 520                 525             


Ile Tyr Gly Gln Tyr Cys Glu Cys Asp Thr Ile Asn Cys Glu Arg Tyr 
    530                 535                 540                 


Asn Gly Gln Val Cys Gly Gly Pro Gly Arg Gly Leu Cys Phe Cys Gly 
545                 550                 555                 560 


Lys Cys Arg Cys His Pro Gly Phe Glu Gly Ser Ala Cys Gln Cys Glu 
                565                 570                 575     


Arg Thr Thr Glu Gly Cys Leu Asn Pro Arg Arg Val Glu Cys Ser Gly 
            580                 585                 590         


Arg Gly Arg Cys Arg Cys Asn Val Cys Glu Cys His Ser Gly Tyr Gln 
        595                 600                 605             


Leu Pro Leu Cys Gln Glu Cys Pro Gly Cys Pro Ser Pro Cys Gly Lys 
    610                 615                 620                 


Tyr Ile Ser Cys Ala Glu Cys Leu Lys Phe Glu Lys Gly Pro Phe Gly 
625                 630                 635                 640 


Lys Asn Cys Ser Ala Ala Cys Pro Gly Leu Gln Leu Ser Asn Asn Pro 
                645                 650                 655     


Val Lys Gly Arg Thr Cys Lys Glu Arg Asp Ser Glu Gly Cys Trp Val 
            660                 665                 670         


Ala Tyr Thr Leu Glu Gln Gln Asp Gly Met Asp Arg Tyr Leu Ile Tyr 
        675                 680                 685             


Val Asp Glu Ser Arg Glu Cys Val Ala Gly Pro Asn Ile Ala Ala Ile 
    690                 695                 700                 


Val Gly Gly Thr Val Ala Gly Ile Val Leu Ile Gly Ile Leu Leu Leu 
705                 710                 715                 720 


Val Ile Trp Lys Ala Leu Ile His Leu Ser Asp Leu Arg Glu Tyr Arg 
                725                 730                 735     


Arg Phe Glu Lys Glu Lys Leu Lys Ser Gln Trp Asn Asn Asp Asn Pro 
            740                 745                 750         


Leu Phe Lys Ser Ala Thr Thr Thr Val Met Asn Pro Lys Phe Ala Glu 
        755                 760                 765             


Ser 
    


<210>  311
<211>  2932
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 2 (complement component 3 receptor 3 
       and 4 subunit) (ITGB2), transcript variant 2, mRNA GeneBank 
       Accession No. NM_001127491.1  GI:188595676

<400>  311
atccagggtg aggaaggcag cccacacttt tcttggagac acatccccaa agaagtcctc       60

acgtggctcc gtttgggcag aaaccatgaa ttgaacggga aaagaaatat gtcaagtatc      120

agaaagaaga gtggcatgct ttgacagcaa gtggactccg agtccagggc agagcctcag      180

ttagggacat gctgggcctg cgccccccac tgctcgccct ggtggggctg ctctccctcg      240

ggtgcgtcct ctctcaggag tgcacgaagt tcaaggtcag cagctgccgg gaatgcatcg      300

agtcggggcc cggctgcacc tggtgccaga agctgaactt cacagggccg ggggatcctg      360

actccattcg ctgcgacacc cggccacagc tgctcatgag gggctgtgcg gctgacgaca      420

tcatggaccc cacaagcctc gctgaaaccc aggaagacca caatgggggc cagaagcagc      480

tgtccccaca aaaagtgacg ctttacctgc gaccaggcca ggcagcagcg ttcaacgtga      540

ccttccggcg ggccaagggc taccccatcg acctgtacta tctgatggac ctctcctact      600

ccatgcttga tgacctcagg aatgtcaaga agctaggtgg cgacctgctc cgggccctca      660

acgagatcac cgagtccggc cgcattggct tcgggtcctt cgtggacaag accgtgctgc      720

cgttcgtgaa cacgcaccct gataagctgc gaaacccatg ccccaacaag gagaaagagt      780

gccagccccc gtttgccttc aggcacgtgc tgaagctgac caacaactcc aaccagtttc      840

agaccgaggt cgggaagcag ctgatttccg gaaacctgga tgcacccgag ggtgggctgg      900

acgccatgat gcaggtcgcc gcctgcccgg aggaaatcgg ctggcgcaac gtcacgcggc      960

tgctggtgtt tgccactgat gacggcttcc atttcgcggg cgacgggaag ctgggcgcca     1020

tcctgacccc caacgacggc cgctgtcacc tggaggacaa cttgtacaag aggagcaacg     1080

aattcgacta cccatcggtg ggccagctgg cgcacaagct ggctgaaaac aacatccagc     1140

ccatcttcgc ggtgaccagt aggatggtga agacctacga gaaactcacc gagatcatcc     1200

ccaagtcagc cgtgggggag ctgtctgagg actccagcaa tgtggtccaa ctcattaaga     1260

atgcttacaa taaactctcc tccagggtct tcctggatca caacgccctc cccgacaccc     1320

tgaaagtcac ctacgactcc ttctgcagca atggagtgac gcacaggaac cagcccagag     1380

gtgactgtga tggcgtgcag atcaatgtcc cgatcacctt ccaggtgaag gtcacggcca     1440

cagagtgcat ccaggagcag tcgtttgtca tccgggcgct gggcttcacg gacatagtga     1500

ccgtgcaggt tcttccccag tgtgagtgcc ggtgccggga ccagagcaga gaccgcagcc     1560

tctgccatgg caagggcttc ttggagtgcg gcatctgcag gtgtgacact ggctacattg     1620

ggaaaaactg tgagtgccag acacagggcc ggagcagcca ggagctggaa ggaagctgcc     1680

ggaaggacaa caactccatc atctgctcag ggctggggga ctgtgtctgc gggcagtgcc     1740

tgtgccacac cagcgacgtc cccggcaagc tgatatacgg gcagtactgc gagtgtgaca     1800

ccatcaactg tgagcgctac aacggccagg tctgcggcgg cccggggagg gggctctgct     1860

tctgcgggaa gtgccgctgc cacccgggct ttgagggctc agcgtgccag tgcgagagga     1920

ccactgaggg ctgcctgaac ccgcggcgtg ttgagtgtag tggtcgtggc cggtgccgct     1980

gcaacgtatg cgagtgccat tcaggctacc agctgcctct gtgccaggag tgccccggct     2040

gcccctcacc ctgtggcaag tacatctcct gcgccgagtg cctgaagttc gaaaagggcc     2100

cctttgggaa gaactgcagc gcggcgtgtc cgggcctgca gctgtcgaac aaccccgtga     2160

agggcaggac ctgcaaggag agggactcag agggctgctg ggtggcctac acgctggagc     2220

agcaggacgg gatggaccgc tacctcatct atgtggatga gagccgagag tgtgtggcag     2280

gccccaacat cgccgccatc gtcgggggca ccgtggcagg catcgtgctg atcggcattc     2340

tcctgctggt catctggaag gctctgatcc acctgagcga cctccgggag tacaggcgct     2400

ttgagaagga gaagctcaag tcccagtgga acaatgataa tccccttttc aagagcgcca     2460

ccacgacggt catgaacccc aagtttgctg agagttagga gcacttggtg aagacaaggc     2520

cgtcaggacc caccatgtct gccccatcac gcggccgaga catggcttgc cacagctctt     2580

gaggatgtca ccaattaacc agaaatccag ttattttccg ccctcaaaat gacagccatg     2640

gccggccggg tgcttctggg ggctcgtcgg ggggacagct ccactctgac tggcacagtc     2700

tttgcatgga gacttgagga gggagggctt gaggttggtg aggttaggtg cgtgtttcct     2760

gtgcaagtca ggacatcagt ctgattaaag gtggtgccaa tttatttaca tttaaacttg     2820

tcagggtata aaatgacatc ccattaatta tattgttaat caatcacgtg tatagaaaaa     2880

aaataaaact tcaatacagg ctgtccatgg aaaaaaaaaa aaaaaaaaaa aa             2932


<210>  312
<211>  769
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 2 (complement component 3 receptor 3 
       and 4 subunit) (ITGB2), transcript variant 2, polypeptide 
       GeneBank Accession No. NP_001120963.1 GI:188595677

<400>  312

Met Leu Gly Leu Arg Pro Pro Leu Leu Ala Leu Val Gly Leu Leu Ser 
1               5                   10                  15      


Leu Gly Cys Val Leu Ser Gln Glu Cys Thr Lys Phe Lys Val Ser Ser 
            20                  25                  30          


Cys Arg Glu Cys Ile Glu Ser Gly Pro Gly Cys Thr Trp Cys Gln Lys 
        35                  40                  45              


Leu Asn Phe Thr Gly Pro Gly Asp Pro Asp Ser Ile Arg Cys Asp Thr 
    50                  55                  60                  


Arg Pro Gln Leu Leu Met Arg Gly Cys Ala Ala Asp Asp Ile Met Asp 
65                  70                  75                  80  


Pro Thr Ser Leu Ala Glu Thr Gln Glu Asp His Asn Gly Gly Gln Lys 
                85                  90                  95      


Gln Leu Ser Pro Gln Lys Val Thr Leu Tyr Leu Arg Pro Gly Gln Ala 
            100                 105                 110         


Ala Ala Phe Asn Val Thr Phe Arg Arg Ala Lys Gly Tyr Pro Ile Asp 
        115                 120                 125             


Leu Tyr Tyr Leu Met Asp Leu Ser Tyr Ser Met Leu Asp Asp Leu Arg 
    130                 135                 140                 


Asn Val Lys Lys Leu Gly Gly Asp Leu Leu Arg Ala Leu Asn Glu Ile 
145                 150                 155                 160 


Thr Glu Ser Gly Arg Ile Gly Phe Gly Ser Phe Val Asp Lys Thr Val 
                165                 170                 175     


Leu Pro Phe Val Asn Thr His Pro Asp Lys Leu Arg Asn Pro Cys Pro 
            180                 185                 190         


Asn Lys Glu Lys Glu Cys Gln Pro Pro Phe Ala Phe Arg His Val Leu 
        195                 200                 205             


Lys Leu Thr Asn Asn Ser Asn Gln Phe Gln Thr Glu Val Gly Lys Gln 
    210                 215                 220                 


Leu Ile Ser Gly Asn Leu Asp Ala Pro Glu Gly Gly Leu Asp Ala Met 
225                 230                 235                 240 


Met Gln Val Ala Ala Cys Pro Glu Glu Ile Gly Trp Arg Asn Val Thr 
                245                 250                 255     


Arg Leu Leu Val Phe Ala Thr Asp Asp Gly Phe His Phe Ala Gly Asp 
            260                 265                 270         


Gly Lys Leu Gly Ala Ile Leu Thr Pro Asn Asp Gly Arg Cys His Leu 
        275                 280                 285             


Glu Asp Asn Leu Tyr Lys Arg Ser Asn Glu Phe Asp Tyr Pro Ser Val 
    290                 295                 300                 


Gly Gln Leu Ala His Lys Leu Ala Glu Asn Asn Ile Gln Pro Ile Phe 
305                 310                 315                 320 


Ala Val Thr Ser Arg Met Val Lys Thr Tyr Glu Lys Leu Thr Glu Ile 
                325                 330                 335     


Ile Pro Lys Ser Ala Val Gly Glu Leu Ser Glu Asp Ser Ser Asn Val 
            340                 345                 350         


Val Gln Leu Ile Lys Asn Ala Tyr Asn Lys Leu Ser Ser Arg Val Phe 
        355                 360                 365             


Leu Asp His Asn Ala Leu Pro Asp Thr Leu Lys Val Thr Tyr Asp Ser 
    370                 375                 380                 


Phe Cys Ser Asn Gly Val Thr His Arg Asn Gln Pro Arg Gly Asp Cys 
385                 390                 395                 400 


Asp Gly Val Gln Ile Asn Val Pro Ile Thr Phe Gln Val Lys Val Thr 
                405                 410                 415     


Ala Thr Glu Cys Ile Gln Glu Gln Ser Phe Val Ile Arg Ala Leu Gly 
            420                 425                 430         


Phe Thr Asp Ile Val Thr Val Gln Val Leu Pro Gln Cys Glu Cys Arg 
        435                 440                 445             


Cys Arg Asp Gln Ser Arg Asp Arg Ser Leu Cys His Gly Lys Gly Phe 
    450                 455                 460                 


Leu Glu Cys Gly Ile Cys Arg Cys Asp Thr Gly Tyr Ile Gly Lys Asn 
465                 470                 475                 480 


Cys Glu Cys Gln Thr Gln Gly Arg Ser Ser Gln Glu Leu Glu Gly Ser 
                485                 490                 495     


Cys Arg Lys Asp Asn Asn Ser Ile Ile Cys Ser Gly Leu Gly Asp Cys 
            500                 505                 510         


Val Cys Gly Gln Cys Leu Cys His Thr Ser Asp Val Pro Gly Lys Leu 
        515                 520                 525             


Ile Tyr Gly Gln Tyr Cys Glu Cys Asp Thr Ile Asn Cys Glu Arg Tyr 
    530                 535                 540                 


Asn Gly Gln Val Cys Gly Gly Pro Gly Arg Gly Leu Cys Phe Cys Gly 
545                 550                 555                 560 


Lys Cys Arg Cys His Pro Gly Phe Glu Gly Ser Ala Cys Gln Cys Glu 
                565                 570                 575     


Arg Thr Thr Glu Gly Cys Leu Asn Pro Arg Arg Val Glu Cys Ser Gly 
            580                 585                 590         


Arg Gly Arg Cys Arg Cys Asn Val Cys Glu Cys His Ser Gly Tyr Gln 
        595                 600                 605             


Leu Pro Leu Cys Gln Glu Cys Pro Gly Cys Pro Ser Pro Cys Gly Lys 
    610                 615                 620                 


Tyr Ile Ser Cys Ala Glu Cys Leu Lys Phe Glu Lys Gly Pro Phe Gly 
625                 630                 635                 640 


Lys Asn Cys Ser Ala Ala Cys Pro Gly Leu Gln Leu Ser Asn Asn Pro 
                645                 650                 655     


Val Lys Gly Arg Thr Cys Lys Glu Arg Asp Ser Glu Gly Cys Trp Val 
            660                 665                 670         


Ala Tyr Thr Leu Glu Gln Gln Asp Gly Met Asp Arg Tyr Leu Ile Tyr 
        675                 680                 685             


Val Asp Glu Ser Arg Glu Cys Val Ala Gly Pro Asn Ile Ala Ala Ile 
    690                 695                 700                 


Val Gly Gly Thr Val Ala Gly Ile Val Leu Ile Gly Ile Leu Leu Leu 
705                 710                 715                 720 


Val Ile Trp Lys Ala Leu Ile His Leu Ser Asp Leu Arg Glu Tyr Arg 
                725                 730                 735     


Arg Phe Glu Lys Glu Lys Leu Lys Ser Gln Trp Asn Asn Asp Asn Pro 
            740                 745                 750         


Leu Phe Lys Ser Ala Thr Thr Thr Val Met Asn Pro Lys Phe Ala Glu 
        755                 760                 765             


Ser 
    


<210>  313
<211>  4894
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 3 (platelet glycoprotein IIIa, 
       antigen CD61) (ITGB3), mRNA GeneBank Accession No. NM_000212.2  
       GI:47078291

<400>  313
cgccgcggga ggcggacgag atgcgagcgc ggccgcggcc ccggccgctc tgggcgactg       60

tgctggcgct gggggcgctg gcgggcgttg gcgtaggagg gcccaacatc tgtaccacgc      120

gaggtgtgag ctcctgccag cagtgcctgg ctgtgagccc catgtgtgcc tggtgctctg      180

atgaggccct gcctctgggc tcacctcgct gtgacctgaa ggagaatctg ctgaaggata      240

actgtgcccc agaatccatc gagttcccag tgagtgaggc ccgagtacta gaggacaggc      300

ccctcagcga caagggctct ggagacagct cccaggtcac tcaagtcagt ccccagagga      360

ttgcactccg gctccggcca gatgattcga agaatttctc catccaagtg cggcaggtgg      420

aggattaccc tgtggacatc tactacttga tggacctgtc ttactccatg aaggatgatc      480

tgtggagcat ccagaacctg ggtaccaagc tggccaccca gatgcgaaag ctcaccagta      540

acctgcggat tggcttcggg gcatttgtgg acaagcctgt gtcaccatac atgtatatct      600

ccccaccaga ggccctcgaa aacccctgct atgatatgaa gaccacctgc ttgcccatgt      660

ttggctacaa acacgtgctg acgctaactg accaggtgac ccgcttcaat gaggaagtga      720

agaagcagag tgtgtcacgg aaccgagatg ccccagaggg tggctttgat gccatcatgc      780

aggctacagt ctgtgatgaa aagattggct ggaggaatga tgcatcccac ttgctggtgt      840

ttaccactga tgccaagact catatagcat tggacggaag gctggcaggc attgtccagc      900

ctaatgacgg gcagtgtcat gttggtagtg acaatcatta ctctgcctcc actaccatgg      960

attatccctc tttggggctg atgactgaga agctatccca gaaaaacatc aatttgatct     1020

ttgcagtgac tgaaaatgta gtcaatctct atcagaacta tagtgagctc atcccaggga     1080

ccacagttgg ggttctgtcc atggattcca gcaatgtcct ccagctcatt gttgatgctt     1140

atgggaaaat ccgttctaaa gtagagctgg aagtgcgtga cctccctgaa gagttgtctc     1200

tatccttcaa tgccacctgc ctcaacaatg aggtcatccc tggcctcaag tcttgtatgg     1260

gactcaagat tggagacacg gtgagcttca gcattgaggc caaggtgcga ggctgtcccc     1320

aggagaagga gaagtccttt accataaagc ccgtgggctt caaggacagc ctgatcgtcc     1380

aggtcacctt tgattgtgac tgtgcctgcc aggcccaagc tgaacctaat agccatcgct     1440

gcaacaatgg caatgggacc tttgagtgtg gggtatgccg ttgtgggcct ggctggctgg     1500

gatcccagtg tgagtgctca gaggaggact atcgcccttc ccagcaggac gaatgcagcc     1560

cccgggaggg tcagcccgtc tgcagccagc ggggcgagtg cctctgtggt caatgtgtct     1620

gccacagcag tgactttggc aagatcacgg gcaagtactg cgagtgtgac gacttctcct     1680

gtgtccgcta caagggggag atgtgctcag gccatggcca gtgcagctgt ggggactgcc     1740

tgtgtgactc cgactggacc ggctactact gcaactgtac cacgcgtact gacacctgca     1800

tgtccagcaa tgggctgctg tgcagcggcc gcggcaagtg tgaatgtggc agctgtgtct     1860

gtatccagcc gggctcctat ggggacacct gtgagaagtg ccccacctgc ccagatgcct     1920

gcacctttaa gaaagaatgt gtggagtgta agaagtttga ccggggagcc ctacatgacg     1980

aaaatacctg caaccgttac tgccgtgacg agattgagtc agtgaaagag cttaaggaca     2040

ctggcaagga tgcagtgaat tgtacctata agaatgagga tgactgtgtc gtcagattcc     2100

agtactatga agattctagt ggaaagtcca tcctgtatgt ggtagaagag ccagagtgtc     2160

ccaagggccc tgacatcctg gtggtcctgc tctcagtgat gggggccatt ctgctcattg     2220

gccttgccgc cctgctcatc tggaaactcc tcatcaccat ccacgaccga aaagaattcg     2280

ctaaatttga ggaagaacgc gccagagcaa aatgggacac agccaacaac ccactgtata     2340

aagaggccac gtctaccttc accaatatca cgtaccgggg cacttaatga taagcagtca     2400

tcctcagatc attatcagcc tgtgccacga ttgcaggagt ccctgccatc atgtttacag     2460

aggacagtat ttgtggggag ggatttgggg ctcagagtgg ggtaggttgg gagaatgtca     2520

gtatgtggaa gtgtgggtct gtgtgtgtgt atgtgggggt ctgtgtgttt atgtgtgtgt     2580

gttgtgtgtg ggagtgtgta atttaaaatt gtgatgtgtc ctgataagct gagctcctta     2640

gcctttgtcc cagaatgcct cctgcaggga ttcttcctgc ttagcttgag ggtgactatg     2700

gagctgagca ggtgttcttc attacctcag tgagaagcca gctttcctca tcaggccatt     2760

gtccctgaag agaagggcag ggctgaggcc tctcattcca gaggaaggga caccaagcct     2820

tggctctacc ctgagttcat aaatttatgg ttctcaggcc tgactctcag cagctatggt     2880

aggaactgct gggcttggca gcccgggtca tctgtacctc tgcctccttt cccctccctc     2940

aggccgaagg aggagtcagg gagagctgaa ctattagagc tgcctgtgcc ttttgccatc     3000

ccctcaaccc agctatggtt ctctcgcaag ggaagtcctt gcaagctaat tctttgacct     3060

gttgggagtg aggatgtctg ggccactcag gggtcattca tggcctgggg gatgtaccag     3120

catctcccag ttcataatca caacccttca gatttgcctt attggcagct ctactctgga     3180

ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc     3240

cacaccctca ctgctgtaga catttgctat gagctgggga tgtctctcat gaccaaatgc     3300

ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc     3360

ctcctgtccc tcatccatag cacctccaca tacctggccc tgtgccttgg tgtgctgtat     3420

ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat     3480

tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac     3540

gtgtggcctg ttcttctatg ggttggacaa cctcatttta actcagtctt taatctgaga     3600

ggccacagtg caattttatt ttatttttct catgatgagg ttttcttaac ttaaaagaac     3660

atgtatataa acatgcttgc attatatttg taaatttatg tgatggcaaa gaaggagagc     3720

ataggaaacc acacagactt gggcagggta cagacactcc cacttggcat cattcacagc     3780

aagtcactgg ccagtggctg gatctgtgag gggctctctc atgatagaag gctatgggga     3840

tagatgtgtg gacacattgg acctttcctg aggaagaggg actgttcttt tgtcccagaa     3900

aagcagtggc tccattggtg ttgacataca tccaacatta aaagccaccc ccaaatgccc     3960

aagaaaaaaa gaaagactta tcaacatttg ttccatgagc agaaaactgg agctctggcc     4020

tcagtgttac agctaaataa tctttaatta aggcaagtca ctttcttctt cttaaagctg     4080

ttttctagtt tgagaaatga tgggatttta gcagccagtc ttgaaggtct ctttcagtat     4140

caacattcta agatgctggg acttactgtg tcatcaaatg tgcggttaag attctctggg     4200

atattgatac tgtttgtgtt tttagttggg agatctgaga gacctggctt tggcaagagc     4260

agatgtcatt ccatatcacc tttctcaatg aaagtctcat tctatcctct ctccaaaccc     4320

gttttccaac atttgttaat agttacgtct ctcctgatgt agcacttaag cttcatttag     4380

ttattatttc tttcttcact ttgcacacat ttgcatccac atattaggga agaggaatcc     4440

ataagtagct gaaatatcta ttctgtatta ttgtgttaac attgagaata agccttggaa     4500

ttagatatgg ggcaatgact gagccctgtc tcacccatgg attactcctt actgtaggga     4560

atggcagtat ggtagaggga taaatagggg gcggggaggg atagtcatgg atccaagaag     4620

tccttagaaa tagtggcagg gaacaggtgt ggaagctcat gcctgtaatt ataaccttca     4680

gctactaaga caggtgtggt ggctcacgcc tgtgattata atcttcagtt actaagacag     4740

agtccatgag agtgttaatg ggacattttc tttagataag atgttttata tgaagaaact     4800

gtatcaaagg gggaagaaaa tgtatttaac aggtgaatca aatcaggaat cttgtctgag     4860

ctactggaat gaagttcaca ggtcttgaag acca                                 4894


<210>  314
<211>  788
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Homo sapiens integrin, beta 3 (platelet glycoprotein IIIa, 
       antigen CD61) (ITGB3), polypeptide GeneBank Accession No. 
       NP_000203.2 GI:47078292

<400>  314

Met Arg Ala Arg Pro Arg Pro Arg Pro Leu Trp Ala Thr Val Leu Ala 
1               5                   10                  15      


Leu Gly Ala Leu Ala Gly Val Gly Val Gly Gly Pro Asn Ile Cys Thr 
            20                  25                  30          


Thr Arg Gly Val Ser Ser Cys Gln Gln Cys Leu Ala Val Ser Pro Met 
        35                  40                  45              


Cys Ala Trp Cys Ser Asp Glu Ala Leu Pro Leu Gly Ser Pro Arg Cys 
    50                  55                  60                  


Asp Leu Lys Glu Asn Leu Leu Lys Asp Asn Cys Ala Pro Glu Ser Ile 
65                  70                  75                  80  


Glu Phe Pro Val Ser Glu Ala Arg Val Leu Glu Asp Arg Pro Leu Ser 
                85                  90                  95      


Asp Lys Gly Ser Gly Asp Ser Ser Gln Val Thr Gln Val Ser Pro Gln 
            100                 105                 110         


Arg Ile Ala Leu Arg Leu Arg Pro Asp Asp Ser Lys Asn Phe Ser Ile 
        115                 120                 125             


Gln Val Arg Gln Val Glu Asp Tyr Pro Val Asp Ile Tyr Tyr Leu Met 
    130                 135                 140                 


Asp Leu Ser Tyr Ser Met Lys Asp Asp Leu Trp Ser Ile Gln Asn Leu 
145                 150                 155                 160 


Gly Thr Lys Leu Ala Thr Gln Met Arg Lys Leu Thr Ser Asn Leu Arg 
                165                 170                 175     


Ile Gly Phe Gly Ala Phe Val Asp Lys Pro Val Ser Pro Tyr Met Tyr 
            180                 185                 190         


Ile Ser Pro Pro Glu Ala Leu Glu Asn Pro Cys Tyr Asp Met Lys Thr 
        195                 200                 205             


Thr Cys Leu Pro Met Phe Gly Tyr Lys His Val Leu Thr Leu Thr Asp 
    210                 215                 220                 


Gln Val Thr Arg Phe Asn Glu Glu Val Lys Lys Gln Ser Val Ser Arg 
225                 230                 235                 240 


Asn Arg Asp Ala Pro Glu Gly Gly Phe Asp Ala Ile Met Gln Ala Thr 
                245                 250                 255     


Val Cys Asp Glu Lys Ile Gly Trp Arg Asn Asp Ala Ser His Leu Leu 
            260                 265                 270         


Val Phe Thr Thr Asp Ala Lys Thr His Ile Ala Leu Asp Gly Arg Leu 
        275                 280                 285             


Ala Gly Ile Val Gln Pro Asn Asp Gly Gln Cys His Val Gly Ser Asp 
    290                 295                 300                 


Asn His Tyr Ser Ala Ser Thr Thr Met Asp Tyr Pro Ser Leu Gly Leu 
305                 310                 315                 320 


Met Thr Glu Lys Leu Ser Gln Lys Asn Ile Asn Leu Ile Phe Ala Val 
                325                 330                 335     


Thr Glu Asn Val Val Asn Leu Tyr Gln Asn Tyr Ser Glu Leu Ile Pro 
            340                 345                 350         


Gly Thr Thr Val Gly Val Leu Ser Met Asp Ser Ser Asn Val Leu Gln 
        355                 360                 365             


Leu Ile Val Asp Ala Tyr Gly Lys Ile Arg Ser Lys Val Glu Leu Glu 
    370                 375                 380                 


Val Arg Asp Leu Pro Glu Glu Leu Ser Leu Ser Phe Asn Ala Thr Cys 
385                 390                 395                 400 


Leu Asn Asn Glu Val Ile Pro Gly Leu Lys Ser Cys Met Gly Leu Lys 
                405                 410                 415     


Ile Gly Asp Thr Val Ser Phe Ser Ile Glu Ala Lys Val Arg Gly Cys 
            420                 425                 430         


Pro Gln Glu Lys Glu Lys Ser Phe Thr Ile Lys Pro Val Gly Phe Lys 
        435                 440                 445             


Asp Ser Leu Ile Val Gln Val Thr Phe Asp Cys Asp Cys Ala Cys Gln 
    450                 455                 460                 


Ala Gln Ala Glu Pro Asn Ser His Arg Cys Asn Asn Gly Asn Gly Thr 
465                 470                 475                 480 


Phe Glu Cys Gly Val Cys Arg Cys Gly Pro Gly Trp Leu Gly Ser Gln 
                485                 490                 495     


Cys Glu Cys Ser Glu Glu Asp Tyr Arg Pro Ser Gln Gln Asp Glu Cys 
            500                 505                 510         


Ser Pro Arg Glu Gly Gln Pro Val Cys Ser Gln Arg Gly Glu Cys Leu 
        515                 520                 525             


Cys Gly Gln Cys Val Cys His Ser Ser Asp Phe Gly Lys Ile Thr Gly 
    530                 535                 540                 


Lys Tyr Cys Glu Cys Asp Asp Phe Ser Cys Val Arg Tyr Lys Gly Glu 
545                 550                 555                 560 


Met Cys Ser Gly His Gly Gln Cys Ser Cys Gly Asp Cys Leu Cys Asp 
                565                 570                 575     


Ser Asp Trp Thr Gly Tyr Tyr Cys Asn Cys Thr Thr Arg Thr Asp Thr 
            580                 585                 590         


Cys Met Ser Ser Asn Gly Leu Leu Cys Ser Gly Arg Gly Lys Cys Glu 
        595                 600                 605             


Cys Gly Ser Cys Val Cys Ile Gln Pro Gly Ser Tyr Gly Asp Thr Cys 
    610                 615                 620                 


Glu Lys Cys Pro Thr Cys Pro Asp Ala Cys Thr Phe Lys Lys Glu Cys 
625                 630                 635                 640 


Val Glu Cys Lys Lys Phe Asp Arg Gly Ala Leu His Asp Glu Asn Thr 
                645                 650                 655     


Cys Asn Arg Tyr Cys Arg Asp Glu Ile Glu Ser Val Lys Glu Leu Lys 
            660                 665                 670         


Asp Thr Gly Lys Asp Ala Val Asn Cys Thr Tyr Lys Asn Glu Asp Asp 
        675                 680                 685             


Cys Val Val Arg Phe Gln Tyr Tyr Glu Asp Ser Ser Gly Lys Ser Ile 
    690                 695                 700                 


Leu Tyr Val Val Glu Glu Pro Glu Cys Pro Lys Gly Pro Asp Ile Leu 
705                 710                 715                 720 


Val Val Leu Leu Ser Val Met Gly Ala Ile Leu Leu Ile Gly Leu Ala 
                725                 730                 735     


Ala Leu Leu Ile Trp Lys Leu Leu Ile Thr Ile His Asp Arg Lys Glu 
            740                 745                 750         


Phe Ala Lys Phe Glu Glu Glu Arg Ala Arg Ala Lys Trp Asp Thr Ala 
        755                 760                 765             


Asn Asn Pro Leu Tyr Lys Glu Ala Thr Ser Thr Phe Thr Asn Ile Thr 
    770                 775                 780                 


Tyr Arg Gly Thr 
785             


