                               SEQUENCE LISTING

<110> RESEARCH INSTITUTE AT NATIONWIDE CHILDREN'S HOSPITAL
 
<120> GENE THERAPY FOR THE TREATMENT OF GALACTOSEMIA

<130> 106887-7751

<140> PCT/US2019/049157
<141> 2019-08-30

<150> 62/725,225
<151> 2018-08-30

<160> 13    

<170> PatentIn version 3.5

<210> 1
<211> 1140
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 1
atgagcagaa gcggcaccga ccctcagcag agacagcagg cctctgaagc cgatgccgcc       60

gctgccacct tcagagccaa tgaccaccag cacatccggt acaaccccct gcaggacgag      120

tgggtgctgg tgtccgccca cagaatgaag aggccttggc agggccaggt ggaaccccag      180

ctgctgaaaa ccgtgcccag acacgacccc ctgaaccctc tgtgtcctgg cgccattaga      240

gccaacggcg aagtgaaccc ccagtacgac agcaccttcc tgttcgacaa cgacttcccc      300

gccctgcagc ctgatgcccc atctcctgga cctagcgacc accctctgtt ccaggccaag      360

tctgccagag gcgtgtgcaa agtgatgtgc ttccaccctt ggagcgacgt gaccctgccc      420

ctgatgagcg tgccagagat cagagccgtg gtggatgcct gggccagcgt gacagaagaa      480

ctgggagccc agtacccctg ggtgcagatc ttcgagaaca agggcgccat gatgggctgc      540

agcaaccccc accctcactg tcaagtgtgg gccagcagct tcctgcccga tatcgcccag      600

cgggaagaga gaagccagca ggcttacaag agccagcacg gcgagcccct gctgatggaa      660

tactccagac aggaactgct gcggaaagaa cggctggtgc tgaccagcga gcactggctg      720

gtgctggtgc ctttttgggc cacatggccc taccagaccc tgctgctgcc tagaaggcac      780

gtgcggagac tgcctgagct gacacccgcc gagagagatg acctggccag catcatgaag      840

aaactgctga ccaaatacga caacctgttc gagaccagct tcccctacag catgggctgg      900

cacggcgctc ctacaggatc tgaggctggc gccaactgga accactggca gctgcacgcc      960

cactactacc ccccactgct gagatctgcc accgtgcgga agttcatggt gggatacgag     1020

atgctggctc aggcccagag agatctgacc cctgaacagg ccgccgaacg gctgagagca     1080

ctgcccgaag tgcactacca cctgggacag aaggacagag agacagccac aatcgcctga     1140


<210> 2
<211> 6237
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 2
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggaatt cacgcgtgga      120

tctgaattca attcacgcgt ggtacctctg gtcgttacat aacttacggt aaatggcccg      180

cctggctgac cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata      240

gtaacgccaa tagggacttt ccattgacgt caatgggtgg agtatttacg gtaaactgcc      300

cacttggcag tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac      360

ggtaaatggc ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg      420

cagtacatct actcgaggcc acgttctgct tcactctccc catctccccc ccctccccac      480

ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg      540

ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga      600

gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc      660

ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcgggagcg ggatcagcca      720

ccgcggtggc ggcctagagt cgacgaggaa ctgaaaaacc agaaagttaa ctggtaagtt      780

tagtcttttt gtcttttatt tcaggtcccg gatccggtgg tggtgcaaat caaagaactg      840

ctcctcagtg gatgttgcct ttacttctag gcctgtacgg aagtgttact tctgctctaa      900

aagctgcgga attgtacccg cggccgatcc accggtctta agggccgagg cggccagatc      960

tttcgaagat atcggcgccg ctagcgcggc cgcagctgcc accatgagca gaagcggcac     1020

cgaccctcag cagagacagc aggcctctga agccgatgcc gccgctgcca ccttcagagc     1080

caatgaccac cagcacatcc ggtacaaccc cctgcaggac gagtgggtgc tggtgtccgc     1140

ccacagaatg aagaggcctt ggcagggcca ggtggaaccc cagctgctga aaaccgtgcc     1200

cagacacgac cccctgaacc ctctgtgtcc tggcgccatt agagccaacg gcgaagtgaa     1260

cccccagtac gacagcacct tcctgttcga caacgacttc cccgccctgc agcctgatgc     1320

cccatctcct ggacctagcg accaccctct gttccaggcc aagtctgcca gaggcgtgtg     1380

caaagtgatg tgcttccacc cttggagcga cgtgaccctg cccctgatga gcgtgccaga     1440

gatcagagcc gtggtggatg cctgggccag cgtgacagaa gaactgggag cccagtaccc     1500

ctgggtgcag atcttcgaga acaagggcgc catgatgggc tgcagcaacc cccaccctca     1560

ctgtcaagtg tgggccagca gcttcctgcc cgatatcgcc cagcgggaag agagaagcca     1620

gcaggcttac aagagccagc acggcgagcc cctgctgatg gaatactcca gacaggaact     1680

gctgcggaaa gaacggctgg tgctgaccag cgagcactgg ctggtgctgg tgcctttttg     1740

ggccacatgg ccctaccaga ccctgctgct gcctagaagg cacgtgcgga gactgcctga     1800

gctgacaccc gccgagagag atgacctggc cagcatcatg aagaaactgc tgaccaaata     1860

cgacaacctg ttcgagacca gcttccccta cagcatgggc tggcacggcg ctcctacagg     1920

atctgaggct ggcgccaact ggaaccactg gcagctgcac gcccactact accccccact     1980

gctgagatct gccaccgtgc ggaagttcat ggtgggatac gagatgctgg ctcaggccca     2040

gagagatctg acccctgaac aggccgccga acggctgaga gcactgcccg aagtgcacta     2100

ccacctggga cagaaggaca gagagacagc cacaatcgcc tgaagtcaag cttatcgata     2160

ccgtcgacta gagctcgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt     2220

gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc     2280

taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt     2340

ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggag     2400

agatcgatct gaggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc     2460

gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc     2520

agtgagcgag cgagcgcgca gagagggagt ggcccccccc cccccccccc cggcgattct     2580

cttgtttgct ccagactctc aggcaatgac ctgatagcct ttgtagagac ctctcaaaaa     2640

tagctaccct ctccggcatg aatttatcag ctagaacggt tgaatatcat attgatggtg     2700

atttgactgt ctccggcctt tctcacccgt ttgaatcttt acctacacat tactcaggca     2760

ttgcatttaa aatatatgag ggttctaaaa atttttatcc ttgcgttgaa ataaaggctt     2820

ctcccgcaaa agtattacag ggtcataatg tttttggtac aaccgattta gctttatgct     2880

ctgaggcttt attgcttaat tttgctaatt ctttgccttg cctgtatgat ttattggatg     2940

ttggaatcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat     3000

atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc     3060

gccaacacta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc     3120

ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc     3180

ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc     3240

accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat     3300

gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc     3360

tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg     3420

ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc     3480

ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt     3540

gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct     3600

caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac     3660

ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact     3720

cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa     3780

gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga     3840

taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt     3900

tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga     3960

agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg     4020

caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat     4080

ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat     4140

tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc     4200

agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga     4260

tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc     4320

agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag     4380

gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc     4440

gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt     4500

tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt     4560

gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat     4620

accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc     4680

accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa     4740

gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg     4800

ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag     4860

atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag     4920

gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa     4980

cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt     5040

gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg     5100

gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc     5160

tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac     5220

cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct     5280

ccccgcgcgt tggccgattc attaatgcag ctggcgtaat agcgaagagg cccgcaccga     5340

tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg cgattccgtt gcaatggctg     5400

gcggtaatat tgttctggat attaccagca aggccgatag tttgagttct tctactcagg     5460

caagtgatgt tattactaat caaagaagta ttgcgacaac ggttaatttg cgtgatggac     5520

agactctttt actcggtggc ctcactgatt ataaaaacac ttctcaggat tctggcgtac     5580

cgttcctgtc taaaatccct ttaatcggcc tcctgtttag ctcccgctct gattctaacg     5640

aggaaagcac gttatacgtg ctcgtcaaag caaccatagt acgcgccctg tagcggcgca     5700

ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta     5760

gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt     5820

caagctctaa atcgggggct ccctttaggg ttccgattta gtgctttacg gcacctcgac     5880

cccaaaaaac ttgattaggg tgatggttca cgtagtgggc catcgccctg atagacggtt     5940

tttcgccctt tgacgttgga gtccacgttc tttaatagtg gactcttgtt ccaaactgga     6000

acaacactca accctatctc ggtctattct tttgatttat aagggatttt gccgatttcg     6060

gcctattggt taaaaaatga gctgatttaa caaaaattta acgcgaattt taacaaaata     6120

ttaacgctta caatttaaat atttgcttat acaatcttcc tgtttttggg gcttttctga     6180

ttatcaaccg gggtacatat gattgacatg ctagttttac gattaccgtt catcgcc        6237


<210> 3
<211> 6782
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 3
gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcgcg       60

ctcgctcgct cactgaggcc gcccgggcaa agcccgggcg tcgggcgacc tttggtcgcc      120

cggcctcagt gagcgagcga gcgcgcagag agggagtggc caactccatc actaggggtt      180

ccttgtagtt aatgattaac ccgccatgct aattatctac gtagccatgt ctagacagcc      240

actatgggtc taggctgccc atgtaaggag gcaaggccta gttattaata gtaatcaatt      300

acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact tacggtaaat      360

ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat gacgtatgtt      420

cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggacta tttacggtaa      480

actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc tattgacgtc      540

aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg ggactttcct      600

acttggcagt acatctacgt attagtcatc gctattacca tggtcgaggt gagccccacg      660

ttctgcttca ctctccccat ctcccccccc tccccacccc caattttgta tttatttatt      720

ttttaattat tttgtgcagc gatgggggcg gggggggggg gggggcgcgc gccaggcggg      780

gcggggcggg gcgaggggcg gggcggggcg aggcggagag gtgcggcggc agccaatcag      840

agcggcgcgc tccgaaagtt tccttttatg gcgaggcggc ggcggcggcg gccctataaa      900

aagcgaagcg cgcggcgggc gggagtcgct gcgacgctgc cttcgccccg tgccccgctc      960

cgccgccgcc tcgcgccgcc cgccccggct ctgactgacc gcgttactcc cacaggtgag     1020

cgggcgggac ggcccttctc ctccgggctg taattagcgc ttggtttaat gacggcttgt     1080

ttcttttctg tggctgcgtg aaagccttga ggggctccgg gagctagagc ctctgctaac     1140

catgttcatg ccttcttctt tttcctacag ctcctgggca acgtgctggt tattgtgctg     1200

tctcatcatt ttggcaaaga attctagcgc ggccgcagct gccaccatga gcagaagcgg     1260

caccgaccct cagcagagac agcaggcctc tgaagccgat gccgccgctg ccaccttcag     1320

agccaatgac caccagcaca tccggtacaa ccccctgcag gacgagtggg tgctggtgtc     1380

cgcccacaga atgaagaggc cttggcaggg ccaggtggaa ccccagctgc tgaaaaccgt     1440

gcccagacac gaccccctga accctctgtg tcctggcgcc attagagcca acggcgaagt     1500

gaacccccag tacgacagca ccttcctgtt cgacaacgac ttccccgccc tgcagcctga     1560

tgccccatct cctggaccta gcgaccaccc tctgttccag gccaagtctg ccagaggcgt     1620

gtgcaaagtg atgtgcttcc acccttggag cgacgtgacc ctgcccctga tgagcgtgcc     1680

agagatcaga gccgtggtgg atgcctgggc cagcgtgaca gaagaactgg gagcccagta     1740

cccctgggtg cagatcttcg agaacaaggg cgccatgatg ggctgcagca acccccaccc     1800

tcactgtcaa gtgtgggcca gcagcttcct gcccgatatc gcccagcggg aagagagaag     1860

ccagcaggct tacaagagcc agcacggcga gcccctgctg atggaatact ccagacagga     1920

actgctgcgg aaagaacggc tggtgctgac cagcgagcac tggctggtgc tggtgccttt     1980

ttgggccaca tggccctacc agaccctgct gctgcctaga aggcacgtgc ggagactgcc     2040

tgagctgaca cccgccgaga gagatgacct ggccagcatc atgaagaaac tgctgaccaa     2100

atacgacaac ctgttcgaga ccagcttccc ctacagcatg ggctggcacg gcgctcctac     2160

aggatctgag gctggcgcca actggaacca ctggcagctg cacgcccact actacccccc     2220

actgctgaga tctgccaccg tgcggaagtt catggtggga tacgagatgc tggctcaggc     2280

ccagagagat ctgacccctg aacaggccgc cgaacggctg agagcactgc ccgaagtgca     2340

ctaccacctg ggacagaagg acagagagac agccacaatc gcctgaagtc aagcttatcg     2400

ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg     2460

ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc     2520

gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt     2580

tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca     2640

ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc     2700

ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc     2760

tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc     2820

tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc     2880

tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc     2940

ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg catcgatacc     3000

gtcgaggccg caataaaaga tctttatttt cattagatct gtgtgttggt tttttgtgtg     3060

tctagacatg gctacgtaga taattagcat ggcgggttaa tcattaacta caaggaaccc     3120

ctagtgatgg agttggccac tccctctctg cgcgctcgct cgctcactga ggccgggcga     3180

ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc     3240

cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct     3300

gaatggcgaa tggaagttcc gttgcaatgg ctggcggtaa tattgttctg gatattacca     3360

gcaaggccga tagtttgagt tcttctactc aggcaagtga tgttattact aatcaaagaa     3420

gtattgcgac aacggttaat ttgcgtgatg gacagactct tttactcggt ggcctcactg     3480

attataaaaa cacttctcag gattctggcg taccgttcct gtctaaaatc cctttaatcg     3540

gcctcctgtt tagctcccgc tctgattcta acgaggaaag cacgttatac gtgctcgtca     3600

aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg     3660

cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct     3720

tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta     3780

gggttccgat ttagtgattt acggcacctc gaccccaaaa aacttgatta gggtgatggt     3840

tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg     3900

ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat ctcggtctat     3960

tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt     4020

taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaattta aatatttgct     4080

tatacaatct tcctgttttt ggggcttttc tgattatcaa ccggggtaca tatgattgac     4140

atgctagttt tacgattacc gttcatcgat tctcttgttt gctccagact ctcaggcaat     4200

gacctgatag cctttgtaga gacctctcaa aaatagctac cctctccggc atgaatttat     4260

cagctagaac ggttgaatat catattgatg gtgatttgac tgtctccggc ctttctcacc     4320

cgtttgaatc tttacctaca cattactcag gcattgcatt taaaatatat gagggttcta     4380

aaaattttta tccttgcgtt gaaataaagg cttctcccgc aaaagtatta cagggtcata     4440

atgtttttgg tacaaccgat ttagctttat gctctgaggc tttattgctt aattttgcta     4500

attctttgcc ttgcctgtat gatttattgg atgttggaag ttcctgatgc ggtattttct     4560

ccttacgcat ctgtgcggta tttcacaccg catatggtgc actctcagta caatctgctc     4620

tgatgccgca tagttaagcc agccccgaca cccgccaaca cccgctgacg cgccctgacg     4680

ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat     4740

gtgtcagagg ttttcaccgt catcaccgaa acgcgcgaga cgaaagggcc tcgtgatacg     4800

cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt     4860

tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta     4920

tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat     4980

gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt     5040

ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg     5100

agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga     5160

agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg     5220

tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt     5280

tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg     5340

cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg     5400

aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga     5460

tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc     5520

tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc     5580

ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc     5640

ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg     5700

cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac     5760

gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc     5820

actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt     5880

aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac     5940

caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa     6000

aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc     6060

accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt     6120

aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg     6180

ccaccacttc aagaactctg tagcaccgcg tacatacctc gctctgctaa tcctgttacc     6240

agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt     6300

accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga     6360

gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct     6420

tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg     6480

cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca     6540

cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa     6600

cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt     6660

ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgggtttg agtgagctga     6720

taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgacc aagcggaaga     6780

gc                                                                    6782


<210> 4
<211> 379
<212> PRT
<213> Homo sapiens

<400> 4
Met Ser Arg Ser Gly Thr Asp Pro Gln Gln Arg Gln Gln Ala Ser Glu 
1               5                   10                  15      


Ala Asp Ala Ala Ala Ala Thr Phe Arg Ala Asn Asp His Gln His Ile 
            20                  25                  30          


Arg Tyr Asn Pro Leu Gln Asp Glu Trp Val Leu Val Ser Ala His Arg 
        35                  40                  45              


Met Lys Arg Pro Trp Gln Gly Gln Val Glu Pro Gln Leu Leu Lys Thr 
    50                  55                  60                  


Val Pro Arg His Asp Pro Leu Asn Pro Leu Cys Pro Gly Ala Ile Arg 
65                  70                  75                  80  


Ala Asn Gly Glu Val Asn Pro Gln Tyr Asp Ser Thr Phe Leu Phe Asp 
                85                  90                  95      


Asn Asp Phe Pro Ala Leu Gln Pro Asp Ala Pro Ser Pro Gly Pro Ser 
            100                 105                 110         


Asp His Pro Leu Phe Gln Ala Lys Ser Ala Arg Gly Val Cys Lys Val 
        115                 120                 125             


Met Cys Phe His Pro Trp Ser Asp Val Thr Leu Pro Leu Met Ser Val 
    130                 135                 140                 


Pro Glu Ile Arg Ala Val Val Asp Ala Trp Ala Ser Val Thr Glu Glu 
145                 150                 155                 160 


Leu Gly Ala Gln Tyr Pro Trp Val Gln Ile Phe Glu Asn Lys Gly Ala 
                165                 170                 175     


Met Met Gly Cys Ser Asn Pro His Pro His Cys Gln Val Trp Ala Ser 
            180                 185                 190         


Ser Phe Leu Pro Asp Ile Ala Gln Arg Glu Glu Arg Ser Gln Gln Ala 
        195                 200                 205             


Tyr Lys Ser Gln His Gly Glu Pro Leu Leu Met Glu Tyr Ser Arg Gln 
    210                 215                 220                 


Glu Leu Leu Arg Lys Glu Arg Leu Val Leu Thr Ser Glu His Trp Leu 
225                 230                 235                 240 


Val Leu Val Pro Phe Trp Ala Thr Trp Pro Tyr Gln Thr Leu Leu Leu 
                245                 250                 255     


Pro Arg Arg His Val Arg Arg Leu Pro Glu Leu Thr Pro Ala Glu Arg 
            260                 265                 270         


Asp Asp Leu Ala Ser Ile Met Lys Lys Leu Leu Thr Lys Tyr Asp Asn 
        275                 280                 285             


Leu Phe Glu Thr Ser Phe Pro Tyr Ser Met Gly Trp His Gly Ala Pro 
    290                 295                 300                 


Thr Gly Ser Glu Ala Gly Ala Asn Trp Asn His Trp Gln Leu His Ala 
305                 310                 315                 320 


His Tyr Tyr Pro Pro Leu Leu Arg Ser Ala Thr Val Arg Lys Phe Met 
                325                 330                 335     


Val Gly Tyr Glu Met Leu Ala Gln Ala Gln Arg Asp Leu Thr Pro Glu 
            340                 345                 350         


Gln Ala Ala Glu Arg Leu Arg Ala Leu Pro Glu Val His Tyr His Leu 
        355                 360                 365             


Gly Gln Lys Asp Arg Glu Thr Ala Thr Ile Ala 
    370                 375                 


<210> 5
<211> 736
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 5
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Ser Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
                405                 410                 415     


Asn Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Ser Thr Gly Gly Thr Ala Gly Thr Gln Gln Leu Leu 
    450                 455                 460                 


Phe Ser Gln Ala Gly Pro Asn Asn Met Ser Ala Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
    530                 535                 540                 


Phe Gly Lys Gln Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Gln Asn Ala Ala 
            580                 585                 590         


Pro Val Gly Ala Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val Trp 
        595                 600                 605             


Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Thr Phe Asn Gln Ala Lys Leu Ala Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210> 6
<211> 2215
<212> DNA
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polynucleotide

<400> 6
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc       60

gagtggtggg acctgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac      120

aacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac      180

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac      240

cagcagctcc aagcgggtga caatccgtac ctgcggtata atcacgccga cgccgagttt      300

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgcgc agtcttccag      360

gccaaaaagc gggttctcga acctctgggc ctggttgaat cgccggttaa gacggctcct      420

ggaaagaaga gaccggtaga gccatcaccc cagcgctctc cagactcctc tacgggcatc      480

ggcaagaaag gccagcagcc cgcaaaaaag agactcaatt ttgggcagac tggcgactca      540

gagtcagtcc ccgaccctca accaatcgga gaaccaccag caggcccctc tggtctggga      600

tctggtacaa tggctgcagg cggtggcgct ccaatggcag acaataacga aggcgccgac      660

ggagtgggta gttcctcagg aaattggcat tgcgattcca catggctggg cgacagagtc      720

atcaccacca gcacccgcac ctgggccctg cccacctaca acaaccacct ctacaagcaa      780

atctccaacg ggacctcggg aggaagcacc aacgacaaca cctacttcgg ctacagcacc      840

ccctgggggt attttgactt caacagattc cactgccact tttcaccacg tgactggagc      900

gactcatcaa caacaactgg ggattccggc ccaagaggct caacttcaag ctcttcaaca      960

tccaagtcaa ggaggtcacg cagaatgaag gcaccaagag catcgccaat aaccttacca     1020

gcaggattca ggtctttacg gactcggaat accagctccc gtacgtgctc ggctcggcgc     1080

accagggctg cctgcctccg ttcccggcgg acgtcttcat gattcctcag tacgggtacc     1140

tgactctgaa caatggcagt caggctgtgg gccggtcgtc cttctactgc ctggagtact     1200

ttccttctca aatgctgaga acgggcaaca actttgaatt cagctacaac ttcgaggacg     1260

tgcccttcca cagcagctac gcgcacagcc agagcctgga ccggctgatg aaccctctca     1320

tcgaccagta cttgtactac ctgtcccgga ctcaaagcac gggcggtact gcaggaactc     1380

agcagttgct attttctcag gccgggccta acaacatgtc ggctcaggcc aagaactggc     1440

tacccggtcc ctgctaccgg cagcacgcgt ctccacgaca ctgtcgcaga acaacaacag     1500

caactttgcc tggacgggtg ccaccaagta tcatctgaat ggcagagact ctctggtgaa     1560

tcctggcgtt gccatggcta cccacaagga cgacgaagag cgattttttc catccagcgg     1620

agtcttaatg tttgggaaac agggagctgg aaaagacaac gtggactata gcagcgtgat     1680

gctaaccagc gaggaagaaa taaagaccac caacccagtg gccacagaac agtacggcgt     1740

ggtggccgat aacctgcaac agcaaaacgc cgctcctatt gtaggggccg tcaatagtca     1800

aggagcctta cctggcatgg tgtggcagaa ccgggacgtg tacctgcagg gtcccatctg     1860

ggccaagatt cctcatacgg acggcaactt tcatccctcg ccgctgatgg gaggctttgg     1920

actgaagcat ccgcctcctc agatcctgat taaaaacaca cctgttcccg cggatcctcc     1980

gaccaccttc aatcaggcca agctggcttc tttcatcacg cagtacagta ccggccaggt     2040

cagcgtggag atcgagtggg agctgcagaa ggagaacagc aaacgctgga acccagagat     2100

tcagtacact tccaactact acaaatctac aaatgtggac tttgctgtca atactgaggg     2160

tacttattcc gagcctcgcc ccattggcac ccgttacctc acccgtaatc tgtaa          2215


<210> 7
<211> 271
<212> PRT
<213> Unknown

<220>
<223> Description of Unknown: 
      KAN Gene Translation sequence

<400> 7
Met Ser His Ile Gln Arg Glu Thr Ser Cys Ser Arg Pro Arg Leu Asn 
1               5                   10                  15      


Ser Asn Met Asp Ala Asp Leu Tyr Gly Tyr Lys Trp Ala Arg Asp Asn 
            20                  25                  30          


Val Gly Gln Ser Gly Ala Thr Ile Tyr Arg Leu Tyr Gly Lys Pro Asp 
        35                  40                  45              


Ala Pro Glu Leu Phe Leu Lys His Gly Lys Gly Ser Val Ala Asn Asp 
    50                  55                  60                  


Val Thr Asp Glu Met Val Arg Leu Asn Trp Leu Thr Glu Phe Met Pro 
65                  70                  75                  80  


Leu Pro Thr Ile Lys His Phe Ile Arg Thr Pro Asp Asp Ala Trp Leu 
                85                  90                  95      


Leu Thr Thr Ala Ile Pro Gly Lys Thr Ala Phe Gln Val Leu Glu Glu 
            100                 105                 110         


Tyr Pro Asp Ser Gly Glu Asn Ile Val Asp Ala Leu Ala Val Phe Leu 
        115                 120                 125             


Arg Arg Leu His Ser Ile Pro Val Cys Asn Cys Pro Phe Asn Ser Asp 
    130                 135                 140                 


Arg Val Phe Arg Leu Ala Gln Ala Gln Ser Arg Met Asn Asn Gly Leu 
145                 150                 155                 160 


Val Asp Ala Ser Asp Phe Asp Asp Glu Arg Asn Gly Trp Pro Val Glu 
                165                 170                 175     


Gln Val Trp Lys Glu Met His Lys Leu Leu Pro Phe Ser Pro Asp Ser 
            180                 185                 190         


Val Val Thr His Gly Asp Phe Ser Leu Asp Asn Leu Ile Phe Asp Glu 
        195                 200                 205             


Gly Lys Leu Ile Gly Cys Ile Asp Val Gly Arg Val Gly Ile Ala Asp 
    210                 215                 220                 


Arg Tyr Gln Asp Leu Ala Ile Leu Trp Asn Cys Leu Gly Glu Phe Ser 
225                 230                 235                 240 


Pro Ser Leu Gln Lys Arg Leu Phe Gln Lys Tyr Gly Ile Asp Asn Pro 
                245                 250                 255     


Asp Met Asn Lys Leu Gln Phe His Leu Met Leu Asp Glu Phe Phe 
            260                 265                 270     


<210> 8
<211> 5
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      peptide

<400> 8
Tyr Ile Gly Ser Arg 
1               5   


<210> 9
<211> 738
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 9
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Ser Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
                405                 410                 415     


Asn Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Ser Thr Gly Gly Thr Ala Gly Thr Gln Gln Leu Leu 
    450                 455                 460                 


Phe Ser Gln Ala Gly Pro Asn Asn Met Ser Ala Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
    530                 535                 540                 


Phe Gly Lys Gln Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Gln Asn Ala Ala 
            580                 585                 590         


Pro Ile Val Gly Ala Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ala Lys Leu Ala Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Thr Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210> 10
<211> 738
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 10
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Ser Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
                405                 410                 415     


Asn Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Ser Thr Gly Gly Thr Ala Gly Thr Gln Gln Leu Leu 
    450                 455                 460                 


Phe Ser Gln Ala Gly Pro Asn Asn Met Ser Ala Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
    530                 535                 540                 


Phe Gly Lys Gln Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Gln Asn Ala Ala 
            580                 585                 590         


Pro Ile Val Gly Ala Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ala Lys Leu Ala Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Thr Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210> 11
<211> 738
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 11
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Ser Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
                405                 410                 415     


Asn Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Ser Thr Gly Gly Thr Ala Gly Thr Gln Gln Leu Leu 
    450                 455                 460                 


Phe Ser Gln Ala Gly Pro Asn Asn Met Ser Ala Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 
                485                 490                 495     


Gln Asn Asn Asn Ser Ile Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
    530                 535                 540                 


Phe Gly Lys Gln Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Gln Asn Ala Ala 
            580                 585                 590         


Pro Ile Val Gly Ala Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ala Lys Leu Ala Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Thr Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210> 12
<211> 738
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 12
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln Lys Gln Asp Asn Gly Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Gln Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Gln Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Val Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Ser Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Pro Ser Pro Gln Arg Ser Pro Asp Ser Ser Thr Gly Ile 
145                 150                 155                 160 


Gly Lys Lys Gly Gln Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln 
                165                 170                 175     


Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro 
            180                 185                 190         


Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
        195                 200                 205             


Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
    210                 215                 220                 


Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225                 230                 235                 240 


Ile Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
                245                 250                 255     


Leu Tyr Lys Gln Ile Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
            260                 265                 270         


Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
        275                 280                 285             


Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn 
    290                 295                 300                 


Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305                 310                 315                 320 


Ile Gln Val Lys Glu Val Thr Gln Asn Glu Gly Thr Lys Thr Ile Ala 
                325                 330                 335     


Asn Asn Leu Thr Ser Thr Ile Gln Val Phe Thr Asp Ser Glu Tyr Gln 
            340                 345                 350         


Leu Pro Tyr Val Leu Gly Ser Ala His Gln Gly Cys Leu Pro Pro Phe 
        355                 360                 365             


Pro Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn 
    370                 375                 380                 


Asn Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385                 390                 395                 400 


Phe Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
                405                 410                 415     


Asn Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser 
            420                 425                 430         


Leu Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu 
        435                 440                 445             


Ser Arg Thr Gln Ser Thr Gly Gly Thr Ala Gly Thr Gln Gln Leu Leu 
    450                 455                 460                 


Phe Ser Gln Ala Gly Pro Asn Asn Met Ser Ala Gln Ala Lys Asn Trp 
465                 470                 475                 480 


Leu Pro Gly Pro Cys Tyr Arg Gln Gln Arg Val Ser Thr Thr Leu Ser 
                485                 490                 495     


Gln Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
            500                 505                 510         


Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
        515                 520                 525             


His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
    530                 535                 540                 


Phe Gly Lys Gln Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545                 550                 555                 560 


Met Leu Thr Ser Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr 
                565                 570                 575     


Glu Gln Tyr Gly Val Val Ala Asp Asn Leu Gln Gln Gln Asn Tyr Ile 
            580                 585                 590         


Gly Ser Arg Gly Ala Val Asn Ser Gln Gly Ala Leu Pro Gly Met Val 
        595                 600                 605             


Trp Gln Asn Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile 
    610                 615                 620                 


Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625                 630                 635                 640 


Gly Leu Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val 
                645                 650                 655     


Pro Ala Asp Pro Pro Thr Thr Phe Asn Gln Ala Lys Leu Ala Ser Phe 
            660                 665                 670         


Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu 
        675                 680                 685             


Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr 
    690                 695                 700                 


Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705                 710                 715                 720 


Gly Thr Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg 
                725                 730                 735     


Asn Leu 
        


<210> 13
<211> 1140
<212> DNA
<213> Homo sapiens

<400> 13
atgtcgcgca gtggaaccga tcctcagcaa cgccagcagg cgtcagaggc ggacgccgca       60

gcagcaacct tccgggcaaa cgaccatcag catatccgct acaacccgct gcaggatgag      120

tgggtgctgg tgtcagctca ccgcatgaag cggccctggc agggtcaagt ggagccccag      180

cttctgaaga cagtgccccg ccatgaccct ctcaaccctc tgtgtcctgg ggccatccga      240

gccaacggag aggtgaatcc ccagtacgat agcaccttcc tgtttgacaa cgacttccca      300

gctctgcagc ctgatgcccc cagtccagga cccagtgatc atcccctttt ccaagcaaag      360

tctgctcgag gagtctgtaa ggtcatgtgc ttccacccct ggtcggatgt aacgctgcca      420

ctcatgtcgg tccctgagat ccgggctgtt gttgatgcat gggcctcagt cacagaggag      480

ctgggtgccc agtacccttg ggtgcagatc tttgaaaaca aaggtgccat gatgggctgt      540

tctaaccccc acccccactg ccaggtatgg gccagcagtt tcctgccaga tattgcccag      600

cgtgaggagc gatctcagca ggcctataag agtcagcatg gagagcccct gctaatggag      660

tacagccgcc aggagctact caggaaggaa cgtctggtcc taaccagtga gcactggtta      720

gtactggtcc ccttctgggc aacatggccc taccagacac tgctgctgcc ccgtcggcat      780

gtgcggcggc tacctgagct gacccctgct gagcgtgatg atctagcctc catcatgaag      840

aagctcttga ccaagtatga caacctcttt gagacgtcct ttccctactc catgggctgg      900

catggggctc ccacaggatc agaggctggg gccaactgga accattggca gctgcacgct      960

cattactacc ctccgctcct gcgctctgcc actgtccgga aattcatggt tggctacgaa     1020

atgcttgctc aggctcagag ggacctcacc cctgagcagg ctgcagagag actaagggca     1080

cttcctgagg ttcattacca cctggggcag aaggacaggg agacagcaac catcgcctga     1140


